Optimization and Control
See recent articles
Showing new listings for Thursday, 23 April 2026
- [1] arXiv:2604.19981 [pdf, html, other]
-
Title: Debiasing optimal transport: classical and entropicSubjects: Optimization and Control (math.OC); Functional Analysis (math.FA)
We study the notion of debiasability for cost functions arising in optimal transport. We call a symmetric cost function $c:\mathscr{X}\times\mathscr{X}\to\mathbb{R}\cup\{+\infty\}$ debiasable if it satisfies $c(x,y)\ge \tfrac{1}{2}c(x,x)+\tfrac{1}{2}c(y,y)$ for all $x,y\in\mathscr{X}$. Building on an equivalent characterization by an inf-representation $c(x,y)=\inf_{z\in\mathscr{Z}}\psi(x,z)+\psi(y,z)$ for some set $\mathscr{Z}$ and some function $\psi: \mathscr{X}\times \mathscr{Z} \to \mathbb{R} \cup \{+\infty\}$, interpreted as a generalization of the midpoint identity for squared geodesic distances, we investigate the debiasability of costs defined on spaces of probability measures. Our primary focus is the entropic regularization of optimal transport across different regimes of the regularization parameter $\varepsilon \in [0,+\infty]$, encompassing classical optimal transport ($\varepsilon=0$), entropic optimal transport ($\varepsilon>0$), and the Maximum Mean Discrepancy ($\varepsilon=+\infty$). For $\varepsilon \in (0,+\infty]$, we investigate sufficient conditions, such as negative definiteness of the ground cost or continuity and positive definiteness of the induced kernel, handled then via a convex-nonconcave minimax argument. All our results extend naturally to unbalanced optimal transport settings and we generalize in this way the findings of \cite{feydy2019interpolating} and \cite{sejourne2019sinkhorn}. As a byproduct, we derive novel decomposition formulas for entropic optimal transport, which may be of independent interest.
- [2] arXiv:2604.19994 [pdf, html, other]
-
Title: Covariance Steering of Discrete-Time Markov Jump Linear Systems with Multiplicative NoiseComments: Submitted to a journal; 28 pages, 3 figuresSubjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
We study a finite-horizon covariance steering problem for discrete-time Markov jump linear systems (MJLS) with both state- and control-dependent multiplicative noise. The objective is to minimize a quadratic running cost while steering the system from given mode-conditioned initial means and covariances to a prescribed terminal mean and covariance. We first show that, without loss of generality, feasible controls may be represented by mode-dependent linear feedback together with feedforward and independent random components, and we highlight that, in contrast to the case without multiplicative noise, a purely affine state-feedback law does not in general suffice. To this end, we introduce a lifted-state formulation that embeds the mean and covariance information into a unified second-moment description, and we prove that the resulting lifted problem is equivalent to the original covariance steering problem formulation. This leads to a lossless relaxation in moment variables and an SDP reformulation for the unconstrained case. We further study chance-constrained covariance steering with ball and half-space constraints on the state and control, derive tractable sufficient convex surrogates, and establish an iterative reference-update scheme to reduce conservatism. Numerical experiments on a finance application illustrate our results.
- [3] arXiv:2604.20029 [pdf, other]
-
Title: Forward-looking evolutionary game dynamics subject to exploration costSubjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
We extend classical evolutionary game dynamics based on the momentary action choices of agents by accounting for two elements: forward-looking behavior and exploration cost. We focus on pairwise comparison protocols that cover major evolutionary game dynamics, such as replicator and logit models. In the proposed mathematical framework, agents update their actions by paying a cost so that a utility or its relative difference is maximized. We show that forward-looking behavior can be modeled as a coupling between the evolutionary game dynamic and static Hamilton-Jacobi-Bellman equation: a mean field game. The exploration cost and its constraint are naturally related to these equations as a function of the optimal Lagrangian multiplier serving as a relaxation parameter, and it is incorporated into the game as a constraint. We show that under certain conditions, our evolutionary game dynamic admits a unique solution. Finally, we computationally investigate one- and two-dimensional problems.
- [4] arXiv:2604.20031 [pdf, html, other]
-
Title: Decision-Focused Federated Learning Under Heterogeneous Objectives and ConstraintsSubjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
We consider what we refer to as {Decision-Focused Federated Learning (DFFL)} framework, i.e., a predict-then-optimize approach employed by a collection of agents, where each agent's predictive model is an input to a downstream linear optimization problem, and no direct exchange of raw data is allowed. Importantly, clients can differ both in objective functions and in feasibility constraints. We build on the well-known SPO+ approach and develop heterogeneity bounds for the SPO+ surrogate loss in this case. This is accomplished by employing a support function representation of the feasible region, separating (i) objective shift via norm distances between the cost vectors and (ii) feasible-set shift via shape distances between the constraint sets. In the case of strongly convex feasible regions, sharper bounds are derived due to the optimizer stability. Building on these results, we define a heuristic local-versus-federated excess risk decision rule which, under SPO+ risk, gives a condition for when federation can be expected to improve decision quality: the heterogeneity penalty must be smaller than the statistical advantage of pooling data. We implement a FedAvg-style DFFL set of experiments on both polyhedral and strongly convex problems and show that federation is broadly robust in the strongly convex setting, while performance in the polyhedral setting degrades primarily with constraint heterogeneity, especially for clients with many samples. In other words, especially for the strongly convex case, an approach following a direct implementation of FedAvg and SPO+ can still yield promising performance even when the downstream optimization problems are noticeably different.
- [5] arXiv:2604.20107 [pdf, html, other]
-
Title: A Benchmark of 25 Nonlinear Functions with Domain-Induced Discontinuity for Global OptimizationSubjects: Optimization and Control (math.OC)
A benchmark of 25 nonlinear optimization problems with domain-induced discontinuity is proposed to support the performance evaluation of global optimization algorithms under feasibility-scarce and structurally discontinuous landscapes. Referred to as the CPC Benchmark (Challenging Problems for Computation), the test suiteconsists of functions that are continuous on their natural domains, while infeasible regions and undefined evaluations are implicitly embedded in the objective, creating substantial challenges for global minimization. Six representative algorithms from diverse methodological paradigms are assessed to examine the structural complexity and discriminative capability of the benchmark. Numerical results show that many functions possess extremely small feasible regions and strong precision sensitivity near feasibility boundaries, complicating initialization, feasibility discovery, and reliable objective assessment. The findings demonstrate that the CPC benchmark provides clear discriminative power across algorithmic paradigms and offers a rigorous, software-oriented testbed for advancing research in global optimization.
- [6] arXiv:2604.20147 [pdf, html, other]
-
Title: Robust Out-of-Distribution Stochastic OptimizationSubjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
Data-driven decision-making under uncertainty typically presumes the collection of historical data from an unknown target probability distribution. However, one may have no access to any data from the target distribution prior to decision-making. To address this challenge, we propose robust out-of-distribution stochastic optimization, a novel data-driven framework that effectively utilizes relevant data distributions for robust decision-making under unseen distributions. A key feature of our framework is that all data distributions are assumed to be randomly generated from a meta-distribution over distributions. To describe uncertainty in distribution generation, we propose to learn a data-driven uncertainty set in a reproducing kernel Hilbert space (RKHS) from relevant data distributions, with adjustable conservatism. We then incorporate this set into a min-max stochastic program to derive robust decisions. Notably, under randomness of distribution generation, we establish rigorous out-of-distribution generalization guarantees for the uncertainty set as well as the solution. To ease problem-solving in RKHS, an approximate parametrization with a provably bounded suboptimality and a row generation strategy are presented. Extensive numerical experiments on multi-item newsvendor and portfolio optimization demonstrate the superior out-of-distribution performance of our decision-making framework under unseen data distribution, even when only a small or moderate number of relevant sources are available.
- [7] arXiv:2604.20433 [pdf, html, other]
-
Title: On Reward-Balancing Methods for Reinforcement LearningSubjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
This paper investigates the so-called reward-balancing methods, a novel class of algorithms for solving discounted-return reinforcement learning (RL) problems. These methods consist of iteratively adjusting the reward function to transform the RL problem into an equivalent one in which the optimal policies are greedy. For this procedure, referred to as normalization process, we provide a theoretical analysis of the involved transformations, emphasizing their algebraic structure. Then, we introduce a control-theoretic reformulation, recasting the reward-balancing procedure into an optimal control framework. The approach is further extended to address model uncertainty through stochastic model sampling, yielding normalization guarantees and probabilistic bounds on stochastic fluctuations. Using the proposed optimal control framework within a scenario model predictive control (MPC) setting, we demonstrate, through simulation studies, performance improvements over the current state-of-the-art.
- [8] arXiv:2604.20506 [pdf, html, other]
-
Title: A unified framework for inexact adaptive stepsizes in the gradient methods, the conjugate gradient methods and the quasi-Newton methods for strictly convex quadratic optimizationSubjects: Optimization and Control (math.OC)
The inexact adaptive stepsizes for the conjugate gradient method and the quasi-Newton method are very rare. The exact stepsizes in the gradient method, the conjugate gradient method and the quasi-Newton method for strictly convex quadratic optimization have a unified framework, while the unified framework for inexact adaptive stepsizes in the gradient method, the conjugate gradient method and the quasi-Newton method for strictly convex quadratic optimization still remains unknown. Based on the above observations, we propose a unified framework for inexact adaptive stepsizes in the gradient method, the conjugate gradient method and the quasi-Newton method for strictly convex quadratic optimization, which is called approximately optimal stepsize. The global convergence and the convergence rate of the gradient method with the approximately optimal stepsize are established by exploring the relation between the approximately optimal stepsize and the famous Barzilai-Borwein (BB) stepsizes. Some numerical results are presented, which confirm the remarkable numerical advantage of the gradient method, the conjugate gradient method and the quasi-Newton method with the unified framework for inexact adaptive stepsizes. Some open problems about the gradient method, the conjugate gradient method and the quasi-Newton method with approximately optimal stepsize are raised.
- [9] arXiv:2604.20532 [pdf, html, other]
-
Title: Reliability as a Design Principle: A Systematic Review and Integrated Framework for Renewable-Based MicrogridsComments: Accepted by Energy Conversion and Management: X, April 17, 2026Journal-ref: Energy Conversion and Management: X, 2026Subjects: Optimization and Control (math.OC)
Reliable operation is a central motivation for deploying renewable-based microgrids. This paper presents a systematic rapid review that positions reliability as the central organizing principle for microgrid design. Specifically, this review systematically synthesizes recent literature to examine how planning assumptions, optimization formulations, operational flexibility mechanisms, and reliability assessment frameworks jointly shape reliability outcomes. The synthesis shows that reliability in renewable-based microgrids is governed primarily by chronological, time-coupled energy adequacy rather than installed capacity alone, with Dunkelflaute events emerging as a key determinant of adequacy failure. Reliability outcomes are shaped by the joint interaction of resource portfolios, storage operating policies, and state trajectories, network features, and protection feasibility under inverter-dominated operation. The review further demonstrates that reliability indices inherited from conventional power systems are poorly suited for renewable-based microgrids, as they compress performance into single dimensions and obscure temporal, spatial, and service-critical risk concentrations. Across optimization practice, reliability is increasingly embedded through multi-objective and constrained formulations; however, persistent gaps remain in representing correlated renewable scarcity, mission-profile-dependent component reliability, and interruption valuation (e.g., value of lost load and customer damage functions) in a consistent and decision-relevant manner. Overall, this review consolidates planning factors, optimization approaches, reliability evaluation methods, and metric suitability into an integrated roadmap for reliability-centered microgrid planning, and outlines future directions toward state-aware, service-oriented planning and assessment frameworks.
- [10] arXiv:2604.20657 [pdf, html, other]
-
Title: Importance Sampling in Expensive Finite-Sum Optimization via Contextual Bandit MethodsSubjects: Optimization and Control (math.OC)
In computational science workflows, it is often the case that 1) objective functions for optimization involve multiple simulation outputs, and 2) those simulations can be performed (at least partially) in parallel. In this work, we reexamine past work on a class of randomized algorithms, stochastic average model (SAM) methods. SAM methods are conceptually similar to stochastic average gradient methods, and effectively require that only randomized subsets of simulation outputs be locally modeled in each iteration of a model-based optimization method. This work focuses on the question of how best to perform this randomization of subset selection, especially in settings where there exists useful side information such as alternative lower-fidelity simulations, pre-trained emulators or domain expertise from humans or AI models. In particular, we consider the problem of generating sampling distributions for SAM methods as a contextual bandit problem and we apply the Exponential weights algorithm for Exploration and Exploitation with Experts (Exp4). We provide some preliminary numerical results on synthetic problems.
- [11] arXiv:2604.20724 [pdf, other]
-
Title: Evaluation of Various Objective Functions for Optimal Reactive Power Flow Including Transformer Tap Changer OptimisationComments: Personal copy supplementing a paper submitted to the 10th Hybrid Power Plants & Systems Workshop 2026Subjects: Optimization and Control (math.OC)
Modern distribution grids with high penetration of renewable generation provide substantial flexibility through distributed reactive power sources and transformer tap changers. This high degree of freedom can be exploited for optimisation. However, choosing an objective function for optimisation is not trivial, e.~g. minimising grid losses may lead to overvoltages and minimising voltage deviations may lead to higher reactive power flows to neighbouring system operators. Thus, this paper deals with the design of an objective function for the centralised optimisation of distributed reactive power sources and transformer tap changers. Different objectives for characteristic network quantities are investigated for the optimisation and optimised in a combined manner and separately. The consequences of optimising conflicting target values are then analysed. For the optimisation, various grid usage cases of a 110 kV benchmark power grid from SimBench are examined. The investigated power grid is characterised by a high proportion of renewable energy plants. The optimisation is carried out in a data-driven, object-oriented manner using the interior point method with open source software. At the end of the paper, a meaningful optimisation function with combined weighted objectives is derived and the results are analysed.
- [12] arXiv:2604.20832 [pdf, html, other]
-
Title: Solving Minimax Problems with Bilinear Objectives with ADMMComments: 9 pages, 1 figure (color)Subjects: Optimization and Control (math.OC); Methodology (stat.ME)
We consider minimax (saddle-point) problems of the form max_{c \in C} min_{\beta \in S} g(c; \beta), where C and S are compact convex sets, and g is concave-convex. Applying the Alternating Direction Method of Multipliers (ADMM) requires evaluating a proximal operator that is, in general, as hard as the original problem. We show that when the outcome function g is bilinear, i.e. g(c; \beta) = c^T A \beta, the proximal operator reduces to a generalized projection onto the confidence region S. This reduction is exact -- it involves no approximation or linearization. The resulting ADMM algorithm alternates between (i) a generalized projection onto S and (ii) a Euclidean projection onto C. We describe the derivation, state the algorithm, and discuss convergence.
New submissions (showing 12 of 12 entries)
- [13] arXiv:2604.20109 (cross-list from cs.LG) [pdf, html, other]
-
Title: Learning to Solve the Quadratic Assignment Problem with Warm-Started MCMC FinetuningSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
The quadratic assignment problem (QAP) is a fundamental NP-hard task that poses significant challenges for both traditional heuristics and modern learning-based solvers. Existing QAP solvers still struggle to achieve consistently competitive performance across structurally diverse real-world instances. To bridge this performance gap, we propose PLMA, an innovative permutation learning framework. PLMA features an efficient warm-started MCMC finetuning procedure to enhance deployment-time performance, leveraging short Markov chains to anchor the adaptation to the promising regions previously explored. For rapid exploration via MCMC over the permutation space, we design an additive energy-based model (EBM) that enables an $O(1)$-time 2-swap Metropolis-Hastings sampling step. Moreover, the neural network used to parameterize the EBM incorporates a scalable and flexible cross-graph attention mechanism to model interactions between facilities and locations in the QAP. Extensive experiments demonstrate that PLMA consistently outperforms state-of-the-art baselines across various benchmarks. In particular, PLMA achieves a near-zero average optimality gap on QAPLIB, exhibits remarkably superior robustness on the notoriously difficult Taixxeyy instances, and also serves as an effective QAP solver in bandwidth minimization.
- [14] arXiv:2604.20137 (cross-list from cs.CG) [pdf, html, other]
-
Title: Optimization of Constrained Quasiconformal Mapping for Origami DesignSubjects: Computational Geometry (cs.CG); Optimization and Control (math.OC)
Origami structures, particularly Miura-ori patterns, offer unique capabilities for surface approximation and deployable designs. In this study, a constrained mapping optimization algorithm is designed for designing surface-aligned Miura-ori via a narrow band approximation of the input surface. The Miura-fold, embedded in the narrow band, is parameterized to a planar domain, and a mapping is computed on the parameter pattern by optimizing certain energy terms and constraints. Extensive experiments are conducted, showing the significance and flexibility of our methods.
- [15] arXiv:2604.20369 (cross-list from cs.IT) [pdf, html, other]
-
Title: Rate-Cost Tradeoffs in Nonlinear ControlComments: 11 pages, 5 figuresSubjects: Information Theory (cs.IT); Systems and Control (eess.SY); Optimization and Control (math.OC)
We study the rate-cost tradeoff in rate-limited control of general stochastic control systems, including nonlinear systems, over a finite horizon. At each time step, an encoder observes the state and transmits a description to a controller, which then selects the control action. For an average control-cost threshold $D$, we characterize the minimum achievable communication rate $R_n(D)$ via a nonasymptotic bound: $R_n(D)$ lies within an additive logarithmic gap of the optimal value of a directed-information minimization $F_n(D)$, namely, we show that $F_n(D) \le R_n(D) \le F_n(D)+\log \bigl(F_n(D)+3.4\bigr)+2+\frac{1}{n}$, in bits. This establishes directed information as the operationally relevant quantity governing rate-limited control, thereby broadening its utility beyond its previously established roles in causal source coding and linear quadratic Gaussian (LQG) control to general nonlinear control systems. We prove the upper bound constructively by building an encoding-and-control policy using the strong functional representation lemma at each time step. As special cases of our setting, our framework yields nonasymptotic bounds for sequential (causal) rate-distortion and LQG control.
- [16] arXiv:2604.20509 (cross-list from eess.SY) [pdf, html, other]
-
Title: Approximate Simulation-based Hierarchical Control of Nonlinear SystemsComments: 14 PagesSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
Controlling complex dynamical systems to satisfy sophisticated specifications remains a significant challenge in modern engineering. A promising approach to this problem is the approximate simulation-based hierarchical control (ASHC) technique. In this method, a simplified representation of the complex system, called the abstract system, is first designed and controlled. An interface function is then designed to translate the control law into the input of the complex system, thereby achieving approximate control synthesis. However, most existing results in ASHC are only for linear systems. This paper proposes a constructive method for solving the ASHC problem for nonlinear systems. To this end, we propose invariance equation-based methods to achieve the two classical requirements of the ASHC technique, namely the bounded output discrepancy and the $m$-relation. We then study the solvability conditions of the problem and summarise the overall design procedures. We illustrate the results with a practical example, providing step-by-step solutions to the ASHC problem of a DC-to-DC Ćuk converter.
- [17] arXiv:2604.20517 (cross-list from math.DS) [pdf, html, other]
-
Title: Bounding Transient Instability in Sensor Data Injected Nonlinear Stochastic Flight DynamicsSubjects: Dynamical Systems (math.DS); Optimization and Control (math.OC); Computation (stat.CO)
Transient instability in nonlinear stochastic dynamical systems is a fundamental limitation in safety-critical aerospace applications, particularly during powered descent and landing where failure is driven by finite-time excursions rather than asymptotic divergence. Classical notions of mean-square or asymptotic stability are therefore insufficient for certification and design. This paper develops a logarithmic-norm-based framework for finite-time transient stability analysis of nonlinear Ito stochastic differential equations. The approach extends matrix measures to nonlinear mappings in a Lipschitz sense, enabling efficient characterization of instantaneous perturbation growth without local linearization. Using Ito calculus, bounds on the mean and variance of transient growth are derived, providing conditions for non-positive finite-time mean growth and probabilistic bounds on instability events. The analysis highlights a key distinction between mean and sample-path behavior, showing that stability in expectation does not guarantee pathwise finite-time safety, and that almost-sure transient stability cannot generally be ensured under stochastic diffusion. The framework is extended to data-constrained stochastic dynamics in navigation and estimation, revealing a trade-off between estimation consistency and transient robustness due to continuous data injection. Demonstrations with flight-like lunar lander telemetry show that similar mean trajectories can exhibit significantly different transient stability behaviour, and that mission failure correlates with accumulation of transient instability over short critical intervals. These results motivate probabilistic finite-time stability metrics for safety-critical autonomous systems.
- [18] arXiv:2604.20614 (cross-list from cs.LG) [pdf, html, other]
-
Title: Too Sharp, Too Sure: When Calibration Follows CurvatureComments: 33 pages, 23 figuresSubjects: Machine Learning (cs.LG); Dynamical Systems (math.DS); Optimization and Control (math.OC); Machine Learning (stat.ML)
Modern neural networks can achieve high accuracy while remaining poorly calibrated, producing confidence estimates that do not match empirical correctness. Yet calibration is often treated as a post-hoc attribute. We take a different perspective: we study calibration as a training-time phenomenon on small vision tasks, and ask whether calibrated solutions can be obtained reliably by intervening on the training procedure. We identify a tight coupling between calibration, curvature, and margins during training of deep networks under multiple gradient-based methods. Empirically, Expected Calibration Error (ECE) closely tracks curvature-based sharpness throughout optimization. Mathematically, we show that both ECE and Gauss--Newton curvature are controlled, up to problem-specific constants, by the same margin-dependent exponential tail functional along the trajectory. Guided by this mechanism, we introduce a margin-aware training objective that explicitly targets robust-margin tails and local smoothness, yielding improved out-of-sample calibration across optimizers without sacrificing accuracy.
- [19] arXiv:2604.20827 (cross-list from math.PR) [pdf, html, other]
-
Title: Failure of ambient closed-set large-deviation upper bounds in entropic optimal transportSubjects: Probability (math.PR); Optimization and Control (math.OC)
Large-deviation upper bounds on compact sets do not, in general, extend to arbitrary closed sets without additional tightness. We show that this obstruction already occurs in static entropic optimal transport. More precisely, we construct a fixed-cost model with continuous cost and nonatomic marginals for which the entropic minimisers converge in total variation to an optimal plan with noncompact support, the known compact-set upper bound remains valid, but the corresponding closed-set upper bound fails on a specific closed subset of the ambient space. For a fixed closed set, we identify the exact tail criterion for passing from compact to closed sets. We show that there does not exist a full large-deviation principle (LDP) on the ambient space at speed $1/\varepsilon$ with an arbitrary lower semicontinuous rate function.
Cross submissions (showing 7 of 7 entries)
- [20] arXiv:2501.17350 (replaced) [pdf, html, other]
-
Title: On Min-Max Robust Data-Driven Predictive Control Considering Non-Unique Solutions to Behavioral RepresentationComments: 8 pagesSubjects: Optimization and Control (math.OC)
Direct data-driven control methods are known to be vulnerable to uncertainty in stochastic systems. In this paper, we propose a new robust data-driven predictive control (DDPC) framework. By analyzing non-unique solutions to behavioral representation, we gain insight into the inherent lack of robustness in subspace predictive control (SPC) and its projection-based regularized variant. This stimulates us to construct an uncertainty set that captures all admissible output trajectories deviating from nominal subspace predictions, which results in a min-max robust formulation of DDPC that endows control sequences with robustness against such unknown deviations. We establish theoretical performance guarantees under bounded additive noise and develop tractable convex reformulations. To mitigate the conservatism of robust design, a feedback robust DDPC scheme is further proposed by incorporating an affine feedback policy. Simulation studies show that the proposed methods effectively robustify SPC and outperform the projection-based regularization.
- [21] arXiv:2503.08927 (replaced) [pdf, html, other]
-
Title: Ensemble optimal control for managing drug resistance in cancer therapiesComments: 34 pages, 7 figures, 7 tables. In Section 2 a broader class of models is considered; Correction of typos and bibliography extensionSubjects: Optimization and Control (math.OC); Numerical Analysis (math.NA)
In this paper, we explore the application of ensemble optimal control to derive enhanced strategies for pharmacological cancer treatment, and we tackle the problem of the long-term management of the disease, i.e., when the complete eradication of the tumor is not achievable. In particular, we focus on moving beyond the classical clinical approach of giving the patient the maximal tolerated drug dose (MTD), which does not properly exploit the fight among sensitive and resistant cells for the available resources. Here, we employ a Lotka-Volterra model to describe the competing subpopulations, and we enclose this system within the ensemble control framework. In the first part, we establish general results suitable for application to various cancers. Then, we carry out numerical simulations in the setting of prostate cancer treated with androgen deprivation therapy, yielding a computed policy that is reminiscent of the medical `active surveillance' paradigm. Finally, inspired by the numerical evidence, we propose a variant of the celebrated adaptive therapy (AT), which we call `Off-On' AT.
- [22] arXiv:2505.13106 (replaced) [pdf, other]
-
Title: How to optimise tournament draws: The case of the FIFA World CupComments: 32 pages, 8 figures, 6 tablesJournal-ref: International Transactions in Operational Research, 2026, forthcomingSubjects: Optimization and Control (math.OC); Physics and Society (physics.soc-ph); Applications (stat.AP)
The organisers of major sports competitions use different policies with respect to constraints in the group draw. Our paper aims to rationalise these choices by analysing the trade-off between attractiveness (the number of games played by teams from the same geographic zone) and fairness (the departure of the draw mechanism from a uniform distribution). A parametric optimisation model is formulated and applied to the 2018 and 2022 FIFA World Cup draws. A flaw of the draw procedure is identified: the pre-assignment of the host to a group unnecessarily increases the distortions. All Pareto efficient sets of draw constraints are determined via simulations. The proposed framework can be used to find the optimal draw rules and justify the non-uniformity of the draw procedure for the stakeholders.
- [23] arXiv:2506.20910 (replaced) [pdf, html, other]
-
Title: Faster Fixed-Point Methods for Multichain MDPsJournal-ref: NeurIPS 2025Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
We study value-iteration (VI) algorithms for solving general (a.k.a. multichain) Markov decision processes (MDPs) under the average-reward criterion, a fundamental but theoretically challenging setting. Beyond the difficulties inherent to all average-reward problems posed by the lack of contractivity and non-uniqueness of solutions to the Bellman operator, in the multichain setting an optimal policy must solve the navigation subproblem of steering towards the best connected component, in addition to optimizing long-run performance within each component. We develop algorithms which better solve this navigational subproblem in order to achieve faster convergence for multichain MDPs, obtaining improved rates of convergence and sharper measures of complexity relative to prior work. Many key components of our results are of potential independent interest, including novel connections between average-reward and discounted problems, optimal fixed-point methods for discounted VI which extend to general Banach spaces, new sublinear convergence rates for the discounted value error, and refined suboptimality decompositions for multichain MDPs. Overall our results yield faster convergence rates for discounted and average-reward problems and expand the theoretical foundations of VI approaches.
- [24] arXiv:2508.18435 (replaced) [pdf, html, other]
-
Title: A second-order cone representable class of nonconvex quadratic programsSubjects: Optimization and Control (math.OC)
We consider the problem of minimizing a sparse nonconvex quadratic function over the unit hypercube. By developing an extension of the Reformulation-Linearization Technique (RLT) to continuous quadratic sets, we propose a novel second-order cone (SOC) representable relaxation for this problem. By exploiting the sparsity of the quadratic function, we establish a sufficient condition under which the convex hull of the feasible region of the lifted quadratic program is SOC-representable. While the proposed formulation may be of exponential size in general, we identify additional structural conditions that guarantee the existence of a polynomial-size SOC-representable formulation, which can be constructed in polynomial time. Under these conditions, the optimal value of the nonconvex quadratic program coincides with that of a polynomial-size second-order cone program. Our results serve as a starting point for bridging the gap between the Boolean quadric polytope of sparse problems and its continuous counterpart.
- [25] arXiv:2601.16683 (replaced) [pdf, html, other]
-
Title: Projected Gradient Methods with MomentumSubjects: Optimization and Control (math.OC)
We focus on the optimization problem with smooth, possibly nonconvex objectives and a convex constraint set for which the Euclidean projection operation is practically available. Focusing on this setting, we carry out a general convergence and complexity analysis for algorithmic frameworks. Consequently, we discuss theoretically sound strategies to integrate momentum information within classical projected gradient type algorithms. One of these approaches is then developed in detail, up to the definition of a tailored algorithm with both theoretical guarantees and reasonable per-iteration cost. The proposed method is finally shown to outperform the standard (spectral) projected gradient method in two different experimental benchmarks, indicating that the addition of momentum terms is as beneficial in the constrained setting as it is in the unconstrained scenario.
- [26] arXiv:2603.26503 (replaced) [pdf, html, other]
-
Title: The adjoint state method for parametric definable optimization without smoothness or uniquenessComments: 27 pages, 1 figureSubjects: Optimization and Control (math.OC); Numerical Analysis (math.NA)
We establish that nonconvex definable parametric optimization problems with possibly nonsmooth objectives, inequality constraints, conic constraint systems, and non-unique primal and dual solutions admit an adjoint state formula under a mere qualification condition. The adjoint construction yields a selection of a conservative field for the value function, providing a computable first-order object without requiring differentiation of the solution mapping. Through examples, we show that even in smooth problems, the formal adjoint construction fails without conservativity or definability, illustrating the relevance of these concepts to grasp theoretical aspects of the method. This work provides a tool which can be directly combined with existing primal-dual solvers for a wide range of parametric optimization problems.
- [27] arXiv:2604.18726 (replaced) [pdf, other]
-
Title: CCOpt: an Open-Source Solver for Large-Scale Mathematical Programs with Complementarity ConstraintsComments: Submitted to Mathematical Programming Computation. Typo in abstract causing accidental URL parsing correctedSubjects: Optimization and Control (math.OC)
This paper presents the Julia package CCOpt, built on top of the interior-point solver MadNLP. CCOpt implements a suite of algorithms for Mathematical Programs with Complementarity Constraints (MPCCs). The solver additionally comes with interfaces for use in Matlab, Python, and C++. MPCCs have recently gained renewed attention in engineering optimization, as complementarity provides a powerful modeling tool for nonsmooth functions and logical conditions. These problems are inherently challenging since their nonlinear programming reformulations violate classical regularity conditions at all feasible points, complicating both theoretical analysis and numerical treatment. Consequently, specialized algorithms are required to handle this degeneracy, and several approaches have been proposed. We implement a toolbox of methods, including relaxation and penalty approaches, as well as a crossover to recently proposed active-set methods. Our solver is based on nonlinear interior-point algorithms that couple the relaxation or penalty parameter with the barrier parameter, yielding substantial speedups compared to standard implementations. Both monotone and nonmonotone strategies for updating this joint parameter update are proposed and investigated. In addition, we propose regularization techniques that improve the conditioning of the KKT system for small relaxation parameters, enhancing robustness and computational efficiency. The implementation is validated on the classical MacMPEC benchmark, large-scale problems in security-constrained optimal power flow, optimal control of nonsmooth systems, as well as on quadratic programs with complementarity constraints arising in model predictive control. This benchmarking reveals an algorithmically driven improvement of often an entire order of magnitude over other methods, including commercial solvers.
- [28] arXiv:2409.08347 (replaced) [pdf, html, other]
-
Title: Sensitivity analysis of the perturbed utility stochastic traffic equilibriumSubjects: Econometrics (econ.EM); Computer Science and Game Theory (cs.GT); Optimization and Control (math.OC)
This paper develops a sensitivity analysis framework for the perturbed utility route choice (PURC) model and the accompanying stochastic traffic equilibrium model. We derive analytical sensitivity expressions for the Jacobian of the individual optimal PURC flow and equilibrium link flows with respect to link cost parameters under general assumptions. This allows us to determine the marginal change in link flows following a marginal change in link costs across the network. We show how to implement these results while exploiting the sparsity generated by the PURC model. Numerical examples illustrate the use of our method for estimating equilibrium link flows after link cost shifts, identifying critical design parameters, and quantifying uncertainty in performance predictions. Finally, we demonstrate the method in a large-scale example. The findings have implications for network design, pricing strategies, and policy analysis in transportation planning and economics, providing a bridge between theoretical models and real-world applications.
- [29] arXiv:2504.09657 (replaced) [pdf, html, other]
-
Title: Online Aging-Aware Energy Optimization for Vehicle-Home-Grid IntegrationComments: Accepted for publication in the proceedings of the 2026 IFAC World CongressSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
This paper investigates the economic impact of vehicle-home-grid integration through an online optimization algorithm that manages energy flows between an electric vehicle, a household, and the electrical grid. The algorithm exploits vehicle-to-home (V2H) for self-consumption and vehicle-to-grid (V2G) for energy trading, adapting in real-time via a hybrid long short-term memory (LSTM) network for household load prediction and a nonlinear battery degradation model including cycle and calendar aging. Simulations show annual economic benefits up to EUR 3046.81 compared to smart unidirectional charging, despite a modest 1.96% increase in battery aging. Even under unfavorable market conditions, with no V2G revenue, V2H alone provides yearly savings of EUR 425.48. Sensitivity analyses on battery capacity, household load, and price ratios confirm the consistent benefits of bidirectional energy exchange, highlighting the role of EVs as active energy nodes for sustainable management.
- [30] arXiv:2506.20904 (replaced) [pdf, html, other]
-
Title: Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RLJournal-ref: NeurIPS 2025Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Optimization and Control (math.OC); Machine Learning (stat.ML)
We study offline reinforcement learning in average-reward MDPs, which presents increased challenges from the perspectives of distribution shift and non-uniform coverage, and has been relatively underexamined from a theoretical perspective. While previous work obtains performance guarantees under single-policy data coverage assumptions, such guarantees utilize additional complexity measures which are uniform over all policies, such as the uniform mixing time. We develop sharp guarantees depending only on the target policy, specifically the bias span and a novel policy hitting radius, yielding the first fully single-policy sample complexity bound for average-reward offline RL. We are also the first to handle general weakly communicating MDPs, contrasting restrictive structural assumptions made in prior work. To achieve this, we introduce an algorithm based on pessimistic discounted value iteration enhanced by a novel quantile clipping technique, which enables the use of a sharper empirical-span-based penalty function. Our algorithm also does not require any prior parameter knowledge for its implementation. Remarkably, we show via hard examples that learning under our conditions requires coverage assumptions beyond the stationary distribution of the target policy, distinguishing single-policy complexity measures from previously examined cases. We also develop lower bounds nearly matching our main result.
- [31] arXiv:2507.06769 (replaced) [pdf, html, other]
-
Title: Constraint Optimized Multichannel Mixer-limiter DesignComments: Accepted at ICASSP 2026Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP); Optimization and Control (math.OC)
Multichannel audio mixer and limiter designs are conventionally decoupled for content reproduction over loudspeaker arrays due to high computational complexity and run-time costs. We propose a coupled mixer-limiter-envelope design formulated as an efficient linear-constrained quadratic program that minimizes a distortion objective over multichannel gain variables subject to sample mixture constraints. Novel methods for asymmetric constant overlap-add window optimization, objective function approximation, variable and constraint reduction are presented. Experiments demonstrate distortion reduction of the coupled design, and computational trade-offs required for efficient real-time processing.
- [32] arXiv:2512.10118 (replaced) [pdf, html, other]
-
Title: Explicit Control Barrier Function-based Safety Filters and their Resource-Aware ComputationSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
This paper studies the efficient implementation of safety filters that are designed using control barrier functions (CBFs), which minimally modify a nominal controller to render it safe with respect to a prescribed set of states. Although CBF-based safety filters are often implemented by solving a quadratic program (QP) in real time, the use of off-the-shelf solvers for such optimization problems poses a challenge in applications where control actions need to be computed efficiently at very high frequencies. In this paper, we introduce a closed-form expression for controllers obtained through CBF-based safety filters. This expression is obtained by partitioning the state-space into different regions, with a different closed-form solution in each region. We leverage this formula to introduce a resource-aware implementation of CBF-based safety filters that detects changes in the partition region and uses the closed-form expression between changes. We showcase the applicability of our approach in examples ranging from aerospace control to safe reinforcement learning.