Stochastic Optimization
Optimisation of objective functions that depend on random variables — covering scenario-based stochastic programming, sample-path methods, distributionally robust formulations, and the stochastic-gradient family.
Stochastic optimization is the discipline of choosing a decision that minimises (or maximises) an objective whose value depends on a random variable. Formally one studies problems of the form , where the expectation is intractable, the distribution may itself be unknown, and the feasible set may carry its own random constraints. Methodological work in the field organises around four interacting axes: how the randomness is represented (closed-form distributions, scenario trees, sample-average approximations, or distributionally robust ambiguity sets), how the expectation is evaluated (deterministic equivalents, Monte Carlo, importance sampling, surrogate models), how iterates are updated (stochastic-gradient descent and its variance-reduced and accelerated descendants, trust-region schemes, primal-dual splitting), and how risk and constraints are encoded (chance constraints, CVaR, two-stage recourse, distributionally robust counterparts). The papers below trace the recent methodological frontier across these axes; each addresses a concrete bottleneck without retreating to a narrow domain application.
Scenario representation and two-stage programming
Two-stage stochastic programming is the workhorse formulation for decisions made under uncertainty with a recourse opportunity, but its practical effectiveness depends almost entirely on the quality of the discrete scenario set used to approximate the underlying distribution. Narum (2024) attacks the scenario-generation step directly with a problem-based method that is agnostic to the specific stochastic program and to the family of distributions involved. Rather than sampling from in isolation, the method studies how the output distribution of the recourse problem varies across candidate first-stage decisions and extracts a small number of components that span this family of output distributions; scenarios are then constructed to span these components rather than the input space. The result is a scenario set whose cardinality is calibrated to the structure of the recourse problem rather than to the ambient dimensionality of , which is the operative quantity for stochastic-programming solvers.
Stochastic gradients and constrained quantum subroutines
The stochastic-gradient family generalises beyond classical convex optimisation in two directions that recent work makes concrete. Gentinetta et al. (2023) extend the Pegasos stochastic-gradient algorithm for support vector machines to the quantum setting, addressing quantum kernel alignment: given a parametrised quantum kernel and a labelled dataset, find kernel parameters whose induced SVM minimises classification risk. The Pegasos update structure transfers cleanly because it depends only on subgradient estimates of the hinge loss, which the quantum subroutine produces unbiasedly. The contribution to stochastic optimisation theory is showing that an unbiased gradient oracle is enough; the inner workings of the oracle can be quantum without breaking SGD’s convergence machinery. Le et al. (2024) tackle the dual question on the optimisation side: how to handle constraints inside a variational quantum eigensolver (VQE), which has historically been formulated as an unconstrained energy-minimisation procedure. Their VQEC framework adapts classical penalty and Lagrangian techniques to the variational quantum setting, producing a hybrid quantum-classical loop that respects equality and inequality constraints during optimisation rather than relying on post-hoc projection.
Trajectory-level objectives and behavioural connections
Most stochastic optimisation procedures evaluate candidate decisions by the expected objective. Fadikar et al. (2023) propose trajectory-oriented optimisation, in which the criterion is not the mean simulation behaviour but the typicality of full sample paths against empirical observations. The framework targets stochastic epidemiological models, where two parameter settings producing the same mean curve can produce wildly different realised trajectories, and shows that calibrating to trajectory similarity rather than to moment statistics changes the inferred parameter posterior. Methodologically this is a clean instance of moving objectives one level up the distributional hierarchy: from to a functional of the distribution of . In a complementary direction, Giron et al. (2023) ask what stochastic optimisation has to say about human learning: using behavioural data from 281 participants ages five to fifty-five, they show that developmental change in exploration is best described as joint optimisation of multiple algorithmic parameters (reward generalisation, uncertainty-directed exploration, random temperature) rather than as a single one-dimensional cooling schedule. The result reframes a long-standing analogy as a quantitative empirical claim about which parameter trajectories a real adaptive system follows.
Open methodological questions cut across the axes above. When does problem-based scenario generation provably beat distributionally robust formulations at the same sample budget? Can quantum oracles deliver variance reduction comparable to classical control-variate schemes, or is the noise floor of NISQ devices the binding constraint? And do trajectory-level objectives admit tractable surrogate gradients, or must they remain the province of derivative-free and Bayesian optimisation?
Prerequisites
Sources
- paper · primary · 2023giron-2023
- paper · primary · 2024narum-2024
-
- paper · primary · 2024le-thinh-2024
- paper · supporting · 2023fadikar-2023
In context
Where this topic sits in the prerequisite graph. Click any node to jump.
Explore
- 01
Stochastic Gradient Methods
SGD convergence theory, variance reduction, and adaptive methods.
- 02
Sample Average Approximation
Statistical guarantees for SAA in stochastic programs.
- 03
Online Convex Optimization
Regret bounds, mirror descent, and bandit settings.
- 04
Robust and Distributionally Robust Optimization
Optimization with ambiguity sets and Wasserstein-DRO.
Review this topic
This page was drafted by an agent and is waiting on expert review. Spotted a wrong prerequisite, a missing concept, a misattributed source, or a factual slip? Tell us — your review opens a tracked issue maintainers act on.