[PDF] Chance-Constrained Optimization: A Review of Mixed-Integer Conic Formulations and Applications

Abstract

Chance-constrained programming (CCP) is one of the most difficult classes of optimization problems that has attracted the attention of researchers since the 1950s. In this survey, we first review recent developments in mixed-integer linear formulations of chance-constrained programs that arise from finite discrete distributions (or sample average approximation). We highlight successful reformulations and decomposition techniques that enable the solution of large-scale instances. We then review active research in distributionally robust CCP, which is a framework to address the ambiguity in the distribution of the random data. The focal point of our review is scalable formulations that can be readily implemented with state-of-the-art optimization software. However, we also discuss alternative approaches and specialized algorithms. Furthermore, we highlight the prevalence of CCPs with a review of applications across multiple domains.

Full PDF

CChance-Constrained Optimization: A Review of Mixed-IntegerConic Formulations and Applications

Simge Küçükyavuz * Ruiwei Jiang † January 22, 2021

Abstract

Chance-constrained programming (CCP) is one of the most difﬁcult classes of optimization problems thathas attracted the attention of researchers since the 1950s. In this survey, we ﬁrst review recent developmentsin mixed-integer linear formulations of chance-constrained programs that arise from ﬁnite discrete distributions(or sample average approximation). We highlight successful reformulations and decomposition techniques thatenable the solution of large-scale instances. We then review active research in distributionally robust CCP, whichis a framework to address the ambiguity in the distribution of the random data. The focal point of our review isscalable formulations that can be readily implemented with state-of-the-art optimization software. However, wealso discuss alternative approaches and specialized algorithms. Furthermore, we highlight the prevalence of CCPswith a review of applications across multiple domains.

Most optimization models in practice involve problem parameters that are uncertain. Furthermore, in some casesthese uncertain parameters involve risky outcomes with low probability. Therefore, requiring feasibility of asolution for every possible outcome may lead to overly conservative solutions. To remedy this, chance-constrainedprogramming (CCP) has emerged as a powerful paradigm to model system failure/reliability considerations and toaddress the conservatism of a solution given a certain tolerance for risky outcomes.For example, in power systems, production levels need to be determined so as to meet peak load (demand) [93].This problem is complicated by uncertainties in both generator availabilities (especially with renewables) and loads.The utility company’s aim is to minimize the expected cost of power production while ensuring that the loss-of-loadprobability (i.e., the probability that the available generator capacity is insufﬁcient to meet the peak load) is belowan acceptable reliability level [163]. In supply chain problems, service level constraints are introduced to limit theprobability of stock-outs [40]. In portfolio optimization problems, there is interest to restrict the downside risk at acertain threshold (value-at-risk) [53]. Finally, in communications network design problems, a certain quality of * Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL 60208, USA, [email protected] † Industrial and Operations Engineering, University of Michigan, Ann Arbor, MI 48109, USA, [email protected] a r X i v : . [ m a t h . O C ] J a n ervice (QoS) with respect to packet losses needs to be ensured [148]. Such risk, service, or reliability constraintsare modeled using CCPs. We will discuss more applications of CCPs in Section 4. Formally, for a given probability space (Ω , F , P ) , a chance-constrained program (CCP) is given by min x c > x s.t. P ( x ∈ P ( ω )) ≥ − (cid:15), (1a) x ∈ X , (1b)where c ∈ R n is a cost vector, X ⊂ R n represents a compact set deﬁned by deterministic constraints on the decisionvariables x , possibly including integrality restrictions on some variables, ω ∈ Ω ⊂ R d is a random vector with atrue distribution P , for a given ω , P ( ω ) represents the set of solutions that are safe or desirable, and (cid:15) ∈ (0 , isthe risk tolerance for the decision vector x being unsafe. For risk-averse decision makers typical choices for the risklevel are small values, e.g., (cid:15) ≤ . . In this survey, we mainly focus on linear chance constraints, i.e., polyhedral P ( ω ) . More precisely, let P ( ω ) := { x : T ( ω ) x ≥ r ( ω ) } , (2)where T ( ω ) is an m × n matrix of random constraint coefﬁcients, and r ( ω ) ∈ R m is a vector of random right-handsides.Next, we introduce the taxonomy of CCPs. Constraint (1a) is said to be an individual chance constraint for m = 1 ,and a joint chance constraint for m > . If, for all ω ∈ Ω , we have T ( ω ) = T for some deterministic m × n matrix T , and only r ( ω ) is random, we say that the CCP has right-hand side (RHS) uncertainty. In contrast, if theso-called technology matrix T ( ω ) is random, we say that the CCP has left-hand side (LHS) uncertainty , regardlessof whether r ( ω ) is a ﬁxed vector or is random. Most of the work in CCP can be seen as single-stage (i.e., static)decision-making problems where the decisions are made here and now, and there are no recourse actions once theuncertainty is revealed. In Section 2.4, we discuss extensions to two-stage CCPs . Finally, in many problems ofinterest, the decision vector x is pure binary and this structure can be exploited to obtain stronger formulations andspecialized algorithms. We refer to such CCPs with pure binary variables as chance-constrained combinatorialoptimization problems.CCP dates back to the early work of Charnes and Cooper [38], Charnes et al. [39], Miller and Wagner [152], Prékopa[182], and Prékopa [183], who ﬁrst consider problems with individual or joint chance constraints. We refer the readerto [25, 59, 104, 185, 186, 202] for textbook treatment and detailed reviews that describe the earlier developments inthis area. This survey is aimed at reviewing the developments in the past two decades primarily from a mixed-integerconic reformulations perspective.Despite long-standing interest and ubiquity in practice, CCP remains one of the most challenging class of problemsin general. There are two main challenges with CCPs. 2. Difﬁculty of evaluating the probability of an undesirable solution.

In practice, the distribution P in thechance constraint is not fully speciﬁed. In rare cases when P is a known continuous distribution, calculatingthe joint probability of several events requires evaluation of a multi-dimensional integral, which is hardto compute accurately [4]. Ben-Tal and Nemirovski [19], Calaﬁore and Campi [29, 30], and Nemirovskiand Shapiro [161, 162] approximate the non-convex chance constraint with convex constraints such thatthe solution to this approximation is feasible with high probability. However, such methods could yieldhighly conservative solutions [4] (see Section 2.5). Finally, a black-box simulation model or an oracle maybe available to evaluate P for a given solution x , however it is not straightforward to integrate such anoracle within the optimization model and the number of feasible solutions to evaluate is typically huge [228].In this survey, we focus on two main approaches to address this difﬁculty, namely the Sample AverageApproximation (SAA) approach (Section 2) and the distributionally robust approach (Section 3).2. Non-convexity of the feasible set.

For certain special cases such as joint CCPs with RHS uncertaintyinvolving quasi-concave or log-concave distributions [182, 185, 226, 227], or individual chance constraintswith LHS uncertainty under a certain log-concave distribution and choice of (cid:15) [116], such as normal [105],there is an equivalent convex representation of the corresponding CCP. In general, however, chance constraintseven in the case with continuous x , polyhedral P , and only RHS uncertainty result non-convex feasibleregions in their original variable space. We illustrate this challenge with an example. Example 1. [Adapted from [198]] Let ω and ω are dependent random variables with joint probabilitydensity function given in Table 1. Consider the CCP with RHS uncertainty min x + x s.t. P (cid:26) x − x ≥ ω x + 2 x ≥ ω (cid:27) ≥ . x ≥ . The feasible region of this problem is non-convex as illustrated in Figure 1.Table 1: Joint probability density function of ω Scenario 1 2 3 4 5 6 7 8 9 ω ω (cid:3) Indeed, the resulting problems are NP-hard, in general [145, 162].There has been a renewed and growing interest in CCP since the early 2000s [61, 196] to tackle these challenges.Capitalizing on the enormous success of mixed-integer programming (MIP) and conic optimization solvers sincethe early 2000s, our focal point is on reformulations that aim to circumvent the aforementioned challenges andenable progress towards the solution of this difﬁcult class of problems.3 = 2 ω = 1 . ω = . ω = . ω = . x x ω = 1 . Figure 1: The feasible region of the example CCP.

We next present two relevant deﬁnitions pertaining to the risk associated with a univariate random variable thatwill be used in our discussion. We refer the reader to [176, 177, 192] for a more detailed treatment of these riskmeasures.

Deﬁnition 1.

For a univariate random variable X , with cumulative distribution function F X , the value-at-risk ( VaR )at conﬁdence level (1 − (cid:15) ) , also known as (1 − (cid:15) ) -quantile, is given by: VaR − (cid:15) ( X ) = min { η : F X ( η ) ≥ − (cid:15) } . (3) (cid:3) It follows from (3) that, for any x ∈ R , the inequalities VaR − (cid:15) ( X ) ≤ x and P ( X ≤ x ) ≥ − (cid:15) are equivalent.That is, a chance constraint on random variable X can be equivalently represented as a constraint on its VaR . Deﬁnition 2 ([193, 194]) . The conditional value-at-risk (CVaR) at conﬁdence level (1 − (cid:15) ) ∈ (0 , is given by CVaR − (cid:15) ( X ) = min (cid:26) η + 1 (cid:15) E ([ X − η ] + ) : η ∈ R (cid:27) , (4)where ( a ) + := max { , a } . (cid:3) It is well known that the minimum in deﬁnition (4) is attained at the

VaR at conﬁdence level (1 − (cid:15) ) . CVaR,introduced by Rockafellar and Uryasev [193], satisﬁes the axioms of coherent risk measures, such as law invarianceand sub-additivity, as deﬁned in [9]. It has other desirable properties, such as tractability—for ﬁnite distributions,CVaR can be formulated as a linear program and embedded in an optimization model [192]. More precisely, suppose X is a random variable with realizations X , . . . , X N and corresponding probabilities p , . . . , p N . Throughout,for a ∈ Z + , let [ a ] := { , . . . , a } . The optimization problem in (4) can equivalently be formulated as the linearprogram (LP): min  η + 1 (cid:15) X i ∈ [ N ] p i w i : w i ≥ X i − η, ∀ i ∈ [ N ] , w ∈ R N +  . (5)4urthermore, let ρ denote an ordering of the realizations such that X ρ ≤ X ρ ≤ · · · ≤ X ρ N . Then, for a givenconﬁdence level (cid:15) ∈ (0 , we have VaR − (cid:15) ( X ) = X ρ q , where q = min  j ∈ [ N ] : X i ∈ [ j ] p ρ i ≥ − (cid:15)  . (6) Our survey is organized as follows. In the ﬁrst part of this survey, in Section 2, we consider CCPs under a ﬁnitediscrete distribution. We consider a natural MIP formulation and valid inequalities for both RHS and LHS uncertaintyin Sections 2.1 and 2.2, respectively. In Section 2.3, we review alternative formulations and specialized methods forCCPs under a ﬁnite distribution. In Section 2.4, we describe a two-stage CCP and a Benders decomposition methodfor its solution. In Section 2.5 we describe approximations of CCPs. In the second part of this survey, in Section 3,we consider distributionally robust CCPs, primarily under two types of uncertainty sets: moment-based (Section3.1) and Wasserstein ambiguity sets (Section 3.2). We give an overview of a wide range of applications in Section 4,and conclude in Section 5.

In this section, we consider CCPs under a ﬁnite discrete probability space (Ω , Ω , P N ) , where Ω = { ω , . . . , ω N } ,where p i = P N ( ω = ω i ) . Of particular interest are such CCPs that result from the Sample Average Approximation(SAA) approach [144, 173], which approximates P via a ﬁnite empirical distribution, P N .For ease of exposition, we will assume that the samples are independent and identically distributed (i.i.d.) andconsider the SAA formulation of CCP (i.e., p i = N , i ∈ [ N ] ). The methods we discuss can be adapted to the caseof non-i.i.d. scenarios, for example those that are obtained via importance sampling [17].The SAA formulation of (1) is min x c > x (7a)s.t. N X i ∈ [ N ] ( x

6∈ P ( ω i )) ≤ (cid:15), (7b) x ∈ X , (7c)where ( · ) is the indicator function. From this formulation, it is apparent that the use of ﬁnite discrete distributioncircumvents the ﬁrst difﬁculty of evaluating high-dimensional integrals. Under non-equal probability scenarios,constraint (7b) is simply X i ∈ [ N ] p i ( x

6∈ P ( ω i )) ≤ (cid:15). When P ( · ) is polyhedral as given by (2), formulation (7) for CCP under a discrete distribution lends itself to anequivalent mixed-integer linear program (MIP) via the introduction of binary variables and big-M constraints. Hence,the non-convex feasible region in the original space of variables can be represented as a MIP with additional binary5ariables. This addresses the second difﬁculty of non-convexity by enabling the immediate use of off-the-shelf MIPsolvers. Next we present such MIP formulations for the RHS and LHS uncertainty cases. First, let us consider the problem with RHS uncertainty. In this setting, the joint linear CCP (7) with RHS uncertaintyis reformulated as a mixed-integer linear program [196] min x,t,z c > x (8a)s.t. x ∈ X , T x = ¯ r + t, (8b) t j ≥ r i,j (1 − z i ) , ∀ i ∈ [ N ] , ∀ j ∈ [ m ] , (8c) N X i ∈ [ N ] z i ≤ (cid:15), (8d) t ∈ R m + , z ∈ { , } N , (8e)where ¯ r ∈ R m is chosen vector satisfying r ( ω i ) ≥ ¯ r for all i and r i = ( r i, , . . . , r i,m ) > denotes r ( ω i ) − ¯ r . Thechoice of ¯ r ensures that the data vector r i is nonnegative for all i ∈ [ N ] . For (cid:15) < , we have T x ≥ ¯ r from (8c)-(8d),hence t ≥ . The binary variable z i encodes the indicator function in (7b) to model the event T x ≥ r ( ω i ) . Inparticular, if z i = 0 , then constraints (8c) enforce that t ≥ r i holds and thus T x ≥ r ( ω i ) is satisﬁed. Otherwise, z i = 1 , and constraints (8c) reduce to the trivial relation t ≥ . Finally, (8d) enforces that the probability of x

6∈ P ( ω ) is within the risk threshold (cid:15) . Note that this constraint is equivalent to a cardinality constraint on thebinary variables P i ∈ [ N ] z i ≤ b (cid:15)N c =: k . In the non-equiprobable case, it is a knapsack constraint P i ∈ [ N ] p i z i ≤ (cid:15). In the case of individual chance constraints, when m = 1 , we can linearize the single inequality in the chanceconstraint as T x ≥ F − ω (1 − (cid:15) ) to lower bound the LHS with the (1 − (cid:15) ) -quantile. Therefore, under RHS uncertainty,problems with joint chance constraints ( m > are more challenging. In fact, Luedtke et al. [145] show that theproblem is NP-hard for m > . Constraints (8c) are referred to as big- M constraints. Often, formulations withbig-M constraints result in weak LP relaxation bounds, which hinder the convergence of the branch-and-boundmethods. Therefore, MIP approaches have focused on obtaining strong formulations for the SAA formulation toscale up the problem sizes that can be solved. To this end, an important substructure in the formulation (8) is givenby the constraints (8c) and (8e) for a ﬁxed j . This particular substructure is a special case of the mixing set studiedin [83] that involve general integer variables. Its speciﬁc form involving only binary variables is ﬁrst considered inAtamtürk et al. [14] in the context of vertex covering.We ﬁrst consider strengthening based on an individual inequality in the chance constraint. More precisely, consider(8c) and (8e) for a ﬁxed j . We will drop the dependence on j for notational convenience. The resulting system isnothing but a mixing set with binary variables given by M := (cid:8) ( t, z ) ∈ R + × { , } N : t + r i z i ≥ r i , ∀ i ∈ [ N ] (cid:9) . The (binary) mixing set M involves N inequalities that share a common continuous variable t , but independentbinary variables z i , i ∈ [ N ] . The so-called mixing inequalities of Günlük and Pochet [83] specialized to binary6ase, which is known to be equivalent to the so-called star inequalities introduced in [14], are an exponentialfamily of linear inequalities that provide the complete linear description of conv( M ) (see also, Pochet and Wolsey[179, Theorem 18]). Furthermore, this class of inequalities can be separated in polynomial time [10, 83], henceformulation (8) can be strengthened using the mixing inequalities within a branch-and-cut framework. Somewhatsurprisingly, Kılınç-Karzan et al. [106] uncover that mixing set M can be viewed as a polymatroid set correspondingto the epigraph of submodular functions. Indeed, the authors show that mixing inequalities are equivalent to extremalpolymatroid inequalities as deﬁned in Lovász [139], Atamtürk and Narayanan [12, Proposition 1].Luedtke et al. [145] further strengthen formulation (8) by exploiting the cardinality constraint (8d) and by studyingthe resulting set given by (8c)–(8e) for a ﬁxed j . In this case, an immediate strengthening is that of the big-M.Consider the set M C :=  ( t, z ) ∈ R + × { , } N : t + r i z i ≥ r i , ∀ i ∈ [ N ] , X i ∈ [ N ] z i ≤ k  . Sort the values r i for i ∈ [ N ] , to obtain a permutation σ such that: r σ ≥ r σ ≥ · · · ≥ r σ N . Now observe that due to the cardinality constraint P i ∈ [ N ] z i ≤ k , we must have t ≥ r σ k +1 . Therefore, we deducethat M C =  ( t, z ) ∈ R + × { , } N : t + ( r i − r σ k +1 ) z i ≥ r i , ∀ i ∈ [ N ] , X i ∈ [ N ] z i ≤ k  . Note, here, that this is an immediate big-M coefﬁcient strengthening that can be readily incorporated into the MIPformulation. This strengthening uses the quantile information that t ≥ r σ k +1 .Due to their common usage, we give a precise deﬁnition of the resulting mixing inequalities that make use of thecardinality-based strengthening next. Then, consider a subset S = { s , s , . . . , s ‘ } ⊆ { σ , σ , . . . , σ k } such that r s i ≥ r s i +1 for i = 1 , . . . , ‘ , where s = σ and s ‘ +1 = σ k +1 . Luedtke et al. [145] show that a strong mixinginequality valid for M C is given by t + ‘ X i =1 (cid:0) r s i − r s i +1 (cid:1) z s i ≥ r s . (9)This idea can be adapted to the non-equiprobable case by redeﬁning k as k := arg min { j : P ji =1 p i ≤ (cid:15) } .Furthermore, inequality (9) can be strengthened by further use of the cardinality relation or for the case where thescenarios are not equiprobable when constraint (8d) is in the form of a knapsack inequality [1, 113, 145, 253].Next, we illustrate this concept on our numerical example (Example 1). Consider the ﬁrst inequality inside thechance constraint and note that k = 3 with respect to ω . Note that the scenarios are already ordered in nonincreasingorder with respect to the possible values of r ( ω ) . Therefore, we have t ≥ .

25 = r ( ω ) . A possible strengthenedmixing inequality is for S = { , } given by t + (0 . − . z + (0 . − . z ≥ . .

7t is easy to see the validity of this inequality. If z = 0 , then we must have t ≥ . , which satisﬁes this inequality.If z = 1 and z = 0 , then we must have t ≥ . , which is also satisﬁed. Finally, when z = z = 0 , the inequalityreduces to t ≥ . , which holds due to the (1 − (cid:15) ) -quantile relation.So far, we reviewed inequalities based on an individual inequality inside the chance constraint. If we considermultiple inequalities inside the chance constraint jointly, the resulting set is an intersection of multiple mixing setsthat share a common set of binary variables z , but independent continuous variables t j , j ∈ [ m ] . For this case,Atamtürk et al. [14, Theorem 3] show that adding the mixing inequalities written for each set to the LP relaxation ofthe set deﬁned by (8c) and (8e) is sufﬁcient to obtain the convex hull of solutions. Furthermore, Kılınç-Karzan et al.[106] show how to extend their framework exploiting submodularity to recover this result, as well as extend it topropose the so-called aggregated mixing inequalities that incorporate lower bounds on the continuous variablesbased on the quantile relation. For the special case of two-sided chance constraints, the convex hull descriptionprovided in Liu et al. [133] are equivalent to the aggregated mixing inequalities. The aggregated mixing inequalitiesdo not directly use the cardinality information, but use it indirectly through the lower bound on the continuousvariables obtained from the quantile. In contrast, Küçükyavuz [113] and Zhao et al. [253] propose valid inequalitiesfor a joint chance constraint by directly considering the cardinality/knapsack constraint. Now consider the problem with uncertainty data in both LHS and RHS. In this setting, the joint linear CCP (7) withLHS uncertainty is reformulated as a mixed-integer linear program [196] min x,z c > x (10a)s.t. x ∈ X , (10b) T ( ω i ) x ≥ r ( ω i ) − M ( ω i )(1 − z i ) , ∀ i ∈ [ N ] , (10c) N X i ∈ [ N ] z i ≤ (cid:15), (10d) z ∈ { , } N , (10e)where M ( ω i ) , i ∈ [ N ] is a vector of big-M coefﬁcients such that when z i = 1 , inequality (10c) is redundant.In Section 2.1 we exploited the mixing structure associated with (8c) and (8e) for a ﬁxed j . In other words, weconsidered an individual inequality inside the (joint) chance constraint. Furthermore, we considered RHS uncertaintyonly. In contrast, in this section we will consider LHS as well as RHS uncertainty, and we will jointly consider theinequalities inside the chance constraints for any m ≥ .The mixing procedure described in Section 2.1 relies on the fact that all scenarios share the same LHS for a given j ∈ [ m ] , that is t = T j x , where T j is the j th row of T . Due to this, we arrive at a mixing structure with N constraints that share the same continuous variable t and different binary variables. In contrast, in LHS uncertaintycase, we no longer have a common continuous variable. Can we still apply the mixing procedure?As it turns out, we can indeed extend the mixing procedure to generate other classes of valid inequalities for joint8hance-constrained programs with LHS uncertainty. To do so, we solve the following single-scenario optimizationproblem for all scenarios ω ∈ Ω and for a given φ ∈ R n : q ω ( φ ) = min x φ > x (11a) x ∈ P ( ω ) , (11b) x ∈ X . (11c)We sort the values q ω ( φ ) for ω ∈ Ω , to obtain a permutation σ such that: q σ ( φ ) ≥ q σ ( φ ) ≥ · · · ≥ q σ N ( φ ) . Observe that φ > x ≥ q σ k +1 ( φ ) is a valid inequality. Furthermore, substituting t = φ > x and r = q ( φ ) in inequality(9), we obtain a valid inequality of the desired form. These inequalities are referred to as quantile cuts . Thisand related inequalities based on quantile information have been studied in [6, 131, 143, 189, 208, 235]. Theseinequalities consider the interaction between the decision variables across multiple inequalities in the chanceconstraint, which results in improved computational performance. In another line of work, Tanner and Ntaimo [212]propose a class of cuts based on the irreducibly infeasible subsystems (IIS) of an LP that requires that a subset ofscenarios are satisﬁed. The authors demonstrate the efﬁcacy of this approach in a vaccine allocation application. While we focus on natural big-M formulations that can be easily adopted by practitioners, it is important to notethat there are alternative reformulations for this class of problems relying on the concept of (1 − (cid:15) ) -efﬁcient points,which are an exponential number of points representing the multivariate value-at-risk associated with the chanceconstraint (12b) to be speciﬁed later. Deﬁnition 3. [184] Let ν ∈ R m be such that F ( ν ) ≥ − (cid:15) and F ( ν − ε ) < − (cid:15) for ε ≥ , ε = . The point ν iscalled (1 − (cid:15) ) -efﬁcient . (cid:3) In Example 1, observe that ν ∈ { (0 . , , (0 . , . , (0 . , . } is (1 − (cid:15) ) -efﬁcient. The (1 − (cid:15) ) -efﬁcient pointsthen prescribe the extreme points of the non-convex feasible region as seen in Figure 1.There are several methods in the literature that rely on the enumeration of the exponentially many (1 − (cid:15) ) -efﬁcientpoints [61, 111, 112, 119, 184, 198]. Such alternative formulations lead to specialized branch-and-bound algorithmsdescribed in [22, 23, 196, 197]. Sen [198] uses the (1 − (cid:15) ) -efﬁcient points to give a disjunctive programmingreformulation of joint chance constraints with ﬁnite discrete distributions. Valid inequalities are proposed based onthe extreme points of the reverse polar of the disjunctive program, which can be separated by a cut generation linearprogram (CGLP) [15]. Küçükyavuz [113] gives a compact and tight extended formulation based on disjunctiveprogramming for m = 1 . Vielma et al. [217] extend this formulation for varying m > to obtain a hierarchyof stronger relaxations. Dentcheva et al. [61] use (1 − (cid:15) ) -efﬁcient points to obtain various reformulations ofprobabilistic programs with discrete random variables, and to derive valid bounds on the optimal objective functionvalue. Ruszczy´nski [196] uses the concept of (1 − (cid:15) ) -efﬁcient points to derive consistent orders on different scenarios9epresenting the discrete distribution. The consistent ordering is represented with precedence constraints, andvalid inequalities for the resulting precedence-constrained knapsack set are proposed. Beraldi and Ruszczy´nski[22] propose a branch-and-bound method for probabilistic integer programs using a partial enumeration of the (1 − (cid:15) ) -efﬁcient points.Alternatively, Ahmed et al. [6] and Jiang and Xie [101] consider a Lagrangian relaxation of the MIP formulation bycreating copies of the variables, and relaxing the non-anticipativity constraint that these variables are equal. Theauthors derive extended formulations whose relaxations achieve the stronger bounds than the basic formulation(without mixing strengthening).Furthermore, for problems with pure binary variables and special structures, i.e., for combinatorial CCPs , strongerformulations have been developed (see, e.g., [21, 95, 130, 206, 208, 228]). For example, Song et al. [208] studychance-constrained bin packing problems, and propose a formulation that does not involve additional indicatorvariables to represent (7b) based on the so-called lifted probabilistic cover inequalities. Later, Wang et al. [225]consider a closely related formulation with multiple chance constraints and derive lifted cover, clique, and projectioninequalities based on a bilinear reformulation. In a related line of work, Wang et al. [224] consider a chance-constrained assignment problem and its distributionally robust variant, and propose lifted cover inequalities basedon a bilinear reformulation of the problem. For chance-constrained knapsack problems, Yoda and Prékopa [243]provide sufﬁcient conditions for the convexity of the formulation, Klopfenstein and Nace [110], De [54], Han et al.[85], and Joung and Lee [103] derive approximate but more tractable formulations that can provide near-optimalsolutions, and Goyal and Ravi [82] derive a fully polynomial time approximation scheme when the random itemsizes are independent and Gaussian. In addition, Nikolova [164] studies approximation algorithms for generalchance-constrained combinatorial optimization problems with random parameters following either the Gaussiandistribution or a general distribution. Xie and Ahmed [236] provide a bicriteria approximation algorithm for aclass of chance-constrained covering problems and their distributionally robust variants that ﬁnds a solution withinconstant factor of the violation probability and a constant factor of the optimal objective.For chance-constrained set covering models with RHS uncertainty, Beraldi and Ruszczy´nski [23], Saxena et al.[197] propose a specialized branch-and-bound algorithm based on the enumeration of (1 − (cid:15) ) -efﬁcient points.Subsequently, Saxena et al. [197] derive polarity cuts to improve the computational performance of this approach.For individual chance-constrained set-covering problems with LHS uncertainty, [73] developed cutting planeapproaches for the case that all components of the Bernoulli random vector ω i are independent. In addition, Wu andKüçükyavuz [228] propose an exact approach for a partial set covering problem for the case that there exists anoracle to retrieve the probability of any events under P . In another line of work, Goyal and Ravi [81] and Swamy[210] propose approximation algorithms for chance-constrained set-covering problems with optimality guarantees.In addition to the aforementioned combinatorial CCPs, Padberg and Rinaldi [172] and Campbell and Thomas [32]study chance-constrained traveling salesman problems, Song and Shen [207] incorporate a chance constraint into abi-level shortest path interdiction problem, and Ishii et al. [98] and Geetha and Nair [77] study chance-constraintvariants of the spanning tree problem.The focus of this survey is on mixed-integer conic reformulations of CCPs, which yield provably optimal solutions10t termination. However, it bears mentioning that there are recent nonlinear programming-based approaches toaddress the non-convexity of chance constraints. Cheon et al. [46] give a global optimization algorithm thatsuccessively partitions the non-convex feasible region until a global optimal solution is obtained. Tayur et al. [213]give an algebraic geometry algorithm for a scheduling problem with joint chance constraints that solves a seriesof chance-constrained integer programs with varying reliability levels. Peña-Ordieres et al. [175] derive smoothnon-convex reformulations of the chance constrained based on the sampled empirical distribution. Other nonlinearprogramming approaches, which may result in solutions that are stationary points, include difference-of-convexoptimization methods [94], sequential outer and inner approximations [78], and sequential cardinality-constrainedquadratic optimization methods [50].Finally, throughout, we have assumed that the risk level (cid:15) is ﬁxed. However, in practice, the decision-maker maybe interested in the trade-offs between risk level and the optimal objective. One way to assess this would be tosolve the problem for multiple values of ﬁxed (cid:15) . For example, Shen [204] proposes a novel v ariable risk thresholdmodel in which the risk tolerance is adjustable with an appropriate penalty function in the objective to preventhigh risk. The author proposes a MIP formulation for this problem for individual chance constraints. Xie et al.[237, Theorem 8] show that the corresponding optimization problem is strongly NP-hard. Elçi et al. [70] proposea stronger MIP formulation for this problem under RHS uncertainty. Finally, Lejeune and Shen [121] considerjoint chance constraints also with LHS uncertainty and propose a Boolean-based mathematical formulation for thismodel. Thus far, we have considered a decision-making problem that is static. In other words, the decisions are madehere-and-now before the revelation of the outcome of a random event. However, in most practical situations, thereare multiple decision stages—intervened by a probabilistic event—and the decision-maker takes recourse actions inthe later epochs based on the observed outcome of the event. In this section, we focus on problems that involvetwo stages. For example, in a power generation setting, the day-ahead problem determines the on/off status ofthe conventional generators a day before realizing the demand (load) or supply (in case of renewable generators).Then the second-stage problem ensures that the loss-of-load probability is no more than a pre-speciﬁed risk level (cid:15) ∈ (0 , . Therefore, a two-stage chance-constrained model is called for.As before, the random outcome ω is deﬁned on a probability space (Ω , Ω , P N ) . Let E [ · ] denote the expectationoperator taken with respect to ω . Liu et al. [131] propose the two-stage chance-constrained mixed-integer program min x c > x + P N ( x ∈ P ( ω )) E [ h ( x, ω ) | x ∈ P ( ω )] , (12a) P N ( x ∈ P ( ω )) ≥ − (cid:15) (12b) x ∈ X , (12c)11here P ( ω ) = { x : ∃ y satisfying W ( ω ) y ≥ r ( ω ) − T ( ω ) x, y ∈ Y} and the second-stage problem is given by h ( x, ω ) = min y g ( ω ) > y (13a) W ( ω ) y ≥ r ( ω ) − T ( ω ) x (13b) y ∈ Y . (13c)Here, g ( ω ) is a vector of second-stage objective coefﬁcients, Y is the domain of the second-stage decision vector y . For a related model that considers only the feasibility of the second-stage problem without an associatedsecond-stage cost function h ( x, ω ) , we refer the reader to [143].The two-stage chance-constrained problem can be formulated as a large-scale mixed-integer program by introducinga big- M term for each inequality in the chance constraint and a binary variable for each scenario. In particular,analogous to the static CCP, the deterministic equivalent formulation (DEF) of the two-stage CCP may be stated as min x,y,z c > x + 1 N X i ∈ [ N ] g ( ω i ) > y ( ω i ) z i (14a) T ( ω i ) x + W ( ω i ) y ( ω i ) ≥ r ( ω i ) − M ( ω i ) z i , i ∈ [ N ] (14b) N X i ∈ [ N ] z i ≤ (cid:15), (14c) x ∈ X , y ( ω i ) ∈ Y , i ∈ [ N ] (14d) z i ∈ { , } i ∈ [ N ] , (14e)where z i , i ∈ [ N ] is a binary variable that equals 0 only if the second-stage problem for scenario ω i has a feasiblesolution, and M ( ω i ) is a vector of large enough constants that makes constraint (14b) redundant if z i = 1 , i.e., ifthe second-stage problem for scenario ω i need not be feasible. The rest of the constraints are interpreted similarlyas before.This formulation poses multiple challenges in addition to the usual difﬁculties of a formulation with big-Mconstraints (14b). First, the objective function (14a) is nonlinear. Second, the problem is large scale due to thecopies of the variables y ( ω i ) and the large number of binary variables z i for i ∈ [ N ] . Nevertheless, the formulation(14) has a decomposable structure—for a ﬁxed ﬁrst-stage vector x , the problem decomposes into independentscenario problems. Furthermore, if y is a continuous decision vector and Y is polyhedral, then the second-stageproblems are linear programs. Next we describe a Benders-type decomposition algorithm that not only exploits thisdecomposable structure, but also replaces the weak big- M constraints (14b) with stronger optimality and feasibilitycuts, using the mixing structure. Benders method [20], or its speciﬁc use in the classical two-stage stochastic programming (without chanceconstraints) referred to as the L -shaped method [215], is the method of choice for problems that have a similarstructure and the second-stage problems are linear programs. However, these methods are not immediately applicableto (14), since both the feasibility and optimality cuts of the Benders method assume that all second stage problems12ust be feasible, which is not the case for two-stage CCPs. For general recourse problems, feasibility and optimalitycuts different from the traditional Benders cuts must be developed.Let η i represent a lower bounding approximation of the optimal objective function value of the second-stage problemunder scenario ω i , i ∈ [ N ] . Without loss of generality, we assume that η i ≥ , i ∈ [ N ] . At each iteration of aBenders decomposition method, a sequence of relaxed master problems (RMP) are solved: min x,z,η c > x + 1 N X i ∈ [ N ] η i (15a) N X i ∈ [ N ] z i ≤ (cid:15), (15b) ( x, z ) ∈ F , (15c) ( x, z, η ) ∈ O , (15d) x ∈ X (15e) z ∈ { , } N , (15f)where, F and O denote the set of feasibility and optimality cuts—to be speciﬁed later,—respectively.At iteration k , let ( x k , z k ) be the optimal solution to the RMP. Given this ﬁrst-stage solution, suppose that wesolve the LP (13) for outcome ω to obtain h ( x k , ω ) . The feasibility cuts in set F are derived from the solutionto this LP. If z ki = 0 for some i ∈ [ N ] , then the second-stage problem must be feasible. If it is infeasible for ascenario j ∈ [ N ] , then there exists an extreme ray ψ ω j associated with the dual of (13) for scenario ω j that yieldsthe inconsistent solution. Then, letting φ = ψ > ω j T ( ω j ) in (11) and following the mixing procedure gives a violatedvalid inequality that cuts off this infeasible solution ( x k , z k ) . If, on the other hand, for all ω ∈ Ω , the second-stageproblem associated with scenario ω such that z k ( ω ) = 0 is indeed feasible, then the current solution ( x k , z k ) is afeasible solution and no feasibility cuts are necessary. However, optimality cuts may be needed. Next we describehow to obtain valid optimality cuts.Let ψ ω j be the dual vector associated with the optimal basis of the second-stage problem (13) for scenario ω j at thisiteration. One possible big-M optimality cut is given by [221, 222] η j + M j z j ≥ ψ > ω j ( r ( ω j ) − T ( ω j ) x ) , (16)where M j , j ∈ [ N ] is a big-M coefﬁcient vector.Next we describe a stronger optimality cut proposed by [131] that leads to faster convergence to an optimal solution.Clearly, the traditional Benders optimality cut, η j ≥ ψ > ω j ( r ( ω j ) − T ( ω j ) x ) is a valid optimality cut for x ∈ X (infact for x ∈ P ( ω ) ) if z j = 0 . However, it may not be valid for all x ∈ X for solutions with z j = 1 . To obtain avalid optimality cut, we solve the following secondary problem with φ = ψ > ω j T ( ω j ) : ¯ v ω j ( φ ) = min x,y φxx ∈ X , y ∈ Y . η j + (cid:16) ψ > ω j r ( ω j ) − ¯ v ω j ( φ ) (cid:17) z j ≥ ψ > ω j ( r ( ω j ) − T ( ω j ) x ) . (17)To see the validity of this inequality at z j = 1 , note that in this case, the second-stage objective function contributionfor scenario ω j is zero. Furthermore, inequality (17) evaluated at z j = 1 reduces to η j ≥ ¯ v ω ( φ ) − φx . Because ¯ v ω ( φ ) − φx ≤ for all x ∈ X and η j ≥ , this inequality is trivially satisﬁed. The ﬁnite convergence of theresulting algorithm is proven in [131] under certain assumptions.In Table 2, we summarize a set of computational experiments that appear in [131] to show the effectiveness of theapproaches discussed so far. The instances are based on a resource planning problem adapted from [143]. In theﬁrst stage, the number of servers among s types of servers to employ is determined. The second-stage problem is toallocate the servers to clients of τ types, so that their demands are met with high probability ( − (cid:15) ). Instances withvarious choices of N, (cid:15), τ, s are tested and we report the average statistics for three random instances generated forthe combination reported in each row. We compare the proposed “Strong" decomposition algorithm which usesthe optimality cuts (17) with DEF (14) and the decomposition approach (referred to as “Basic") which uses themixing-based feasibility cuts and the big- M optimality cuts (16) with an appropriate choice of big- M as describedin [131]. We report the solution times (in seconds) only for Strong decomposition, because for DEF and Basic, allinstances tested reach the time limit of one hour. We also report the percentage optimality gap at termination underthe Gap column. In most cases, DEF is unable to ﬁnd a feasible solution to the LP relaxation, as indicated by a ‘-’.In cases when it is able to ﬁnd a feasible solution, it ends with a gap ranging from 4% to 8%. On the other hand,Basic is able to ﬁnd a feasible solution for all instances, but is unable to prove optimality for any of the 36 instancestested. It ends after an hour with optimality gaps ranging from 2% to 7%. In contrast, the Strong decompositionalgorithm, based on the proposed strong optimality cuts, is able to solve most of the instances to optimality. Forthe two unsolved instances (indicated by a superscript under the Gap column), the average optimality gap is lessthan 0.1%. These results highlight the importance of using strong formulations and decomposition for large-scaleinstances.It is important to note that in this model, the undesirable outcomes ω such that x

6∈ P ( ω ) are simply ignored. Liuet al. [131] propose an extension of the two-stage model (12), where they allow so-called recovery decisions for theundesirable scenarios. They discuss how to resolve a potential time inconsistency in two-stage CCP. Furthermore,the Benders decomposition-based solution method is extended to operate in the case of recovery.Elçi and Noyan [69] extend this framework to a two-stage chance-constrained optimization model with a mean-riskobjective, using the conditional value-at-risk as a risk measure. They apply this framework to a humanitarian reliefnetwork design problem and demonstrate its effectiveness on a case study based on hurricane preparedness inSoutheastern United States. Lodi et al. [136] extend this two-stage framework to convex second-stage problems,motivated by hydro-power scheduling applications. They build an outer approximation of the nonlinear second-stageformulations to design a Benders-type algorithm that converges to an optimal solution under mild assumptions.They demonstrate the computational beneﬁt of the decomposition algorithm on a case study based on hydroplantdata from Greece.We close this subsection by noting the assumption of continuous second-stage variables can be lifted by leveraging14able 2: Result for instances with random RHS.Instances DEF Basic Strong ( N, (cid:15) ) ( s, τ ) Gap (%) Gap (%) Time Gap (%) (2000, 0.05) (5,10) 4.60 2.34 166 0(10,20) - 2.93 483 0(15,30) - 2.69 1106 0(2500, 0.05) (5,10) 4.64 2.61 279 0(10,20) - 3.08 711 0(15,30) - 2.88 1819 0.09 (2000, 0.1) (5,10) 7.1 5.46 723 0(10,20) - 5.99 1069 0(15,30) - 6.27 1032 0(2500, 0.1) (5,10) 7.63 5.32 641 0(10,20) - 5.79 1198 0(15,30) - 6.03 2112 0.02 the developments for decomposition algorithms for classical two-stage stochastic mixed-integer programs, wherethe second-stage problems also involve integer decisions [35, 75, 115, 117, 167–169, 187, 199–201, 245]. Thesemethods rely on iteratively convexifying the second-stage problems and updating the feasibility and optimality cutsaccordingly. These methods can be combined with the Benders-type algorithm we described to enable the solutionof two-stage CCPs with integer variables at the second-stage. Given the difﬁculty of solving the exact formulations of CCPs or their SAA reformulations, one line of research hasfocused on inner and outer approximations of CCPs that are more tractable. This tractability often comes at theprice of conservatism in the resulting solutions. Here we brieﬂy review these formulations and refer the reader to[5] for a review of relaxations and approximations for CCPs.•

Scenario approximation.

Scenario approximation (SA) [e.g., 29, 30, 33, 34, 55] entails sampling to ap-proximate the true distribution P with a ﬁnite distribution P N with a set of outcomes Ω = { ω , . . . , ω N } .However, unlike the SAA model (7), a usual stochastic program (not chance-constrained) is solved enforcingthat the relations inside the chance constraint hold for each scenario. Thus, the scenario approximationproblem is given by min x c > x s.t. x ∈ P ( ω ) , ω ∈ Ω , (18a) x ∈ X , (18b)As a result, for polyhedral P ( ω ) and continuous x , the resulting SA formulation is a large-scale LP. Theauthors give a ﬁnite sample guarantee that the solution to this problem is feasible to the original CCP with highprobability. Interestingly, this sample size does not depend on m , under certain assumptions. Unfortunately,15he required sample size is typically large and the resulting solution is overly conservative. The SAA approach[144, 173] is aimed at alleviating the conservatism of the SA approach by enforcing the chance constraint,with a smaller risk level, over the ﬁnite distribution P N , albeit as a MIP as opposed to an LP.• CVaR approximation.

From Deﬁnitions 2 and 3, it is readily apparent that for a univariate random variable X , CVaR − (cid:15) ( X ) ≥ VaR − (cid:15) ( X ) . Therefore, for individual chance constraints ( m = 1) , one can approximatethe constraint P ( r ( ω ) − T ( ω ) x ≤ ≥ − (cid:15) , or in other words, VaR − (cid:15) ( r ( ω ) − T ( ω ) x ) ≤ with CVaR − (cid:15) ( r ( ω ) − T ( ω ) x )) ≤ . For the case of ﬁnite discrete distributions, this approximation leads totractable reformulations due to the LP representation of CVaR given in (5). In particular, for individual chanceconstrained CCP (7), the CVaR approximation LP is min x c > x s.t. η + 1 (cid:15)N X i ∈ [ N ] w i ≤ ,w i ≥ r ( ω i ) − T ( ω i ) x − η, ∀ i ∈ [ N ] ,x ∈ X . In general, though, it is not possible to represent CVaR tractably [162]. Nevertheless, Nemirovski and Shapiro[162] give a family of safe (i.e., feasible with high probability) and, in some cases, tractable approximations—referred to as generator-based approximations—that include the Bernstein approximation [178]. They showthat the tightest such approximation is a CVaR approximation. However, CVaR approximation is alsoconservative in some cases [7]. We refer the reader to [160], and references therein, for a survey on relatedsafe tractable approximations for individual chance constraints.In the case of joint chance constraints ( m > ), it is worthwhile to note that even for the discrete case, while avector-valued multivariate VaR deﬁnition exists (Deﬁnition 3), there is no uniﬁed deﬁnition of multivariateCVaR [see, 150, and the discussions therein]. This poses challenges in formulating related CVaR-basedapproximations that are tractable. One approach is to scalarize the multivariate random vector r ( ω ) − T ( ω ) x and use the corresponding univariate CVaR. Considering the ambiguity of the scalarization weights leads to amultivariate CVaR deﬁnition that can be represented as a challenging MIP with big-M constraints [165]. MIPstrengthening techniques can be used to improve the computational performance of the resulting multivariateCVaR formulations [114, 132, 166].• Bonferroni approximation.

Given that joint chance constraints are signiﬁcantly harder than individualchance constraints, one approximation scheme that is commonly considered replaces the joint chanceconstraint with m individual chance constraints. In this case, consider replacing the joint chance constraint P ( T j ( ω ) x ≥ r j ( ω ) , j ∈ [ m ]) ≥ − (cid:15) with P ( T j ( ω ) x ≥ r j ( ω )) ≥ − (cid:15) j , (19)where X j ∈ [ m ] (cid:15) j ≤ (cid:15). (20)16rom Bonferroni’s inequality, it follows that any solution satisfying constraints (19)–(20) also satisﬁes thejoint chance constraint [42, 162]. Because optimizing over (cid:15) j is, in general, difﬁcult, a common choice is (cid:15) j = (cid:15)/m, j ∈ [ m ] . However, this is also known to be a conservative approach [41, 162].Note that while these approximations provide some statistical guarantees for feasibility, they are known to beconservative and do not come with optimality guarantees. Indeed, Xie and Ahmed [236] show an inapproximabilityresult for CCPs. Ahmed [2] uses a similar idea as [162], this time to obtain a convex (Bernstein) relaxation thatyield deterministic lower bounds. Integrated chance constraints proposed by Klein Haneveld [108] replaces the non-convex chance constraints with a quantitative measure of shortfalls that lead to polyhedral representations [109] inthe discrete case. In this case, they are equivalent to the LP relaxation of the MIP formulation of CCP. Alternatively,statistical lower bounds can be obtained by using order statistics based on SAA solutions [144, 173]. Suchdeterministic or statistical bounds are useful in assessing the quality of a solution obtained from an approximation.The ﬁnite sample guarantees of sampling based methods [29, 30, 34, 144, 173] are much too large and conservativein practice. On the other hand, for small N , the out-of-sample performance of the SAA solution may even beinfeasible to the true problem. For example, in [228], the authors consider a partial set covering problem when anoracle that can evaluate the true probability of the desired event is available. They observe that for sample sizesthat lend themselves to a tractable solution of the resulting MIP, the SAA solution is often infeasible to the trueproblem. This is related to the over-ﬁtting phenomenon in machine learning when the solution of the problemis highly sensitive to the samples { ω i } i ∈ [ N ] used to obtain it. In the next section, we describe an approach thatalleviates this problem. Given the unavailability of the exact distribution P and the potential overﬁtting issues due to SAA-based approaches,there has been growing interest in modeling stochastic optimization problems that are distributionally robust [see,190, and references therein].Formally, a distributionally robust chance-constrained program (DRCCP) is modeled as min x c > x (21a)s.t. sup P ∈F ( β ) P ( x

6∈ P ( ω )) ≤ (cid:15) (21b) x ∈ X , (21c)where F ( β ) is an ambiguity set of distributions and β is a set of parameters that describe the ambiguity set.Accordingly, the distributionally robust chance constraint (21b) ensures that the chance constraint is satisﬁed withrespect to all distributions in F ( β ) , even the worst possible one.Several types of ambiguity sets have been studied in the literature based on various characteristics of the distribution,including moments, shape information (e.g., symmetry and unimodality), support, mixture models, and discrepancymeasures (e.g., Wasserstein and φ -divergence) [3, 31, 43, 68, 71, 87, 102, 118, 124, 162, 216, 232, 238, 240, 254].17hese ambiguity sets lead to different computational tractability and conservatism of the corresponding DRCCP. Inthis survey, we will focus on moment-based ambiguity sets (Section 3.1) and Wasserstein ambiguity sets (Section3.2). There are many successful developments on the tractability of single and joint chance constraints with momentambiguity sets, which characterize P based on moment information of P [31, 87, 88, 124, 233, 241, 254].For known mean value µ and covariance matrix Σ , El Ghaoui et al. [68] characterize a moment ambiguity set F ( µ, Σ) := { P : E [ ω ] = µ, E [( ω − µ )( ω − µ ) > ] = Σ } . All probability distributions in F ( µ, Σ) need to have the designated ﬁrst two moments, and are otherwise allowedto have different distribution types (e.g., Gaussian, Gaussian mixture, etc.) or different support (e.g., discreteor continuous). Perhaps surprisingly, El Ghaoui et al. [68, Theorem 1] show that DRCCP is second-order conicrepresentable for individual chance constraints (i.e., m = 1 ). Speciﬁcally, if T ( ω ) := ω > A + T for some datamatrix A ∈ R d × n and vector T ∈ R × n and r ( ω ) := b > ω + r for some data vector b ∈ R d and constant r ∈ R ,then constraint (21b) is equivalent to µ > ( b − Ax ) + r − (cid:15)(cid:15) k Σ / ( b − Ax ) k ≤ T x − r . (22)This indicates that DRCCP may improve not only the out-of-sample performance of CCP when the sample size N is small but also the computational tractability. The same result is also discovered by Calaﬁore and El Ghaoui [31]and Wagner [219]. In addition, Zymler et al. [254] point out an interesting fact that, for m = 1 and ambiguity set F ( µ, Σ) , constraint (21b) is equivalent to its conservative approximation that replaces the chance constraint withCVaR, i.e., sup P ∈F ( µ, Σ) CVaR − (cid:15) ( r ( ω ) − T ( ω ) x ) ≤ .For individual chance constraints, the result of El Ghaoui et al. [68] can be extended in multiple directions whilemaintaining both exactness and computational tractability . For example, Cheng et al. [45] incorporate supportinformation into F ( µ, Σ) (e.g., specifying that P is supported on a convex set) and derive an exact reformulation of(21b) based on linear matrix inequalities. Zhang et al. [248] consider potential errors of estimating the mean value µ and covariance matrix Σ , e.g., when this is done based on inadequate historical data. To address this, they adopt analternative ambiguity set proposed by Delage and Ye [56] to allow the true mean value of ω to be within an ellipsoidcentered at µ and the true covariance matrix to be bounded from above by Σ . For this extended ambiguity set, Zhanget al. [248] show that constraint (21b) is still second-order conic representable. For ambiguity set F ( µ, Σ) , Xuet al. [238] study a distributionally robust variant of the stochastic dominance constraint (see, e.g., Dentchevaand Ruszczy´nski [60]), which requires different risk tolerances for violating a chance constraint with differentmagnitudes. More precisely, they study constraints sup P ∈F ( µ, Σ) P [ T ( ω ) x ≥ r ( ω ) − s ] ≤ (cid:15) − β ( s ) for all s ≥ , where β ( s ) is a pre-speciﬁed non-decreasing function of s , and show that these constraints are conic representable forvarious β ( s ) functions. Furthermore, Yang and Xu [241] and Xie and Ahmed [233] consider an extension thatallows the event x ∈ P ( ω ) to depend non-linearly on x and ω , e.g., x ∈ P ( ω ) if and only if f ( x, ω ) ≥ , where18unction f ( x, ω ) is concave in x and quasiconvex in ω . For example, Yang and Xu [241, Corollary 2] recast (21b)as a linear matrix inequality if r ( ω ) , as well as each entry of T ( ω ) , is either convex quadratic or linear in ω .It is also possible to extend El Ghaoui et al. [68] by incorporating shape information into the ambiguity set F ( µ, Σ) .For example, Calaﬁore and El Ghaoui [31, Lemma 3.1] strengthens F ( µ, Σ) by additionally requiring P to be centrally symmetric (that is, P [ A ] = P [ − A ] for any Borel set A ⊆ R d ) and derives a conservative approximationof constraint (21b). Hanasusanto [86] considers a similar ambiguity set and allows the true covariance matrixto be bounded from above by Σ (instead of matching it exactly as in F ( µ, Σ) ). Consequently, Hanasusanto[86, Theorem 3.4.3] recasts (21b) as a set of conic constraints. Different from [31], Li et al. [124, Theorem 1]strengthens F ( µ, Σ) by requiring that P is α -unimodal (a generalized notion of unimodality; see Dharmadhikariand Joag-Dev [62] for deﬁnition). They show that constraint (21b) is equivalent to a set of second-order conicconstraints. Hanasusanto [86, Example 3.4.4] considers a similar ambiguity set, which bound the true covariancematrix from above by Σ , and recasts (21b) as linear matrix inequalities. Stellato [209] also considers a similarambiguity set as in Li et al. [124] but requires P to be centered around µ . In that case, Stellato [209, Section 4.1.1]recasts (21b) as a single second-order conic constraint. There are works that consider other shape information andprovide tractable conservative approximations of (21b) (i.e., maintaining computational tractability at a potentialcost of exactness). For example, Chen et al. [42] replace the covariance information in F ( µ, σ ) with bounds onforward and backward deviations, which capture the asymmetry of P , and derive a conservative approximationof (21b) via second-order conic constraints. Li et al. [123] drop the covariance restriction from F ( µ, Σ) whileadding in that P is log-concave and supported on an ellipsoid centered at µ . For this case, Li et al. [123] deriveconservative and relaxing approximations of (21b), all via second-order conic constraints. Postek et al. [180] replacethe covariance information in F ( µ, Σ) with the mean absolute deviation (MAD) from the mean and further requirethat ω is componentwise independent . For that case, Postek et al. [180] derive a conservative approximation of(21b) based on second-order conic constraints.The special case of combinatorial DRCCPs with individual chance constraints is in general intractable because ofthe binary decision variables. Nevertheless, various formulation strengthening and algorithmic techniques can beapplied to solve these problems more effectively. For example, Ahmed and Papageorgiou [3] exploit supermodularityof their distributionally robust set covering problem to derive a stronger and compact reformulation. Zhang et al.[248] derive a submodular relaxation of their DRCCP reformulation for a general binary packing problem and applyextended polymatroid inequalities. Zhang et al. [252] integrate various algorithmic techniques, including coefﬁcientstrengthening and structure-aware reformulation, into a branch-and-price algorithm to solve a bin packing problem.Tractable reformulations for distributionally robust joint chance constraints, i.e., constraint (21b) with m ≥ , aremuch scarcer than for individual chance constraints. Indeed, Hanasusanto et al. [88, Section 2.3] show that DRCCPbecomes NP-hard if the ambiguity set involves any non-homogeneous dispersion measure (e.g., covariance as in F ( µ, Σ) ) or any non-conic support (e.g., a hyperrectangle), or if T ( ω ) involves any uncertainty (i.e., if T ( ω ) = T for some data matrix T ∈ R m × n ). Nevertheless, tractable reformulations do exist for ambiguity sets differentfrom F ( µ, Σ) or for chance constraints less general than (21b). For example, Hanasusanto et al. [88, Theorem 2]characterize an ambiguity set by the mean value, a positively homogeneous dispersion measure (e.g., MAD), and aconic support of ω , and derive a second-order conic reformulation of constraint (21b), in which T ( ω ) = T . Xie19 -3 -2 -1 0 1 2 3 O p t i m a l V a l ue ( $10 ) ED-F( , ' )ED-F( , ' , , ) (a) Optimal Value vs. φ , O p t i m a l V a l ue ( $10 ) ED-F( , ' )ED-F( , ' , , ) (b) Optimal Value vs. α Figure 2: Optimal values of ED- F ( µ, Σ) and ED- F ( µ, Σ , α ) with various φ and α and Ahmed [234, Theorem 2] consider a two-sided variant of (21b) with m = 2 and T ( ω ) = − T ( ω ) and derive asecond-order conic reformulation of constraint (21b) with regard to ambiguity set F ( µ, Σ) . Xie and Ahmed [233]derive exact and tractable reformulations of (21b) with regard to multiple types ambiguity sets, e.g., when F ( β ) involves linear moment constraints only (i.e., on the mean value of ω ) or when F ( β ) consists of a single (possiblynonlinear) moment constraint. Xie et al. [237] consider a subclass of constraints (21b) with separable uncertaintiesacross individual inequalities, i.e., each row of [ T ( ω ); r ( ω )] involves a different set of uncertain parameters and,correspondingly, a different ambiguity set. They show that, if either T ( ω ) or r ( ω ) involves no uncertainty, then(21b) admits an exact and tractable reformulation by applying the Bonferroni approximation (or union bound;see Bonferroni [28]).Various conservative approximations for distributionally robust joint chance constraints have been proposed. Chenet al. [41] propose to approximate the chance constraint in (21b) by using CVaR and subsequently approximate theresulting distributionally robust CVaR (DR-CVaR) constraint via a classical inequality of order statistics. These twolayers of approximation lead to a set of second-order conic constraints. Later, Zymler et al. [254] show that thesecond-layer approximation can be circumvented by deriving an exact reformulation of the DR-CVaR constraint,yielding a linear matrix inequality approximation of (21b). The approximations of [41] and [254] can both be furtherimproved by tuning certain scaling parameters. Unfortunately, it appears to be difﬁcult to simultaneously optimizesuch scaling parameters and the decision x in DRCCP. Cheng et al. [45] obtain a different approximation from thatof [254] when different rows of T ( ω ) are independent.In Figs. 2a–2b, we summarize a case study of a distributionally robust chance-constrained economic dispatch (ED)problem that appears in Li et al. [124] to demonstrate the difference between F ( µ, Σ) and an alternative ambiguityset that incorporates α -unimodality into F ( µ, Σ) , denoted by F ( µ, Σ , α ) . Their case study uses the IEEE 30-bussystem and incorporates two uncertain parameters, representing prediction errors of the forecast power outputs attwo wind farms. The formulation and parameters of this problem can be found in [124, Section 5.1]. In particular,we assume that the uncertainties are α -unimodal with a mode at [0 , > and have a mean value µ = φ [1 , > with20 ∈ {− , − , . . . , } . In Fig. 2a, we compare the optimal values of ED with regard to F ( µ, Σ) and that of EDwith regard to F ( µ, Σ , α ) with α = 1 and various φ values. From this ﬁgure, we observe that the optimal valueof ED- F ( µ, Σ) is consistently higher than that of ED- F ( µ, Σ , α ) . This conﬁrms that incorporating unimodalityinto the ambiguity set makes DRCCP less conservative. In Fig. 2b, we compare the optimal values of ED- F ( µ, Σ) and ED- F ( µ, Σ , α ) with φ = 0 and various α values. From this ﬁgure, we observe that, although the discrepancybetween ED- F ( µ, Σ) and ED- F ( µ, Σ , α ) declines as α increases, the convergence is sub-linear (in fact, it takesplace when α exceeds ). This demonstrates the signiﬁcant inﬂuence of unimodality upon the ambiguity set andthe corresponding DRCCP.The case study just described highlights the utility of available distribution information in reducing the degree ofconservatism. In this regard, moment ambiguity sets are known to be more conservative than their counterpartsbased on discrepancy measures (e.g., a Wasserstein ambiguity set) when more data samples are available. On theother hand, there is a trade-off between conservatism and tractability—unlike with moment-based ambiguity sets,DRCCP with a Wasserstein ambiguity set is not polynomially solvable in general [236]. However, there have beenrecent developments in MIP formulations for DRCCP under Wasserstein ambiguity, which we describe in the nextsection. Due to its desirable statistical properties, the so-called

Wasserstein ambiguity set has witnessed an explosion ofinterest. Wasserstein ambiguity set F ( N, θ ) is deﬁned as the θ -radius Wasserstein ball of distributions on R d aroundthe empirical distribution P N . This is deﬁned as d W ( P , P ) := inf Π (cid:8) E ( ω,ω ) ∼ Π [ k ω − ω k ] : Π has marginal distributions P , P (cid:9) , where the , based on a norm k · k , between two distributions P and P is used. The Wassersteinambiguity set is then deﬁned as F ( P N , θ ) := { P : d W ( P N , P ) ≤ θ } . Given a decision x ∈ X and randomrealization ω ∈ R d , we ﬁrst deﬁne a safety set, S ( x ) , of outcomes such that S ( x ) = { ω ∈ Ω : x ∈ P ( w ) } . Thedistance from ω to the unsafe set is dist( ω, S ( x )) := inf ω ∈ R d {k ω − ω k : ω

6∈ S ( x ) } . (23)Chen et al. [43, Theorem 3] and Xie [232, Proposition 1] show that the formulation min x,v,u c > xx ∈ X , v ≥ , u i ≥ , i ∈ [ N ] , (24a) dist( ω i , S ( x )) ≥ v − u i , i ∈ [ N ] , (24b) (cid:15) v ≥ θ + 1 N X i ∈ [ N ] u i (24c)is an equivalent formulation of (21), by using the dual representation for the worst-case probability P [ x

6∈ P ( ω )] under the Wasserstein ambiguity set P ∈ F ( P N , θ ) provided in [27, 76, 153]. (See also Hota et al. [96] for adeterministic non-convex reformulation of (21) and CVaR-based inner approximation of (21) for certain safety sets.)21ote that formulation (21) is non-convex due to constraint (24b). However, for certain safety sets S ( · ) , MIPreformulations are possible [43, 99, 232]. Therefore, we can once again formulate a deterministic equivalent modeland solve it using off-the-shelf optimization software, thereby enabling the usage of these models by practitioners. In this section, we consider joint chance constraints with RHS uncertainty under certain common form of a safetyset. In particular, let S ( x ) := { ω : T x ≥ r ( ω ) } , (25)where r ( ω ) := Bω + e , for a given an m × d data matrix B , e ∈ R m , and T is a given m × n data matrix. For m = 1 (resp. m > ), we say that the problem is an individual (resp. joint) chance-constrained problem with RHSuncertainty. Let T j and B j be a row vector of appropriate dimension corresponding to the j th row of T and B ,respectively. In this case, the distance function is evaluated as [43] dist( ω, S ( x )) = max (cid:26) , min j ∈ [ m ] T j x − B j ω − e j k B j k ∗ (cid:27) , (26)where k · k ∗ is the dual norm. We can then introduce binary variables, z , to capture the non-convex constraint (24b)to arrive at the mixed-integer linear program [43, Proposition 2] min z,u,v,x c > x (27a)s.t. z ∈ { , } N , v ≥ , u i ≥ , i ∈ [ N ] , x ∈ X , (27b) (cid:15) v ≥ θ + 1 N X i ∈ [ N ] u i , (27c) M (1 − z i ) ≥ v − u i , i ∈ [ N ] , (27d) T j x − B j ω i − e j k B j k ∗ + M i z i ≥ v − u i , i ∈ [ N ] , j ∈ [ m ] , (27e)where M i , i ∈ [ N ] is a sufﬁciently large Big-M coefﬁcient.A few remarks are in order. The computational studies of [43, 232] indicate that this MIP reformulation is difﬁcultto solve in certain cases—state-of-the-art solvers terminate with large optimality gaps after an hour time limit.To address this challenge, Ho-Nguyen et al. [91] propose a number of results that make an order of magnitudeimprovement in the solution times. Note that formulation (30) is not immediately amenable to the improvementswe described for the SAA counterpart. For example, constraints (30e) do not have the mixing structure that theSAA counterpart beneﬁted greatly from. In particular, the continuous variables u i are not shared across scenarios,whereas the mixing set requires common continuous variables. On the other hand, as argued in [91], the SAAcounterpart is a relaxation of (30). By making a key observation that relates the nominal SAA problem for P N toformulation (30), Ho-Nguyen et al. [91] give a stronger formulation and valid inequalities based on the same set ofbinary variables z . Furthermore, this strengthening does have the mixing structure. They also use pre-processingtechniques to reduce the formulation size drastically. On a related note, Ji and Lejeune [99] give a different MIPformulation of (21) under Wasserstein ambiguity under additional assumptions on the support of ω .22 .2.2 LHS uncertainty In this section, we consider joint chance constraints with RHS uncertainty under certain common form of a safetyset. In particular, let S ( x ) := { ω : T ( ω ) x ≥ r ( ω ) } , (28)where r j ( ω ) := b > ω j + e j , j ∈ [ m ] , for a given vector b ∈ R κ , ω j , j ∈ [ m ] is a projection of ω to a κ -dimensionalvector, and e ∈ R m . Also, let the j th row of T ( ω ) be given by T j ( ω ) := ω > A + T j for some n × κ data matrix A > and T ∈ R m × n . In this case, the distance function is measured by dist( ω, S ( x )) = max (cid:26) , min p ∈ [ P ] T j ( ω ) x − r j ( ω ) k A > x − b k ∗ (cid:27) , (29)We can then introduce binary variables, z to represent the non-convex constraint (24b) and make a transformation ofvariables to arrive at the mixed-integer conic program ([232, Theorem 2] and [44, Proposition 1 (for m = 1 )] min z,u,v,x c > x (30a)s.t. z ∈ { , } N , v ≥ , u i ≥ , i ∈ [ N ] , x ∈ X , (30b) (cid:15) v ≥ θ k A > x − b k ∗ + 1 N X i ∈ [ N ] u i , (30c) M i (1 − z i ) ≥ v − u i , i ∈ [ N ] , (30d) T j ( ω i ) x − r j ( ω i ) + M i z i ≥ v − u i , i ∈ [ N ] , j ∈ [ m ] , (30e)where M i , i ∈ [ N ] is a sufﬁciently large Big-M coefﬁcient, under the assumption that A > x = b for any x ∈ X .This assumption can be relaxed with appropriate safeguards as described in [44, 92, 232].As in the case of SAA, the computational studies show that the LHS uncertainty case is a more challenging casethan the RHS uncertainty only. First, the resulting formulation is no longer linear, but conic. Furthermore, thecoefﬁcients of the common variables x are scenario-dependent unlike the RHS uncertainty case. So it is not clear ifsimilar enhancements that Ho-Nguyen et al. [91] performed for the RHS uncertainty case can be done here. To thisend, Ho-Nguyen et al. [92] establish the link between the DRCCP and its SAA counterpart for the LHS case toidentify mixing-type valid inequalities and strengthen the formulation. This results in signiﬁcant improvements inthe performance of the resulting MIP formulation. Distributionally robust variants of the resource planning problem(described in Section 2.4) with N = 100 that are unsolvable or terminate with high end gaps (40-80%) with theoriginal formulation are now solvable or have much small end gaps (<15%) with the enhancements proposed in[92].For combinatorial DRCCPs , for which the decision variables are pure binary, further strengthening is possible. Xie[232] observe the submodularity of the norm and the terms in the distance operator, and propose the use ofpolymatroid inequalities to strengthen the formulation. They report signiﬁcant improvements in the performanceof the resulting algorithm. Kılınç-Karzan et al. [107] show how the polymatroid inequalities derived from theconic constraint can be generalized to the case of mixed-binary decisions. In addition, Shen and Jiang [203] derivepolymatroid inequalities when the random parameters are binary-valued and show how these inequalities can23e further strengthened via mixing and lifting schemes. In a related line of work, Wang et al. [224] consider anassignment problem and derive lifted cover inequalities based on a bilinear reformulation of their DRCCP. Conservative approximations for DRCCP with Wasserstein ambiguity are related to their SAA counterparts describedin Section 2.5. The approach of Erdo˘gan and Iyengar [71] may be seen as a (robust) scenario approximationcounterpart of [29, 33] with similar sample complexity results when the uncertainty set is deﬁned by a Prohorovmetric, which is related to a Wasserstein metric. Furthermore, for distributionally robust CCPs under Wassersteinambiguity [96] give an approximation based on a CVaR interpretation of the reformulation [see, also, 232, for thisand two other approximations based on the scenario approximation and VaR approximation].

CCP is used to model risk-averse decision-making problems in a plethora of applications, ranging from chemicalprocesses [89, 90] to water quality management [211]. In this section, we review a few recent and active applicationdomains—this is not meant to be an exhaustive list.

Finance.

Chance constraints (or equivalently, VaR as deﬁned in (3)) have been applied in ﬁnance to control risks.Linsmeier and Pearson [129] provide motivation of using VaR as a risk measure in signiﬁcant volatile ﬁnancialmarkets. VaR has been widely adopted (e.g., by the US Securities and Exchange Commission) as a method ofquantifying risks. Lemus Rodriguez [122], El Ghaoui et al. [68], Natarajan et al. [159], Zymler et al. [255], Huangand Zhao [97], Yao et al. [242], Çetinkaya and Thiele [37], Barrieu and Scandolo [18], Lotﬁ and Zenios [138],Li et al. [126], and Ji and Lejeune [100] apply VaR and worst-case VaR (analogous to the distributionally robustchance constraints) in ﬁnance via mathematical optimization. In addition, Rujeerapaiboon et al. [195] and Choiet al. [47] apply chance constraints in multi-period portfolio optimization.

Healthcare.

Chance constraints ﬁnd applications in appointment scheduling (e.g., Deng and Shen [57]), surgeryplanning (e.g., Deng et al. [58], Wang et al. [223], and Zhang et al. [249]), operating room planning (e.g., Wanget al. [225], Wang et al. [224], and Najjarbashi and Lim [158]), vaccine allocation (e.g., Tanner and Ntaimo [212]),and social distancing during a pandemic (e.g., Duque et al. [67]), among others.

Power Systems.

Zhang and Li [244], Bienstock et al. [24], Zhang et al. [247], Duan et al. [66], Lubin et al.[141, 142] Dall’Anese et al. [52], Xie and Ahmed [234], Li et al. [123], and Li et al. [125] study chance-constrainedvariants of the optimal power ﬂow problem. Ozturk et al. [171], Pozo and Contreras [181], and Wang et al. [222]consider chance constraints in the unit commitment problem. Vrakopoulou et al. [218], Pozo and Contreras[181], and Wu et al. [229] apply chance constraints to schedule electricity systems in face of random outages andcontingencies. Liu et al. [134], Liu et al. [135], Ravichandran et al. [191], and Zhang et al. [251] employ chanceconstraints to model an integrated system of power grid and electric vehicles. Other power system applicationsinclude coordinated load control (e.g., Zhang et al. [247] and Zhang et al. [250]), power grid topology control(e.g., Qiu and Wang [188] and Mazadi et al. [149]), and hydro power plant scheduling (e.g., Wu et al. [230] and Lodi24t al. [137]). We refer the reader to a recent survey [214] and references therein for a more detailed review of CCPin energy management.

Transportation and Routing.

Dinh et al. [63], Moser et al. [155], Pelletier et al. [174], Du et al. [64], Wu et al.[231], Muraleedharan et al. [156], Ghosal and Wiesemann [79], and Florio et al. [74] study chance constraints inthe optimal route design for vehicles (also see Cordeau et al. [49]). Blackmore et al. [26], Farrokhsiar and Najjaran[72], Banerjee et al. [16], Du Toit and Burdick [65], d. S. Arantes et al. [51], Castillo-Lopez et al. [36], and Oh et al.[170] study chance constraints to ﬁnd paths for robots while avoiding obstacles.

Supply Chain, Logistics, and Scheduling.

Wang [220], Song and Luedtke [206], Hong et al. [95], Elçi andNoyan [69], Elçi et al. [70], and Noyan et al. [166] employ chance constraints in the design of networks for logisticsand humanitarian relief. Lejeune and Ruszczy´nski [120], Murr and Prékopa [157], Zhang et al. [246], and Liuand Küçükyavuz [130] apply chance constraints in logistics. Gurvich et al. [84] study chance constraints in thestafﬁng of call centers. Cohen et al. [48] apply chance constraints to cloud computing. Lu et al. [140] apply chanceconstraints in non-proﬁt resource allocation.

Wireless Communication.

Li et al. [128], Soltani et al. [205], Mokari et al. [154], and Xu and Nallanathan[239] apply chance-constrained programming to accommodate the data rate requirement in orthogonal frequencydivision multiple access (OFDMA) systems. Ma and Sun [147] and Li et al. [127] apply chance constraints on thebeamforming problem in communication networks.

In this survey, we reviewed mixed-integer conic formulations of CCPs under various distributional assumptions. Wedescribed the trade-offs between tractability and conservatism of the corresponding optimization models, as well asthe trade-offs between the amount of distributional information used and over-ﬁtting. There is some theoreticalguidance on selecting sample sizes or other design parameters, such as the Wasserstein ball radius. However, thisguidance is conservative, and instead the parameter choices are made and statistically veriﬁed using out-of-sampletests and cross-validation, in practice. There are many opportunities that arise from the recent developments in CCPmodels. As we outlined, these models often lead to mixed-integer conic formulations, which optimization softwareis now able to handle in modest sizes. The novel mixed-integer conic CCP models when coupled with paralleldevelopments in strengthening mixed-integer conic formulations [11–13, 107, 232, 248] are likely to enable thesolution of large-scale problems before resorting to conservative approximations. Such strengthening approachesoften exploit hidden submodularity—a recurring structure in many reformulations we discussed. Approximationscontinue to play an important role in applications where faster solution times are needed. In such cases, it is ofinterest to be able to provide some performance guarantees. In this regard, recent research in deriving strongrelaxations and approximation algorithms for structured problems is promising.We have primarily discussed single- or two-stage problems in this survey. Conceptually, one can also envisionCCPs with multiple decision epochs. Zhang et al. [246] consider multi-stage CCPs and give valid inequalities25or the SAA reformulation. Lulli and Sen [146] consider a multi-stage problem under a ﬁnite discrete demanddistribution, and propose a model wherein non-anticipativity is enforced only for the scenarios that meet the desiredservice constraint. The authors propose a branch-and-price algorithm, for the resulting formulation. Andrieu et al.[8], González Grandón et al. [80], and references therein, consider problems with dynamic chance constraints, andpropose solution methods under certain continuous distributions. Meraklı and Küçükyavuz [151] consider the riskassociated with parameter uncertainty in inﬁnite-horizon Markov decision processes, and formulate this problemusing a chance-constrained optimization framework. Models and methods for multi-stage CCPs are sparser due totheir inherent difﬁculty not only in modeling, by taking into account the time consistency of solutions, but also indesigning scalable solution methods. This is an area of further research.In closing, we believe that the developments in easy-to-implement reformulations will usher in new and excitingapplications of CCPs, given the increasingly uncertain conditions of operations in various sectors (extreme weather,autonomous devices, renewable power, pandemics, political unrest, etc.).

Acknowledgments

Simge Küçükyavuz is supported, in part, by ONR grant N00014-19-1-2321 and NSF grant 2007814. Ruiwei Jiangis supported, in part, by NSF grant ECCS-1845980.

References [1] A. Abdi and R. Fukasawa. On the mixing set with a knapsack constraint.

Mathematical Programming , 157:191–217, 2016.[2] S. Ahmed. Convex relaxations of chance constrained optimization problems.

Optimization Letters , 8(1):1–12, 2014.[3] S. Ahmed and D. J. Papageorgiou. Probabilistic set covering with correlations.

Operations Research , 61(2):438–452, 2013.[4] S. Ahmed and A. Shapiro. Solving chance-constrained stochastic programs via sampling and integerprogramming, 2008. TutORials in Operations Research, INFORMS 2008.[5] S. Ahmed and W. Xie. Relaxations and approximations of chance constraints under ﬁnite distributions.

Mathematical Programming , 170(1):43–65, 2018.[6] S. Ahmed, J. Luedtke, Y. Song, and W. Xie. Nonanticipative duality, relaxations, and formulations forchance-constrained stochastic programs.

Mathematical Programming , 162:51–81, 2017.[7] G. J. Alexander and A. M. Baptista. A comparison of var and cvar constraints on portfolio selection with themean-variance model.

Management Science , 50(9):1261–1273, 2004.[8] L. Andrieu, R. Henrion, and W. Römisch. A model for dynamic chance constraints in hydro power reservoirmanagement.

European Journal of Operational Research , 207(2):579 – 589, 2010.269] P. Artzner, F. Delbaen, J. Eber, and D. Heath. Coherent measures of risk.

Mathematical Finance , 9(3):203–228, 1999.[10] A. Atamtürk. On the facets of the mixed-integer knapsack polyhedron.

Mathematical Programming , 98(1-3):145–175, 2003.[11] A. Atamtürk and A. Gómez. Submodularity in conic quadratic mixed 0-1 optimization.

Operations Research ,68:609–630, 2020.[12] A. Atamtürk and V. Narayanan. Polymatroids and mean-risk minimization in discrete optimization.

Opera-tions Research Letters , 36(5):618–622, 2008.[13] A. Atamtürk and V. Narayanan. Conic mixed-integer rounding cuts.

Mathematical Programming , 122:1–20,2010.[14] A. Atamtürk, G. L. Nemhauser, and M. W. Savelsbergh. The mixed vertex packing problem.

MathematicalProgramming , 89(1):35–53, 2000.[15] E. Balas. Disjunctive programming.

Annals of Discrete Mathematics , 5:3–51, 1979.[16] A. G. Banerjee, M. Ono, N. Roy, and B. C. Williams. Regression-based lp solver for chance-constrainedﬁnite horizon optimal control with nonconvex constraints. In ,pages 131–138, 2011.[17] J. Barrera, T. Homem-de Mello, E. Moreno, B. K. Pagnoncelli, and G. Canessa. Chance-constrained problemsand rare events: an importance sampling approach.

Mathematical Programming , 157(1):153–189, 2016.[18] P. Barrieu and G. Scandolo. Assessing ﬁnancial model risk.

European Journal of Operational Research , 242(2):546 – 556, 2015. ISSN 0377-2217.[19] A. Ben-Tal and A. Nemirovski. Robust convex optimization.

Mathematics of Operations Research , 23(4):769–805, 1998.[20] J. F. Benders. Partitioning procedures for solving mixed-variables programming problems.

NumerischeMathematik , 4(1):238–252, 1962.[21] P. Beraldi and M. E. Bruni. An exact approach for solving integer problems under probabilistic constraintswith random technology matrix.

Annals of Operations Research , 177(1):127–137, 2009.[22] P. Beraldi and A. Ruszczy´nski. A branch and bound method for stochastic integer programs under probabilisticconstraints.

Optimization Methods and Software , 17(3):359–382, 2002.[23] P. Beraldi and A. Ruszczy´nski. The probabilistic set-covering problem.

Operations Research , 50(6):956–967,2002.[24] D. Bienstock, M. Chertkov, and S. Harnett. Chance-constrained optimal power ﬂow: Risk-aware networkcontrol under uncertainty.

SIAM Review , 56(3):461–495, 2014.2725] J. R. Birge and F. Louveaux.

Introduction to stochastic programming . Springer Verlag, New York, 1997.[26] L. Blackmore, M. Ono, and B. C. Williams. Chance-constrained optimal path planning with obstacles.

IEEETransactions on Robotics , 27(6):1080–1094, 2011.[27] J. Blanchet and K. Murthy. Quantifying distributional model risk via optimal transport.

Mathematics ofOperations Research , 44(2):565–600, 2019.[28] C. E. Bonferroni.

Teoria statistica delle classi e calcolo delle probabilità . Libreria internazionale Seeber,1936.[29] G. C. Calaﬁore and M. C. Campi. Uncertain convex programs: Randomized solutions and conﬁdence levels.

Mathematical Programming , 102(1):25–46, 2005.[30] G. C. Calaﬁore and M. C. Campi. The scenario approach to robust control design.

IEEE Transactions onAutomatic Control , 51(5):742–753, 2006.[31] G. C. Calaﬁore and L. El Ghaoui. On distributionally robust chance-constrained linear programs.

Journal ofOptimization Theory and Applications , 130(1):1–22, 2006.[32] A. M. Campbell and B. W. Thomas. Probabilistic traveling salesman problem with deadlines.

TransportationScience , 42(1):1–21, 2008.[33] M. C. Campi and S. Garatti. The exact feasibility of randomized solutions of uncertain convex programs.

SIAM Journal on Optimization , 19(3):1211–1230, 2008.[34] M. C. Campi and S. Garatti. A sampling-and-discarding approach to chance-constrained optimization:feasibility and optimality.

Journal of Optimization Theory and Applications , 148:257–280, 2011.[35] C. C. Carøe and J. Tind. A cutting-plane approach to mixed 0-1 stochastic integer programs.

EuropeanJournal of Operational Research , 101(2):306–316, 1997.[36] M. Castillo-Lopez, P. Ludivig, S. A. Sajadi-Alamdari, J. L. Sanchez-Lopez, M. A. Olivares-Mendez, andH. Voos. A real-time approach for chance-constrained motion planning with dynamic obstacles.

IEEERobotics and Automation Letters , 5(2):3620–3625, 2020.[37] E. Çetinkaya and A. Thiele. Data-driven portfolio management with quantile constraints.

OR Spectrum , 37(3):761–786, 2015.[38] A. Charnes and W. W. Cooper. Deterministic equivalents for optimizing and satisﬁcing under chanceconstraints.

Operations Research , 11(1):18–39, 1963.[39] A. Charnes, W. Cooper, and G. Y. Symonds. Cost horizons and certainty equivalents: an approach tostochastic programming of heating oil.

Management Science , 4(3):235–263, 1958.[40] F. Chen and D. Krass. Inventory models with minimal service level constraints.

European Journal ofOperational Research , 134:120–140, 2001. 2841] W. Chen, M. Sim, J. Sun, and C.-P. Teo. From CVaR to uncertainty set: Implications in joint chance-constrained optimization.

Operations Research , 58(2):470–485, 2010.[42] X. Chen, M. Sim, and P. Sun. A robust optimization perspective on stochastic programming.

OperationsResearch , 55(6):1058–1071, 2007.[43] Z. Chen, D. Kuhn, and W. Wiesemann. Data-driven chance constrained programs over Wasserstein balls. arXiv:1809.00210 , 2018.[44] Z. Chen, M. Sim, and H. Xu. Distributionally robust optimization with inﬁnitely constrained ambiguity sets.

Operations Research , 67(5):1328–1344, 2019.[45] J. Cheng, E. Delage, and A. Lisser. Distributionally robust stochastic knapsack problem.

SIAM Journal onOptimization , 24(3):1485–1506, 2014.[46] M. Cheon, S. Ahmed, and F. Al-Khayyal. A branch-reduce-cut algorithm for the global optimization ofprobabilistically constrained linear programs.

Mathematical Programming , 108(2–3):617–634, 2006.[47] B.-G. Choi, N. Rujeerapaiboon, and R. Jiang. Multi-period portfolio optimization: Translation of autocorre-lation risk to excess variance.

Operations Research Letters , 44(6):801–807, 2016.[48] M. C. Cohen, P. W. Keller, V. Mirrokni, and M. Zadimoghaddam. Overcommitment in cloud services: Binpacking with chance constraints.

Management Science , 65(7):3255–3271, 2019.[49] J.-F. Cordeau, G. Laporte, M. W. Savelsbergh, and D. Vigo. Vehicle routing. In C. Barnhart and G. Laporte,editors,

Handbooks in operations research and management science , volume 14, chapter 6, pages 367–428.Elsevier, 2007.[50] F. E. Curtis, A. Wächter, and V. M. Zavala. A sequential algorithm for solving nonlinear optimizationproblems with chance constraints.

SIAM Journal on Optimization , 28(1):930–958, 2018.[51] M. d. S. Arantes, C. F. M. Toledo, B. C. Williams, and M. Ono. Collision-free encoding for chance-constrainednonconvex path planning.

IEEE Transactions on Robotics , 35(2):433–448, 2019.[52] E. Dall’Anese, K. Baker, and T. Summers. Chance-constrained AC optimal power ﬂow for distributionsystems with renewables.

IEEE Transactions on Power Systems , 32(5):3427–3438, 2017.[53] J. Danielsson, B. N. Jorgensen, C. G. de Vries, and X. Yang. Optimal portfolio allocation under theprobabilistic VaR constraint and incentives for ﬁnancial innovation.

Annals of Finance , 4(3):1614–2446,2008.[54] A. De. Boolean function analysis meets stochastic optimization: An approximation scheme for stochasticknapsack. In

Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms , pages1286–1305. SIAM, 2018.[55] D. P. de Farias and B. Van Roy. On constraint sampling in the linear programming approach to approximatedynamic programming.

Mathematics of Operations Research , 29(3):462–478, 2004.2956] E. Delage and Y. Ye. Distributionally robust optimization under moment uncertainty with application todata-driven problems.

Operations Research , 58(3):595–612, 2010.[57] Y. Deng and S. Shen. Decomposition algorithms for optimizing multi-server appointment scheduling withchance constraints.

Mathematical Programming , 157(1):245–276, 2016.[58] Y. Deng, S. Shen, and B. Denton. Chance-constrained surgery planning under conditions of limited andambiguous data.

INFORMS Journal on Computing , 31(3):559–575, 2019.[59] D. Dentcheva. Optimization models with probabilistic constraints. In G. Calaﬁore and F. Dabbene, editors,

Probabilistic and Randomized Methods for Design under Uncertainty . London, 2006.[60] D. Dentcheva and A. Ruszczy´nski. Optimization with stochastic dominance constraints.

SIAM Journal onOptimization , 14(2):548–566, 2003.[61] D. Dentcheva, A. Prékopa, and A. Ruszczy´nski. Concavity and efﬁcient points of discrete distributions inprobabilistic programming.

Mathematical Programming , 89(1):55–77, 2000.[62] S. Dharmadhikari and K. Joag-Dev.

Unimodality, Convexity, and Applications . Elsevier, 1988.[63] T. Dinh, R. Fukasawa, and J. Luedtke. Exact algorithms for the chance-constrained vehicle routing problem.

Mathematical Programming , 172(1-2):105–138, 2018.[64] B. Du, D. Sun, S. G. Manyam, and D. W. Casbeer. Cooperative air-ground vehicle routing using chance-constrained optimization. In , pages 392–397, 2020.[65] N. E. Du Toit and J. W. Burdick. Probabilistic collision checking with chance constraints.

IEEE Transactionson Robotics , 27(4):809–815, 2011.[66] C. Duan, W. Fang, L. Jiang, L. Yao, and J. Liu. Distributionally robust chance-constrained approximateAC-OPF with wasserstein metric.

IEEE Transactions on Power Systems , 33(5):4924–4936, 2018.[67] D. Duque, D. P. Morton, B. Singh, Z. Du, R. Pasco, and L. A. Meyers. Timing social distancing toavert unmanageable covid-19 hospital surges.

Proceedings of the National Academy of Sciences , 117(33):19873–19878, 2020.[68] L. El Ghaoui, M. Oks, and F. Oustry. Worst-case value-at-risk and robust portfolio optimization: A conicprogramming approach.

Operations Research , 51(4):543–556, 2003.[69] Ö. Elçi and N. Noyan. A chance-constrained two-stage stochastic programming model for humanitarianrelief network design.

Transportation Research Part B: Methodological , 108:55 – 83, 2018.[70] Ö. Elçi, N. Noyan, and K. Bülbül. Chance-constrained stochastic programming under variable reliabilitylevels with an application to humanitarian relief network design.

Computers & Operations Research , 96:91 –107, 2018. 3071] E. Erdo˘gan and G. Iyengar. Ambiguous chance constrained problems and robust optimization.

MathematicalProgramming , 107(1-2):37–61, 2005.[72] M. Farrokhsiar and H. Najjaran. Unscented predictive motion planning of a nonholonomic system. In , pages 4480–4485, 2011.[73] M. Fischetti and M. Monaci. Cutting plane versus compact formulations for uncertain (integer) linearprograms.

Mathematical Programming Computation , 4(3):239–273, 2012.[74] A. M. Florio, R. F. Hartl, S. Minner, and J.-J. Salazar-González. A branch-and-price algorithm for the vehiclerouting problem with stochastic demands and probabilistic duration constraints.

Transportation Science , 55(1):122–138, 2021.[75] D. Gade, S. Küçükyavuz, and S. Sen. Decomposition algorithms with parametric Gomory cuts for two-stagestochastic integer programs.

Mathematical Programming , 144(1-2):39–64, 2014.[76] R. Gao and A. J. Kleywegt. Distributionally robust stochastic optimization with Wasserstein distance. arXiv:1604.02199 , 2016.[77] S. Geetha and K. P. K. Nair. On stochastic spanning tree problem.

Networks , 23(8):675–679, 1993.[78] A. Geletu, A. Hoffmann, M. Klöppel, and P. Li. An inner-outer approximation approach to chance constrainedoptimization.

SIAM Journal on Optimization , 27(3):1834–1857, 2017.[79] S. Ghosal and W. Wiesemann. The distributionally robust chance-constrained vehicle routing problem.

Operations Research , 68(3):716–732, 2020.[80] T. González Grandón, R. Henrion, and P. Pérlulliez-Aros. Dynamic probabilistic constraints under continuousrandom distributions.

Mathematical Programming , 2020. doi: 10.1007/s10107-020-01593-z. Article inadvance.[81] V. Goyal and R. Ravi. Approximation algorithms for robust covering problems with chance constraints.Technical report, 2008. https://kilthub.cmu.edu/ndownloader/files/12232889 .[82] V. Goyal and R. Ravi. A PTAS for the chance-constrained knapsack problem with random item sizes.

Operations Research Letters , 38(3):161–164, 2010.[83] O. Günlük and Y. Pochet. Mixing mixed-integer inequalities.

Mathematical Programming , 90(3):429–457,2001.[84] I. Gurvich, J. Luedtke, and T. Tezcan. Stafﬁng Call Centers with Uncertain Demand Forecasts: A Chance-Constrained Optimization Approach.

Management Science , 56(7):1093–1115, 2010.[85] J. Han, K. Lee, C. Lee, K.-S. Choi, and S. Park. Robust optimization approach for a chance-constrainedbinary knapsack problem.

Mathematical Programming , 157(1):277–296, 2016.3186] G. A. Hanasusanto.

Decision Making under Uncertainty: Robust and Data-Driven Approaches . PhD thesis,Imperial College London, 2015.[87] G. A. Hanasusanto, V. Roitch, D. Kuhn, and W. Wiesemann. A distributionally robust perspective onuncertainty quantiﬁcation and chance constrained programming.

Mathematical Programming , 151(1):35–62,2015.[88] G. A. Hanasusanto, V. Roitch, D. Kuhn, and W. Wiesemann. Ambiguous joint chance constraints under meanand dispersion information.

Operations Research , 65(3):751–767, 2017.[89] R. Henrion and A. Möller. Optimization of a continuous distillation process under random inﬂow rate.

Computers & Mathematics with Applications , 45(1):247 – 262, 2003.[90] R. Henrion, P. Li, A. Möller, M. C. Steinbach, M. Wendt, and G. Wozny. Stochastic optimization foroperating chemical processes under uncertainty. In M. Grötschel, S. O. Krumke, and J. Rambau, editors,

Online Optimization of Large Scale Systems , pages 457–478. Springer Berlin Heidelberg, Berlin, Heidelberg,2001.[91] N. Ho-Nguyen, F. Kılınç-Karzan, S. Küçükyavuz, and D. Lee. Distributionally robust chance-constrainedprograms with right-hand side uncertainty under Wasserstein ambiguity. arXiv preprint arXiv:2003.12685 ,2020. To appear in

Mathematical Programming. [92] N. Ho-Nguyen, F. Kılınç-Karzan, S. Küçükyavuz, and D. Lee. Strong formulations for distributionally robustchance-constrained programs with left-hand side uncertainty under Wasserstein ambiguity. arXiv preprintarXiv:2007.06750 , 2020.[93] B. Hobbs. Optimization methods for electric utility resource planning.

European Journal of OperationalResearch , 83(1):1–20, 1995.[94] L. J. Hong, Y. Yang, and L. Zhang. Sequential convex approximations to joint chance constrained programs:A monte carlo approach.

Operations Research , 59(3):617–630, 2011.[95] X. Hong, M. A. Lejeune, and N. Noyan. Stochastic network design for disaster preparedness.

IIE Transactions ,47(4):329–357, 2015.[96] A. R. Hota, A. Cherukuri, and J. Lygeros. Data-driven chance constrained optimization under Wassersteinambiguity sets. In , pages 1501–1506, July 2019. doi: 10.23919/ACC.2019.8814677.[97] X. Huang and T. Zhao. Mean-chance model for portfolio selection based on uncertain measure.

Insurance:Mathematics and Economics , 59:243 – 250, 2014.[98] H. Ishii, S. Shiode, T. Nishida, and Y. Namasuya. Stochastic spanning tree problem.

Discrete AppliedMathematics , 3(4):263–273, 1981. 3299] R. Ji and M. Lejeune. Data-driven distributionally robust chance-constrained optimization with Wassersteinmetric.

SSRN Electronic Journal , 2018. ISSN 1556-5068. doi: 10.2139/ssrn.3201356.[100] R. Ji and M. A. Lejeune. Risk-budgeting multi-portfolio optimization with portfolio and marginal riskconstraints.

Annals of Operations Research , 262(2):547–578, 2018.[101] N. Jiang and W. Xie. ALSO-X is better than CVaR: Convex approximations for chance constrained programsrevisited, 2020.[102] R. Jiang and Y. Guan. Data-driven chance constrained stochastic program.

Mathematical Programming , 158:291–327, 2016.[103] S. Joung and K. Lee. Robust optimization-based heuristic algorithm for the chance-constrained knapsackproblem using submodularity.

Optimization Letters , 14(1):101–113, 2020.[104] P. Kall and S. W. Wallace.

Stochastic Programming . Wiley John & Sons, Chichester, 1994.[105] S. Kataoka. A stochastic programming model.

Econometrica , 31(1/2):181–196, 1963.[106] F. Kılınç-Karzan, S. Küçükyavuz, and D. Lee. Joint chance-constrained programs and the intersection ofmixing sets through a submodularity lens. arXiv:1910.01353 , 2019.[107] F. Kılınç-Karzan, S. Küçükyavuz, and D. Lee. Conic mixed-binary sets: Convex hull characterizations andapplications. arXiv:2012.14698 , 2020.[108] W. K. Klein Haneveld.

On Integrated Chance Constraints , pages 113–138. Springer Berlin Heidelberg,Berlin, Heidelberg, 1986.[109] W. K. Klein Haneveld and M. H. van der Vlerk. Integrated chance constraints: Reduced forms and analgorithm.

Computational Management Science , 3(4):245–269, 2006.[110] O. Klopfenstein and D. Nace. A robust approach to the chance-constrained knapsack problem.

OperationsResearch Letters , 36(5):628–632, 2008.[111] A. Kogan and M. A. Lejeune. Threshold boolean form for joint probabilistic constraints with randomtechnology matrix.

Mathematical Programming , 147(1):391–427, 2014.[112] A. Kogan, M. A. Lejeune, and J. Luedtke. Erratum to: Threshold boolean form for joint probabilisticconstraints with random technology matrix.

Mathematical Programming , 155(1):617–620, 2016.[113] S. Küçükyavuz. On mixing sets arising in chance-constrained programming.

Mathematical Programming ,132(1-2):31–56, 2012.[114] S. Küçükyavuz and N. Noyan. Cut generation for optimization problems with multivariate risk constraints.

Mathematical Programming , 159(1-2):165–199, 2016.33115] S. Küçükyavuz and S. Sen. An introduction to two-stage stochastic mixed-integer programming. In R. Battaand J. Peng, editors,

TutORials in Operations Research: Leading Developments from INFORMS Communities ,chapter 1, pages 1–27. INFORMS, 2017.[116] C. M. Lagoa, X. Li, and M. Sznaier. Probabilistically constrained linear programs and risk-adjusted controllerdesign.

SIAM Journal on Optimization , 15(3):938–951, 2005.[117] G. Laporte and F. V. Louveaux. The integer L -shaped method for stochastic integer programs with completerecourse. Operations Research Letters , 13(3):133–142, 1993.[118] J. B. Lasserre and T. Weisser. Distributionally robust polynomial chance-constraints under mixture ambiguitysets.

Mathematical Programming , 2019. doi: 10.1007/s10107-019-01434-8. Article in advance.[119] M. Lejeune. Pattern-based modeling and solution of probabilistically constrained optimization problems.

Operations Research , 60(6):1356–1372, 2012.[120] M. A. Lejeune and A. Ruszczy´nski. An efﬁcient trajectory method for probabilistic inventory-production-distribution problems.

Operations Research , 55(2):378–394, 2007.[121] M. A. Lejeune and S. Shen. Multi-objective probabilistically constrained programs with variable risk: Modelsfor multi-portfolio ﬁnancial optimization.

European Journal of Operational Research , 252(2):522 – 539.[122] G. J. Lemus Rodriguez.

Portfolio optimization with quantile-based risk measures . PhD thesis, MassachusettsInstitute of Technology, 1999.[123] B. Li, R. Jiang, and J. L. Mathieu. Distributionally robust chance constrained optimal power ﬂow assuminglog-concave distributions. In , pages 1–7. IEEE, 2018.[124] B. Li, R. Jiang, and J. L. Mathieu. Ambiguous risk constraints with moment and unimodality information.

Mathematical Programming , 173(1):151–192, Jan 2019.[125] B. Li, R. Jiang, and J. L. Mathieu. Distributionally robust chance-constrained optimal power ﬂow assumingunimodal distributions with misspeciﬁed modes.

IEEE Transactions on Control of Network Systems , 6(3):1223–1234, 2019.[126] L. Li, H. Shao, R. Wang, and J. Yang. Worst-case range value-at-risk with partial information.

SIAM Journalon Financial Mathematics , 9(1):190–218, 2018.[127] Q. Li, A. M.-C. So, and W.-K. Ma. Distributionally robust chance-constrained transmit beamforming formultiuser miso downlink. In , pages 3479–3483. IEEE, 2014.[128] W. W. Li, Y. J. Zhang, A. M. So, and M. Z. Win. Slow adaptive ofdma systems through chance constrainedprogramming.

IEEE Transactions on Signal Processing , 58(7):3858–3869, 2010.[129] T. J. Linsmeier and N. D. Pearson. Value at risk.

Financial Analysts Journal , 56(2):47–67, 2000.34130] X. Liu and S. Küçükyavuz. A polyhedral study of the static probabilistic lot-sizing problem.

Annals ofOperations Research , 261(1-2):233–254, 2018.[131] X. Liu, S. Küçükyavuz, and J. Luedtke. Decomposition algorithms for two-stage chance-constrainedprograms.

Mathematical Programming , 157(1):219–243, 2016.[132] X. Liu, S. Küçükyavuz, and N. Noyan. Robust multicriteria risk-averse stochastic programming models.

Annals of Operations Research , 259(1):259–294, 2017.[133] X. Liu, F. Kılınç-Karzan, and S. Küçükyavuz. On intersection of two mixing sets with applications to jointchance-constrained programs.

Mathematical Programming , 175:29–68, 2019.[134] Z. Liu, F. Wen, and G. Ledwich. Optimal siting and sizing of distributed generators in distribution systemsconsidering uncertainties.

IEEE Transactions on Power Delivery , 26(4):2541–2551, 2011.[135] Z. Liu, Q. Wu, S. S. Oren, S. Huang, R. Li, and L. Cheng. Distribution locational marginal pricing for optimalelectric vehicle charging through chance constrained mixed-integer programming.

IEEE Transactions onSmart Grid , 9(2):644–654, 2018.[136] A. Lodi, E. Malaguti, G. Nannicini, and D. Thomopulos. Nonlinear chance-constrained problems withapplications to hydro scheduling.

Mathematical Programming , 2019. doi: 10.1007/s10107-019-01447-3.Article in advance.[137] A. Lodi, E. Malaguti, G. Nannicini, and D. Thomopulos. Nonlinear chance-constrained problems with applica-tions to hydro scheduling.

Mathematical Programming , pages 1–40, 2019. doi: 10.1007/s10107-019-01447-3.Article in advance.[138] S. Lotﬁ and S. A. Zenios. Robust VaR and CVaR optimization under joint ambiguity in distributions, means,and covariances.

European Journal of Operational Research , 269(2):556 – 576, 2018.[139] L. Lovász. Submodular functions and convexity. In

Mathematical Programming The State of the Art: Bonn1982 , pages 235–257. Springer, Berlin, Heidelberg, 1983.[140] M. Lu, H. Nakao, S. Shen, and L. Zhao. Non-proﬁt resource allocation and service scheduling withcross-subsidization and uncertain resource consumptions.

Omega , pages 102–191, 2020.[141] M. Lubin, Y. Dvorkin, and S. Backhaus. A robust approach to chance constrained optimal power ﬂow withrenewable generation.

IEEE Transactions on Power Systems , 31(5):3840–3849, 2016.[142] M. Lubin, Y. Dvorkin, and L. Roald. Chance constraints for improving the security of ac optimal power ﬂow.

IEEE Transactions on Power Systems , 34(3):1908–1917, 2019.[143] J. Luedtke. A branch-and-cut decomposition algorithm for solving chance-constrained mathematical programswith ﬁnite support.

Mathematical Programming , 146:219–244, 2014.[144] J. Luedtke and S. Ahmed. A sample approximation approach for optimization with probabilistic constraints.

SIAM Journal on Optimization , 19(2):674–699, 2008.35145] J. Luedtke, S. Ahmed, and G. L. Nemhauser. An integer programming approach for linear programs withprobabilistic constraints.

Mathematical Programming , 122(2):247–272, 2010.[146] G. Lulli and S. Sen. Branch-and-price algorithm for multistage stochastic integer programming withapplication to stochastic batch-sizing problems. 50(6):786–796, 2004.[147] S. Ma and D. Sun. Chance constrained robust beamforming in cognitive radio networks.

IEEE Communica-tions Letters , 17(1):67–70, 2013.[148] V. Marianov and M. Rios. A probabilistic quality of service constraint for a location model of switches inATM communication networks.

Annals of Operations Research , 96:237–243, 2000.[149] M. Mazadi, W. D. Rosehart, O. P. Malik, and J. A. Aguado. Modiﬁed chance-constrained optimizationapplied to the generation expansion problem.

IEEE Transactions on Power Systems , 24(3):1635–1636, 2009.[150] M. Meraklı and S. Küçükyavuz. Vector-valued multivariate conditional value-at-risk.

Operations ResearchLetters , 46(3):300 – 305, 2018.[151] M. Meraklı and S. Küçükyavuz. Risk aversion to parameter uncertainty in Markov decision processes withan application to slow-onset disaster relief.

IISE Transactions , 52:811–831, 2020.[152] B. L. Miller and H. M. Wagner. Chance constrained programming with joint constraints.

Operations Research ,13(6):930–965, 1965.[153] P. Mohajerin Esfahani and D. Kuhn. Data-driven distributionally robust optimization using the Wassersteinmetric: performance guarantees and tractable reformulations.

Mathematical Programming , 171(1):115–166,Sep 2018.[154] N. Mokari, S. Parsaeefard, P. Azmi, H. Saeedi, and E. Hossain. Robust ergodic uplink resource allocation inunderlay ofdma cognitive radio networks.

IEEE Transactions on Mobile Computing , 15(2):419–431, 2016.[155] D. Moser, R. Schmied, H. Waschl, and L. del Re. Flexible spacing adaptive cruise control using stochasticmodel predictive control.

IEEE Transactions on Control Systems Technology , 26(1):114–127, 2018.[156] A. Muraleedharan, A. T. Tran, H. Okuda, and T. Suzuki. Scenario-based model predictive speed controllerconsidering probabilistic constraint for driving scene with pedestrian. In , pages 1–7, 2020.[157] M. R. Murr and A. Prékopa. Solution of a product substitution problem using stochastic programming. InS. P. Uryasev, editor,

Probabilistic Constrained Optimization: Methodology and Applications , pages 252–271.Springer US, Boston, MA, 2000.[158] A. Najjarbashi and G. J. Lim. A decomposition algorithm for the two-stage chance-constrained operatingroom scheduling problem.

IEEE Access , 8:80160–80172, 2020.[159] K. Natarajan, D. Pachamanova, and M. Sim. Incorporating asymmetric distributional information in robustvalue-at-risk optimization.

Management Science , 54(3):573–585, 2008.36160] A. Nemirovski. On safe tractable approximations of chance constraints.

European Journal of OperationalResearch , 219(3):707 – 718, 2012.[161] A. Nemirovski and A. Shapiro. Scenario approximation of chance constraints. In G. Calaﬁore and F. Dabbene,editors,

Probabilistic and Randomized Methods for Design under Uncertainty , pages 3–48. Springer, 2005.[162] A. Nemirovski and A. Shapiro. Convex approximations of chance constrained programs.

SIAM Journal onOptimization , 17(4):969–996, 2007.[163] T. Niimura, B. S. Kermanshahi, and R. Yokoyama. Multi-stage optimization of generation planning includingpower system reliability constraints.

International Journal of Energy Systems , 10(3):144–148, 1990.[164] E. Nikolova. Approximation algorithms for reliable stochastic combinatorial optimization. In

Approximation,Randomization, and Combinatorial Optimization. Algorithms and Techniques , pages 338–351. Springer,2010.[165] N. Noyan and G. Rudolf. Optimization with multivariate conditional value-at-risk constraints.

OperationsResearch , 61(4):990–1013, 2013.[166] N. Noyan, M. Meraklı, and S. Küçükyavuz. Two-stage stochastic programming under multivariate riskconstraints with an application to humanitarian relief network design.

Mathematical Programming , pages1–39, 2019. doi: 10.1007/s10107-019-01373-4. Article in advance.[167] L. Ntaimo. Fenchel decomposition for stochastic mixed-integer programming.

Journal of Global Optimiza-tion , 55(1):141–163, 2013.[168] L. Ntaimo and S. Sen. The million variable “march” for stochastic combinatorial optimization.

Journal ofGlobal Optimization , 32:385–400, 2005.[169] L. Ntaimo and S. Sen. A comparative study of decomposition algorithms for stochastic combinatorialoptimization.

Computational Optimization and Applications , 40:299–319, 2008.[170] Y. Oh, K. Cho, Y. Choi, and S. Oh. Chance-constrained multi-layered sampling-based path planningfor temporal logic-based missions.

IEEE Transactions on Automatic Control , pages 1–15, 2020. doi:10.1109/TAC.2020.3044273. Article in advance.[171] U. A. Ozturk, M. Mazumdar, and B. A. Norman. A solution to the stochastic unit commitment problem usingchance constrained programming.

IEEE Transactions on Power Systems , 19(3):1589–1598, 2004.[172] M. Padberg and G. Rinaldi. A branch-and-cut approach to a traveling salesman problem with side constraints.

Management Science , 35(11):1393–1412, 1989.[173] B. K. Pagnoncelli, S. Ahmed, and A. Shapiro. Sample average approximation method for chance constrainedprogramming: Theory and applications.

Journal of Optimization Theory and Applications , 142(2):399–416,2009. 37174] S. Pelletier, O. Jabali, and G. Laporte. The electric vehicle routing problem with energy consumptionuncertainty.

Transportation Research Part B: Methodological , 126:225–255, 2019.[175] A. Peña-Ordieres, J. R. Luedtke, and A. Wächter. Solving chance-constrained problems via a smoothsample-based nonlinear approximation.

SIAM Journal on Optimization , 30(3):2221–2250, 2020.[176] G. C. Pﬂug. Some remarks on the value-at-risk and the conditional value-at-risk. In S. Uryasev, editor,

Probabilistic Constrained Optimization: Methodology and Applications . Kluwer Academic Publishers,Dordrecht, 2000.[177] G. C. Pﬂug and W. Römisch.

Modelling, managing and measuring risk . World Scientiﬁc publishing,Singapore, 2007.[178] J. Pintér. Deterministic approximations of probability inequalities.

Zeitschrift för Operations-Research , 33(4):219–239, 1989.[179] Y. Pochet and L. A. Wolsey. Polyhedra for lot-sizing with Wagner-Whitin costs.

Mathematical Programming ,67:297–323, 1994.[180] K. Postek, A. Ben-Tal, D. den Hertog, and B. Melenberg. Robust optimization with ambiguous stochasticconstraints under mean and dispersion information.

Operations Research , 66(3):814–833, 2018.[181] D. Pozo and J. Contreras. A chance-constrained unit commitment with an n − k security criterion andsigniﬁcant wind generation. IEEE Transactions on Power Systems , 28(3):2842–2851, 2013.[182] A. Prékopa. On probabilistic constrained programming. In

Proceedings of the Princeton Symposium onMathematical Programming , volume 113, page 138. Princeton, NJ, 1970.[183] A. Prékopa. Contributions to the theory of stochastic programming.

Mathematical Programming , 4(1):202–221, 1973.[184] A. Prékopa. Dual method for the solution of a one-stage stochastic programming problem with random RHSobeying a discrete probability distribution.

ZOR Zeitschrift für Operations Research Methods and Models ofOperations Research , 34(6):441–461, 11 1990.[185] A. Prékopa.

Stochastic Programming . Kluwer Academic Publishers, Dordrecht/Boston/London, 1995.[186] A. Prékopa. Probabilistic programming. In A. Ruszczy´nski and A. Shapiro, editors,

Stochastic Programming ,volume 10 of

Handbooks in Operations Research and Management Science , pages 267–351. Elsevier, 2003.[187] Y. Qi and S. Sen. The ancestral Benders’ cutting plane algorithm with multi-term disjunctions for mixed-integer recourse decisions in stochastic programming.

Mathematical Programming , 161:193–235, 2017.[188] F. Qiu and J. Wang. Chance-constrained transmission switching with guaranteed wind power utilization.

IEEE Transactions on Power Systems , 30(3):1270–1278, 2015.38189] F. Qiu, S. Ahmed, S. S. Dey, and L. A. Wolsey. Covering linear programming with violations.

INFORMSJournal on Computing , 26(3):531–546, 2014.[190] H. Rahimian and S. Mehrotra. Distributionally robust optimization: A review. arXiv:1908.05659 , 2019.[191] A. Ravichandran, S. Sirouspour, P. Malysz, and A. Emadi. A chance-constraints-based control strategy formicrogrids with energy storage and integrated electric vehicles.

IEEE Transactions on Smart Grid , 9(1):346–359, 2018.[192] R. Rockafellar. Coherent approaches to risk in optimization under uncertainty. In

TutORrials in OperationsResearch , pages 38–61. INFORMS, 2007.[193] R. Rockafellar and S. Uryasev. Optimization of conditional value-at-risk.

The Journal of Risk , 2(3):21–41,2000.[194] R. Rockafellar and S. Uryasev. Conditional value-at-risk for general loss distributions.

Journal of Bankingand Finance , 26(7):1443–1471, 2002.[195] N. Rujeerapaiboon, D. Kuhn, and W. Wiesemann. Robust growth-optimal portfolios.

Management Science ,62(7):2090–2109, 2016.[196] A. Ruszczy´nski. Probabilistic programming with discrete distributions and precedence constrained knapsackpolyhedra.

Mathematical Programming , 93(2):195–215, 2002.[197] A. Saxena, V. Goyal, and M. A. Lejeune. MIP reformulations of the probabilistic set covering problem.

Mathematical Programming , 121(1):1–31, 2010.[198] S. Sen. Relaxations for probabilistically constrained programs with discrete random variables.

OperationsResearch Letters , 11(2):81–86, 1992.[199] S. Sen.

Stochastic Integer Programming Algorithms: Beyond Benders’ Decomposition . Wiley Handbook inOR/MS, World Wide Web, 2010.[200] S. Sen and J. L. Higle. The C theorem and a D algorithm for large scale stochastic mixed-integerprogramming: set convexiﬁcation. Mathematical Programming , 104(1):1–20, 2005.[201] S. Sen and H. D. Sherali. On the convergence of cutting plane algorithms for a class of nonconvexmathematical programs.

Mathematical Programming , 106(2):203–223, 2006.[202] A. Shapiro, D. Dentcheva, and A. Ruszczy´nski.

Lectures on stochastic programming: modeling and theory .SIAM, 2009.[203] H. Shen and R. Jiang. Chance-constrained set covering with Wasserstein ambiguity. Technical report, 2020. https://arxiv.org/abs/2010.05671 .[204] S. Shen. Using integer programming for balancing return and risk in problems with individual chanceconstraints.

Computers & Operations Research , 49:59 – 70, 2014.39205] N. Y. Soltani, S. Kim, and G. B. Giannakis. Chance-constrained optimization of ofdma cognitive radiouplinks.

IEEE Transactions on Wireless Communications , 12(3):1098–1107, 2013.[206] Y. Song and J. R. Luedtke. Branch-and-cut approaches for chance-constrained formulations of reliablenetwork design problems.

Mathematical Programming Computation , 5(4):397–432, 2013.[207] Y. Song and S. Shen. Risk-averse shortest path interdiction.

INFORMS Journal on Computing , 28(3):527–539, 2016.[208] Y. Song, J. R. Luedtke, and S. Küçükyavuz. Chance-constrained binary packing problems.

INFORMSJournal on Computing , 26(4):735–747, 2014.[209] B. Stellato. Data-driven chance constrained optimization. Master’s thesis, ETH Zürich, 2014.[210] C. Swamy. Risk-averse stochastic optimization: probabilistically-constrained models and algorithms forblack-box distributions. In

Proceedings of the twenty-second annual ACM-SIAM symposium on DiscreteAlgorithms , pages 1627–1646. SIAM, 2011.[211] A. K. Takyi and B. J. Lence. Surface water quality management using a multiple-realization chance constraintmethod.

Water Resources Research , 35(5):1657–1670, 1999.[212] M. W. Tanner and L. Ntaimo. IIS branch-and-cut for joint chance-constrained stochastic programs andapplication to optimal vaccine allocation.

European Journal of Operational Research , 207(1):290 – 296,2010.[213] S. R. Tayur, R. R. Thomas, and N. R. Natraj. An algebraic geometry algorithm for scheduling in presence ofsetups and correlated demands.

Mathematical Programming , 69(1–3):369–401, 1995.[214] W. Van Ackooij, R. Henrion, A. Moller, and R. Zorgati. Chance constrained programming and its applicationsto energy management. In I. Dritsas, editor,

Stochastic Optimization - Seeing the Optimal for the Uncertain .IntechOpen, 2011.[215] R. M. Van Slyke and R. Wets. L-shaped linear programs with applications to optimal control and stochasticprogramming.

SIAM Journal on Applied Mathematics , 17(4):638–663, 1969.[216] L. Vandenberghe, S. Boyd, and K. Comanor. Generalized Chebyshev bounds via semideﬁnite programming.

SIAM Review , 49(1):52–64, 2007.[217] J. Vielma, S. Ahmed, and G. Nemhauser. Mixed integer linear programming formulations for probabilisticconstraints.

Operations Research Letters , 40(3):153 – 158, 2012.[218] M. Vrakopoulou, K. Margellos, J. Lygeros, and G. Andersson. A probabilistic framework for reservescheduling and n − security assessment of systems with high wind power penetration. IEEE Transactionson Power Systems , 28(4):3885–3896, 2013.[219] M. R. Wagner. Stochastic 0–1 linear programming under limited distributional information.

OperationsResearch Letters , 36(2):150–156, 2008. 40220] J. Wang. The β -reliable median on a network with discrete probabilistic demand weights. Operationsresearch , 55(5):966–975, 2007.[221] J. Wang and S. Shen. Risk and energy consumption tradeoffs in cloud computing service via stochasticoptimization models. In

Proceedings of the 5th IEEE/ACM International Conference on Utility and CloudComputing (UCC 2012) , Chicago, IL, 2012.[222] Q. Wang, Y. Guan, and J. Wang. A chance-constrained two-stage stochastic program for unit commitmentwith uncertain wind power output.

IEEE Transactions on Power Systems , 27(1):206–215, 2012.[223] S. Wang, J. Li, and C. Peng. Distributionally robust chance-constrained program surgery planning withdownstream resource. In , pages1–6. IEEE, 2017.[224] S. Wang, J. Li, and S. Mehrotra. A solution approach to distributionally robust chance-constrained assignmentproblems. Technical report, 2019. .[225] S. Wang, J. Li, and S. Mehrotra. Chance-constrained multiple bin packing problem with an application tooperating room planning. Technical report, 2019. .[226] R. J.-B. Wets. Stochastic programming: Solution techniques and approximation schemes. In B. A., K. B.,and G. M., editors,

Mathematical Programming The State of the Art , Handbooks in Operations Research andManagement Science, pages 566–603. Springer, 1983.[227] R. J.-B. Wets. Stochastic programming. In

Optimization , volume 1 of

Handbooks in Operations Researchand Management Science , chapter VIII, pages 573 – 629. Elsevier, 1989.[228] H. Wu and S. Küçükyavuz. Probabilistic partial set covering with an oracle for chance constraints.

SIAMJournal on Optimization , 29(1):690–718, 2019.[229] H. Wu, M. Shahidehpour, Z. Li, and W. Tian. Chance-constrained day-ahead scheduling in stochastic powersystem operation.

IEEE Transactions on Power Systems , 29(4):1583–1591, 2014.[230] J. Wu, J. Zhu, G. Chen, and H. Zhang. A hybrid method for optimal scheduling of short-term electric powergeneration of cascaded hydroelectric plants based on particle swarm optimization and chance-constrainedprogramming.

IEEE Transactions on Power Systems , 23(4):1570–1579, 2008.[231] P. Wu, J. Xie, and J. Chen. Safe path planning for unmanned aerial vehicle under location uncertainty. In , pages 342–347, 2020.[232] W. Xie. On distributionally robust chance constrained programs with Wasserstein distance.

MathematicalProgramming , 2019. doi: 10.1007/s10107-019-01445-5. Article in advance.41233] W. Xie and S. Ahmed. On deterministic reformulations of distributionally robust joint chance constrainedoptimization problems.

SIAM Journal on Optimization , 28(2):1151–1182, 2016.[234] W. Xie and S. Ahmed. Distributionally robust chance constrained optimal power ﬂow with renewables: Aconic reformulation.

IEEE Transactions on Power Systems , 33(2):1860–1867, 2018.[235] W. Xie and S. Ahmed. On quantile cuts and their closure for chance constrained optimization problems.

Mathematical Programming , 172:621–646, 2018.[236] W. Xie and S. Ahmed. Bicriteria approximation of chance-constrained covering problems.

OperationsResearch , 68(2):516–533, 2020.[237] W. Xie, S. Ahmed, and R. Jiang. Optimized Bonferroni approximations of distributionally robust joint chanceconstraints.

Mathematical Programming , 2019. doi: 10.1007/s10107-019-01442-8. Article in advance.[238] H. Xu, C. Caramanis, and S. Mannor. Optimization under probabilistic envelope constraints.

OperationsResearch , 60(3):682–699, 2012.[239] L. Xu and A. Nallanathan. Energy-efﬁcient chance-constrained resource allocation for multicast cognitiveofdm network.

IEEE Journal on Selected Areas in Communications , 34(5):1298–1306, 2016.[240] I. Yang. Wasserstein distributionally robust stochastic control: A data-driven approach.

IEEE Transactionson Automatic Control , pages 1–8, 2020. doi: 10.1109/TAC.2020.3030884. Article in advance.[241] W. Yang and H. Xu. Distributionally robust chance constraints for non-linear uncertainties.

MathematicalProgramming , 155(1-2):231–265, 2016.[242] H. Yao, Y. Li, and K. Benson. A smooth non-parametric estimation framework for safety-ﬁrst portfoliooptimization.

Quantitative Finance , 15(11):1865–1884, 2015.[243] K. Yoda and A. Prékopa. Convexity and solutions of stochastic multidimensional 0-1 knapsack problemswith probabilistic constraints.

Mathematics of Operations Research , 41(2):715–731, 2016.[244] H. Zhang and P. Li. Chance constrained programming for optimal power ﬂow under uncertainty.

IEEETransactions on Power Systems , 26(4):2417–2424, 2011.[245] M. Zhang and S. Küçükyavuz. Finitely convergent decomposition algorithms for two-stage stochastic pureinteger programs.

SIAM Journal on Optimization , 24(4):1933–1951, 2014.[246] M. Zhang, S. Küçükyavuz, and S. Goel. A branch-and-cut method for dynamic decision making under jointchance constraints.

Management Science , 60(5):1317–1333, 2014.[247] Y. Zhang, S. Shen, and J. L. Mathieu. Distributionally robust chance-constrained optimal power ﬂow withuncertain renewables and uncertain reserves provided by loads.

IEEE Transactions on Power Systems , 32(2):1378–1388, 2017. 42248] Y. Zhang, R. Jiang, and S. Shen. Ambiguous chance-constrained binary programs under mean-covarianceinformation.

SIAM Journal on Optimization , 28(4):2922–2944, 2018.[249] Y. Zhang, S. Shen, and S. A. Erdogan. Solving 0–1 semideﬁnite programs for distributionally robust allocationof surgery blocks.

Optimization Letters , 12(7):1503–1521, 2018.[250] Y. Zhang, J. Dong, T. Kuruganti, S. Shen, and Y. Xue. Distributionally robust building load control tocompensate ﬂuctuations in solar power generation. In , pages5857–5863. IEEE, 2019.[251] Y. Zhang, M. Lu, and S. Shen. On the values of vehicle-to-grid electricity selling in electric vehicle sharing.

Manufacturing & Service Operations Management , 2020. doi: 10.1287/msom.2019.0855. Article in advance.[252] Z. Zhang, B. T. Denton, and X. Xie. Branch and price for chance-constrained bin packing.

INFORMSJournal on Computing , 32(3):547–564, 2020.[253] M. Zhao, K. Huang, and B. Zeng. A polyhedral study on chance constrained program with random right-handside.

Mathematical Programming , 166:19–64, 2017.[254] S. Zymler, D. Kuhn, and B. Rustem. Distributionally robust joint chance constraints with second-ordermoment information.

Mathematical Programming , 137(1-2):167–198, 2011.[255] S. Zymler, D. Kuhn, and B. Rustem. Worst-case value at risk of nonlinear portfolios.