Two-stage linear decision rules for multi-stage stochastic programming

Bodur, Merve; Luedtke, James R.

doi:10.1007/s10107-018-1339-4

Two-stage linear decision rules for multi-stage stochastic programming

Full Length Paper
Series B
Published: 15 October 2018

Volume 191, pages 347–380, (2022)
Cite this article

Mathematical Programming Submit manuscript

2231 Accesses
11 Citations
3 Altmetric
Explore all metrics

Abstract

Multi-stage stochastic linear programs (MSLPs) are notoriously hard to solve in general. Linear decision rules (LDRs) yield an approximation of an MSLP by restricting the decisions at each stage to be an affine function of the observed uncertain parameters. Finding an optimal LDR is a static optimization problem that provides an upper bound on the optimal value of the MSLP, and, under certain assumptions, can be formulated as an explicit linear program. Similarly, as proposed by Kuhn et al. (Math Program 130(1):177–209, 2011) a lower bound for an MSLP can be obtained by restricting decisions in the dual of the MSLP to follow an LDR. We propose a new approximation approach for MSLPs, two-stage LDRs. The idea is to require only the state variables in an MSLP to follow an LDR, which is sufficient to obtain an approximation of an MSLP that is a two-stage stochastic linear program (2SLP). We similarly propose to apply LDR only to a subset of the variables in the dual of the MSLP, which yields a 2SLP approximation of the dual that provides a lower bound on the optimal value of the MSLP. Although solving the corresponding 2SLP approximations exactly is intractable in general, we investigate how approximate solution approaches that have been developed for solving 2SLP can be applied to solve these approximation problems, and derive statistical upper and lower bounds on the optimal value of the MSLP. In addition to potentially yielding better policies and bounds, this approach requires many fewer assumptions than are required to obtain an explicit reformulation when using the standard static LDR approach. A computational study on two example problems demonstrates that using a two-stage LDR can yield significantly better primal policies and modestly better dual policies than using policies based on a static LDR.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Article 09 April 2023

A practical guide to multi-objective reinforcement learning and planning

Article Open access 13 April 2022

Survey on Lagrangian relaxation for MILP: importance, challenges, historical review, recent advancements, and opportunities

Article 11 July 2023

References

Ahmed, S.: Multistage stochastic optimization (2016). https://www.ima.umn.edu/materials/2015-2016/ND8.1-12.16/25386/mssp.pdf. New Directions Short Course on Mathematical Optimization
Bakir, I., Boland, N., Dandurand, B., Erera, A.: Scenario set partition dual bounds for multistage stochastic programming: a hierarchy of bounds and a partition sampling approach (2016). http://www.optimization-online.org/DB_FILE/2016/01/5311.pdf
Bampou, D., Kuhn, D.: Scenario-free stochastic programming with polynomial decision rules. In: 2011 50th IEEE Conference on Decision and Control and European Control Conference, pp. 7806–7812. IEEE (2011)
Ben-Tal, A., Goryashko, A., Guslitzer, E., Nemirovski, A.: Adjustable robust solutions of uncertain linear programs. Math. Program. 99(2), 351–376 (2004)
MathSciNet MATH Google Scholar
Ben-Tal, A., Nemirovski, A.: Robust optimization—methodology and applications. Math. Program. 92, 453–480 (2002)
MathSciNet MATH Google Scholar
Bertsimas, D., Caramanis, C.: Adaptability via sampling. In: 2007 46th IEEE Conference on Decision and Control, pp. 4717–4722. IEEE (2007)
Bertsimas, D., Georghiou, A.: Design of near optimal decision rules in multistage adaptive mixed-integer optimization. Oper. Res. 63(3), 610–627 (2015)
MathSciNet MATH Google Scholar
Birge, J.: Decomposition and partitioning methods for multistage stochastic linear programs. Oper. Res. 33(5), 989–1007 (1985)
MathSciNet MATH Google Scholar
Birge, J.R.: Aggregation bounds in stochastic linear programming. Math. Program. 31(1), 25–41 (1985)
MathSciNet MATH Google Scholar
Birge, J.R., Donohue, C.J., Holmes, D.F., Svintsitski, O.G.: A parallel implementation of the nested decomposition algorithm for multistage stochastic linear programs. Math. Program. 75(2), 327–352 (1996)
MathSciNet MATH Google Scholar
Calafiore, G., Campi, M.: The scenario approach to robust control design. IEEE Trans. Automat. Contr. 51, 742–753 (2006)
MathSciNet MATH Google Scholar
Casey, M.S., Sen, S.: The scenario generation algorithm for multistage stochastic linear programming. Math. Oper. Res. 30(3), 615–631 (2005)
MathSciNet MATH Google Scholar
Chen, X., Sim, M., Sun, P., Zhang, J.: A linear decision-based approximation approach to stochastic programming. Oper. Res. 56(2), 344–357 (2008)
MathSciNet MATH Google Scholar
Chen, Z.L., Powell, W.: Convergent cutting-plane and partial-sampling algorithm for multistage stochastic linear programs with recourse. J. Optim. Theory Appl. 102(3), 497–524 (1999)
MathSciNet MATH Google Scholar
de Maere d’Aertrycke, G., Shapiro, A., Smeers, Y.: Risk exposure and Lagrange multipliers of nonanticipativity constraints in multistage stochastic problems. Math. Methods Oper. Res. 77(3), 393–405 (2013)
MathSciNet MATH Google Scholar
Dowson, O., Kapelevich, L.: SDDP.jl: a Julia package for stochastic dual dynamic programming. Optimization Online (2017). http://www.optimization-online.org/DB_HTML/2017/12/6388.html
Dyer, M., Stougie, L.: Computational complexity of stochastic programming problems. Math. Program. 106(3), 423–432 (2006)
MathSciNet MATH Google Scholar
Eisner, M., Olsen, P.: Duality for stochastic programming interpreted as L.P. in $L_p$-space. SIAM J. Appl. Math 28(4), 779–792 (1975)
MathSciNet MATH Google Scholar
Fábián, C.I., Szőke, Z.: Solving two-stage stochastic programming problems with level decomposition. Comput. Manag. Sci. 4, 313–353 (2007)
MathSciNet MATH Google Scholar
Fu, M.C.: Feature article: optimization for simulation: theory vs. practice. INFORMS J. Comput. 14(3), 192–215 (2002)
MathSciNet MATH Google Scholar
Garstka, S., Wets, R.: On decision rules in stochastic programming. Math. Program. 7(1), 117–143 (1974)
MathSciNet MATH Google Scholar
Gassmann, H.I.: Mslip: a computer code for the multistage stochastic linear programming problem. Math. Program. 47(1), 407–423 (1990)
MathSciNet MATH Google Scholar
Georghiou, A., Wiesemann, W., Kuhn, D.: Generalized decision rule approximations for stochastic programming via liftings. Math. Program. 152(1–2), 301–338 (2015)
MathSciNet MATH Google Scholar
Girardeau, P., Leclere, V., Philpott, A.: On the convergence of decomposition methods for multistage stochastic convex programs. Math. Oper. Res. 40, 130–145 (2015)
MathSciNet MATH Google Scholar
Guigues, V.: SDDP for some interstage dependent risk-averse problems and application to hydro-thermal planning. Comput. Optim. Appl. 57, 167–203 (2014)
MathSciNet MATH Google Scholar
Guigues, V.: Convergence analysis of sampling-based decomposition methods for risk-averse multistage stochastic convex programs. SIAM J. Optim. 26, 2468–2494 (2016)
MathSciNet MATH Google Scholar
Hanasusanto, G.A., Kuhn, D., Wiesemann, W.: A comment on “computational complexity of stochastic programming problems”. Math. Program. 159(1), 557–569 (2016)
MathSciNet MATH Google Scholar
Heitsch, H., Römisch, W.: Scenario tree modeling for multistage stochastic programs. Math. Program. 118(2), 371–406 (2009)
MathSciNet MATH Google Scholar
Higle, J., Sen, S.: Multistage stochastic convex programs: duality and its implications. Ann. Oper. Res. 142(1), 129–146 (2006)
MathSciNet MATH Google Scholar
Homem de Mello, T.: On rates of convergence for stochastic optimization problems under non-independent and identically distributed sampling. SIAM J. Optim. 19, 524–551 (2008)
MathSciNet MATH Google Scholar
Hong, L.J., Nelson, B.L.: A brief introduction to optimization via simulation. In: Winter Simulation Conference, WSC ’09, Winter Simulation Conference, pp. 75–85 (2009)
Høyland, K., Wallace, S.: Generating scenario trees for multistage decision problems. Manag. Sci. 47(2), 295–307 (2001)
MATH Google Scholar
Infanger, G., Morton, D.: Cut sharing for multistage stochastic linear programs with interstage dependency. Math. Program. 75(2), 241–256 (1996)
MathSciNet MATH Google Scholar
Koivu, M.: Variance reduction in sample approximations of stochastic programs. Math. Program. 103, 463–485 (2005)
MathSciNet MATH Google Scholar
Kuhn, D., Wiesemann, W., Georghiou, A.: Primal and dual linear decision rules in stochastic and robust optimization. Math. Program. 130(1), 177–209 (2011)
MathSciNet MATH Google Scholar
Lemaréchal, C., Nemirovskii, A., Nesterov, Y.: New variants of bundle methods. Math. Program. 69, 111–147 (1995)
MathSciNet MATH Google Scholar
Lemaréchal, C., Nemirovskii, A., Nesterov, Y.: New variants of bundle methods. Math. Program. 69, 111–147 (1995)
MathSciNet MATH Google Scholar
Linderoth, J., Shapiro, A., Wright, S.: The empirical behavior of sampling methods for stochastic programming. Ann. Oper. Res. 142, 215–241 (2006)
MathSciNet MATH Google Scholar
Mak, W.K., Morton, D., Wood, R.: Monte Carlo bounding techniques for determining solution quality in stochastic programs. Oper. Res. Lett. 24, 47–56 (1999)
MathSciNet MATH Google Scholar
Nemirovski, A., Juditsky, A., Lan, G., Shapiro, A.: Robust stochastic approximation approach to stochastic programming. SIAM J. Optim. 19(4), 1574–1609 (2009)
MathSciNet MATH Google Scholar
Pennanen, T.: Epi-convergent discretizations of multistage stochastic programs via integration quadratures. Math. Program. 116, 461–479 (2009)
MathSciNet MATH Google Scholar
Pereira, M., Pinto, L.: Multi-stage stochastic optimization applied to energy planning. Math. Program. 52(1–3), 359–375 (1991)
MathSciNet MATH Google Scholar
Philpott, A., De Matos, V.: Dynamic sampling algorithms for multi-stage stochastic programs with risk aversion. Eur. J. Oper. Res. 218(2), 470–483 (2012)
MathSciNet MATH Google Scholar
Philpott, A., Guan, Z.: On the convergence of stochastic dual dynamic programming and related methods. Oper. Res. Lett. 36(4), 450–455 (2008)
MathSciNet MATH Google Scholar
Philpott, A., Pritchard, G.: Emi-doasa. Technical report, Electric Power Optimization Centre (2013). http://www.emi.ea.govt.nz/Content/Tools/Doasa/DOASA%20paper%20by%20SOL.pdf
Polyak, B., Juditsky, A.: Acceleration of stochastic approximation by averaging. SIAM J. Control Optim. 30, 838–855 (1992)
MathSciNet MATH Google Scholar
Powell, W.: Approximate Dynamic Programming: Solving the Curses of Dimensionality, vol. 703. Wiley, New York (2007)
MATH Google Scholar
Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Stat. 22, 400–407 (1951)
MathSciNet MATH Google Scholar
Rockafellar, R., Wets, R.B.: Scenarios and policy aggregation in optimization under uncertainty. Math. Oper. Res. 16(1), 119–147 (1991)
MathSciNet MATH Google Scholar
Ruszczyński, A.: A regularized decomposition method for minimizing a sum of polyhedral functions. Math. Program. 35(3), 309–333 (1986)
MathSciNet MATH Google Scholar
Sen, S., Zhou, Z.: Multistage stochastic decomposition: a bridge between stochastic programming and approximate dynamic programming. SIAM J. Optim. 24(1), 127–153 (2014)
MathSciNet MATH Google Scholar
Shapiro, A.: Stochastic programming approach to optimization under uncertainty. Math. Program. 112, 183–220 (2008)
MathSciNet MATH Google Scholar
Shapiro, A.: Analysis of stochastic dual dynamic programming method. Eur. J. Oper. Res. 209(1), 63–72 (2011)
MathSciNet MATH Google Scholar
Shapiro, A.: Topics in stochastic programming. Universite Catholique de Louvain, CORE Lecture Series (2011)
Shapiro, A., Dentcheva, D., Ruszczyński, A.: Lectures on stochastic programming: modeling and theory, vol. 16. SIAM (2014)
Shapiro, A., Homem-de Mello, T.: On the rate of convergence of optimal solutions of monte carlo approximations of stochastic programs. SIAM J. Optim. 11(1), 70–86 (2000)
MathSciNet MATH Google Scholar
Shapiro, A., Nemirovski, A.: On complexity of stochastic programming problems. In: Jeyakumar, V., Rubinov, A. (eds.) Continuous Optimization, pp. 111–146. Springer, Heidelberg (2005)
Google Scholar
Shapiro, A., Tekaya, W., da Paulo Costa, J., Pereira Soares, M.: Risk neutral and risk averse stochastic dual dynamic programming method. Eur. J. Oper. Res. 224(2), 375–391 (2013)
MathSciNet MATH Google Scholar
Van Slyke, R.M., Wets, R.: L-shaped linear programs with applications to optimal control and stochastic programming. SIAM J. Appl. Math. 17(4), 638–663 (1969)
MathSciNet MATH Google Scholar
Vayanos, P., Kuhn, D., Rustem, B.: A constraint sampling approach for multi-stage robust optimization. Automatica 48(3), 459–471 (2012)
MathSciNet MATH Google Scholar
Wallace, S.W., Ziemba, W.T.: Applications of Stochastic Programming, vol. 5. SIAM, Philadelphia (2005)
MATH Google Scholar

Download references

Acknowledgements

The research of James Luedtke has been supported in part by the National Science Foundation under Grant CMMI-1634597, and by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, Applied Mathematics program under Contract Number DE-AC02-06CH11347. The research of Merve Bodur has been supported in part by the Natural Sciences and Engineering Research Council of Canada under the Discovery Grant RGPIN-2018-04984. We would like to thank the reviewers for their careful and constructive comments that have significantly improved the paper. Also, we would like to thank Angelos Georghiou for providing data and clarifications on the inventory planning example, and Oscar Dowson for his assistance using the sddp.jl code.

Author information

Authors and Affiliations

Department of Mechanical and Industrial Engineering, University of Toronto, Toronto, Canada
Merve Bodur
Department of Industrial and Systems Engineering, University of Wisconsin-Madison, Madison, USA
James R. Luedtke

Authors

Merve Bodur
View author publications
You can also search for this author in PubMed Google Scholar
James R. Luedtke
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Merve Bodur.

Appendices

Data for the capacity expansion problem

See Tables 6, 7 and 8.

Table 6 Fixed annual cost and operation cost (Table 1 in [15])

Full size table

Table 7 Initial demand (Table 2 in [15])

Full size table

Table 8 Wind regimes (Table 3 in [15])

Full size table

Additional models for the capacity expansion example

1.1 Primal model and Benders decomposition

P-LDR-2S of the capacity expansion model is obtained by substituting

$$\begin{aligned} s_{ti}(\xi ^{t}) = \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\beta }_{tki} \end{aligned}$$

in (17). Dropping $\xi ^t$ dependences for variables to simplify the notation, we obtain

$$\begin{aligned} \min \,\,&\sum _{i \in [I]} \Big ( c_{1i}^{u^+} u^+_{1i} + c_{1i}^{u^-} u^-_{1i} + \sum _{j \in [J]} c_{1ij}^{x} x_{1ij}(\xi ^{t}) \Big ) + \sum _{j \in [J]} c_{1j}^zz_{1j} \nonumber \\&\qquad \qquad + \sum _{t \in [T]} \sum _{i \in [I]} c_{ti}^s\sum _{k \in [K_t]} \mathbb {E}\big [ {\Phi }_{tk}(\xi ^{t}) \big ] {\beta }_{tki} + \sum _{t \in [2,T]} \mathbb {E}[ Q_t({\beta },\xi ^{t}) ] \end{aligned}$$

(19a)

$$\begin{aligned} \text {s.t.}\,\,&{\beta }_{11i} - u^+_{1i} + u^-_{1i} = 0,&\quad i \in [I], \end{aligned}$$

(19b)

$$\begin{aligned}&{\beta }_{11i} - x_{1ij} \ge 0,&i \in [I], \ j \in [J], \end{aligned}$$

(19c)

$$\begin{aligned}&\sum _{i \in [I]} x_{1ij} + z_{1j} \ge d_{1j},&\quad j \in [J], \end{aligned}$$

(19d)

$$\begin{aligned}&0 \le z_{1j} \le d_{1j},&j \in [J], \end{aligned}$$

(19e)

$$\begin{aligned}&0 \le u^+_{1i} \le M_{1i}^{u^+}, \ \ 0 \le u^-_{1i} \le M_{1i}^{u^-}, \ \ 0 \le {\beta }_{11i} \le M_{1i}^{s},&i \in [I], \end{aligned}$$

(19f)

$$\begin{aligned}&x_{1ij} \ge 0,&i \in [I], \ j \in [J], \end{aligned}$$

(19g)

where, for $t \in [2,T]$, $Q_t({\beta },\xi ^{t})$ is defined as the optimal objective value of the following problem:

$$\begin{aligned} \min \,\,&\sum _{i \in [I]} \Big ( c_{ti}^{u^+} u^+_{i} + c_{ti}^{u^-} u^{-}_{i} + \sum _{j \in [J]} c_{tij}^{x} x_{ij} \Big ) + \sum _{j \in [J]} c_{tj}^zz_{j} \end{aligned}$$

(20a)

$$\begin{aligned} \text {s.t.} \,\,&u^+_{i} - u^-_{i} = \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\beta }_{tki} - \sum _{k \in [K_{t-1}]} {\Phi }_{tk}(\xi ^{t-1}) {\beta }_{t-1,k,i},&i \in [I], \end{aligned}$$

(20b)

$$\begin{aligned}&x_{ij} \le \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\beta }_{tki},&i \in [I], \ j \in [J], \end{aligned}$$

(20c)

$$\begin{aligned}&\sum _{i \in [I]} x_{ij} + z_{j} \ge d_{tj}(\xi ^{t}),&j \in [J], \end{aligned}$$

(20d)

$$\begin{aligned}&0 \le z_{j} \le d_{tj}(\xi ^{t}),&j \in [J], \end{aligned}$$

(20e)

$$\begin{aligned}&0 \le u^+_{i} \le M_{ti}^{u^+},&i \in [I], \end{aligned}$$

(20f)

$$\begin{aligned}&0 \le u^-_{i} \le M_{ti}^{u^-},&i \in [I], \end{aligned}$$

(20g)

$$\begin{aligned}&x_{ij} \ge 0,&i \in [I], \ j \in [J], \end{aligned}$$

(20h)

$$\begin{aligned}&{0\le M_{ti}^s - \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\beta }_{tki},}&i \in [I], \end{aligned}$$

(20i)

$$\begin{aligned}&{0 \le \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\beta }_{tki},}&i \in [I], \end{aligned}$$

(20j)

We note that P-LDR-2S of the capacity expansion model does not have relatively complete recourse since the recourse constraints (20i) and (20j) might be violated under some scenarios.

Let $\xi ^{T}_n, n \in [N]$, be an independent and identically distributed (i.i.d.) random sample of the random vector $\xi ^{T}$. We solve the obtained primal SAA problem with Benders decomposition, using a single aggregate cut per time-stage. That is, we have a master problem of the following form:

$$\begin{aligned} \min \,\,&\sum _{i \in [I]} \Big ( c_{1i}^{u^+} u^+_{1i} + c_{1i}^{u^-} u^-_{1i} + \sum _{j \in [J]} c_{1ij}^{x} x_{1ij}(\xi ^{t}) \Big ) + \sum _{j \in [J]} c_{1j}^zz_{1j} \nonumber \\&\qquad + \sum _{t \in [T]} \sum _{i \in [I]} c_{ti}^s\sum _{k \in [K_t]} \mathbb {E}\big [ {\Phi }_{tk}(\xi ^{t}) \big ] {\beta }_{tki} + \sum _{t \in [2,T]} \eta _t\end{aligned}$$

(21a)

$$\begin{aligned} \text {s.t.}\,\,&22b-22g, \end{aligned}$$

(21b)

$$\begin{aligned}&(\eta _t, {\beta }_{t11}, \cdots , {\beta }_{t K_tI} ) \in \mathcal {O}^t ,&\quad t \in [2,T] , \end{aligned}$$

(21c)

$$\begin{aligned}&({\beta }_{t11}, \cdots , {\beta }_{t K_tI} ) \in \mathcal {F}^t,&\quad t \in [2,T], \end{aligned}$$

(21d)

$$\begin{aligned}&{0 \le \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}_n) {\beta }_{tki} \le M_{ti}^s,}&\quad t \in [2,T], i \in [I], \ n \in [N], \end{aligned}$$

(21e)

$$\begin{aligned}&\eta _t \ge 0,&\quad t \in [2,T]. \end{aligned}$$

(21f)

The variable $\eta _t$ represents the expected second-stage cost at period $t \in [2,T]$, i.e., $\frac{1}{N} \sum _{n \in [N]} [ Q_t({\beta },\xi ^{t}_n) ]$. Note that since all the original decision variables are defined to be nonnegative, and all the cost parameters are assumed to be nonnegative, $Q_t({\beta },\xi ^{t}) \ge 0$ for any given $\beta $, thus (21f) are valid. Constraints (21c) and (21d) correspond to the set of Benders optimality and feasibility cuts, respectively. As ${\beta }$ variables belong to the master problem, we add constraints (20i) and (20j) for each scenario in the sample to the master problem as (21e) which can be seen as an additional set of feasibility cuts.

The subproblem decomposes not only by scenario but also by stage. For $t \in [2,T]$ and $n \in [N]$, we have the corresponding subproblem (20a)–(20h), denoted by SP(t, n).

At every iteration of the Benders decomposition algorithm, we solve the master problem, get a candidate $\beta $ solution which is fixed in the subproblems, and solve all the subproblems. For $t \in [2,T]$, if there is at least one index $n \in [N]$ for which SP(t, n) is infeasible, then we generate a Benders feasibility cut and add it to the master problem. Otherwise, we generate a Benders optimality cut, but add it to the master problem only if it is violated at the current master problem solution. We repeat this procedure until all the subproblems are feasible and no violated optimality cuts are found.

1.2 Dual model and level method

Let $\lambda , \gamma , \theta ^+, \theta ^-, {\Gamma }^{u^+},{\Gamma }^{u^-}, {\Gamma }^{s}$ be the dual variables associated with the constraints (17b)–(17h) in (17), respectively. Then, the dual of (17) is:

$$\begin{aligned} \max \,\,&\mathbb {E}\sum _{t \in [T]} \Big [ \sum _{j \in [J]} d_{tj}(\xi ^{t}) \big ( \theta ^+_{tj}(\xi ^{t}) \, {- } \, \theta ^-_{tj}(\xi ^{t})\big ) \nonumber \\&\quad \ \ \, \, {- } \, \sum _{i \in [I]} \bigl (M_{ti}^{u^+} {\Gamma }^{u^+}_{ti}(\xi ^{t}) + M_{ti}^{u^-}{\Gamma }^{u^-}_{ti}(\xi ^{t}) + M_{ti}^{s} {\Gamma }^{s}_{ti}(\xi ^{t})\bigr ) \Big ] \end{aligned}$$

(22a)

$$\begin{aligned} \text {s.t.}\,\,&\lambda _{ti}(\xi ^{t}) - \mathbb {E}[ \lambda _{t+1,i}(\xi ^{t+1}) \mid \xi ^{t}] \nonumber \\&\ + \sum _{j \in [J]} \gamma _{tij}(\xi ^{t}) \, {- } \, {\Gamma }^{s}_{ti}(\xi ^{t}) \le c_{ti}^{s},&t \in [T], \ \mathbb {P}\text {-a.s.}, \ i \in [I], \end{aligned}$$

(22b)

$$\begin{aligned}&{- } \, {\Gamma }^{u^+}_{ti}(\xi ^{t})- \lambda _{ti}(\xi ^{t}) \le c_{ti}^{u^+},&t \in [T], \ \mathbb {P}\text {-a.s.}, \ i \in [I], \end{aligned}$$

(22c)

$$\begin{aligned}&{- } \, {\Gamma }^{u^-}_{ti}(\xi ^{t}) + \lambda _{ti}(\xi ^{t}) \le c_{ti}^{u^-},&t \in [T], \ \mathbb {P}\text {-a.s.}, \ i \in [I], \end{aligned}$$

(22d)

$$\begin{aligned}&\theta ^+_{tj}(\xi ^{t}){- } \theta ^-_{tj}(\xi ^{t}) \ \le c_{tj}^{z},&t \in [T], \ \mathbb {P}\text {-a.s.}, \ j \in [J], \end{aligned}$$

(22e)

$$\begin{aligned}&\theta ^+_{tj}(\xi ^{t}) - \gamma _{tij}(\xi ^{t}) \le c_{tij}^{x},&t \in [T], \ \mathbb {P}\text {-a.s.}, \ i \in [I], \ j \in [J], \end{aligned}$$

(22f)

$$\begin{aligned}&\gamma _{tij}(\xi ^{t}) \ge 0,&t \in [T], \ \mathbb {P}\text {-a.s.}, \ i \in [I], \ j \in [J], \end{aligned}$$

(22g)

$$\begin{aligned}&\theta ^+_{tj}(\xi ^{t}), {\theta ^-_{tj}(\xi ^{t})} \ge 0,&t \in [T], \ \mathbb {P}\text {-a.s.}, \ j \in [J], \end{aligned}$$

(22h)

$$\begin{aligned}&{\Gamma }^{u^+}_{ti}(\xi ^{t}), {\Gamma }^{u^-}_{ti}(\xi ^{t}), {\Gamma }^{s}_{ti}(\xi ^{t}) \, {\ge } \, 0&t \in [T], \ \mathbb {P}\text {-a.s.}, \ i \in [I]. \end{aligned}$$

(22i)

Observing that $\theta ^-$ variables are redundant, we remove them to simplify the model. D-LDR-2S of the capacity expansion model is obtained by substituting

$$\begin{aligned} \lambda _{ti}(\xi ^{t}) = \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\Lambda }_{tki} \end{aligned}$$

in (22). Dropping $\xi ^t$ dependences for variables to simplify the notation, we obtain

$$\begin{aligned} \max \,\,&\sum _{j \in [J]} d_{1j} \theta ^+_{1j} \, {-} \, \sum _{i \in [I]} \big ( M_{1i}^{u^+} {\Gamma }^{u^+}_{1i} + M_{1i}^{u^-} {\Gamma }^{u^-}_{1i} + M_{1i}^{s} {\Gamma }^{s}_{1i} \big ) + \sum _{t \in [2,T]} \mathbb {E}[G_t({\Lambda },\xi ^{t}) ] \end{aligned}$$

(23a)

$$\begin{aligned} \text {s.t.}\,\,&{\Lambda }_{11i} - \sum _{k \in [K_2]} \mathbb {E}\big [ {\Phi }_{2k}(\xi ^2) \big ] {\Lambda }_{2ki} + \sum _{j \in [J]} \gamma _{1ij} \, {-} \, {\Gamma }^{s}_{1i} \le c_{1i}^{s},&i \in [I], \end{aligned}$$

(23b)

$$\begin{aligned}&{-} \, {\Gamma }^{u^+}_{1i} -{\Lambda }_{11i} \le c_{1i}^{u^+},&i \in [I], \end{aligned}$$

(23c)

$$\begin{aligned}&{-} \, {\Gamma }^{u^-}_{1i} + {\Lambda }_{11i} \le c_{1i}^{u^-},&i \in [I], \end{aligned}$$

(23d)

$$\begin{aligned}&\theta ^+_{1j} \le c_{1j}^{z},&j \in [J], \end{aligned}$$

(23e)

$$\begin{aligned}&\theta ^+_{1j} - \gamma _{1ij} \le c_{1ij}^{x},&i \in [I], \ j \in [J] \end{aligned}$$

(23f)

$$\begin{aligned}&\gamma _{1ij} \ge 0,&i \in [I], \ j \in [J] \end{aligned}$$

(23g)

$$\begin{aligned}&\theta ^+_{1j} \ge 0,&j \in [J], \end{aligned}$$

(23h)

$$\begin{aligned}&{\Gamma }^{u^+}_{1i}, {\Gamma }^{u^-}_{1i}, {\Gamma }^{s}_{1i} \ {\ge } \ 0&i \in [I], \end{aligned}$$

(23i)

where for $t \in [2,T]$, $G_t({\Lambda },\xi ^{t})$ is defined as the optimal objective value of the following problem:

$$\begin{aligned} \max \,\,&\sum _{j \in [J]} d_{tj} \theta ^+_{j} \, {-} \, \sum _{i \in [I]} \big (M_{ti}^{u^+}{\Gamma }^{u^+}_{i}+ M_{ti}^{u^-} {\Gamma }^{u^-}_{i}+M_{ti}^{s} {\Gamma }^{s}_{i} \big ) \end{aligned}$$

(24a)

$$\begin{aligned} \text {s.t.}\,\,&\sum _{j \in [J]} \gamma _{ij} \, {-} \, {\Gamma }^{s}_{i} \le c_{ti}^{s} - \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\Lambda }_{tki} \nonumber \\&\qquad \quad + \sum _{k \in [K_{t+1}]} \mathbb {E}\Big [ {\Phi }_{t+1,k}(\xi ^{t+1}) \Big | \xi ^{t}\Big ] {\Lambda }_{t+1,k,i},&i \in [I], \end{aligned}$$

(24b)

$$\begin{aligned}&{-} \, {\Gamma }^{u^+}_{i} \le c_{ti}^{u^+} + \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\Lambda }_{tki},&i \in [I], \end{aligned}$$

(24c)

$$\begin{aligned}&{-} \, {\Gamma }^{u^-}_{i} \le c_{ti}^{u^-} - \sum _{k \in [K_t]} {\Phi }_{tk}(\xi ^{t}) {\Lambda }_{tki},&i \in [I], \end{aligned}$$

(24d)

$$\begin{aligned}&\theta ^+_{j} \le c_{tj}^{z},&j \in [J], \end{aligned}$$

(24e)

$$\begin{aligned}&\theta ^+_{j} - \gamma _{ij} \le c_{tij}^{x},&i \in [I], \ j \in [J], \end{aligned}$$

(24f)

$$\begin{aligned}&\gamma _{ij} \ge 0,&i \in [I], \ j \in [J], \end{aligned}$$

(24g)

$$\begin{aligned}&\theta ^+_{j} \ge 0,&j \in [J], \end{aligned}$$

(24h)

$$\begin{aligned}&{\Gamma }^{u^+}_{i}, {\Gamma }^{u^-}_{i}, {\Gamma }^{s}_{i} \ {\ge } \ 0&i \in [I]. \end{aligned}$$

(24i)

Let $\xi ^{T}_n, n \in [N]$, be an independent and identically distributed (i.i.d.) random sample of the random vector $\xi ^{T}$. We solve the obtained dual SAA problem with the bundle-level method, because the Benders decomposition method converged slowly for this problem. We use cuts aggregated over scenarios, thus introduce $\zeta _{t}$ variable to represent the expected second-stage cost value, i.e., $\frac{1}{N} \sum _{n \in [N]} G_t({\Lambda },\xi ^{t}_n)$, for $t \in [2,T]$.

We observe that the subproblem (24) can be further decomposed into two: one problem including only the $u^+$ and $u^-$ variables, and the other problem including the remaining set of variables.

$$\begin{aligned} (\text {DSP}^{upart}): \ \max&- \sum _{i \in [I]} \big (M_{ti}^{u^+}{\Gamma }^{u^+}_{i}+ M_{ti}^{u^-} {\Gamma }^{u^-}_{i} \big ) \\ \quad \,\, \,\text {s.t.} \,&27c,27d,27i \end{aligned}$$

$$\begin{aligned} (\text {DSP}^{rest}): \ \max&\sum _{j \in [J]} d_{tj} \theta ^+_{j} - \sum _{i \in [I]} M_{ti}^{s} {\Gamma }^{s}_{i} \\ \quad \text {s.t.}\,\,&27b,27e{-}27h \end{aligned}$$

We exploit this decomposition to disaggregate the optimality cuts in the master problem. Thus, we introduce additional variables $\zeta ^{upart}_{t}$ and $\zeta ^{rest}_{t}$ for $t \in [2,T]$ and obtain the following master problem:

$$\begin{aligned} \text {(MP)}: \ \max&\sum _{j \in [J]} d_{1j} \theta ^+_{1j} \, {-} \, \sum _{i \in [I]} \big ( M_{1i}^{u^+} {\Gamma }^{u^+}_{1i} + M_{1i}^{u^-} {\Gamma }^{u^-}_{1i} + M_{1i}^{s} {\Gamma }^{s}_{1i} \big )+ \sum _{t \in [2,T]} \zeta _{t} \end{aligned}$$

(25a)

$$\begin{aligned} \quad \quad \,\,\,\,\text {s.t.}\,\,&26b-26i, \end{aligned}$$

(25b)

$$\begin{aligned}&\zeta _{t} = \zeta ^{upart}_{t} + \zeta ^{rest}_{t},&t \in [2,T], \end{aligned}$$

(25c)

$$\begin{aligned}&(\zeta ^{upart}_{t}, {\Lambda }_{t11}, \ldots , {\Lambda }_{t K_tI} ) \in \mathcal {U}^{t} ,&t \in [2,T], \end{aligned}$$

(25d)

$$\begin{aligned}&(\zeta ^{rest}_{t}, {\Lambda }_{t11}, \ldots , {\Lambda }_{t K_tI} ) \in \mathcal {R}^{t} ,&t \in [2,T], \end{aligned}$$

(25e)

$$\begin{aligned}&\zeta ^{upart}_{t} \le 0,&t \in [2,T], \end{aligned}$$

(25f)

$$\begin{aligned}&\zeta ^{rest}_{t} \le \frac{1}{N} \sum _{n \in [N]} \sum _{j \in [J]} d_{tj}(\xi ^{t}_n) c_{tj}^{z},&t \in [2,T], \end{aligned}$$

(25g)

where $\mathcal {U}^{t}$ and $\mathcal {R}^{t}$ represent the optimality cuts derived from problems ($\text {DSP}^{upart}$) and ($\text {DSP}^{rest}$), respectively. Moreover, we introduce the upper bounds on the new auxiliary variables, which are derived from the subproblems $(\text {DSP}^{upart})$ and $(\text {DSP}^{rest})$.

The level method also uses a quadratic program for regularization which projects the previous iterate on the level set of the current approximation of the objective function. We use the following problem for this projection:

$$\begin{aligned} \text {(QP)}: \ \max&|| {\Lambda }- \hat{{\Lambda }} ||^2_2 \\ \quad \,\,\quad \,\,\,\,\text {s.t.}&28b{-}28g, \\&\sum _{j \in [J]} d_{1j} \theta ^+_{1j} \, {-} \, \sum _{i \in [I]} \big ( M_{1i}^{u^+} {\Gamma }^{u^+}_{1i} + M_{1i}^{u^-} {\Gamma }^{u^-}_{1i} + M_{1i}^{s} {\Gamma }^{s}_{1i} \big ) \ge L, \end{aligned}$$

where $\hat{\Lambda }$ and L denote the current ${\Lambda }$ solution (i.e., the previous iterate) and the level target, respectively. The optimal solution values of ${\Lambda }$ variables determine the next iterate.

The details of the level method are provided in Algorithm 1 where LB and UB denote lower bound and upper bound, respectively.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bodur, M., Luedtke, J.R. Two-stage linear decision rules for multi-stage stochastic programming. Math. Program. 191, 347–380 (2022). https://doi.org/10.1007/s10107-018-1339-4

Download citation

Received: 15 January 2017
Accepted: 28 September 2018
Published: 15 October 2018
Issue Date: January 2022
DOI: https://doi.org/10.1007/s10107-018-1339-4

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Two-stage linear decision rules for multi-stage stochastic programming

Abstract

Access this article

Similar content being viewed by others

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

A practical guide to multi-objective reinforcement learning and planning

Survey on Lagrangian relaxation for MILP: importance, challenges, historical review, recent advancements, and opportunities

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Data for the capacity expansion problem

Additional models for the capacity expansion example

1.1 Primal model and Benders decomposition

1.2 Dual model and level method

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Two-stage linear decision rules for multi-stage stochastic programming

Abstract

Access this article

Similar content being viewed by others

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

A practical guide to multi-objective reinforcement learning and planning

Survey on Lagrangian relaxation for MILP: importance, challenges, historical review, recent advancements, and opportunities

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Data for the capacity expansion problem

Additional models for the capacity expansion example

1.1 Primal model and Benders decomposition

1.2 Dual model and level method

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation