Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies

Defourny, Boris; Ernst, Damien; Wehenkel, Louis

doi:10.1007/978-3-642-04944-6_6

Boris Defourny¹⁸,
Damien Ernst¹⁸ &
Louis Wehenkel¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5792))

Included in the following conference series:

International Symposium on Stochastic Algorithms

3102 Accesses
2 Citations

Abstract

We propose a generic method for obtaining quickly good upper bounds on the minimal value of a multistage stochastic program. The method is based on the simulation of a feasible decision policy, synthesized by a strategy relying on any scenario tree approximation from stochastic programming and on supervised learning techniques from machine learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Frauendorfer, K.: Barycentric scenario trees in convex multistage stochastic programming. Mathematical Programming 75, 277–294 (1996)
MathSciNet MATH Google Scholar
Dempster, M.: Sequential importance sampling algorithms for dynamic stochastic programming. Annals of Operations Research 84, 153–184 (1998)
Google Scholar
Dupacova, J., Consigli, G., Wallace, S.: Scenarios for multistage stochastic programs. Annals of Operations Research 100, 25–53 (2000)
Article MathSciNet MATH Google Scholar
Høyland, K., Wallace, S.: Generating scenario trees for multistage decision problems. Management Science 47(2), 295–307 (2001)
Article MATH Google Scholar
Shapiro, A.: Monte Carlo sampling methods. In: Ruszczyński, A., Shapiro, A. (eds.) Stochastic Programming. Handbooks in Operations Research and Management Science, vol. 10, pp. 353–425. Elsevier, Amsterdam (2003)
Chapter Google Scholar
Casey, M., Sen, S.: The scenario generation algorithm for multistage stochastic linear programming. Mathematics of Operations Research 30, 615–631 (2005)
Article MathSciNet MATH Google Scholar
Hochreiter, R., Pflug, G.: Financial scenario generation for stochastic multi-stage decision processes as facility location problems. Annals of Operations Research 152, 257–272 (2007)
Article MathSciNet MATH Google Scholar
Pennanen, T.: Epi-convergent discretizations of multistage stochastic programs via integration quadratures. Mathematical Programming 116, 461–479 (2009)
Article MathSciNet MATH Google Scholar
Heitsch, H., Römisch, W.: Scenario tree modeling for multistage stochastic programs. Mathematical Programming 118(2), 371–406 (2009)
Article MathSciNet MATH Google Scholar
Shapiro, A.: On complexity of multistage stochastic programs. Operations Research Letters 34(1), 1–8 (2006)
Article MathSciNet MATH Google Scholar
Shapiro, A.: Inference of statistical bounds for multistage stochastic programming problems. Mathematical Methods of Operations Research 58(1), 57–68 (2003)
Article MathSciNet MATH Google Scholar
Golub, B., Holmer, M., McKendall, R., Pohlman, L., Zenios, S.: A stochastic programming model for money management. European Journal of Operational Research 85, 282–296 (1995)
Article MATH Google Scholar
Kouwenberg, R.: Scenario generation and stochastic programming models for asset liability management. European Journal of Operational Research 134, 279–292 (2001)
Article MathSciNet MATH Google Scholar
Hilli, P., Pennanen, T.: Numerical study of discretizations of multistage stochastic programs. Kybernetika 44, 185–204 (2008)
MathSciNet MATH Google Scholar
Billingsley, P.: Probability and Measure, 3rd edn. Wiley, Chichester (1995)
MATH Google Scholar
Vapnik, V.: Statistical Learning Theory. Wiley, Chichester (1998)
MATH Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd edn. Springer, Heidelberg (2009)
Book MATH Google Scholar
Wahba, G., Golub, G., Heath, M.: Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics 21, 215–223 (1979)
Article MathSciNet MATH Google Scholar
Efron, B., Tibshirani, R.: An introduction to the bootstrap. Chapman and Hall, London (1993)
Book MATH Google Scholar
Thénié, J., Vial, J.P.: Step decision rules for multistage stochastic programming: A heuristic approach. Automatica 44, 1569–1584 (2008)
Article MathSciNet MATH Google Scholar
Küchler, C., Vigerske, S.: Numerical evaluation of approximation methods in stochastic programming (2008) (submitted)
Google Scholar
Cover, T.: Estimation by the nearest neighbor rule. IEEE Transactions on Information Theory 14, 50–55 (1968)
Article MATH Google Scholar
Akaike, H.: Information theory and an extension of the maximum likelihood principle. In: Proceedings of the Second International Symposium on Information Theory, pp. 267–281 (1973)
Google Scholar
Schwartz, G.: Estimating the dimension of a model. Annals of Statistics 6, 461–464 (1978)
Article MathSciNet Google Scholar
Rissanen, J.: Stochastic complexity and modeling. Annals of Statistics 14, 1080–1100 (1986)
Article MathSciNet MATH Google Scholar
James, G., Radchenko, P., Lv, J.: DASSO: connections between the Dantzig selector and Lasso. Journal of the Royal Statistical Society: Series B 71, 127–142 (2009)
Article MathSciNet MATH Google Scholar
Chapelle, O., Vapnik, V., Bengio, Y.: Model selection for small sample regression. Machine Learning 48, 315–333 (2002)
Article MATH Google Scholar
Huber, P.: Projection pursuit. Annals of Statistics 13, 435–475 (1985)
Article MathSciNet MATH Google Scholar
Buja, A., Hastie, T., Tibshirani, R.: Linear smoothers and additive models. Annals of Statistics 17, 453–510 (1989)
Article MathSciNet MATH Google Scholar
Friedman, J.: Multivariate adaptive regression splines (with discussion). Annals of Statistics 19, 1–141 (1991)
Article MathSciNet MATH Google Scholar
Girosi, F., Jones, M., Poggio, T.: Regularization theory and neural networks architectures. Neural Computation 7, 219–269 (1995)
Article Google Scholar
Williams, C., Rasmussen, C.: Gaussian processes for regression. In: Advances in Neural Information Processing Systems 8 (NIPS 1995), pp. 514–520 (1996)
Google Scholar
Smola, A., Schölkopf, B., Müller, K.R.: The connection between regularization operators and support vector kernels. Neural Networks 11, 637–649 (1998)
Article Google Scholar
Puterman, M.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, Chichester (1994)
Book MATH Google Scholar
Bertsekas, D., Tsitsiklis, J.: Neuro-Dynamic Programming. Athena Scientific, Belmont (1996)
MATH Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning, an introduction. MIT Press, Cambridge (1998)
Google Scholar
Bagnell, D., Kakade, S., Ng, A., Schneider, J.: Policy search by dynamic programming. In: Advances in Neural Information Processing Systems 16 (NIPS 2003), pp. 831–838 (2004)
Google Scholar
Lagoudakis, M., Parr, R.: Reinforcement learning as classification: leveraging modern classifiers. In: Proceedings of the Twentieth International Conference on Machine Learning (ICML 2003), pp. 424–431 (2003)
Google Scholar
Ernst, D., Geurts, P., Wehenkel, L.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503–556 (2005)
MathSciNet MATH Google Scholar
Langford, J., Zadrozny, B.: Relating reinforcement learning performance to classification performance. In: Proceedings of the Twenty-Second International Conference on Machine Learning (ICML 2005), pp. 473–480 (2005)
Google Scholar
Fern, A., Yoon, S., Givan, R.: Approximate policy iteration with a policy language bias: solving relational Markov Decision Processes. Journal of Artificial Intelligence Research 25, 85–118 (2006)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

University of Liège, Systems and Modeling, B28, B-4000, Liège, Belgium
Boris Defourny, Damien Ernst & Louis Wehenkel

Authors

Boris Defourny
View author publications
You can also search for this author in PubMed Google Scholar
Damien Ernst
View author publications
You can also search for this author in PubMed Google Scholar
Louis Wehenkel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematical and Computing Sciences, Tokyo Institute of Technology, W8-25, 152-8552, Tokyo, Japan
Osamu Watanabe
Division of Computer Science, N-14, W-9, Hokkaido University, 060-0814, Sapporo, Japan
Thomas Zeugmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Defourny, B., Ernst, D., Wehenkel, L. (2009). Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies. In: Watanabe, O., Zeugmann, T. (eds) Stochastic Algorithms: Foundations and Applications. SAGA 2009. Lecture Notes in Computer Science, vol 5792. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04944-6_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-04944-6_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04943-9
Online ISBN: 978-3-642-04944-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics