Skip to main content
Log in

Optimal Stopping with a Probabilistic Constraint

  • Published:
Journal of Optimization Theory and Applications Aims and scope Submit manuscript

Abstract

We present an efficient method for solving optimal stopping problems with a probabilistic constraint. The goal is to optimize the expected cumulative cost, but constrained by an upper bound on the probability that the cost exceeds a specified threshold. This probabilistic constraint causes optimal policies to be time-dependent and randomized, however, we show that an optimal policy can always be selected with “piecewise-monotonic” time-dependence and “nearly-deterministic” randomization. We prove these properties using the Bellman optimality equations for a Lagrangian relaxation of the original problem. We present an algorithm that exploits these properties for computational efficiency. Its performance and the structure of optimal policies are illustrated on two numerical examples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Fleming, W.H., Soner, H.M.: Controlled Markov Processes and Viscosity Solutions. Applications of Mathematics. Springer, New York (1993)

    MATH  Google Scholar 

  2. Fan, Y., Nie, Y.: Optimal routing for maximizing the travel time reliability. Netw. Spat. Econ. 6(3), 333–344 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  3. Browne, S.: Optimal investment policies for a firm with a random risk process: exponential utility and minimizing the probability of ruin. Math. Oper. Res. 20(4), 937–958 (1995). https://doi.org/10.1287/moor.20.4.937

    Article  MathSciNet  MATH  Google Scholar 

  4. Rockafellar, R.T., Uryasev, S.: Optimization of conditional value-at-risk. J. Risk 2, 21–41 (2000)

    Article  Google Scholar 

  5. Ermon, S., Gomes, C., Selman, B., Vladimirsky, A.: Probabilistic planning with non-linear utility functions and worst-case guarantees. In: Proceedings of the 11th International AAMAS Conference, Vol. 2, pp. 965–972 (2012)

  6. Bertsekas, D.P.: Nonlinear Programming. Athena Scientific, Belmont (1999)

    MATH  Google Scholar 

  7. White, D.: Dynamic programming and probabilistic constraints. Oper. Res. 22(3), 654–664 (1974)

    Article  MathSciNet  MATH  Google Scholar 

  8. Pfeiffer, L.: Two approaches to stochastic optimal control problems with a final-time expectation constraint. Appl. Math. Optim. (2016)

  9. Bertsekas, D.P.: Dynamic Programming and Optimal Control, vol. II, 3rd edn. Athena Scientific, Belmont (2007)

    MATH  Google Scholar 

  10. Bertsekas, D.P., Tsitsiklis, J.N.: An analysis of stochastic shortest path problems. Math. Oper. Res. 16, 580–595 (1991)

    Article  MathSciNet  MATH  Google Scholar 

  11. Crandall, M.G.: Viscosity solutions: a primer. In: Viscosity Solutions and applications: lectures given at the 2nd session of the centro internazionale matematico estivo (C.I.M.E.) held in montecatiniterme, Italy, June 12–20, 1995, pp. 1–43. Springer, Berlin (1997)

  12. Krylov, N.V., Balakrishnan, A.V.: Controlled diffusion processes/N. V. Krylov; translated by A. B. Aries. In: Balakrishnan A. V. (ed.) Springer, New York (1980)

  13. Palmer, A.Z.: a C++ implementation of algorithms for PCOS problem (2017). https://github.com/AaronZPalmer/PCOS

  14. Baxter, J., Chacon, R.: Compactness of stopping times. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 40(3), 169–181 (1977)

    Article  MathSciNet  MATH  Google Scholar 

  15. Karoui, N.E., Peng, S., Quenez, M.C.: A dynamic maximum principle for the optimization of recursive utilities under constraints. Ann. Appl. Probab. 11, 664–693 (2001)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

Aaron Zeff Palmer was supported in part as NSF GRFP Fellow 2011122749. Alexander Vladimirsky was supported in part by the NSF Grants DMS-1016150 and DMS-1738010. We would like to thank the associate editor and the reviewers for their carefully reading and suggestions that helped us greatly improve this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Aaron Zeff Palmer.

Additional information

Communicated by Kyriakos G. Vamvoudakis.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Palmer, A.Z., Vladimirsky, A. Optimal Stopping with a Probabilistic Constraint. J Optim Theory Appl 175, 795–817 (2017). https://doi.org/10.1007/s10957-017-1183-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10957-017-1183-3

Keywords

Mathematics Subject Classification

Navigation