Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion

Wei, Qingda

doi:10.1007/s00186-016-0550-4

Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion

Original Article
Published: 11 June 2016

Volume 84, pages 461–487, (2016)
Cite this article

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Qingda Wei¹

471 Accesses
22 Citations
Explore all metrics

Abstract

This paper studies continuous-time Markov decision processes with a denumerable state space, a Borel action space, bounded cost rates and possibly unbounded transition rates under the risk-sensitive finite-horizon cost criterion. We give the suitable optimality conditions and establish the Feynman–Kac formula, via which the existence and uniqueness of the solution to the optimality equation and the existence of an optimal deterministic Markov policy are obtained. Moreover, employing a technique of the finite approximation and the optimality equation, we present an iteration method to compute approximately the optimal value and an optimal policy, and also give the corresponding error estimations. Finally, a controlled birth and death system is used to illustrate the main results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Conservative and Semiconservative Random Walks: Recurrence and Transience

Article 27 February 2017

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Article 17 January 2019

On some mean field games and master equations through the lens of conservation laws

Article 16 April 2024

References

Cavazos-Cadena R, Hernández-Hernández D (2011) Discounted approximations for risk-sensitive average criteria in Markov decision chains with finite state space. Math Oper Res 36:133–146
Article MathSciNet MATH Google Scholar
Confortola F, Fuhrman M (2014) Backward stochastic differential equations associated to jump Markov processes and applications. Stoch Process Appl 124:289–316
Article MathSciNet MATH Google Scholar
Di Masi GB, Stettner L (2007) Infinite horizon risk sensitive control of discrete time Markov processes under minorization property. SIAM J Control Optim 46:231–252
Article MathSciNet MATH Google Scholar
Ghosh MK, Saha S (2014) Risk-sensitive control of continuous time Markov chains. Stochastics 86:655–675
MathSciNet MATH Google Scholar
Guo XP, Hernández-Lerma O (2009) Continuous-time Markov decision processes: theory and applications. Springer, Berlin
Guo XP, Zhang WZ (2014) Convergence of controlled models and finite-state approximation for discounted continuous-time Markov decision processes with constraints. Eur J Oper Res 238:486–496
Guo XP, Huang XX, Huang YH (2015) Finite horizon optimality for continuous-time Markov decision processes with unbounded transition rates. Adv Appl Probab 47:1064–1087
Hernández-Lerma O, Lasserre JB (1999) Further topics on discrete-time Markov control processes. Springer, New York
Book MATH Google Scholar
Jaśkiewicz A (2007) Average optimality for risk-sensitive control with general state space. Ann Appl Probab 17:654–675
Article MathSciNet MATH Google Scholar
Kitaev MY, Rykov VV (1995) Controlled queueing systems. CRC Press, Boca Raton
MATH Google Scholar
Miller BL (1968) Finite state continuous time Markov decision processes with finite planning horizon. SIAM J Control 6:266–280
Article MathSciNet MATH Google Scholar
Prieto-Rumeau T, Hernández-Lerma O (2012) Discounted continuous-time controlled Markov chains: convergence of control models. J Appl Probab 49:1072–1090
MathSciNet MATH Google Scholar
Prieto-Rumeau T, Lorenzo JM (2010) Approximating ergodic average reward continuous-time controlled Markov chains. IEEE Trans Autom Control 55:201–207
Article MathSciNet Google Scholar
Puterman ML (1994) Markov decision processes: discrete stochastic dynamic programming. Wiley, New York
Book MATH Google Scholar
Royden HL (1988) Real analysis. Macmillan, New York
MATH Google Scholar
van Dijk NM (1988) On the finite horizon Bellman equation for controlled Markov jump models with unbounded characteristics: existence and approximation. Stoch Process Appl 28:141–157
Article MathSciNet MATH Google Scholar
van Dijk NM (1989) A note on constructing \(\varepsilon \)-optimal policies for controlled Markov jump models with unbounded characteristics. Stochastics 27:51–58
MathSciNet MATH Google Scholar
Wei QD, Chen X (2014) Strong average optimality criterion for continuous-time Markov decision processes. Kybernetika 50:950–977
MathSciNet MATH Google Scholar

Download references

Acknowledgments

I am greatly indebted to the associate editor and the anonymous referees for many valuable comments and suggestions that have greatly improved the presentation. The research was supported by National Natural Science Foundation of China (Grant No. 11526092).

Author information

Authors and Affiliations

School of Economics and Finance, Huaqiao University, Quanzhou, 362021, People’s Republic of China
Qingda Wei

Authors

Qingda Wei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qingda Wei.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wei, Q. Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion. Math Meth Oper Res 84, 461–487 (2016). https://doi.org/10.1007/s00186-016-0550-4

Download citation

Received: 23 October 2015
Published: 11 June 2016
Issue Date: December 2016
DOI: https://doi.org/10.1007/s00186-016-0550-4

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion

Abstract

Access this article

Similar content being viewed by others

Conservative and Semiconservative Random Walks: Recurrence and Transience

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

On some mean field games and master equations through the lens of conservation laws

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion

Abstract

Access this article

Similar content being viewed by others

Conservative and Semiconservative Random Walks: Recurrence and Transience

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

On some mean field games and master equations through the lens of conservation laws

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation