Local Poisson Equations Associated with Discrete-Time Markov Control Processes

Hernández Hernández, Daniel; Hernández Bustos, Diego

doi:10.1007/s10957-017-1076-5

Local Poisson Equations Associated with Discrete-Time Markov Control Processes

Published: 13 February 2017

Volume 173, pages 1–29, (2017)
Cite this article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

Daniel Hernández Hernández¹ &
Diego Hernández Bustos¹

317 Accesses
Explore all metrics

Abstract

This paper provides a characterization of the optimal average cost function, when the long-run (risk-sensitive) average cost criterion is used. The Markov control model has a denumerable state space with finite set of actions, and the characterization presented is given in terms of a system of local Poisson equations, which gives as a by-product the existence of an optimal stationary policy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Abel-type Results for Controlled Piecewise Deterministic Markov Processes

Article 09 April 2015

A poisson equation for the risk-sensitive average cost in semi-markov chains

Article 09 January 2016

Average cost criterion induced by the regular utility function for continuous-time Markov decision processes

Article 20 February 2017

References

Cavazos-Cadena, R., Hernández-Hernández, D.: A system of Poisson equations for a non-constant Varadhan functional on finite state space. Appl. Math. Optim. 53, 101–119 (2006)
Article MathSciNet MATH Google Scholar
Cavazos-Cadena, R., Hernández-Hernández, D.: Local Poisson equations associated with the Varadhan functional. Asymptot. Anal. 96, 023–050 (2015)
Article MathSciNet MATH Google Scholar
Alanís-Durán, A., Cavazos-Cadena, R.: An optimality system for finite average Markov decision chains under risk aversion. Kybernetika 48, 83–104 (2012)
MathSciNet MATH Google Scholar
Howard, A.R., Matheson, J.E.: Risk sensitive Markov decision processes. Manag. Sci. 18, 356–369 (1972)
Article MathSciNet MATH Google Scholar
Gantmakher, F.R.: The Theory of Matrices. Chelsea, London (1959)
MATH Google Scholar
Meyer, C.D.: Matrix Analysis and Applied Linear Algebra. SIAM, Philadelphia (2000)
Book Google Scholar
Fleming, W.H., Hernández-Hernández, D.: Risk sensitive control of finite state machines on a infinite horizon I. SIAM J. Control Optim. 35, 1790–1810 (1997)
Article MathSciNet MATH Google Scholar
Bielecki, T., Hernández-Hernández, D., Pliska, R.: Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management. Math. Methods Oper. Res. 50, 167–188 (1999)
Article MathSciNet MATH Google Scholar
Hernández-Hernández, D., Marcus, S.I.: Risk sensitive control of Markov processes in countable state space. Syst. Control Lett. 29, 147–155 (1996). Corrigendum. 34, 105–106 (1999)
Article MathSciNet MATH Google Scholar
Cavazos-Cadena, R., Fernández-Gaucherand, E.: Controlled Markov chains with risk-sensitive criteria: average cost, optimality equations, and optimal solutions. Math. Methods Oper. Res. 49, 299–324 (1999)
MathSciNet MATH Google Scholar
Cavazos-Cadena, R., Fernández-Gaucherand, E.: Risk sensitive control in communicating average Markov decision chains. In: Dror, P., L’Ecuyer, P., Szidarovsky, F. (eds.) Modelling Uncertainty: An Examination of Stochastic Theory, Methods and Applications, pp. 525–544. Kluwer, Boston (2002)
Google Scholar
Di Masi, G.B., Stettner, L.: Risk sensitive control of discrete time Markov processes with infinite horizon. SIAM J. Control Optim. 38, 61–78 (1999)
Article MathSciNet MATH Google Scholar
Cavazos-Cadena, R.: Solution to the risk sensitive average cost optimality equation in a class of Markov decision processes with finite state space. Math. Methods Optim. Res. 57, 263–285 (2003)
Article MathSciNet MATH Google Scholar
Sladký, K.: Growth rates and average optimality in risk sensitive Markov decision chains. Kybernetika 44, 205–226 (2008)
MathSciNet MATH Google Scholar
Cavazos-Cadena, R., Salem-Silva, F.: The discounted method and equivalence of average criteria for risk sensitive Markov decision processes on Borel space. Appl. Math. Optim. 61, 167–190 (2009)
Article MathSciNet MATH Google Scholar
Di Masi, G.B., Stettner, L.: Infinite horizon risk sensitive control of discrete time Markov processes with small risk. Syst. Control Lett. 40, 15–20 (2000)
Article MathSciNet MATH Google Scholar
Di Masi, G.B., Stettner, L.: Infinite horizon risk sensitive control of discrete time Markov processes under minorization property. SIAM J. Control Optim. 46, 231–252 (2007)
Article MathSciNet MATH Google Scholar
Sladký, K.: Bounds on discrete dynamic programming recursions I. Kybernetika 16, 526–547 (1980)
MathSciNet MATH Google Scholar
Sladký, K., Montes-de-Oca, R.: Risk-sensitive average optimality in Markov decision chains. In: Kalcsiscs, J., Nickel, S. (eds.) Operations Research Proceedings, vol. 2007, pp. 69–74. Springer, Berlin (2008)
Google Scholar
Zijim, W.H.M.: Nonnegative Matrices in Dynamic Programming. Mathematical Centre Tract, Amsterdam (1983)
Google Scholar
Rothblum, U.G., Whittle, P.: Growth optimality for branching Markov desicion chains. Math. Oper. Res. 7, 582–601 (1982)
Article MathSciNet MATH Google Scholar
Hernández-Lerma, O., Lasserre, J.B.: Discrete-Time Markov Control Processes: Basic Optimality Criteria. Springer, New York (1996)
Book MATH Google Scholar
Ash, R.B.: Probability and Measure Theory. Academic Press, New York (2000)
MATH Google Scholar
Billingsley, P.: Probability and Measure. Wiley, New York (1995)
MATH Google Scholar

Download references

Acknowledgements

DHH was partially supported by CONACYT Grant 254166.

Author information

Authors and Affiliations

Centro de Investigación en Matemáticas, Guanajuato, Mexico
Daniel Hernández Hernández & Diego Hernández Bustos

Authors

Daniel Hernández Hernández
View author publications
You can also search for this author in PubMed Google Scholar
Diego Hernández Bustos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel Hernández Hernández.

Additional information

Communicated by Alan Bensoussan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hernández Hernández, D., Hernández Bustos, D. Local Poisson Equations Associated with Discrete-Time Markov Control Processes. J Optim Theory Appl 173, 1–29 (2017). https://doi.org/10.1007/s10957-017-1076-5

Download citation

Received: 18 August 2016
Accepted: 27 January 2017
Published: 13 February 2017
Issue Date: April 2017
DOI: https://doi.org/10.1007/s10957-017-1076-5

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Local Poisson Equations Associated with Discrete-Time Markov Control Processes

Abstract

Access this article

Similar content being viewed by others

Abel-type Results for Controlled Piecewise Deterministic Markov Processes

A poisson equation for the risk-sensitive average cost in semi-markov chains

Average cost criterion induced by the regular utility function for continuous-time Markov decision processes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Local Poisson Equations Associated with Discrete-Time Markov Control Processes

Abstract

Access this article

Similar content being viewed by others

Abel-type Results for Controlled Piecewise Deterministic Markov Processes

A poisson equation for the risk-sensitive average cost in semi-markov chains

Average cost criterion induced by the regular utility function for continuous-time Markov decision processes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation