Skip to main content
Log in

Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards

  • Published:
Mathematical Methods of Operations Research Aims and scope Submit manuscript

Abstract.

This paper is the second part of our study of Blackwell optimal policies in Markov decision chains with a Borel state space and unbounded rewards. We prove that a stationary policy is Blackwell optimal in the class of all history-dependent policies if it is Blackwell optimal in the class of stationary policies.

 We also develop recurrence and drift conditions which ensure ergodicity and integrability assumptions made in the previous paper, and which are more suitable for applications. As an example we study a cash-balance model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Manuscript received: October 1998

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hordijk, A., Yushkevich, A. Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards. Mathematical Methods of OR 50, 421–448 (1999). https://doi.org/10.1007/s001860050079

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s001860050079

Navigation