A new strong optimality criterion for nonstationary Markov decision processes

Guo, Xianping; Shi, Peng; Zhu, Weiping

doi:10.1007/s001860000076

A new strong optimality criterion for nonstationary Markov decision processes

Published: November 2000

Volume 52, pages 287–306, (2000)
Cite this article

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Xianping Guo¹,
Peng Shi² &
Weiping Zhu³

63 Accesses
2 Citations
Explore all metrics

Abstract.

This paper deals with a new optimality criterion consisting of the usual three average criteria and the canonical triplet (totally so-called strong average-canonical optimality criterion) and introduces the concept of a strong average-canonical policy for nonstationary Markov decision processes, which is an extension of the canonical policies of Herna´ndez-Lerma and Lasserre [16] (pages: 77) for the stationary Markov controlled processes. For the case of possibly non-uniformly bounded rewards and denumerable state space, we first construct, under some conditions, a solution to the optimality equations (OEs), and then prove that the Markov policies obtained from the OEs are not only optimal for the three average criteria but also optimal for all finite horizon criteria with a sequence of additional functions as their terminal rewards (i.e. strong average-canonical optimal). Also, some properties of optimal policies and optimal average value convergence are discussed. Moreover, the error bound in average reward between a rolling horizon policy and a strong average-canonical optimal policy is provided, and then a rolling horizon algorithm for computing strong average ε(>0)-optimal Markov policies is given.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Author information

Authors and Affiliations

Department of Mathematics, Zhongshan University, Guangzhou 510275, P. R. China (e-mail: mcsgxp@zsu.edu.cn), , , , , , CN
Xianping Guo
Land Operations Division, Defence Science and Technology Organisation, PO Box 1500, Salisbury 5108 SA, Australia (e-mail: peng.shi@dsto.defence.gov.au), , , , , , AU
Peng Shi
Department of Computer Science and Electrical Engineering, The University of Queensland, St. Lucia 4072, QLD, Australia, , , , , , AU
Weiping Zhu

Authors

Xianping Guo
View author publications
You can also search for this author in PubMed Google Scholar
Peng Shi
View author publications
You can also search for this author in PubMed Google Scholar
Weiping Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Manuscript received: December 1999/Final version received: May 2000

Rights and permissions

Reprints and permissions

About this article

Cite this article

Guo, X., Shi, P. & Zhu, W. A new strong optimality criterion for nonstationary Markov decision processes. Mathematical Methods of OR 52, 287–306 (2000). https://doi.org/10.1007/s001860000076

Download citation

Issue Date: November 2000
DOI: https://doi.org/10.1007/s001860000076

Key words: Nonstationary Markov decision processes, optimality equations, strong average-canonical optimal policies

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A new strong optimality criterion for nonstationary Markov decision processes

Abstract.

Access this article

Similar content being viewed by others

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

Strong n-discount and finite-horizon optimality for continuous-time Markov decision processes

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

A new strong optimality criterion for nonstationary Markov decision processes

Abstract.

Access this article

Similar content being viewed by others

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

Strong n-discount and finite-horizon optimality for continuous-time Markov decision processes

Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation