Average Optimality for Unbounded Rewards

Guo, Xianping; Hernández-Lerma, Onésimo

doi:10.1007/978-3-642-02547-1_7

Xianping Guo³ &
Onésimo Hernández-Lerma⁴

Part of the book series: Stochastic Modelling and Applied Probability ((SMAP,volume 62))

3452 Accesses

Abstract

In Chap. 7, we study the EAR criterion for the same MDP model as in Chap. 6. After briefly introducing some basic facts in Sect. 7.2, we establish the average reward optimality equation and the existence of EAR optimal policies in Sect. 7.3. In Sect. 7.4, we provide a policy iteration algorithm for computing or at least approximating an EAR optimal policy. Finally, we illustrate the results in this chapter with several examples in Sect. 7.5.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Hardcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

School of Mathematics and Computational Science, Zhongshan University, Guangzhou, 510275, People’s Republic of China
Xianping Guo
Departamento de Matemáticas, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), Apdo Postal 14-740, México, D.F., 07000, Mexico
Onésimo Hernández-Lerma

Authors

Xianping Guo
View author publications
You can also search for this author in PubMed Google Scholar
Onésimo Hernández-Lerma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xianping Guo or Onésimo Hernández-Lerma .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Guo, X., Hernández-Lerma, O. (2009). Average Optimality for Unbounded Rewards. In: Continuous-Time Markov Decision Processes. Stochastic Modelling and Applied Probability, vol 62. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02547-1_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-02547-1_7
Published: 18 September 2009
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02546-4
Online ISBN: 978-3-642-02547-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics