Recent results on conditions for the existence of average optimal stationary policies

Cavazos-Cadena, Rolando

doi:10.1007/BF02055572

Recent results on conditions for the existence of average optimal stationary policies

Published: December 1991

Volume 28, pages 3–27, (1991)
Cite this article

Annals of Operations Research Aims and scope Submit manuscript

Rolando Cavazos-Cadena¹

21 Citations
Explore all metrics

Abstract

This paper concerns countable state space Markov decision processes endowed with a (long-run expected)average reward criterion. For these models we summarize and, in some cases,extend some recent results on sufficient conditions to establish the existence of optimal stationary policies. The topics considered are the following: (i) the new assumptions introduced by Sennott in [20–23], (ii)necessary and sufficient conditions for the existence of a bounded solution to the optimality equation, and (iii) equivalence of average optimality criteria. Some problems are posed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the Expected Total Reward with Unbounded Returns for Markov Decision Processes

Article 23 October 2018

Semi-Markov decision processes with variance minimization criterion

Article 09 August 2014

First Passage Exponential Optimality Problem for Semi-Markov Decision Processes

References

R.B. Ash,Real Analysis and Probability (Academic Press, New York, 1972).
Google Scholar
J.S. Baras, A.J. Dorsey and A.M. Makowski, Two competing queues with linear costs and geometric service requirements: The Μc-rule is often optimal, Adv. Appl. Prob. 17 (1985) 186–209.
Google Scholar
V.S. Borkar, Controlled Markov chains and stochastic networks, SIAM J. Control Optim. 21 (1983) 652–666.
Google Scholar
V.S. Borkar, On minimum cost per unit of time control of Markov chains, SIAM J. Control Optim. 22 (1984) 965–978.
Google Scholar
V.S. Borkar, Control of Markov chains with long-run average cost criterion: The dynamic programming equations, SIAM J. Control Optim. 27 (1989) 965–978.
Google Scholar
R. Cavazos-Cadena, Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains, Syst. Control Lett. 10 (1988) 71–78.
Google Scholar
R. Cavazos-Cadena, Necessary conditions for the optimality equation in average-reward Markov decision processes, J.-Appl. Math. Optim. 19 (1989) 97–112.
Google Scholar
R. Cavazos-Cadena, Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs, Kybernetika (Prague) 25 (1989) 145–156.
Google Scholar
R. Cavazos-Cadena, Solution to the optimality equation in a class of Markov decision chains with the average cost criterion, Kybernetika (Prague) 27 (1991) 23–37.
Google Scholar
R. Cavazos-Cadena and L.I. Sennott, Comparing recent assumptions for the existence of optimal stationary policies, submitted.
J. Dugundji,Topology (Allyn and Bacon, New York, 1960).
Google Scholar
A. Federgruen, A. Hordijk and H.C. Tijms, A note on simultaneous recurrence conditions on a set of denumerable matrices, J. Appl. Prob. 15 (1978) 842–847.
Google Scholar
A. Federgruen, P.J. Schweitzer and H.C. Tijms, Denumerable undiscounted semi-Markov decision processes with unbounded rewards, Math. Oper. Res. 8 (1983) 298–313.
Article Google Scholar
O. Hernández-Lerma,Adaptive Markov Control Processes (Springer, New York, 1989).
Google Scholar
D. Heyman and M. Sobel,Stochastic Models in Operations Research, vol. 2 (McGraw-Hill, New York, 1984).
Google Scholar
K. Hinderer,Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter, Lecture Notes in Operations Research 33 (Springer, New York, 1970).
Google Scholar
A. Hordijk,Dynamic Programming and Potential Theory, Mathematical Centre Tracts 51, Amsterdam, The Netherlands (1974).
M. Loève,Probability Theory, vols. I and II (Springer, New York, 1977).
Google Scholar
P. Nain and K.W. Ross, Optimal priority assignment with hard constraints, IEEE Trans. Auto. Control. AC-31 (1988) 883–888.
Google Scholar
S.M. Ross,Applied Probability Models with Optimization Applications (Holden-Day, San Francisco, 1970).
Google Scholar
L.I. Sennott, A new condition for the existence of optimum stationary policies in average cost Markov decision processes, Oper. Res. Lett. (1986) 17–23.
L.I. Sennott, A new condition for the existence of optimum stationary policies in average cost Markov decision processes-unbounded costs case,Proc. 25th IEEE Conf. on Decision and Control (1986) pp. 1719–1721.
L.I. Sennott, Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs, Oper. Res. 37 (1989) 626–633.
Article Google Scholar
L.I. Sennott, Average cost semi-Markov decision processes and the control of queueing systems, Prob. Eng. Inf. Sci. 3 (1988) 247–272.
Article Google Scholar
L.C. Thomas, Connectedness conditions for denumerable state Markov decision processes, in:Recent Developments in Markov Decision Processes, eds. R. Hartley, L.C. Thomas and D.J. White (Academic Press, New York, 1980) pp. 181–204.
Google Scholar
R.R. Weber and S. Stidham Jr., Optimal control of service rates in network of queues, Adv. Appl. Prob. 19 (1987) 202–218.
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Estadística y Cálculo, Universidad Autónoma Agraria Antonio Narro, Buenavista, 25315, Saltillo COAH, Mexico
Rolando Cavazos-Cadena

Authors

Rolando Cavazos-Cadena
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

This research was partially supported by the Third World Academy of Sciences (TWAS) under Grant No. TWAS RG MP 898-152.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cavazos-Cadena, R. Recent results on conditions for the existence of average optimal stationary policies. Ann Oper Res 28, 3–27 (1991). https://doi.org/10.1007/BF02055572

Download citation

Issue Date: December 1991
DOI: https://doi.org/10.1007/BF02055572

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Recent results on conditions for the existence of average optimal stationary policies

Abstract

Access this article

Similar content being viewed by others

On the Expected Total Reward with Unbounded Returns for Markov Decision Processes

Semi-Markov decision processes with variance minimization criterion

First Passage Exponential Optimality Problem for Semi-Markov Decision Processes

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Recent results on conditions for the existence of average optimal stationary policies

Abstract

Access this article

Similar content being viewed by others

On the Expected Total Reward with Unbounded Returns for Markov Decision Processes

Semi-Markov decision processes with variance minimization criterion

First Passage Exponential Optimality Problem for Semi-Markov Decision Processes

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation