Singularly perturbed Markov control problem: Limiting average cost

Bielecki, Tomasz R.; Filar, Jerzy A.

doi:10.1007/BF02055579

Singularly perturbed Markov control problem: Limiting average cost

Research Contributions
Published: December 1991

Volume 28, pages 153–168, (1991)
Cite this article

Annals of Operations Research Aims and scope Submit manuscript

Tomasz R. Bielecki¹ &
Jerzy A. Filar²

89 Accesses
32 Citations
Explore all metrics

Abstract

In this paper we consider a singularly perturbed Markov decision process with the limiting average cost criterion. We assume that the underlying process is composed ofn separate irreducible processes, and that the small perturbation is such that it “unites” these processes into a single irreducible process. We formulate the underlying control problem for the singularly perturbed MDP, and call it the “limit Markov control problem” (limit MCP). We prove the validity of the “the limit control principle” which states that an optimal solution to the perturbed MDP can be approximated by an optimal solution of the limit MCP for any sufficiently small perturbation. We also demonstrate that the limit Markov control problem is equivalent to a suitably constructed nonlinear program in the space of long-run state-action frequencies. This approach combines the solutions of the original separated irreducible MDPs with the stationary distribution of a certain “aggregated MDP” and creates a framework for future algorithmic approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

R. Aldhaheri and h. Khalil, Aggregation and optimal control of nearly completely decomposable Markov chains, in:Proc. 28th CDC, (IEEE, 1989).
D. Blackwell, Discrete dynamic programming, Ann. Math. Stat. 33 (1962) 719–726.
Google Scholar
M. Cordech, A. Willsky, S. Sastry and D. Castanon, Hierarchical aggregation of linear systems with multiple time scales, IEEE Trans. Automatic Control AC-28 (1983) 1017–1029.
Google Scholar
F. Delebecque, A reduction process for perturbed Markov chains, SIAM J. Appl. Math. 48 (1983) 325–350.
Google Scholar
F. Delebecque and J. Quadrat, Optimal control of Markov chains admitting strong and weak interactions, Automatica 17 (1981) 281–296.
Google Scholar
N.V. Dijk, Perturbation theory for unbounded Markov reward processes with applications to queueing, Adv. Appl. Prob. 20 (1988) 99–111.
Google Scholar
N.V. Dijk and M. Puterman, Perturbation theory for Markov reward processes with applications to queueing systems, Adv. Appl. Prob. 20 (1988) 79–98.
Google Scholar
R.A. Howard,Dynamic Programming and Markov Processes (Wiley, New York, 1960).
Google Scholar
L.C.M. Kallenberg,Linear Programming and Finite Markovian Control Problems, Mathematical Center Tracts 148, Amsterdam (1983).
T. Kato,Perturbation Theory for Linear Operators (Springer, Berlin, 1980).
Google Scholar
P. Kokotovič, Application of singular perturbation techniques to control problems, SIAM Rev. 26 (1984) 501–550.
Google Scholar
R.G. Phillips and P. Kokotovič, A singular perurbation approach to modelling and control of Markov chains, IEEE Trans. Automatic Control AC-26 (1981) 1087–1094.
Google Scholar
J. Rohlicek and A. Willsky, Multiple time scale decomposition of discrete time Markov chains, Syst. Control Lett. 11 (1988) 309–314.
Google Scholar
P. Schweitzer, Perturbation series for nearly completely-decomposable Markov chains, in:Teletraffic Analysis and Computer Performance Evaluation, O. Boxma, J. Cohen and H. Tijms (eds.) (Elsevier Science, 1986).
P.J. Schweitzer, Perturbation theory and finite Markov chains, J. Appl. Prob. 5 (1968) 401–413.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, University of Kansas, 66044, Lawrence, KS, USA
Tomasz R. Bielecki
Department of Mathematics and Statistics, University of Maryland at Baltimore County, 21228, Baltimore, MD, USA
Jerzy A. Filar

Authors

Tomasz R. Bielecki
View author publications
You can also search for this author in PubMed Google Scholar
Jerzy A. Filar
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

On leave from Main College of Planning and Statistics, Warsaw, Poland.

Supported in part by the AFOSR and the NSF under the grant ECS-8704954.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bielecki, T.R., Filar, J.A. Singularly perturbed Markov control problem: Limiting average cost. Ann Oper Res 28, 153–168 (1991). https://doi.org/10.1007/BF02055579

Download citation

Issue Date: December 1991
DOI: https://doi.org/10.1007/BF02055579

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Singularly perturbed Markov control problem: Limiting average cost

Abstract

Access this article

Similar content being viewed by others

Average cost criterion induced by the regular utility function for continuous-time Markov decision processes

Asymptotics of Normalized Control with Markov Switchings

Stability Estimation of Transient Markov Decision Processes

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Singularly perturbed Markov control problem: Limiting average cost

Abstract

Access this article

Similar content being viewed by others

Average cost criterion induced by the regular utility function for continuous-time Markov decision processes

Asymptotics of Normalized Control with Markov Switchings

Stability Estimation of Transient Markov Decision Processes

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation