Abstract
The long-run average cost control problem for discrete time Markov chains is studied in an extremely general framework. Existence of stable stationary strategies which are optimal in the appropriate sense is established and these are characterized via the dynamic programming equations. The approach here differs from the conventional approach via the discounted cost problem and covers situations not covered by the latter.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
R.N. Bhattacharya, and M. Majumdar, preprint.
P. Billingsley, Convergence of Probability Measures, John Wiley, New York, 1968.
V. Borkar, Controlled Markov Chains and Stochastic Networks, SIAM J. Control and Opt., 21 (1983), pp. 652–666.
V. Borkar, On Minimum Cost Per Unit Time Control of Markov Chains, SIAM J. Control and Opt., 22 (1984), pp. 965–978.
V. Borkar and P. Varaiya, Identification and Adaptive Control of Markov Chains, SIAM J. Control and Opt., 20 (1982), pp. 470–489.
H. Deppe, Continuity of Mean Recurrence Times in Denumerable Semi-Markov Processes, Zeit. Wahr, 69 (1985), pp. 581–592.
M. Doxsxer, and S.R.S Varadhan, Asymptotic Evaluation of Certain Markov Process Expectations for Large Time III, Comm. in Pure and Appl. Math XXIX, 4 (1976), pp. 389–461.
A Federgruen, A. Horduk and H.C. Toms, A Note on Simultaneous Recurrence Conditions on a Set of Denumerable Stochastic Matrices, J. Appl. Prob., 15 (1978), pp. 842–847.
B. Hajek, Hitting-Time and Occupation-Time Bounds Implied by Drift Analysis with Applications, Adv. Appl. Prob., 14 (1982), pp. 502–525.
H.J. Kushner, Introduction to Stochastic Control, Holt, Rinehart and Winston Inc., New York, 1971.
M. Losve, Probability Theory II, 4th Ed., Springer-Verlag, New York, 1978.
Z. Kosberg, P. Varaiya and J. Walrand, Optimal Control of Service in Tandem Queues, IEEE Trans. on Automatic Control, AC-27 (1982), pp. 600–609.
S. Ross, Applied Probability Models with Optimization Applications, Holden Day, San Francisco (1970).
J. Wessels and J. Van De Wal, Markov Decision Processes, Statistics Neerlandica, 39 (1985), pp. 219–233.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1988 Springer-Verlag New York Inc.
About this paper
Cite this paper
Borkar, V.S. (1988). Control of Markov Chains with Long-Run Average Cost Criterion. In: Fleming, W., Lions, PL. (eds) Stochastic Differential Systems, Stochastic Control Theory and Applications. The IMA Volumes in Mathematics and Its Applications, vol 10. Springer, New York, NY. https://doi.org/10.1007/978-1-4613-8762-6_4
Download citation
DOI: https://doi.org/10.1007/978-1-4613-8762-6_4
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4613-8764-0
Online ISBN: 978-1-4613-8762-6
eBook Packages: Springer Book Archive