Abstract
In this paper we develop the theory of constrained Markov games. We consider the expected average cost as well as discounted cost. We allow different players to have different types of costs. We present sufficient conditions for the existence of stationary Nash equilibrium. Our results are based on the theory of sensitivity analysis of mathematical programs developed by Dantzig, Folkman, and Shapiro [9], which was applied to Markov Decision Processes (MDPs) in [3]. We further characterize all stationary Nash equilibria as fixed points of some coupled Linear Programs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altman, E. and A. Shwartz. Optimal Priority Assignment: A Time Sharing Approach. IEEE Transactions on Automatic Control, AC-34, No. 10, 1089–1102, 1989.
Altman, E. and A. Shwartz. Markov Decision Problems and State-Action Frequencies. SIAM Journal of Control and Optimization, 29, No. 4, 786–809, 1991.
Altman, E. and A. Shwartz. Sensitivity of Constrained Markov Decision Problems. Annals of Operations Research,32, 1–22, 1991.
Altman, E. Denumerable Constrained Markov Decision Processes and Finite Approximations. Mathematics of Operations Research, 19, No. 1, 169–191, 1994.
Altman, E. Asymptotic Properties of Constrained Markov Decision Processes. ZOR— Methods and Models in Operations Research, 37, No. 2, 151–170, 1993.
Beutler, F. J. and K. W. Ross. Optimal Policies for Controlled Markov Chains with a Constraint. Journal of Mathematical Analysis and Applications, 112,236–252,1985.
Beutler, F. J. and K. W. Ross. Time-Average Optimal Constrained Semi-Markov Decision Processes. Advances of Applied Probability, 18, No. 2, 341–359, 1986.
Borkar, V. S. Ergodic Control of Markov Chains with Constraints—The General Case. SIAM Journal of Control and Optimization, 32, No. 1, 176–186, 1994.
Dantzig, G. B., J. Folkman, and N. Shapiro. On the Continuity of the Minimum Set of a Continuous Function. Journal of Mathematical Analysis and Applications, 17, 519–548, 1967.
Derman, C and M. Klein. Some Remarks on Finite Horizon Markovian Decision Models. Operations Research, 13, 272–278, 1965.
Feinberg, E. A. Constrained Semi-Markov Decision Processes with Average Rewards. ZOR, 39, 257–288, 1993.
Feinberg, E. A. and M. I. Reiman. Optimality of Randomized Trunk Reservation. Probability in the Engineering and Informational Sciences, 8,463–489, 1994.
Feinberg, E. A. and A. Shwartz. Constrained Markov Decision Models with Weighted Discounted Rewards. Mathematics of Operations Research, 20, 302–320, 1995.
Feinberg, E. A. and A. Shwartz. Constrained Discounted Dynamic Programming. Mathematics of Operations Research, 21, 922–945, 1996.
Hordijk, A. and L. C. M. Kallenberg. Constrained Undiscounted Stochastic Dynamic Programming. Mathematics of Operations Research, 9, No. 2, 276–289, 1984.
Hordijk, A. and F. Spieksma. Constrained Admission Control to a Queuing System. Advances of Applied Probability, 21,409–431, 1989.
Hsiao, M. T. and A. A. Lazar. Optimal Decentralized Flow Control of Markovian queueing Networks with Multiple Controllers. Performance Evaluation, 13,181–204, 1991.
Kallenberg, L. C. M. Linear Programming and Finite Markovian Control Problems. Mathematical Centre Tracts 148, Amsterdam, 1983.
Korilis, Y. A. and A. Lazar. On the Existence of Equilibria in Noncooperative Optimal Flow Control. Journal of the Association for Computing Machinery, 42, No. 3, 584–613, 1995.
Lazar, A. Optimal Flow Control of a Class of Queuing Networks in Equilibrium. IEEE Transactions on Automatic Control, 28, No. 11, 1001–1007, 1983.
Levi, R. and A. Shwartz. A Theory of ApproachabiHty and Throughput-Cost Tradeoff in a Queue with Impatient Customers. EE Pub. 936, Technion, 1994.
Levi, R. and A. Shwartz. Throughput-Delay Tradeoff with Impatient Arrivals. Proceedings of the 23rd Allerton Conference on Communications, Control and Computing, Allerton, IL, 1994.
Nain, P. and K. W. Ross. Optimal Priority Assignment with hard Constraint. Transactions on Automatic Control, 31, No. 10, 883–888, IEEE 1986.
Rosen, J. B. Existence and Uniqueness of Equilibrium Points for Concave n-Person Games. Econometrica, 33, 520–534, 1965.
Ross, K. W. Randomized and Past-Dependent Policies for Markov Decision Processes with Multiple Constraints. Operations Research, 37, No. 3, 474–477, 1989.
Ross K. W. and B. Chen. Optimal Scheduling of Interactive and Nonlnteractive Traffic in Telecommunication Systems. IEEE Transactions on Automatic Control, 33, No. 3, 261–267, 1988.
Ross, K. W. and R. Varadarajan. Markov Decision Processes with Sample Path Constraints: The Communicating Case. Operations Research, 37, No. 5, 780–790, 1989.
Sennott, L. I. Constrained Discounted Markov Decision Chains. Probability in the Engineering and Informational Sciences, 5,463–475, 1991.
Sennott, L. I. Constrained Average Cost Markov Decision Chains, Probability in the Engineering and Informational Sciences 7, 69–83, 1993.
Shimkin, N. Stochastic Games with Average Cost Constraints. Annals of the International Society of Dynamic Games, Vol. 1: Advances in Dynamic Games and Applications (T. Basar and A. Haurie, eds.) Birkhauser, Boston, 1994.
Shimkin, N. and A. Shwartz. Guaranteed Performance Regions for Markovian Systems with Competing Decision Makers, IEEE Transactions on Automatic Control, 38, 84–95, 1993.
Spieksma, F. M. Geometrically Ergodic Markov Chains and the Optimal Control of Queues. Ph.D. thesis, University of Leiden, 1990.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer Science+Business Media New York
About this paper
Cite this paper
Altman, E., Shwartz, A. (2000). Constrained Markov Games: Nash Equilibria. In: Filar, J.A., Gaitsgory, V., Mizukami, K. (eds) Advances in Dynamic Games and Applications. Annals of the International Society of Dynamic Games, vol 5. Birkhäuser, Boston, MA. https://doi.org/10.1007/978-1-4612-1336-9_11
Download citation
DOI: https://doi.org/10.1007/978-1-4612-1336-9_11
Publisher Name: Birkhäuser, Boston, MA
Print ISBN: 978-1-4612-7100-0
Online ISBN: 978-1-4612-1336-9
eBook Packages: Springer Book Archive