Abstract
We consider zero-sum stochastic games. For every discount factor \(\lambda \), a time normalization allows to represent the discounted game as being played during the interval [0, 1]. We introduce the trajectories of cumulated expected payoff and of cumulated occupation measure on the state space up to time \(t\in [0,1]\), under \(\varepsilon \)-optimal strategies. A limit optimal trajectory is defined as an accumulation point as (\(\lambda , \varepsilon )\) tend to 0. We study existence, uniqueness and characterization of these limit optimal trajectories for compact absorbing games.
Similar content being viewed by others
References
Aumann RJ, Maschler M (1994) Repeated games with incomplete information. MIT Press, New York
Cardaliaguet P, Laraki R, Sorin S (2012) A continuous time approach for the asymptotic value in two-person zero-sum repeated games. SIAM J Control Optim 50:1573–1596
Kohlberg E (1974) Repeated games with absorbing states. Ann Stat 2:724–738
Laraki R (2010) Explicit formulas for repeated games with absorbing states. Int J Game Theor 39:53–69
Mertens J-F, Neyman A (1981) Stochastic games. Int J Game Theor 10:53–66
Mertens J-F, Neyman A, Rosenberg D (2009) Absorbing games with compact action spaces. Math Oper Res 34:257–262
Mertens J-F, Zamir S (1971) The value of two-person zero-sum repeated games with lack of information on both sides. Int J Game Theor 1:39–64
Oliu-Barton M, Ziliotto B (2018) Constant payoff in zero-sum stochastic games. arXiv:1811.04518v1
Rosenberg D, Sorin S (2001) An operator approach to zero-sum repeated games. Isr J Math 121:221–246
Shapley LS (1953) Stochastic games. Proc Natl Acad Sci USA 39:1095–1100
Sorin S (2002) A first course on zero-sum repeated games. Springer, New York
Sorin S, Vigeral G (2013) Existence of the limit value of two person zero-sum discounted repeated games via comparison theorems. J Optim Theor Appl 157:564–576
Sorin S, Vigeral G (2015) Reversibility and oscillations in zero-sum discounted stochastic games. J Dyn Games 2:103–115
Sorin S, Venel X, Vigeral G (2010) Asymptotic properties of optimal trajectories in dynamic programming. Sankhya A 72:237–245
Vigeral G (2013) A zero-sum stochastic game with compact action sets and no asymptotic value. Dyn Games Appl 3:172–186
Ziliotto B (2016) Zero-sum repeated games: counterexamples to the existence of the asymptotic value and the conjecture \(maxmin = {\rm lim}\, v_n\). Ann Probab 44:1107–1133
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Some of the results of this paper were presented in “Atelier Franco-Chilien: Dynamiques, optimisation et apprentissage” Valparaiso, November 2010, and a preliminary version of this paper was given at the Game Theory Conference in Stony Brook, July 2012. This research was supported by Grant PGMO 0294-01 (France)
Rights and permissions
About this article
Cite this article
Sorin, S., Vigeral, G. Limit Optimal Trajectories in Zero-Sum Stochastic Games. Dyn Games Appl 10, 555–572 (2020). https://doi.org/10.1007/s13235-019-00333-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13235-019-00333-z