Skip to main content
Log in

Stationary, completely mixed and symmetric optimal and equilibrium strategies in stochastic games

  • Original Paper
  • Published:
International Journal of Game Theory Aims and scope Submit manuscript

Abstract

In this paper, we address various types of two-person stochastic games—both zero-sum and nonzero-sum, discounted and undiscounted. In particular, we address different aspects of stochastic games, namely: (1) When is a two-person stochastic game completely mixed? (2) Can we identify classes of undiscounted zero-sum stochastic games that have stationary optimal strategies? (3) When does a two-person stochastic game possess symmetric optimal/equilibrium strategies? Firstly, we provide some necessary and some sufficient conditions under which certain classes of discounted and undiscounted stochastic games are completely mixed. In particular, we show that, if a discounted zero-sum switching control stochastic game with symmetric payoff matrices has a completely mixed stationary optimal strategy, then the stochastic game is completely mixed if and only if the matrix games restricted to states are all completely mixed. Secondly, we identify certain classes of undiscounted zero-sum stochastic games that have stationary optima under specific conditions for individual payoff matrices and transition probabilities. Thirdly, we provide sufficient conditions for discounted as well as certain classes of undiscounted stochastic games to have symmetric optimal/equilibrium strategies—namely, transitions are symmetric and the payoff matrices of one player are the transpose of those of the other. We also provide a sufficient condition for the stochastic game to have a symmetric pure strategy equilibrium. We also provide examples to show the sharpness of our results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

Notes

  1. The Pfaffian of a skew-symmetric matrix is the square root of the determinant.

References

  • Blackwell D, Ferguson TS (1968) The big match. Ann Math Stat 39:159–168

    Article  Google Scholar 

  • Dasgupta P, Maskin E (1986) The existence of equilibrium in discontinuous economic games, I: Theory. Rev Econ Stud 53–1:1–26

    Article  Google Scholar 

  • Duersch P, Oechssler J, Schipper B (2012) Pure strategy equilibria in symmetric two-player zero-sum games. Int J Game Theory 41(3):553–564

    Article  Google Scholar 

  • Filar J (1981) Orderfield property of stochastic games when the player who controls the transition changes from state to state. J Optim Theory Appl 34:505–517

    Article  Google Scholar 

  • Filar J (1984) On stationary equilibria of a single-controller stochastic game. Math Progr 30–3:313–325

    Article  Google Scholar 

  • Filar J (1985) The completely mixed single-controller stochastic game. Proc Am Math Soc 95–4:585–594

    Article  Google Scholar 

  • Fink AM (1964) Equilibrium in a stochastic n-person game. J Sci Hiroshima Uni 28–1:89–93

    Google Scholar 

  • Flesch J, Parthasarathy T, Thuijsman F, Uyttendaele P (2013) Evolutionary stochastic games. Dyn Games Appl 3:207–219

    Article  Google Scholar 

  • Gale D (1960) The theory of linear economic models. McGraw-Hill Book Company, New York

    Google Scholar 

  • Gillette D (1957) Stochastic games with zero-stop probabilities. Contributions to the theory of games. Ann Math Stud, Princeton University Press 39:179–188

  • Himmelberg CJ, Parthasarathy T, Raghavan TES, Van Vleck FS (1976) Existence of p-equilibrium and optimal stationary strategies in stochastic games. Proc Am Math Soc 60:245–251

    Google Scholar 

  • Hofbauer J, Sigmund K (2003) Evolutionary game dynamics. Bull Am Math Soc 40–4:479–519

    Article  Google Scholar 

  • Kakutani S (1941) A generalization of Brouwer’s fixed point theorem. Duke Math J 8–3:457–459

    Article  Google Scholar 

  • Kaplansky I (1945) A contribution to von Neumann’s theory of games. Ann Math 46–3:474–479

    Article  Google Scholar 

  • Kaplansky I (1995) A contribution to von Neumann’s theory of games II. Linear Algebra Appl 226–228:371–373

    Article  Google Scholar 

  • Krishnamurthy N, Parthasarathy T, Ravindran G (2009) Communication complexity of stochastic games. Proceedings of the International Conference on Game Theory for Networks (GameNets‘09), Istanbul, Turkey. IEEE Press, NJ, USA, p 411–417

  • Krishnamurthy N (2011) Equilibria in bimatrix games and stochastic games: theoretical and computational aspects. Ph.D. Thesis (Guided by: Parthasarathy T. Co-guided by: Samir Datta, Ravindran G, Subrahmanyam KV), Chennai Mathematical Institute, India

  • Maitra AP, Parthasarathy T (1970) On stochastic games. J Optim Theory Appl 5(4):289–300

    Article  Google Scholar 

  • Maynard Smith J, Price GR (1973) The logic of animal conflict. Nature 246:15–18

    Article  Google Scholar 

  • Mertens JF, Neyman A (1980) Stochastic games. Core Discussion Paper 8001, Université Catholique de Louvain-la-Neuve, Belgium

  • Mertens JF, Neyman A (1981) Stochastic games. Int J Game Theory 10:53–66

    Article  Google Scholar 

  • Nash J (1951) Non-cooperative games. Ann Math 54(2):286–295

    Article  Google Scholar 

  • Nowak AS (2003) On a new class of nonzero-sum discounted stochastic games having stationary Nash equilibrium points. Int J Game Theory 32:121–132

    Article  Google Scholar 

  • Oviedo JA (1996) Payoff matrices in completely mixed bimatrix games with zero-value. Union Math Argent 40:75–81

    Google Scholar 

  • Parthasarathy T (1982) Existence of equilibrium stationary strategies in discounted stochastic games. Sankhyā Indian J Stat 44(A–1):114–127

    Google Scholar 

  • Parthasarathy T, Raghavan TES (1981) An orderfield property for stochastic games when one player controls transition probabilities. J Optim Theory Appl 33–3:375–392

    Article  Google Scholar 

  • Parthasarathy T, Sinha S (1989) Existence of stationary equilibrium strategies in nonzero-sum discounted stochastic games with uncountable state space and state-independent transitions. Int J Game Theory 18:189–194

    Article  Google Scholar 

  • Parthasarathy T, Tijs SJ, Vrieze OJ (1984) Stochastic games with state independent transitions and separable rewards. In: Hammer G, Pallaschke D (eds) Selected topics in operations research and mathematical economics (Karlsruhe, 1983), Lecture notes in economics and mathematical systems, vol 226. Springer, Berlin, pp 262–271

  • Raghavan TES (1970) Completely mixed strategies in bimatrix games. J Lond Math Soc 2:709–712

    Article  Google Scholar 

  • Raghavan TES, Tijs SJ, Vrieze OJ (1986) Stochastic games with additive rewards and additive transitions. J Optim Theory Appl 47:451–464

    Article  Google Scholar 

  • Reny P (1999) On the existence of pure and mixed strategy Nash equilibria in discontinuous games. Econometrica 67–5:1029–1056

    Article  Google Scholar 

  • Rieder U (1979) Equilibrium plans for nonzero-sum Markov games. In: Moeshlin O, Pallaschke D (eds) Game theory and related topics. North-Holland Publishing Company, Amsterdam, pp 91–101

    Google Scholar 

  • Shapley L (1953) Stochastic games. Proc Natl Acad Sci 39:1095–1100

    Article  Google Scholar 

  • Sujatha B, Parthasarathy T, Saha A (2014) Strategies in certain types of skew symmetric games. In: Mishra RK, Shaheen, Raveendran J, Deman S (eds). Applications of game theory, Academic Foundation, p 43–50

  • Takahashi M (1964) Equilibrium points of stochastic noncooperative n-person games. J Sci Hiroshima Uni Ser AI Math 28:95–99

    Google Scholar 

  • Von Neumann J (1928) Zur Theorie der Gesellschaftsspiele. Math Ann 100:295–320 (in German)

Download references

Acknowledgments

We would like to thank the two anonymous Referees and the Associate Editor for their valuable and detailed comments that has helped structure this paper better.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sujatha Babu.

Additional information

Dedicated to the memory of Lloyd Shapley.

Appendix

Appendix

The proofs of Theorems 7 and 9 are published in the Proceedings of a Conference that are not available online, and is hence not easily accessible currently. For the sake of completeness, we provide these proofs below.

We first provide necessary notation and known results that are used in the proofs. For the bimatrix game (AB), let \(\varepsilon \) denote the set of equilibrium strategies. Let \( (x_0, y_0) \in \varepsilon \). Further let Cofactor\((A_{ij})\) and Cofactor\((B_{ij})\) denote the cofactors of \(a_{ij}\) and \(b_{ij}\) respectively. For showing the conditions for a stochastic game to be completely mixed using the concept of principal Pfaffians, we need the following results by Oviedo (1996).

  1. 1.

    (Theorem 1 of Oviedo 1996) If \(\varepsilon \) is completely mixed and \(x_0^tAy_0 = x_0^tBy_0 = 0\), then there exists an i \( (1 \le i \le n)\) such that Cofactor\((A_{i1})\), Cofactor\((A_{i2})\), ..., Cofactor\((A_{in})\) are different from zero and have the same sign. Similarly, there exists a j \( (1 \le j \le n)\) such that Cofactor\((B_{j1})\), Cofactor\((B_{j2})\), ..., Cofactor\((B_{jn})\) are different from zero and have the same sign.

  2. 2.

    (Corollary 1 of Oviedo 1996) If \(\varepsilon \) is completely mixed, then \(x_0^tAy_0 = \frac{det (A)}{\sum \limits _{i, j} A_{ij}}\) and \(x_0^tBy_0 = \frac{det (B)}{\sum \limits _{i, j} B_{ij}}\), where the denominators are always different from 0.

  3. 3.

    (Proposition 1 of Oviedo 1996) Suppose there exists constants \(v_1\) and \(v_2\) such that for any \( (x, y) \in \varepsilon \), \(Ay = v_1e\) and \(x^tB = v_2e^t\). Suppose, moreover, that both A and B are square matrices of rank \(n - 1\). Then \(\varepsilon \) is completely mixed.

Proof of Theorem 7

(Theorem 3 of Sujatha et al. 2014) Let the set of all equilibrium strategies \(\varepsilon \) be completely mixed. Let x and y be the strategies used by player-1 and player-2 respectively. Since the matrices are odd ordered skew symmetric, \(det (A) = det (B) = 0\). Then, \(x^tAy = x^tBy = 0\) by Oviedo (1996, Corollary 1).

Further by Oviedo (1996, Theorem 1) and without loss of generality, we can assume Cofactor\((A_{ij}) > 0\) for all i, for all j. That is, \( (-1)^{i+j} m_{ij} > 0\) where \(m_{ij}\) is the sub-determinant obtained by deleting the ith row and jth column of the matrix A. By Kaplansky (1995, Theorem 1), this implies that \( (-1)^{i+j} p_{i}p_{j} > 0\) where \(p_{i}\) and \(p_{j}\) are the i\(^\mathrm{th}\) and j\(^\mathrm{th}\) principal Pfaffians of A.

If i and j are both even or both odd, then the above equation implies that either both \(p_{i}\) and \(p_{j}\) are greater than 0, or both are lesser than 0. If i is even and j is odd or vice versa, then either (\(p_{i} > 0\), \(p_{j} < 0\)) or (\(p_{i} < 0\), \(p_{j} > 0\)). Thus all the principal Pfaffians of A are nonzero and alternate in sign. The same holds for all the principal Pfaffians of matrix B.

Conversely, let the principal Pfaffians of matrix A be nonzero and alternate in sign. Without loss of generality, we can assume that \(p_i > 0\) where i is odd and \(p_j < 0\) where j is even.

Then, \( (-1)^{i+j}p_{i}p_{j} > 0\). That is, \( (-1)^{i+j}m_{ij} > 0\), where \(m_{ij}\) is the sub-determinant obtained by deleting the ith row and jth column of matrix A. This implies that \(c_{ij} > 0\), where \(c_{ij}\) is the cofactor of \(a_{ij}\).

Hence all cofactors of A (and similarly of B) are nonzero and have the same sign. The rank of both these matrices is \(n-1\) since the minors of order \(n-1\) are nonzero. Consider a strategy \( (x_0, y_0) \in \varepsilon \). Then there exists \(v_1\) and \(v_2\) such that \(Ay_0 = v_1e\) and \( (x_0)^tB = v_2e^t\). By Oviedo (1996, Proposition 1), \(\varepsilon \) is completely mixed. \(\square \)

Proof of Theorem 9

(Theorem 5 of Sujatha et al. 2014) Let \(f^o (s)\) be the optimal strategy for player-1 for R(s). Gale (1960) showed that for a finite zero-sum game with a skew symmetric matrix, the value of the game is 0 and any strategy that is optimal for one player is also optimal for the other player. Hence, if we view R(s) as a finite zero sum game with skew symmetric payoff matrix, then the value of the stochastic game is 0. Let g be any strategy for player-2. We have \(r (s, f^o, g) \ge 0\).

Indicate r(fg) by \(\begin{pmatrix} r(1, \quad f(1), \quad g(1))\\ r(2,\quad f(2), \quad g(2))\\ \vdots \\ r(N, \quad f(N),\quad g(N)) \end{pmatrix}.\)

The total expected \(\beta \)-discounted income for player-1 is given by

where \(Q (f^o, g)\) is a \(N \times N\) matrix with \( (s, s')\)th element representing the transition probability \(q (s' | s, f^o, g)\). It is obvious that for all g, for all s

$$\begin{aligned} I_\beta (f^o, g) (s)\ge 0 \end{aligned}$$
(5)

Similarly, let \(g^o\) be an optimal stationary strategy for player-2. The total expected \(\beta \)-discounted income for player-2 is given by

(6)

By Gale’s result (1960), any strategy that is optimal for one player is also optimal for the other player, that is \(g^o = f^o\). Hence, \(I_\beta (f, f^o)(s) = [I - \beta Q (f, f^o)]^{-1} r (s, f, f^o)\) (from Eq. 6). Hence for all s, for all f,

$$\begin{aligned} I_\beta (f, f^o)(s) \le 0 \end{aligned}$$
(7)

Comparing Eqs. 5 and 7, we have \(I_\beta (f^o, g)(s) = I_\beta (f, f^o)(s) = 0\). That is, the value of the stochastic game starting in state s is 0. Note that the auxiliary game and the matrix game restricted to state s coincide for all discount factors. Thus the stochastic game has symmetric optimal stationary strategies independent of the discount factor \(\beta \) and the transition probability \(q (s' | s, f, g)\). \(\square \)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Babu, S., Krishnamurthy, N. & Parthasarathy, T. Stationary, completely mixed and symmetric optimal and equilibrium strategies in stochastic games. Int J Game Theory 46, 761–782 (2017). https://doi.org/10.1007/s00182-016-0555-5

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00182-016-0555-5

Keywords

Navigation