Stochastic Mean Payoff Games: Smoothed Analysis and Approximation Schemes

  • Endre Boros
  • Khaled Elbassioni
  • Mahmoud Fouz
  • Vladimir Gurvich
  • Kazuhisa Makino
  • Bodo Manthey
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6755)

Abstract

In this paper, we consider two-player zero-sum stochastic mean payoff games with perfect information modeled by a digraph with black, white, and random vertices. These BWR-games games are polynomially equivalent with the classical Gillette games, which include many well-known subclasses, such as cyclic games, simple stochastic games, stochastic parity games, and Markov decision processes. They can also be used to model parlor games such as Chess or Backgammon.

It is a long-standing open question if a polynomial algorithm exists that solves BWR-games. In fact, a pseudo-polynomial algorithm for these games with an arbitrary number of random nodes would already imply their polynomial solvability. Currently, only two classes are known to have such a pseudo-polynomial algorithm: BW-games (the case with no random nodes) and ergodic BWR-games (in which the game’s value does not depend on the initial position) with constant number of random nodes. In this paper, we show that the existence of a pseudo-polynomial algorithm for BWR-games with constant number of random vertices implies smoothed polynomial complexity and the existence of absolute and relative polynomial-time approximation schemes. In particular, we obtain smoothed polynomial complexity and derive absolute and relative approximation schemes for BW-games and ergodic BWR-games (assuming a technical requirement about the probabilities at the random nodes).

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Andersson, D., Miltersen, P.B.: The complexity of solving stochastic games on graphs. In: Dong, Y., Du, D.-Z., Ibarra, O. (eds.) ISAAC 2009. LNCS, vol. 5878, pp. 112–121. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  2. 2.
    Beier, R., Vöcking, B.: Typical properties of winners and losers in discrete optimization. SIAM J. Comput. 35(4), 855–881 (2006)MathSciNetCrossRefMATHGoogle Scholar
  3. 3.
    Boros, E., Elbassioni, K.M., Gurvich, V., Makino, K.: A pumping algorithm for ergodic stochastic mean payoff games with perfect information. In: Eisenbrand, F., Shepherd, F.B. (eds.) IPCO 2010. LNCS, vol. 6080, pp. 341–354. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  4. 4.
    Gillette, D.: Stochastic games with zero stop probabilities. In: Dresher, M., Tucker, A.W., Wolfe, P. (eds.) Contribution to the Theory of Games III. Annals of Mathematics Studies, vol. 39, pp. 179–187. Princeton University Press, Princeton (1957)Google Scholar
  5. 5.
    Gurvich, V., Karzanov, A., Khachiyan, L.: Cyclic games and an algorithm to find minimax cycle means in directed graphs. USSR Computational Mathematics and Mathematical Physics 28, 85–91 (1988)CrossRefMATHGoogle Scholar
  6. 6.
    Hoffman, A.J., Karp, R.M.: On nonterminating stochastic games. Management Science, Series A 12(5), 359–370 (1966)MathSciNetMATHGoogle Scholar
  7. 7.
    Karzanov, A.V., Lebedev, V.N.: Cyclical games with prohibition. Mathematical Programming 60, 277–293 (1993)MathSciNetCrossRefMATHGoogle Scholar
  8. 8.
    Liggett, T.M., Lippman, S.A.: Stochastic games with perfect information and time-average payoff. SIAM Review 4, 604–607 (1969)MathSciNetCrossRefMATHGoogle Scholar
  9. 9.
    Mine, H., Osaki, S.: Markovian decision process. Elsevier, Amsterdam (1970)MATHGoogle Scholar
  10. 10.
    Mulmuley, K., Vazirani, U.V., Vazirani, V.V.: Matching is as easy as matrix inversion. Combinatorica 7(1), 105–113 (1987)MathSciNetCrossRefMATHGoogle Scholar
  11. 11.
    Roth, A., Balcan, M.-F., Kalai, A., Mansour, Y.: On the equilibria of alternating move games. In: SODA, pp. 805–816 (2010)Google Scholar
  12. 12.
    Spielman, D.A., Teng, S.-H.: Smoothed analysis of algorithms: Why the simplex algorithm usually takes polynomial time. J. ACM 51(3), 385–463 (2004)MathSciNetCrossRefMATHGoogle Scholar
  13. 13.
    Spielman, D.A., Teng, S.-H.: Smoothed analysis: An attempt to explain the behavior of algorithms in practice. C. ACM 52(10), 76–84 (2009)CrossRefGoogle Scholar
  14. 14.
    Vorobyov, S.: Cyclic games and linear programming. Discrete Appl. Math. 156(11), 2195–2231 (2008)MathSciNetCrossRefMATHGoogle Scholar
  15. 15.
    Zwick, U., Paterson, M.: The complexity of mean payoff games on graphs. Theoret. Comput. Sci. 158(1-2), 343–359 (1996)MathSciNetCrossRefMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Endre Boros
    • 1
  • Khaled Elbassioni
    • 2
  • Mahmoud Fouz
    • 3
  • Vladimir Gurvich
    • 1
  • Kazuhisa Makino
    • 4
  • Bodo Manthey
    • 5
  1. 1.RUTCOR, Rutgers UniversityUSA
  2. 2.Max-Planck-Institut für InformatikSaarbrückenGermany
  3. 3.Fachrichtung InformatikUniversität des SaarlandesGermany
  4. 4.Graduate School of Information Science and TechnologyUniversity of TokyoJapan
  5. 5.Department of Applied MathematicsUniversity of TwenteThe Netherlands

Personalised recommendations