Abstract.
We give a policy-improvement type algorithm to locate an optimal pure stationary strategy for discounted stochastic games with perfect information. A graph theoretic motivation for our algorithm is presented as well.
Similar content being viewed by others
Author information
Authors and Affiliations
Additional information
Received: January 1998 / Accepted: May 2002 Published online: February 14, 2003
Key words. stochastic games – MDP – perfect information – policy iteration
Partially Funded by NSF Grant DMS 930-1052 and DMS 970-4951
Rights and permissions
About this article
Cite this article
Raghavan, T., Syed, Z. A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information. Math. Program., Ser. A 95, 513–532 (2003). https://doi.org/10.1007/s10107-002-0312-3
Issue Date:
DOI: https://doi.org/10.1007/s10107-002-0312-3