Skip to main content
Log in

A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information

  • Published:
Mathematical Programming Submit manuscript

Abstract.

 We give a policy-improvement type algorithm to locate an optimal pure stationary strategy for discounted stochastic games with perfect information. A graph theoretic motivation for our algorithm is presented as well.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received: January 1998 / Accepted: May 2002 Published online: February 14, 2003

Key words. stochastic games – MDP – perfect information – policy iteration

Partially Funded by NSF Grant DMS 930-1052 and DMS 970-4951

Rights and permissions

Reprints and permissions

About this article

Cite this article

Raghavan, T., Syed, Z. A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information. Math. Program., Ser. A 95, 513–532 (2003). https://doi.org/10.1007/s10107-002-0312-3

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10107-002-0312-3

Keywords

Navigation