A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information

Raghavan, T.E.S.; Syed, Zamir

doi:10.1007/s10107-002-0312-3

A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information

Published: March 2003

Volume 95, pages 513–532, (2003)
Cite this article

Mathematical Programming Submit manuscript

T.E.S. Raghavan¹ &
Zamir Syed²

204 Accesses
21 Citations
Explore all metrics

Abstract.

We give a policy-improvement type algorithm to locate an optimal pure stationary strategy for discounted stochastic games with perfect information. A graph theoretic motivation for our algorithm is presented as well.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Stackelberg risk preference design

Article 02 April 2024

On the Replication of the Pre-kernel and Related Solutions

Article 19 September 2023

Author information

Authors and Affiliations

Department of Mathematics, Statistics and Computer Science, University of Illinois at Chicago, e-mail: ter@uic.edu, , , , , , US
T.E.S. Raghavan
The Hull Group L.L.C, Chicago, IL 60606, e-mail: zsyed@hdc.com, , , , , , US
Zamir Syed

Authors

T.E.S. Raghavan
View author publications
You can also search for this author in PubMed Google Scholar
Zamir Syed
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Received: January 1998 / Accepted: May 2002 Published online: February 14, 2003

Key words. stochastic games – MDP – perfect information – policy iteration

Partially Funded by NSF Grant DMS 930-1052 and DMS 970-4951

Rights and permissions

Reprints and permissions

About this article

Cite this article

Raghavan, T., Syed, Z. A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information. Math. Program., Ser. A 95, 513–532 (2003). https://doi.org/10.1007/s10107-002-0312-3

Download citation

Issue Date: March 2003
DOI: https://doi.org/10.1007/s10107-002-0312-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information

Abstract.

Access this article

Similar content being viewed by others

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Stackelberg risk preference design

On the Replication of the Pre-kernel and Related Solutions

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information

Abstract.

Access this article

Similar content being viewed by others

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Stackelberg risk preference design

On the Replication of the Pre-kernel and Related Solutions

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation