The Linear Program approach in multi-chain Markov Decision Processes revisited

Altman, Eitan; Spieksma, Flos

doi:10.1007/BF01415752

The Linear Program approach in multi-chain Markov Decision Processes revisited

Articles
Published: June 1995

Volume 42, pages 169–188, (1995)
Cite this article

Zeitschrift für Operations Research Aims and scope Submit manuscript

Eitan Altman¹ &
Flos Spieksma²

344 Accesses
18 Citations
Explore all metrics

Abstract

Linear Programming is known to be an important and useful tool for solving Markov Decision Processes (MDP). Its derivation relies on the Dynamic Programming approach, which also serves to solve MDP. However, for Markov Decision Processes with several constraints the only available methods are based on Linear Programs. The aim of this paper is to investigate some aspects of such Linear Programs, related to multi-chain MDPs. We first present a stochastic interpretation of the decision variables that appear in the Linear Programs available in the literature. We then show for the multi-constrained Markov Decision Process that the Linear Program suggested in [9] can be obtained from an equivalent unconstrained Lagrange formulation of the control problem. This shows the connection between the Linear Program approach and the Lagrange approach, that was previously used only for the case of a single constraint [3, 14, 15].

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Markov Decision Processes with Discounted Costs: Improved Successive Over-Relaxation Method

Finite Markov Chains and Markov Decision Processes

Markov Decision Processes with Discounted Rewards: Improved Successive Over-Relaxation Method

References

Altman E, Schwartz A (1991) Markov decision problems and state-action frequencies. SIAM J Control and Optimization 29/4:786–809
Google Scholar
Altman E (1994) Denumerable constrained Markov decision problems and finite approximations. Math of OR 19:169–191
Google Scholar
Beutler FJ, Ross KW (1985) Optimal policies for controlled Markov chains with a constraint. Math Anal Appl 112:236–252
Google Scholar
Borkar VS (1988) A convex analytic approach to Markov decision processes. Probab Th Rel Fields 78:583–602
Google Scholar
Borkar VS (1991) Topics in controlled Markov chains. Pitman
Dembo A, Zeitouni O (1993) Large deviations techniques and applications. Jones and Bartlett
Derman C (1970) Finite state markovian decision processes. Academic Press
Hordijk A, Kallenberg LCM (1979) Linear programing and Markov decision chains. Management Science 25/4:352–362
Google Scholar
Hordijk A, Kallenberg LCM (1984) Constrained undiscounted stochastic dynamic programming. Math of OR 9:277–298
Google Scholar
Kallenberg LCM (1983) Linear programming and finite markovian control problems. Math Centre Tracts 148 Amsterdam
Luenberger DG (1968) Optimization by vector space methods. John Wiley
Ross K, Varadarajan R (1991) Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach. MOR 16/1:195–207
Google Scholar
Seneta E (1981) Non-negative martices and markov chains. Springer-Verlag
Sennott LI (1991) Constrained discounted Markov decision chains. Probability in the Engineering and Informational Sciences 5:463–475
Google Scholar
Sennott LI (1993) Constrained average cost markov decision chains. Probability in the Engineering and Informational Sciences 7:69–83
Google Scholar
Spieksma F (1990) Geometrically ergodic markov chains and the optimal control of queues. Ph D thesis Leiden

Download references

Author information

Authors and Affiliations

Centre Sophia Antipolis, INRIA, 06565, Valbonne Cedex, France
Eitan Altman
Institute of Mathematics & Computer Science, University of Leiden, P.O. Box 9512, 2300, RA Leiden, The Netherlands
Flos Spieksma

Authors

Eitan Altman
View author publications
You can also search for this author in PubMed Google Scholar
Flos Spieksma
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Altman, E., Spieksma, F. The Linear Program approach in multi-chain Markov Decision Processes revisited. ZOR - Methods and Models of Operations Research 42, 169–188 (1995). https://doi.org/10.1007/BF01415752

Download citation

Received: 15 February 1994
Issue Date: June 1995
DOI: https://doi.org/10.1007/BF01415752

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Linear Program approach in multi-chain Markov Decision Processes revisited

Abstract

Access this article

Similar content being viewed by others

Markov Decision Processes with Discounted Costs: Improved Successive Over-Relaxation Method

Finite Markov Chains and Markov Decision Processes

Markov Decision Processes with Discounted Rewards: Improved Successive Over-Relaxation Method

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Key words

Navigation

The Linear Program approach in multi-chain Markov Decision Processes revisited

Abstract

Access this article

Similar content being viewed by others

Markov Decision Processes with Discounted Costs: Improved Successive Over-Relaxation Method

Finite Markov Chains and Markov Decision Processes

Markov Decision Processes with Discounted Rewards: Improved Successive Over-Relaxation Method

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation