Reinforcement Learning for MDPs with Constraints

Geibel, Peter

doi:10.1007/11871842_63

Reinforcement Learning for MDPs with Constraints

Peter Geibel²¹

Conference paper

6613 Accesses
41 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Abstract

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is either itself subject to an inequality constraint, or there is maximum allowable probability that the single returns violate the constraint. I describe and discuss three new reinforcement learning approaches for solving such control problems.

Download to read the full chapter text

Chapter PDF

References

Altman, E.: Constrained Markov Decision Processes. Chapman and Hall/CRC, Boca Raton (1999)
MATH Google Scholar
Bertsekas, D.P.: Dynamic Programming and Optimal Control. vol. 1, 2. Athena Scientific, Belmont (1995)
Google Scholar
Dolgov, D.A., Durfee, E.H.: Constructing optimal policies for agents with constrained architectures. In: AAMAS, pp. 974–975 (2003)
Google Scholar
Dolgov, D.A., Durfee, E.H.: Approximating optimal policies for agents with limited execution resources. In: Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp. 1107–1112. AAAI Press, Menlo Park (2004)
Google Scholar
Feinberg, E.A., Shwartz, A.: Constrained markov decision models with weighted discounted rewards. Math. of Operations Research 20(4), 302–320 (1995)
Article MATH MathSciNet Google Scholar
Gabor, Z., Kalmar, Z., Szepesvari, C.: Multi-criteria reinforcement learning. In: Proc. 15th International Conf. on Machine Learning, pp. 197–205. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Geibel, P., Wysotzki, F.: Risk-sensitive reinforcement learning applied to chance constrained control. JAIR 24, 81–108 (2005)
MATH Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning – An Introduction. MIT Press, Cambridge (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Cognitive Science, AI Group, University of Osnabrück, Germany
Peter Geibel

Authors

Peter Geibel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Geibel, P. (2006). Reinforcement Learning for MDPs with Constraints. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_63

Download citation

DOI: https://doi.org/10.1007/11871842_63
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics