Abstract
Stochastic Shortest Path problems (SSPs), a subclass of Markov Decision Problems (MDPs), can be efficiently dealt with VI, PI, RTDP, LAO* and so on. However, in many practical problems the estimation of the probabilities is far from accurate. In this paper, we present uncertain transition probabilities as close real intervals. Also, we describe a general algorithm, called gLAO*, that can solve uncertain MDPs efficiently. We demonstrate that Buffet and Aberdeen’s approach, searching for the best policy under the worst model, is a special case of our approaches. Experiments show that gLAO* inherits excellent performance of LAO* for solving uncertain MDPs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hansen, E.A., Ziberstein, S.: LAO*: A heuristic search algorithm that finds solutions with loops. Artificial Intelligence 129, 35–62 (2001)
Bagnell, J.A., Ng, A.Y., Schneider, J.: Solving uncertain markov decision problems. Technical Report CMU-RI-TR-01-25, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA (August 2001)
Daram, U.K., Chong, E.K.P., Shroff, N.B.: Markov Decision Processes with Uncertain Transition Rates: Sensitivity and Robust Control. In: Proceedings of the 41st IEEE, Conference on Devision and Control, Las Vegas, Nevada, USA (December 2002)
Buffet, O., Aberdeen, D.: Robust planning with (l)rtdp. In: Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI 2005) (2005)
Givan, R., Leach, S., Dean, T.: Bounded parameter markov decision processes. Artificial Intelligence 122(1-2), 71–109 (2000)
Bertsekas, D.P., Tsitsiklis, J.N.: Neurodynamic Programming. Athena Scientific, Belmont (1996)
Bertsekas, D.: Dynamic Programming and Optimal Control. Athena Scientific, Belmont (1995)
Martelli, A., Montanari, U.: Optimizing decision trees through heuristically guided search. Comm. ACM 21(12), 1025–1039 (1978)
Barto, A.G., Bradtke, S., Singh, S.: Learning to act using real time dynamic programming. Artificial Intelligence 72 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cui, S., Sun, J., Yin, M., Lu, S. (2006). Solving Uncertain Markov Decision Problems: An Interval-Based Method. In: Jiao, L., Wang, L., Gao, X., Liu, J., Wu, F. (eds) Advances in Natural Computation. ICNC 2006. Lecture Notes in Computer Science, vol 4222. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881223_120
Download citation
DOI: https://doi.org/10.1007/11881223_120
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45907-1
Online ISBN: 978-3-540-45909-5
eBook Packages: Computer ScienceComputer Science (R0)