Analysis of Markov Decision Processes Under Parameter Uncertainty

Buchholz, Peter; Dohndorf, Iryna; Scheftelowitsch, Dimitri

doi:10.1007/978-3-319-66583-2_1

Peter Buchholz¹⁵,
Iryna Dohndorf¹⁵ &
Dimitri Scheftelowitsch¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 10497))

Included in the following conference series:

European Workshop on Performance Engineering

812 Accesses
3 Citations

Abstract

Markov Decision Processes (MDPs) are a popular decision model for stochastic systems. Introducing uncertainty in the transition probability distribution by giving upper and lower bounds for the transition probabilities yields the model of Bounded Parameter MDPs (BMDPs) which captures many practical situations with limited knowledge about a system or its environment. In this paper the class of BMDPs is extended to Bounded Parameter Semi Markov Decision Processes (BSMDPs). The main focus of the paper is on the introduction and numerical comparison of different algorithms to compute optimal policies for BMDPs and BSMDPs; specifically, we introduce and compare variants of value and policy iteration.

The paper delivers an empirical comparison between different numerical algorithms for BMDPs and BSMDPs, with an emphasis on the required solution time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
We consider in the following and subsequent equations continuous random variables where the integrals are well-defined for sojourn times in the states. For discrete random variables, the integrals have to be substituted by sums and the densities by probabilities, respectively.
2.
\(\epsilon \)-optimality means that the optimal value is reached up to \(\epsilon \).

References

Analysis of Markov decision processes under parameter uncertainty online companion. http://ls4-www.cs.tu-dortmund.de/cms/de/home/dohndorf/publications/
Bertsekas, D.P.: Dynamic Programming and Optimal Control, vol. 2, 3rd edn. Athena Scientific (2005, 2007)
Google Scholar
Beutler, F.J., Ross, K.W.: Uniformization for Semi-Markov decision processes under stationary policies. J. Appl. Probab. 24, 644–656 (1987)
Article MathSciNet MATH Google Scholar
Buchholz, P., Kriege, J., Felko, I.: Input Modeling with Phase-Type Distributions and Markov Models. SM. Springer, Cham (2014)
Book MATH Google Scholar
Chen, T., Hahn, E.M., Han, T., Kwiatkowska, M.Z., Qu, H., Zhang, L.: Model repair for Markov decision processes. In: TASE, pp. 85–92 (2013)
Google Scholar
Cubuktepe, M., Jansen, N., Junges, S., Katoen, J., Papusha, I., Poonawala, H.A., Topcu, U.: Sequential convex programming for the efficient verification of parametric MDPs. CoRR, abs/1702.00063 (2017)
Google Scholar
Delgado, K.V., de Barros, L.N., Cozman, F.G., Sanner, S.: Using mathematical programming to solve factored Markov decision processes with imprecise probabilities. Int. J. Approx. Reasoning 52(7), 1000–1017 (2011)
Article MathSciNet MATH Google Scholar
Delgado, K.V., Sanner, S., de Barros, L.N.: Efficient solutions to factored MDPs with imprecise transition probabilities. Artif. Intell. 175, 1498–1527 (2011)
Article MathSciNet MATH Google Scholar
Filho, R.S., Cozman, F.G., Trevizan, F.W., de Campos, C.P., de Barros, L.N.: Multilinear and integer programming for Markov decision processes with imprecise probabilities. In: 5th Int. Symposium on Imprecise Porbability: Theories and Applications, Prague, Czech Republic, pp. 395–404 (2007)
Google Scholar
Givan, R., Leach, S.M., Dean, T.L.: Bounded-parameter Markov decision processes. Artif. Intell. 122(1–2), 71–109 (2000)
Article MathSciNet MATH Google Scholar
Gross, D., Miller, D.: The randomization technique as a modeling tool and solution procedure for transient Markov processes. Oper. Res. 32, 343–361 (1984)
Article MathSciNet MATH Google Scholar
Hoffman, A.J., Karp, R.M.: On nonterminating stochastic games. Manage. Sci. 12(5), 359–370 (1966)
Article MathSciNet MATH Google Scholar
Kallenberg, L.: Markov decision processes. Lecture Notes, University Leiden (2011). https://www.math.leidenuniv.nl/~kallenberg/Lecture-notes-MDP.pdf
Müller, A., Stoyan, D.: Comparison Methods for Stochastic Models and Risks. Wiley, Chichester (2002)
MATH Google Scholar
Puterman, M.L.: Markov Decision Processes. Wiley, New York (2005)
MATH Google Scholar
Satia, J.K., Lave, R.E.: Markovian decision processes with uncertain transition probabilities. Oper. Res. 21(3), 728–740 (1973)
Article MathSciNet MATH Google Scholar
Serfozo, R.F.: An equivalence between continuous and discrete time Markov decision processes. Oper. Res. 27(3), 616–620 (1979)
Article MathSciNet MATH Google Scholar
Sigaud, O., Buffet, O. (eds.): Markov Decision Processes in Artificial Intelligence. Wiley-ISTE (2010)
Google Scholar
Tewari, A., Bartlett, P.L.: Bounded parameter Markov decision processes with average reward criterion. In: Bshouty, N.H., Gentile, C. (eds.) COLT 2007. LNCS (LNAI), vol. 4539, pp. 263–277. Springer, Heidelberg (2007). doi:10.1007/978-3-540-72927-3_20
Chapter Google Scholar
White, C.C., Eldeib, H.K.: Markov decision processes with imprecise transition probabilities. Oper. Res. 42(4), 739–749 (1994)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, TU Dortmund, Dortmund, Germany
Peter Buchholz, Iryna Dohndorf & Dimitri Scheftelowitsch

Authors

Peter Buchholz
View author publications
You can also search for this author in PubMed Google Scholar
Iryna Dohndorf
View author publications
You can also search for this author in PubMed Google Scholar
Dimitri Scheftelowitsch
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Iryna Dohndorf .

Editor information

Editors and Affiliations

Bristol, Germany
Philipp Reinecke
University of L’Aquila, L’Aquila, Italy
Antinisca Di Marco

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Buchholz, P., Dohndorf, I., Scheftelowitsch, D. (2017). Analysis of Markov Decision Processes Under Parameter Uncertainty. In: Reinecke, P., Di Marco, A. (eds) Computer Performance Engineering. EPEW 2017. Lecture Notes in Computer Science(), vol 10497. Springer, Cham. https://doi.org/10.1007/978-3-319-66583-2_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-66583-2_1
Published: 13 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66582-5
Online ISBN: 978-3-319-66583-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics