Abstract
In this chapter we solve finite horizon Markov decision problems. We are describing a policy evaluation algorithm and the Bellman equations, which are necessary and sufficient optimality conditions for Markov decision problems. Then we are constructing optimal policies out of the solution of the Bellman equations. We will see that the class of Markov deterministic policies —that are easier to handle—contain, under assumptions which are often satisfied in practise, optimal policies. Finally, we describe how optimal policies can be calculated, based on a backward induction algorithm. This chapter is based on [Put94], [Whi93], and [Der70].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2015 Springer Fachmedien Wiesbaden
About this chapter
Cite this chapter
Ondra, T. (2015). Finite Horizon Markov Decision Problems. In: Optimized Response-Adaptive Clinical Trials. BestMasters. Springer Spektrum, Wiesbaden. https://doi.org/10.1007/978-3-658-08344-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-658-08344-1_2
Published:
Publisher Name: Springer Spektrum, Wiesbaden
Print ISBN: 978-3-658-08343-4
Online ISBN: 978-3-658-08344-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)