Abstract
We consider the problem of discounted dynamic programming with a continuous time parameter (CDP) when the Markov policies are used. We give an axiomatization of such discounted CDP. We also give necessary and sufficient conditions for the existence of an optimal policy. Analogously to the discrete case we formulate improvement’s theorems and a theorem on the existence of a (p, ε)-optimal policy in a class of semi-Markov policies.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
D. Blackwell: On the functional equation of dynamic programming. J. Math. Anal. Appl. 2 (1961), 273–276.
D. Blackwell: Discrete dynamic programming. The Annals of Math. Statist. 33 (1962), 719–726.
D. Blackwell: Discounted dynamic programming. The Annals of Math. Statist. 36 (1965), 226–235.
K. Hinderer: Foundations of Non-stationary Dynamic Programming. Springer-Verlag, Berlin—Heidelberg 1970.
R. A. Howard: Dynamic Programming and Markov Processes. Wiley, New York 1960.
P. Kakumanu: Continuously discounted Markov decision model with countable state and action space. The Annals of Math. Statist. 42 (1971), 919–926.
A. Maitra: Dynamic programming for countable state systems. Sankhyá Ser. A 27 (1965), 241–248.
A. Maitra: Discounted dynamic programming on compact metric spaces. Sankhyá Ser. A 30 (1968), 211–216.
P. A. Meyer: Probability and Potentials. Blaisdell Publishing Company: Waltham, Massachusetts—Toronto—London 1966.
B. L. Miller: Finite state continuous time Markov decision processes with a finite planning horizon. SI AM J. Control 6 (1968), 266–280.
R. E. Strauch: Negative dynamic programming. The Annals of Math. Statist. 37 (1966), 871–890.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1977 ACADEMIA, Publishing House of the Czechoslovak Academy of Sciences, Prague
About this chapter
Cite this chapter
Idzik, A. (1977). On Markov Policies in Continuous Time Discounted Dynamic Programming. In: Kožešnik, J. (eds) Transactions of the Seventh Prague Conference on Information Theory, Statistical Decision Functions, Random Processes and of the 1974 European Meeting of Statisticians. Transactions of the Seventh Prague Conference on Information Theory, Statistical Decision Functions, Random Processes and of the 1974 European Meeting of Statisticians, vol 7A. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-9910-3_28
Download citation
DOI: https://doi.org/10.1007/978-94-010-9910-3_28
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-9912-7
Online ISBN: 978-94-010-9910-3
eBook Packages: Springer Book Archive