Abstract
In this paper a continuous-time discounted dynamic programming problem in a Markov decision model is investigated. In many cases it is difficult to search directly for an optimal solution for such a programming problem. We introduce a Lagrangian-type programming problem associated with the original programming problem and show that, under some assumptions, a weak optimal solution exists for the Lagrangian problem. Moreover, we consider the original programming problem in the perturbed programming one and develop the Lagrangian duality.
Similar content being viewed by others
References
Aubin JP (1979) Mathematical Methods of Game and Economic Theory. North-Holland, Amsterdam
Blackwell D (1965) Discounted dynamic programming. Ann Math Statist 36:226–235
Doshi BT (1976) Continuous time control of Markov processes on an arbitrary state space; discounted rewards. Ann Statist 4:1219–1235
Dynkin EB (1965) Markov Processes—I. Springer-Verlag, Berlin
Fleming WH (1969) Optimal continuous-parameter stochastic control. SIAM Rev 11:470–509
Gihman II, Skorohod AV (1979) Controlled Stochastic Processes. Springer-Verlag, Berlin
Kakumanu PK (1971) Continuously discounted Markov decision model with countable state and action spaces. Ann Math Statist 42:919–926
Lai HC, Tanaka K (1986) On aD-solution of a cooperativem-person discounted Markov game. J Math Anal Appl 115:578–591
Lai HC, Tanaka K (1987) AnN-person noncooperative discounted vector-valued dynamic game with a metric space. Appl Math Optim 16:135–146
Luenberger DG (1969) Optimization by Vector Space Methods. Wiley—Interscience, New York
Miller BL (1968) Finite state continuous time Markov decision processes with finite planning horizon. SIAM J Control 6:266–280
Miller BL (1968) Finite state continuous time Markov decision processes with an infinite planning horizon. J Math Anal Appl 22:552–569
Pliska SR (1975) Controlled jump processes. Stochastic Process Appl 3:259–282
Pliska SR (1978) On a functional differential equation that arises in a Markov control problem. J Differential Equations 28:390–405
Ponstein J (1980) Approaches to the Theory of Optimization. Cambridge University Press, London.
Tanaka K (to appear) On a discounted dynamic programming with the constraints. J Math Anal Appl
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Lai, HC., Tanaka, K. On continuous-time discounted stochastic dynamic programming. Appl Math Optim 23, 155–169 (1991). https://doi.org/10.1007/BF01442395
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF01442395