On the convergence of policy iteration for controlled diffusions

Puterman, M. L.

doi:10.1007/BF00935182

On the convergence of policy iteration for controlled diffusions

Contributed Papers
Published: January 1981

Volume 33, pages 137–144, (1981)
Cite this article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

M. L. Puterman¹

172 Accesses
11 Citations
Explore all metrics

Abstract

The convergence of an approximation scheme known as policy iteration has been demonstrated for controlled diffusions by Fleming, Puterman, and Bismut. In this paper, we show that this approximation scheme is equivalent to the Newton-Kantorovich iteration for solving the optimality equation and exploit this equivalence to obtain a new proof of convergence. Estimates of the rate of convergence of this procedure are also obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the Policy Iteration Algorithm for Nondegenerate Controlled Diffusions Under the Ergodic Criterion

Infinite Horizon Controlled Diffusions with Randomly Varying and State-Dependent Discount Cost Rates

Article 27 June 2016

Moderate deviations and central limit theorem for positive diffusions

Article Open access 03 March 2016

References

Fleming, W. H.,Some Markovian Optimization Problems, Journal of Mathematics and Mechanics, Vol. 12, pp. 131–140, 1963.
Google Scholar
Puterman, M. L.,Optimal Control of Diffusion Processes with Reflection, Journal of Optimization Theory and Applications, Vol. 22, 103–116, 1977.
Google Scholar
Bismut, J.,An Approximation Method in Optimal Stochastic Control, SIAM Journal on Control and Optimization, Vol. 16, pp. 122–130, 1978.
Google Scholar
Puterman, M. L., andBrumelle, S. L.,On the Convergence of Policy Iteration in Stationary Dynamic Programming, Mathematics of Operations Research, Vol. 4, pp. 60–69, 1979.
Google Scholar
Stroock, D. W., andVaradhan, S. R. S.,Diffusion Processes with Boundary Conditions, Communications on Pure and Applied Mathematics, Vol. 26, pp. 147–226, 1971.
Google Scholar
Kantorovich, L. V., andAkilov, G. P.,Functional Analysis in Normed Spaces, The Macmillan Company, New York, New York, 1964.
Google Scholar
Stroock, D. W., andVaradhan, S. R. S.,Diffusion Processes with Continuous Coefficients, II, Communications on Pure and Applied Mathematics, Vol. 22, pp. 479–530, 1969.
Google Scholar
Fleming, W. H., andRishel, R.,Deterministic and Stochastic Optimal Control, Springer-Verlag, New York, New York, 1975.
Google Scholar
Mandl, P.,Analytic Treatment of One-Dimensional Markov Processes, Springer-Verlag, New York, New York, 1968.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Commerce and Business Administration, University of British Columbia, Vancouver, British Columbia, Canada
M. L. Puterman (Associate Professor)

Authors

M. L. Puterman
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Communicated by R. Rishel

This research was partially supported by NRC Grant No. A-3609.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Puterman, M.L. On the convergence of policy iteration for controlled diffusions. J Optim Theory Appl 33, 137–144 (1981). https://doi.org/10.1007/BF00935182

Download citation

Issue Date: January 1981
DOI: https://doi.org/10.1007/BF00935182

Key Words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the convergence of policy iteration for controlled diffusions

Abstract

Access this article

Similar content being viewed by others

On the Policy Iteration Algorithm for Nondegenerate Controlled Diffusions Under the Ergodic Criterion

Infinite Horizon Controlled Diffusions with Randomly Varying and State-Dependent Discount Cost Rates

Moderate deviations and central limit theorem for positive diffusions

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Key Words

Navigation

On the convergence of policy iteration for controlled diffusions

Abstract

Access this article

Similar content being viewed by others

On the Policy Iteration Algorithm for Nondegenerate Controlled Diffusions Under the Ergodic Criterion

Infinite Horizon Controlled Diffusions with Randomly Varying and State-Dependent Discount Cost Rates

Moderate deviations and central limit theorem for positive diffusions

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key Words

Search

Navigation