Abstract
A predictor-corrector, Crank-Nicolson computer algorithm is examined for the Bellman equation of stochastic optimal control with quadratic costs and constrained control. A linearized comparison equation is heuristically derived for the nonlinear and discontinuous Bellman equation. Convergence of the method is studied using von Neumann's Fourier stability method. A mesh-ratio-type condition for the convergence is derived for the comparison equation. This condition is uniform for both parabolic and hyperbolic versions of the nonlinear equation. The results are valid for Gaussian stochastic noise and Poisson noise.
Similar content being viewed by others
References
W.H. Fleming, chairman,Future Directions in Control Theory: A Mathematical Prospective. Society for Industrial and Applied Mathematics: Philadelphia, PA, 1988.
D. Ludwig, “Optimal harvesting of a randomly fluctuating resource I: Application of perturbation methods,”SIAM J. Appl. Math., vol. 37, pp. 166–184, August 1979.
D. Ludwig and J.M. Varah, “Optimal harvesting of a randomly fluctuating resource II: Numerical methods and results,”SIAM J. Appl. Math., vol. 37, pp. 185–205, August 1979.
D. Ryan and F.B. Hanson, “Optimal harvesting of a logistic population in an environment with stochastic jumps,”J. Math. Biol., vol. 24, pp. 259–277, 1986.
F.B. Hanson, “Bioeconomic model of the Lake Michigan alewife fishery,”Can. J. Fish. Aquat. Sci., vol. 44 (suppl. II), pp. 298–305, 1987.
F.B. Hanson and D. Ryan, “Optimal harvesting in a stochastic environment with density dependent jumps,” inMathematical Models of Renewable Resources, vol. 3, edited by R. Lamberson. Association of Resource Modelers: Arcata, CA, pp. 117–123, 1984.
F.B. Hanson and D. Ryan, “Optimal harvesting with density dependent random effects,”Natural Res. Modeling, vol. 2, pp. 439–455, Winter 1987.
M. Athans, D. Castanon, K.P. Dunn, C.S. Greene, W.H. Lee, N.R. Sandell, Jr., and A.S. Willsky, “The stochastic control of the F-8C aircraft using a multiple model adaptive control (MMAC) method—Part I: Equilibrium flight,”IEEE Trans. Autom. Control, vol. AC-22, pp. 768–780, October 1977.
A.E. Bryson and Y.C. Ho,Applied Optimal Control. Hemisphere: New York, 1975.
R.E. Larson,State Increment Dynamic Programming. American Elsevier: New York, 1968.
D.H. Jacobson and D.Q. Mayne,Differential Dynamic Programming. American Elsevier: New York, 1970.
P. Dyer and S.R. McReynolds,The Computation and Theory of Optimal Control. Academic Press: New York, 1970.
I.H. Mufti,Computational Methods in Optimal Control Problems. Lecture Notes in Operations Research and Mathematical Systems, vol. 27. Springer-Verlag: Berlin, 1970.
E. Polak,Computational Methods in Optimization. Academic Press: New York, 1971.
E. Polak, “An historical survey of computational methods in optimal control,”SIAM Rev., vol. 15, pp. 553–584, 1973.
H.J. Kushner, “A survey of some applications of probability and stochastic control theory to finite difference methods for degenerate elliptic and parabolic equations,”SIAM Rev., vol. 18, pp. 545–577, 1976.
H.J. Kushner,Probability Methods for Approximations in Stochastic Control and for Elliptic Equations. Academic Press: New York, 1978.
H.J. Kusher and P.G. Dupuis,Numerical Methods for Stochastic Control in Continuous Time. Springer-Verlag: New York, 1992.
M.G. Crandall and P.-L. Lions, “Viscosity solutions of Hamilton-Jacobi equations,”Trans. Am. Math. Soc., vol. 277, pp. 1–42, 1983.
P.E. Souganidis, “Approximation schemes for viscosity solutions of Hamilton-Jacobi equations,”J. Diff. Eqns., vol. 59, pp. 1–43, 1985.
R. Jensen, “The maximum principle for viscosity solutions of fully nonlinear second order partial differential equations,”Arch. Rat. Mech. Anal., vol. 101, pp. 1–27, 1988.
M.G. Crandall, H. Ishii, and P.-L. Lions, “User's guide to viscosity solutions of second order partial differential equations,”Bull. Am. Math. Soc., vol. 27(1), pp. 1–67, July 1992.
P.-L. Lions and B. Perthame, “Remarks on Hamilton-Jacobi-Equations with measurable time-dependent Hamiltonians,”Nonlin. Anal. Th. Meth. Appl., vol. 11, pp. 613–622, 1987.
I.I. Gihman and A.V. Skorohod,Stochastic Differential Equations. Springer-Verlag: New York, 1972.
I.I. Gihman and A.V. Skorohod,Controlled Stochastic Processes. Springer-Verlag: New York, 1979.
R.D. Richtmyer and K.W. Morton,Difference Methods for Initial-Value Problems. Wiley: New York, 1967.
A.R. Mitchell and D.F. Griffiths,The Finite Difference Method in Partial Differential Equations. Wiley: New York, 1980.
L. Arnold,Stochastic Differential Equations: Theory and Applications. Wiley: New York, 1974.
W.H. Fleming and R.W. Rishel,Deterministic and Stochastic Optimal Control. Springer-Verlag: New York, 1975.
Z. Schuss,Theory and Applications of Stochastic Differential Equations. Wiley: New York, 1980.
R.E. Bellman,Adaptive Control Processes: A Guided Tour. Princeton University Press: Princeton, NJ, 1961.
C.C. Holt, F. Modigliani, J. Muth, and H. Simon,Planning Production Inventories, and Work Force. Prentice-Hall: Englewood Cliffs, NJ, 1960.
M. Athans, “The role and use of the stochastic linear-quadratic-Gaussian problem in control system design,”IEEE Trans. Autom. Control, vol. AC-16, pp. 529–552, 1971.
D.J. Bell and D.H. Jacobson,Singular Optimal Control. Academic Press: New York, 1975.
A. Jameson and R.E. O'Malley, Jr., “Cheap control of time-invariant regulator,”Appl. Math. Optim., vol. 1(4), pp. 337–354, 1975.
R.E. O'Malley, Jr., and A. Jameson, “Singular perturbation and singular arcs—Part I,”IEEE Trans. Autom. Control, vol. AC-20, pp. 218–226, 1975.
P. Kokotovic, H.K. Khalil, and J. O'Reilly,Singular Perturbation Methods in Control: Analysis and Design. Academic Press: New York, 1986.
J.E. Flaherty and R.E. O'Malley, Jr., “On computation of singular controls,”IEEE Trans. Autom. Control, vol. AC-22, pp. 640–648, 1977.
J. Douglas, Jr., and T. DuPont, “Galerkin methods for parabolic equations,”SIAM J. Num. Anal., vol. 7, pp. 575–626, 1970.
J. Douglas, Jr., “Effective time-stepping methods for the numerical solution of nonlinear parabolic problems,” inThe Mathematics of Finite Elements and Applications V: MAFELAP 1978, edited by J.R. Whiteman. Academic Press: New York, pp. 289–304, 1979.
F.B. Hanson, “Computational dynamic programming on a vector multiprocessor,”IEEE Trans. Autom. Control, vol. 36(4), pp. 507–511, 1991.
S.-L. Chung, F.B. Hanson, and H.H. Xu, “Parallel stochastic dynamic programming: Finite element methods,”Lin. Alg. Appl., vol. 172, pp. 197–218, July 1992.
S.-L. Chung, F.B. Hanson, and H. Xu, “Supercomputer optimizations for stochastic optimal control applications,”Proc. 4th NASA Workshop on Computational Control of Flexible Aerospace Systems, NASA Conf. Publ. 10065, Part 1, edited by L.W. Taylor, NASA Langley Research Center, pp. 57–70, March 1991.
H.H. Xu, F.B. Hanson, and S.-L. Chung, “Optimal data parallel methods for stochastic dynamic programming,”Proc. 1991 Int. Conf. Parallel Processing, vol. III Algorithms and Applications, CRC Press: Boca Raton, FL, pp. 142–146, August 1991.
W. Feller,An Introduction to Probability Theory and Its Applications, vol. 2, Wiley, New York, 1971.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Naimipour, K., Hanson, F.B. Numerical convergence for the Bellman equation of stochastic optimal control with quadratic costs and constraints. Dynamics and Control 3, 237–259 (1993). https://doi.org/10.1007/BF01972698
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF01972698