The Adaptive Dynamic Programming Theorem

Murray, John J.; Cox, Chadwick J.; Saeks, Richard E.

doi:10.1007/978-1-4612-0037-6_19

John J. Murray,
Chadwick J. Cox &
Richard E. Saeks

Part of the book series: Control Engineering ((CONTRENGIN))

487 Accesses
7 Citations

Abstract

The centerpiece of the theory of dynamic programming is the HamiltonJacobi-Bellman (HJB) equation, which can be used to solve for the optimal cost functional V^o for a nonlinear optimal control problem, while one can solve a second partial differential equation for the corresponding optimal control law k^o.Although the direct solution of the HJB equation is computationally untenable, the HJB equation and the relationship between V^o and k^oserves as the basis for the adaptive dynamic programming algorithm. Here, one starts with an initial cost functional and stabilizing control law pair (V_o, k₀) and constructs a sequence of cost functional/control law pairs (V_i, k_i) in real time, which are stepwise stable and converge to the optimal cost functional/control law pair, for a prescribed nonlinear optimal control problem with unknown input affine state dynamics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Barnett The Matrices of Control Theory Van Nostrand Reinhold, New York, 1971.
Google Scholar
R. E. Bellman Dynamic Programming Princeton University Press, Princeton, NJ, 1957.
MATH Google Scholar
D. P. Bertsekas Dynamic Programming: Deterministic and Stochastic Models Prentice-Hall, Englewood Cliffs, NJ, 1987.
MATH Google Scholar
A. Devinatz and J. L. Kaplan, “Asymptotic estimates for solutions of linear systems of ordinary differential equations having multiple characteristics roots,” Indiana University Math J., vol. 22, p. 335, 1972.
Google Scholar
J. Dieudonne Foundations of Mathematical Analysis Academic Press, New York, 1960.
Google Scholar
A. Halanay and V. Rasvan Applications of Liapunov Methods in Stability Kluwer, Dordrecht, 1993.
Book MATH Google Scholar
D. G. Luenberger Introduction to Dynamic Systems: Theory Models and Applications John Wiley, New York, 1979.
MATH Google Scholar
J. Murray, C. Cox, G. Lendaris, and R. Saeks, “Adaptive dynamic programming,” IEEE Trans. Systems, Man and Cybernetics: Part C vol. 32, pp. 140–153, 2002.
Article Google Scholar
R. Saeks and C. Cox, “Adaptive critic control and functional link networks,” Proc. 1998 IEEE Conference on Systems Man and Cybernetics San Diego, CA, pp. 1652–1657, 1998.
Google Scholar

Download references

Authors

John J. Murray
View author publications
You can also search for this author in PubMed Google Scholar
Chadwick J. Cox
View author publications
You can also search for this author in PubMed Google Scholar
Richard E. Saeks
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, Illinois, 60607, USA
Derong Liu
Department of Electrical Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA
Panos J. Antsaklis

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Murray, J.J., Cox, C.J., Saeks, R.E. (2003). The Adaptive Dynamic Programming Theorem. In: Liu, D., Antsaklis, P.J. (eds) Stability and Control of Dynamical Systems with Applications. Control Engineering. Birkhäuser, Boston, MA. https://doi.org/10.1007/978-1-4612-0037-6_19

Download citation

DOI: https://doi.org/10.1007/978-1-4612-0037-6_19
Publisher Name: Birkhäuser, Boston, MA
Print ISBN: 978-1-4612-6583-2
Online ISBN: 978-1-4612-0037-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics