Abstract
This paper considers the problem of minimizing a functionalI which depends on the statex(t), the controlu(t), and the parameter π. Here,I is a scalar,x ann-vector,u anm-vector, and π ap-vector. At the initial point, the state is prescribed. At the final point, the statex and the parameter π are required to satisfyq scalar relations. Along the interval of integration, the state, the control, and the parameter are required to satisfyn scalar differential equations. Asequential algorithm composed of the alternate succession of gradient phases and restoration phases is presented. This sequential algorithm is contructed in such a way that the differential equations and boundary conditions are satisfied at the end of each iteration, that is, at the end of a complete gradient-restoration phase; hence, the value of the functional at the end of one iteration is comparable with the value of the functional at the end of any other iteration.
In thegradient phase, nominal functionsx(t),u(t), π satisfying all the differential equations and boundary conditions are assumed. Variations Δx(t), Δu(t), Δπ leading to varied functions\(\tilde x\)(t),ũ(t),\(\tilde \pi\) are determined so that the value of the functional is decreased. These variations are obtained by minimizing the first-order change of the functional subject to the linearized differential equations, the linearized boundary conditions, and a quadratic constraint on the variations of the control and the parameter.
Since the constraints are satisfied only to first order during the gradient phase, the functions\(\tilde x\)(t),ũ(t),\(\tilde \pi\) may violate the differential equations and/or the boundary conditions. This being the case, a restoration phase is needed prior to starting the next gradient phase. In thisrestoration phase, the functions\(\tilde x\)(t),ũ(t),\(\tilde \pi\) are assumed to be the nominal functions. Variations Δ\(\tilde x\)(t), Δũ(t), Δ\(\tilde \pi\) leading to varied functions\(\hat x\)(t),û(t),\(\hat \pi\) consistent with all the differential equations and boundary conditions are determined. These variations are obtained by requiring the least-square change of the control and the parameter subject to the linearized differential equations and the linearized boundary conditions. Of course, the restoration phase must be performed iteratively until the cumulative error in the differential equations and boundary conditions becomes smaller than some preselected value.
If the gradient stepsize is α, an order-of-magnitude analysis shows that the gradient corrections are Δx=O(α), Δu=O(α), Δπ=O(α), while the restoration corrections are\(\Delta \tilde x = O(\alpha ^2 ), \Delta \tilde u = O(\alpha ^2 ), \Delta \hat \pi = O(\alpha ^2 )\). Hence, for α sufficiently small, the restoration phase preserves the descent property of the gradient phase: the functionalI decreases between any two successive iterations.
Methods to determine the gradient stepsize in an optimal fashion are discussed. Examples are presented for both the fixed-final-time case and the free-final-time case. The numerical results show the rapid convergence characteristics of the sequential gradient-restoration algorithm.
Similar content being viewed by others
References
Miele, A.,Gradient Methods in Control Theory, Part 1, Ordinary Gradient Method, Rice University, Aero-Astronautics Report No. 60, 1969.
Miele, A., andPritchard, R. E.,Gradient Methods in Control Theory, Part 2, Sequential Gradient-Restoration Algorithm, Rice University, Aero-Astronautics Report No. 62, 1969.
Damoulakis, J. N.,Gradient Methods in Control Theory, Part 3, Sequential Gradient-Restoration Algorithm: Numerical Examples, Rice University, Aero-Astronautics Report No. 65, 1969.
Damoulakis, J. N.,Gradient Methods in Control Theory, Part 4, Sequential Gradient-Restoration Algorithm: Further Numerical Examples, Rice University, Aero-Astronautics Report No. 67, 1970.
Damoulakis, J. N.,Gradient Methods in Control Theory, Part 5, Sequential Gradient-Restoration Algorithm: Additional Numerical Examples, Rice University, Aero-Astronautics Report No. 73, 1970.
Bryson, A. E., andDenham, W. F.,A Steepest-Ascent Method for Solving Optimum Programming Problems, Journal of Applied Mechanics, Vol. 84, No. 2, 1962.
Kelley, H. J.,Gradient Theory of Optimal Flight Paths, ARS Journal, Vol. 30, No. 10, 1960.
Miele, A., Heideman, J. C., andDamoulakis, J. N.,The Restoration of Constraints in Holonomic and Nonholonomic Problems, Journal of Optimization Theory and Applications, Vol. 3, No. 5, 1969.
Long, R. S.,Newton-Raphson Operator; Problems with Undetermined End Points, AIAA Journal, Vol. 3, No. 7, 1965.
Miele,Method of Particular Solutions for Linear, Two-Point Boundary-Value Problems, Journal of Optimization Theory and Applications, Vol. 2, No. 4, 1968.
Heideman, J. C.,Use of the Method of Particular Solutions in Nonlinear, Two-Point Boundary-Value Problems, Journal of Optimization Theory and Applications, Vol. 2, No. 6, 1968.
Miele, A., andIyer, R.,General Technique for Solving Nonlinear, Two-Point Boundary-Value Problems via the Method of Particular Solutions, Rice University, Aero-Astronautics Report No. 63, 1969.
Miele, A., Editor,Theory of Optimum Aerodynamic Shapes, Academic Press, New York, 1965.
Hestenes, M. R.,Multiplier and Gradient Methods, Journal of Optimization Theory and Applications, Vol. 4, No. 5, 1969.
Ralston, A.,Numerical Integration Methods for the Solution of Ordinary Differential Equations, Mathematical Methods for Digital Computers, Vol. 1, Edited by A. Ralston and H. S. Wilf, John Wiley and Sons, New York, 1960.
Lasdon, L. S., Mitter, S. K., andWaren, A. D.,The Conjugate Gradient Method for Optimal Control Problems, IEEE Transactions on Automatic Control, Vol. AC-12, No. 2, 1967.
Hestenes, M. R.,Calculus of Variations and Optimal Control Theory, McGraw-Hill Book Company, New York, 1967.
Author information
Authors and Affiliations
Additional information
The portions of this paper dealing with the fixed-final-time case were presented by the senior author at the 2nd Hawaii International Conference on System Sciences, Honolulu, Hawaii, 1969. The portions of this paper dealing with the free-final-time case were presented by the senior author at the 20th International Astronautical Congress, Mar del Plata, Argentina, 1969. This research, supported by the NASA-Manned Spacecraft Center, Grant No. NGR-44-006-089, Supplement No. 1, is a condensation of the investigations presented in Refs. 1–5. The authors are indebted to Professor H. Y. Huang for helpful discussions.
Rights and permissions
About this article
Cite this article
Miele, A., Pritchard, R.E. & Damoulakis, J.N. Sequential gradient-restoration algorithm for optimal control problems. J Optim Theory Appl 5, 235–282 (1970). https://doi.org/10.1007/BF00927913
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF00927913