Ein funktionalanalytischer Beweis des Maximumprinzips von Pontrjagin und dessen Verwendung zur Herleitung der Politikiteration von Howard

Spremann, K.

doi:10.1007/BF02241608

Ein funktionalanalytischer Beweis des Maximumprinzips von Pontrjagin und dessen Verwendung zur Herleitung der Politikiteration von Howard

A proof of Pontrjagin's maximal principle by methods of functional analysis and its application to deduce Howard's policy iteration

Published: December 1972

Volume 9, pages 343–353, (1972)
Cite this article

Computing Aims and scope Submit manuscript

K. Spremann¹

45 Accesses
2 Citations
Explore all metrics

Zusammenfassung

Der umfangreiche, auf geometrische Überlegungen gründende Beweis des Maximumprinzips vonPontrjagin läßt sich vollständig durch funktionalanalytische Herleitungen ersetzen: anstelle der totalen Ableitung von Prozeß und Zielfunktional bei der direkten Methode sind hier nur die partiellen Ableitungen in Richtung der Zustandsvariablen nötig, während die Differenz in Richtung der Steuerungen nicht linearisiert wird. Die Kozustandsvariablen sind Hilfsgrößen, die zur Umformung eines Skalarproduktes dienen. (Sie ergeben sich als Lösung einer linearen Gleichung, deren Operator durch die Adjungierte zur partiellen Ableitung des Prozeßoperators gegeben ist und deren rechte Seite das teillinearisierte Zielfunktional bildet.) Dabei erhält man die bekannte Ungleichung der Hamiltonfunktionen, deren Gültigkeitsbereich in einem Widerspruchsbeweis globalisiert wird.

Dieser funktionalanalytische Beweis ist kürzer, konstruktiver und allgemeiner: so ergibt sich die Politikiteration vonHoward als Anwendung des Maximumprinzips auf bewertete stationäre Markovprozesse.

Summary

The tedious proof ofPontrjagin's maximum principle, based on geometric considerations, can be fully replaced by methods of functional analysis: instead of complete differentiation of the process and the objective functional in the direct method, only partial derivation in direction of state variables are used, while the difference in direction of the control is not linearized. The costate variables furnish a means to transform an innerproduct. (They are the solution of a linear equation whose operator is the adjoint of the partial derivative of the process operator and whose right side is formed by the partial linearized objective functional.) As result we obtain the wellknown unequality of the Hamiltonians, whose domain of validity is globalized in a proof by contradiction.

This proof by methods of functional analysis is more concise, constructive and more general: application of the maximal principle to ergodic Marcovprocesses with rewards results inHoward's method of policy iteration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Literatur

Lee, E. B., andL. Markus: Foundations of Optimal Control Theory. New York: Wiley. 1967.
Google Scholar
Pontrjagin, L. S.,et al.: Mathematische Theorie optimaler Prozesse. München: Oldenbourg-Verlag, 1967; dt. Übersetzung des russ. Originals.
Google Scholar
Spremann, K.: Das Maximumprinzip von Pontrjagin — konstruktive Anwendung und ein Zusammenhang mit der direkten Methode. Diplomarbeit, Inst. f. Angew. Math. d. TU München (1970).
Gessner, P., undK. Spremann: Optimierung in Funktionenräumen (Lecture Notes in Economics and Mathematical Systems, Vol. 6). Berlin-Heidelberg-New York: Springer. 1972.
Google Scholar
Canon, M. D., C. D. Cullum, Jr., andE. Polak: Theory of Optimal Control and Mathematical Programming, New York: McGraw-Hill. 1970.
Google Scholar
Gessner, P.: Optimierungsprobleme in unitären Räumen. Habilitationsschrift, TU München (1970).
Gessner, P., undH. J. Wacker: Dynamische Optimierung — Modelle und Computerprogramme. München: Carl Hanser Verlag. 1972.
Google Scholar
Holtzman, I. M.: Convexity and the Maximum Principle for Discrete Systems. IEEE Trans. on Automatic Control, AC-11,1, 30–35 (1966).
Google Scholar
Howard, R. A.: Dynamic Programming and Markov Processes, 2nd ed., pp. 32–43. Cambridge, Mass.: MIT Press. 1962.
Google Scholar
Feilmeier, M., P. Gessner undH. J. Wacker: Lineare Kontrollprobleme. Unternehmensforschung14, 4, 263–275 (1970).
Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Angewandte Mathematik Technische Universität München, Arcisstraße 21, D-8000, München 2, Bundesrepublik Deutschland
K. Spremann

Authors

K. Spremann
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Spremann, K. Ein funktionalanalytischer Beweis des Maximumprinzips von Pontrjagin und dessen Verwendung zur Herleitung der Politikiteration von Howard. Computing 9, 343–353 (1972). https://doi.org/10.1007/BF02241608

Download citation

Received: 08 July 1972
Issue Date: December 1972
DOI: https://doi.org/10.1007/BF02241608

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Ein funktionalanalytischer Beweis des Maximumprinzips von Pontrjagin und dessen Verwendung zur Herleitung der Politikiteration von Howard

Zusammenfassung

Summary

Access this article

Literatur

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation