Journal of Optimization Theory and Applications

, Volume 35, Issue 3, pp 443–464 | Cite as

Stackelberg strategies in linear-quadratic stochastic differential games

  • A. Bagchi
  • T. Başar
Contributed Papers

Abstract

This paper obtains the Stackelberg solution to a class of two-player stochastic differential games described by linear state dynamics and quadratic objective functionals. The information structure of the problem is such that the players make independent noisy measurements of the initial state and are permitted to utilize only this information in constructing their controls. Furthermore, by the very nature of the Stackelberg solution concept, one of the players is assumed to know, in advance, the strategy of the other player (the leader). For this class of problems, we first establish existence and uniqueness of the Stackelberg solution and then relate the derivation of the leader's Stackelberg solution to the optimal solution of a nonstandard stochastic control problem. This stochastic control problem is solved in a more general context, and its solution is utilized in constructing the Stackelberg strategy of the leader. For the special case Gaussian statistics, it is shown that this optimal strategy is affine in observation of the leader. The paper also discusses numerical aspects of the Stackelberg solution under general statistics and develops algorithms which converge to the unique Stackelberg solution.

Key Words

Stochastic differential games Stackelberg solution linear-quadratic games nonzero-sum games 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Von Stackelberg, H.,The Theory of Market Economy, Oxford University Press, Oxford, England, 1952.Google Scholar
  2. 2.
    Chen, C. I., andCruz, J. B., Jr.,Stackelberg Solution for Two-Person Games with Biased Information Patterns, IEEE Transactions on Automatic Control, Vol. AC-17, No. 5, 1972.Google Scholar
  3. 3.
    Simaan, M., andCruz, J. B., Jr.,On the Stackelberg Strategy in Nonzero-sum Games, Journal of Optimization Theory and Applications, Vol. 11, No. 5, 1973.Google Scholar
  4. 4.
    Simaan, M. andCruz, J. B., Jr.,Additional Aspect of the Stackelberg Strategy in Nonzero-Sum Games, Journal of Optimization Theory and Applications, Vol. 11, No. 6, 1973.Google Scholar
  5. 5.
    Başar, T.,Information Structures and Equilibria in Dynamic Games, New Trends in Dynamic Systems Theory and Economics, Edited by M. Aoki and A. Marzollo, Academic Press, New York, New York, 1979.Google Scholar
  6. 6.
    Cruz, J. B., Jr.,Leader-Follower Strategies For Multilevel Systems, IEEE Transactions on Automatic Control, Vol. AC-23, No. 2, 1978.Google Scholar
  7. 7.
    Başar, T., andSelbuz, H.,Closed-Loop Stackelberg Strategies with Applictions in the Optimal Control of Multilevel Systems, IEEE Transactions on Automatic Control, Vol. AC-24, No. 2, 1979.Google Scholar
  8. 8.
    Castanon, D., andAthans, M.,On the Stochastic Dynamic Stackelberg Strategies, Automatica, Vol. 12, No. 2, 1976.Google Scholar
  9. 9.
    Başar, T.,Hierarchical Decision Making under Uncertainty, Dynamic Optimization and Mathematical Economics, Edited by P. T. Liu, Plenum Press, New York, New York, 1979.Google Scholar
  10. 10.
    Başar, T.,Stochastic Stagewise Stackelberg Strategies For Linear Quadratic Systems, Stochastic Control Theory and Stochastic Differential Systems, Edited by M. Kohlmann and W. Vogel, Springer-Verlag, Berlin, Germany, 1979.Google Scholar
  11. 11.
    Fleming, W. H., andNisio, M.,On the Existence of Optimal Stochastic Controls, Journal of Mathematics and Mechanics, Vol. 15, pp. 777–794, 1966.Google Scholar
  12. 12.
    Baghi, A., andBaşar, T.,Team Decision Theory for Linear Continuous-Time Systems, Twente University of Technology, Enschede, Holland, Department of Applied Mathematics, Memorandum No. 274, 1979.Google Scholar
  13. 13.
    Allwright, H. C.,Contraction Mapping Algorithm with Guaranteed Convergence, Proceedings of the 5th IFAC Congress, Paris, France, 1972.Google Scholar

Copyright information

© Plenum Publishing Corporation 1981

Authors and Affiliations

  • A. Bagchi
    • 1
  • T. Başar
    • 2
  1. 1.Department of Applied MathematicsTwente University of TechnologyEnschedeHolland
  2. 2.Applied Mathematics DivisionMarmara Scientific and Industrial Research InstituteGebze, KocaeliTurkey

Personalised recommendations