Advertisement

Min-max and min-min stackelberg strategies with closed-loop information structure

  • M. JungersEmail author
  • E. Trelat
  • H. Abou-kandil
Article

Abstract

This paper deals with the min-max and min-min Stackelberg strategies in the case of a closed-loop information structure. Two-player differential one-single stage games are considered with one leader and one follower. We first derive necessary conditions for the existence of the follower to characterize the best response set of the follower and to recast it, under weak assumptions, to an equivalent and more convenient form for expressing the constraints of the leader’s optimization problem. Under a standard strict Legendre condition, we then derive optimality necessary conditions for the leader of both min-max and min-min Stackelberg strategies in the general case of nonlinear criteria for finite time horizon games. This leads to an expression of the optimal controls along the associated trajectory. Then, using focal point theory, the necessary conditions are also shown to be sufficient and lead to cheap control. The set of initial states allowing the existence of an optimal trajectory is emphasized. The linear-quadratic case is detailed to illustrate these results.

Key words and phrases

Stackelberg strategy game theory multi-criteria optimization closed-loop information structure bilevel optimization problem 

2000 Mathematics Subject Classification

91A65 49N70 49N90 

References

  1. 1.
    H. Abou-Kandil and P. Bertrand, Analytical solution for an open-loop Stackelberg game. IEEE Trans. Automat. Control AC-30 (1985), 1222–1224.MathSciNetCrossRefGoogle Scholar
  2. 2.
    H. Abou-Kandil, G. Freiling, V. Ionescu, and G. Jank, Matrix Riccati equations in control and systems theory. Birkhäuser (2003).Google Scholar
  3. 3.
    A. Aboussoror and P. Loridan, Strong–weak Stackelberg problems in finite-dimensional spaces. Serdica Math. J. 21 (1995), 151–170.MathSciNetzbMATHGoogle Scholar
  4. 4.
    _____, Existence of solutions of two-level optimization problems with nonunique lower-level solutions. J. Math. Anal. Appl. 254 (2001), 348–357.MathSciNetzbMATHCrossRefGoogle Scholar
  5. 5.
    A. Agrachev and J.-P. Gauthier, On subanalyticity of Carnot–Carathéodory distances. Ann. Inst. H. Poincaré Anal. Non Linéaire 18 (2001), 359–382.MathSciNetzbMATHCrossRefGoogle Scholar
  6. 6.
    A. Agrachev and Yu. Sachkov, Control theory from the geometric viewpoint. Encycl. Math. Sci. 87, Springer-Verlag, Berlin–New York (2004).zbMATHGoogle Scholar
  7. 7.
    A. Agrachev and A. Sarychev, Abnormal sub-Riemannian geodesics: Morse index and rigidity. Ann. Inst. H. Poincaré 13 (1996), 635–690.MathSciNetzbMATHGoogle Scholar
  8. 8.
    R. Axelrod, The evolution of cooperation. New York Basic Books (1984).Google Scholar
  9. 9.
    A. Bagchi, Stackelberg differential games in economic models. Lect. Notes Control Inform. Sci. Springer-Verlag (1984).Google Scholar
  10. 10.
    T. Ba,sar and A. Haurie, Feedback equilibria in differential games with structural and modal uncertainties. JAE Press Inc. Connecticut (1984).Google Scholar
  11. 11.
    T. Başar and G. J. Olsder, Team-optimal closed-loop Stackelberg strategies in hierarchical control problems. Automatica 16 (1980), 409–414.zbMATHCrossRefGoogle Scholar
  12. 12.
    _____, Dynamic noncooperative game theory. SIAM (1995).Google Scholar
  13. 13.
    T. Başar and H. Selbuz, Closed-loop Stackelberg strategies with applications in the optimal control of multilevel systems. IEEE Trans. Automat. Control AC-24 (1979), 166–179.Google Scholar
  14. 14.
    T. Ba,sar and R. Srikant, A Stackelberg network game with a large number of followers. J. Optim. Theory Appl. 115 (2002), 479–490.MathSciNetCrossRefGoogle Scholar
  15. 15.
    B. Bonnard and M. Chyba, The role of singular trajectories in control theory. Math. Appl. 40, Springer-Verlag (2003).Google Scholar
  16. 16.
    B. Bonnard, L. Faubourg, and E. Trélat, Mécanique céleste et contrôle de systèèmes spatiaux. Math. Appl. 51, Springer-Verlag (2006).Google Scholar
  17. 17.
    M. Breton, A. Alj, and A. Haurie, Sequential Stackelberg equilibria in two-person games. J. Optim. Theory Appl. 59 (1988), 71–97.MathSciNetzbMATHCrossRefGoogle Scholar
  18. 18.
    A. Buratto and G. Zaccour, Coordination of advertising strategies in a fashion licensing contract. J. Optim. Theory Appl. 142 (2009), 31–53.MathSciNetzbMATHCrossRefGoogle Scholar
  19. 19.
    L. Cesari, Optimization. Theory and applications. Problems with ordinary differential equations. Springer-Verlag, New York (1983).zbMATHGoogle Scholar
  20. 20.
    B. Chen and P.-A. Zadrozny, An anticipative feedback solution for the infinite-horizon, linear-quadratic, dynamic, Stackelberg game. J. Econ. Dynam. Control, 26 (2002), 1397–1416.MathSciNetzbMATHCrossRefGoogle Scholar
  21. 21.
    C. I. Chen and J. B. Cruz, Stackelberg solution for two-person games with biased information patterns. IEEE Trans. Automat. Control AC-17 (1972), 791–797.CrossRefGoogle Scholar
  22. 22.
    Y. Chitour, F. Jean, and E. Trélat, Propiétés génériques des trajectoires singulières. C. R. Math. Acad. Sci. Paris 337 (2003), 49–52.MathSciNetzbMATHGoogle Scholar
  23. 23.
    ______, Genericity results for singular curves. J. Differ. Geom. 73 (2006), 45–73.zbMATHGoogle Scholar
  24. 24.
    ______, Singular trajectories of control-affine systems. SIAM J. Control Optim. 47 (2008), 1078–1095.MathSciNetzbMATHCrossRefGoogle Scholar
  25. 25.
    J. B. Cruz, Survey of Nash and Stackelberg equilibrium strategies in dynamic games. Ann. Econ. Social Measurement 4 (1975), 339–344.Google Scholar
  26. 26.
    S. Dempe, Essays and surveys in global optimization. Springer-Verlag (2005), pp. 165–193.Google Scholar
  27. 27.
    S. Dempe and N. Gadhi, Second-order optimality conditions for bilevel set optimization problems. J. Global Optim. 47 (2010), 233–245.MathSciNetzbMATHCrossRefGoogle Scholar
  28. 28.
    E. Dockner, S. Jorgensen, N. V. Long, and G. Sorger, Differential games in economics and management science. Cambridge Univ. Press (2000).Google Scholar
  29. 29.
    G. Freiling, G. Jank, and S. R. Lee, Existence and uniqueness of openloop stackelberg equilibria in linear-quadratic differential games. J. Optim. Theory Appl. 110 (2001), 515–544.MathSciNetzbMATHCrossRefGoogle Scholar
  30. 30.
    J. W. Friedman, A non-cooperative equilibrium for supergames. Review Econ. Stud. 38 (1971), 1–12.zbMATHCrossRefGoogle Scholar
  31. 31.
    X. He, A. Prasad, S. P. Sethi, and G. J. Gutierrez, A survey of Stackelberg differential game models in supply and marketing channels. J. Syst. Sci. Syst. Eng. 16 (2007), 385–413.CrossRefGoogle Scholar
  32. 32.
    M. Jungers, Matrix block formulation of closed-loop memoryless Stackelberg strategy for discrete-time games. In: Proc. 47th IEEE Conf. on Decision and Control, Cancun, Mexico, December 2008.Google Scholar
  33. 33.
    M. Jungers and C. Oară, Anti-palindromic pencil formulation for openloop Stackelberg strategy in discrete-time. In: Proc. 19th Int. Symp. on Mathematical Theory of Networks and Systems (MTNS), Budapest, Hungary, July 2010, pp. 2265–2268.Google Scholar
  34. 34.
    S. Lasaulce, Y. Hayel, R. E. Azouzi, and M. Debbah, Introducing hierarchy in energy games. IEEE Trans. Wireless Commun. 8 (2009), 3833–3843.CrossRefGoogle Scholar
  35. 35.
    E. Lee and L. Markus, Foundations of optimal control theory. Wiley, New York (1967).zbMATHGoogle Scholar
  36. 36.
    G. Leitmann, On generalized Stackelberg strategies. J. Optim. Theory Appl. 26 (1978).Google Scholar
  37. 37.
    D. Limebeer, B. Anderson, and H. Hendel, A Nash game approach to mixed H 2/H control. IEEE Trans. Automat. Control 39 (1994), 69– 82.MathSciNetzbMATHCrossRefGoogle Scholar
  38. 38.
    P. Loridan and J. Morgan, A theoretical approximation scheme for Stackelberg problems. J. Optim. Theory Appl. 61 (1989), 95–110.MathSciNetzbMATHCrossRefGoogle Scholar
  39. 39.
    ______, Weak via strong Stackelberg problem: New results. J. Global Optim. 8 (1996), 263–287.MathSciNetzbMATHCrossRefGoogle Scholar
  40. 40.
    L. Mallozzi and J. Morgan, Existence of a feedback equilibrium for twostage Stackelberg games. IEEE Trans. Automat. Control 42 (1997), 1612–1614.MathSciNetzbMATHCrossRefGoogle Scholar
  41. 41.
    J. Medanic, Closed-loop Stackelberg strategies in linear-quadratic problems. IEEE Trans. Automat. Control 23 (1978), 632–637.zbMATHCrossRefGoogle Scholar
  42. 42.
    P.-Y. Nie, Dynamic Stackelberg games under open-loop complete information. J. Franklin Inst., 342 (2005), 737–748.MathSciNetzbMATHCrossRefGoogle Scholar
  43. 43.
    P.-Y. Nie, L.-H. Chen, and M. Fukushima, Dynamic programming approach to discrete time dynamic feedback Stackelberg games with independent and dependent followers. Eur. J. Oper. Res. 169 (2006), 310–328.MathSciNetzbMATHCrossRefGoogle Scholar
  44. 44.
    P.-Y. Nie, M.-Y. Lai, and S.-J. Zhu, Dynamic feedback Stackelberg games with nonunique solutions. Nonlin. Anal. 69 (2008), 1904–1913.MathSciNetzbMATHCrossRefGoogle Scholar
  45. 45.
    A.-J. Novak, G. Feichtinger, and G. Leitmann, A differential game related to terrorism: Nash and Stackelberg strategies. J. Optim. Theory Appl., 144 (2010), 533–555.MathSciNetzbMATHCrossRefGoogle Scholar
  46. 46.
    G. P. Papavassilopoulos and J. B. Cruz, Nonclassical control problems and Stackelberg games. IEEE Trans. Automat. Control 24 (1979), 155–166.MathSciNetzbMATHCrossRefGoogle Scholar
  47. 47.
    M. Simaan and J. B. Cruz, Additional aspects of the Stackelberg strategy in nonzero-sum games. J. Optim. Theory Appl. 11 (1973), 613–626.MathSciNetzbMATHCrossRefGoogle Scholar
  48. 48.
    ______, On the Stackelberg strategy in nonzero-sum games. J. Optim. Theory Appl. 11 (1973), 533–555.MathSciNetzbMATHCrossRefGoogle Scholar
  49. 49.
    A. W. Starr and Y. C. Ho, Further properties of nonzero-sum differential games. J. Optim. Theory Appl. 3 (1969), 207–219.MathSciNetzbMATHCrossRefGoogle Scholar
  50. 50.
    ______, Nonzero-sum differential games. J. Optim. Theory Appl. 3 (1969), 184–206.MathSciNetzbMATHCrossRefGoogle Scholar
  51. 51.
    B. Tolwinski, Closed-loop Stackelberg solution to a multistage linearquadratic game. J. Optim. Theory Appl., 34 (1981), 485–501.MathSciNetzbMATHCrossRefGoogle Scholar
  52. 52.
    ______, A Stackelberg solution of dynamic games. IEEE Trans. Automat. Control 28 (1983), 85–93.MathSciNetzbMATHCrossRefGoogle Scholar
  53. 53.
    E. Trélat, Asymptotics of accessibility sets along an abnormal trajectory. ESAIM Control Optim. Calc. Var. 6 (2001), 387–414.MathSciNetzbMATHCrossRefGoogle Scholar
  54. 54.
    ______, Contrôle optimal: théorie et applications. Vuibert (2005).Google Scholar
  55. 55.
    L. N. Vicente and P. H. Calamai, Bilevel and multilevel programming: a bibliography review. J. Global Optim. 5 (1994), 291–306.MathSciNetzbMATHCrossRefGoogle Scholar
  56. 56.
    H. von Stackelberg, Marktform und Gleichgewicht. Springer-Verlag, Berlin (1934).Google Scholar
  57. 57.
    X. Yashan, Stackelberg equilibirums of open-loop differential games. In: Proc. 26th Chinese Control Conf., Zhangjiajie, Hunan, China, July 2007, pp. 446–450.Google Scholar
  58. 58.
    J. J. Ye, Optimal strategies for bilevel dynamic problems. SIAM J. Control Optim. 35 (1997), 512–531.MathSciNetzbMATHCrossRefGoogle Scholar
  59. 59.
    K. Zhou, J. C. Doyle, and K. Glover, Robust and optimal control. Prentice Hall, New Jersey (1996).zbMATHGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  1. 1.CRAN UMR CNRS 7039Vandoeuvre cedexFrance
  2. 2.Universite d’Orleans, UFR Sciences Federation Denis Poisson Mathematiques, Laboratoire MAPMOOrleans Cedex 2France
  3. 3.SATIE ENS CACHANCachan CedexFrance

Personalised recommendations