Skip to main content
Log in

Stackelberg Strategy of Two-player Stochastic Difference Game With Time Delay

  • Regular Papers
  • Control Theory and Applications
  • Published:
International Journal of Control, Automation and Systems Aims and scope Submit manuscript

Abstract

In this paper, we study the open-loop Stackelberg strategy of stochastic Stackelberg game with time delay. The main contribution is to give the explicit Stackelberg strategy in terms of Riccati equations. The key to solving the problem is the explicit solvability of the forward and backward stochastic difference equations (FBSDEs). Moreover, the optimal costs are indicated by using the initial value of the state.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. C. C. Yang and T. H. Cheng, “Leader-follower cooperative swinging by UAVs without interagent communication,” International Journal of Control, vol. 95, no. 1, pp. 104–113, January 2022.

    Article  MathSciNet  MATH  Google Scholar 

  2. B. He and F. Gao, “Influence analysis of leader information with application to formation control of multi-agent systems,” International Journal of Control, Automation, and Systems, vol. 18, no. 12, pp. 3062–3072, March 2020.

    Article  Google Scholar 

  3. K. K. Oh and H. S. Ahn, “Leader-follower type distance-based formation control of a group of autonomous agents,” International Journal of Control, Automation, and Systems, vol. 15, no. 4, pp. 1738–1745, June 2017.

    Article  Google Scholar 

  4. H. J. Liang, Z. X. Du, T. W. Huang, and Y. N. Pan, “Neuroadaptive performance guaranteed control for multiagent systems with power integrators and unknown measurement sensitivity,” IEEE Transaction on Neural Networks and Learning Systems, pp. 1–12, 2022. DOI:https://doi.org/10.1109/TNNLS.2022.3160532

  5. H. J. Liang, G. L. Liu, H. G. Zhang, and T. W. Huang, “Neural-network-based event-triggered adaptive control of nonaffine nonlinear multiagent systems with dynamic uncertainties,” IEEE Transaction on Neural Networks and Learning Systems, vol. 32, no. 5, pp. 2239–2250, May 2021.

    Article  MathSciNet  Google Scholar 

  6. Q. Lu, S. K. Lu, and Y. J. Leng, “A Nash-Stackelberg game approach in regional energy market considering users’ integrated demand response,” Energy, vol. 175, pp. 456–470, May 2019.

    Article  Google Scholar 

  7. W. Q. Hua, D. Li, H. J. Sun, and P. Matthews, “Stackelberg game-theoretic model for low carbon energy market scheduling,” IET Smart Grid, vol. 3, no. 1, pp. 31–41, February 2020.

    Article  Google Scholar 

  8. M. G. Lizarraga, O. Garcia, R. C. Garcia, and E. G. Rodriguez, “Differential game-based formation flight for quadrotors,” International Journal of Control, Automation, and Systems, vol. 16, no. 4, pp. 1854–1865, July 2018.

    Article  Google Scholar 

  9. L. Dong, J. H. Wang, S. S. Gu, Y. B. Shi, and F. M. Zhao, “Adaptive synchronization of leader-follower networked systems against communication attenuation and actuators faults,” International Journal of Control, Automation, and Systems, vol. 14, no. 6, pp. 1484–1492, October 2016.

    Article  Google Scholar 

  10. H. Mukaidani, and H. Xu, “Incentive Stackelberg games for stochastic linear systems with H constraint,” IEEE Transactions on Cybernetics, vol. 49, no. 4, pp. 1463–1474, April 2019.

    Article  Google Scholar 

  11. Y. D. Zhao, Y. Y. Zhang, and J. Y. Lee, “Lyapunov and sliding mode based leader-follower formation control for multiple mobile robots with an augmented distance-angle strategy,” International Journal of Control, Automation, and Systems, vol. 17, no. 5, pp. 1314–1321, May 2019.

    Article  Google Scholar 

  12. J. J. Xu, H. S. Zhang, and T. Y. Chai, “Necessary and sufficient condition for two-player Stackelberg strategy,” IEEE Transactions on Automatic Control, vol. 60, no. 5, pp. 1356–1361, March 2015.

    Article  MathSciNet  MATH  Google Scholar 

  13. G. Freiling, G. Jank, and H. Abou-Kandil, “Discrete-time riccati equations in open-loop Nash and Stackelberg games,” European Journal of Control, vol. 5, no. 1, pp. 56–66, 1999.

    Article  MATH  Google Scholar 

  14. M. Jungers, “Feedback strategies for discrete-time linear-quadratic two-player descriptor games,” Linear Algebra & Its Applications, vol. 440, no. 1, pp. 1–23, January 2014.

    Article  MathSciNet  MATH  Google Scholar 

  15. Y. P. Zheng, T. Basar, and J. B. Cruz, “Stackelberg strategies and incentives in multiperson deterministic decision problems,” IEEE Transactions on Systems Man & Cybernetics, vol. SMC-14, no. 1, pp. 10–24, January 1984.

    Article  MathSciNet  MATH  Google Scholar 

  16. M. Jungers, E. Trelat, and H. Abou-Kandil, “Min-max and min-min Stackelberg strategies with closed-loop information structure,” Journal of Dynamical & Control Systems, vol. 17, no. 3, pp. 387–425, July 2011.

    Article  MathSciNet  MATH  Google Scholar 

  17. P. Y. Nie, “Discrete time dynamic Stackelberg games with the leaders in turn,” Nonlinear Analysis: Real World Applications, vol. 11, no. 3, pp. 1685–1691, June 2010.

    MathSciNet  MATH  Google Scholar 

  18. M. Li, J. B. Cruz, and M. A. Simaan, “An approach to discrete-time incentive feedback Stackelberg games,” IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, vol. 32, no. 4, pp. 472–481, July 2002.

    Article  Google Scholar 

  19. J. J. Xu, H. S. Zhang, and T. Basar, “Stackelberg solution for a two-agent rational expectations model,” Automatica, vol. 129, 109601, 2021.

    Article  MathSciNet  MATH  Google Scholar 

  20. H. Mukaidani, “Stackelberg strategy for discrete-time stochastic system and its application to H2/H control,” Proc. of American Control Conference, pp. 4488–4493, June 2014.

  21. H. Xu, “An MPCC approach for stochastic Stackel-berg-Nash-Cournot equilibrium,” Optimization, vol. 54, no. 1, pp. 27–57, February 2005.

    Article  MathSciNet  MATH  Google Scholar 

  22. Y. N. Lin, “Feedback Stackelberg strategies for the discrete-time mean-field stochastic systems in infinite horizon,” Journal of the Franklin Institute, vol. 356, no. 10, pp. 5222–5239, July 2019.

    Article  MathSciNet  MATH  Google Scholar 

  23. Y. L. Li, H. T. Li, X. Y. Ding, and G. D. Zhao, “Leader-follower consensus of multi-agent systems with time delays over finite fields,” IEEE Transactions on Cybernetics, vol. 49, no. 8, pp. 3203–3208, August 2019.

    Article  Google Scholar 

  24. X. K. Liu, R. R. Liu, and Y. Li, “Infinite time linear quadratic Sftackelberg game problem for unknown stochastic discrete-time systems via adaptive dynamic programming approach,” Asian Journal of Control, vol. 23, no. 2, pp. 937–948, March 2021.

    Article  MathSciNet  Google Scholar 

  25. Y. N. Lin, “Linear quadratic open-loop Stackelberg game for stochastic systems with Poisson jumps,” Journal of the Franklin Institute, vol. 358, no. 10, pp. 5262–5280, July 2021.

    Article  MathSciNet  MATH  Google Scholar 

  26. F. Vélez, M. Lizárraga, and C. R. Carreon, “Open loop robust equilibria in uncertain discrete time games,” International Journal of Control, Automation, and Systems, vol. 19, no. 2, pp. 587–595, October 2021.

    Article  Google Scholar 

  27. W. J. Meng and J. T. Shi, “A linear quadratic stochastic Stackelberg differential game with time delay,” Mathematical Control and Related Fields, vol. 12, no. 3, pp. 581–609, September 2022.

    Article  MathSciNet  MATH  Google Scholar 

  28. J. J. Xu and H. S. Zhang, “Sufficient and necessary open-loop Stackelberg strategy for two-player game with time delay,” IEEE Transactions on Cybernetics, vol. 46, no. 2, pp. 438–449, February 2016.

    Article  MathSciNet  Google Scholar 

  29. A. Bensoussan, M. H. M. Chau, Y. Lai, and S. C. P. Yam, “Linear-quadratic mean field Stackelberg games with state and control delays,” SIAM Journal on Control and Optimization, vol. 55, no. 4, pp. 2748–2781, August 2017.

    Article  MathSciNet  MATH  Google Scholar 

  30. X. S. Cai, B. L. Nikolaos, and K. Miroslav, “Input-to-state stability and inverse optimality of linear time-varying-delay predictor feedbacks,” IEEE Transactions on Automatic Control, vol. 63, no. 1, pp. 233–240, January 2018.

    Article  MathSciNet  MATH  Google Scholar 

  31. X. Q. Li, J. J. Xu, W. Wang, and H. S. Zhang, “Mixed H2/H control for discrete-time systems with input delay,” IET Control Theory & Applications, vol. 12, no. 16, pp. 2221–2231, July 2018.

    Article  MathSciNet  Google Scholar 

  32. J. J. Xu, J. T. Shi, and H. S. Zhang, “A leader-follower stochastic linear quadratic differential game with time delay,” Science China Information Sciences, vol. 61, pp. 1–13, November 2018.

    Article  MathSciNet  Google Scholar 

  33. H. S. Zhang, L. Li, J. J. Xu, and M. Y. Fu, “Linear quadratic regulation and stabilization of discrete-time systems with delay and multiplicative noise,” IEEE Transactions on Automatic Control, vol. 60, no. 10, pp. 2599–2613, October 2015.

    Article  MathSciNet  MATH  Google Scholar 

  34. Y. L. Chan and L. Kogan, “Catching up with the Joneses: Heterogeneous preferences and the dynamics of asset prices,” Journal of Political Economy, vol. 110, no. 6, pp. 1255–1285, December 2002.

    Article  Google Scholar 

  35. M. Y. Huang, “Large-population LQG games involving a major player: the Nash certainty equivalence principle,” SIAM Journal on Control and Optimization, vol. 48, no. 5, pp. 3318–3353, January 2010.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Wang.

Ethics declarations

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported by the National Natural Science Foundation of China under Grants 61821004, 62250056, and the Natural Science Foundation of Shandong Province (ZR2021ZD14, ZR2021JQ24), and Science and Technology Project of Qingdao West Coast New Area (2019-32, 2020-20, 2020-1-4), High-level Talent Team Project of Qingdao West Coast New Area (RCTD-JC-2019-05), Key Research and Development Program of Shandong Province (2020CXGC01208).

Jingmei Liu received her B.S. and M.Sc. degrees in mathematics from Qufu Normal University, in 2015 and 2018, respectively. She is currently a doctoral student with the School of Control Science and Engineering, Shandong University, China. Her research interests include optimal control, stochastic systems, and time-delay systems.

Wei Wang received his Ph.D. degree in control science and engineering from Shenzhen Graduate School, Harbin Institute of Technology in 2010. He is currently a Professor with the School of Control Science and Engineering, Shandong University, China. His research interests include optimal control and estimation for delayed systems, distributed control and estimation.

Juanjuan Xu received her B.S. degree in mathematics from Qufu Normal University in 2006, an M.Sc. degree in mathematics from Shandong University in 2009, and a Ph.D. degree in control theory from Shandong University in 2013. She is currently a Professor with the School of Control Science and Engineering, Shandong University, China. Her research interests include distributed consensus, optimal control, game theory, stochastic systems, and time-delay systems.

Huanshui Zhang received his B.S. degree in mathematics from Qufu Normal University in 1986 and his M.Sc. and Ph.D. degrees in control theory from Heilongjiang University, China, and Northeastern University, China, in 1991 and 1997, respectively. He worked as a postdoctoral fellow at Nanyang Technological University from 1998 to 2001 and Research Fellow at Chinese University of Hong Kong and Hong Kong Polytechnic University from 2001 to 2003. From 1986 to 1999, he served as an Assistant and Associate Professor at Taishan College, China. He held a Professor at Shandong University, China, from 1999 to 2003. From 2003 to 2006, he served as a Professor at Harbin Institute of Technology, Shenzhen, China. From 2006 to 2019, he was a Taishan Professor and Changjiang Professor at Shandong University, China. He is currently a Distinguished Professor at Shandong University of Science and Technology, China. He also held visiting appointments as Research Scientist and Fellow with Nanyang Technological University, Curtin University of Technology and Hong Kong City University from 2003 to 2006. His research interests include optimal and distributed control, time-delay systems, stochastic systems, and networked control systems. He was an Associate Editor of IEEE Transactions on Automatic Control, and IEEE Transactions on Circuits and Systems I.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, J., Wang, W., Xu, J. et al. Stackelberg Strategy of Two-player Stochastic Difference Game With Time Delay. Int. J. Control Autom. Syst. 21, 2904–2915 (2023). https://doi.org/10.1007/s12555-022-0534-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12555-022-0534-5

Keywords

Navigation