Skip to main content

Distributed Optimal Flocking Design for Multi-agent Two-Player Zero-Sum Games with Unknown System Dynamics and Disturbance

  • Conference paper
  • First Online:
Advances in Visual Computing (ISVC 2016)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10072))

Included in the following conference series:

  • 4215 Accesses

Abstract

In this paper, distributed flocking strategies have been exploited for multi-agent two-player zero-sum games. Two main challenges are addressed, i.e. (a) handling system uncertainties and disturbances, and (b) achieving optimality. Adopting the emerging Approximate Dynamic Programming (ADP) technology, a novel distributed adaptive flocking design is proposed to optimize the multi-agent two-player zero-sum games even when the system dynamics and disturbances are unknown. First, to evaluate the multi-agent flocking performance and effects from disturbances, a novel flocking cost function is developed. Next, an innovative type of online neural network (NN) based identifier is proposed to approximate the multi-agent zero-sum game system dynamics effectively. Subsequently, another novel neural network (NN) is proposed to approximate the optimal flocking cost function by using the Hamilton-Jacobi-Isaacs (HJI) equation in a forward in time manner. Moreover, a novel additional term is designed and included into the NN update law to relax the stringent requirement of initial admissible control. Eventually, the distributed adaptive optimal flocking design is obtained by using the learnt Multi-agent zero-sum games system dynamics and approximated optimal flocking cost function. Simulation results demonstrate the effectiveness of proposed scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Reynolds, C.W.: Flocks, herds, and schools: a distributed behavioral model. Comput. Graph. 21, 25–34 (1986)

    Article  Google Scholar 

  2. Saber, R.O.: Flocking for multi-agent dynamic systems: algorithms and theory. IEEE Trans. Autom. Control 51, 401–420 (2006)

    Article  MathSciNet  Google Scholar 

  3. Wang, Q., Fang, H., Chen, J., Mao, Y., Dou, L.: Flocking with obstacle avoidance and connectivity maintenance in multi-agent systems. In: Proceedings of IEEE Control and Decision Conference, pp. 4009–4014 (2012)

    Google Scholar 

  4. Dragan, V., Morozan, R.: Global solution to game-theoretic riccati equation of stochastic control. J. Diff. Equ. 138, 328–350 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  5. Wang, J., Xin, M.: Integrated optimal formation control of multiple unmanned aerial vehicles. IEEE Trans. Control Syst. Tech. 21, 1731–1744 (2013)

    Article  Google Scholar 

  6. Lewis, F.L., Vrabie, D., Syrmos, V.L.: Optimal Control, 3rd edn. Wiley, New York (2012)

    Book  MATH  Google Scholar 

  7. Bertsekas, D.P., Tsitsiklis, J.: Neuro-Dynamic Programming. Athena Scientific, CA (1996)

    MATH  Google Scholar 

  8. Al-Tamimi, A., Lewis, F.L., Abu-Khalaf, M.: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control. Automatica 3, 471–481 (2007)

    MathSciNet  MATH  Google Scholar 

  9. Dierks, T., Jagannathan, S.: Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update. IEEE Trans. Neural Netw. Learn. Syst. 23, 1118–1129 (2012)

    Article  Google Scholar 

  10. Diestel, R.: Graph Theory. Graduate Texts in Mathematics, vol. 184. Springer, Heidelberg (2000)

    MATH  Google Scholar 

  11. Jagannathan, S.: Neural Network Control of Nonlinear Discrete-Time Systems. CRC Press, FL (2006)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hao Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Xu, H., Carrillo, L.R.G. (2016). Distributed Optimal Flocking Design for Multi-agent Two-Player Zero-Sum Games with Unknown System Dynamics and Disturbance. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2016. Lecture Notes in Computer Science(), vol 10072. Springer, Cham. https://doi.org/10.1007/978-3-319-50835-1_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50835-1_54

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50834-4

  • Online ISBN: 978-3-319-50835-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics