Distributed Optimal Flocking Design for Multi-agent Two-Player Zero-Sum Games with Unknown System Dynamics and Disturbance

Xu, Hao; Carrillo, Luis Rodolfo Garcia

doi:10.1007/978-3-319-50835-1_54

Hao Xu²⁵ &
Luis Rodolfo Garcia Carrillo²⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10072))

Included in the following conference series:

International Symposium on Visual Computing

4215 Accesses

Abstract

In this paper, distributed flocking strategies have been exploited for multi-agent two-player zero-sum games. Two main challenges are addressed, i.e. (a) handling system uncertainties and disturbances, and (b) achieving optimality. Adopting the emerging Approximate Dynamic Programming (ADP) technology, a novel distributed adaptive flocking design is proposed to optimize the multi-agent two-player zero-sum games even when the system dynamics and disturbances are unknown. First, to evaluate the multi-agent flocking performance and effects from disturbances, a novel flocking cost function is developed. Next, an innovative type of online neural network (NN) based identifier is proposed to approximate the multi-agent zero-sum game system dynamics effectively. Subsequently, another novel neural network (NN) is proposed to approximate the optimal flocking cost function by using the Hamilton-Jacobi-Isaacs (HJI) equation in a forward in time manner. Moreover, a novel additional term is designed and included into the NN update law to relax the stringent requirement of initial admissible control. Eventually, the distributed adaptive optimal flocking design is obtained by using the learnt Multi-agent zero-sum games system dynamics and approximated optimal flocking cost function. Simulation results demonstrate the effectiveness of proposed scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Reynolds, C.W.: Flocks, herds, and schools: a distributed behavioral model. Comput. Graph. 21, 25–34 (1986)
Article Google Scholar
Saber, R.O.: Flocking for multi-agent dynamic systems: algorithms and theory. IEEE Trans. Autom. Control 51, 401–420 (2006)
Article MathSciNet Google Scholar
Wang, Q., Fang, H., Chen, J., Mao, Y., Dou, L.: Flocking with obstacle avoidance and connectivity maintenance in multi-agent systems. In: Proceedings of IEEE Control and Decision Conference, pp. 4009–4014 (2012)
Google Scholar
Dragan, V., Morozan, R.: Global solution to game-theoretic riccati equation of stochastic control. J. Diff. Equ. 138, 328–350 (1997)
Article MathSciNet MATH Google Scholar
Wang, J., Xin, M.: Integrated optimal formation control of multiple unmanned aerial vehicles. IEEE Trans. Control Syst. Tech. 21, 1731–1744 (2013)
Article Google Scholar
Lewis, F.L., Vrabie, D., Syrmos, V.L.: Optimal Control, 3rd edn. Wiley, New York (2012)
Book MATH Google Scholar
Bertsekas, D.P., Tsitsiklis, J.: Neuro-Dynamic Programming. Athena Scientific, CA (1996)
MATH Google Scholar
Al-Tamimi, A., Lewis, F.L., Abu-Khalaf, M.: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control. Automatica 3, 471–481 (2007)
MathSciNet MATH Google Scholar
Dierks, T., Jagannathan, S.: Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update. IEEE Trans. Neural Netw. Learn. Syst. 23, 1118–1129 (2012)
Article Google Scholar
Diestel, R.: Graph Theory. Graduate Texts in Mathematics, vol. 184. Springer, Heidelberg (2000)
MATH Google Scholar
Jagannathan, S.: Neural Network Control of Nonlinear Discrete-Time Systems. CRC Press, FL (2006)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Biomedical Engineering, University of Nevada, Reno, Nevada, USA
Hao Xu & Luis Rodolfo Garcia Carrillo

Authors

Hao Xu
View author publications
You can also search for this author in PubMed Google Scholar
Luis Rodolfo Garcia Carrillo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hao Xu .

Editor information

Editors and Affiliations

University of Nevada , Reno, Nevada, USA
George Bebis
NASA Ames Research Center , Moffett Field, California, USA
Richard Boyle
Lawrence Berkeley National Laboratory , Berkeley, California, USA
Bahram Parvin
Desert Research Institute , Reno, Nevada, USA
Darko Koracin
The Australian National University , O'Malley, Aust Capital Terr, Australia
Fatih Porikli
Pilot AI Labs , Redwood City, California, USA
Sandra Skaff
University of Florida , Gainesville, Florida, USA
Alireza Entezari
Google Inc. , Mountain View, California, USA
Jianyuan Min
Osaka University , Osaka, Japan
Daisuke Iwai
The MOVES Institute , Monterey, California, USA
Amela Sadagic
University of Arizona , Tucson, Arizona, USA
Carlos Scheidegger
Université Paris-Sud , Orsay, France
Tobias Isenberg

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, H., Carrillo, L.R.G. (2016). Distributed Optimal Flocking Design for Multi-agent Two-Player Zero-Sum Games with Unknown System Dynamics and Disturbance. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2016. Lecture Notes in Computer Science(), vol 10072. Springer, Cham. https://doi.org/10.1007/978-3-319-50835-1_54

Download citation

DOI: https://doi.org/10.1007/978-3-319-50835-1_54
Published: 10 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50834-4
Online ISBN: 978-3-319-50835-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics