Cooperative Multi-agent Learning in a Large Dynamic Environment

Zemzem, Wiem; Tagina, Moncef

doi:10.1007/978-3-319-23240-9_13

Wiem Zemzem⁶ &
Moncef Tagina⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9321))

Included in the following conference series:

International Conference on Modeling Decisions for Artificial Intelligence

818 Accesses
4 Citations

Abstract

In this work, we are addressing the problem of cooperative multi-agent learning for distributed decision making in non stationary environments. Our principal focus is to improve learning by exchanging information between local neighbors (agents) and to ensure the adaption to the new environmental form without ignoring knowledge already acquired. First, a distributed dynamic correlation matrix based on multi-Q learning method, presented in [1], is evaluated. To overcome the shortcomings of this method, a new multi-agent reinforcement learning approach and a new cooperative action selection strategy are developed. Several simulation tests are conducted using a cooperative foraging task with a single moving target and show the efficiency of the proposed methods in the case of large, unknown and temporary dynamic environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Hongliang, G., Yan, M.: Distributed reinforcement learning for coordinate multi-robot foraging. J. Intell. Rob. Syst. 60(3–4), 531–551 (2010)
MATH Google Scholar
Yifan, C.: Intelligent Multi-robot Cooperation for Target Searching and Foraging Tasks in completely Unknown Environments. Ph.D. dissertation, University of Guelph, Guelph (2013)
Google Scholar
Yogeswaran, M., Ponnambalam, S.G.: Reinforcement learning: explorationexploitation dilemma in multi-agent foraging task. OPSEARCH 49(3), 223–236 (2012)
Article Google Scholar
Chen, K., Lin, F., Tan, Q., Shi, Z.: Adaptive action selection using utility-based reinforcement learning. In: IEEE International Conference on Granular Computing GRC 2009, pp. 67–72. IEEE, August 2009
Google Scholar
Jaradat, M.A.K., Al-Rousan, M., Quadan, L.: Reinforcement based mobile robot navigation in dynamic environment. Robot. Comput. Integr. Manuf. 27(1), 135–149 (2011)
Article Google Scholar
Cunningham, B., Cao, Y.: Non-reciprocating sharing methods in cooperative Q-learning environments. In: Proceedings of the the 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, Vol. 02, pp. 212–219. IEEE Computer Society, December 2012
Google Scholar
Torrey, L., Taylor, M.E.: Help an agent out: student/teacher learning in sequential decision tasks. In: Proceedings of the Adaptive and Learning Agents workshop (at AAMAS-120) (2012)
Google Scholar
Coggan, M.: Exploration and exploitation in reinforcement learning. In: Proceedings of the Fourth International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2001), Shonan International Village Yokosuka City (2001)
Google Scholar
Mataric, M.J.: Learning in multi-robot systems. In: Weiss, G., Sen, S. (eds.) IJCAI-WS 1995. LNCS, vol. 1042, pp. 152–163. Springer, Heidelberg (1996)
Chapter Google Scholar
Stone, P., Sutton, R.S., Kuhlmann, G.: Reinforcement learning for robocup soccer keepaway. Adaptive Behav. 13(3), 165–188 (2005)
Article Google Scholar
Panait, L., Luke., S.: A pheromone-based utility model for collaborative foraging. In: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 36–43, Washington, DC (2004)
Google Scholar
Hrolenok, B., Luke, S., Sullivan, K., Vo, C.: Collaborative Foraging using Beacons. In: van der Hoek., et al. (ed.) Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), pp. 1197–1204 (2010)
Google Scholar
Tan, M.: Multi-agent reinforcement learning: independent vs. cooperative agents. In: Proceedings of the Tenth International Conference on Machine Learning, pp. 330–337 (1993)
Google Scholar
Guo, H., Meng, Y.: Dynamic correlation matrix based multi-q learning for a multi-robot system. In: 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems IROS 2008, pp. 840–845. IEEE, September 2008
Google Scholar
Zemzem, W., Tagina, M.: A Novel exploration/exploitation policy accelerating learning in both stationary and non-stationary environment navigation tasks. Int. J. Comput. Electr. Eng. 7(3), 149–158 (2015)
Google Scholar
Sutton, S., Barto, G.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)
Google Scholar
http://simbad.sourceforge.net/

Download references

Author information

Authors and Affiliations

National School of Computer Science, University of Manouba, Manouba, Tunisia
Wiem Zemzem & Moncef Tagina

Authors

Wiem Zemzem
View author publications
You can also search for this author in PubMed Google Scholar
Moncef Tagina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wiem Zemzem .

Editor information

Editors and Affiliations

University of Skövde, Skövde, Sweden
Vicenc Torra
Toho Gakuen, Tokyo, Japan
Torra Narukawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zemzem, W., Tagina, M. (2015). Cooperative Multi-agent Learning in a Large Dynamic Environment. In: Torra, V., Narukawa, T. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2015. Lecture Notes in Computer Science(), vol 9321. Springer, Cham. https://doi.org/10.1007/978-3-319-23240-9_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-23240-9_13
Published: 01 September 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23239-3
Online ISBN: 978-3-319-23240-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics