Skip to main content

Cooperative Multi-agent Learning in a Large Dynamic Environment

  • Conference paper
  • First Online:
Modeling Decisions for Artificial Intelligence (MDAI 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9321))

Abstract

In this work, we are addressing the problem of cooperative multi-agent learning for distributed decision making in non stationary environments. Our principal focus is to improve learning by exchanging information between local neighbors (agents) and to ensure the adaption to the new environmental form without ignoring knowledge already acquired. First, a distributed dynamic correlation matrix based on multi-Q learning method, presented in [1], is evaluated. To overcome the shortcomings of this method, a new multi-agent reinforcement learning approach and a new cooperative action selection strategy are developed. Several simulation tests are conducted using a cooperative foraging task with a single moving target and show the efficiency of the proposed methods in the case of large, unknown and temporary dynamic environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hongliang, G., Yan, M.: Distributed reinforcement learning for coordinate multi-robot foraging. J. Intell. Rob. Syst. 60(3–4), 531–551 (2010)

    MATH  Google Scholar 

  2. Yifan, C.: Intelligent Multi-robot Cooperation for Target Searching and Foraging Tasks in completely Unknown Environments. Ph.D. dissertation, University of Guelph, Guelph (2013)

    Google Scholar 

  3. Yogeswaran, M., Ponnambalam, S.G.: Reinforcement learning: explorationexploitation dilemma in multi-agent foraging task. OPSEARCH 49(3), 223–236 (2012)

    Article  Google Scholar 

  4. Chen, K., Lin, F., Tan, Q., Shi, Z.: Adaptive action selection using utility-based reinforcement learning. In: IEEE International Conference on Granular Computing GRC 2009, pp. 67–72. IEEE, August 2009

    Google Scholar 

  5. Jaradat, M.A.K., Al-Rousan, M., Quadan, L.: Reinforcement based mobile robot navigation in dynamic environment. Robot. Comput. Integr. Manuf. 27(1), 135–149 (2011)

    Article  Google Scholar 

  6. Cunningham, B., Cao, Y.: Non-reciprocating sharing methods in cooperative Q-learning environments. In: Proceedings of the the 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, Vol. 02, pp. 212–219. IEEE Computer Society, December 2012

    Google Scholar 

  7. Torrey, L., Taylor, M.E.: Help an agent out: student/teacher learning in sequential decision tasks. In: Proceedings of the Adaptive and Learning Agents workshop (at AAMAS-120) (2012)

    Google Scholar 

  8. Coggan, M.: Exploration and exploitation in reinforcement learning. In: Proceedings of the Fourth International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2001), Shonan International Village Yokosuka City (2001)

    Google Scholar 

  9. Mataric, M.J.: Learning in multi-robot systems. In: Weiss, G., Sen, S. (eds.) IJCAI-WS 1995. LNCS, vol. 1042, pp. 152–163. Springer, Heidelberg (1996)

    Chapter  Google Scholar 

  10. Stone, P., Sutton, R.S., Kuhlmann, G.: Reinforcement learning for robocup soccer keepaway. Adaptive Behav. 13(3), 165–188 (2005)

    Article  Google Scholar 

  11. Panait, L., Luke., S.: A pheromone-based utility model for collaborative foraging. In: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 36–43, Washington, DC (2004)

    Google Scholar 

  12. Hrolenok, B., Luke, S., Sullivan, K., Vo, C.: Collaborative Foraging using Beacons. In: van der Hoek., et al. (ed.) Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), pp. 1197–1204 (2010)

    Google Scholar 

  13. Tan, M.: Multi-agent reinforcement learning: independent vs. cooperative agents. In: Proceedings of the Tenth International Conference on Machine Learning, pp. 330–337 (1993)

    Google Scholar 

  14. Guo, H., Meng, Y.: Dynamic correlation matrix based multi-q learning for a multi-robot system. In: 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems IROS 2008, pp. 840–845. IEEE, September 2008

    Google Scholar 

  15. Zemzem, W., Tagina, M.: A Novel exploration/exploitation policy accelerating learning in both stationary and non-stationary environment navigation tasks. Int. J. Comput. Electr. Eng. 7(3), 149–158 (2015)

    Google Scholar 

  16. Sutton, S., Barto, G.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)

    Google Scholar 

  17. http://simbad.sourceforge.net/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wiem Zemzem .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Zemzem, W., Tagina, M. (2015). Cooperative Multi-agent Learning in a Large Dynamic Environment. In: Torra, V., Narukawa, T. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2015. Lecture Notes in Computer Science(), vol 9321. Springer, Cham. https://doi.org/10.1007/978-3-319-23240-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23240-9_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23239-3

  • Online ISBN: 978-3-319-23240-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics