Abstract
In this work, we are addressing the problem of cooperative multi-agent learning for distributed decision making in non stationary environments. Our principal focus is to improve learning by exchanging information between local neighbors (agents) and to ensure the adaption to the new environmental form without ignoring knowledge already acquired. First, a distributed dynamic correlation matrix based on multi-Q learning method, presented in [1], is evaluated. To overcome the shortcomings of this method, a new multi-agent reinforcement learning approach and a new cooperative action selection strategy are developed. Several simulation tests are conducted using a cooperative foraging task with a single moving target and show the efficiency of the proposed methods in the case of large, unknown and temporary dynamic environments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hongliang, G., Yan, M.: Distributed reinforcement learning for coordinate multi-robot foraging. J. Intell. Rob. Syst. 60(3–4), 531–551 (2010)
Yifan, C.: Intelligent Multi-robot Cooperation for Target Searching and Foraging Tasks in completely Unknown Environments. Ph.D. dissertation, University of Guelph, Guelph (2013)
Yogeswaran, M., Ponnambalam, S.G.: Reinforcement learning: explorationexploitation dilemma in multi-agent foraging task. OPSEARCH 49(3), 223–236 (2012)
Chen, K., Lin, F., Tan, Q., Shi, Z.: Adaptive action selection using utility-based reinforcement learning. In: IEEE International Conference on Granular Computing GRC 2009, pp. 67–72. IEEE, August 2009
Jaradat, M.A.K., Al-Rousan, M., Quadan, L.: Reinforcement based mobile robot navigation in dynamic environment. Robot. Comput. Integr. Manuf. 27(1), 135–149 (2011)
Cunningham, B., Cao, Y.: Non-reciprocating sharing methods in cooperative Q-learning environments. In: Proceedings of the the 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, Vol. 02, pp. 212–219. IEEE Computer Society, December 2012
Torrey, L., Taylor, M.E.: Help an agent out: student/teacher learning in sequential decision tasks. In: Proceedings of the Adaptive and Learning Agents workshop (at AAMAS-120) (2012)
Coggan, M.: Exploration and exploitation in reinforcement learning. In: Proceedings of the Fourth International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2001), Shonan International Village Yokosuka City (2001)
Mataric, M.J.: Learning in multi-robot systems. In: Weiss, G., Sen, S. (eds.) IJCAI-WS 1995. LNCS, vol. 1042, pp. 152–163. Springer, Heidelberg (1996)
Stone, P., Sutton, R.S., Kuhlmann, G.: Reinforcement learning for robocup soccer keepaway. Adaptive Behav. 13(3), 165–188 (2005)
Panait, L., Luke., S.: A pheromone-based utility model for collaborative foraging. In: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 36–43, Washington, DC (2004)
Hrolenok, B., Luke, S., Sullivan, K., Vo, C.: Collaborative Foraging using Beacons. In: van der Hoek., et al. (ed.) Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), pp. 1197–1204 (2010)
Tan, M.: Multi-agent reinforcement learning: independent vs. cooperative agents. In: Proceedings of the Tenth International Conference on Machine Learning, pp. 330–337 (1993)
Guo, H., Meng, Y.: Dynamic correlation matrix based multi-q learning for a multi-robot system. In: 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems IROS 2008, pp. 840–845. IEEE, September 2008
Zemzem, W., Tagina, M.: A Novel exploration/exploitation policy accelerating learning in both stationary and non-stationary environment navigation tasks. Int. J. Comput. Electr. Eng. 7(3), 149–158 (2015)
Sutton, S., Barto, G.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Zemzem, W., Tagina, M. (2015). Cooperative Multi-agent Learning in a Large Dynamic Environment. In: Torra, V., Narukawa, T. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2015. Lecture Notes in Computer Science(), vol 9321. Springer, Cham. https://doi.org/10.1007/978-3-319-23240-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-23240-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23239-3
Online ISBN: 978-3-319-23240-9
eBook Packages: Computer ScienceComputer Science (R0)