Abstract
We present a method of supervised learning from demonstration for real-time, online training of complex heterogenous multiagent behaviors which scale to large numbers of agents in operation. Our learning method is applicable in domains where coordinated behaviors must be created quickly in unexplored environments. Examples of such problem domains includes disaster relief, search and rescue, and gaming environments. We demonstrate this training method in an adversarial mining scenario which coordinates four types of individual agents to perform six distinct roles in a mining task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barca, J.C., Sekercioglu, Y.A.: Swarm robotics reviewed. Robotica 31(3), 345–359 (2013)
Blokzijl-Zanker, M., Demiris, Y.: Multi robot learning by demonstration. In: Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems-Volume 3, pp. 1207–1208. International Foundation for Autonomous Agents and Multiagent Systems (2012)
Brys, T., Harutyunyan, A., Suay, H.B., Chernova, S., Taylor, M.E., Nowé, A.: Reinforcement learning from demonstration through shaping. In: Twenty-Fourth International Joint Conference on Artificial Intelligence (2015)
Chernova, S., Veloso, M.: Confidence-based multi-robot learning from demonstration. Int. J. Soc. Robot. 2(2), 195–215 (2010). https://doi.org/10.1007/s12369-010-0060-0
Elston, J., Frew, E.W.: Hierarchical distributed control for search and tracking by heterogeneous aerial robot networks. In: 2008 IEEE International Conference on Robotics and Automation. ICRA 2008, pp. 170–175. IEEE (2008)
Freelan, D., Wicke, D., Sullivan, K., Luke, S.: Towards rapid multi-robot learning from demonstration at the robocup competition. In: Bianchi, R.A.C., Akin, H.L., Ramamoorthy, S., Sugiura, K. (eds.) RoboCup 2014. LNCS (LNAI), vol. 8992, pp. 369–382. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18615-3_30
Horling, B., Lesser, V.: A survey of multi-agent organizational paradigms. Knowl. Eng. Rev. 19(4), 281–316 (2004)
Le, H.M., Yue, Y., Carr, P., Lucey, P.: Coordinated multi-agent imitation learning. In: Proceedings of the 34th International Conference on Machine Learning-Volume 70, pp. 1995–2003. JMLR. org (2017)
Luke, S., Ziparo, V.A.: Learn to behave! rapid training of behavior automata. In: Proceedings of Adaptive and Learning Agents Workshop at AAMAS 2010 (2010)
Martins, M.F., Demiris, Y.: Learning multirobot joint action plans from simultaneous task execution demonstrations. In: Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: Volume 1-Volume 1, pp. 931–938. International Foundation for Autonomous Agents and Multiagent Systems (2010)
Parker, L.E.: Multiple mobile robot systems. In: Siciliano, B., Khatib, O. (eds.) Springer Handbook of Robotics, pp. 921–941. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-30301-5_41
Pinciroli, C., O’Grady, R., Christensen, A.L., Dorigo, M.: Coordinating heterogeneous swarms through minimal communication among homogeneous sub-swarms. In: Dorigo, M., et al. (eds.) ANTS 2010. LNCS, vol. 6234, pp. 558–559. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15461-4_59
Soule, T., Heckendorn, R.B.: A developmental approach to evolving scalable hierarchies for multi-agent swarms. In: Proceedings of the 12th Annual Conference Companion on Genetic and Evolutionary Computation, pp. 1769–1776. ACM (2010)
Squires, W.G., Luke, S.: LfD training of heterogeneous formation behaviors. In: AAAI Spring Symposia (2018)
Sullivan, K., Luke, S.: Learning from demonstration with swarm hierarchies. In: Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems-Volume 1, pp. 197–204. International Foundation for Autonomous Agents and Multiagent Systems (2012)
Sullivan, K., Luke, S.: Real-time training of team soccer behaviors. In: Chen, X., Stone, P., Sucar, L.E., van der Zant, T. (eds.) RoboCup 2012. LNCS (LNAI), vol. 7500, pp. 356–367. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39250-4_32
Sullivan, K., Wei, E., Squires, B., Wicke, D., Luke, S.: Training heterogeneous teams of robots. In: Autonomous Robots and Multirobot Systems (ARMS) (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Squires, W., Luke, S. (2020). Scalable Heterogeneous Multiagent Learning from Demonstration. In: Demazeau, Y., Holvoet, T., Corchado, J., Costantini, S. (eds) Advances in Practical Applications of Agents, Multi-Agent Systems, and Trustworthiness. The PAAMS Collection. PAAMS 2020. Lecture Notes in Computer Science(), vol 12092. Springer, Cham. https://doi.org/10.1007/978-3-030-49778-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-030-49778-1_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-49777-4
Online ISBN: 978-3-030-49778-1
eBook Packages: Computer ScienceComputer Science (R0)