Coordination in Collaborative Work by Deep Reinforcement Learning with Various State Descriptions

Miyashita, Yuki; Sugawara, Toshiharu

doi:10.1007/978-3-030-33792-6_40

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11873))

Included in the following conference series:

International Conference on Principles and Practice of Multi-Agent Systems

842 Accesses
3 Citations

Abstract

Cooperation and coordination are sophisticated behaviors and are still major issues in studies on multi-agent systems because how to cooperate and coordinate depends on not only environmental characteristics but also the behaviors/strategies that closely affect each other. On the other hand, recently using the multi-agent deep reinforcement learning (MADRL) has received much attention because of the possibility of learning and facilitating their coordinated behaviors. However, the characteristics of socially learned coordination structures have been not sufficiently clarified. In this paper, by focusing on the MADRL in which each agent has its own deep Q-networks (DQNs), we show that the different types of input to the network lead to various coordination structures, using the pickup and floor laying problem, which is an abstract form related to our target problem. We also indicate that the generated coordination structures affect the entire performance of multi-agent systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agmon, N., Kraus, S., Kaminka, G.A.: Multi-robot perimeter patrol in adversarial settings. In: 2008 IEEE International Conference on Robotics and Automation, pp. 2339–2345. IEEE (2008)
Google Scholar
Foerster, J., Nardelli, N., Farquhar, G., Torr, P., Kohli, P., Whiteson, S., et al.: Stabilising experience replay for deep multi-agent reinforcement learning. arXiv preprint arXiv:1702.08887 (2017)
Giuggioli, L., Arye, I., Heiblum Robles, A., Kaminka, G.A.: From ants to birds: a novel bio-inspired approach to online area coverage. In: Groß, R., et al. (eds.) Distributed Autonomous Robotic Systems. SPAR, vol. 6, pp. 31–43. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73008-0_3
Chapter Google Scholar
Gu, S., Holly, E., Lillicrap, T., Levine, S.: Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 3389–3396. IEEE (2017)
Google Scholar
Lample, G., Chaplot, D.S.: Playing FPS games with deep reinforcement learning. In: AAAI, pp. 2140–2146 (2017)
Google Scholar
Liu, M., Ma, H., Li, J., Koenig, S.: Task and path planning for multi-agent pickup and delivery. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pp. 1152–1160. IFAAMAS (2019)
Google Scholar
Palmer, G., Tuyls, K., Bloembergen, D., Savani, R.: Lenient multi-agent deep reinforcement learning. In: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, pp. 443–451. IFAAMAS (2018)
Google Scholar
Van Hasselt, H., Guez, A., Silver, D.: Deep reinforcement learning with double q-learning. In: AAAI, Phoenix, AZ, vol. 2, p. 5 (2016)
Google Scholar

Download references

Acknowledgement

This work was partly supported by JSPS KAKENHI Grant Number 17KT0044.

Author information

Authors and Affiliations

Computer Science and Engineering, Waseda University, Tokyo, 1698555, Japan
Yuki Miyashita & Toshiharu Sugawara
Shimizu Corporation, Tokyo, 1040031, Japan
Yuki Miyashita

Authors

Yuki Miyashita
View author publications
You can also search for this author in PubMed Google Scholar
Toshiharu Sugawara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yuki Miyashita or Toshiharu Sugawara .

Editor information

Editors and Affiliations

Università degli Studi di Torino, Turin, Italy
Matteo Baldoni
Utrecht University, Utrecht, The Netherlands
Mehdi Dastani
Zhejiang University, Hangzhou, China
Beishui Liao
National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
Yuko Sakurai
University of Texas at Dallas, Richardson, TX, USA
Rym Zalila Wenkstern

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Miyashita, Y., Sugawara, T. (2019). Coordination in Collaborative Work by Deep Reinforcement Learning with Various State Descriptions. In: Baldoni, M., Dastani, M., Liao, B., Sakurai, Y., Zalila Wenkstern, R. (eds) PRIMA 2019: Principles and Practice of Multi-Agent Systems. PRIMA 2019. Lecture Notes in Computer Science(), vol 11873. Springer, Cham. https://doi.org/10.1007/978-3-030-33792-6_40

Download citation

DOI: https://doi.org/10.1007/978-3-030-33792-6_40
Published: 21 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33791-9
Online ISBN: 978-3-030-33792-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics