Generating collective wall-jumping behavior for a robotic swarm with self-teaching automatic curriculum learning

Nie, Xiaotong; Liang, Yupeng; Han, Ziyao; Ohkura, Kazuhiro

doi:10.1007/s10015-022-00833-z

Generating collective wall-jumping behavior for a robotic swarm with self-teaching automatic curriculum learning

Original Article
Published: 29 November 2022

Volume 28, pages 67–75, (2023)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Xiaotong Nie¹,
Yupeng Liang¹,
Ziyao Han¹ &
…
Kazuhiro Ohkura¹

162 Accesses
Explore all metrics

Abstract

Swarm robotics (SR) is a research field about how to design a large number of robots so that they can generate meaningful collective behaviors. One of the promising approaches in designing a control policy is reinforcement learning (RL). However, it is well known that the sparse reward problem may arise, especially in cases of solving highly complex problems. Curriculum learning (CL) can be one of the effective approaches to overcoming this difficulty. In this paper, we propose a novel method called Self-Teaching Automatic Curriculum Learning (STACL). The training progress of different lessons is compared by agents to determine which lesson should be trained in the next episode. The collective wall-jumping task, in which the robots have to generate collective wall-jumping behavior to jump over the high wall and reach the goal as soon as possible, is employed to illustrate the effects. Simulation results show that the proposed approach has the fastest convergence speed and the most stable performance. In addition, we also conducted experiments to examine the flexibility of the developed controllers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-agent deep reinforcement learning: a survey

Article Open access 15 April 2021

Economical Quadrupedal Multi-Gait Locomotion via Gait-Heuristic Reinforcement Learning

Article 18 May 2024

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

References

Şahin E (2004) Swarm robotics: from sources of inspiration to domains of application. In: International workshop on swarm robotics. Springer, pp 10–20
Seeley TD, Camazine S, Sneyd J (1991) Collective decision-making in honey bees: how colonies choose among nectar sources. Behav Ecol Sociobiol 28(4):277–290
Article Google Scholar
Bayındır L (2016) A review of swarm robotics tasks. Neurocomputing 172:292–321
Article Google Scholar
Francesca G, Brambilla M, Trianni V, Dorigo M, Birattari M (2012) Analysing an evolved robotic behaviour using a biological model of collegial decision making. In: Ziemke T, Balkenius C, Hallam J (eds) From animals to animats 12. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 381–390
Chapter Google Scholar
Groß R, Dorigo M (2009) Towards group transport by swarms of robots. Int J Bio-Inspir Comput 1:01
Article Google Scholar
Hiraga M, Yasuda T, Ohkura K (2018) Evolutionary acquisition of autonomous specialization in a path-formation task of a robotic swarm. J Adv Comput Intell Intell Inform 22(5):621–628
Article Google Scholar
Gu S, Holly E, Lillicrap T, Levine S (2017) Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates. In: 2017 IEEE international conference on robotics and automation (ICRA), pp 3389–3396. IEEE
Kober J, Bagnell JA, Peters J (2013) Reinforcement learning in robotics: a survey. Int J Rob Res 32(11):1238–1274
Article Google Scholar
Hüttenrauch M, Adrian S, Neumann G et al (2019) Deep reinforcement learning for swarm systems. J Mach Learn Res 20(54):1–31
MATH Google Scholar
Bengio Y, Louradour J, Collobert R, Weston J (2009) Curriculum learning. In: Proceedings of the 26th annual international conference on machine learning, pp 41–48
Wang X, Chen Y, Zhu W (2021) A survey on curriculum learning. IEEE Trans Pattern Anal Mach Intell 44:4555–4576
Google Scholar
Chen D, Chen K, Zhang Z, Zhang B (2015) Mechanism of locust air posture adjustment. J Bionic Eng 12(3):418–431
Article Google Scholar
Noh M, Kim S-W, An S, Koh J-S, Cho K-J (2012) Flea-inspired catapult mechanism for miniature jumping robots. IEEE Trans Rob 28(5):1007–1018
Article Google Scholar
Romanishin JW, Gilpin K, Rus D (2013) M-blocks: momentum-driven, magnetic modular robots. In: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp 4288–4295. IEEE
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533
Article Google Scholar
Kulkarni TD, Narasimhan K, Saeedi A, Tenenbaum J (2016) Hierarchical deep reinforcement learning: integrating temporal abstraction and intrinsic motivation. Adv Neural Inform Process Syst 29
Duan Y, Chen X, Houthooft R, Schulman J, Abbeel P (2016) Benchmarking deep reinforcement learning for continuous control. In: International conference on machine learning, pp 1329–1338. PMLR
Matiisen T, Oliver A, Cohen T, Schulman J (2017) Teacher-student curriculum learning
Portelas R, Colas C, Weng L, Hofmann K, Oudeyer P-Y (2020) Automatic curriculum learning for deep RL: a short survey. CoRR, abs/2003.04664
Ivanovic B, Harrison J, Sharma A, Chen M, Pavone M (2018) Backward reachability curriculum for robotic reinforcement learning, Barc
Salimans T, Chen R (2018) Learning Montezuma’s revenge from a single demonstration. CoRR
Schulman J, Wolski F, Dhariwal P, Radford A, Klimov O (2017) Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347

Download references

Acknowledgements

This work was supported by Initiative for Realizing Diversity in the Research Environment (Specific Correspondence Type).

Author information

Authors and Affiliations

Graduate School of Advanced Science and Engineering, Hiroshima University, Hiroshima, Japan
Xiaotong Nie, Yupeng Liang, Ziyao Han & Kazuhiro Ohkura

Authors

Xiaotong Nie
View author publications
You can also search for this author in PubMed Google Scholar
Yupeng Liang
View author publications
You can also search for this author in PubMed Google Scholar
Ziyao Han
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Ohkura
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaotong Nie.

Additional information

Publisher's Note

Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

SWARM Special Issue: This work was presented in part at the joint symposium of the 27th International Symposium on Artificial Life and Robotics, the 7th International Symposium on BioComplexity, and the 5th International Symposium on Swarm Behavior and Bio-Inspired Robotics (Online, January 25–27, 2022).

About this article

Cite this article

Nie, X., Liang, Y., Han, Z. et al. Generating collective wall-jumping behavior for a robotic swarm with self-teaching automatic curriculum learning. Artif Life Robotics 28, 67–75 (2023). https://doi.org/10.1007/s10015-022-00833-z

Download citation

Received: 10 June 2022
Accepted: 16 November 2022
Published: 29 November 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s10015-022-00833-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Generating collective wall-jumping behavior for a robotic swarm with self-teaching automatic curriculum learning

Abstract

Access this article

Similar content being viewed by others

Multi-agent deep reinforcement learning: a survey

Economical Quadrupedal Multi-Gait Locomotion via Gait-Heuristic Reinforcement Learning

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

About this article

Cite this article

Keywords

Navigation

Generating collective wall-jumping behavior for a robotic swarm with self-teaching automatic curriculum learning

Abstract

Access this article

Similar content being viewed by others

Multi-agent deep reinforcement learning: a survey

Economical Quadrupedal Multi-Gait Locomotion via Gait-Heuristic Reinforcement Learning

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

About this article

Cite this article

Share this article

Keywords

Search

Navigation