Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

Ji, Xiukun; Hai, Jintao; Luo, Wenguang; Lin, Cuixia; Xiong, Yu; Ou, Zengkai; Wen, Jiayan

doi:10.1007/s12204-021-2357-6

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

Published: 28 October 2021

Volume 26, pages 680–685, (2021)
Cite this article

Journal of Shanghai Jiaotong University (Science) Aims and scope Submit manuscript

Xiukun Ji (冀秀坤)¹,
Jintao Hai (海金涛)¹,
Wenguang Luo (罗文广)¹,
Cuixia Lin (林翠霞)¹,
Yu Xiong (熊禹)²,
Zengkai Ou (殴增开)² &
…
Jiayan Wen (文家燕)¹

147 Accesses
3 Citations
Explore all metrics

Abstract

To solve the problems of difficult control law design, poor portability, and poor stability of traditional multi-agent formation obstacle avoidance algorithms, a multi-agent formation obstacle avoidance method based on deep reinforcement learning (DRL) is proposed. This method combines the perception ability of convolutional neural networks (CNNs) with the decision-making ability of reinforcement learning in a general form and realizes direct output control from the visual perception input of the environment to the action through an end-to-end learning method. The multi-agent system (MAS) model of the follow-leader formation method was designed with the wheelbarrow as the control object. An improved deep Q netwrok (DQN) algorithm (we improved its discount factor and learning efficiency and designed a reward value function that considers the distance relationship between the agent and the obstacle and the coordination factor between the multi-agents) was designed to achieve obstacle avoidance and collision avoidance in the process of multi-agent formation into the desired formation. The simulation results show that the proposed method achieves the expected goal of multi-agent formation obstacle avoidance and has stronger portability compared with the traditional algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reinforcement learning for multi-agent formation navigation with scalability

Article 25 September 2023

Reinforcement learning-based dynamic obstacle avoidance and integration of path planning

Article 06 October 2021

Multi-layer Feed-forward Neural Network Deep Learning Control with Hybrid Position and Virtual-force Algorithm for Mobile Robot Obstacle Avoidance

Article 22 February 2019

References

XIE G, ZHANG Y. Survey of consensus problem in cooperative control of multi-agent systems [J]. Application Research of Computers, 2011, 28(6): 2035–2039 (in Chinese).
Google Scholar
CHEN Z, LIN L, YAN G. An approach to scientific cooperative robotics: Through MAS (multi-agent system) [J]. Robot, 2001, 23(4): 368–373 (in Chinese).
Google Scholar
DUAN Y, YANG H, CUI B, et al. Application of reinforcement learning to basic action learning of soccer robot [J]. Robot, 2008, 30(5): 453–459 (in Chinese).
Google Scholar
LITTMAN M L. Reinforcement learning improves behaviour from evaluative feedback [J]. Nature, 2015, 521(7553): 445–451.
Article Google Scholar
ZHU Y, ZHAO D. Probably approximately correct reinforcement learning solving continuous-state control problem [J]. Control Theory and Applications, 2016, 33(12): 1603–1613 (in Chinese).
MATH Google Scholar
ZHOU W. The application of deep learning algorithms in intelligent collaborative robots [J]. China New Telecommunications, 2017, 19(21): 129–130 (in Chinese).
Google Scholar
POLYDOROS A S, NALPANTIDIS L. Survey of model-based reinforcement learning: Applications on robotics [J]. Journal of Intelligent & Robotic Systems, 2017, 86(2): 153–173.
Article Google Scholar
LIMA H, KUROE Y. Swarm reinforcement learning methods improving certainty of learning for amulti-robot formation problem [C]//2015 IEEE Congress on Evolutionary Computation (CEC). Sendai: IEEE, 2015: 3026–3033.
Google Scholar
LIU Q, ZHAI J, ZHANG Z, et al. A survey on deep reinforcement learning [J]. Chinese Journal of Computers, 2018, 41(1): 1–27 (in Chinese).
Google Scholar
RIEDMILLER M. Neural fitted Q iteration: First experiences with a data efficient neural reinforcement learning method [M]//Machine learning: ECML2005. Berlin, Heidelberg: Springer, 2005: 317–328.
Google Scholar
LANGE S, RIEDMILLER M. Deep auto-encoder neural networks in reinforcement learning [C]//The 2010 International Joint Conference on Neural Networks (IJCNN). Barcelona: IEEE, 2010: 1–8.
Google Scholar
ABTAHI F, FASEL I. Deep belief nets as function approximators for reinforcement learning [C]//Workshops at the Twenty-Fifth AAAI Conference on Artificial Intelligence. Frankfurt: AAAI, 2011: 2–7.
Google Scholar
MNIH V, KAVUKCUOGLU K, SILVER D, et al. Human-level control through deep reinforcement learning [J]. Nature, 2015, 518(7540): 529–533.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Guangxi Key Laboratory of Auto Parts and Vehicle Technology; School of Electrical and Information Engineering, Guangxi University of Science and Technology, Liuzhou, Guangxi, 545006, China
Xiukun Ji (冀秀坤), Jintao Hai (海金涛), Wenguang Luo (罗文广), Cuixia Lin (林翠霞) & Jiayan Wen (文家燕)
Technology Center of Dongfeng Liuzhou Automobile Co., Ltd., Liuzhou, Guangxi, 545000, China
Yu Xiong (熊禹) & Zengkai Ou (殴增开)

Authors

Xiukun Ji (冀秀坤)
View author publications
You can also search for this author in PubMed Google Scholar
Jintao Hai (海金涛)
View author publications
You can also search for this author in PubMed Google Scholar
Wenguang Luo (罗文广)
View author publications
You can also search for this author in PubMed Google Scholar
Cuixia Lin (林翠霞)
View author publications
You can also search for this author in PubMed Google Scholar
Yu Xiong (熊禹)
View author publications
You can also search for this author in PubMed Google Scholar
Zengkai Ou (殴增开)
View author publications
You can also search for this author in PubMed Google Scholar
Jiayan Wen (文家燕)
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wenguang Luo (罗文广).

Additional information

Foundation item

the National Natural Science Foundation of China (No. 61963006), and the Natural Science Foundation of Guangxi Province (Nos. 2020GXNSFDA238011, 2018GXNSFAA050029, and 2018GXNSFAA294085)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ji, X., Hai, J., Luo, W. et al. Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning. J. Shanghai Jiaotong Univ. (Sci.) 26, 680–685 (2021). https://doi.org/10.1007/s12204-021-2357-6

Download citation

Received: 28 November 2020
Accepted: 07 January 2021
Published: 28 October 2021
Issue Date: October 2021
DOI: https://doi.org/10.1007/s12204-021-2357-6

Key words

CLC number

O 231.5

Document code

A

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

Abstract

Access this article

Similar content being viewed by others

Reinforcement learning for multi-agent formation navigation with scalability

Reinforcement learning-based dynamic obstacle avoidance and integration of path planning

Multi-layer Feed-forward Neural Network Deep Learning Control with Hybrid Position and Virtual-force Algorithm for Mobile Robot Obstacle Avoidance

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Foundation item

Rights and permissions

About this article

Cite this article

Key words

CLC number

Document code

Navigation

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

Abstract

Access this article

Similar content being viewed by others

Reinforcement learning for multi-agent formation navigation with scalability

Reinforcement learning-based dynamic obstacle avoidance and integration of path planning

Multi-layer Feed-forward Neural Network Deep Learning Control with Hybrid Position and Virtual-force Algorithm for Mobile Robot Obstacle Avoidance

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Foundation item

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Document code

Search

Navigation