Connected Autonomous Vehicle Platoon Control Through Multi-agent Deep Reinforcement Learning

Xu, Guangfei; Chen, Bing; Li, Guangxian; He, Xiangkun

doi:10.1007/978-3-030-93479-8_16

Guangfei Xu^18,19,
Bing Chen²⁰,
Guangxian Li²¹ &
…
Xiangkun He²²

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 413))

Included in the following conference series:

International Conference on Broadband Communications, Networks and Systems

888 Accesses

Abstract

The rise of the artificial intelligence (AI) brings golden opportunity to accelerate the development of the intelligent transportation system (ITS). The platoon control of connected autonomous vehicle (CAV) as the key technology exhibits superior for improving traffic system. However, there still exist some challenges in multi-objective platoon control and multi-agent interaction. Therefore, this paper proposed a connected autonomous vehicle latoon control approach with multi-agent deep reinforcement learning (MADRL). Finally, the results in stochastic mixed traffic flow based on SUMO (simulation of urban mobility) platform demonstrate that the proposed method is feasible, effective and advanced.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Liu, D., Wang, Y., Shen, Y.: Electric vehicle charging and discharging coordination on distribution network using multi-objective particle swarm optimization and fuzzy decision making. Energies 9(3), 186 (2016)
Article Google Scholar
Delgarm, N., Sajadi, B., Kowsary, F., et al.: Multi-objective optimization of the building energy performance: a simulation-based approach by means of particle swarm optimization (PSO). Appl. Energy 170, 293–303 (2016)
Article Google Scholar
Zhang, Y., Guo, L., Gao, B., Qu, T., Chen, H.: Deterministic promotion reinforcement learning applied to longitudinal velocity control for automated vehicles. IEEE Trans. Veh. Technol. 69(1), 338–348 (2020). https://doi.org/10.1109/TVT.2019.2955959
Article Google Scholar
Xu, G., et al.: Hierarchical speed control for autonomous electric vehicle through deep reinforcement learning and robust control. IET Control Theory Appl. 1–13 (2021). https://doi.org/10.1049/cth2.12211
Jardine, P.T.: A reinforcement learning approach to predictive control design: autonomous vehicle applications. Queen’s University (Canada) (2018)
Google Scholar
Hang, P., Lv, C., Huang, C., Cai, J., Hu, Z., Xing, Y.: An integrated framework of decision making and motion planning for autonomous vehicles considering social behaviors. Electr. Eng. Syst. Sci. 1–11 (2020)
Google Scholar
Xu, J., Shu, H., Shao, Y.: Modeling of driver behavior on trajectory-speed decision making in minor traffic roadways with complex features. IEEE Trans. Intell. Transp. Syst. 20(1), 41–53 (2019). https://doi.org/10.1109/TITS.2018.2800086
Article Google Scholar
Liu, T., Wang, B., Cao, D., Tang, X., Yang, Y.: Integrated longitudinal speed decision-making and energy efficiency control for connected electrified vehicles. Electr. Eng. Syst. Sci. 1–11 (2020)
Google Scholar
He, X., Fei, C., Liu, Y., Yang, K., Ji, X.: Multi-objective longitudinal decision-making for autonomous electric vehicle: a entropy-constrained reinforcement learning approach. In: 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), Rhodes, Greece, pp. 1–6 (2020). https://doi.org/10.1109/ITSC45102.2020.9294736
Kreidieh, A.R., Wu, C., Bayen, A.M.: Dissipating stop-and-go waves in closed and open networks via deep reinforcement learning. In: 2018 21st International Conference on Intelligent Transportation Systems (ITSC), pp. 1475–1480. IEEE, November 2018
Google Scholar
Vinitsky, E., et al.: Benchmarks for reinforcement learning in mixed-autonomy traffic. In: Conference on Robot Learning, pp. 399–409. PMLR, October 2018
Google Scholar
Achiam, J., Held, D., Tamar, A., Abbeel, P.: Constrained policy optimization. In: International Conference on Machine Learning, pp. 22–31. PMLR, July 2017
Google Scholar
Bhatnagar, S., Sutton, R.S., Ghavamzadeh, M., Lee, M.: Natural actor-critic algorithms. Automatica 45(11), 2471–2482 (2009)
Article MathSciNet Google Scholar
Cao, X.R.: A basic formula for online policy gradient algorithms. IEEE Trans. Autom. Control 50(5), 696–699 (2005)
Article MathSciNet Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., et al.: Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)
Wu, C., Kreidieh, A.R., Parvate, K., Vinitsky, E., Bayen, A.M.: Flow: a modular learning framework for mixed autonomy traffic. IEEE Trans. Robot. (2021)
Google Scholar
Krajzewicz, D., Erdmann, J., Behrisch, M., Bieker, L.: Recent development and applications of sumo-simulation of urban mobility. Int. J. Adv. Syst. Meas. 5(3&4) (2012)
Google Scholar
Duan, Y., Chen, X., Houthooft, R., Schulman, J., Abbeel, P.: Benchmarking deep reinforcement learning for continuous control. CoRR, vol. abs/1604.06778 (2016). http://arxiv.org/abs/1604.06778
Liang, E., et al.: Ray RLlib: a composable and scalable reinforcement learning library. arXiv preprint arXiv:1712.09381 (2017)
Brockman, G., et al.: OpenAI Gym. arXiv preprint arXiv:1606.01540 (2016)
Treiber, M., Kesting, A.: Trajectory and floating-car data. In: Treiber, M., Kesting, A. (eds.) Traffic Flow Dynamics, pp. 7–12. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-32460-4_2.
Xu, G., et al.: Hierarchical speed control for autonomous electric vehicle through deep reinforcement learning and robust control. IET Control Theory Appl. (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

Shandong University of Technology, Zibo, 255000, China
Guangfei Xu
Liaocheng Academy of Agricultural Sciences, Liaocheng, 252000, China
Guangfei Xu
Bentron Information Technology Co. Ltd., Shenzhen, 518000, China
Bing Chen
Guangxi University, Nanning, 530000, China
Guangxian Li
School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore, 639798, Singapore
Xiangkun He

Authors

Guangfei Xu
View author publications
You can also search for this author in PubMed Google Scholar
Bing Chen
View author publications
You can also search for this author in PubMed Google Scholar
Guangxian Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiangkun He
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

La Trobe University, Bundoora, VIC, Australia
Wei Xiang
RMIT University, Melbourne, VIC, Australia
Fengling Han
La Trobe University, Melbourne, VIC, Australia
Tran Khoa Phan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, G., Chen, B., Li, G., He, X. (2022). Connected Autonomous Vehicle Platoon Control Through Multi-agent Deep Reinforcement Learning. In: Xiang, W., Han, F., Phan, T.K. (eds) Broadband Communications, Networks, and Systems. BROADNETS 2021. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 413. Springer, Cham. https://doi.org/10.1007/978-3-030-93479-8_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-93479-8_16
Published: 01 January 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-93478-1
Online ISBN: 978-3-030-93479-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics