Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections

Li, Guofa; Li, Shenglong; Li, Shen; Qin, Yechen; Cao, Dongpu; Qu, Xingda; Cheng, Bo

doi:10.1007/s42154-020-00113-1

Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections

Published: 13 November 2020

Volume 3, pages 374–385, (2020)
Cite this article

Automotive Innovation Aims and scope Submit manuscript

Guofa Li ORCID: orcid.org/0000-0002-7889-4695^1,2,
Shenglong Li¹,
Shen Li ORCID: orcid.org/0000-0002-7111-8861³,
Yechen Qin⁴,
Dongpu Cao²,
Xingda Qu¹ &
…
Bo Cheng⁵

2739 Accesses
52 Citations
Explore all metrics

Abstract

Road intersection is one of the most complex and accident-prone traffic scenarios, so it’s challenging for autonomous vehicles (AVs) to make safe and efficient decisions at the intersections. Most of the related studies focus on the solution to a single scenario or only guarantee safety without considering driving efficiency. To address these problems, this study proposed a deep reinforcement learning enabled decision-making framework for AVs to drive through intersections automatically, safely and efficiently. The mapping relationship between traffic images and vehicle operations was obtained by an end-to-end decision-making framework established by convolutional neural networks. Traffic images collected at two timesteps were used to calculate the relative velocity between vehicles. Markov decision process was employed to model the interaction between AVs and other vehicles, and the deep Q-network algorithm was utilized to obtain the optimal driving policy regarding safety and efficiency. To verify the effectiveness of the proposed decision-making framework, the top three accident-prone crossing path crash scenarios at intersections were simulated, when different initial vehicle states were adopted for better generalization capability. The results showed that the developed method could make AVs drive safely and efficiently through intersections in all of the tested scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A performance comparison of YOLOv8 models for traffic sign detection in the Robotaxi-full scale autonomous vehicle competition

Article 12 August 2023

Connected and Autonomous Vehicles and Infrastructures: A Literature Review

Article 24 November 2021

Traffic sign recognition based on deep learning

Article Open access 07 March 2022

Abbreviations

AV:: Autonomous vehicle
DQN:: Deep Q-network
DRL:: Deep reinforcement learning
LTAP/LD:: Left turn across path-lateral direction
LTAP/OD:: Left turn across path-opposite direction
MDP:: Markov decision process
OV:: Other vehicle
SCP:: Straight crossing path
V2I:: Vehicle-to-infrastructure
V2V:: Vehicle-to-vehicle

References

Tay, R.: A random parameters probit model of urban and rural intersection crashes. Accid. Anal. Prev. 84, 38–40 (2015)
Article Google Scholar
Li, G.F., Wang, Y., Zhu, F.P., et al.: Drivers’ visual scanning behavior at signalized and unsignalized intersections: a naturalistic driving study in China. J. Saf. Res. 71, 219–229 (2019)
Article Google Scholar
Werneke, J., Vollrath, M.: How do environmental characteristics at intersections change in their relevance for drivers before entering an intersection: analysis of drivers’ gaze and driving behavior in a driving simulator study. Cogn. Technol. Work 16(2), 157–169 (2014)
Article Google Scholar
Lemonnier, S., Brémond, R., Baccino, T.: Gaze behavior when approaching an intersection: dwell time distribution and comparison with a quantitative prediction. Transp. Res. Part F Traffic Psychol. Behav. 35, 60–74 (2015)
Article Google Scholar
NHTSA: Traffic Safety Facts 2017 Data (DOT HS 812 806, updated September 2019). National Highway Traffic Safety Administration, Washington, DC (2019). https://crashstats.nhtsa.dot.gov/Api/Public/ViewPublication/812806. Accessed 14 Feb 2020
Zhang, G.G., Yau, K.K.W., Chen, G.H.: Risk factors associated with traffic violations and accident severity in China. Accid. Anal. Prev. 59, 18–25 (2013)
Article Google Scholar
Ronald, J.: V2V/V2I Communications for Improved Road Safety and Efficiency. SAE, San Diego (2012)
Book Google Scholar
Li, G.F., Yang, Y.F., Qu, X.D.: Deep learning approaches on pedestrian detection in hazy weather. IEEE Trans. Ind. Electron. (2019). https://doi.org/10.1109/TIE.2019.2945295
Article Google Scholar
Dooley, D., McGinley, B., Hughes, C., et al.: A blind-zone detection method using a rear-mounted fisheye camera with combination of vehicle detection methods. IEEE Trans. Intell. Transp. Syst. 17(1), 264–278 (2015)
Article Google Scholar
Li, S.E.B., Li, G.F., Yu, J.Y., et al.: Kalman filter-based tracking of moving objects using linear ultrasonic sensor array for road vehicles. Mech. Syst. Signal Proc. 98, 173–189 (2018)
Article Google Scholar
Li, G.F., Li, S.E.B., Zou, R.B., et al.: Detection of road traffic participants using cost-effective arrayed ultrasonic sensors in low-speed traffic situations. Mech. Syst. Signal Proc. 132, 535–545 (2019)
Article Google Scholar
Noh, S.: Decision-making framework for autonomous driving at road intersections: safeguarding against collision, overly conservative behavior, and violation vehicles. IEEE Trans. Ind. Electron. 66(4), 3275–3286 (2018)
Article Google Scholar
Najm, W., Smith, J.D., Smith, D.L.: Analysis of crossing path crashes. John A. Volpe National Transportation Systems Center (US). No. DOT-VNTSC-NHTSA-01-03 (2001)
Brannstrom, M., Coelingh, E., Sjoberg, J.: Model-based threat assessment for avoiding arbitrary vehicle collisions. IEEE Trans. Intell. Transp. Syst. 11(3), 658–669 (2010)
Article Google Scholar
Li, G.F., Li, S.E., Cheng, B., et al.: Estimation of driving style in naturalistic highway traffic using maneuver transition probabilities. Transp. Res. Part C Emerg. Technol. 74, 113–125 (2017)
Article Google Scholar
Polychronopoulos, A., Tsogas, M., Amditis, A., et al.: Sensor fusion for predicting vehicles’ path for collision avoidance systems. IEEE Trans. Intell. Transp. Syst. 8(3), 549–562 (2007)
Article Google Scholar
Campos, G.R., Runarsson, A.H., Granum, F., et al.: Collision avoidance at intersections: a probabilistic threat-assessment and decision-making system for safety interventions. Paper presented at the 17th International IEEE Conference on Intelligent Transportation Systems, Qingdao, China, 8–11 October 2014
Jansson, J.: Collision avoidance theory with application to automotive collision mitigation. Dissertation, Linköping University (2005)
Noh, S., Han, W.Y.: Collision avoidance in on-road environment for autonomous driving. Paper presented at the 14th International Conference on Control, Automation and Systems, Seoul, South Korea, 22–25 October 2014
Naranjo, J.E., Gonzalez, C., Garcia, R., et al.: Lane-change fuzzy control in autonomous vehicles for the overtaking maneuver. IEEE Trans. Intell. Transp. Syst. 9(3), 438–450 (2008)
Article Google Scholar
Hubmann, C., Schulz, J., Becker, M., et al.: Automated driving in uncertain environments: planning with interaction and uncertain maneuver prediction. IEEE Trans. Intell. Veh. 3(1), 5–17 (2018)
Article Google Scholar
Schubert, R.: Evaluating the utility of driving: toward automated decision making under uncertainty. IEEE Trans. Intell. Transp. Syst. 9(3), 354–364 (2011)
Google Scholar
Jansson, J., Gustafsson, F.: A framework and automotive application of collision avoidance decision making. Automatica 44(9), 2347–2351 (2008)
Article MathSciNet MATH Google Scholar
Armand, A., Filliat, D., Ibanez-Guzman, J.: Modelling stop intersection approaches using Gaussian processes. Paper presented at the 16th International IEEE Conference on Intelligent Transportation Systems, The Hague, Netherlands, 6–9 October 2013
Huang, R., Liang, H., Zhao, P., et al.: Intent-estimation-and motion-model-based collision avoidance method for autonomous vehicles in urban environments. Appl. Sci. 7(5), 457 (2017)
Article Google Scholar
Notomista, G., Botsch, M.: A machine learning approach for the segmentation of driving maneuvers and its application in autonomous parking. J. Artif. Intell. Soft Comput. Res. 7(4), 243–255 (2017)
Article Google Scholar
Hussein, A., Gaber, M.M., Elyan, E., et al.: Imitation learning: a survey of learning methods. ACM Comput. Surv. 50(2), 1–35 (2017)
Article Google Scholar
Bojarski, M., Yeres, P., Choromanska, A., et al.: Explaining how a deep neural network trained with end-to-end learning steers a car (2017). arXiv preprint arXiv:1704.07911
Mo, C.M., Li, Y.N., Zheng, L.: Simulation and analysis on overtaking safety assistance system based on vehicle-to-vehicle communication. Automot. Innov. 1(2), 158–166 (2018)
Article Google Scholar
Hafner, M.R., Cunningham, D., Caminiti, L., et al.: Cooperative collision avoidance at intersections: algorithms and experiments. IEEE Trans. Intell. Transp. Syst. 14(3), 1162–1175 (2013)
Article Google Scholar
Rios-Torres, J., Malikopoulos, A.A.: A survey on the coordination of connected and automated vehicles at intersections and merging at highway on-ramps. IEEE Intell. Transp. Syst. Mag. 18(5), 1066–1077 (2017)
Article Google Scholar
Lee, J., Park, B.: Development and evaluation of a cooperative vehicle intersection control algorithm under the connected vehicles environment. IEEE Trans. Intell. Transp. Syst. 13(1), 81–90 (2012)
Article Google Scholar
Luo, Y.G., Yang, G., Xu, M.C., et al.: Cooperative lane-change maneuver for multiple automated vehicles on a highway. Automot. Innov. 2(3), 157–168 (2019)
Article Google Scholar
Zong, X.P., Xu, G.Y., Yu, G.Z., et al.: Obstacle avoidance for self-driving vehicle with reinforcement learning. SAE Int. J. Passeng. Cars Electron. Electr. Syst. 11(1), 30–39 (2017)
Article Google Scholar
Kendall, A., Hawke, J., Janz, D., et al.: Learning to drive in a day. Paper presented at the 2019 International Conference on Robotics and Automation, Montreal, QC, Canada, 20–24 May (2019)
Zhu, M.X., Wang, X.S., Wang, Y.H.: Human-like autonomous car-following model with deep reinforcement learning. Transp. Res. Part C Emerg. Technol. 97, 348–368 (2018)
Article Google Scholar
Ye, Y.J., Zhang, X.H., Sun, J.: Automated vehicle’s behavior decision making using deep reinforcement learning and high-fidelity simulation environment. Transp. Res. Part C Emerg. Technol. 107, 155–170 (2019)
Article Google Scholar
Zhou, M.F., Yu, Y., Qu, X.B.: Development of an efficient driving strategy for connected and automated vehicles at signalized intersections: a reinforcement learning approach. IEEE Trans. Intell. Transp. Syst. 21(1), 433–443 (2020)
Article Google Scholar
Qi, X.W., Luo, Y.D., Wu, G.Y., et al.: Deep reinforcement learning enabled self-learning control for energy efficient driving. Transp. Res. Part C Emerg. Technol. 99, 67–81 (2019)
Article Google Scholar
Bellman, R.: A Markovian decision process. J. Math. Mech. 6(5), 679–684 (1957)
MathSciNet MATH Google Scholar
Arulkumaran, K., Deisenroth, M.P., Brundage, M., et al.: Deep reinforcement learning: a brief survey. IEEE Signal Process. Mag. 34(6), 26–38 (2017)
Article Google Scholar
Dolcetta, I.C., Ishii, H.: Approximate solutions of the Bellman equation of deterministic control theory. Appl. Math. Optim. 11(1), 161–181 (1984)
Article MathSciNet MATH Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
MATH Google Scholar
Mnih, V., Kavukcuoglu, K., Silver, D., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Tokic, M., Palm, G.: Value-difference based exploration: adaptive control between epsilon-greedy and softmax. Paper presented at the 34th Annual German conference on Advances in artificial intelligence, Heidelberg, Berlin, 4–7 October (2011)
Wisch, M., Hellmann, A., Lerner, M., et al.: Car-to-car accidents at intersections in Europe and identification of use cases for the test and assessment of respective active vehicle safety systems. Paper presented at 26th International Technical Conference on the Enhanced Safety of Vehicles, Eindhoven, Netherlands, 10–13 June (2019)
Kusano, K.D., Gabler, H.C.: Target population for intersection advanced driver assistance systems in the U.S. SAE Int. J. Transp. Saf. 3(1), 1–16 (2015)
Article Google Scholar
Arikere, A., Yang, D.R., Klomp, M.: Optimal motion control for collision avoidance at left turn across path/opposite direction intersection scenarios using electric propulsion. Veh. Syst. Dyn. 57(5), 637–664 (2019)
Article Google Scholar
Sander, U., Lubbe, N., Pietzsch, S.: Intersection AEB implementation strategies for left turn across path crashes. Traffic Inj. Prev. 20(sup1), S119–S125 (2019)
Article Google Scholar
Ulrich, S., Nils, L.: The potential of clustering methods to define intersection test scenarios: assessing real-life performance of AEB. Accid. Anal. Prev. 113, 1–11 (2018)
Article Google Scholar
Dosovitskiy, A., Ros, G., Codevilla, F., et al.: CARLA: an open urban driving simulator (2017). arXiv preprint arXiv:1711.03938
Eidehall, A., Petersson, L.: Statistical threat assessment for general road scenes using Monte Carlo sampling. IEEE Trans. Intell. Transp. Syst. 9(1), 137–147 (2008)
Article Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (Grant No. 51805332), the Young Elite Scientists Sponsorship Program funded by the China Society of Automotive Engineers, the Natural Science Foundation of Guangdong Province (Grant No. 2018A030310532), and the Shenzhen Fundamental Research Fund (Grant No. JCYJ20190808142613246).

Author information

Authors and Affiliations

Institute of Human Factors and Ergonomics, College of Mechatronics and Control Engineering, Shenzhen University, Shenzhen, 518060, China
Guofa Li, Shenglong Li & Xingda Qu
Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
Guofa Li & Dongpu Cao
Department of Civil and Environmental Engineering, University of Wisconsin-Madison, Madison, WI, 53706, USA
Shen Li
School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
Yechen Qin
State Key Laboratory of Automotive Safety and Energy, School of Vehicle and Mobility, Tsinghua University, Beijing, 100084, China
Bo Cheng

Authors

Guofa Li
View author publications
You can also search for this author in PubMed Google Scholar
Shenglong Li
View author publications
You can also search for this author in PubMed Google Scholar
Shen Li
View author publications
You can also search for this author in PubMed Google Scholar
Yechen Qin
View author publications
You can also search for this author in PubMed Google Scholar
Dongpu Cao
View author publications
You can also search for this author in PubMed Google Scholar
Xingda Qu
View author publications
You can also search for this author in PubMed Google Scholar
Bo Cheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shen Li.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding authors state that there is no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, G., Li, S., Li, S. et al. Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections. Automot. Innov. 3, 374–385 (2020). https://doi.org/10.1007/s42154-020-00113-1

Download citation

Received: 14 January 2020
Accepted: 21 July 2020
Published: 13 November 2020
Issue Date: December 2020
DOI: https://doi.org/10.1007/s42154-020-00113-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections

Abstract

Access this article

Similar content being viewed by others

A performance comparison of YOLOv8 models for traffic sign detection in the Robotaxi-full scale autonomous vehicle competition

Connected and Autonomous Vehicles and Infrastructures: A Literature Review

Traffic sign recognition based on deep learning

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections

Abstract

Access this article

Similar content being viewed by others

A performance comparison of YOLOv8 models for traffic sign detection in the Robotaxi-full scale autonomous vehicle competition

Connected and Autonomous Vehicles and Infrastructures: A Literature Review

Traffic sign recognition based on deep learning

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation