Planning Maneuvers for Autonomous Driving Based on Offline Reinforcement Learning: Comparative Study

Melkumov, Mikhail; Panov, Aleksandr I.

doi:10.1007/978-3-031-43789-2_6

Mikhail Melkumov¹² &
Aleksandr I. Panov¹³

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 776))

Included in the following conference series:

International Conference on Intelligent Information Technologies for Industry

305 Accesses

Abstract

One of the key challenges in developing autonomous vehicles is planning safe and efficient trajectories in complex environments, such as intersections. This paper proposes an offline RL approach for planning trajectories for autonomous vehicles at crossroads with other actors. It enables the possibility of using pre-recorded expert trajectories for algorithm tuning. We study the influence of the quality of collected trajectories on various offline reinforcement learning methods. Our approach has the potential to overcome the limitations of online RL and provide an effective planning solution for autonomous vehicles in dynamic environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Carla challenge. https://leaderboard.carla.org/challenge/
Vinyals, O., et al.: Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature 575, 350–354 (2019). https://doi.org/10.1038/s41586-019-1724-z
Article Google Scholar
Althoff, M., Koschi, M., Manzinger, S.: Commonroad: composable benchmarks for motion planning on roads (2017). https://doi.org/10.1109/IVS.2017.7995802
Wang, B., Gong, J., Chen, H.: Motion primitives representation, extraction and connection for automated vehicle motion planning applications. IEEE Trans. Intell. Transp. Syst. 21, 3931–3945 (2020)
Article Google Scholar
Badue, C., et al.: Self-driving cars: a survey. Expert Syst. Appl. 165, 113816 (2021). https://doi.org/10.1016/j.eswa.2020.113816
Article Google Scholar
Cheng, J., Chen, Y., Zhang, Q., Gan, L., Liu, C., Liu, M.: Real-time trajectory planning for autonomous driving with gaussian process and incremental refinement. In: 2022 International Conference on Robotics and Automation (ICRA), pp. 8999–9005. IEEE (2022)
Google Scholar
Dixit, S.: Trajectory planning for autonomous high-speed overtaking in structured environments using robust MPC. IEEE Trans. Intell. Transp. Syst. 21, 2310–2323 (2020)
Article Google Scholar
Esterle, K., Kessler, T., Knoll, A.: Optimal behavior planning for autonomous driving: a generic mixed-integer formulation. In: 2020 IEEE Intelligent Vehicles Symposium (IV), pp. 1914–1921. IEEE (2020)
Google Scholar
Fujimoto, S., Gu, S.S.: A minimalist approach to offline reinforcement learning (2021)
Google Scholar
Fujimoto, S., Meger, D., Precup, D.: Off-policy deep reinforcement learning without exploration (2019)
Google Scholar
Mouhagir, H., Talj, R., Cherfaoui, V., Aioun, F., Guillemard, F.: Evidential-based approach for trajectory planning with tentacles, for autonomous vehicles. IEEE Trans. Intell. Transp. Syst. 21, 3485–3496 (2020)
Article Google Scholar
Haarnoja, T., et al.: Soft actor-critic algorithms and applications (2018). http://arxiv.org/abs/1812.05905
He, R., et al.: TDR-OBCA: a reliable planner for autonomous driving in free-space environment (2020). http://arxiv.org/abs/2009.11345
Hoel, C.J., Tram, T., Sjöberg, J.: Reinforcement learning with uncertainty estimation for tactical decision-making in intersections (2020). http://arxiv.org/abs/2006.09786
Isele, D., Rahimi, R., Cosgun, A., Subramanian, K., Fujimura, K.: Navigating occluded intersections with autonomous vehicles using deep reinforcement learning (2017). http://arxiv.org/abs/1705.01196
Janner, M., Li, Q., Levine, S.: Offline reinforcement learning as one big sequence modeling problem. In: Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P., Vaughan, J.W. (eds.) Advances in Neural Information Processing Systems, vol. 34, pp. 1273–1286. Curran Associates, Inc. (2021)
Google Scholar
Kessler, T., Esterle, K., Knoll, A.: Linear differential games for cooperative behavior planning of autonomous vehicles using mixed-integer programming. In: 2020 59th IEEE Conference on Decision and Control (CDC), pp. 4060–4066. IEEE (2020)
Google Scholar
Kessler, T., Esterle, K., Knoll, A.: Mixed-integer motion planning on German roads within the Apollo driving stack. IEEE Trans. Intell. Veh. 8(1), 851–867 (2022)
Article Google Scholar
Khaitan, S., Dolan, J.M.: State dropout-based curriculum reinforcement learning for self-driving at unsignalized intersections (2022). http://arxiv.org/abs/2207.04361
Kumar, A., Zhou, A., Tucker, G., Levine, S.: Conservative q-learning for offline reinforcement learning (2020)
Google Scholar
Lee, D.H., Liu, J.L.: End-to-end deep learning of lane detection and path prediction for real-time autonomous driving (2021). http://arxiv.org/abs/2102.04738
Ljungqvist, O., Evestedt, N., Cirillo, M., Axehill, D., Holmer, O.: Lattice-based motion planning for a general 2-trailer system. In: 2017 IEEE Intelligent Vehicles Symposium (IV), pp. 819–824. IEEE (2017)
Google Scholar
Martinson, M., Skrynnik, A., Panov, A.I.: Navigating autonomous vehicle at the road intersection simulator with reinforcement learning. In: Kuznetsov, S.O., Panov, A.I., Yakovlev, K.S. (eds.) RCAI 2020. LNCS (LNAI), vol. 12412, pp. 71–84. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59535-7_6
Chapter Google Scholar
Nair, A., Gupta, A., Dalal, M., Levine, S.: AWAC: accelerating online reinforcement learning with offline datasets (2021)
Google Scholar
Berner, C., et al.: Dota 2 with large scale deep reinforcement learning (2019)
Google Scholar
Paden, B., Čáp, M., Yong, S.Z., Yershov, D., Frazzoli, E.: A survey of motion planning and control techniques for self-driving urban vehicles. IEEE Trans. Intell. Veh. 1(1), 33–55 (2016). https://doi.org/10.1109/TIV.2016.2578706
Article Google Scholar
Prakash, A., Chitta, K., Geiger, A.: Multi-modal fusion transformer for end-to-end autonomous driving (2021). http://arxiv.org/abs/2104.09224
Prudencio, R.F., Maximo, M.R.O.A., Colombini, E.L.: A survey on offline reinforcement learning: taxonomy, review, and open problems (2022). http://arxiv.org/abs/2203.01387
Seno, T., Imai, M.: d3rlpy: an offline deep reinforcement learning library. J. Mach. Learn. Res. 23(315), 1–20 (2022). http://jmlr.org/papers/v23/22-0017.html
Shikunov, M., Panov, A.I.: Hierarchical reinforcement learning approach for the road intersection task. In: Samsonovich, A.V. (ed.) BICA 2019. AISC, vol. 948, pp. 495–506. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-25719-4_64
Chapter Google Scholar
Skrynnik, A., Staroverov, A., Aitygulov, E., Aksenov, K., Davydov, V., Panov, A.I.: Forgetful experience replay in hierarchical reinforcement learning from expert demonstrations. Knowl.-Based Syst. 218, 106844 (2021)
Google Scholar
Skrynnik, A., Staroverov, A., Aitygulov, E., Aksenov, K., Davydov, V., Panov, A.I.: Hierarchical deep Q-network from imperfect demonstrations in minecraft. Cogn. Syst. Res. 65, 74–78 (2021). https://doi.org/10.1016/j.cogsys.2020.08.012
Spanogiannopoulos, S., Zweiri, Y., Seneviratne, L.: Sampling-based non-holonomic path generation for self-driving cars. J. Intell. Robot. Syst. 104(1), 1–17 (2022)
Article Google Scholar
Zhang, T., Fu, M., Song, W., Yang, Y., Wang, M.: Trajectory planning based on spatio-temporal map with collision avoidance guaranteed by safety strip. IEEE Trans. Intell. Transp. Syst. 23, 1030–1043 (2022)
Article Google Scholar
The Autoware Foundation: Autoware. https://www.autoware.org/
Lim, W., Lee, S., Sunwoo, M., Jo, K.: Hybrid trajectory planning for autonomous driving in on-road dynamic scenarios. IEEE Trans. Intell. Transp. Syst. 22, 341–355 (2021)
Article Google Scholar
Wang, X., Krasowski, H., Althoff, M.: CommonRoad-RL: a configurable reinforcement learning environment for motion planning of autonomous vehicles. In: IEEE International Conference on Intelligent Transportation Systems (ITSC) (2021). https://doi.org/10.1109/ITSC48978.2021.9564898
Wang, Z., et al.: Critic regularized regression (2021)
Google Scholar
Wu, Y., Tucker, G., Nachum, O.: Behavior regularized offline reinforcement learning (2019)
Google Scholar
Yudin, D.A., Skrynnik, A., Krishtopik, A., Belkin, I., Panov, A.I.: Object detection with deep neural networks for reinforcement learning in the task of autonomous vehicles path planning at the intersection. Opt. Mem. Neural Netw. 28(4), 283–295 (2019). https://doi.org/10.3103/S1060992X19040118
Article Google Scholar
Zhou, J., et al.: DL-IAPS and PJSO: a path/speed decoupled trajectory optimization and its application in autonomous driving (2020). http://arxiv.org/abs/2009.11135

Download references

Author information

Authors and Affiliations

Moscow Institute of Physics and Technology, Moscow, Russia
Mikhail Melkumov
Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, Moscow, Russia
Aleksandr I. Panov

Authors

Mikhail Melkumov
View author publications
You can also search for this author in PubMed Google Scholar
Aleksandr I. Panov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mikhail Melkumov .

Editor information

Editors and Affiliations

Rostov State Transport University, Rostov-on-Don, Russia
Sergey Kovalev
St. Petersburg Federal Research Center of the Russian Academy of Sciences, St. Petersburg, Russia
Igor Kotenko
Rostov State Transport University, Rostov-on-Don, Russia
Andrey Sukhanov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Melkumov, M., Panov, A.I. (2023). Planning Maneuvers for Autonomous Driving Based on Offline Reinforcement Learning: Comparative Study. In: Kovalev, S., Kotenko, I., Sukhanov, A. (eds) Proceedings of the Seventh International Scientific Conference “Intelligent Information Technologies for Industry” (IITI’23). IITI 2023. Lecture Notes in Networks and Systems, vol 776. Springer, Cham. https://doi.org/10.1007/978-3-031-43789-2_6

Download citation

DOI: https://doi.org/10.1007/978-3-031-43789-2_6
Published: 21 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43788-5
Online ISBN: 978-3-031-43789-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Planning Maneuvers for Autonomous Driving Based on Offline Reinforcement Learning: Comparative Study