Robust Deep Reinforcement Learning for Traffic Signal Control

Tan, Kai Liang; Sharma, Anuj; Sarkar, Soumik

doi:10.1007/s42421-020-00029-6

Robust Deep Reinforcement Learning for Traffic Signal Control

Original Paper
Published: 01 December 2020

Volume 2, pages 263–274, (2020)
Cite this article

Journal of Big Data Analytics in Transportation Aims and scope Submit manuscript

Kai Liang Tan¹,
Anuj Sharma² &
Soumik Sarkar¹

828 Accesses
11 Citations
Explore all metrics

Abstract

A traffic signal is a fundamental part of the traffic control system to reduce congestion and enhance safety. Since the inception of motorized vehicles, traffic signal controllers are put in place to coordinate and maintain traffic flow. With the number of vehicles on the road increasing exponentially, it is imperative to innovate new traffic control frameworks to cope with the high-density traffic demand. In this regard, recent advances in machine/deep learning have enabled significant progress towards reducing congestion using reinforcement learning for traffic signal control. However, most of these works are still not ready for deployment due to assumptions of perfect knowledge of the traffic environment. In reality, congestion detection or prediction systems are at best able to approximate the traffic state with significant noise. In this work, we propose a robust training framework for reinforcement learning agents that can handle such noisy approximation of the traffic states. Specifically, we show that by carefully adding synthetic perturbations to the state space, such as the queue length during training, the reinforcement learning agents can be robustified. Conceptually, our approach is similar to adversarial training schemes and can lead to successful deployment of reinforcement learning agent-based traffic signal controllers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep learning: systematic review, models, challenges, and research directions

Article Open access 07 September 2023

Multi-agent deep reinforcement learning: a survey

Article Open access 15 April 2021

A performance comparison of YOLOv8 models for traffic sign detection in the Robotaxi-full scale autonomous vehicle competition

Article 12 August 2023

References

Arel I, Liu C, Urbanik T, Kohls AG (2010) Reinforcement learning-based multi-agent system for network traffic signal control. IET Intell Trans Syst 4(2):128–135
Article Google Scholar
Barceló J, Casas J (2005) Dynamic network simulation with aimsun. In: Simulation approaches in transportation analysis, pp. 57–98. Springer
Bellman R (1966) Dynamic programming. Science 153(3731):34–37
Article Google Scholar
Chakraborty P, Adu-Gyamfi YO, Poddar S, Ahsani V, Sharma A, Sarkar S (2018) Traffic congestion detection from camera images using deep convolution neural networks. Trans Res Record 2672(45):222–231
Article Google Scholar
Chu T, Wang J, Codecà L, Li Z (2019) Multi-agent deep reinforcement learning for large-scale traffic signal control. IEEE Trans Intell Trans Syst 21(3):1086–1095
Article Google Scholar
Cohen JM, Rosenfeld E, Kolter JZ (2019) Certified adversarial robustness via randomized smoothing. CoRR abs/1902.02918. arXiv:abs/1902.02918
Esfandiari Y, Ebrahimi K, Balu A, Elia N, Vaidya U, Sarkar S (2019) A saddle-point dynamical system approach for robust deep learning. arXiv preprint arXiv:1910.08623
Group P (2019) Vissim. http://vision-traffic.ptvgroup.com/en-us/products/ptv-vissim/
Havens A, Jiang Z, Sarkar S (2018) Online robust policy learning in the presence of unknown adversaries. In: Advances in neural information processing systems 9916–9926
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778
Joshi A, Mukherjee A, Sarkar S, Hegde C (2019) Semantic adversarial attacks: Parametric transformations that fool deep classifiers. In: Proceedings of the IEEE international conference on computer vision 4773–4783
Koonce P, Rodegerdts L, Lee K, Quayle S, Beaird S, Braud C, Bonneson J, Tarnoff P, Urbanik T (2010) Traffic Signal Timing Manual: Chapter 5 - Office of Operations http://ops.fhwa.dot.gov/publications/fhwahop08024/chapter5.htm
Koonce P, Rodegerdts L, Lee K, Quayle S, Beaird S, Braud C, Bonneson J, Tarnoff P, Urbanik T (2010) Traffic signal timing manual: Chapter 6 - Detectors. http://ops.fhwa.dot.gov/publications/fhwahop08024/chapter5.htm
Krajzewicz D, Hertkorn G, Rössel C, Wagner P (2002) Sumo (simulation of urban mobility)-an open-source traffic simulation. In: Proceedings of the 4th middle East Symposium on Simulation and Modelling (MESM20002), 183–187
Kurakin A, Goodfellow I, Bengio S (2016) Adversarial machine learning at scale. arXiv preprint arXiv:1611.01236
Lasley P (2019) 2019 urban mobility report
Lee XY, Ghadai S, Tan KL, Hegde C, Sarkar S (2020) Spatiotemporally constrained action space attacks on deep reinforcement learning agents. In: AAAI, 4577–4584
Li B, Cheng W, Li L (2018) Real-time prediction of lane-based queue lengths for signalized intersections. J Adv Trans 2018:
Li L, Lv Y, Wang FY (2016) Traffic signal timing via deep reinforcement learning. IEEE/CAA J Automatica Sinica 3(3):247–254
Article MathSciNet Google Scholar
Li X, Ouyang Y (2012) Reliable traffic sensor deployment under probabilistic disruptions and generalized surveillance effectiveness measures. Oper Res 60(5):1183–1198
Article MathSciNet Google Scholar
Liang X, Du X, Wang G, Han Z (2019) A deep reinforcement learning network for traffic light cycle control. IEEE Trans Vehicular Technol 68(2):1243–1253
Article Google Scholar
Lin Y, Dai X, Li L, Wang FY (2018) An efficient deep reinforcement learning model for urban traffic control. arXiv preprint arXiv:1808.01876
Liu C, Zhao M, Sharma A, Sarkar S (2019) Traffic dynamics exploration and incident detection using spatiotemporal graphical modeling. J Big Data Anal Trans 1(1):37–55
Article Google Scholar
Liu HX, Wu X, Ma W, Hu H (2009) Real-time queue length estimation for congested signalized intersections. Trans Res Part C 17(4):412–427
Article Google Scholar
Liu M, Deng J, Xu M, Zhang X, Wang W (2017) Cooperative deep reinforcement learning for tra ic signal control. In: The 7th international workshop on Urban computing (UrbComp 2018)
Madry A, Makelov A, Schmidt L, Tsipras D, Vladu A (2017) Towards deep learning models resistant to adversarial attacks. arXiv preprint arXiv:1706.06083
Miller AJ (1963) Settings for fixed-cycle traffic signals. J Oper Res Soc 14(4):373–386
Article Google Scholar
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533
Article Google Scholar
Morimoto J, Doya K (2005) Robust reinforcement learning. Neural Comput 17(2):335–359
Article MathSciNet Google Scholar
Mukherjee A, Joshi A, Hegde C, Sarkar S. Semantic domain adaptation for deep classifiers via gan-based data augmentation
Van der Pol E, Oliehoek FA (2016) Coordinated deep reinforcement learners for traffic light control. In: Proceedings of learning, inference and control of multi-agent systems (at NIPS 2016)
Rajagopal R, Varaiya PP (2007) Health of California’s loop detector system. University of., California PATH Program, Institute of Transportation Studies
Rodrigues F, Azevedo CL (2019) Towards robust deep reinforcement learning for traffic signal control: Demand surges, incidents and sensor failures. In: 2019 IEEE intelligent transportation systems conference (ITSC), pp. 3559–3566. IEEE
Roess RP, Prassas ES, McShane WR (2004) Traffic engineering. Pearson/Prentice Hall
Sharma A, Bullock DM, Bonneson JA (2007) Input-output and hybrid techniques for real-time prediction of delay and maximum queue length at signalized intersections. Trans Res Record 2035(1):69–80
Article Google Scholar
Tan KL, Esfandiari Y, Lee XY, Sarkar S, et al (2020) Robustifying reinforcement learning agents via action space adversarial training. In: 2020 American control conference (ACC), pp. 3959–3964. IEEE
Tan KL, Poddar S, Sarkar S, Sharma A (2019) Deep reinforcement learning for adaptive traffic signal control. In: ASME 2019 dynamic systems and control conference. American Society of Mechanical Engineers Digital Collection
Van Hasselt H, Guez A, Silver D (2016) Deep reinforcement learning with double q-learning. In: Thirtieth AAAI conference on artificial intelligence
Varaiya P (2013) Max pressure control of a network of signalized intersections. Trans Res Part C 36:177–195
Article MathSciNet Google Scholar
Webster F (1958) Traffic signal settings, road research technical paper no. 39. Road Research Laboratory
Wei H, Chen C, Wu K, Zheng G, Yu Z, Gayah V, Li Z (2019) Deep reinforcement learning for traffic signal control along arterials
Wei H, Zheng G, Yao H, Li Z (2018) Intellilight: A reinforcement learning approach for intelligent traffic light control. In: Proceedings of the 24th ACM SIGKDD International conference on knowledge discovery & data mining, 2496–2505
Wiering M (2000) Multi-agent reinforcement learning for traffic light control. In: Machine learning: proceedings of the seventeenth international conference (ICML’2000), 1151–1158
Wu L, Liu C, Huang T, Sharma A, Sarkar S (2017) Traffic sensor health monitoring using spatiotemporal graphical modeling. In: Proceedings of the 2nd ACM SIGKDD workshop on machine learning for prognostics and health management, 13–17

Download references

Acknowledgements

This work was supported in part by NSF grant CNS-1845969. We would like to thank Subhadipto Poddar for his insights in general traffic intersection knowledge.

Author information

Authors and Affiliations

Department of Mechanical Engineering, Iowa State University, Ames, Iowa, 50011, USA
Kai Liang Tan & Soumik Sarkar
Department of Civil Engineering, Iowa State University, Ames, Iowa, 50011, USA
Anuj Sharma

Authors

Kai Liang Tan
View author publications
You can also search for this author in PubMed Google Scholar
Anuj Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Soumik Sarkar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Soumik Sarkar.

Ethics declarations

Conflicts of interest

Anuj Sharma works for ETALYC Inc. that works in the area of adaptive signal control.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tan, K.L., Sharma, A. & Sarkar, S. Robust Deep Reinforcement Learning for Traffic Signal Control. J. Big Data Anal. Transp. 2, 263–274 (2020). https://doi.org/10.1007/s42421-020-00029-6

Download citation

Received: 13 August 2020
Revised: 29 October 2020
Accepted: 23 November 2020
Published: 01 December 2020
Issue Date: December 2020
DOI: https://doi.org/10.1007/s42421-020-00029-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust Deep Reinforcement Learning for Traffic Signal Control

Abstract

Access this article

Similar content being viewed by others

Deep learning: systematic review, models, challenges, and research directions

Multi-agent deep reinforcement learning: a survey

A performance comparison of YOLOv8 models for traffic sign detection in the Robotaxi-full scale autonomous vehicle competition

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Robust Deep Reinforcement Learning for Traffic Signal Control

Abstract

Access this article

Similar content being viewed by others

Deep learning: systematic review, models, challenges, and research directions

Multi-agent deep reinforcement learning: a survey

A performance comparison of YOLOv8 models for traffic sign detection in the Robotaxi-full scale autonomous vehicle competition

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation