Skip to main content
Log in

Spatiotemporal grid-based crash prediction—application of a transparent deep hybrid modeling framework

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Traffic crashes are one of the significant causes of death worldwide, and the prediction of this event is complicated due to many contributing factors. This study used spatial, temporal, and spatiotemporal information to predict crashes in Chicago at 1 km grid levels. A Deep Hybrid Network (DHN) was developed by exploiting inherent unique characteristics of Convolution Neural Network (CNN), Long Short-term Memory (LSTM), and Deep Neural Network (DNN). The hyperparameters of the models were obtained through the Bayesian optimization algorithm. The proposed modeling framework investigated the feature importance, the spatial heterogeneity of predictions, the worst-performing spatial grids, and the spatial distribution of features pertinent to model performance. These analyses transform the proposed DHN into an interpretable and transparent model. The DHN model was compared with Logistic Regression (LR), DNN, CNN, LSTM, and bidirectional LSTM, and it outperformed the baseline models with an accuracy of 0.72, recall of 0.70, false alarm rate of 0.28, and AUC of 0.79. The top three essential features were time, weather, and taxi trips, consecutively. The grid-level distribution of prediction performance investigations revealed a consistent performance of all deep learning models in terms of failed grids (i.e., AUC is 0.5 or less). It was revealed that DHN has the fewest failed grids (i.e., 18 failed from 710 grids) among the experimented models. According to the district level analysis, the O’hare airport area and the central district had the fewest number of failed grids for all methods, while the far south district had the highest number of failed grids. In addition, it was observed that the passed grid had a higher average feature density than the failed grid.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data availability

The data of this study are public and open sourced.

Abbreviations

AUC:

Area under curve

Bi-LSTM:

Bidirectional long short-term memory

CART:

Classification and regression tree

CNN:

Convolution neural network

DHN:

Deep hybrid network

DNN:

Deep neural network

DT:

Decision tree

GPS:

Global positioning system

ITS:

Intelligent transportation system

LSTM:

Long short-term memory

LR:

Logistic regression

PNN:

Probabilistic neural network

RNN:

Recurrent neural network

SVM:

Support vector machine

VMS:

Variable message signs

XGBOOST:

EXtreme gradient boosting

oC:

Centigrade

\({g}_{n}\) :

Grid (n)

k:

Timestamp (k)

K:

Total number of timestamps

L:

Vector of model prediction for each spatial grids

m:

Number of timesteps backward

N:

Number of spatial grids

\({u}_{n}\) :

Spatial features located in grid (n)

U:

Vector of spatial features

V:

Vector of input features

\({w}_{k}\) :

Temporal features for timestamp (k)

W:

Vector of temporal features

\({x}_{t,g}\) :

Spatiotemporal feature for grid \({g}_{n}\) and a timestamp \({t}_{k}\)

X:

Vector of spatiotemporal features

References

  1. WHO (2018) WHO | Global status report on road safety 2018. World Health Organization

  2. WISQARS (2020) WISQARS (Web-based Injury Statistics Query and Reporting System)|Injury Center|CDC. https://www.cdc.gov/injury/wisqars/. Accessed 3 Nov 2020

  3. WHO (2004) World report on road traffic injury prevention. https://www.who.int/publications/i/item/world-report-on-road-traffic-injury-prevention. Accessed 3 Nov 2020

  4. Alam A, Besselink B, Turri V et al (2015) Heavy-duty vehicle platooning for sustainable freight transportation: a cooperative method to enhance safety and efficiency. IEEE Control Syst Mag 35:34–56

    Article  Google Scholar 

  5. Parkhun S, Kim SM, Ha YG (2016) Highway traffic accident prediction using VDS big data analysis. J Supercomput 72:2815–2831. https://doi.org/10.1007/s11227-016-1624-z

    Article  Google Scholar 

  6. Tamim Kashifi M, Ahmad I (2022) Efficient histogram-based gradient boosting approach for accident severity prediction with multisource data. Transp Res Record: J Transp Res Board. https://doi.org/10.1177/03611981221074370

    Article  Google Scholar 

  7. Yan X, Wu J (2014) Effectiveness of variable message signs on driving behavior based on a driving simulation experiment. Discret Dyn Nat Soc. https://doi.org/10.1155/2014/206805

    Article  Google Scholar 

  8. Ren H, Song Y, Wang J, et al (2018) A Deep Learning Approach to the Citywide Traffic Accident Risk Prediction. IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC 2018-Novem:3346–3351. https://doi.org/10.1109/ITSC.2018.8569437

  9. Bao J, Liu P, Qin X, Zhou H (2018) Understanding the effects of trip patterns on spatially aggregated crashes with large-scale taxi GPS data. Accid Anal Prev 120:281–294. https://doi.org/10.1016/j.aap.2018.08.014

    Article  Google Scholar 

  10. Chen Q, Song X, Yamada H, Shibasaki R (2016) Learning deep representation from big and heterogeneous data for traffic accident inference. 30th AAAI Conference on Artificial Intelligence, AAAI 2016 338–344

  11. Chen M, Mao S, Liu Y (2014) Big data: a survey. Mobile Netw Appl 19:171–209. https://doi.org/10.1007/s11036-013-0489-0

    Article  Google Scholar 

  12. Huang H, Abdel-Aty MA, Darwiche AL (2010) County-level crash risk analysis in Florida: Bayesian spatial modeling. Transp Res Rec. https://doi.org/10.3141/2148-04

    Article  Google Scholar 

  13. Rhee KA, Kim JK, Lee YI, Ulfarsson GF (2016) Spatial regression analysis of traffic crashes in Seoul. Accid Anal Prev 91:190–199. https://doi.org/10.1016/j.aap.2016.02.023

    Article  Google Scholar 

  14. Bao J, Liu P, Ukkusuri S (2019) A spatiotemporal deep learning approach for citywide short-term crash risk prediction with multi-source data. Accid Anal Prev 122:239–254. https://doi.org/10.1016/j.aap.2018.10.015

    Article  Google Scholar 

  15. Yuan J, Abdel-Aty M, Gong Y, Cai Q (2019) Real-time crash risk prediction using long short-term memory recurrent neural network. Transp Res Rec 2673:314–326. https://doi.org/10.1177/0361198119840611

    Article  Google Scholar 

  16. Abdel-Aty M, Uddin N, Pande A et al (2004) Predicting freeway crashes from loop detector data by matched case-control logistic regression. Transp Res Rec. https://doi.org/10.3141/1897-12

    Article  Google Scholar 

  17. Zheng Z, Ahn S, Monsere CM (2010) Impact of traffic oscillations on freeway crash occurrences. Accid Anal Prev 42:626–636. https://doi.org/10.1016/j.aap.2009.10.009

    Article  Google Scholar 

  18. Xu C, Liu P, Wang W, Li Z (2012) Evaluation of the impacts of traffic states on crash risks on freeways. Accid Anal Prev 47:162–171. https://doi.org/10.1016/j.aap.2012.01.020

    Article  Google Scholar 

  19. Abdel-Aty MA, Hassan HM, Ahmed M, Al-Ghamdi AS (2012) Real-time prediction of visibility related crashes. Transp Res Part C: Emerg Technol 24:288–298. https://doi.org/10.1016/j.trc.2012.04.001

    Article  Google Scholar 

  20. Shi Q, Abdel-Aty M (2015) Big Data applications in real-time traffic operation and safety monitoring and improvement on urban expressways. Transp Res Part C: Emerg Technol 58:380–394. https://doi.org/10.1016/j.trc.2015.02.022

    Article  Google Scholar 

  21. Wang L, Abdel-Aty M, Shi Q, Park J (2015) Real-time crash prediction for expressway weaving segments. Transp Res Part C: Emerg Technol 61:1–10. https://doi.org/10.1016/j.trc.2015.10.008

    Article  Google Scholar 

  22. Michie D, Spiegelhalter DJ, Taylor CC et al (1994) Machine learning. Neural Stat Class 13:1–298

    MATH  Google Scholar 

  23. Murphy KP (2012) Machine learning: a probabilistic perspective. MIT press

    MATH  Google Scholar 

  24. Jordan MI (1979) Mitchell TM (2015) Machine learning: trends, perspectives, and prospects. Science 349:255–260

    Article  Google Scholar 

  25. Marsland S (2015) Machine learning: an algorithmic perspective. CRC Press

    Google Scholar 

  26. Alpaydin E (2020) Introduction to machine learning. MIT press

    MATH  Google Scholar 

  27. Scholkopf B, Sung K-K, Burges CJC et al (1997) Comparing support vector machines with Gaussian kernels to radial basis function classifiers. IEEE Trans Signal Process 45:2758–2765

    Article  Google Scholar 

  28. Yu R, Abdel-Aty M (2013) Utilizing support vector machine in real-time crash risk evaluation. Accid Anal Prev. https://doi.org/10.1016/j.aap.2012.11.027

    Article  Google Scholar 

  29. Parsa AB, Taghipour H, Derrible S, Mohammadian A (2019) Real-time accident detection: Coping with imbalanced data. Accid Anal Prev 129:202–210. https://doi.org/10.1016/j.aap.2019.05.014

    Article  Google Scholar 

  30. Guo M, Zhao X, Yao Y et al (2021) A study of freeway crash risk prediction and interpretation based on risky driving behavior and traffic flow data. Accident Anal Prevention 160:106328

    Article  Google Scholar 

  31. Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85–117

    Article  Google Scholar 

  32. LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444

    Article  Google Scholar 

  33. Zhao Z, Chen W, Wu X et al (2017) LSTM network: a deep learning approach for short-term traffic forecast. IET Image Proc. https://doi.org/10.1049/iet-its.2016.0208

    Article  Google Scholar 

  34. Tian C, Ma J, Zhang C, Zhan P (2018) A deep neural network model for short-term load forecast based on long short-term memory network and convolutional neural network. Energies (Basel). https://doi.org/10.3390/en11123493

    Article  Google Scholar 

  35. Zheng Z, Yang Y, Liu J et al (2019) Deep and embedded learning approach for traffic flow prediction in urban informatics. IEEE Trans Intell Transp Syst. https://doi.org/10.1109/TITS.2019.2909904

    Article  Google Scholar 

  36. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput. https://doi.org/10.1162/neco.1997.9.8.1735

    Article  Google Scholar 

  37. Bengio Y, Simard P, Frasconi P (1994) Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw. 5(2):157

    Article  Google Scholar 

  38. Skrobek D, Krzywanski J, Sosnowski M et al (2020) Prediction of sorption processes using the deep learning methods (long short-term memory). Energies (Basel) 13:6601

    Article  Google Scholar 

  39. Alahi A, Goel K, Ramanathan V, et al (2016) Social lstm: Human trajectory prediction in crowded spaces. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 961–971

  40. Cui Z, Ke R, Pu Z, Wang Y (2018) Deep Bidirectional and Unidirectional {LSTM} Recurrent Neural Network for Network-wide Traffic Speed Prediction. CoRR abs/1801.0:1–11

  41. Li P, Abdel-Aty M, Yuan J (2020) Real-time crash risk prediction on arterials based on LSTM-CNN. Accid Anal Prev 135:105371. https://doi.org/10.1016/j.aap.2019.105371

    Article  Google Scholar 

  42. Zhang Z, He Q, Gao J, Ni M (2018) A deep learning approach for detecting traffic accidents from social media data. Transp Res Part C: Emerg Technol 86:580–596. https://doi.org/10.1016/j.trc.2017.11.027

    Article  Google Scholar 

  43. Chen C, Fan X, Zheng C, et al (2018) SDCAE: Stack Denoising Convolutional Autoencoder Model for Accident Risk Prediction Via Traffic Big Data 2018 Sixth International Conference on Advanced Cloud and Big Data SDCAE: Stack Denoising Convolutional Autoencoder Model for Accident Risk Prediction via Traffic Big Data. https://doi.org/10.1109/CBD.2018.00065

  44. Yuan Z, Zhou X, Yang T (2018) Hetero-convlstm: A deep learning approach to traffic accident prediction on heterogeneous spatio-temporal data. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery \& Data Mining. pp 984–992

  45. Data Commons (2020) Chicago - Place Explorer - Data Commons. https://datacommons.org/place/geoId/1714000. Accessed 27 Oct 2020

  46. Continental Motors (2020) How many cars are in Chicago? Commuter Vehicle Numbers. https://www.continentalmotors.com/blog/how-many-cars-are-in-chicago/. Accessed 3 Nov 2020

  47. Arguez A, Durre I, Applequist S, et al (2020) NOAA’s U.S. Climate Normals (1981–2010). Hourly Weather. NOAA National Centers for Environmental Information. https://www.ncdc.noaa.gov/cdo-web/datatools/lcd. Accessed 27 Oct 2020

  48. Chicago Data Portal (2020) Traffic Crashes - Crashes | City of Chicago | Data Portal. https://data.cityofchicago.org/Transportation/Traffic-Crashes-Crashes/85ca-t3if. Accessed 24 Oct 2020

  49. Worldwide Public Holidays (2021) Worldwide Public Holidays Listed by Country - qppstudio.net. https://www.qppstudio.net/worldwide-public-holidays/country-portal.htm. Accessed 31 May 2021

  50. Train KE (2003) Discrete choice methods with simulation. Cambridge University Press

    Book  MATH  Google Scholar 

  51. Vapnik V (1999) The nature of statistical learning theory. Springer science and business media

    MATH  Google Scholar 

  52. Wang S-C (2003) Artificial neural network. interdisciplinary computing in java programming. Springer

    Book  Google Scholar 

  53. Gers FA, Schmidhuber J, Cummins F (2000) Learning to forget: continual prediction with LSTM. Neural Comput 12:2451–2471. https://doi.org/10.1162/089976600300015015

    Article  Google Scholar 

  54. Xu D, Shi Y, Tsang IW et al (2020) Survey on multi-output learning. IEEE Trans Neural Netw Learn Syst 31:2409–2429. https://doi.org/10.1109/TNNLS.2019.2945133

    Article  MathSciNet  Google Scholar 

  55. Núñez JC, Cabido R, Pantrigo JJ et al (2018) Convolutional neural networks and long short-term memory for skeleton-based human activity and hand gesture recognition. Pattern Recogn. https://doi.org/10.1016/j.patcog.2017.10.033

    Article  Google Scholar 

  56. Zheng X, Chen W (2021) An Attention-based Bi-LSTM method for visual object classification via EEG. Biomed Signal Process Control 63:102174. https://doi.org/10.1016/j.bspc.2020.102174

    Article  Google Scholar 

  57. LeCun Y, Boser B, Denker JS et al (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput. https://doi.org/10.1162/neco.1989.1.4.541

    Article  Google Scholar 

  58. Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539

    Article  Google Scholar 

  59. Rengasamy D, Jafari M, Rothwell B et al (2020) Deep learning with dynamically weighted loss function for sensor-based prognostics and health management. Sensors (Switzerland) 20:723. https://doi.org/10.3390/s20030723

    Article  Google Scholar 

  60. Snoek J, Larochelle H, Adams RP (2012) Practical Bayesian optimization of machine learning algorithms. Adv Neural Inf Process Syst 4:2951–2959

    Google Scholar 

  61. Ozenne B, Subtil F, Maucort-Boulch D (2015) The precision-recall curve overcame the optimism of the receiver operating characteristic curve in rare diseases. J Clin Epidemiol 68:855–859. https://doi.org/10.1016/j.jclinepi.2015.02.010

    Article  Google Scholar 

  62. Keselman HJ, Rogan JC (1977) The Tukey multiple comparison test: 1953–1976. Psychol Bull 84:1050

    Article  Google Scholar 

  63. Moosavi S, Samavatian MH, Parthasarathy S, et al (2019) Accident risk prediction based on heterogeneous sparse data: New dataset and insights. GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems 33–42. https://doi.org/10.1145/3347146.3359078

Download references

Acknowledgements

The authors would like to acknowledge the support of King Fahd University of Petroleum & Minerals in conducting this research.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hassan Musaed Al-Ahmadi.

Ethics declarations

Conflict of interest

The authors acknowledge that there is no conflict of interest.

Consent for publication

All authors agreed to the submission.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kashifi, M.T., Al-Sghan, I.Y., Rahman, S.M. et al. Spatiotemporal grid-based crash prediction—application of a transparent deep hybrid modeling framework. Neural Comput & Applic 34, 20655–20669 (2022). https://doi.org/10.1007/s00521-022-07511-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-022-07511-y

Keywords

Navigation