LSTM training set analysis and clustering model development for short-term traffic flow prediction

Doğan, Erdem

doi:10.1007/s00521-020-05564-5

LSTM training set analysis and clustering model development for short-term traffic flow prediction

Original Article
Published: 11 January 2021

Volume 33, pages 11175–11188, (2021)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Erdem Doğan ORCID: orcid.org/0000-0001-7802-641X¹

941 Accesses
13 Citations
Explore all metrics

Abstract

Long short-term memory (LSTM) is becoming increasingly popular in the short-term flow. In order to develop high-quality prediction models, it is worth investigating the LSTM potential deeply for traffic flow prediction. This study has two objectives: first, to observe the effect of using different sized training sets in LSTM training for various and numerous databases; second, to develop a clustering model that contributes to adjusting the training set size. For this purpose, 83 datasets were divided into certain sizes and LSTM model performances were examined depending on these training set sizes. As a result, enlargement of the training set size reduced LSTM errors monotonic for certain datasets. This phenomenon was modeled with the state-of-the-art clustering algorithms, such as K-nearest neighbor, support vector machine (SVM), logistic regression and pattern recognition networks (PRNet). In these models, statistical properties of datasets were utilized as input. The best results were obtained by PRNet, and SVM model performance was closest to PRNet. This study indicates that enlarging the training set size in traffic flow prediction increases the LSTM performance monotonically for specific datasets. In addition, a high-precision clustering model is presented to assist researchers in short-term traffic forecasting to adjust the size of the training set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust-LSTM: a novel approach to short-traffic flow prediction based on signal decomposition

Article 05 April 2022

Short-Term Traffic Flow Intensity Prediction Based on CHS-LSTM

Article 31 August 2020

Short-Time Prediction of Subway Inbound Passenger Flow Based on K-means Clustering Combination Model

References

Zhao Z, Chen W, Wu X et al (2017) LSTM network: a deep learning approach for short-term traffic forecast. IET Intel Transport Syst 11:68–75. https://doi.org/10.1049/iet-its.2016.0208
Article Google Scholar
Ahmed MS, Cook AR (1979) Analysis of freeway traffic time-series data by using box-jenkins techniques. Transp Res Rec. https://doi.org/10.3141/2024-03
Article Google Scholar
van Lint JWC, van Hinsbergen C (2012) Short term traffic and travel time prediction models, in artificial intelligence applications to critical transportation issues Transportation Research Circular National. Academies Press, Washington DC, Number E-C168
Kamarianakis Y, Prastacos P (2003) forecasting traffic flow conditions in an urban network: comparison of multivariate and univariate approaches. Transp Res Rec 1857:74–84. https://doi.org/10.3141/1857-09
Article Google Scholar
Park D, Rilett LR (1998) Forecasting multiple-period freeway link travel times using modular neural networks. Transp Res Rec: J Transp Res Board 1617:163–170
Article Google Scholar
Wu CH, Ho JM, Lee DT (2004) Travel-time prediction with support vector regression. IEEE Trans Intell Transp Sys 5(4):276–281
Article Google Scholar
Nikovski D, Nishiuma N, Goto Y, Kumazawa H (2005) Univariate short-term prediction of road travel times. In: IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC. pp 1074–1079
Nagel K, Schreckenberg M (1992) A cellular automaton model for freeway traffic. J de Phys I 2:2221. https://doi.org/10.1051/jp1:1992277
Article Google Scholar
Cremer M (1995) On the calculation of individual travel times by macroscopic models. In: Pacific Rim TransTech Conference. 1995 Vehicle Navigation and Information Systems Conference Proceedings. 6th International VNIS. A Ride into the Future. IEEE, pp 187–193
Williams BM (2001) Multivariate vehicular traffic flow prediction: Evaluation of ARIMAX modeling. Transp Res Rec. https://doi.org/10.3141/1776-25
Article Google Scholar
Wu C-H, Ho J-M, Lee DT (2004) Travel-Time Prediction With Support Vector Regression. IEEE Trans Intell Transp Syst 5:276–281. https://doi.org/10.1109/TITS.2004.837813
Article Google Scholar
Lingras P, Sharma S, Zhong M (2002) Prediction of recreational travel using genetically designed regression and time-delay neural network models. Transp Res Rec 1805(1):16–24
Article Google Scholar
Smith BL, Williams BM, Keith Oswald R (2002) Comparison of parametric and nonparametric models for traffic flow forecasting. Transp Res Part C: Emerg Technol 10:303–321. https://doi.org/10.1016/S0968-090X(02)00009-8
Article Google Scholar
Kamarianakis Y, Prastacos P (2003) Forecasting Traffic Flow Conditions in an Urban Network: Comparison of Multivariate and Univariate Approaches. Transp Res Rec: J Transp Res Board 1857:74–84. https://doi.org/10.3141/1857-09
Article Google Scholar
Sun B, Cheng W, Goswami P, Bai G (2018) Short-term traffic forecasting using self-adjusting k-nearest neighbours. IET Intel Transport Syst 12:41–48. https://doi.org/10.1049/iet-its.2016.0263
Article Google Scholar
Doğan E (2020) Short-term Traffic Flow Prediction Using Artificial Intelligence with Periodic Clustering and Elected Set. Promet-Traffic Transp 32:65–78
Article Google Scholar
Xie Y, Zhang Y, Ye Z (2007) Short-term traffic volume forecasting using Kalman filter with discrete wavelet decomposition. Computer-Aided Civil and Infrastructure Engineering 22:326–334. https://doi.org/10.1111/j.1467-8667.2007.00489.x
Article Google Scholar
Tsekeris T, Stathopoulos A (2010) Short-Term Prediction of Urban Traffic Variability: Stochastic Volatility Modeling Approach. Journal of Transportation Engineering 136:606–613. https://doi.org/10.1061/(ASCE)TE.1943-5436.0000112
Article Google Scholar
McCrea J, Moutari S (2010) A hybrid macroscopic-based model for traffic flow in road networks. Eur J Operational Res 207:676–684. https://doi.org/10.1016/j.ejor.2010.05.018
Article MathSciNet MATH Google Scholar
Vlahogianni EI, Karlaftis MG, Golias JC (2014) Short-term traffic forecasting: Where we are and where we’re going. Transportation Research Part C: Emerging Technologies 43:3–19. https://doi.org/10.1016/j.trc.2014.01.005
Article Google Scholar
Hochreiter S, Schmidhuber J (1997) Long Short-Term Memory. Neural Comput 9:1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Article Google Scholar
Yang B, Sun S, Li J et al (2019) Traffic flow prediction using LSTM with feature enhancement. Neurocomputing 332:320–327. https://doi.org/10.1016/j.neucom.2018.12.016
Article Google Scholar
Liu J (2017) LSTM network: a deep learning approach for short-term traffic forecast. IET Intel Transport Sys 11:68–75
Article Google Scholar
Lv Y, Duan Y, Kang W et al (2015) Traffic Flow Prediction with Big Data: A Deep Learning Approach. IEEE Trans Intell Transp Syst 16:865–873. https://doi.org/10.1109/TITS.2014.2345663
Article Google Scholar
Wu Y, Tan H, Qin L et al (2018) A hybrid deep learning based traffic flow prediction method and its understanding. Transp Res Part C: Emerg Technol 90:166–180. https://doi.org/10.1016/j.trc.2018.03.001
Article Google Scholar
Tian Y, Zhang K, Li J et al (2018) LSTM-based traffic flow prediction with missing data. Neurocomputing 318:297–305. https://doi.org/10.1016/j.neucom.2018.08.067
Article Google Scholar
Zhao Z, Chen W, Wu X et al (2017) LSTM network: a deep learning approach for short-term traffic forecast. IET Intel Transport Syst. https://doi.org/10.1049/iet-its.2016.0208
Article Google Scholar
Luo X, Li D, Yang Y (2019) Zhang S (2019) Spatiotemporal traffic flow prediction with KNN and LSTM. Journal of Advanced Transportation 5:1–10. https://doi.org/10.1155/2019/4145353
Article Google Scholar
Yu R, Li Y, Shahabi C, et al (2017) Deep learning: A generic approach for extreme condition traffic forecasting. In: Proceedings of the 2017 SIAM International Conference on Data Mining. SIAM, pp 777–785
Hochreiter S (1998) The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int J Uncertain, Fuzziness Knowl-Based Sys 6:107–116
Article Google Scholar
Polson NG, Sokolov VO (2017) Deep learning for short-term traffic flow prediction. Transp Res Part C: Emerg Technol 79:1–17. https://doi.org/10.1016/j.trc.2017.02.024
Article Google Scholar
Çodur MY, Tortum A (2015) An Artificial Neural Network Model for Highway Accident Prediction: A Case Study of Erzurum, Turkey. PROMET-Traffic Transp 27:217–225. https://doi.org/10.7307/ptt.v27i3.1551
Article Google Scholar
Celikoglu HB (2013) An Approach to Dynamic Classification of Traffic Flow Patterns. Comp-Aided Civil Infrastruct Eng 28:273–288. https://doi.org/10.1111/j.1467-8667.2012.00792.x
Article Google Scholar
Akgüngör AP, Doğan E (2009) An artificial intelligent approach to traffic accident estimation: Model development and application. Transport 24:135–142. https://doi.org/10.3846/1648-4142.2009.24.135-142
Article Google Scholar
PeMS PeMS Data Clearinghouse. http://pems.dot.ca.gov/?dnode=Clearinghouse. Accessed 12 Aug 2019
Mann HB (1945) Nonparametric tests against trend. Econometrica 13(3):245–259. https://doi.org/10.2307/1907187
Article MathSciNet MATH Google Scholar
Kendall MG (1948) Rank correlation methods. Griffin
Yu Y-S, Zou S, Whittemore D (1993) Non-parametric trend analysis of water quality data of rivers in Kansas. J Hydrol 150:61–80
Article Google Scholar
Bishop CM (2006) Pattern recognition and machine learning. Springer-Verlag, New York
MATH Google Scholar
Mitchell TM (1997) Artificial neural networks. Machine Learn 45:81–127
Google Scholar
Kingma DP, Ba JL (2015) Adam: A method for stochastic optimization. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings 1–15
Reimers N, Gurevych I (2017) Optimal hyperparameters for deep LSTM-networks for sequence labeling tasks. ArXiv, abs/1707.06799
Møller MF (1993) A scaled conjugate gradient algorithm for fast supervised learning. Neural Networks 6:525–533
Article Google Scholar
Jolliffe IT, Cadima J (2016) Principal component analysis: a review and recent developments. Philosl Trans R Soc A: Math, Phys Eng Sci 374:20150202
Article MathSciNet Google Scholar

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Civil Engineering Department, Kırıkkale University, Yahşihan, Kırıkkale, 71451, Turkey
Erdem Doğan

Authors

Erdem Doğan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Erdem Doğan.

Ethics declarations

Conflict of interest

The author declares that they have no conflict of interest.

Avaliability of data and material

The datasets generated during and/or analyzed during the current study are available in the [Caltrans Performance Measurement System] repository, [http://pems.dot.ca.gov/?dnode=Clearinghouse&type=station_hour&district_id=4&submit=Submit/

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

See Table

Table 2 Experimental dataset statistics

Full size table

2

Rights and permissions

Reprints and permissions

About this article

Cite this article

Doğan, E. LSTM training set analysis and clustering model development for short-term traffic flow prediction. Neural Comput & Applic 33, 11175–11188 (2021). https://doi.org/10.1007/s00521-020-05564-5

Download citation

Received: 27 April 2020
Accepted: 01 December 2020
Published: 11 January 2021
Issue Date: September 2021
DOI: https://doi.org/10.1007/s00521-020-05564-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

LSTM training set analysis and clustering model development for short-term traffic flow prediction

Abstract

Access this article

Similar content being viewed by others

Robust-LSTM: a novel approach to short-traffic flow prediction based on signal decomposition

Short-Term Traffic Flow Intensity Prediction Based on CHS-LSTM

Short-Time Prediction of Subway Inbound Passenger Flow Based on K-means Clustering Combination Model

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Avaliability of data and material

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation