Abstract
Accurately forecasting short-term traffic flow is essential for intelligent transportation systems. However, current methods often struggle to fully exploit implicit variation patterns and heterogeneous correlations in traffic flow data, and can be sensitive to non-Gaussian noise. In this paper, we propose a novel noise-immune and attention-based multi-modal model (NIAMNet) for short-term traffic flow forecasting. Inspired by the success of computer vision techniques, NIAMNet transforms one-dimensional traffic flow into images and embeds residual dual-attention blocks (RDB) to extract in-deep features. Besides, we introduce a dynamic noise-immune loss to address the impact of noise and outliers on model performance. Experimental results on four real-world benchmark datasets demonstrate the superiority of NIAMNet over existing methods, achieving the lowest MAPE (10.43, 9.79, 10.51, and 11.01) and RMSE (247.13, 192.36, 208.40, and 150.01). Additional ablation experiments are carried out to provide insight into the significance of each component. Our approach contributes to the development of more accurate and robust short-term traffic flow forecasting models.
Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
Cai L, Zhang Z, Yang J, Yidan Y, Zhou T, Qin J (2019) A noise-immune Kalman filter for short-term traffic flow forecasting. Physica A Stat Mech Appl 536:122601. https://doi.org/10.1016/j.physa.2019.122601
Cai L, Yidan Yu, Zhang S, Song Y, Xiong Z, Zhou T (2020a) A sample-rebalanced outlier-rejected k-nearest neighbour regression model for short-term traffic flow forecasting. IEEE Access 8:22686–22696. https://doi.org/10.1109/ACCESS.2020.2970250
Cai W, Yang J, Yidan Yu, Song Y, Zhou T, Qin J (2020b) PSO-ELM: a hybrid learning model for short-term traffic flow forecasting. IEEE Access 8:6505–6514. https://doi.org/10.1109/ACCESS.2019.2963784
Cai L, Lei M, Zhang S, Yidan Yu, Zhou T, Qin J (2020c) A noise-immune LSTM network for short-term traffic flow forecasting. Chaos 30(2):023135. https://doi.org/10.1063/1.5120502
Chai W, Zheng Y, Tian L, Qin J, Zhou T (2023) GA-KELM: genetic-algorithm-improved kernel extreme learning machine for traffic flow forecasting. Mathematics 11(16):3574
Chan KY, Dillon TS, Singh J, Chang E (2011) Neural-network-based models for short-term traffic flow forecasting using a hybrid exponential smoothing and Levenberg-Marquardt algorithm. IEEE Trans Intell Transp Syst 13(2):644–654. https://doi.org/10.1109/TITS.2011.2174051
Chen B, Xing L, Zhao H, Zheng N, Prı JC et al (2016) Generalized correntropy for robust adaptive filtering. IEEE Trans Signal Process 64(13):3376–3387. https://doi.org/10.1109/TSP.2016.2539127
Chen L-C, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2017) DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848. https://doi.org/10.1109/TPAMI.2017.2699184
Chen W, An J, Li R, Li F, Guoqi Xie Md, Bhuiyan ZA, Li K (2018) A novel fuzzy deep-learning approach to traffic flow prediction with uncertain spatial-temporal data features. Future Gener Comput Syst 89:78–88. https://doi.org/10.1016/j.future.2018.06.021
Chen B, Zheng Y, Ren P (2021) Error loss networks. arXiv preprint arXiv:2106.03722
Chen J, Kao S, He H, Zhuo W, Wen S, Lee C-H, Chan S-HG (2023) Run, don’t walk: chasing higher flops for faster neural networks. arXiv preprint arXiv:2303.03667
Cheng S, Feng L, Peng P (2020) Short-term traffic forecasting by mining the non-stationarity of spatiotemporal patterns. IEEE Trans Intell Transp Syst. https://doi.org/10.1109/TITS.2020.2991781
Cui Z, Huang B, Dou H, Cheng Y, Guan J, Zhou T (2022a) A two stages hybrid extreme learning model for short-term traffic flow forecasting. Mathematics. https://doi.org/10.3390/math10122087
Cui Z, Huang B, Dou H, Tan G, Zheng S, Zhou T (2022b) GSA-ELM: a hybrid learning model for short-term traffic flow forecasting. IET Intell Transp Syst 16(1):41–52. https://doi.org/10.1049/itr2.12127
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929
Dou H, Tan J, Wei H, Wang F, Yang J, Ma X-G, Wang J, Zhou T (2022) Transfer inhibitory potency prediction to binary classification: a model only needs a small training set. Comput Methods Programs Biomed. https://doi.org/10.1016/j.cmpb.2022.106633
Fang W, Zhuo W, Yan J, Song Y, Jiang D, Zhou T (2022) Attention meets long short-term memory: a deep learning network for traffic flow forecasting. Physica A Stat Mech Appl 587:126485. https://doi.org/10.1016/j.physa.2021.126485
Feng X, Ling X, Zheng H, Chen Z, Yiwen X (2018) Adaptive multi-kernel SVM with spatial-temporal correlation for short-term traffic flow prediction. IEEE Trans Intell Transp Syst 20(6):2001–2013. https://doi.org/10.1109/TITS.2018.2854913
Haibo X, Jiang C (2020) Deep belief network-based support vector regression method for traffic flow forecasting. Neural Comput Appl 32:2027–2036. https://doi.org/10.1007/s00521-019-04339-x
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861
Hu W, Li Yan, Liu K, Wang H (2016) A short-term traffic flow forecasting method based on the hybrid PSO-SVR. Neural Process Lett 43(1):155–172. https://doi.org/10.1007/s11063-015-9409-6
Huakang L, Huang D, Youyi S, Jiang D, Zhou T, Qin J (2020) ST-TrafficNet: a spatial-temporal deep learning network for traffic forecasting. Electronics 9(9):1–17. https://doi.org/10.3390/electronics9091474
Huakang L, Ge Z, Song Y, Jiang D, Zhou T, Qin J (2021) A temporal-aware LSTM enhanced by loss-switch mechanism for traffic flow forecasting. Neurocomputing 427:169–178. https://doi.org/10.1016/j.neucom.2020.11.026
Huang G, Liu Z, Van Der ML, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
Huang H, Zheng S, Yang Z, Wu Y, Li Y, Qiu J, Cheng Y, Lin P, Lin Y, Ji Guan, Mikulis DJ, Zhou T, Wu R (2022) Voxel-based morphometry and a deep learning model for the diagnosis of early Alzheimer’s disease based on cerebral gray matter changes. Cereb Cortex. https://doi.org/10.1093/cercor/bhac099
Li X, Kang Y, Li F (2020) Forecasting with time series imaging. Expert Syst Appl 160:113680. https://doi.org/10.1016/j.eswa.2020.113680
Lippi M, Bertini M, Frasconi P (2013) Short-term traffic flow forecasting: an experimental comparison of time-series analysis and supervised learning. IEEE Trans Intell Transp Syst 14(2):871–882. https://doi.org/10.1109/TITS.2013.2247040
Liu W, Pokharel PP, Principe JC (2007) Correntropy: properties and applications in non-gaussian signal processing. IEEE Trans Signal Process 55(11):5286–5298. https://doi.org/10.1109/TSP.2007.896065
Luo Y, Wei M, Li S, Ling J, Xie G, Yao S (2023a) An effective co-support guided analysis model for multi-contrast MRI reconstruction. IEEE J Biomed Health Inform 27:2477–2488
Luo Y, Huang Q, Ling J, Lin K, Zhou T (2023b) Local and global knowledge distillation with direction-enhanced contrastive learning for single-image deraining. Knowledge-Based Systems, pp 1–10
Miglani A, Kumar N (2019) Deep learning models for traffic flow prediction in autonomous vehicles: a review, solutions, and challenges. Vehic Commun 20:100184. https://doi.org/10.1016/j.vehcom.2019.100184
Qiu J, Tan G, Lin Y, Guan J, Dai Z, Wang F, Zhuang C, Wilman AH, Huang H, Cao Z et al (2022) Automated detection of intracranial artery stenosis and occlusion in magnetic resonance angiography: a preliminary study based on deep learning. Magn Reson Imaging 94:105–111. https://doi.org/10.1016/j.mri.2022.09.006
Shahriari S, Ghasri M, Sisson SA, Rashidi T (2020) Ensemble of ARIMA: combining parametric and bootstrapping technique for traffic flow prediction. Transp A Transp Sci 16(3):1552–1573. https://doi.org/10.1080/23249935.2020.1764662
Song Z, Guo Y, Yao W, Ma J (2019) Short-term traffic speed prediction under different data collection time intervals using a SARIMA-SDGM hybrid prediction model. PLoS One 14(6):e0218626. https://doi.org/10.1371/journal.pone.0218626
Tan M, Le Q (2019) EfficientNet: rethinking model scaling for convolutional neural networks. In: International conference on machine learning. PMLR, pp 6105–6114
Tan G, Zheng S, Huang B, Cui Z, Dou H, Yang X, Zhou T (2021) Hybrid GA-SVR: an effective way to predict short-term traffic flow. In: 21st International conference on algorithms and architectures for parallel processing (ICA3PP 2021), pp 1–11. https://doi.org/10.1007/978-3-030-95388-1_4
Tan G, Huang B, Cui Z, Dou H, Zheng S, Zhou T (2022) A noise-immune reinforcement learning method for early diagnosis of neuropsychiatric systemic lupus erythematosus. Math Biosci Eng 19(3):2219–2239. https://doi.org/10.3934/mbe.2022104
Wang Z, Oates T (2015) Encoding time series as images for visual inspection and classification using tiled convolutional neural networks. In: Workshops at the twenty-ninth AAAI conference on artificial intelligence
Wang Y, Van Schuppen JH, Vrancken J (2013) Prediction of traffic flow at the boundary of a motorway network. IEEE Trans Intell Transp Syst 15(1):214–227. https://doi.org/10.1109/TITS.2013.2278192
Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Xia D, Zhang M, Xiaobo Yan Yu, Bai YZ, Li Y, Li H (2021) A distributed WND-LSTM model on MapReduce for short-term traffic flow prediction. Neural Comput Appl 33(7):2393–2410. https://doi.org/10.1007/s00521-020-05076-2
Xie Y, Zhang Y, Ye Z (2007) Short-term traffic volume forecasting using Kalman filter with discrete wavelet decomposition. Comput Aided Civ Infrastruct Eng 22(5):326–334. https://doi.org/10.1111/j.1467-8667.2007.00489.x
Xu Y, Kong Q-J, Liu Y (2013) Short-term traffic volume prediction using classification and regression trees. In: 2013 IEEE intelligent vehicles symposium (IV). IEEE, pp 493–498 https://doi.org/10.1109/IVS.2013.6629516
Xu S, Chan HK, Zhang T (2019) Forecasting the demand of the aviation industry using hybrid time series SARIMA-SVR approach. Transp Res Part E Logist Transp Rev 122:169–180. https://doi.org/10.1016/j.tre.2018.12.005
Yang H-F, Dillon TS, Chang E, Chen Y-PP (2018) Optimized configuration of exponential smoothing and extreme learning machine for traffic flow forecasting. IEEE Trans Ind Inform 15(1):23–34. https://doi.org/10.1109/TII.2018.2876907
Yang S, Li H, Luo Y, Li J, Song Y, Zhou T (2022) Spatiotemporal adaptive fusion graph network for short-term traffic flow forecasting. Mathematics 10(9):1594. https://doi.org/10.3390/math10091594
Yao R, Zhang W, Zhang L (2020) Hybrid methods for short-term traffic flow prediction based on ARIMA-GARCH model and wavelet neural network. J Transp Eng Part A Syst 146(8):04020086. https://doi.org/10.1061/JTEPBS.0000388
Yuan Y, Quan T, Song Y, Guan J, Zhou T, Wu R (2022) Noise-immune extreme ensemble learning for early diagnosis of neuropsychiatric systemic lupus erythematosus. IEEE J Biomed Health Inform. https://doi.org/10.1109/JBHI.2022.3164937
Yuanli G, Wenqi L, Xinyue X, Qin L, Shao Z, Zhang H (2019) An improved bayesian combination model for short-term traffic prediction with deep learning. IEEE Trans Intell Transp Syst 21(3):1332–1342. https://doi.org/10.1109/TITS.2019.2939290
Zeng B, Li C (2018) Improved multi-variable grey forecasting model with a dynamic background-value coefficient and its application. Comput Ind Eng 118:278–290. https://doi.org/10.1016/j.cie.2018.02.042
Zeng J, Zhang M, Lin S-B (2022) Fully corrective gradient boosting with squared hinge: fast learning rates and early stopping. Neural Netw 147:136–151. https://doi.org/10.1016/j.neunet.2021.12.016
Zhang Y, Zhang Y, Haghani A (2014) A hybrid short-term traffic flow forecasting method based on spectral analysis and statistical volatility model. Transp Res Part C Emerg Technol 43:65–78. https://doi.org/10.1016/j.trc.2013.11.011
Zheng H, Lin F, Feng X, Chen Y (2021) A hybrid deep learning model with attention-based Conv-LSTM networks for short-term traffic flow prediction. IEEE Trans Intell Transp Syst 22(11):6910–6920. https://doi.org/10.1109/TITS.2020.2997352
Zhou T, Han G, Xuemiao X, Lin Z, Han C, Huang Y, Qin J (2017) \(\delta \)-agree adaboost stacked autoencoder for short-term traffic flow forecasting. Neurocomputing 247(4):31–38. https://doi.org/10.1016/j.neucom.2017.03.049
Zhou T, Jiang D, Lin Z, Han G, Xuemiao X, Qin J (2019) Hybrid dual Kalman filtering model for short-term traffic flow forecasting. IET Intell Transp Syst 13(6):1023–1032. https://doi.org/10.1049/iet-its.2018.5385
Zhou T, Dou H, Tan J, Song Y, Wang F, Wang J (2022) Small dataset solves big problem: an outlier-insensitive binary classifier for inhibitory potency prediction. Knowl Based Syst. https://doi.org/10.1016/j.knosys.2022.109242
Zhu JZ, Cao JX, Zhu Y (2014) Traffic volume forecasting based on radial basis function neural network with the consideration of traffic flows at the adjacent intersections. Transp Res Part C Emerg Technol 47:139–154. https://doi.org/10.1016/j.trc.2014.06.011
Funding
This work was supported by the 2022 Guangdong Basic and Applied Basic Research Foundation (Nos. 2022A1515011590, 2022A1515011978, 2021A1515012302), the Key Scientific Research Project of Universities in Guangdong Province (Nos. 2020ZDZX3028, 2022ZDZX1007), Guangdong Province Key Field R &D Program Project (No. 2021B0101220006), the Guangdong Provincial Science and Technology Plan Project (No. STKJ202209003), the National Natural Science Foundation of China (No. 61902232), STU Incubation Project for the Research of Digital Humanities and New Liberal Arts (No. 2021DH-3), 2020 Li Ka Shing Foundation Cross-Disciplinary Research Grant (No. 2020LKSFG05D), Open Fund of Guangdong Provincial Key Laboratory of Infectious Diseases and Molecular Immunopathology (No. GDKL202212).
Author information
Authors and Affiliations
Contributions
GT: data curation, software, writing—original draft preparation. TZ: conceptualization, methodology, funding acquisition. BH, and HD: visualization, investigation. ZC and SZ: data curation, validation. ZL: writing—reviewing and editing, supervision, project administration.
Corresponding authors
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Ethical approval
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Tan, G., Zhou, T., Huang, B. et al. A noise-immune and attention-based multi-modal framework for short-term traffic flow forecasting. Soft Comput 28, 4775–4790 (2024). https://doi.org/10.1007/s00500-023-09173-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-023-09173-x