Abstract
Accurate traffic data collection is essential for supporting advanced traffic management system operations. This study investigated a large-scale data-driven sequential traffic sensor health monitoring (TSHM) module that can be used to monitor sensor health conditions over large traffic networks. Our proposed module consists of three sequential steps for detecting different types of abnormal sensor issues. The first step detects sensors with abnormally high missing data rates, while the second step uses clustering anomaly detection to detect sensors reporting abnormal records. The final step introduces a novel Bayesian changepoint modeling technique to detect sensors reporting abnormal traffic data fluctuations by assuming a constant vehicle length distribution based on average effective vehicle length (AEVL). Our proposed method is then compared with two benchmark algorithms to show its efficacy. Results obtained by applying our method to the statewide traffic sensor data of Iowa show it can successfully detect different classes of sensor issues. This demonstrates that sequential TSHM modules can help transportation agencies determine traffic sensors’ exact problems, thereby enabling them to take the required corrective steps.
Similar content being viewed by others
References
Al-Deek HM, Venkata C, Chandra SR (2004) New algorithms for filtering and imputation of real-time and archived dual-loop detector data in i–4 data warehouse. Transportation Research Record 1867(1), 116–126
Apache Pig (2018) https://pig.apache.org/. Accessed 20 July 2018
Berkhin P (2006) A survey of clustering data mining techniques. In Grouping multidimensional data (pp 25–71). Springer, Berlin
Bouwmans T, Zahzah EH (2014) Robust pca via principal component pursuit: A review for a comparative evaluation in video surveillance. Computer Vision and Image Understanding 122:22–34
Candès EJ, Li X, Ma Y, Wright J (2011) Robust principal component analysis? Journal of the ACM (JACM) 58(3):11
Chakraborty P, Adu-Gyamfi YO, Poddar S, Ahsani V, Sharma A, Sarkar S (2018) Traffic congestion detection from camera images using deep convolution neural networks. Transportation Research Record: Journal of the Transportation Research Board 2672(45), 222–231. doi: https://doi.org/10.1177/0361198118777631
Chakraborty P, Sharma A, Hegde C (2018) Freeway traffic incident detection from cameras: a semi-supervised learning approach. In 2018 21st International Conference on Intelligent Transportation Systems (ITSC) 2018 21st international conference on intelligent transportation systems (itsc), pp 1840–1845
Chandola V, Banerjee A, Kumar V (2009) Anomaly detection: A survey. ACM Computing Surveys (CSUR) 41(3):15
Chen C, Kwon J, Rice J, Skabardonis A, Varaiya P (2003) Detecting errors and imputing missing data for single-loop surveillance systems. Transportation Research Record 1855(1), 160–167
Erman J, Arlitt M, Mahanti A (2006) Traffic classification using clustering algorithms. In Proceedings of the 2006 SIGCOMM workshop on Mining network data, pp 281–286
Ester M, Kriegel H-P, Sander J, Xu X et al (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, Vol 96, pp 226–231
Fearnhead P (2006) Exact and efficient bayesian inference for multiple changepoint problems. Statistics and Computing 16(2), 203–213
Fearnhead P, Liu Z (2007) On-line inference for multiple changepoint problems. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 69(4):589–605
Jacobson LN, Nihan NL, Bender JD (1990) Detecting erroneous loop detector data in a freeway traffic management system (No. 1287)
Klein LA, Mills MK, Gibson DR et al (2006) Traffic detector handbook: Volume i (Tech. Rep.) Turner-Fairbank Highway Research Center
Kodinariya TM, Makwana PR (2013) Review on determining number of cluster in k-means clustering. International Journal of Advance Research in Computer Science and Management Studies 1(6), 90–95
Lee H, Coifman B (2011) Quantifying loop detector sensitivity and correcting detection problems on freeways. Journal of Transportation Engineering 138(7), 871–881
Lu Y, Yang X, Chang G-L (2014) Algorithm for detector-error screening on basis of temporal and spatial information. Transportation Research Record: Journal of the Transportation Research Board 2443:40–48
Ma D, Luo X, Li W, Jin S, Guo W, Wang D (2017) Traffic demand estimation for lane groups at signal-controlled intersections using travel times from video-imaging detectors. IET Intelligent Transport Systems 11(4), 222–229
MacQueen J et al (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth berkeley symposium on mathematical statistics and probability, vol 1, pp 281–297
Mori U, Mendiburu A, Álvarez M, Lozano JA (2015) A review of travel time estimation and forecasting for advanced traveller information systems. Transportmetrica A: Transport Science 11(2), 119–157
Nisa KK, Andrianto HA, Mardhiyyah R (2014) Hotspot clustering using dbscan algorithm and shiny web framework. In: 2014 international conference on advanced computer science and information system, pp 129–132
Payne H, Thompson S (1997) Malfunction detection and data repair for induction-loop sensors using i–880 data base. Transportation Research Record: Journal of the Transportation Research Board 1570:191–201
Schubert E, Sander J, Ester M, Kriegel HP, Xu X (2017) Dbscan revisited, revisited: why and how you should (still) use dbscan. ACM Transactions on Database Systems (TODS) 42(3):19
Shi Q, Abdel-Aty M (2015) Big data applications in real-time traffic operation and safety monitoring and improvement on urban expressways. Transportation Research Part C: Emerging Technologies 58:380–394
Sun Z, Jin W-L, Ng M (2016) Network sensor health problem. Transportation Research Part C: Emerging Technologies 68:300–310
Turner S, Albert L, Gajewski B, Eisele W (2000) Archived intelligent transportation system data quality: Preliminary analyses of san antonio transguide data. Transportation Research Record: Journal of the Transportation Research Board 1719:77–84
Turochy RE, Smith BL (2000) New procedure for detector data screening in traffic management systems. Transportation Research Record 1727(1), 127–131
Vanajakshi L, Rilett L (2004) Loop detector data diagnostics based on conservation-of-vehicles principle. Transportation Research Record 1870(1), 162–169
Wells TJ, Smaglik EJ, Bullock DM (2008) Implementation of station health monitoring procedures for its sensors, volume 1
Wu L, Liu C, Huang T, Sharma A, Sarkar S (2017) Traffic sensor health monitoring using spatiotemporal graphical modeling. In: Proceedings of the 2nd acm sigkdd workshop on machine learning for prognostics and health management, pp 13–17
Yao B, Chen C, Cao Q, Jin L, Zhang M, Zhu H, Yu B (2017) Short-term traffic speed prediction for an urban corridor. Computer-Aided Civil and Infrastructure Engineering 32(2), 154–169
Acknowledgements
Our research results are based upon work jointly supported by the National Science Foundation Partnerships for Innovation: Building Innovation Capacity (PFI: BIC) program under Grant No. 1632116, National Science Foundation under Grants Nos. CNS-1464279, CCF-1566281, CCF-1750920, Award ATD-1830254 supported by NSF and the National Geospatial-Intelligence Agency, and Iowa DOT Office of Traffic Operations Support Grant. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Huang, T., Chakraborty, P., Sharma, A. et al. Large-Scale Data-Driven Traffic Sensor Health Monitoring. J. Big Data Anal. Transp. 3, 229–245 (2021). https://doi.org/10.1007/s42421-021-00049-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42421-021-00049-w