Abstract
Dynamic Transportation Information Service has penetrated into residents’ travels. The current problems that transportation information services face are variable such as real-time traffic forecasting, traffic managing and traffic induction. The above problems are related to the quality of historical traffic condition data. Due to a limited of GPS data collecting, the collected GPS data which scarcely covers the whole road network leads to incomplete and error traffic condition data. In consequence, two serious problems of traffic condition data quality manifest in incompleteness and low accuracy. This paper extends RD-PCA method which preliminarily focuses on the accuracy of imputing to prevent the estimating results from being impacted by outliers and aims at guaranteeing the completeness of imputing. The method excludes error data taking data quality measurement criterions. By adopting a measure factor, this method detects outliers and standardizes them, then constructs a robust feature space and imputes the missing data. The experimental results show that the proposed method can guarantee a high completeness and high accuracy under the condition of different missing rates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Oki. What is VICS, http://www.oki.com/jp/SSC/ITS/eng/vics.html
Wen, Y.H., Lee, T.T., Cho, H.J.: Missing Data Treatment and Data Fusion Toward Travel Time Estimation For ATIS. Journal of the Eastern Asia Society for Transportation Studies 6, 2546–2560 (2005)
Graham, J.W., Cumsille, P.E., Elek-Fisk, E.: Methods for Handling Missing Data. Handbook of Psychology 2, 87–114 (2003)
Du, B., Xu, L., Ma, D., Lv, W.: Missing data compensation model in real-time traffic information service system. In: Fifth International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008, vol. 5, pp. 371–378. IEEE (2008)
Guo, J., Huang, W., Williams, B.M.: Adaptive Kalman filter approach for stochastic short-term traffic flow rate prediction and uncertainty quantification. Transportation Research Part C: Emerging Technologies 43(1), 50–64 (2014)
Li, L., Li, Y., Li, Z.: Efficient missing data imputing for traffic flow by considering temporal and spatial dependence. Transportation Research Part C:Emerging Technologies 34, 108–120 (2013)
Qu, L., Li, L., Zhang, Y., et al.: PPCA-based missing data imputation for traffic flow volume: A systematical approach. IEEE Transactions on Intelligent Transportation Systems 10(3), 512–522 (2009)
Turner, S.: Dening and measuring traffic data quality: White paper on recommended approaches. Transportation Research Record: Journal of the Transportation Research Board 1870(1), 62–69 (2004)
Hubert, M., Rousseeuw, P.J., Branden, K.V.: ROBPCA: A new approach to robust principal component analysis. Technometrics 47 (2005)
Hubert, M., Van Driessen, K.: Fast and robust discriminant analysis. Computational Statistics & Data Analysis 45(2), 301–320 (2005)
Kumagai, M., Fushiki, T., Kimita, K., et al.: Spatial Interpolation of Real-time Floating Car Data Based on Multiple Link Correlation in feature space. In: Proc. of 13th World Congress of ITS, London, CDROM (2006)
Kramer, M.A.: Nonlinear principal component analysis using autoassociative neural networks. AIChE Journal 37(2), 233–243 (1991)
Wasito, I.: Least Squares Algorithms with Nearest Neighbor Techniques for Imputing Missing Data Values. Birkbeck College (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Wan, X., Du, Y., Wang, J. (2014). RD-PCA: A Traffic Condition Data Imputation Method Based on Robust Distance. In: Sun, Xh., et al. Algorithms and Architectures for Parallel Processing. ICA3PP 2014. Lecture Notes in Computer Science, vol 8630. Springer, Cham. https://doi.org/10.1007/978-3-319-11197-1_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-11197-1_19
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11196-4
Online ISBN: 978-3-319-11197-1
eBook Packages: Computer ScienceComputer Science (R0)