Skip to main content

RD-PCA: A Traffic Condition Data Imputation Method Based on Robust Distance

  • Conference paper
Book cover Algorithms and Architectures for Parallel Processing (ICA3PP 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8630))

  • 2554 Accesses

Abstract

Dynamic Transportation Information Service has penetrated into residents’ travels. The current problems that transportation information services face are variable such as real-time traffic forecasting, traffic managing and traffic induction. The above problems are related to the quality of historical traffic condition data. Due to a limited of GPS data collecting, the collected GPS data which scarcely covers the whole road network leads to incomplete and error traffic condition data. In consequence, two serious problems of traffic condition data quality manifest in incompleteness and low accuracy. This paper extends RD-PCA method which preliminarily focuses on the accuracy of imputing to prevent the estimating results from being impacted by outliers and aims at guaranteeing the completeness of imputing. The method excludes error data taking data quality measurement criterions. By adopting a measure factor, this method detects outliers and standardizes them, then constructs a robust feature space and imputes the missing data. The experimental results show that the proposed method can guarantee a high completeness and high accuracy under the condition of different missing rates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Oki. What is VICS, http://www.oki.com/jp/SSC/ITS/eng/vics.html

  2. Wen, Y.H., Lee, T.T., Cho, H.J.: Missing Data Treatment and Data Fusion Toward Travel Time Estimation For ATIS. Journal of the Eastern Asia Society for Transportation Studies 6, 2546–2560 (2005)

    Google Scholar 

  3. Graham, J.W., Cumsille, P.E., Elek-Fisk, E.: Methods for Handling Missing Data. Handbook of Psychology 2, 87–114 (2003)

    Google Scholar 

  4. Du, B., Xu, L., Ma, D., Lv, W.: Missing data compensation model in real-time traffic information service system. In: Fifth International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008, vol. 5, pp. 371–378. IEEE (2008)

    Google Scholar 

  5. Guo, J., Huang, W., Williams, B.M.: Adaptive Kalman filter approach for stochastic short-term traffic flow rate prediction and uncertainty quantification. Transportation Research Part C: Emerging Technologies 43(1), 50–64 (2014)

    Article  Google Scholar 

  6. Li, L., Li, Y., Li, Z.: Efficient missing data imputing for traffic flow by considering temporal and spatial dependence. Transportation Research Part C:Emerging Technologies 34, 108–120 (2013)

    Article  Google Scholar 

  7. Qu, L., Li, L., Zhang, Y., et al.: PPCA-based missing data imputation for traffic flow volume: A systematical approach. IEEE Transactions on Intelligent Transportation Systems 10(3), 512–522 (2009)

    Article  Google Scholar 

  8. Turner, S.: Dening and measuring traffic data quality: White paper on recommended approaches. Transportation Research Record: Journal of the Transportation Research Board 1870(1), 62–69 (2004)

    Article  Google Scholar 

  9. Hubert, M., Rousseeuw, P.J., Branden, K.V.: ROBPCA: A new approach to robust principal component analysis. Technometrics 47 (2005)

    Google Scholar 

  10. Hubert, M., Van Driessen, K.: Fast and robust discriminant analysis. Computational Statistics & Data Analysis 45(2), 301–320 (2005)

    Article  Google Scholar 

  11. Kumagai, M., Fushiki, T., Kimita, K., et al.: Spatial Interpolation of Real-time Floating Car Data Based on Multiple Link Correlation in feature space. In: Proc. of 13th World Congress of ITS, London, CDROM (2006)

    Google Scholar 

  12. Kramer, M.A.: Nonlinear principal component analysis using autoassociative neural networks. AIChE Journal 37(2), 233–243 (1991)

    Article  Google Scholar 

  13. Wasito, I.: Least Squares Algorithms with Nearest Neighbor Techniques for Imputing Missing Data Values. Birkbeck College (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Wan, X., Du, Y., Wang, J. (2014). RD-PCA: A Traffic Condition Data Imputation Method Based on Robust Distance. In: Sun, Xh., et al. Algorithms and Architectures for Parallel Processing. ICA3PP 2014. Lecture Notes in Computer Science, vol 8630. Springer, Cham. https://doi.org/10.1007/978-3-319-11197-1_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-11197-1_19

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-11196-4

  • Online ISBN: 978-3-319-11197-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics