Skip to main content
Log in

A Machine Learning Based Approach for Smart and Automated Data Collection: Applications in Transportation

  • Original Article
  • Published:
Transportation in Developing Economies Aims and scope Submit manuscript

Abstract

This paper presents a streamlined machine learning-assisted integrated approach for smart surveys and data collection with applications to multiple domains such as transportation, aviation, healthcare, and education. We exploit several machine learning methods to enhance the survey’s effectiveness and increase the trustworthiness of the collected data in real time. The methodology is demonstrated in a case study involving the daytime seat belt surveys mandated by the U.S. National Highway Traffic Safety Administration (NHTSA). To implement and test the approach, we also built an integrated tool. This cloud-based software tool is designed to aid in intelligent data collection, quality control, survey management, data analysis, insight extraction, and a feedback structure. This tool automates and standardizes the complete process of transportation-related data collection, quality control, and better decision-making. The framework presented is adaptable, underscoring its potential for widespread application and utility in other domains as well.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Kahane CJ (2000) Fatality reduction by safety belts for front-seat occupants of cars and light trucks: updated and expanded estimates based on 1986-99 fars data. Technical report

  2. Agarwal S, Sancheti A, Khaddar R, Kachroo P (2013) Geospatial framework for integration of transportation data using voronoi diagrams. In: Transportation Research Board 92nd Annual Meeting

  3. Vasudevan V, Kachroo P, Bandaru N (2015) Nighttime seatbelt usage data collection: when and how long? IATSS Res 38(2):149–156

    Article  Google Scholar 

  4. Krishen AS, Kachroo P, Agarwal S, Sastry SS, Wilson M (2015) Safety culture from an interdisciplinary perspective: conceptualizing a hierarchical feedback-based transportation framework. Transp J 54(4):516–534

    Article  Google Scholar 

  5. Vivoda JM, Eby DW (2006) Using personal digital assistants (pdas) for the collection of safety belt use data in the field. Behav Res Methods 38(1):158–164

    Article  Google Scholar 

  6. Kachroo P, Lakhanpal P, Wilson M (2011) An optimal data entry system using pdas for seat-belt usage studies. In: Transportation research board 90th annual meeting

  7. Lakhanpal P (2011) Traffic safety: modeling, analysis and visualization

  8. Sancheti A, Lakhanpal P, Contreras S, Kachroo P, Wilson M (2012) Comparative study of various data collection software used for seat-belt observational surveys. Technical report

  9. Nambisan SS, Vasudevan V (2007) Is seat belt usage by front seat passengers related to seat belt usage by their drivers? J Saf Res 38(5):545–555

    Article  Google Scholar 

  10. De Maesschalck R, Jouan-Rimbaud D, Massart DL (2000) The mahalanobis distance. Chemom Intell Lab Syst 50(1):1–18

    Article  Google Scholar 

  11. Filzmoser P, Maronna R, Werner M (2008) Outlier identification in high dimensions. Comput Stat Data Anal 52(3):1694–1711

    Article  MathSciNet  Google Scholar 

  12. Locantore N, Marron J, Simpson D, Tripoli N, Zhang J, Cohen K, Boente G, Fraiman R, Brumback B, Croux C (1999) Robust principal component analysis for functional data. Test 8(1):1–73

    Article  MathSciNet  Google Scholar 

  13. Agarwal S, Kachroo P, Regentova E (2016) A hybrid model using logistic regression and wavelet transformation to detect traffic incidents. IATSS Res 40(1):56–63

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shaurya Agarwal.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Agarwal, S., Gupta, S., Kachroo, P. et al. A Machine Learning Based Approach for Smart and Automated Data Collection: Applications in Transportation. Transp. in Dev. Econ. 10, 15 (2024). https://doi.org/10.1007/s40890-024-00199-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s40890-024-00199-w

Keywords

Navigation