Using Data Mining and Vehicular Networks to Estimate the Severity of Traffic Accidents

  • Manuel FogueEmail author
  • Piedad Garrido
  • Francisco J. Martinez
  • Juan-Carlos Cano
  • Carlos T. Calafate
  • Pietro Manzoni
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 171)


New communication technologies integrated into modern vehicles offer an opportunity for better assistance to people injured in traffic accidents. To improve the overall rescue process, a fast and accurate estimation of the severity of the accident represents a key point to help the emergency services to better determine the amount of required resources. This paper proposes a novel intelligent system which is able to automatically estimate the severity of traffic accidents based on the concept of datamining and knowledge inference.Our system considers the most relevant variables that can characterize the severity of the accidents (variables such as the vehicle speed, the type of vehicles involved, and the airbag status). Results show that data mining classification algorithms, combined with an adequate selection of relevant features and a prior division of collisions based on the impact direction, allows generating estimation models able to predict the severity of new accidents.


Bayesian Network Road Accident Vehicular Network Accident Severity Accident Data 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Beshah, T., Hill, S.: Mining Road Traffic Accident Data to Improve Safety: Role of Road-Related Factors on Accident Severity in Ethiopia. In: Proceedings of AAAI Artificial Intelligence for Development (AI-D 2010), Stanford, CA, USA (March 2010)Google Scholar
  2. 2.
    Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30, 1145–1159 (1997)CrossRefGoogle Scholar
  3. 3.
    Chong, M., Abraham, A., Paprzycki, M.: Traffic accident analysis using machine learning paradigms. Informatica 29, 89–98 (2005)Google Scholar
  4. 4.
    Cooper, G.F., Herskovits, E.: A bayesian method for the induction of probabilistic networks from data. Machine Learning 9, 309–347 (1992)zbMATHGoogle Scholar
  5. 5.
    Dirección General de Tráfico (DGT). The main statistics of road accidents. Spain (2010),
  6. 6.
    Fawcett, T.: ROC Graphs: Notes and Practical Considerations for Researchers. Technical report, HP Labs (2004)Google Scholar
  7. 7.
    Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: The KDD process for extracting useful knowledge from volumes of data. Communications of the ACM 39, 27–34 (1996)CrossRefGoogle Scholar
  8. 8.
    Fogue, M., Garrido, P., Martinez, F.J., Cano, J.-C., Calafate, C.T., Manzoni, P., Sanchez, M.: Prototyping an automatic notification scheme for traffic accidents in vehicular networks. In: 2011 IFIP Wireless Days (WD), pp. 1–5 (2011)Google Scholar
  9. 9.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explorations 11, 10–18 (2009)CrossRefGoogle Scholar
  10. 10.
    Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelligence - Special Issue on Relevance 97, 273–324 (1997)zbMATHGoogle Scholar
  11. 11.
    Hall, M.: Correlation-based feature selection for machine learning. PhD thesis, Department of Computer Science, University of Waikato, Hamilton, New Zealand (2008)Google Scholar
  12. 12.
    Martinez, F.J., Cano, J.-C., Calafate, C.T., Manzoni, P., Barrios, J.M.: Assessing the feasibility of a VANET driver warning system. In: Proceedings of the 4th ACM Workshop on Performance Monitoring and Measurement of Heterogeneous Wireless and Wired Networks, PM2HW2N 2009, Tenerife, Spain, pp. 39–45. ACM (2009)Google Scholar
  13. 13.
    National Highway Traffic Safety Administration (NHTSA). FTP Site for the General Estimates System, GES (2012),
  14. 14.
    Platt, J.C.: Fast training of support vector machines using sequential minimal optimization, pp. 185–208. MIT Press, Cambridge (1999)Google Scholar
  15. 15.
    Ross Quinlan, J.: C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)Google Scholar
  16. 16.
    Sohn, S.Y., Lee, S.H.: Data fusion, ensemble and clustering to improve the classification accuracy for the severity of road traffic accidents in Korea. Safety Science 41(1), 1–14 (2003)MathSciNetCrossRefGoogle Scholar
  17. 17.
    Sohn, S.Y., Shin, H.: Pattern recognition for road traffic accident severity in Korea. Ergonomics 44(1), 107–117 (2001)Google Scholar
  18. 18.
    Tesema, T., Abraham, A., Grosan, C.: Rule Mining and Classification of Road Accidents Using Adaptive Regression Trees. International Journal of Simulation Systems, Science & Technology 6(10-11), 80–94 (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Manuel Fogue
    • 1
    Email author
  • Piedad Garrido
    • 1
  • Francisco J. Martinez
    • 1
  • Juan-Carlos Cano
    • 2
  • Carlos T. Calafate
    • 2
  • Pietro Manzoni
    • 2
  1. 1.University of ZaragozaSaragossaSpain
  2. 2.Universitat Politècnica de ValènciaSaragossaSpain

Personalised recommendations