Skip to main content
Log in

Mining Social Networks to Detect Traffic Incidents

  • Published:
Information Systems Frontiers Aims and scope Submit manuscript

Abstract

Social networks are usually used by citizens to report or complain about traffic incidents that affect their daily mobility. Automatically finding traffic-related reports and extracting useful information from them is not a trivial task, due to the informal language used in social networks, to the lack of geographic metadata, and to the large amount of non traffic-related publications. In this article, we address this problem by combining Machine Learning and Natural Language Processing techniques. Our approach (a) filters publications that report traffic incidents in social networks, (b) extracts geographic information from the textual content of the publications, and (c) provides a broadcasting service that clusters all the reports of the same incident. We compared the performance of our approach with state of the art approaches and with a popular traffic-specific social network, obtaining promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. https://www.waze.com/

  2. https://www.twitter.com/

  3. We decided to keep two repeated letters at maximum because some words and names have repetitions of letters (e.g. ‘Saavedra’ and ‘calle’ in Spanish or ‘street’ and ‘traffic’ in English).

  4. Stopwords are irrelevant words such as articles or prepositions that have a high probability of occurrence in the publications, regardless of the topic discussed.

  5. https://developers.google.com/maps/documentation/geocoding/

  6. https://fasttext.cc/docs/en/pretrained-vectors.html

  7. http://data.buenosaires.gob.ar/dataset/calles

  8. http://www.rae.es/diccionario-panhispanico-de-dudas/terminos-linguisticos

  9. http://twitter4j.org/en/index.html

  10. https://developer.twitter.com/en/docs/tweets/filter-realtime/guides/basic-stream-parameters

  11. https://www.cs.waikato.ac.nz/ml/weka/

  12. https://www.postgresql.org/

  13. https://docs.oracle.com/javase/7/docs/api/index.html

  14. https://developers.google.com/maps/documentation/geocoding/start

  15. https://twitter.com/Manwebsas

  16. http://si.isistan.unicen.edu.ar/Manwe/research-demo/

  17. https://eclipse-ee4j.github.io/jersey/

  18. http://tomcat.apache.org/

References

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sebastián Vallejos.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vallejos, S., Alonso, D.G., Caimmi, B. et al. Mining Social Networks to Detect Traffic Incidents. Inf Syst Front 23, 115–134 (2021). https://doi.org/10.1007/s10796-020-09994-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10796-020-09994-3

Keywords

Navigation