Advertisement

Fine-Scale Prediction of People’s Home Location Using Social Media Footprints

  • Hamdi KavakEmail author
  • Daniele Vernon-Bido
  • Jose J. Padilla
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10899)

Abstract

In this study, we develop a machine learning classifier that determines Twitter users’ home location with 100 m resolution. Our results suggest up to 0.87 overall accuracy in predicting home location for the City of Chicago. We explore the influence of time span of data collection and location-sharing habits of a user. The classifier accuracy changes by data collection time but larger than one-month time spans do not significantly increase prediction accuracy. An individual’s home location can be ascertained with as few as 0.6 to 1.4 tweets/day or 75 to 225 tweets with an accuracy of over 0.8. Our results shed light on how home location information can be predicted with high accuracy and how long data needs to be collected. On the flip side, our results imply potential privacy issues on publicly available social media data.

Keywords

Human mobility Social media Home location inference 

Notes

Acknowledgments and Disclaimer

This material is based on research sponsored by the Office of the Assistant Secretary of Defense for Research and Engineering (OASD(R&E)) under agreement number FAB750-15-2-0120. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the OASD(R&E) or the U.S. Government.

References

  1. 1.
    Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)zbMATHGoogle Scholar
  2. 2.
    Eagle, N., Pentland, A.S.: Reality mining: sensing complex social systems. Pers. Ubiquit. Comput. 10(4), 255–268 (2006)CrossRefGoogle Scholar
  3. 3.
    Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd KDD. AAAI Press (1996)Google Scholar
  4. 4.
    Hu, T., Luo, J., Kautz, H., Sadilek, A.: Home location inference from sparse and noisy data: models and applications. In: Proceedings - 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015, pp. 1382–1387 (2016)Google Scholar
  5. 5.
    Mahmud, J., Nichols, J., Drews, C.: Where is this tweet from? Inferring home locations of Twitter users. In: ICWSM, pp. 511–514 (2012)Google Scholar
  6. 6.
    Pontes, T., Magno, G., Vasconcelos, M., Gupta, A., Almeida, J., Kumaraguru, P., Almeida, V.: Beware of what you share: inferring home location in social networks. In: ICDMW 2012, pp. 571–578 (2012)Google Scholar
  7. 7.
    Ryoo, K., Moon, S.: Inferring Twitter user locations with 10 km accuracy. In: Proceedings of the 23rd International Conference on WWW (2014)Google Scholar
  8. 8.
    Schneider, C.M., Belik, V., Couronné, T., Smoreda, Z., González, M.C.: Unravelling daily human mobility motifs. J. R. Soc. Interface 10(84), 20130246 (2013)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.George Mason UniversityFairfaxUSA
  2. 2.Virginia Modeling Analysis and Simulation CenterSuffolkUSA

Personalised recommendations