Users key locations in online social networks: identification and applications

  • Hariton Efstathiades
  • Demetris Antoniades
  • George Pallis
  • Marios D. Dikaiakos
Original Article

Abstract

Ubiquitous Internet connectivity enables users to update their Online Social Network profile from any location and at any point in time. These, often geo-tagged, data can be used to provide valuable information to closely located users, both in real time and in aggregated form. However, despite the fact that users publish geo-tagged information, only a small number implicitly reports their base location in their Online Social Network profile. In this paper, we present a simple yet effective methodology for identifying a user’s Key locations, namely her Home and Work places. We evaluate our methodology with Twitter datasets collected from the country of Netherlands, city of London and Los Angeles county. Furthermore, we combine Twitter and LinkedIn information to construct a Work location dataset and evaluate our methodology. Results show that our proposed methodology not only outperforms state-of-the-art methods by at least 30 % in terms of accuracy, but also cuts the detection radius at least at half the distance from other methods. To illustrate the applicability of our methodology and motivate further research in location-based social network analysis, we provide an initial evaluation of three such approaches, namely (1) Twitter user mobility patterns, (2) Ego network formulation, and (3) Key location tweet sentiment analysis.

Keywords

Online social networks Key location identification Mobility patterns 

References

  1. Adali S, Golbeck J (2014) Predicting personality with social behavior: a comparative study. Soc Netw Anal Min 4(1):1–20Google Scholar
  2. Aldrich HE, Kim PH (2007) Small worlds, infinite possibilities? How social networks affect entrepreneurial team formation and search. Strateg Entrep J 1(1–2):147–165CrossRefGoogle Scholar
  3. Backstrom L, Huttenlocher D, Kleinberg J, Lan X (2006) Group formation in large social networks: membership, growth, and evolution. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’06. ACM, New York, pp 44–54Google Scholar
  4. Bird S (2006) Nltk: the natural language toolkit. In: Proceedings of the COLING/ACL on interactive presentation sessions, COLING-ACL ’06, Association for Computational Linguistics, Stroudsburg, pp 69–72Google Scholar
  5. Bo H, Cook P, Baldwin T (2012) Geolocation prediction in social media data by finding location indicative words. In: Proceedings of COLING 2012: technical papers, pp 1045–1062Google Scholar
  6. Borgatti SP, Mehra A, Brass DJ, Labianca G (2009) Network analysis in the social sciences. Science 323(5916):892–895CrossRefGoogle Scholar
  7. Brown C, Noulas A, Mascolo C, Blondel V (2013) A place-focused model for social networks in cities. In: 2013 international conference on social computing (SocialCom), pp 75–80Google Scholar
  8. Catanzaro M, Caldarelli G, Pietronero L (2004) Social network growth with assortative mixing. Phys A Stat Mech Appl 338(1–2):119–124. Proceedings of the conference a nonlinear world: the real world, 2nd international conference on frontier scienceGoogle Scholar
  9. Cho E, Myers SA, Leskovec J (2011) Friendship and mobility: user movement in location-based social networks. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’11. ACM, New YorkGoogle Scholar
  10. Cici B, Markopoulou A, Frias-Martinez E, Laoutaris N (2014) Assessing the potential of ride-sharing using mobile and social data: a tale of four cities. In: Proceedings of the 2014 ACM international joint conference on pervasive and ubiquitous computing, UbiComp ’14. ACM, New York, pp 201–211Google Scholar
  11. Efstathiades H, Antoniades D, Pallis G, Dikaiakos MD (2015) Identification of key locations based on online social network activity. In: Proceedings of the 2015 IEEE/ACM international conference on advances in social networks analysis and mining 2015, ASONAM ’15. ACM, New York, pp 218–225Google Scholar
  12. Eisenstein J, O’Connor B, Smith NA, Xing EP (2010) A latent variable model for geographic lexical variation. In Proceedings of the 2010 conference on empirical methods in natural language processing, EMNLP ’10. StroudsburgGoogle Scholar
  13. Ellison NB, Steinfield C, Lampe C (2007) The benefits of facebook friends: social capital and college students’ use of online social network sites. J Comput Med Commun 12(4):1143–1168CrossRefGoogle Scholar
  14. Falcone D, Mascolo C, Comito C, Talia D, Crowcroft J (2014) What is this place? Inferring place categories through user patterns identification in geo-tagged tweets. In: Proceedings of international conference on mobile computing, applications and services, MobiCASEGoogle Scholar
  15. Ferrara E, Varol O, Davis C, Menczer F, Flammini A (2014) The rise of social bots. CoRR. arxiv:1407.5225
  16. Ganti RK, Tsai Y-E, Abdelzaher TF (2008) Senseworld: towards cyber-physical social networks. In: Proceedings of the 7th international conference on information processing in sensor networks, IPSN ’08, Washington, DC. IEEE Computer Society, , pp 563–564Google Scholar
  17. Georgiev P, Noulas A, Mascolo C (2014) The call of the crowd: event participation in location-based social services. In: International AAAI conference on weblogs and social media (ICWSM)Google Scholar
  18. Granovetter MS (1973) The strength of weak ties. Am J Sociol 78(6):1360–1380CrossRefGoogle Scholar
  19. Granovetter M, Soong R (1983) Threshold models of diffusion and collective behavior. J Math Sociol 9(3):165–179CrossRefMATHGoogle Scholar
  20. Gross R, Acquisti A (2005) Information revelation and privacy in online social networks. In: Proceedings of the 2005 ACM workshop on privacy in the electronic society. ACM, pp 71–80Google Scholar
  21. Guimerà R, Danon L, Díaz-Guilera A, Giralt F, Arenas A (2003) Self-similar community structure in a network of human interactions. Phys Rev E 68:065103CrossRefGoogle Scholar
  22. Hawelka B, Sitko I, Beinat E, Sobolevsky S, Kazakopoulos P, Ratti C (2014) Geo-located twitter as proxy for global mobility patterns. Cartogr Geogr Inform Sci 41(3):260–271CrossRefGoogle Scholar
  23. Hecht B, Hong L, Suh B, Chi EH (2011) Tweets from justin bieber’s heart: the dynamics of the location field in user profiles. In: Proceedings of the SIGCHI conference on human factors in computing systems, CHI ’11, New York. ACM, pp 237–246Google Scholar
  24. Herder E, Siehndel P, Kawase R (2014) Predicting user locations and trajectories. User modeling, adaptation, and personalization. Springer, New York, pp 86–97Google Scholar
  25. Hopcroft J, Lou T, Tang J (2011) Who will follow you back?: Reciprocal relationship prediction. In: Proceedings of the 20th ACM international conference on information and knowledge management, CIKM ’11, New York. ACM, pp 1137–1146Google Scholar
  26. Jaiswal A, Peng W, Sun T (Aug 2013) Predicting time-sensitive user locations from social media. In: 2013 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM), pp 870–877Google Scholar
  27. Jurdak R, Zhao K, Liu J, AbouJaoude M, Cameron M, Newth D (2014) Understanding Human mobility from Twitter. ArXiv e-printsGoogle Scholar
  28. Jurgens D (2013) That’s what friends are for: inferring location in online social media platforms based on social relationships. In: International AAAI conference on weblogs and social media (ICWSM)Google Scholar
  29. Katragadda S, Jin M, Raghavan V (2014) An unsupervised approach to identify location based on the content of user’s tweet history. Active media technology lecture notes in computer science, vol 8610. Springer, New York, pp 311–323Google Scholar
  30. Kotzias D, Lappas T, Gunopulos D (2016) Home is where your friends are: Utilizing the social graph to locate twitter users in a city. Inform Syst 57:77–87CrossRefGoogle Scholar
  31. Kulshrestha J, Kooti F, Nikravesh A, Gummadi PK (2012) Geographic dissection of the twitter network. In: International AAAI conference on weblogs and social media (ICWSM)Google Scholar
  32. Kumar R, Novak J, Tomkins A (2006) Structure and evolution of online social networks. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’06, New York. ACM, pp 611–617Google Scholar
  33. Kwak H, Lee C, Park H, Moon S (2010) What is twitter, a social network or a news media? In: Proceedings of the 19th international conference on World Wide Web, WWW ’10, New York. ACM, pp 591–600Google Scholar
  34. Leskovec J, Horvitz E (2008) Planetary-scale views on a large instant-messaging network. In: Proceedings of the 17th international conference on World Wide Web, WWW ’08, New York. ACM, pp 915–924Google Scholar
  35. Leskovec J, Kleinberg J, Faloutsos C (2005) Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of the Eleventh ACM SIGKDD international conference on knowledge discovery in data mining, KDD ’05, New York. ACM, pp 177–187Google Scholar
  36. Levine SS, Kurzban R (2006) Explaining clustering in social networks: towards an evolutionary theory of cascading benefits. Manag Decis Econ 27(2–3):173–187CrossRefGoogle Scholar
  37. Liben-Nowell D, Kleinberg J (2003) The link prediction problem for social networks. In: Proceedings of the Twelfth international conference on information and knowledge management, CIKM ’03, New York. ACM, pp 556–559Google Scholar
  38. Li G, Hu J, Feng J, Tan K-L (March 2014) Effective location identification from microblogs. In: IEEE 30th international conference on data engineering (ICDE), 2014. pp 880–891Google Scholar
  39. Liu H, Zhou Y, Zhang Y (2015) Estimating users’ home and work locations leveraging large-scale crowd-sourced smartphone data. IEEE Commun Mag 53(3):71–79CrossRefGoogle Scholar
  40. Li R, Wang S, Deng H, Wang R, Chang KC-C (2012) Towards social user profiling: Unified and discriminative influence model for inferring home locations. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, New York. ACM, pp 1023–1031Google Scholar
  41. Mahmud J, Nichols J, Drews C (2014) Home location identification of twitter users. ACM Trans Intell Syst Technol 5(3):47CrossRefGoogle Scholar
  42. Mcauley J, Leskovec J (2014) Discovering social circles in ego networks. ACM Trans Knowl Discov Data 8(1):4:1–4:28CrossRefGoogle Scholar
  43. Milgram S (1967) The small world problem. Psychol Today 2(1):60–67MathSciNetGoogle Scholar
  44. Mocanu D, Baronchelli A, Perra N, Gonçalves B, Zhang Q, Vespignani A (2013) The twitter of babel: mapping world languages through microblogging platforms. PLoS One 8(4):e61981CrossRefGoogle Scholar
  45. Morstatter F, Pfeffer J, Liu H, Carley K (2013) Is the sample good enough? Comparing data from twitter’s streaming api with twitter’s firehose. In: International AAAI conference on weblogs and social media (ICWSM)Google Scholar
  46. Myers SA, Sharma A, Gupta P, Lin J (2014) Information network or social network? The structure of the twitter follow graph. In: Proceedings of the 23rd international conference on world wide web, WWW ’14 Companion, New York. ACM, pp 493–498Google Scholar
  47. Narr S, Hulfenhaus M, Albayrak S (2012) Language-independent twitter sentiment analysis. In: Knowledge discovery and machine learning (KDML), LWA, pp 12–14Google Scholar
  48. Noulas A, Scellato S, Mascolo C, Pontil M (2011) An empirical study of geographic user activity patterns in foursquare. In: Proceedings of the 5th international AAAI conference on weblogs and social media. pp 570–573Google Scholar
  49. Perc M (2014) The matthew effect in empirical data. J R Soc Interface 11(98):20140378CrossRefGoogle Scholar
  50. Ryoo K, Moon S (2014) Inferring twitter user locations with 10 km accuracy. In: Proceedings of the companion publication of the 23rd international conference on world wide web companion, WWW Companion ’14, pp 643–648Google Scholar
  51. Sadilek A, Kautz H, Bigham JP (2012) Finding your friends and following them to where you are. In: Proceedings of the Fifth ACM international conference on web search and data mining, WSDM ’12, New York, NY, USA. ACM, pp 723–732Google Scholar
  52. Yang C, Harkreader R, Gu G (2013) Empirical evaluation and new design for fighting evolving twitter spammers. IEEE Trans Inform Forensics Secur 8(8):1280–1293CrossRefGoogle Scholar
  53. Yuan Q, Cong G, Ma Z, Sun A, Thalmann NM (2013) Who, where, when and what: discover spatio-temporal topics for twitter users. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’13, New YorkGoogle Scholar
  54. Yu Y, Wang X (2015) World cup 2014 in the twitter world: a big data analysis of sentiments in u.s. sports fansâĂŹ tweets. Comput Hum Behav 48:392–400CrossRefGoogle Scholar
  55. Zhang D, Huang J, Li Y, Zhang F, Xu C, He T (2014) Exploring human mobility with multi-source data at extremely large metropolitan scales. In: Proceedings of the 20th annual international conference on mobile computing and networking, MobiCom ’14, New York. ACM, pp 201–212Google Scholar

Copyright information

© Springer-Verlag Wien 2016

Authors and Affiliations

  • Hariton Efstathiades
    • 1
  • Demetris Antoniades
    • 1
  • George Pallis
    • 1
  • Marios D. Dikaiakos
    • 1
  1. 1.Computer Science DepartmentUniversity of CyprusNicosiaCyprus

Personalised recommendations