Reliable Online Social Network Data Collection

  • Fehmi Ben Abdesslem
  • Iain Parris
  • Tristan Henderson
Chapter

Abstract

Large quantities of information are shared through online social networks, making them attractive sources of data for social network research. When studying the usage of online social networks, these data may not describe properly users’ behaviours. For instance, the data collected often include content shared by the users only, or content accessible to the researchers, hence obfuscating a large amount of data that would help to understand users’ behaviours and privacy concerns. Moreover, the data collection methods employed in experiments may also have an effect on data reliability when participants self-report inaccurate information or are observed while using a simulated application. Understanding the effects of these collection methods on data reliability is paramount for the study of social networks; for understanding user behaviour; for designing socially aware applications and services; and for mining data collected from such social networks and applications. This chapter reviews previous research which has looked at social network data collection and user behaviour in these networks. We highlight shortcomings in the methods used in these studies and introduce our own methodology and user study based on the experience sampling method; we claim that our methodology leads to the collection of more reliable data by capturing both those data which are shared and not shared. We conclude with suggestions for collecting and mining data from online social networks.

References

  1. 1.
    Amichai-Hamburger, Y., Vinitzky, G.: Social network use and personality. Comput. Hum. Behav. 26(6), 1289–1295 (2010). doi:10. 1016/j.chb.2010.03.018CrossRefGoogle Scholar
  2. 2.
    Anthony, D., Henderson, T., Kotz, D.: Privacy in location-aware computing environments. IEEE Pervasive Comput. 6(4), 64–72 (2007). doi:10.1109/MPRV.2007.83CrossRefGoogle Scholar
  3. 3.
    Ben Abdesslem, F., Phillips, A., Henderson, T.: Less is more: energy-efficient mobile sensing with SenseLess. In: ACM MobiHeld’09, Barcelona, pp. 61–62 (2009). doi:10.1145/1592606.1592621Google Scholar
  4. 4.
    Benevenuto, F., Rodrigues, T., Cha, M., Almeida, V.: Characterizing user behavior in online social networks. In: IMC ’09: Proceedings of the 9th ACM Internet Measurement Conference, Chicago, pp. 49–62 (2009). doi:10.1145/1644893.1644900Google Scholar
  5. 5.
    Besmer, A., Lipford, H.R.: Moving beyond untagging: photo privacy in a tagged world. In: CHI ’10: Proceedings of the 28th International Conference on Human Factors in Computing Systems, Atlanta, pp. 1563–1572 (2010). doi:10.1145/1753326.1753560Google Scholar
  6. 6.
    Brandtzæg, P.B., Heim, J.: Why people use social networking sites. In Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Rangan, C.P., Steffen, B., Sudan, M., Terzopoulos, D., Tygar, D., Vardi, M.Y., Weikum, G., Ozok, A.A., Zaphiris, P. (eds.) Online Communities and Social Computing, vol. 5621, chapter 16, pp. 143–152. Springer, Berlin/Heidelberg (2009). doi:10.1007/978-3-642-02774-1_16Google Scholar
  7. 7.
    Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in Twitter: the million follower fallacy. In: Proceedings of the 4th International AAAI Conference on Weblogs and Social Media (ICWSM), Washington (2010). Online at http://aaai.org/ocs/index.php/ICWSM/ICWSM10/paper/view/1538/0
  8. 8.
    Consolvo, S., Walker, M.: Using the experience sampling method to evaluate ubicomp applications. IEEE Pervasive Comput. 2(2), 24–31 (2003). doi:10.1109/MPRV.2003.1203750CrossRefGoogle Scholar
  9. 9.
    Consolvo, S., Smith, I.E., Matthews, T., Lamarca, A., Tabert, J., Powledge, P.: Location disclosure to social relations: why, when, and what people want to share. In: CHI ’05: Proceedings of the SIGCHI conference on Human factors in computing systems, Portland, pp. 81–90 (2005). doi:10.1145/1054972.1054985Google Scholar
  10. 10.
    Eagle, N., Pentland, A.S., Lazer, D.: Inferring friendship network structure by using mobile phone data. Proc. Natl. Acad. Sci. 106(36), 15274–15278 (2009). doi:10.1073/pnas.0900282106CrossRefGoogle Scholar
  11. 11.
    Ellison, N.B., Steinfield, C., Lampe, C.: The benefits of Facebook “friends:” social capital and college students use of online social network sites. J. Comput. Mediat. Commun. 12(4), 1143–1168 (2007). doi:10. 1111/j.1083-6101.2007.00367.xCrossRefGoogle Scholar
  12. 12.
    Froehlich, J., Chen, M.Y., Consolvo, S., Harrison, B., Landay, J.A.: MyExperience: a system for in situ tracing and capturing of user feedback on mobile phones. In: MobiSys ’07: Proceedings of the 5th International Conference on Mobile Systems, Applications and Services, San Juan, pp. 57–70 (2007). doi:10.1145/1247660.1247670Google Scholar
  13. 13.
    Garg, S., Gupta, T., Carlsson, N., Mahanti, A.: Evolution of an online social aggregation network: an empirical study. In: IMC ’09: Proceedings of the 9th ACM Internet Measurement Conference, Chicago, pp. 315–321 (2009). doi:10.1145/1644893.1644931Google Scholar
  14. 14.
    Ghosh, S., Korlam, G., Ganguly, N.: The effects of restrictions on number of connections in OSNs: a case-study on Twitter. In: Proceedings of the 3rd Workshop on Online Social Networks (WOSN 2010), Boston (2010). Online at http://www.usenix.org/events/wosn10/tech/full_papers/Ghosh.pdf
  15. 15.
    Gjoka, M., Sirivianos, M., Markopoulou, A., Yang, X.: Poking Facebook: characterization of OSN applications. In: WOSN ’08: Proceedings of the First Workshop on Online Social Networks, Seattle, pp. 31–36 (2008). doi:10.1145/1397735.1397743Google Scholar
  16. 16.
    Gjoka, M., Kurant, M., Butts, C.T., Markopoulou, A.: Walking in Facebook: a case study of unbiased sampling of OSNs. In: Proceedings of IEEE INFOCOM 2010, San Diego, pp. 1–9 (2010). doi:10.1109/ INFCOM.2010.5462078Google Scholar
  17. 17.
    Guy, I., Jacovi, M., Meshulam, N., Ronen, I., Shahar, E.: Public vs. private: comparing public social network information with email. In: CSCW ’08: Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work, San Diego, pp. 393–402 (2008). doi:10. 1145/1460563.1460627Google Scholar
  18. 18.
    Gyarmati, L., Trinh, T.: Measuring user behavior in online social networks. IEEE Netw. 24(5), 26–31 (2010). doi:10.1109/MNET.2010. 5578915CrossRefGoogle Scholar
  19. 19.
    Hoser, B., Nitschke, T.: Questions on ethics for research in the virtually connected world. Soc. Netw. 32(3), 180–186 (2010). doi:10.1016/j.socnet. 2009.11.003CrossRefGoogle Scholar
  20. 20.
    Iachello, G., Smith, I., Consolvo, S., Chen, M., Abowd, G.D.: Developing privacy guidelines for social location disclosure applications and services. In: SOUPS ’05: Proceedings of the 2005 Symposium on Usable Privacy and Security, Philadelphia, pp. 65–76 (2005). doi:10.1145/1073001. 1073008Google Scholar
  21. 21.
    Java, A., Song, X., Finin, T., Tseng, B.: Why we Twitter: an analysis of a microblogging community. In: Zhang, H., Spiliopoulou, M., Mobasher, B., Giles, C.L., McCallum, A., Nasraoui, O., Srivastava, J., Yen, J. (eds.) Advances in Web Mining and Web Usage Analysis. Lecture Notes in Computer Science, vol. 5439, chapter 7, pp. 118–138. Springer, Berlin/Heidelberg (2007). doi: 10.1007/978-3-642-00528-2_7
  22. 22.
    Jiang, J., Wilson, C., Wang, X., Huang, P., Sha, W., Dai, Y., Zhao, B.Y.: Understanding latent interactions in online social networks. In: IMC ’10: Proceedings of the 10th Annual Conference on Internet Measurement, Melbourne, pp. 369–382 (2010). doi:10.1145/1879141.1879190Google Scholar
  23. 23.
    Kofod-Petersen, A., Gransaether, P.A., Krogstie, J.: An empirical investigation of attitude towards location-aware social network service. Int. J. Mobile Commun. 8(1), 53–70 (2010). doi:10.1504/IJMC.2010. 030520CrossRefGoogle Scholar
  24. 24.
    Krasnova, H., Günther, O., Spiekermann, S., Koroleva, K.: Privacy concerns and identity in online social networks. Identit. Inf. Soc. 2(1), 39–63 (2009). doi:10.1007/s12394-009-0019-1CrossRefGoogle Scholar
  25. 25.
    Kwon, O., Wen, Y.: An empirical study of the factors affecting social network service use. Comput. Hum. Behav. 26(2), 254–263 (2010). doi:10.1016/j.chb.2009.04.011CrossRefGoogle Scholar
  26. 26.
    Lampe, C., Ellison, N.B., Steinfield, C.: Changes in use and perception of Facebook. In: CSCW ’08: Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work, San Diego,, pp. 721–730 (2008). doi:10.1145/1460563.1460675Google Scholar
  27. 27.
    Larson, R., Csikszentmihalyi, M.: The experience sampling method. New Dir. Methodol. Soc. Behav. Sci. 15, 41–56 (1983)Google Scholar
  28. 28.
    Lewis, K., Kaufman, J., Christakis, N.: The Taste for privacy: an analysis of college student privacy settings in an online social network. J. Comput. Mediat. Commun. 14(1), 79–100 (2008). doi:10.1111/j. 1083-6101.2008.01432.xCrossRefGoogle Scholar
  29. 29.
    Lindamood, J., Heatherly, R., Kantarcioglu, M., Thuraisingham, B.: Inferring private information using social network data. In: WWW ’09: Proceedings of the 18th International World Wide Web Conference, Madrid, pp. 1145–1146 (2009). doi:10.1145/1526709.1526899Google Scholar
  30. 30.
    Mancini, C., Thomas, K., Rogers, Y., Price, B.A., Jedrzejczyk, L., Bandara, A.K., Joinson, A.N., Nuseibeh, B.: From spaces to places: emerging contexts in mobile privacy. In: Ubicomp ’09: Proceedings of the 11th International Conference on Ubiquitous Computing, Orlando, pp. 1–10 (2009) doi:10.1145/1620545.1620547Google Scholar
  31. 31.
    Nagle, F., Singh, L.: Can friends be trusted? Exploring privacy in online social networks. In: 2009 International Conference on Advances in Social Network Analysis and Mining (ASONAM), Athens, pp. 312–315 (2009). doi:10.1109/ASONAM.2009.61Google Scholar
  32. 32.
    Nazir, A., Raza, S., Chuah, C.N.: Unveiling Facebook: a measurement study of social network based applications. In: IMC ’08: Proceedings of the 8th ACM SIGCOMM Conference on Internet Measurement, Vouliagmeni, pp. 43–56 (2008). doi:10.1145/1452520.1452527Google Scholar
  33. 33.
    Pempek, T.A., Yermolayeva, Y.A., Calvert, S.L.: College students’ social networking experiences on Facebook. J. Appl. Dev. Psychol. 30(3), 227–238 (2009). doi:10.1016/j.appdev.2008.12.010CrossRefGoogle Scholar
  34. 34.
    Peterson, K., Siek, K.A.: Analysis of information disclosure on a social networking site. In Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Rangan, C.P., Steffen, B., Sudan, M., Terzopoulos, D., Tygar, D., Vardi, M.Y., Weikum, G., Ozok, A.A., Zaphiris, P. (eds.) Online Communities and Social Computing, vol. 5621, chapter 28, pp. 256–264. Springer, Berlin/Heidelberg (2009). doi:10.1007/978-3-642-02774-1_28 Google Scholar
  35. 35.
    Qiu, T., Feng, J., Ge, Z., Wang, J., Xu, J., Yates, J.: Listen to me if you can: tracking user experience of mobile network on social media. In: IMC ’10: Proceedings of the 10th Annual Conference on Internet Measurement, Melbourne, pp. 288–293 (2010). doi:10.1145/ 1879141.1879178Google Scholar
  36. 36.
    Rejaie, R., Torkjazi, M., Valafar, M., Willinger, W.: Sizing up online social networks. IEEE Netw. 24(5), 32–37 (2010). doi:10.1109/MNET. 2010.5578916CrossRefGoogle Scholar
  37. 37.
    Roblyer, M., McDaniel, M., Webb, M., Herman, J., Witty, J.V.: Findings on Facebook in higher education: a comparison of college faculty and student uses and perceptions of social networking sites. Int. High. Educ. 13(3), 134–140 (2010). doi:10.1016/j.iheduc.2010.03.002CrossRefGoogle Scholar
  38. 38.
    Sadeh, N., Hong, J., Cranor, L., Fette, I., Kelley, P., Prabaker, M., Rao, J.: Understanding and capturing people’s privacy policies in a mobile social networking application. Pers. Ubiquitous Comput. 13, 401–412 (2009). doi:10.1007/s00779-008-0214-3CrossRefGoogle Scholar
  39. 39.
    Schneider, F., Feldmann, A., Krishnamurthy, B., Willinger, W.: Understanding online social network usage from a network perspective. In: IMC ’09: Proceedings of the 9th ACM Internet Measurement Conference, Chicago, pp. 35–48 (2009). doi:10.1145/1644893.1644899Google Scholar
  40. 40.
    Stutzman, F., Duffield, J.K.: Friends only: examining a privacy-enhancing behavior in facebook. In: CHI ’10: Proceedings of the 28th International Conference on Human Factors in Computing Systems, Atlanta, pp. 1553–1562 (2010). doi:10.1145/1753326.1753559Google Scholar
  41. 41.
    Tsai, J.Y., Kelley, P., Drielsma, P., Cranor, L.F., Hong, J., Sadeh, N.: Who’s viewed you?: the impact of feedback in a mobile location-sharing application. In: CHI ’09: Proceedings of the 27th International Conference on Human Factors in Computing Systems, Boston, pp. 2003–2012 (2009). doi:10.1145/1518701.1519005Google Scholar
  42. 42.
    Valafar, M., Rejaie, R., Willinger. W.: Beyond friendship graphs: a study of user interactions in Flickr. In: WOSN ’09: Proceedings of the 2nd ACM Workshop on Online Social Networks, Barcelona, pp. 25–30 (2009). doi:10.1145/1592665.1592672Google Scholar
  43. 43.
    Viswanath, B., Mislove, A., Cha, M., Gummadi, K.P.: On the evolution of user interaction in Facebook. In: WOSN ’09: Proceedings of the 2nd ACM Workshop on Online Social Networks, Barcelona, pp. 37–42 (2009). doi:10.1145/1592665.1592675Google Scholar
  44. 44.
    Westin, A., Harris, L. & Associates: Equifax-Harris Consumer Privacy Survey. Conducted for Equifax Inc. (1991)Google Scholar
  45. 45.
    Wilson, C., Boe, B., Sala, A., Puttaswamy, K.P., Zhao, B.Y.: User interactions in social networks and their implications. In: Proceedings of the Fourth ACM European Conference on Computer Systems (EuroSys), Nuremberg, pp. 205–218 (2009). doi:10.1145/1519065.1519089Google Scholar
  46. 46.
    Ye, S., Wu, F.: Estimating the size of online social networks. In: Proceedings of the IEEE Second International Conference on Social Computing (SocialCom), Minneapolis, pp. 169–176 (2010). doi:10.1109/ SocialCom.2010.32Google Scholar
  47. 47.
    Young, A.L., Quan-Haase, A.: Information revelation and internet privacy concerns on social network sites: a case study of Facebook. In: C&T ’09: Proceedings of the Fourth International Conference on Communities and Technologies, University Park, pp. 265–274 (2009). doi:10.1145/1556460.1556499Google Scholar

Copyright information

© Springer-Verlag London 2012

Authors and Affiliations

  • Fehmi Ben Abdesslem
    • 1
  • Iain Parris
    • 1
  • Tristan Henderson
    • 1
  1. 1.School of Computer ScienceUniversity of St AndrewsSt AndrewsUK

Personalised recommendations