Skip to main content

How Much Is Too Much? Leveraging Ads Audience Estimation to Evaluate Public Profile Uniqueness

  • Conference paper
Privacy Enhancing Technologies (PETS 2013)

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 7981))

Included in the following conference series:

Abstract

This paper addresses the important goal of quantifying the threat of linking external records to public Online Social Networks (OSN) user profiles, by providing a method to estimate the uniqueness of such profiles and by studying the amount of information carried by public profile attributes. Our first contribution is to leverage the Ads audience estimation platform of a major OSN to compute the information surprisal (IS) based uniqueness of public profiles, independently from the used profiles dataset. Then, we measure the quantity of information carried by the revealed attributes and evaluate the impact of the public release of selected combinations of these attributes on the potential to identify user profiles. Our measurement results, based on an unbiased sample of more than 400 thousand Facebook public profiles, show that, when disclosed in such profiles, current city has the highest individual attribute potential for unique identification and the combination of gender, current city and age can identify close to 55% of users to within a group of 20 and uniquely identify around 18% of users. We envisage the use of our methodology to assist both OSNs in designing better anonymization strategies when releasing user records and users to evaluate the potential for external parties to uniquely identify their public profiles and hence make it easier to link them with other data sources.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. How microsoft and yahoo are selling politicians access to you, http://goo.gl/90d6j

  2. Acquisti, A., Gross, R.: Predicting Social Security Numbers from Public Data. Proceedings of the National Academy of Sciences 106 (July 2009)

    Google Scholar 

  3. Blum, A., Dwork, C., McSherry, F., Nissim, K.: Practical Privacy: the SuLQ Framework. In: PODS (2005)

    Google Scholar 

  4. Calandrino, J., Kilzer, A., Narayanan, A., Felten, E., Shmatikov, V.: You might also like: Privacy risks of collaborative filtering. In: S&P (2011)

    Google Scholar 

  5. Chaabane, A., Acs, G., Kaafar, M.-A.: You Are What You Like! Information Leakage Through Users’ Interests. In: NDSS (2012)

    Google Scholar 

  6. Chen, T., Kaafar, M.A., Boreli, R.: Is More Always Merrier? A Deep Dive Into Online Social Footprints. In: SIGCOMM WOSN (2012)

    Google Scholar 

  7. Dwork, C.: Differential Privacy: a Survey of Results. In: Agrawal, M., Du, D.-Z., Duan, Z., Li, A. (eds.) TAMC 2008. LNCS, vol. 4978, pp. 1–19. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  8. Eckersley, P.: How Unique is Your Web Browser? In: Atallah, M.J., Hopper, N.J. (eds.) PETS 2010. LNCS, vol. 6205, pp. 1–18. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  9. Gale, W.A., Sampson, G.: Good-turning Frequency Estimation without Tears. Journal of Quantitative Linguistics (1995)

    Google Scholar 

  10. Gjoka, M., Kurant, M., Butts, C.T., Markopoulou, A.: Walking in facebook: A case study of unbiased sampling of osns. In: INFOCOM (2010)

    Google Scholar 

  11. Golle, P.: Revisiting the Uniqueness of Simple Demographics in the US Population. In: WPES (2006)

    Google Scholar 

  12. Irani, D., Webb, S., Li, K., Pu, C.: Large Online Social Footprints–An Emerging Threat. In: SIAM CS&E (2009)

    Google Scholar 

  13. Korolova, A.: Privacy Violations Using Microtargeted Ads: A Case Study. In: CDMW (2010)

    Google Scholar 

  14. Krishnamurthy, B., Wills, C.: On the Leakage of Personally Identifiable Information via Online Social Networks. In: ACM SIGCOMM CCR (2010)

    Google Scholar 

  15. Krishnamurthy, B., Wills, C.E.: Privacy Leakage in Mobile Online Social Networks. In: SIGCOMM WOSN (2010)

    Google Scholar 

  16. Lindamood, J., Heatherly, R., Kantarcioglu, M., Thuraisingham, B.: Inferring private information using social network data. In: WWW (2009)

    Google Scholar 

  17. McSherry, F., Mironov, I.: Differentially Private Recommender Systems: Building Privacy into the Netflix Prize Contenders. In: KDD (2009)

    Google Scholar 

  18. Mislove, A., Viswanath, B., Gummadi, K.P., Druschel, P.: You are Who You Know: Inferring User Profiles in Online Social Networks. In: WSDM (2010)

    Google Scholar 

  19. Narayanan, A., Shmatikov, V.: Robust De-anonymization of Large Sparse Datasets. In: S&P (2008)

    Google Scholar 

  20. Narayanan, A., Shmatikov, V.: De-anonymizing social networks. In: S&P (2009)

    Google Scholar 

  21. Perito, D., Castelluccia, C., Kaafar, M.A., Manils, P.: How Unique and Traceable Are Usernames? In: Fischer-Hübner, S., Hopper, N. (eds.) PETS 2011. LNCS, vol. 6794, pp. 1–17. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  22. Dey, R., Jelveh, Z., Ross, K.W.: Facebook Users Have Become Much More Private: A Large-Scale Study. In: WSSN (2012)

    Google Scholar 

  23. Sweeney, L.: Uniqueness of Simple Demographics in the U.S. Population. LIDAP-WP4 Carnegie Mellon University (2000)

    Google Scholar 

  24. Sweeney, L.: k-anonymity: a Model for Protecting Privacy. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. (October 2002)

    Google Scholar 

  25. Yen, T.-F., Xie, Y., Yu, F., Yu, R.P., Abadi, M.: Host Fingerprinting and Tracking on the Web: Privacy and Security Implications. In: NDSS (2012)

    Google Scholar 

  26. Zheleva, E., Getoor, L.: To Join or not to Join: the Illusion of Privacy in Social Networks with Mixed Public and Private user Profiles. In: WWW (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, T., Chaabane, A., Tournoux, P.U., Kaafar, MA., Boreli, R. (2013). How Much Is Too Much? Leveraging Ads Audience Estimation to Evaluate Public Profile Uniqueness. In: De Cristofaro, E., Wright, M. (eds) Privacy Enhancing Technologies. PETS 2013. Lecture Notes in Computer Science, vol 7981. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39077-7_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-39077-7_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-39076-0

  • Online ISBN: 978-3-642-39077-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics