Semi-supervised policy recommendation for online social networks

Original Article

Abstract

Fine-grained policy settings in social networking sites are becoming important for managing user privacy. Incorrect privacy policy settings can easily lead to leaks in private and personal information. At the same time, being too restrictive would reduce the benefits of online social networks. This is further complicated due to the growing adoption of social networks and the rapid growth in information uploading and sharing. The problem of facilitating policy settings has attracted the attention of numerous access control, and human–computer interaction researchers. The proposed solutions range from usable interfaces for policy settings to automated policy settings. We propose a fine-grained policy recommendation system that is based on an iterative semi-supervised learning approach which leverages the social graph propagation properties. Active learning and social graph properties are used to detect the most informative instances to be labeled as training sets. We implemented and tested our approach using both participant-labeled Facebook dataset and their real policy dataset extracted using the Facebook API. We compared our proposed approach to supervised learning and random walk-based approaches. Our approach provided higher accuracy and precision for both datasets. Collaborative active learning further improved the performance of our approach. Moreover, the accuracy and precision of our approach were maintained with the addition of new friends in the social graph.

References

  1. Anwar M, Fong PWL, Yang XD, Hamilton H (2009) Visualizing privacy implications of access control policies in social networks. In: Workshop on data privacy management. IEEEGoogle Scholar
  2. Benczr AA, Csalogny K, Lukcs L, Siklsi D (2007) Semi-supervised learning: a comparative study for web spam and telephone user churn. In: Graph labeling workshop in conjunction with ECML/PKDDGoogle Scholar
  3. Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the eleventh annual conference on computational learning theoryGoogle Scholar
  4. Borgatti SP, Everett MG (2006) A graph-theoretic perspective on centrality. Soc Netw 28(4):466–484CrossRefGoogle Scholar
  5. Camps-valls G, Marsheve TVB, Zhou D (2007) Semi-supervised graph-based hyperspectral image classification. IEEE Trans Geosci Remote Sens 45:2044–3054CrossRefGoogle Scholar
  6. Chapelle O, Scholkopf B, Zien A (2006) Semi-supervised learning. The MIT Press, Massachusetts Institute of Technology, Boca RatonCrossRefGoogle Scholar
  7. Clauset A, Newman MEJ, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):1–6CrossRefGoogle Scholar
  8. Cohn D, Atlas L, Ladner R (1994) Improving generalization with active learning. Mach Learn 15:201–221Google Scholar
  9. Fang L, LeFevre K (2010) Privacy wizards for social networking sites. In: Proceedings of the international conference on World wide web, ACM, pp 351–360Google Scholar
  10. Fouss F, Pirotte A, Renders JM, Saerens M (2007) Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation. IEEE Trans Knowl Data Eng 19(3):355–369CrossRefGoogle Scholar
  11. Joachims T (1999) Transductive inference for text classification using support vector machines. In: 16th international conference on machine learning, Morgan KaufmannGoogle Scholar
  12. Kong X, Yu PS (2010) Semi-supervised feature selection for graph classification. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data miningGoogle Scholar
  13. Mazzia A, LeFevre K, Adar E (2011) The PViz comprehension tool for social network privacy settings. Tech. Rep. CSE-TR-570-11, University of MichiganGoogle Scholar
  14. Mo M, Wang D, Li B, Hong D, King I (2010) Exploit of online social networks with semi-supervised learning. In: Neural Networks (IJCNN), The 2010 International Joint ConferenceGoogle Scholar
  15. Mukherjee A, Chen J (2010) Active learning via random walk [Online]. http://www.eecs.umich.edu/~cscott/past_courses/eecs545f09/projects/ChengMukherjee.pdf
  16. Newman MEJ (2001) Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Phys Rev E 64(1):016,132+Google Scholar
  17. Papagelis M, Plexousakis D (2005) Qualitative analysis of user-based and item-based prediction algorithms for recommendation agents. Eng Appl Artif Intell 18(7):781–789CrossRefGoogle Scholar
  18. Prasad B, Martin K (2006) The ties that lead: a social network approach to leadership. Elsevier Inc, AmsterdamGoogle Scholar
  19. Ratsaby J, Venkatesh S (1995) Learning from a mixture of labeled and unlabeled examples with parametric side information. In: Annual conference on computational learning theoryGoogle Scholar
  20. Roy N, McCallum A (2001) Toward optimal active learning through sampling estimation of error reduction. In: Proceedings of the eighteenth international conference on machine learningGoogle Scholar
  21. Shehab M, Cheek G, Touati H, Squicciarini AC, Cheng PC (2010) User centric policy management in online social networks. In: Proceedings of the IEEE international symposium on policies for distributed systems and networks, pp 9–13Google Scholar
  22. Squicciarini AC, Paci F, Sundareswaran S (2014) Prima: a comprehensive approach to privacy protection in social network sites. Ann Telecommun Annales des télécommunications 69(1–2):21–36CrossRefGoogle Scholar
  23. Su X, Khoshgoftaar TM (2009) A survey of collaborative filtering techniques. Adv Artif Intell 2009:1–19, Article ID 421425Google Scholar
  24. Zhou D, Bousquet O, Lal TN, Weston J, Schlkopf B (2004) Learning with local and global consistency. In: Advances in neural information processing systems vol 16. MIT Press, Boca Raton, pp 321–328Google Scholar
  25. Zhu X, Goldberg AB (2009) Introduction to semi-supervised learning. In: Synthesis lectures on artificial intelligence and machine learning, Morgan & Claypool, pp 9–40Google Scholar
  26. Zhu X, Ghahramani Z, Lafferty J (2003) Semi-supervised learning using gaussian fields and harmonic functions. In: IN ICMLGoogle Scholar

Copyright information

© Springer-Verlag Wien 2016

Authors and Affiliations

  1. 1.College of Computing and InformaticsUniversity of North Carolina at CharlotteCharlotteUSA

Personalised recommendations