Encyclopedia of Social Network Analysis and Mining

2014 Edition
| Editors: Reda Alhajj, Jon Rokne

Similarity Metrics on Social Networks

  • Cuneyt Gurcan Akcora
  • Elena Ferrari
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-6170-8_252




Tendency to create friendships with similar people

Undirected Network

Network where relationships are created by mutual consent of the two involved users

Profile Data

User-uploaded text-based personal information on social networks


In the last decade, online social networks have gained millions of users who are daily creating terabytes of personal data (Ellison et al. 2007). With this big amount of data, it quickly becomes impractical to analyze all of the network for solving user-specific problems, such as finding communities of users or classifying them according to a specific criteria. At the basis of most of these computations, there is the need of computing similarity between social network users. In this entry, we show how similarity computation can be done by using a family of metrics which provide fast, local, and efficient solutions to the question of computing user...

This is a preview of subscription content, log in to check access.


  1. Adamic L, Adar E (2003) Friends and neighbors on the web. Soc Netw 25(3):211–230Google Scholar
  2. Akcora C, Carminati B, Ferrari E (2011) Network and profile based measures for user similarities on social networks. In: 2011 IEEE international conference on information reuse and integration (IRI), Las Vegas. IEEE, pp 292–298Google Scholar
  3. Akcora C, Carminati B, Ferrari E (2012) Privacy in social networks: how risky is your social graph? In: 2012 IEEE 28th international conference on data engineering (ICDE), Washington, DC. IEEE, pp 9–19Google Scholar
  4. Anderson A, Huttenlocher D, Kleinberg J, Leskovec J (2012) Effects of user similarity in social media. In: Proceedings of the fifth ACM international conference on web search and data mining, Seattle. ACM, pp 703–712Google Scholar
  5. Barabási A, Albert R (1999) Emergence of scaling in random networks. Science 286(5439):509–512MathSciNetGoogle Scholar
  6. Bhattacharyya P, Garg A, Wu S (2010) Analysis of user keyword similarity in online social networks. Soc Netw Anal Min 1:1–16Google Scholar
  7. Boriah S, Chandola V, Kumar V (2008) Similarity measures for categorical data: a comparative evaluation. SIAM 30(2):243–254Google Scholar
  8. Bouma G (2009) Normalized (pointwise) mutual information in collocation extraction. In: Proceedings of GSCL conference, Potsdam, Germany, pp 31–40Google Scholar
  9. Cristani M, Cuel R (2005) A survey on ontology creation methodologies. Int J Semant Web Inf Syst 1(2): 49–69Google Scholar
  10. Cukierski W, Hamner B, Yang B (2011) Graph-based features for supervised link prediction. In: The 2011 international joint conference on neural networks (IJCNN), San Jose. IEEE, pp 1237–1244Google Scholar
  11. De Meo P, Ferrara E, Fiumara G (2011) Finding similar users in facebook. In: Social networking and community behavior modeling: qualitative and quantitative measurement, Igi Publishing. Hershey, Pennsylvania (USA). vol 4, pp 1–26Google Scholar
  12. Ellison N et al (2007) Social network sites: definition, history, and scholarship. J Comput-Mediat Commun 13(1):210–230MathSciNetGoogle Scholar
  13. Ha V, Haddawy P (2003) Similarity of personal preferences: theoretical foundations and empirical analysis. Artif Intell 146(2):149–173zbMATHMathSciNetGoogle Scholar
  14. Han J, Kamber M, Pei J (2006) Data mining: concepts and techniques. Morgan kaufmann, San FranciscoGoogle Scholar
  15. Huang Z, Li X, Chen H (2005) Link prediction approach to collaborative filtering. In: Proceedings of the 5th ACM/IEEE-CS joint conference on digital libraries, Denver. ACM, pp 141–142Google Scholar
  16. Jin L, Takabi H, Joshi J (2011) Towards active detection of identity clone attacks on online social networks. In: Proceedings of the first ACM conference on data and application security and privacy, San Antonio. ACM, pp 27–38Google Scholar
  17. Jung J, Euzenat J (2007) Towards semantic social networks. Semant Web: Res Appl 1:267–280Google Scholar
  18. Katz L (1953) A new status index derived from sociometric analysis. Psychometrika 18(1):39–43zbMATHGoogle Scholar
  19. Leroy V, Cambazoglu B, Bonchi F (2010) Cold start link prediction. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, Washington, DC. ACM, pp 393–402Google Scholar
  20. Liben-Nowell D, Kleinberg J (2007) The link-prediction problem for social networks. J Am Soc Inf Sci Technol 58(7):1019–1031Google Scholar
  21. Lin D (1998) An information-theoretic definition of similarity. In: Proceedings of the 15th international conference on machine learning, San Francisco, vol 1, pp 296–304Google Scholar
  22. McGill M (1979) An evaluation of factors affecting document ranking by information retrieval systems. Syracuse Univ., NY. School of Information Studies.Google Scholar
  23. McPherson M, Smith-Lovin L, Cook J (2001) Birds of a feather: homophily in social networks. Ann Rev Sociol 27:415–444Google Scholar
  24. Melville P, Sindhwani V (2010) Recommender systems. In: Encyclopedia of machine learning, Springer Science+Business Media. LLC 2011. New York, vol 1, pp 829–837Google Scholar
  25. Mika P (2005) Ontologies are us: a unified model of social networks and semantics. In: The semantic web-ISWC 2005, Galway, vol 4, pp 522–536Google Scholar
  26. Richter M (2007) Foundations of similarity and utility. In: The 20th international FLAIRS conference, Key WestGoogle Scholar
  27. Spertus E, Sahami M, Buyukkokten O (2005) Evaluating similarity measures: a large-scale study in the orkut social network. In: Proceedings of the 11th SIGKDD, Chicago. ACM, pp 678–684Google Scholar
  28. Tan P, Steinbach M, Kumar V et al (2006) Introduction to data mining. Pearson/Addison Wesley, Boston Zheleva E, Getoor L, Golbeck J, Kuter U (2010) Using friendship ties and family circles for link prediction. Advances in social network mining and analysis, Springer Berlin Heidelberg. pp 97–113Google Scholar
  29. Zheleva E, Getoor L, Golbeck J, Kuter U (2010) Using friendship ties and family circles for link prediction. Advances in social network mining and analysis, Springer Berlin Heidelberg. pp 97–113Google Scholar

Recommended Reading

  1. Birds of a feather: Homophily in social networks (McPherson et al. 2001)Google Scholar
  2. Effects of user similarity in social media (Anderson et al. 2012)Google Scholar
  3. Evaluating similarity measures: a large-scale study in the orkut social network (Spertus et al. 2005)Google Scholar
  4. Finding similar users in facebook (De Meo et al. 2011)Google Scholar
  5. Link prediction approach to collaborative filtering (Huang et al. 2005)Google Scholar

Copyright information

© Springer Science+Business Media New York 2014

Authors and Affiliations

  • Cuneyt Gurcan Akcora
    • 1
  • Elena Ferrari
    • 1
  1. 1.DISTA, Università degli Studi dell'InsubriaVareseItaly