Encyclopedia of Social Network Analysis and Mining

2018 Edition
| Editors: Reda Alhajj, Jon Rokne

Data Mining Techniques for Social Networks Analysis

  • Karan Aggarwal
  • Komal Kapoor
  • Jaideep Srivastava
Reference work entry
DOI: https://doi.org/10.1007/978-1-4939-7131-2_56




Groups of individuals in a network such that the nodes in the group are more densely connected to each other and less densely connected to nodes outside the group

Data mining

Extraction of knowledge from data


Tendency of individuals to form connection with others who are similar to them

Social influence

The influence of a node in a network on its direct and indirect neighbors

Social media

Web and mobile technologies used to facilitate interactions among individuals

Social network

A set of individuals related to each other based on a relationship of interest


A social network is defined as a set of individuals related to each other based on a relationship of interest, such as friendship, advisory, co-location, and trust. Social network analysis is the study of behaviors and properties of these networked individuals. The interest of the data mining community in social network analysis...

This is a preview of subscription content, log in to check access.



We hereby acknowledge all the past and present members of the Data Mining Research Lab at the University of Minnesota, Twin Cities, namely, Aarti Sathyanarayana, Ankit Sharma, Bhavtosh Rath, Kartik Singhal, Kyong Jin Shim, Muhammad Ahmad, Nishith Pathak, Colin DeLong, Amogh Mahapatra, Zoheb Borbora, Atanu Roy, and Chandrima Sarkar.


  1. Aggarwal C, Subbian K (2014) Evolutionary network analysis: a survey. ACM Comput Surv 47(1):10zbMATHCrossRefGoogle Scholar
  2. Ahmad MA, Borbora Z, Srivastava J, Contractor NS (2010) Link prediction across multiple social networks. In: ICDM workshops. IEEE, Sydney, pp 911–918Google Scholar
  3. Alon U (2007) Network motifs: theory and experimental approaches. Nat Rev Genet 8:450CrossRefGoogle Scholar
  4. Amaral LAN, Scala A, Barthélémy M, Stanley HE (2000) Classes of behavior of small-world networks. Proc Natl Acad Sci U S A 97:11149–11152CrossRefGoogle Scholar
  5. Araujo M, Papadimitriou S, Günnemann S, Faloutsos C, Basu P, Swami A, Koutra D (2014) Com2: fast automatic discovery of temporal (‘comet’) communities. In: PAKDD. Springer International Publishing, Tainan, pp 271–283CrossRefGoogle Scholar
  6. Barabási A, Albert R (1999) Emergence of scaling in random networks. Science 286:509–512MathSciNetzbMATHCrossRefGoogle Scholar
  7. Bavelas A (1948) A mathematical model for group structures. Hum Organ 7:16–30CrossRefGoogle Scholar
  8. Bright DA, Hughes CE, Chalmers J (2012) Illuminating dark networks: a social network analysis of an Australian drug trafficking syndicate. Crime Law Soc Chang 57(2):151–176CrossRefGoogle Scholar
  9. Cai D, Shao Z, He X, Yan X, Han J (2005) Mining hidden community in heterogeneous social networks. In: Proceedings of the 3rd international workshop on link discovery. ACM, Chicago, IL, USA, pp 58–65Google Scholar
  10. Cheng Z, Caverlee J, Barthwal H, Bachani V (2014) Who is the barbecue king of texas?: a geo-spatial approach to finding local experts on twitter. In: Proceedings of the 37th international ACM SIGIR, Gold Coast, pp 335–344Google Scholar
  11. Clauset A, Moore C, Newman MEJ (2008) Hierarchical structure and the prediction of missing links in networks. Nature 453:98CrossRefGoogle Scholar
  12. Coleman J, Katza E, Menzel H (1957) The diffusion of an innovation among physicians. Sociometry 20:253–270CrossRefGoogle Scholar
  13. Dodds PS, Watts DJ (2005) A generalized model of social and biological contagion. J Theor Biol 232:587–604MathSciNetCrossRefGoogle Scholar
  14. Domingos P, Richardson M (2001) Mining the network value of customers. In: Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining (KDD). San FranciscoGoogle Scholar
  15. Dunlavy DM, Kolda TG, Acar E (2011) Temporal link prediction using matrix and tensor factorizations. ACM Trans Knowl Discov Data 5(2):10CrossRefGoogle Scholar
  16. Freeman LC (1979) Centrality in social networks: I. Conceptual clarification. Soc Netw 1:215–239CrossRefGoogle Scholar
  17. Gilbert E, Karahalios K (2009) Predicting tie strength with social media. In: CHI ‘09. ACM, BostonGoogle Scholar
  18. Goldenberg J, Libai B, Muller E (2001a) Using complex systems analysis to advance marketing theory development: modeling heterogeneity effects on new product growth through stochastic cellular automata. Acad Mark Sci Rev [Online] 1(9):1–20Google Scholar
  19. Goldenberg J, Libai B, Muller E (2001b) Talk of the network: a complex systems look at the underlying process of word-of-mouth. Mark Lett 12(3):209–221CrossRefGoogle Scholar
  20. Goyal A, Bonchi F, Lakshmanan LV (2011) A data-based approach to social influence maximization. Proc VLDB Endowment 5(1):73–84CrossRefGoogle Scholar
  21. Gregory S (2007) An algorithm to find overlapping community structure in networks. In: Knowledge discovery in databases. PKDD 2007. Springer Berlin Heidelberg, Warsaw, pp 91–102Google Scholar
  22. Guo G, Zhang J, Yorke-Smith N (2015). TrustSVD: collaborative filtering with both the explicit and implicit influence of user trust and of item ratings. In: AAAI Press, pp 123–129Google Scholar
  23. Gupta, M, Gao, J, Sun, Y, Han, J (2012). Integrating community matching and outlier detection for mining evolutionary community outliers. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. Beijing, ChinaGoogle Scholar
  24. Hasan M, Chaoji V, Salem S, Zaki M (2005) Link prediction using supervised learning. In: Proceedings of the workshop on link discovery: issues, approaches and applications. Society for Industrial and Applied Mathematics, Bethedsa, MD, USAGoogle Scholar
  25. Haveliwala TH (2003) Topic-sensitive PageRank: a context-sensitive ranking algorithm for web search. IEEE Trans Knowl Data Eng 15(4):784–796CrossRefGoogle Scholar
  26. Haveliwala T, Kamvar S, Jeh G (2003) An analytical comparison of approaches to personalizing PageRank (technical report). Stanford University, StanfordGoogle Scholar
  27. Huang, F, Niranjan, UN, Hakeem, MU, Anandkumar A (2013) Fast detection of overlapping communities via online tensor methods. arXiv preprint arXiv:1309.0787Google Scholar
  28. Immorlica N, Kleinberg J, Mahdian M, Wexler T (2007) The role of compatibility in the diffusion of technologies through social networks. In: Proceedings of the eighth ACM conference on electronic commerce (EC). ACM, San DiegoGoogle Scholar
  29. Kapoor K, Sharma D, Srivastava J (2013) Weighted node degree centrality for hypergraphs. In: Network Science Workshop (NSW), 2013 I.E. 2nd. IEEE, West Point, NY, USA, pp 152–155Google Scholar
  30. Keegan B, Ahmed M, Williams D, Srivastava J, Contractor N (2010) Dark gold: statistical properties of clandestine networks in massively multiplayer online games. In: SocialCom 10. Minneapolis, pp 201–208Google Scholar
  31. Kempe D, Kleinberg J, Tardos E (2003) Maximizing the spread of influence in a social network. In: Proceedings of the ninth ACM SIGKDD international conference on knowledge discovery and data mining (KDD). ACM, Washington, DCGoogle Scholar
  32. Kempe D, Kleinberg J, Tardos E (2005) Influential nodes in a diffusion model for social networks. In: Proceedings of the 32nd international colloquium on automata, languages and programming (ICALP). Springer Berlin Heidelberg, LisbonCrossRefGoogle Scholar
  33. Kleinberg J (1998) Authoritative sources in a hyperlinked environment. In: Proceedings of the ACM-SIAM symposium on discrete algorithms. ACM, San FranciscoGoogle Scholar
  34. Knoke D, Burt RS (1983) Prominence. In: Burt RS, Minor MJ (eds) Applied network analysis. Sage, Newbury Park, pp 195–222Google Scholar
  35. Kochen M (1989) Preface. In: Kochen M (ed) The small world. Ablex, Norwood, pp vii–xiiiGoogle Scholar
  36. Kostka J, Oswald YA, Wattenhofer R (2008) Word of mouth: rumor dissemination in social networks. In: 15th international colloquium on structural information and communication complexity (SIROCCO). Springer Berlin Heidelberg, Villars-sur-Ollon, Switzerland, June 2008Google Scholar
  37. Lappas T, Liu K, Terzi E (2011) A survey of algorithms and systems for expert location in social networks. Social Network Data Analytics. Springer US, pp 215–241CrossRefGoogle Scholar
  38. Leskovec J, Adamic LA, Huberman BA (2006a) The dynamics of viral marketing. In: Proceedings of the 7th ACM conference on electronic commerce. ACM, Ann ArborGoogle Scholar
  39. Leskovec J, Singh A, Kleinberg J (2006b) Patterns of influence in a recommendation network. In: Pacific-Asia conference on knowledge discovery and data mining (PAKDD). SingaporeCrossRefGoogle Scholar
  40. Leskovec J, Huttenlocher D, Kleinberg J (2010) Predicting positive and negative links in online social networks. In: Proceedings of WWW’2010. ACM, New YorkGoogle Scholar
  41. Leung A, Dron W, Hancock JP, Aguirre M, Purnell J, Han J, Wang C, Srivastava J, Mahapatra A, Roy A, Scott L (2013) Social patterns: community detection using behavior-generated network datasets. In: Network Science Workshop (NSW), 2013 I.E. 2nd. IEEE, West Point, NY, USA, pp 82–89.Google Scholar
  42. Liben-Nowell D, Kleinberg J (2007) The link-prediction problem for social networks. J Am Soc Inf Sci Technol 58:1019CrossRefGoogle Scholar
  43. Liggett TM (1985) Interacting particle systems. Springer, New YorkzbMATHCrossRefGoogle Scholar
  44. Liu L, Tang J, Han J, Yang S (2012) Learning influence from heterogeneous social networks. Data Min Knowl Disc 25(3):511–544MathSciNetzbMATHCrossRefGoogle Scholar
  45. Liu Z, He JL, Kapoor K, Srivastava J (2013) Correlations between community structure and link formation in complex networks. PLoS One 8(9):e72908CrossRefGoogle Scholar
  46. Lü L, Zhou T (2010) Link prediction in weighted networks: the role of weak ties. EPL 89:18001CrossRefGoogle Scholar
  47. Morris S (2000) Contagion. The Review of Economic Studies 67(1):57–78.MathSciNetzbMATHCrossRefGoogle Scholar
  48. Myers S, Zhu C, Leskovec J (2012) Information diffusion and external influence in networks. In: Proceedings of the 18th ACM SIGKDD. Beijing, pp 33–41Google Scholar
  49. Page L, Brin S, Motwani R, Winograd T (1998) The PageRank citation ranking: bringing order to the web. In: Stanford digital libraries working paper, Stanford InfoLabGoogle Scholar
  50. Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043):814–818CrossRefGoogle Scholar
  51. Pathak N, Delong C, Banerjee A, Erickson K (2008) Social topic models for community extraction. In: The 2nd SNA-KDD workshop ’08 (SNA-KDD’08). ACM, Las VegasGoogle Scholar
  52. Qin J, Xu JJ, Hu D, Sageman M, Chen H (2005) Analyzing terrorist networks: a case study of the global Salafi Jihad network. In: Intelligence and security informatics. Springer Berlin Heidelberg, AtlantaCrossRefGoogle Scholar
  53. Roy A. (2015) Computational trust at various granularities in social networks. Doctoral dissertation, University of MinnesotaGoogle Scholar
  54. Roy A, Sarkar C, Srivastava J, Huh J (2016) Trustingness & trustworthiness: a pair of complementary trust measures in a social network. In: Advances in Social Networks Analysis and Mining (ASONAM), 2016 IEEE/ACM International Conference on. IEEE, San Francisco, CA, USA, pp 549–554Google Scholar
  55. Sewell DK, Chen Y (2015) Latent space models for dynamic networks. J Am Stat Assoc 110(512):1646–1657MathSciNetzbMATHCrossRefGoogle Scholar
  56. Steyvers M, Smyth P, Rosen-Zvi M, Griffiths T (2004) Probabilistic author-topic models for information discovery. In: Proceedings of 10th ACM SIGKDD. Seattle, pp 306–315Google Scholar
  57. Subbian K, Aggarwal C, Srivastava J (2016) Mining influencers using information flows in social streams. ACM Trans Knowl Disc Data 10(3):26Google Scholar
  58. Tantipathananandh C, Berger-Wolf TY, Kempe D (2007) A framework for community identification in dynamic social networks. In: SIGKDD international conference on knowledge discovery and data mining. San Jose, pp 717–726Google Scholar
  59. Travers J, Milgram S (1969) An experimental study of the small world problem. Sociometry 32:425–443CrossRefGoogle Scholar
  60. Tylenda T, Angelova R, Bedathur S (2009) Towards time-aware link prediction in evolving social networks. In: Proceedings of the 3rd workshop on social network mining and analysis. ACM, Paris/New YorkGoogle Scholar
  61. Walter FE, Battiston S, Schweitzer F (2008) A model of a trust-based recommendation system of a social network. Auton Agents Multi-Agent Syst 16:57–74CrossRefGoogle Scholar
  62. Wasserman S, Faust K (1994) Social network analysis. Cambridge University Press, CambridgezbMATHCrossRefGoogle Scholar
  63. Watts DJ, Dodds PS (2007) Influentials, networks, and public opinion formation. J Consum Res 34:441–458CrossRefGoogle Scholar
  64. Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393:409–410zbMATHCrossRefGoogle Scholar
  65. Williams D, Poole S, Contractor N, Srivastava J (2011) The virtual world exploratorium: using large-scale data and computational techniques for communication research. Commun Methods Meas 5:163–180CrossRefGoogle Scholar
  66. Xiang R, Neville J, Rogati M (2009) Modeling relationship strength in online social networks. In: Workshop on analyzing networks and learning with graphs. Whistler, Dec 2009Google Scholar
  67. Yap HY, Lim TM (2016) Trusted social node: evaluating the effect of trust and trust variance to maximize social influence in a multilevel social node influential diffusion model. In: International Conference on Computational Science and Its Applications. Springer, pp 530–542CrossRefGoogle Scholar
  68. Yu K, Chu W, Yu S, Tresp V, Xu Z (2006) Stochastic relational models for discriminative link prediction. In: Proceedings of neural information processing systems. MIT, Cambridge, p 1553Google Scholar
  69. Zhang J, Tang J, Li J-Z (2007) Expert finding in a social network. In: Proceedings of DASFAA’2007. Bangkok, pp 1066–1069Google Scholar
  70. Zhao Y, Levina E, Zhu J (2011) Community extraction for social networks. In: Proceedings of the 2011 joint statistical meetings. Miami BeachCrossRefGoogle Scholar
  71. Zhou D, Manavoglu E, Li J, Giles CL, Zha H. (2006) Probabilistic models for discovering e-communities. In Proceedings of the 15th international conference on World Wide Web, 2006. ACM, New York, pp 173–182.Google Scholar
  72. Zhu L, Guo D, Yin J, Ver Steeg G, Galstyan A (2016) Scalable temporal latent space inference for link prediction in dynamic social networks. IEEE Trans Knowl Data Eng 28(10):2765–2777CrossRefGoogle Scholar

Recommended Reading

  1. Elsner U (1997) Graph partitioning: a survey. Technical report 97–27. Technische Universität Chemnitz, ChemnitzGoogle Scholar
  2. Fortunato S (2010) Community detection in graphs. Phys Rep 486:75–174MathSciNetCrossRefGoogle Scholar
  3. Kleinberg J (2007) Cascading behavior in networks: algorithmic and economic issues. In: Algorithmic game theory. Cambridge University Press, Cambridge, pp 613–632zbMATHCrossRefGoogle Scholar
  4. Wortman J (2008) Viral marketing and the diffusion of trends on social networks, technical reports, MS-CIS-08-19, Department of Computer and Information Science, University of PennsylvaniaGoogle Scholar

Copyright information

© Springer Science+Business Media LLC, part of Springer Nature 2018

Authors and Affiliations

  • Karan Aggarwal
    • 1
  • Komal Kapoor
    • 1
  • Jaideep Srivastava
    • 1
  1. 1.Department of Computer Science and EngineeringUniversity of MinnesotaMinneapolisUSA

Section editors and affiliations

  • Talel Abdessalem
    • 1
  • Rokia Missaoui
    • 2
  1. 1.telecom-paristechParisFrance
  2. 2.Department of Computer Science and EngineeringUniversité du Québec en Outaouais (UQO)GatineauCanada