Abstract
Nowadays there is a boom in social network data streaming from various fields of interest related to finance, engineering, medicine, and general sciences. All these data are modeled as graphs for better analysis. Community detection is one such mechanism for the analysis of such massive data. Many community detection algorithms exist in literature. The existing algorithms are compared by using either real-world or artificial networks (modeled as graphs) but not both. This paper aims to make a comparative study of two popular existing community detection algorithms both on real-world and synthetic data and verify their performance. The approach in this paper makes good use of recent advances in graphical modeling of different social networks. We generated a random graph that represents most of the observed properties of a real-world dataset. The experimental results are tabulated and the computed metrics help in inferring the suitability or scalability of an algorithm for small or massive datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Girvan M, Newman MEJ (2002) Community structure in social and biological networks. Proc Nat Acad Sci 99(12):7821–7826
Fortunato S (2010) Community detection in graphs. Phys Rep 486(3):75–174
Sharma R, Oliveira S (2017) Community detection algorithm for big social networks using hybrid architecture. Big Data Res 10(2):44–522
Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Statistical Mech Theory Experim 2008(10):10008
Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76(3):036106
Chakrabarti D, Kumar R, Tomkins A (2006) Evolutionary clustering. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, Aug 2006, pp 554–560
Li X, Tan Y, Zhang Z, Tong Q (2016) Community detection in large social networks based on relationship density. In: IEEE 40th Annual computer software and applications conference (COMPSAC 2016), pp 524–533
Alamuri M, Surampudi BR, Negi A (2014) A survey of distance/similarity measures for categorical data. In: International joint conference on neural networks (IJCNN), vol 9(3), pp 1907–1914
Leskovec J, Kleinberg J, Faloutsos C (2005) Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of the 11th ACM SIGKDD international conference on knowledge discovery in data mining, pp 177–187, 21 Aug 2005
Sumith N, Annappa B, Bhattacharya SA (2018) Holistic approach to influence maximization in social networks: STORIE. Appl Soft Comput 66(2):533–547
Clauset A, Newman ME, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):066111
Lancichinetti A, Kivelä M, Saramäki J, Fortunato S (2010) Characterizing the community structure of complex networks. PloS One 5(8):e11976
Leskovec J, Lang KJ, Mahoney M (2010) Empirical comparison of algorithms for network community detection. In: Proceedings of the 19th international conference on world wide web, pp 631–640
Tyler JR, Wilkinson DM, Huberman BA (2005) E-mail as spectroscopy: automated discovery of community structure within organizations. Inform Soc 21(2):143–153
Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(70):793–814
Pons P, Latapy M (2005) Computing communities in large networks using random walks. In: International symposium on computer and information sciences, pp 284–293, Oct 2005
Wagenseller P, Wang F, Wu W (2018) Size matters: a comparative analysis of community detection algorithms. IEEE Trans Comput Soc Syst 5(4):951–960
Zhao Z, Zheng S, Li C, Sun J, Chang L, Chiclana F (2018) A comparative study on community detection methods in complex networks. J Intell Fuzzy Syst 35(1):1077–1086
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Rajita, B.S.A.S., Kumari, D., Panda, S. (2020). A Comparative Analysis of Community Detection Methods in Massive Datasets. In: Goel, N., Hasan, S., Kalaichelvi, V. (eds) Modelling, Simulation and Intelligent Computing. MoSICom 2020. Lecture Notes in Electrical Engineering, vol 659. Springer, Singapore. https://doi.org/10.1007/978-981-15-4775-1_19
Download citation
DOI: https://doi.org/10.1007/978-981-15-4775-1_19
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4774-4
Online ISBN: 978-981-15-4775-1
eBook Packages: Computer ScienceComputer Science (R0)