Skip to main content

A Comparative Analysis of Community Detection Methods in Massive Datasets

  • Conference paper
  • First Online:
Modelling, Simulation and Intelligent Computing (MoSICom 2020)

Abstract

Nowadays there is a boom in social network data streaming from various fields of interest related to finance, engineering, medicine, and general sciences. All these data are modeled as graphs for better analysis. Community detection is one such mechanism for the analysis of such massive data. Many community detection algorithms exist in literature. The existing algorithms are compared by using either real-world or artificial networks (modeled as graphs) but not both. This paper aims to make a comparative study of two popular existing community detection algorithms both on real-world and synthetic data and verify their performance. The approach in this paper makes good use of recent advances in graphical modeling of different social networks. We generated a random graph that represents most of the observed properties of a real-world dataset. The experimental results are tabulated and the computed metrics help in inferring the suitability or scalability of an algorithm for small or massive datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Girvan M, Newman MEJ (2002) Community structure in social and biological networks. Proc Nat Acad Sci 99(12):7821–7826

    Google Scholar 

  2. Fortunato S (2010) Community detection in graphs. Phys Rep 486(3):75–174

    Article  MathSciNet  Google Scholar 

  3. Sharma R, Oliveira S (2017) Community detection algorithm for big social networks using hybrid architecture. Big Data Res 10(2):44–522

    Article  Google Scholar 

  4. Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Statistical Mech Theory Experim 2008(10):10008

    Google Scholar 

  5. Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76(3):036106

    Google Scholar 

  6. Chakrabarti D, Kumar R, Tomkins A (2006) Evolutionary clustering. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, Aug 2006, pp 554–560

    Google Scholar 

  7. Li X, Tan Y, Zhang Z, Tong Q (2016) Community detection in large social networks based on relationship density. In: IEEE 40th Annual computer software and applications conference (COMPSAC 2016), pp 524–533

    Google Scholar 

  8. Alamuri M, Surampudi BR, Negi A (2014) A survey of distance/similarity measures for categorical data. In: International joint conference on neural networks (IJCNN), vol 9(3), pp 1907–1914

    Google Scholar 

  9. Leskovec J, Kleinberg J, Faloutsos C (2005) Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of the 11th ACM SIGKDD international conference on knowledge discovery in data mining, pp 177–187, 21 Aug 2005

    Google Scholar 

  10. Sumith N, Annappa B, Bhattacharya SA (2018) Holistic approach to influence maximization in social networks: STORIE. Appl Soft Comput 66(2):533–547

    Google Scholar 

  11. Clauset A, Newman ME, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):066111

    Google Scholar 

  12. Lancichinetti A, Kivelä M, Saramäki J, Fortunato S (2010) Characterizing the community structure of complex networks. PloS One 5(8):e11976

    Google Scholar 

  13. Leskovec J, Lang KJ, Mahoney M (2010) Empirical comparison of algorithms for network community detection. In: Proceedings of the 19th international conference on world wide web, pp 631–640

    Google Scholar 

  14. Tyler JR, Wilkinson DM, Huberman BA (2005) E-mail as spectroscopy: automated discovery of community structure within organizations. Inform Soc 21(2):143–153

    Article  Google Scholar 

  15. Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(70):793–814

    Google Scholar 

  16. Pons P, Latapy M (2005) Computing communities in large networks using random walks. In: International symposium on computer and information sciences, pp 284–293, Oct 2005

    Google Scholar 

  17. Wagenseller P, Wang F, Wu W (2018) Size matters: a comparative analysis of community detection algorithms. IEEE Trans Comput Soc Syst 5(4):951–960

    Article  Google Scholar 

  18. Zhao Z, Zheng S, Li C, Sun J, Chang L, Chiclana F (2018) A comparative study on community detection methods in complex networks. J Intell Fuzzy Syst 35(1):1077–1086

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to B. S. A. S. Rajita .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rajita, B.S.A.S., Kumari, D., Panda, S. (2020). A Comparative Analysis of Community Detection Methods in Massive Datasets. In: Goel, N., Hasan, S., Kalaichelvi, V. (eds) Modelling, Simulation and Intelligent Computing. MoSICom 2020. Lecture Notes in Electrical Engineering, vol 659. Springer, Singapore. https://doi.org/10.1007/978-981-15-4775-1_19

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-4775-1_19

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-4774-4

  • Online ISBN: 978-981-15-4775-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics