Skip to main content

Evaluating the Stability of Communities Found by Clustering Algorithms

  • Conference paper
  • 1253 Accesses

Part of the book series: Studies in Computational Intelligence ((SCI,volume 476))

Abstract

Since clustering algorithms identify communities even in networks that are not believed to exhibit clustering behavior, we are interested in evaluating the significance of the communities returned by these algorithms. As a proxy for significance, prior work has investigated stability: by how much do the clusters change when the network is perturbed? We describe a cheap and simple method for evaluating stability and test it on a variety of real-world and synthetic graphs. Rather than evaluating our method on a single clustering algorithm, we give results using three popular clustering algorithms and measuring the distance between computed clusterings via three different metrics. We consider the results in the context of known strengths and weaknesses of the three clustering algorithms and also compare our method against that of measuring modularity. Finally we give evidence that our method provides more information than does the modularity.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barabasi, A.L., Albert, R.: Emergence of scaling in random networks. Science 286(5439), 509–512 (1999), doi:10.1126/science.286.5439.509

    Article  MathSciNet  Google Scholar 

  2. Ben-Hur, A., Elisseeff, A., Guyon, I.: A stability based method for discovering structure in clustered data. In: Pac. Symp. on Biocomput., pp. 6–17 (2002), http://view.ncbi.nlm.nih.gov/pubmed/11928511

  3. Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. of Stat. Mech.: Theory and Exp. 10, 8 (2008)

    Google Scholar 

  4. Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Louvain method: finding communities in large networks, https://sites.google.com/site/findcommunities/ (accessed: September 13, 2012)

  5. Clauset, A.: “Fast Modularity” community structure inference algorithm, http://cs.unm.edu/aaron/research/fastmodularity.htm (accessed: August 15 , 2012)

  6. Clauset, A., Newman, M.E.J., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70, 066111 (2004), doi:10.1103/PhysRevE.70.066111

    Google Scholar 

  7. Csardi, G., Nepusz, T.: The igraph software package for complex network research. Inter Journal Complex Systems 1695 (2006), http://igraph.sf.net

  8. Danon, L., Díaz-Guilera, A., Duch, J., Arenas, A.: Comparing community structure identification. J. of Stat. Mech.: Theory and Exp. 2005 09,008 (2005)

    Google Scholar 

  9. Davis, T.: University of Florida sparse matrix collection. NA Digest, v.92, n.42, Oct. 16, 1994 and NA Digest, v.96, n.28, Jul. 23, 1996, and NA Digest, v.97, n.23 (June 7, 1997)

    Google Scholar 

  10. Dongen, S.V.: Graph clustering by flow simulation. Ph.D. thesis, University of Utrecht (2000)

    Google Scholar 

  11. Dongen, S.V.: Performance criteria for graph clustering and markov cluster experiments. Tech. Rep. INS-R0012, CWI (Centre for Mathematics and Computer Science) (2000)

    Google Scholar 

  12. Dongen, S.V.: MCL-edge: analyzing networks with millions of nodes, http://micans.org/mcl (accessed: August 20, 2012)

  13. Erdos, P., Renyi, A.: On random graphs. Publ. Math (Debrecen) 6, 290 (1959)

    MathSciNet  Google Scholar 

  14. Fortunato, S.: Community detection in graphs. Phys. Rep. 486, 75–174 (2010), doi:10.1016/j.physrep.2009.11.002

    Article  MathSciNet  Google Scholar 

  15. Gfeller, D., Chappelier, J.C., De Los Rios, P.: Finding instabilities in the community structure of complex networks. Phys. Rev. E 72, 056135 (2005), doi:10.1103/PhysRevE.72.056135

    Google Scholar 

  16. Good, B.H., de Montjoye, Y.A., Clauset, A.: Performance of modularity maximization in practical contexts. Phys. Rev. E 81, 046,106 (2010)

    Google Scholar 

  17. Guimera, R., Sales-Pardo, M., Amaral, L.: Modularity from fluctuations in random graphs and complex networks. Phys. Rev. E 70(2), 025,101 (2004)

    Google Scholar 

  18. Hagberg, A.A., Schult, D.A., Swart, P.J.: Exploring network structure, dynamics, and function using NetworkX. In: Proc. of the 7th Python in Sci. Conf. (SciPy 2008), Pasadena, CA USA, pp. 11–15 (2008)

    Google Scholar 

  19. Hu, Y., Nie, Y., Yang, H., Cheng, J., Fan, Y., Di, Z.: Measuring the significance of community structure in complex networks. Phys. Rev. E 82, 066106 (2010), doi:10.1103/PhysRevE.82.066106

    Google Scholar 

  20. Jaccard, P.: Étude comparative de la distribution florale dans une portion des Alpes et des Jura. Bull. del la Soc. Vaud. des Sci. Nat. 37, 547–579 (1901)

    Google Scholar 

  21. Karrer, B., Levina, E., Newman, M.E.J.: Robustness of community structure in networks. Phys. Rev. E 77, 046,119 (2008), doi:10.1103/PhysRevE.77.046119

    Google Scholar 

  22. Kleinberg, J.M.: An impossibility theorem for clustering. In: Becker, S., Thrun, S., Obermayer, K. (eds.) NIPS, pp. 446–453. MIT Press (2002)

    Google Scholar 

  23. Leskovec, J., Lang, K.J., Mahoney, M.: Empirical comparison of algorithms for network community detection. In: Proc. of the 19th Intl. Conf. on World Wide Web (WWW 2010), pp. 631–640. ACM, New York (2010), doi:10.1145/1772690.1772755

    Chapter  Google Scholar 

  24. Levine, E., Domany, E.: Resampling method for unsupervised estimation of cluster validity. Neural Comput. 13(11), 2573–2593 (2001), doi:10.1162/089976601753196030

    Article  MATH  Google Scholar 

  25. Liben-Nowell, D., Kleinberg, J.: The link prediction problem for social networks. In: Proc. of the Twelfth Intl. Conf. on Inf. and Knowl. Manag. (CIKM 2003), pp. 556–559. ACM, New York (2003), doi:10.1145/956863.956972

    Google Scholar 

  26. Meilă, M.: Comparing clusterings — an information based distance. J. of Multivar. Anal. 98(5), 873–895 (2007), doi:10.1016/j.jmva.2006.11.013

    Article  MATH  Google Scholar 

  27. Mirshahvalad, A., Lindholm, J., Derlén, M., Rosvall, M.: Significant communities in large sparse networks. PLoS ONE 7(3), e33721 (2012), doi:10.1371/journal.pone.0033721

    Google Scholar 

  28. Moradi, F., Olovsson, T., Tsigas, P.: An Evaluation of Community Detection Algorithms on Large-Scale Email Traffic. In: Klasing, R. (ed.) SEA 2012. LNCS, vol. 7276, pp. 283–294. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  29. Newman, M.E.J.: Fast algorithm for detecting community structure in networks. Phys. Rev. E 69, 066133 (2004), doi:10.1103/PhysRevE.69.066133

    Google Scholar 

  30. Newman, M.E.J.: Modularity and community structure in networks. Proc. of the Natl. Acad. of Sci. 103(23), 8577–8582 (2006), doi:10.1073/pnas.0601602103

    Article  Google Scholar 

  31. Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69, 026113 (2004)

    Google Scholar 

  32. Rosvall, M., Bergstrom, C.T.: Maps of random walks on complex networks reveal community structure. Proc. of the Natl. Acad. of Sci. 105(4), 1118–1123 (2008), doi:10.1073/pnas.0706851105

    Article  Google Scholar 

  33. Rosvall, M., Bergstrom, C.T.: Mapping change in large networks. PLoS ONE 5(1), e8694 (2010), doi:10.1371/journal.pone.0008694

    Google Scholar 

  34. Vieira, V.: A Comparison of Methods for Community Detection in Large Scale Networks. In: Menezes, R., Evsukoff, A., González, M.C. (eds.) Complex Networks. SCI, vol. 424, pp. 75–86. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  35. Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393(6684), 440–442 (1998), doi:10.1038/30918

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tzu-Yi Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, TY., Fields, E. (2013). Evaluating the Stability of Communities Found by Clustering Algorithms. In: Ghoshal, G., Poncela-Casasnovas, J., Tolksdorf, R. (eds) Complex Networks IV. Studies in Computational Intelligence, vol 476. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36844-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-36844-8_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-36843-1

  • Online ISBN: 978-3-642-36844-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics