Detection of Communities and Bridges in Weighted Networks

  • Tanwistha Saha
  • Carlotta Domeniconi
  • Huzefa Rangwala
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6871)

Abstract

Traditional graph-based clustering methods group vertices into non-intersecting clusters under the assumption that each vertex can belong to only a single cluster. On the other hand, recent research on graph-based clustering methods, applied to real world networks (e.g., protein-protein interaction networks and social networks), shows overlapping patterns among the underlying clusters. For example, in social networks, an individual is expected to belong to multiple clusters (or communities), rather than strictly confining himself/herself to just one. As such, overlapping clusters enable better models of real-life phenomena. Soft clustering (e.g., fuzzy c-means) has been used with success for network data as well as non-graph data, when the objects are allowed to belong to multiple clusters with a certain degree of membership. In this paper, we propose a fuzzy clustering based approach for community detection in a weighted graphical representation of social and biological networks, for which the ground truth associated to the nodes is available. We compare our results with a baseline method for both multi-labeled and single-labeled datasets.

Keywords

Ground Truth Fuzzy Cluster Community Detection Multiple Cluster Bridge Node 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Asur, S., Ucar, D., Parthasarathy, S.: An ensemble framework for clustering protein–protein interaction networks. Bioinformatics 23(13), i29 (2007)CrossRefGoogle Scholar
  2. 2.
    Bezdek, J.: Fuzzy mathematics in pattern classification. Unpublished Ph. D. dissertation, Cornell University, Ithaca, NY (1973)Google Scholar
  3. 3.
    Chen, J., Zaiane, O., Goebel, R.: Detecting communities in social networks using max-min modularity. In: SDM 2009, pp. 978–989 (2009)Google Scholar
  4. 4.
    Clauset, A., Newman, M., Moore, C.: Finding community structure in very large networks. Physical Review E 70(6), 66111 (2004)CrossRefGoogle Scholar
  5. 5.
    Dhillon, I., Guan, Y., Kulis, B.: Weighted graph cuts without eigenvectors a multilevel approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1944–1957 (2007)Google Scholar
  6. 6.
    Duggal, G., Navlakha, S., Girvan, M., Kingsford, C.: Uncovering Many Views of Biological Networks Using Ensembles of Near-Optimal Partitions. In: Proceedings of MultiClust: 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings, KDD (2010)Google Scholar
  7. 7.
    Girvan, M., Newman, M.: Community structure in social and biological networks. Proceedings of the National Academy of Sciences of the United States of America 99(12), 7821 (2002)MathSciNetCrossRefMATHGoogle Scholar
  8. 8.
    Gregory, S.: An algorithm to find overlapping community structure in networks. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) PKDD 2007. LNCS (LNAI), vol. 4702, pp. 91–102. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  9. 9.
    Gunnemann, S., Seidl, T.: Subgraph Mining on Directed and Weighted Graphs. In: Advances in Knowledge Discovery and Data Mining, pp. 133–146 (2010)Google Scholar
  10. 10.
    Heller, K., Williamson, S., Ghahramani, Z.: Statistical models for partial membership. In: Proceedings of the 25th International Conference on Machine learning, pp. 392–399. ACM, New York (2008)Google Scholar
  11. 11.
    Henderson, K., Eliassi-Rad, T., Papdimitriou, S., Faloutsos, C.: HCDF: A hybrid community discovery framework. In: Proceedings of the 2010 SIAM Conference on Data Mining (SDM 2010), Columbus, OH (2010)Google Scholar
  12. 12.
    Hoeppner, F.: Fuzzy cluster analysis: methods for classification, data analysis, and image recognition. Wiley, Chichester (1999)Google Scholar
  13. 13.
    Hoff, P.: Random effects models for network data. In: Dynamic social network modeling and analysis: Workshop summary and papers, pp. 303–312 (2003)Google Scholar
  14. 14.
    Hong, T., Lin, K., Wang, S.: Fuzzy data mining for interesting generalized association rules* 1. Fuzzy sets and systems 138(2), 255–269 (2003)MathSciNetCrossRefGoogle Scholar
  15. 15.
    Karypis, G., Kumar, V.: Parallel multilevel k-way partitioning scheme for irregular graphs. Proceedings of the 1996 ACM/IEEE Conference on Supercomputing, 35–35 (1996)Google Scholar
  16. 16.
    Long, B., Wu, X., Zhang, Z., Yu, P.: Unsupervised learning on k-partite graphs. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 317–326. ACM, New York (2006)CrossRefGoogle Scholar
  17. 17.
    Ma, X., Gao, L., Yong, X., Fu, L.: Semi-supervised clustering algorithm for community structure detection in complex networks. Physica A: Statistical Mechanics and its Applications 389(1), 187–197 (2010)CrossRefGoogle Scholar
  18. 18.
    Nepusz, T., Petróczi, A., Bazsó, F.: Fuzzy Clustering and the Concept of Bridgedness in Social Networks. In: Proceedings of the International Workshop and Conference on Network Science, NetSci (2007)Google Scholar
  19. 19.
    Nepusz, T., Petróczi, A., Négyessy, L., Bazsó, F.: Fuzzy communities and the concept of bridgeness in complex networks. Physical Review E 77(1), 16107 (2008)MathSciNetCrossRefGoogle Scholar
  20. 20.
    Newman, M.: Scientific collaboration networks. I. Network construction and fundamental results. Physical Review E 64(1), 16131 (2001)CrossRefGoogle Scholar
  21. 21.
    Newman, M.: Analysis of weighted networks. Physical Review E 70(5), 56131 (2004)CrossRefGoogle Scholar
  22. 22.
    Newman, M., Girvan, M.: Finding and evaluating community structure in networks. Physical review E 69(2), 26113 (2004)CrossRefGoogle Scholar
  23. 23.
    Palla, G., Derényi, I., Farkas, I., Vicsek, T.: Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043), 814–818 (2005)CrossRefGoogle Scholar
  24. 24.
    Ravasz, E., Somera, A., Mongru, D., Oltvai, Z., Barabási, A.: Hierarchical organization of modularity in metabolic networks. Science 297(5586), 1551 (2002)CrossRefGoogle Scholar
  25. 25.
    Reichardt, J., Bornholdt, S.: Detecting fuzzy community structures in complex networks with a Potts model. Physical Review Letters 93(21), 218701 (2004)CrossRefGoogle Scholar
  26. 26.
    Ruan, J., Zhang, W.: An efficient spectral algorithm for network community discovery and its applications to biological and social networks. In: Seventh IEEE International Conference on Data Mining, ICDM 2007. pp. 643–648. IEEE, Los Alamitos (2008)Google Scholar
  27. 27.
    Sawardecker, E., Sales-Pardo, M., Amaral, L.: Detection of node group membership in networks with group overlap. The European Physical Journal B 67(3), 277–284 (2008)CrossRefMATHGoogle Scholar
  28. 28.
    Stark, C., Breitkreutz, B., Reguly, T., Boucher, L., Breitkreutz, A., Tyers, M.: BioGRID: a general repository for interaction datasets. Nucleic acids research 34(suppl 1), D535 (2006)CrossRefGoogle Scholar
  29. 29.
    Thurman, B.: In the office: Networks and coalitions* 1. Social Networks 2(1), 47–63 (1980)CrossRefGoogle Scholar
  30. 30.
    Ucar, D., Asur, S., Catalyurek, U., Parthasarathy, S.: Improving functional modularity in protein-protein interactions graphs using hub-induced subgraphs. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) PKDD 2006. LNCS (LNAI), vol. 4213, pp. 371–382. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  31. 31.
    White, S., Smyth, P.: A spectral clustering approach to finding communities in graphs. In: Proceedings of the Fifth SIAM International Conference on Data Mining. p. 274. Society for Industrial Mathematics (2005)Google Scholar
  32. 32.
    Zachary, W.: An information flow model for conflict and fission in small groups. Journal of Anthropological Research 33(4), 452–473 (1977)CrossRefGoogle Scholar
  33. 33.
    Zadeh, L.: Fuzzy sets*. Information and control 8(3), 338–353 (1965)MathSciNetCrossRefMATHGoogle Scholar
  34. 34.
    Zhang, S., Wang, R., Zhang, X.: Identification of overlapping community structure in complex networks using fuzzy c-means clustering. Physica A: Statistical Mechanics and its Applications 374(1), 483–490 (2007)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Tanwistha Saha
    • 1
  • Carlotta Domeniconi
    • 1
  • Huzefa Rangwala
    • 1
  1. 1.Department of Computer ScienceGeorge Mason UniversityFairfaxUSA

Personalised recommendations