Density-Based Subspace Clustering in Heterogeneous Networks

  • Brigitte Boden
  • Martin Ester
  • Thomas Seidl
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8724)

Abstract

Many real-world data sets, like data from social media or bibliographic data, can be represented as heterogeneous networks with several vertex types. Often additional attributes are available for the vertices, such as keywords for a paper. Clustering vertices in such networks, and analyzing the complex interactions between clusters of different types, can provide useful insights into the structure of the data. To exploit the full information content of the data, clustering approaches should consider the connections in the network as well as the vertex attributes. We propose the density-based clustering model TCSC for the detection of clusters in heterogeneous networks that are densely connected in the network as well as in the attribute space. Unlike previous approaches for clustering heterogeneous networks, TCSC enables the detection of clusters that show similarity only in a subset of the attributes, which is more effective in the presence of a large number of attributes.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Akoglu, L., Tong, H., Meeder, B., Faloutsos, C.: Pics: Parameter-free identification of cohesive subgroups in large attributed graphs. In: Proceedings of the Twelfth SIAM International Conference on Data Mining, pp. 439–450 (2012)Google Scholar
  2. 2.
    Barber, M.: Modularity and community detection in bipartite networks. Phys. Rev. E 76(6), 066102 (2007)Google Scholar
  3. 3.
    Boden, B.: Combined Clustering of Graph and Attribute Data. Ph.D. thesis, RWTH Aachen University, Aachen (2014)Google Scholar
  4. 4.
    Boden, B., Günnemann, S., Hoffmann, H., Seidl, T.: Mining coherent subgraphs in multi-layer graphs with edge labels. In: SIGKDD, pp. 1258–1266 (2012)Google Scholar
  5. 5.
    Gao, B., Liu, T., Zheng, X., Cheng, Q., Ma, W.: Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering. In: SIGKDD, pp. 41–50 (2005)Google Scholar
  6. 6.
    Günnemann, S., Färber, I., Boden, B., Seidl, T.: Subspace clustering meets dense subgraph mining: A synthesis of two paradigms. In: ICDM (2010)Google Scholar
  7. 7.
    Günnemann, S., Boden, B., Seidl, T.: Finding density-based subspace clusters in graphs with feature vectors. DMKD 25(2), 243–269 (2012)MATHGoogle Scholar
  8. 8.
    Kriegel, H.P., Kröger, P., Zimek, A.: Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering. TKDD 3(1), 1–58 (2009)CrossRefGoogle Scholar
  9. 9.
    Li, X., Ng, M.K., Ye, Y.: Multicomm: Finding community structure in multi-dimensional networks. TKDE 99(PrePrints), 1 (2013)CrossRefGoogle Scholar
  10. 10.
    Moser, F., Colak, R., Rafiey, A., Ester, M.: Mining cohesive patterns from graphs with feature vectors. In: SDM, pp. 593–604 (2009)Google Scholar
  11. 11.
    Mucha, P.J., Richardson, T., Macon, K., Porter, M.A., Onnela, J.P.: Community structure in time-dependent, multiscale, and multiplex networks. Science 328(5980), 876–878 (2010)CrossRefMATHMathSciNetGoogle Scholar
  12. 12.
    Shiga, M., Takigawa, I., Mamitsuka, H.: A spectral clustering approach to optimally combining numerical vectors with a modular network. In: SIGKDD, pp. 647–656 (2007)Google Scholar
  13. 13.
    Sun, Y., Aggarwal, C., Han, J.: Relation strength-aware clustering of heterogeneous information networks with incomplete attributes. VLDB 5(5), 394–405 (2012)Google Scholar
  14. 14.
    Sun, Y., Norick, B., Han, J., Yan, X., Yu, P., Yu, X.: Integrating meta-path selection with user-guided object clustering in heterogeneous information networks. In: SIGKDD, pp. 1348–1356 (2012)Google Scholar
  15. 15.
    Sun, Y., Yu, Y., Han, J.: Ranking-based clustering of heterogeneous information networks with star network schema. In: SIGKDD, pp. 797–806 (2009)Google Scholar
  16. 16.
    Suzuki, K., Wakita, K.: Extracting multi-facet community structure from bipartite networks. In: CSE, vol. 4, pp. 312–319 (2009)Google Scholar
  17. 17.
    Tang, L., Liu, H., Zhang, J.: Identifying evolving groups in dynamic multimode networks. TKDE 24(1), 72–85 (2012)Google Scholar
  18. 18.
    Tang, L., Wang, X., Liu, H.: Community detection via heterogeneous interaction analysis. DMKD 25(1), 1–33 (2012)MathSciNetGoogle Scholar
  19. 19.
    Yang, J., McAuley, J.J., Leskovec, J.: Community detection in networks with node attributes. In: ICDM, pp. 1151–1156 (2013)Google Scholar
  20. 20.
    Zhou, Y., Cheng, H., Yu, J.X.: Graph clustering based on structural/attribute similarities. PVLDB 2(1), 718–729 (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  • Brigitte Boden
    • 1
  • Martin Ester
    • 2
  • Thomas Seidl
    • 1
  1. 1.RWTH Aachen UniversityAachenGermany
  2. 2.Simon Fraser UniversityBurnabyCanada

Personalised recommendations