Abstract
Community detection algorithms are the basic tools for discovering the internal structure and organizational principles of a community. Ranging from model-based and optimization-based methods, existing efforts typically consider two sources of information, i.e. network structure and node attributes, to obtain communities with both denser network structure and similar attribute information. We argue that an inherent drawback of such methods is that, different impacts of different sources, is ignored during the clustering process. Besides, some existing community detection algorithms typically consider two sources of information but they cannot automatically determine the relative importance between them to reveal subspaces. As such, the detected communities may be unsatisfactory.
In this work, we propose to integrate subspace into a new overlapping community detection framework, an Overlapping Community Detection with Subspaces on Double-Views (CDDV), which exploits the relative importance between structures and attributes. This leads to a better detection result, effectively injecting subspaces to show the diversity of communities into the detection process in an explicit manner. We conduct extensive experiments on four public benchmarks, demonstrating significant improvements over several state-of-the-art models. Further analysis verifies the importance of subspace finding for capturing better communities, justifying the rationality and effectiveness of CDDV.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rezvani, M., Liang, W., Liu, C., et al.: Efficient detection of overlapping communities using asymmetric triangle cuts. IEEE Trans. Knowl. Data Eng. 30(11), 2093–2105 (2018)
Coscia, M., Rossetti, G., Giannotti, F., et al.: DEMON: a local-first discovery method for overlapping communities. In: KDD, pp. 615–623 (2012)
Yang, J., Leskovec, J.: Overlapping community detection at scale: a nonnegative matrix factorization approach. In: ACM International Conference on Web Search & Data Mining, pp. 587–596. ACM (2013)
Yamaguchi, Y., Hayashi, K.: When does label propagation fail? a view from a network generative model. In: IJCAI, pp. 3224–3230 (2017)
Whang, J.J., Dhillon, I.S., Gleich, D.F.: Non-exhaustive, overlapping K-means. In: Proceedings of the 2015 SIAM International Conference on Data Mining, pp. 936–944 (2015)
Jing, L., Ng, M.K., Huang, J.Z.: An entropy weighting k-means algorithm for subspace clustering of high-dimensional sparse data. IEEE Trans. Knowl. Data Eng. 19(8), 1026–1041 (2007)
Whang, J.J., Dhillon, I.S., Gleich, D.F.: Non-exhaustive, overlapping K-means. In: Proceedings of the 2015 SIAM International Conference on Data Mining, pp. 936–944 (2015)
Frank, M., Streich, A.P., Basin, D., et al.: Multi-assignment clustering for boolean data. J. Mach. Learn. Res. 13, 459–489 (2012)
Li, Y., Sha, C., Huang, X., et al.: Community detection in attributed graphs: an embedding approach. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Chen, X., Xu, X., Huang, J.Z., et al.: TW-K-means: automated two-level variable weighting clustering algorithm for multiview data. IEEE Trans. Knowl. Data Eng. 25(4), 932–944 (2013)
Ruan, Y., Fuhry, D., Parthasarathy, S.: Efficient community detection in large networks using content and links. In: WWW, pp. 1089–1098. ACM (2013)
Cohn, D., Hofmann, T.: The missing link-a probabilistic model of document content and hypertext connectivity. In: International Conference on Neural Information Processing Systems. MIT Press, pp. 430–436 (2000)
Tang, J., Qu, M., Wang, M., et al.: LINE: large-scale information network embedding. In: International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, pp. 1067–1077 (2015)
Chen, L.: Privacy preserving adjacency spectral embedding on stochastic blockmodels (2019)
Berberidis, D., Nikolakopoulos, A.N., Giannakis, G.B.: Adaptive diffusions for scalable learning over graphs. IEEE Trans. Signal Process. 67(5), 1307–1321 (2019)
Lancichinetti, A., Fortunato, S.: Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Phys. Rev. E 80(1), 016118 (2009)
Yang, Z., Li, Q., Liu, W., et al.: Dual graph regularized NMF model for social event detection from Flickr data. World Wide Web 20(5), 995–1015 (2017)
Acknowledgment
This work is supported by the National Natural Science Foundation of China (61762078, 61363058, 61966004, 61862058), Research Fund of Guangxi Key Lab of Multi-source Information Mining and Security (MIMS18-08), Northwest Normal University young teachers research capacity promotion plan (NWNU-LKQN2019-2) and Research Fund of Guangxi Key Laboratory of Trusted Software (kx202003).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Chang, Y., Ma, H., Su, Y., Li, Z. (2020). An Overlapping Community Detection with Subspaces on Double-Views. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12533. Springer, Cham. https://doi.org/10.1007/978-3-030-63833-7_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-63833-7_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63832-0
Online ISBN: 978-3-030-63833-7
eBook Packages: Computer ScienceComputer Science (R0)