Abstract
The last decade has witnessed the advance of overlapping community detection based on local expansion. In this paper, we propose a novel local expanding-based overlapping community detection algorithm, denoted by Core and Bridge Seeds Extension, that aims to improve the quality of communities. Instead of the traditional approaches to select the cores of communities as seeds, a new Core-Bridge triplet strategy is suggested to select seeds to generate the initial backbone and framework of the community. In the optimization stage, a stepwise refinement approach is adopted to solve the issue of unreasonable division and unassigned node allocation. A merge index is designed to merge communities reasonably. The comparisons about the methods to improve accuracy of community numbers based on the known algorithms are also presented. Experimental results on synthetic networks and real networks show that our strategy outperforms the state-of-art algorithms in stability and effectiveness.
Similar content being viewed by others
Data availability
No datasets were generated or analysed during the current study.
References
Chen J, Yuan B (2006) Detecting functional modules in the yeast protein–protein interaction network. Bioinformatics 22(18):2283–2290
Girvan M, Newman MEJ (2002) Community structure in social and biological networks. Proc Natl Acad Sci 99(12):7821–7826
Dourisboure Y, Geraci F, Pellegrini M (2007) Extraction and classification of dense communities in the web. In: Proceedings of the 16th international conference on World Wide Web, pp 461–470
Li X, Wang Z, Hu R, Zhu Q, Wang L (2019) Recommendation algorithm based on improved spectral clustering and transfer learning. Pattern Anal Appl 22(2):633–647
Coscia M, Giannotti F, Pedreschi D (2011) A classification for community discovery methods in complex networks. Stat Anal Data Min ASA Data Sci J 4(5):512–546
Lyzinski V, Tang M, Athreya A, Park Y, Priebe CE (2016) Community detection and classification in hierarchical stochastic blockmodels. IEEE Trans Netw Sci Eng 4(1):13–26
Hou Chin J, Ratnavelu K (2017) A semi-synchronous label propagation algorithm with constraints for community detection in complex networks. Sci Rep 7(1):45836
Fortunato S (2010) Community detection in graphs. Phys Rep 486(3–5):75–174
Chen Y, Mo D (2022) Community detection for multilayer weighted networks. Inf Sci 595:119–141
Tang Z, Tang Y, Li C, Cao J, Chen G, Lin R (2021) A fast local community detection algorithm in complex networks. World Wide Web 24(6):1929–1955
Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043):814–818
Gregory S (2010) Finding overlapping communities in networks by label propagation. New J Phys 12(10):103018
Xie J, Szymanski BK, Liu X (2011) Slpa: uncovering overlapping communities in social networks via a speaker-listener interaction dynamic process. In: IEEE 11th international conference on data mining workshops, pp 344–349
Lancichinetti A, Fortunato S, Kertész J (2009) Detecting the overlapping and hierarchical community structure in complex networks. New J Phys 11(3):033015
Wu Z, Lin Y, Gregory S, Wan H, Tian S (2012) Balanced multi-label propagation for overlapping community detection in social networks. J Comput Sci Technol 27(3):468–479
Ahn YY, Bagrow JP, Lehmann S (2010) Link communities reveal multiscale complexity in networks. Nature 466(7307):761–764
Whang JJ, Gleich DF, Dhillon IS (2016) Overlapping community detection using neighborhood-inflated seed expansion. IEEE Trans Knowl Data Eng 28(5):1272–1284
Jokar E, Mosleh M, Kheyrandish M (2022) Overlapping community detection in complex networks using fuzzy theory, balanced link density, and label propagation. Expert Syst 39(5):e12921
Shen H, Cheng X, Guo J (2009) Quantifying and identifying the overlapping community structure in networks. J Stat Mech Theory Exp 07:P07042
Evans TS (2010) Clique graphs and overlapping communities. J Stat Mech Theory Exp 12:P12037
Luo M, Xu Y (2022) Community detection via network node vector label propagation. Phys A 593:126931
Yang J, Leskovec J (2013), Overlapping community detection at scale: a nonnegative matrix factorization approach. In: Proceedings of the 6th ACM international conference on Web search and data mining, pp 587–596
Coscia M, Rossetti G, Giannotti F, Pedreschi D (2012) Demon: a local-first discovery method for overlapping communities. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 615–623
Lancichinetti A, Radicchi F, Ramasco JJ, Fortunato S (2011) Finding statistically significant communities in networks. PLoS ONE 6(4):e18961
Baumes J, Goldberg MK, Krishnamoorthy MS, Magdon Ismail M, Preston N (2005) Finding communities by clustering a graph into overlapping subgraphs. In: Proceedings of the IADIS international conference on applied computing, pp 97–104
Yang J, Zhang X (2017) Finding overlapping communities using seed set. Phys A 467:96–106
Yin H, Benson AR, Leskovec J, Gleich DF (2017) Local higher-order graph clustering. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, pp 555–564
Wang X, Liu G, Li J, Nees JP (2017) Locating structural centers: a density-based clustering method for community detection. PLoS ONE 12(1):e0169355
Gao Y, Zhang H, Zhang Y (2019) Overlapping community detection based on conductance optimization in large-scale networks. Phys A 522:69–79
Gao Y, Yu X, Zhang H (2021) Overlapping community detection by constrained personalized PageRank. Expert Syst Appl 173:114682
Basuchowdhuri P, Sikdar S, Nagarajan V, Mishra K, Gupta S, Majumder S (2019) Fast detection of community structures using graph traversal in social networks. Knowl Inf Syst 59(1):1–31
Ding X, Zhang J, Yang J (2020) Node-community membership diversifies community structures: an overlapping community detection algorithm based on local expansion and boundary re-checking. Knowl Based Syst 198:105935
Ding X, Yang H, Zhang J, Yang J, Xiang X (2022) Ceo: identifying overlapping communities via construction, expansion and optimization. Inf Sci 596:93–118
Cheng F, Wang C, Zhang X, Yang Y (2020) A local-neighborhood information based overlapping community detection algorithm for large-scale complex networks. IEEE ACM Trans Netw 29(2):543–556
Jiang H, Liu Z, Liu C, Su Y, Zhang X (2020) Community detection in complex networks with an ambiguous structure using central node based link prediction. Knowl Based Syst 195:105626
Padrol Sureda A, Perarnau Llobet G, Pfeifle J, Muntés Mulero V (2010) Overlapping community search for social networks. In: IEEE 26th international conference on data engineering (ICDE 2010), pp 992–995
Whang JJ, Gleich DF, Dhillon IS (2013) Overlapping community detection using seed set expansion. In: Proceedings of the 22nd ACM international conference on information & knowledge management, pp 2099–2108
Andersen R, Chung F, Lang K (2006) Local graph partitioning using pagerank vectors. In: 47th annual IEEE symposium on foundations of computer science (FOCS’06), pp 475–486
Wang X, Wang Y, Qin X, Li R, Eustace J (2018) Detecting overlapping communities based on vital nodes in complex networks. Chin Phys B 27(10):100504
Shen H, Cheng X, Cai K, Hu MB (2009) Detect overlapping and hierarchical community structure in networks. Phys A 388(8):1706–1712
Su Y, Wang B, Zhang X (2017) A seed-expanding method based on random walks for community detection in networks with ambiguous community structures. Sci Rep 7(1):1–10
Berahmand K, Bouyer A, Samadi N (2019) A new local and multidimensional ranking measure to detect spreaders in social networks. Computing 101(11):1711–1733
Meghanathan N (2017) A computationally lightweight and localized centrality metric in lieu of betweenness centrality for complex network analysis. Vietnam J Comput Sci 4(1):23–38
Meghanathan M (2021) Neighborhood-based bridge node centrality tuple for complex network analysis. Appl Netw Sci 6(1):1–36
Şimşek A (2021) Lexical sorting centrality to distinguish spreading abilities of nodes in complex networks under the susceptible-infectious-recovered (sir) model. J King Saud Univ Comput Inf Sci 34(8):4810–4820
Kloumann IM, Kleinberg JM (2014) Community membership identification from small seed sets. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1366–1375
Fortunato S, Hric D (2016) Community detection in networks: a user guide. Phys Rep 659:1–44
Radicchi F, Castellano C, Cecconi F, Loreto V, Parisi D (2004) Defining and identifying communities in networks. Proc Natl Acad Sci 101(9):2658–2663
Zhou T, Lü L, Zhang YC (2009) Predicting missing links via local information. Eur Phys J B 71(4):623–630
Chakraborty T, Dalmia A, Mukherjee A, Ganguly N (2017) Metrics for community analysis: a survey. ACM Comput Surv: CSUR 50(4):1–37
McDaid AF, Greene D, Hurley N (2011) Normalized mutual information to evaluate overlapping community finding algorithms. arXiv:1110.2515
Leskovec J, Lang KJ, Mahoney M (2010) Empirical comparison of algorithms for network community detection. In: Proceedings of the 19th international conference on World Wide Web, pp 631–640
Lancichinetti A, Fortunato S, Radicchi F (2008) Benchmark graphs for testing community detection algorithms. Phys Rev E 78(4):046110
Lancichinetti A, Fortunato S (2009) Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Phys Rev E 80(1):016118
Zachary WW (1977) An information flow model for conflict and fission in small groups. J Anthropol Res 33(4):452–473
Lusseau D, Schneider K, Boisseau OJ, Haase P, Slooten E, Dawson SM (2003) The bottlenose dolphin community of doubtful sound features a large proportion of long-lasting associations. Behav Ecol Sociobiol 54(4):396–405
Kunegis J (2013) KONECT—the Koblenz network collection. In: Proceedings of the international conference on World Wide Web companion, pp 1343–1350
Guimera R, Danon L, Diaz Guilera A, Giralt F, Arenas A (2003) Self-similar community structure in a network of human interactions. Phys Rev E 68(6):065103
Boguná M, Pastor Satorras R, Díaz Guilera A, Arenas A (2004) Models of social networks based on social distance attachment. Phys Rev E 70(5):056122
Acknowledgements
The authors would like to express their sincere gratitude to all reviewers for valuable suggestions, which are helpful in improving and clarifying the original manuscript. We thank the National Institute of Education, Nanyang Technological University, where part of this research was performed. This work was partly supported by the National Natural Science Foundation of China (Nos. 61977016 and 61572010), Natural Science Foundation of Fujian Province (Nos. 2020J01164, 2017J01738). This work was also partly supported by Fujian Alliance of Mathematics (No. 2023SXLMMS04) and China Scholarship Council (CSC No. 202108350054).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, G., Zhou, S. A novel overlapping community detection strategy based on Core-Bridge seeds. Int. J. Mach. Learn. & Cyber. 15, 2131–2147 (2024). https://doi.org/10.1007/s13042-023-02020-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-023-02020-3