Abstract
The scale of communities in real-world networks is often imbalanced, which has a significant impact on community detection performance. Existing approaches exhibit a trade-off between accuracy and computational cost, with global methods offering higher accuracy but requiring intensive computations, and local methods accelerating the detection at the expense of accuracy. Despite these challenges, few works concentrate on how to effectively handle community detection with imbalanced community scales. To address this gap, first, a hybrid method that combines global and local information in the network is proposed to identify core nodes. This involves incorporating hierarchical structural information used to measure the global influence of the node, together with the effective local boundaries ensuring even distribution of core nodes in the network, to alleviate the impact of community scale imbalance. Second, we propose a two-phase expansion strategy to handle the imbalance scale of communities and prevent over-expansion of a single structure. In the first phase of the strategy, the belonging function is proposed to better measure the strength of connections between the current node and the other nodes for local community expansion. In the second phase of the strategy, we present a weighted label propagation method to efficiently expand the unlabeled boundary nodes and the nodes with overlapping attributes. Extensive experiments were conducted over twenty networks in comparison with eight state-of-the-art baseline methods, demonstrating that CONTEX is very competitive to these methods in achieving higher accuracy of community detection, while maintaining a relevantly lower computational time.
Similar content being viewed by others
Data Availability and Access
The datasets are publicly available online through their reference sources in the manuscript.
References
Dey A, Kumar BR, Das B, Ghoshal AK (2023) Outlier detection in social networks leveraging community structure. Inform Sci 634:578–586. https://doi.org/10.1016/j.ins.2023.03.120
Calero Valdez A, Schaar AK, Ziefle M, Holzinger A, Jeschke S, Brecher C (2012) Using mixed node publication network graphs for analyzing success in interdisciplinary teams. In: Active Media Technology: 8th International Conference, AMT 2012, Macau, China, December 4-7, 2012. Proceedings 8, Springer, pp 606–617. https://doi.org/10.1007/978-3-642-35236-2_61
Li Z, Liu A, Miao J, Yang Y (2023) A three-phase method for spherical fuzzy environment and application to community epidemic prevention management. Expert Syst Appl 211:118601. https://doi.org/10.1016/j.eswa.2022.118601
Choudhary C, Singh I, Kumar M (2023) Community detection algorithms for recommendation systems: techniques and metrics. Computing 105(2):417–453. https://doi.org/10.1007/s00607-022-01131-z
Li C, Guo X, Lin W, Tang Z, Cao J, Zhang Y (2023) Multiplex network community detection algorithm based on motif awareness. Knowl-Based Syst 260:110136. https://doi.org/10.1016/j.knosys.2022.110136
Ding X, Zhang J, Yang J (2018) A robust two-stage algorithm for local community detection. Knowl-Based Syst 152:188–199. https://doi.org/10.1016/j.knosys.2018.04.018
El Kouni I.B, Karoui W, Romdhane LB (2020) Node importance based label propagation algorithm for overlapping community detection in networks. Expert Syst Appl 162:113020. https://doi.org/10.1016/j.eswa.2019.113020
Bouyer A, Roghani H (2020) Lsmd: A fast and robust local community detection starting from low degree nodes in social networks. Future Generation Computer Syst 113:41–57. https://doi.org/10.1016/j.future.2020.07.011
Roghani H, Bouyer A (2022) A fast local balanced label diffusion algorithm for community detection in social networks. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2022.3162161
You X, Ma Y, Liu Z (2020) A three-stage algorithm on community detection in social networks. Knowl-Based Syst 187:104822. https://doi.org/10.1016/j.knosys.2019.06.030
Aghaalizadeh S, Afshord ST, Bouyer A, Anari B (2021) A three-stage algorithm for local community detection based on the high node importance ranking in social networks. Physica A: Statistical Mechanics Appl 563:125420. https://doi.org/10.1016/j.physa.2020.125420
Berahmand K, Bouyer A, Vasighi M (2018) Community detection in complex networks by detecting and expanding core nodes through extended local similarity of nodes. IEEE Trans Computational Social Syst 5(4):1021–1033. https://doi.org/10.1109/TCSS.2018.2879494
Zhang W, Shang R, Jiao L (2023) Large-scale community detection based on core node and layer-by-layer label propagation. InformSci 632:1–18. https://doi.org/10.1016/j.ins.2023.02.090
Gao Y, Zhang H, Yu X (2022) Higher-order community detection: On information degeneration and its elimination. IEEE/ACM Trans Netw. https://doi.org/10.1109/TNET.2022.3201668
Boroujeni R.J, Soleimani S (2022) The role of influential nodes and their influence domain in community detection: An approximate method for maximizing modularity. Expert Syst Appl 202:117452. https://doi.org/10.1016/j.eswa.2022.117452
Yan Y, Chen W (2023) New components and combinations: The perspective of the internal collaboration networks of scientific teams. J Informet 17(2). https://doi.org/10.1016/j.joi.2023.101407
Arenas A, Danon L, Diaz-Guilera A, Gleiser PM, Guimera R (2004) Community analysis in social networks. European Phys J B 38:373–380. https://doi.org/10.1140/epjb/e2004-00130-1
Goekoop R et al (2023) Hierarchical network structure as the source of hierarchical dynamics (power-law frequency spectra) in living and non-living systems: how state-trait continua (body plans, personalities) emerge from first principles in biophysics. Neuroscience & Biobehavioral Rev 105402. https://doi.org/10.1016/j.neubiorev.2023.105402
Zhao Z, Li D, Sun Y, Zhang R, Liu J (2023) Ranking influential spreaders based on both node k-shell and structural hole. Knowl-Based Syst 260:110163. https://doi.org/10.1016/j.knosys.2022.110163
Blondel VD, Guillaume J-L (2008) Lambiotte R, Lefebvre E: Fast unfolding of communities in large networks. J Statistical Mechanics: Theory Exper 2008(10):10008. https://doi.org/10.1088/1742-5468/2008/10/P10008
Traag V.A, Waltman L, Van Eck NJ (2019) From louvain to leiden: guaranteeing well-connected communities. Scientific Reports 9(1):5233. https://doi.org/10.1038/s41598-019-41695-z
Guo K, Huang X, Wu L, Chen Y (2022) Local community detection algorithm based on local modularity density. Appl Intell 52(2):1238–1253. https://doi.org/10.1007/s10489-020-02052-0
Wang P-W, Kolter JZ (2020) Community detection using fast low-cardinality semidefinite programming. In: Advances in neural information processing systems 33:3374–3385. https://doi.org/10.48550/arXiv.2012.02676
Kang Y, Lee J-S, Shin W-Y, Kim S-W (2022) Community reinforcement: An effective and efficient preprocessing method for accurate community detection. Knowl-Based Syst 237:107741. https://doi.org/10.1016/j.knosys.2021.107741
Yuan C, Rong C, Yao Q (2020) Boundary-connection deletion strategy based method for community detection in complex networks. Appl Intell 50(11):3570–3589. https://doi.org/10.1007/s10489-020-01762-9
Raghavan UN (2007) Albert R, Kumara S: Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76(3). https://doi.org/10.1103/PhysRevE.76.036106
Li C, Chen H, Li T, Yang X (2022) A stable community detection approach for complex network based on density peak clustering and label propagation. Appl Intell 52(2):1188–1208. https://doi.org/10.1007/s10489-021-02287-5
Laassem B, Idarrou A, Boujlaleb L et al (2022) Label propagation algorithm for community detection based on coulomb’s law. Physica A: Statistical Mechanics and its Applications 593:126881. https://doi.org/10.1016/j.physa.2022.126881
Fang W, Wang X, Liu L, Wu Z, Tang S, Zheng Z (2022) Community detection through vector-label propagation algorithms. Chaos, Solitons & Fractals 158:112066. https://doi.org/10.1016/j.chaos.2022.112066
Ma T, Liu Q, Cao J, Tian Y, Al-Dhelaan A, Al-Rodhaan M (2020) Lgiem: Global and local node influence based community detection. Future Generation Comput Syst 105:533–546. https://doi.org/10.1016/j.future.2019.12.022
Niu Y, Kong D, Liu L, Wen R, Xiao J (2023) Overlapping community detection with adaptive density peaks clustering and iterative partition strategy. Expert Syst Appl 213:119213. https://doi.org/10.1016/j.eswa.2022.119213
Ni L, Ge J, Zhang Y, Luo W, Sheng VS (2023) Semi-supervised local community detection. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2023.3290095
Rosvall M, Axelsson D, Bergstrom CT (2009) The map equation. European Phys J Special Topics 178(1):13–23. https://doi.org/10.1140/epjst/e2010-01179-1
Ma Z, Nandy S (2023) Community detection with contextual multilayer networks. IEEE Transactions on Information Theory 69(5):3203–3239. https://doi.org/10.1109/TIT.2023.3238352
Jiang H, Liu Z, Liu C, Su Y, Zhang X (2020) Community detection in complex networks with an ambiguous structure using central node based link prediction. Knowl-Based Syst 195:105626. https://doi.org/10.1016/j.knosys.2020.105626
Zhou J, Chen Z, Du M, Chen L, Yu S, Chen G, Xuan Q (2021) Robustecd: Enhancement of network structure for robust community detection. IEEE Trans Knowl Data Eng 35(1):842–856. https://doi.org/10.1109/TKDE.2021.3088844
Yang X-H, Ma G-F, Zeng X-Y, Pang Y, Zhou Y, Zhang Y-D, Ye L (2023) Community detection based on markov similarity enhancement. IEEE Trans Circuits Syst II: Express Briefs. https://doi.org/10.1109/TCSII.2023.3275153
Yin T, Chen H, Yuan Z, Wan J, Liu K, Horng S-J (2023) Li T A robust multilabel feature selection approach based on graph structure considering fuzzy dependency and feature interaction. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2023.3287193
Shang R, Zhao K, Zhang W, Feng J, Li Y, Jiao L (2022) Evolutionary multiobjective overlapping community detection based on similarity matrix and node correction. Appl Soft Comput 127:109397. https://doi.org/10.1016/j.asoc.2022.109397
Fei R, Wan Y, Hu B, Li A, Li Q (2023) A novel network core structure extraction algorithm utilized variational autoencoder for community detection. Expert Syst Appl 222:119775. https://doi.org/10.1016/j.eswa.2023.119775
Zhang Z, Li X, Gan C (2021) Identifying influential nodes in social networks via community structure and influence distribution difference. Digital Commun Netw 7(1):131–139. https://doi.org/10.1016/j.dcan.2020.04.011
Haque MN, Mathieson L, Moscato P (2017) A memetic algorithm for community detection by maximising the connected cohesion. In: 2017 IEEE Symposium Series on computational intelligence (SSCI), IEEE, pp 1–8. https://doi.org/10.1109/SSCI.2017.8285404
Pan R, Wang Y, Sun J, Liu H, Zhao Y, Xia J, Chen W (2023) Simplifying social networks via triangle-based cohesive subgraphs. Visual Inform 7(4):84–94. https://doi.org/10.1016/j.visinf.2023.07.003
Shang R, Zhang W, Zhang J, Jiao L, Li Y, Stolkin R (2022) Local community detection algorithm based on alternating strategy of strong fusion and weak fusion. IEEE Trans Cybernet 53(2):818–831. https://doi.org/10.1109/TCYB.2022.3159584
Shao Z, Ma L, Lin Q, Li J, Gong M, Nandi AK (2022) Pmcdm: Privacy-preserving multiresolution community detection in multiplex networks. Knowl-Based Syst 244:108542. https://doi.org/10.1016/j.knosys.2022.108542
Paul A, Dutta A (2022) Community detection using local group assimilation. Expert Syst Appl 206:117794. https://doi.org/10.1016/j.eswa.2022.117794
Kar AK, Mishra AC, Mohanty SK (2023) An efficient entropy based dissimilarity measure to cluster categorical data. Eng Appl Artif Intell 119:105795. https://doi.org/10.1016/j.engappai.2022.105795
Xia C, Luo Y, Wang L, Li H-J (2021) A fast community detection algorithm based on reconstructing signed networks. IEEE Syst J 16(1):614–625. https://doi.org/10.1109/JSYST.2021.3065378
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Shiliang Liu: Writing original draft preparation, Conceptualization, Software, Validation. Xinyao Zhang: Investigation, Software, Validation. Yinglong Ma: Conceptualization, Methodology, Writing-Reviewing and Editing.
Corresponding author
Ethics declarations
Completing Interests
The authors have no relevant financial or non-financial interests to disclose.
Ethical standard
This work described in this manuscript is original and has not been under consideration for publication elsewhere. All authors read and approved the final manuscript. The research in this manuscript does not involve human participants and animals.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, S., Zhang, X. & Ma, Y. A hybrid information-based two-phase expansion algorithm for community detection with imbalanced scales. Appl Intell 54, 4814–4833 (2024). https://doi.org/10.1007/s10489-024-05424-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-024-05424-y