Skip to main content
Log in

A hybrid information-based two-phase expansion algorithm for community detection with imbalanced scales

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

The scale of communities in real-world networks is often imbalanced, which has a significant impact on community detection performance. Existing approaches exhibit a trade-off between accuracy and computational cost, with global methods offering higher accuracy but requiring intensive computations, and local methods accelerating the detection at the expense of accuracy. Despite these challenges, few works concentrate on how to effectively handle community detection with imbalanced community scales. To address this gap, first, a hybrid method that combines global and local information in the network is proposed to identify core nodes. This involves incorporating hierarchical structural information used to measure the global influence of the node, together with the effective local boundaries ensuring even distribution of core nodes in the network, to alleviate the impact of community scale imbalance. Second, we propose a two-phase expansion strategy to handle the imbalance scale of communities and prevent over-expansion of a single structure. In the first phase of the strategy, the belonging function is proposed to better measure the strength of connections between the current node and the other nodes for local community expansion. In the second phase of the strategy, we present a weighted label propagation method to efficiently expand the unlabeled boundary nodes and the nodes with overlapping attributes. Extensive experiments were conducted over twenty networks in comparison with eight state-of-the-art baseline methods, demonstrating that CONTEX is very competitive to these methods in achieving higher accuracy of community detection, while maintaining a relevantly lower computational time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Algorithm 1
Algorithm 2
Algorithm 3
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data Availability and Access

The datasets are publicly available online through their reference sources in the manuscript.

References

  1. Dey A, Kumar BR, Das B, Ghoshal AK (2023) Outlier detection in social networks leveraging community structure. Inform Sci 634:578–586. https://doi.org/10.1016/j.ins.2023.03.120

  2. Calero Valdez A, Schaar AK, Ziefle M, Holzinger A, Jeschke S, Brecher C (2012) Using mixed node publication network graphs for analyzing success in interdisciplinary teams. In: Active Media Technology: 8th International Conference, AMT 2012, Macau, China, December 4-7, 2012. Proceedings 8, Springer, pp 606–617. https://doi.org/10.1007/978-3-642-35236-2_61

  3. Li Z, Liu A, Miao J, Yang Y (2023) A three-phase method for spherical fuzzy environment and application to community epidemic prevention management. Expert Syst Appl 211:118601. https://doi.org/10.1016/j.eswa.2022.118601

  4. Choudhary C, Singh I, Kumar M (2023) Community detection algorithms for recommendation systems: techniques and metrics. Computing 105(2):417–453. https://doi.org/10.1007/s00607-022-01131-z

    Article  Google Scholar 

  5. Li C, Guo X, Lin W, Tang Z, Cao J, Zhang Y (2023) Multiplex network community detection algorithm based on motif awareness. Knowl-Based Syst 260:110136. https://doi.org/10.1016/j.knosys.2022.110136

  6. Ding X, Zhang J, Yang J (2018) A robust two-stage algorithm for local community detection. Knowl-Based Syst 152:188–199. https://doi.org/10.1016/j.knosys.2018.04.018

  7. El Kouni I.B, Karoui W, Romdhane LB (2020) Node importance based label propagation algorithm for overlapping community detection in networks. Expert Syst Appl 162:113020. https://doi.org/10.1016/j.eswa.2019.113020

  8. Bouyer A, Roghani H (2020) Lsmd: A fast and robust local community detection starting from low degree nodes in social networks. Future Generation Computer Syst 113:41–57. https://doi.org/10.1016/j.future.2020.07.011

  9. Roghani H, Bouyer A (2022) A fast local balanced label diffusion algorithm for community detection in social networks. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2022.3162161

    Article  Google Scholar 

  10. You X, Ma Y, Liu Z (2020) A three-stage algorithm on community detection in social networks. Knowl-Based Syst 187:104822. https://doi.org/10.1016/j.knosys.2019.06.030

  11. Aghaalizadeh S, Afshord ST, Bouyer A, Anari B (2021) A three-stage algorithm for local community detection based on the high node importance ranking in social networks. Physica A: Statistical Mechanics Appl 563:125420. https://doi.org/10.1016/j.physa.2020.125420

  12. Berahmand K, Bouyer A, Vasighi M (2018) Community detection in complex networks by detecting and expanding core nodes through extended local similarity of nodes. IEEE Trans Computational Social Syst 5(4):1021–1033. https://doi.org/10.1109/TCSS.2018.2879494

    Article  Google Scholar 

  13. Zhang W, Shang R, Jiao L (2023) Large-scale community detection based on core node and layer-by-layer label propagation. InformSci 632:1–18. https://doi.org/10.1016/j.ins.2023.02.090

  14. Gao Y, Zhang H, Yu X (2022) Higher-order community detection: On information degeneration and its elimination. IEEE/ACM Trans Netw. https://doi.org/10.1109/TNET.2022.3201668

    Article  Google Scholar 

  15. Boroujeni R.J, Soleimani S (2022) The role of influential nodes and their influence domain in community detection: An approximate method for maximizing modularity. Expert Syst Appl 202:117452. https://doi.org/10.1016/j.eswa.2022.117452

  16. Yan Y, Chen W (2023) New components and combinations: The perspective of the internal collaboration networks of scientific teams. J Informet 17(2). https://doi.org/10.1016/j.joi.2023.101407

  17. Arenas A, Danon L, Diaz-Guilera A, Gleiser PM, Guimera R (2004) Community analysis in social networks. European Phys J B 38:373–380. https://doi.org/10.1140/epjb/e2004-00130-1

  18. Goekoop R et al (2023) Hierarchical network structure as the source of hierarchical dynamics (power-law frequency spectra) in living and non-living systems: how state-trait continua (body plans, personalities) emerge from first principles in biophysics. Neuroscience & Biobehavioral Rev 105402. https://doi.org/10.1016/j.neubiorev.2023.105402

  19. Zhao Z, Li D, Sun Y, Zhang R, Liu J (2023) Ranking influential spreaders based on both node k-shell and structural hole. Knowl-Based Syst 260:110163. https://doi.org/10.1016/j.knosys.2022.110163

  20. Blondel VD, Guillaume J-L (2008) Lambiotte R, Lefebvre E: Fast unfolding of communities in large networks. J Statistical Mechanics: Theory Exper 2008(10):10008. https://doi.org/10.1088/1742-5468/2008/10/P10008

  21. Traag V.A, Waltman L, Van Eck NJ (2019) From louvain to leiden: guaranteeing well-connected communities. Scientific Reports 9(1):5233. https://doi.org/10.1038/s41598-019-41695-z

  22. Guo K, Huang X, Wu L, Chen Y (2022) Local community detection algorithm based on local modularity density. Appl Intell 52(2):1238–1253. https://doi.org/10.1007/s10489-020-02052-0

    Article  Google Scholar 

  23. Wang P-W, Kolter JZ (2020) Community detection using fast low-cardinality semidefinite programming. In: Advances in neural information processing systems 33:3374–3385. https://doi.org/10.48550/arXiv.2012.02676

  24. Kang Y, Lee J-S, Shin W-Y, Kim S-W (2022) Community reinforcement: An effective and efficient preprocessing method for accurate community detection. Knowl-Based Syst 237:107741. https://doi.org/10.1016/j.knosys.2021.107741

  25. Yuan C, Rong C, Yao Q (2020) Boundary-connection deletion strategy based method for community detection in complex networks. Appl Intell 50(11):3570–3589. https://doi.org/10.1007/s10489-020-01762-9

    Article  Google Scholar 

  26. Raghavan UN (2007) Albert R, Kumara S: Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E 76(3). https://doi.org/10.1103/PhysRevE.76.036106

  27. Li C, Chen H, Li T, Yang X (2022) A stable community detection approach for complex network based on density peak clustering and label propagation. Appl Intell 52(2):1188–1208. https://doi.org/10.1007/s10489-021-02287-5

    Article  Google Scholar 

  28. Laassem B, Idarrou A, Boujlaleb L et al (2022) Label propagation algorithm for community detection based on coulomb’s law. Physica A: Statistical Mechanics and its Applications 593:126881. https://doi.org/10.1016/j.physa.2022.126881

  29. Fang W, Wang X, Liu L, Wu Z, Tang S, Zheng Z (2022) Community detection through vector-label propagation algorithms. Chaos, Solitons & Fractals 158:112066. https://doi.org/10.1016/j.chaos.2022.112066

  30. Ma T, Liu Q, Cao J, Tian Y, Al-Dhelaan A, Al-Rodhaan M (2020) Lgiem: Global and local node influence based community detection. Future Generation Comput Syst 105:533–546. https://doi.org/10.1016/j.future.2019.12.022

  31. Niu Y, Kong D, Liu L, Wen R, Xiao J (2023) Overlapping community detection with adaptive density peaks clustering and iterative partition strategy. Expert Syst Appl 213:119213. https://doi.org/10.1016/j.eswa.2022.119213

  32. Ni L, Ge J, Zhang Y, Luo W, Sheng VS (2023) Semi-supervised local community detection. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2023.3290095

  33. Rosvall M, Axelsson D, Bergstrom CT (2009) The map equation. European Phys J Special Topics 178(1):13–23. https://doi.org/10.1140/epjst/e2010-01179-1

  34. Ma Z, Nandy S (2023) Community detection with contextual multilayer networks. IEEE Transactions on Information Theory 69(5):3203–3239. https://doi.org/10.1109/TIT.2023.3238352

    Article  MathSciNet  Google Scholar 

  35. Jiang H, Liu Z, Liu C, Su Y, Zhang X (2020) Community detection in complex networks with an ambiguous structure using central node based link prediction. Knowl-Based Syst 195:105626. https://doi.org/10.1016/j.knosys.2020.105626

  36. Zhou J, Chen Z, Du M, Chen L, Yu S, Chen G, Xuan Q (2021) Robustecd: Enhancement of network structure for robust community detection. IEEE Trans Knowl Data Eng 35(1):842–856. https://doi.org/10.1109/TKDE.2021.3088844

  37. Yang X-H, Ma G-F, Zeng X-Y, Pang Y, Zhou Y, Zhang Y-D, Ye L (2023) Community detection based on markov similarity enhancement. IEEE Trans Circuits Syst II: Express Briefs. https://doi.org/10.1109/TCSII.2023.3275153

    Article  Google Scholar 

  38. Yin T, Chen H, Yuan Z, Wan J, Liu K, Horng S-J (2023) Li T A robust multilabel feature selection approach based on graph structure considering fuzzy dependency and feature interaction. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2023.3287193

  39. Shang R, Zhao K, Zhang W, Feng J, Li Y, Jiao L (2022) Evolutionary multiobjective overlapping community detection based on similarity matrix and node correction. Appl Soft Comput 127:109397. https://doi.org/10.1016/j.asoc.2022.109397

  40. Fei R, Wan Y, Hu B, Li A, Li Q (2023) A novel network core structure extraction algorithm utilized variational autoencoder for community detection. Expert Syst Appl 222:119775. https://doi.org/10.1016/j.eswa.2023.119775

  41. Zhang Z, Li X, Gan C (2021) Identifying influential nodes in social networks via community structure and influence distribution difference. Digital Commun Netw 7(1):131–139. https://doi.org/10.1016/j.dcan.2020.04.011

    Article  Google Scholar 

  42. Haque MN, Mathieson L, Moscato P (2017) A memetic algorithm for community detection by maximising the connected cohesion. In: 2017 IEEE Symposium Series on computational intelligence (SSCI), IEEE, pp 1–8. https://doi.org/10.1109/SSCI.2017.8285404

  43. Pan R, Wang Y, Sun J, Liu H, Zhao Y, Xia J, Chen W (2023) Simplifying social networks via triangle-based cohesive subgraphs. Visual Inform 7(4):84–94. https://doi.org/10.1016/j.visinf.2023.07.003

    Article  Google Scholar 

  44. Shang R, Zhang W, Zhang J, Jiao L, Li Y, Stolkin R (2022) Local community detection algorithm based on alternating strategy of strong fusion and weak fusion. IEEE Trans Cybernet 53(2):818–831. https://doi.org/10.1109/TCYB.2022.3159584

    Article  Google Scholar 

  45. Shao Z, Ma L, Lin Q, Li J, Gong M, Nandi AK (2022) Pmcdm: Privacy-preserving multiresolution community detection in multiplex networks. Knowl-Based Syst 244:108542. https://doi.org/10.1016/j.knosys.2022.108542

  46. Paul A, Dutta A (2022) Community detection using local group assimilation. Expert Syst Appl 206:117794. https://doi.org/10.1016/j.eswa.2022.117794

  47. Kar AK, Mishra AC, Mohanty SK (2023) An efficient entropy based dissimilarity measure to cluster categorical data. Eng Appl Artif Intell 119:105795. https://doi.org/10.1016/j.engappai.2022.105795

  48. Xia C, Luo Y, Wang L, Li H-J (2021) A fast community detection algorithm based on reconstructing signed networks. IEEE Syst J 16(1):614–625. https://doi.org/10.1109/JSYST.2021.3065378

Download references

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Shiliang Liu: Writing original draft preparation, Conceptualization, Software, Validation. Xinyao Zhang: Investigation, Software, Validation. Yinglong Ma: Conceptualization, Methodology, Writing-Reviewing and Editing.

Corresponding author

Correspondence to Yinglong Ma.

Ethics declarations

Completing Interests

The authors have no relevant financial or non-financial interests to disclose.

Ethical standard

This work described in this manuscript is original and has not been under consideration for publication elsewhere. All authors read and approved the final manuscript. The research in this manuscript does not involve human participants and animals.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, S., Zhang, X. & Ma, Y. A hybrid information-based two-phase expansion algorithm for community detection with imbalanced scales. Appl Intell 54, 4814–4833 (2024). https://doi.org/10.1007/s10489-024-05424-y

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-024-05424-y

Keywords

Navigation