Skip to main content
Log in

An improved limited random walk approach for identification of overlapping communities in complex networks

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Detection of community structures in complex networks provides an effective tool for studying the relationships between nodes and revealing the hidden structures. In many real applications, the communities overlap, which means that the nodes can belong to different communities. Recently, random walk methods have been successfully applied for identification of overlapping communities in social networks. However, most of the existing random-walk-based methods require information on the entire network, which is difficult to obtain. Moreover, most networks have gained very large scales with the rapid development of Internet technologies, and it is impractical to scale the existing works for online social networks. Another issue concerning the existing methods is the need for information on the number of communities before the algorithm begins, which is impossible to meet for most real-world networks. To resolve the above issues, a random walk method is proposed in this paper for detection of overlapping community structures in complex networks. The proposed method employs the Markov transition matrix to calculate the transferability of the agent from one node to the others. These probabilities are then used as an attribute vector and feature set for each node, and the feature sets are used for identification of initial communities. Nodes that are available in the feature sets of more than one community, or exhibit high ratios of neighborhood with the other communities are identified as overlapping nodes. The proposed method is examined on various real-world and synthetic datasets. The results reported in terms of various evaluation metrics demonstrate its high efficiency as compared to the existing works.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Eliacik AB, Erdogan N (2018) Influential user weighted sentiment analysis on topic based microblogging community. Expert Syst Appl 92:403–418

    Google Scholar 

  2. Rezaeimehr F, Moradi P, Ahmadian S, Qader NN, Jalili M (2018) TCARS: time- and community-aware recommendation system. Futur Gener Comput Syst 78:419–429

    Google Scholar 

  3. Rad AA, Hasler M, Moradi P, Automatic skill acquisition in Reinforcement Learning using connection graph stability centrality, in: Proceedings of 2010 IEEE International Symposium on Circuits and Systems, 2010, pp. 697–700

  4. Moradi P, Rad AA, Khadivi A, Hasler M (2012) Automatic skill acquisition in reinforcement learning using graph centrality measures. Intelligent Data Analysis 16:113–135

    Google Scholar 

  5. Jain L, Katarya R (2019) Discover opinion leader in online social network using firefly algorithm. Expert Syst Appl 122:1–15

    Google Scholar 

  6. Huang H, Shen H, Meng Z, Chang H, He H (2019) Community-based influence maximization for viral marketing. Appl Intell 49(6):2137–2150

    Google Scholar 

  7. Zare H, Nikooie Pour MA, Moradi P (2019) Enhanced recommender system using predictive network approach. Physica A: Statistical Mechanics and its Applications 520:322–337

    Google Scholar 

  8. Airoldi EM, Blei DM, Fienberg SE, Xing EP (2008) Mixed membership stochastic Blockmodels. J Mach Learn Res 9:1981–2014

    MATH  Google Scholar 

  9. Hajiabadi M, Zare H, Bobarshad H (2017) IEDC: an integrated approach for overlapping and non-overlapping community detection. Knowl-Based Syst 123:188–199

    Google Scholar 

  10. Lancichinetti A, Radicchi F, Ramasco JJ, Fortunato S (2011) Finding statistically significant communities in networks. PLoS One 6(4):e18961–e18961

    Google Scholar 

  11. Fan H, Zhong Y, Zeng G (2018) Overlapping community detection based on discrete biogeography optimization. Appl Intell 48(5):1314–1326

    Google Scholar 

  12. Tabarzad MA, Hamzeh A (2017) A heuristic local community detection method (HLCD). Appl Intell 46(1):62–78

    Google Scholar 

  13. Gao Y, Zhang H, Zhang Y (2019) Overlapping community detection based on conductance optimization in large-scale networks. Physica A: Statistical Mechanics and its Applications 522:69–79

    MathSciNet  Google Scholar 

  14. Le B-D, Shen H, Nguyen H, Falkner N (2019) Improved network community detection using meta-heuristic based label propagation. Appl Intell 49(4):1451–1466

    Google Scholar 

  15. Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435:814–818

    Google Scholar 

  16. T. Nepusz, A. Petróczi, L. Négyessy, F. Bazsó, Fuzzy communities and the concept of bridgeness in complex networks, Physical review. E, Statistical, nonlinear, and soft matter physics, 77 1 Pt 2 (2008) 016107

  17. Psorakis I, Roberts S, Ebden M, Sheldon B (2011) Overlapping community detection using Bayesian non-negative matrix factorization. Phys Rev E 83(6):066114

    Google Scholar 

  18. J. Yang, J. Leskovec, Overlapping community detection at scale: a nonnegative matrix factorization approach, in: proceedings proceedings of the sixth ACM international conference on web search and data mining, Rome, Italy, 2013, pp. 587–596

  19. Sun PG, Gao L, Shan Han S (2011) Identification of overlapping and non-overlapping community structure by fuzzy clustering in complex networks. Inform Sci 181(6):1060–1071

    MATH  Google Scholar 

  20. Messaoudi I, Kamel N (2019) A multi-objective bat algorithm for community detection on dynamic social networks. Appl Intell 49(6):2119–2136

    Google Scholar 

  21. Zare H, Hajiabadi M, Jalili M (2019) Detection of community structures in networks with nodal features based on generative probabilistic approach. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2019.2960222

  22. T. Bühler, S.S. Rangapuram, S. Setzer, M. Hein, Constrained fractional set programs and their application in local clustering and community detection, in: Proceedings Proceedings of the 30th International Conference on Machine Learning, Proceedings of Machine Learning Research, 2013, pp. 624--632

  23. Su Y, Wang B, Zhang X, A seed-expanding method based on random walks for community detection in networks with ambiguous community structures, in: Scientific reports, 2017, Volume 7

  24. Deng X, Li G, Dong M, Ota K (2017) Finding overlapping communities based on Markov chain and link clustering. Peer-to-Peer Networking and Applications 10(2):411–420

    Google Scholar 

  25. Macropol K, Can T, Singh A (2009) RRW: repeated random walks on genome-scale protein networks for local cluster discovery. BMC bioinformatics 10:283

    Google Scholar 

  26. Wang W, Liu D, Liu X, Pan L (2013) Fuzzy overlapping community detection based on local random walk and multidimensional scaling. Physica A: Statistical Mechanics and its Applications 392(24):6578–6586

    Google Scholar 

  27. Wen X, Chen W-N, Lin Y, Gu T, Zhang H, Li Y, Yin Y, Zhang J (2017) A maximal clique based multiobjective evolutionary algorithm for overlapping community detection. IEEE Trans Evol Comput 21(3):363–377

    Google Scholar 

  28. Lu X, Kuzmin K, Chen M, Szymanski BK (2018) Adaptive modularity maximization via edge weighting scheme. Inform Sci 424:55–68

    MathSciNet  Google Scholar 

  29. Wu W, Kwong S, Zhou Y, Jia Y, Gao W (2018) Nonnegative matrix factorization with mixed hypergraph regularization for community detection. Inform Sci 435:263–281

    MathSciNet  MATH  Google Scholar 

  30. Mohammadi M, Moradi P, Jalili M (2019) SCE: subspace-based core expansion method for community detection in complex networks. Physica A: Statistical Mechanics and its Applications 527:121084

    Google Scholar 

  31. Whang JJ, Gleich DF, Dhillon IS (2016) Overlapping community detection using neighborhood-inflated seed expansion. IEEE Trans Knowl Data Eng 28(5):1272–1284

    Google Scholar 

  32. Dongen SV (2000) Graph clustering by flow simulation, Utrecht University Repository

  33. Pons P, Latapy M (2005) Computing Communities in Large Networks Using Random Walks, in: p. Yolum, T. Güngör, F. Gürgen, C. Özturan (Eds.) Computer and Information Sciences - ISCIS 2005, Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 284–293

  34. Zhou H, Lipowsky R (2004) Network Brownian motion: a new method to measure vertex-vertex proximity and to identify communities and subcommunities, in: M. Bubak, G.D. van Albada, P.M.A. Sloot, J. Dongarra (Eds.) Computational science - ICCS 2004, Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 1062–1069

  35. Brin S, Page L (1998) The anatomy of a large-scale Hypertextual web search engine, in: proceedings seventh international world-wide web conference (WWW 1998). Brisbane, Australia, pp 107–117

    Google Scholar 

  36. Cai B, Wang H, Zheng H, Wang H, An improved random walk based clustering algorithm for community detection in complex networks, in: 2011 IEEE International Conference on Systems, Man, and Cybernetics, 2011, pp. 2162–2167

  37. Rosvall M, Bergstrom CT (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci 105(4):1118–1123

    Google Scholar 

  38. Chen Q, Wu T-T, Fang M (2013) Detecting local community structures in complex networks based on local degree central nodes. Physica A: Statistical Mechanics and its Applications 392(3):529–537

    Google Scholar 

  39. Yu Z, Chen J, Quo K, Chen Y, Xu Q (2017) Overlapping community detection based on random walk and seeds extension, in: proceedings proceedings of the 12th Chinese conference on computer supported cooperative work and social computing. Chongqing, China, pp 18–24

    Google Scholar 

  40. de Guzzi Bagnato G, Ronqui JRF, Travieso G (2018) Community detection in networks using self-avoiding random walks. Physica A: Statistical Mechanics and its Applications 505:1046–1055

    Google Scholar 

  41. Li W, Xie J, Xin M, Mo J (2018) An overlapping network community partition algorithm based on semi-supervised matrix factorization and random walk. Expert Syst Appl 91:277–285

    Google Scholar 

  42. Xin Y, Xie Z-Q, Yang J (2016) An adaptive random walk sampling method on dynamic community detection. Expert Syst Appl 58:10–19

    Google Scholar 

  43. Liu X, Zhou Y, Hu C, Guan X (2016) MIRACLE: a multiple independent random walks community parallel detection algorithm for big graphs. J Netw Comput Appl 70:89–101

    Google Scholar 

  44. Li X, Xu G, Tang M (2018) Community detection for multi-layer social network based on local random walk. J Vis Commun Image Represent 57:91–98

    Google Scholar 

  45. Macropol K, Can T, Singh AK (2009) RRW: repeated random walks on genome-scale protein networks for local cluster discovery. BMC Bioinformatics 10(1):283

    Google Scholar 

  46. Norris JR (1997) Markov Chains. Cambridge Series on Statistical and Probabilistic Mathematics, 1st ed. Cambridge University Press, Cambridge, UK, New York

  47. Zhang H, Raitoharju J, Kiranyaz S, Gabbouj M (2016) Limited random walk algorithm for big graph data clustering. Journal of Big Data 3(1):26

    Google Scholar 

  48. Lancichinetti A, Fortunato S, Kertész J (2009) Detecting the overlapping and hierarchical community structure in complex networks. New J Phys 11(3):033015

    Google Scholar 

  49. Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment 2008(10):P10008

    MATH  Google Scholar 

  50. Yang J, Leskovec J, Community-Affiliation Graph Model for Overlapping Network Community Detection, in: 2012 IEEE 12th International Conference on Data Mining, 2012, pp. 1170–1175

  51. Chakraborty T, Ghosh S, Park N (2019) Ensemble-based overlapping community detection using disjoint community structures. Knowl-Based Syst 163:241–251

    Google Scholar 

  52. Chang Z, Jia C, Yin X, Zheng Y (2019) A generative model for exploring structure regularities in attributed networks. Inform Sci 505:252–264

    MathSciNet  MATH  Google Scholar 

  53. Yang J, Leskovec J (2015) Defining and evaluating network communities based on ground-truth. Knowl Inf Syst 42(1):181–213

    Google Scholar 

  54. Lancichinetti A, Fortunato S, Radicchi F (2008) Benchmark graphs for testing community detection algorithms. Phys Rev E 78(4):046110

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their valuable comments and suggestions, which have greatly improved the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Parham Moradi.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bahadori, S., Moradi, P. & Zare, H. An improved limited random walk approach for identification of overlapping communities in complex networks. Appl Intell 51, 3561–3580 (2021). https://doi.org/10.1007/s10489-020-01999-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-020-01999-4

Keywords

Navigation