Skip to main content
Log in

A high-performance algorithm for finding influential nodes in large-scale social networks

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

As one of the significant issues in social networks analysis, the influence maximization problem aims to fetch a minimal set of the most influential individuals in the network to maximize the number of influenced nodes under a diffusion model. Several approaches have been proposed to tackle this NP-hard problem. The traditional approaches failed to develop an efficient and effective solution due to the exponential growth of the size of social networks (due to massive computational overhead). In this paper, a three-stage framework based on the community detection approach is devised, namely LGFIM. In the first stage, the search space was controlled by partitioning the network into communities. Simultaneously, three heuristic methods were presented for modifying the community detection algorithm to extract the optimal communities: core nodes selection, capacity constraint on communities, and communities combination. These extracted communities were highly compatible with the information propagation mechanism. The next stages apply a scalable and robust algorithm at two different levels of the network: 1. Exploring the local scope of communities to select the most influential nodes of each community and construct the potential influential nodes set 2. Exploring the global scope of the network to select the target influential nodes among potential influential nodes set. Experimental results on various real datasets proved that LGFIM could achieve remarkable results compared with the state-of-the-art algorithms, especially acceptable influence spread, much better running time, and more applicable to massive social networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Availability of data and materials

All of the mentioned datasets are available at the SNAP library on the Stanford University website https://snap.stanford.edu/data/ or  http://konect.cc/networks/ website.

Code availability

These codes are available at https://github.com/mohsentaherinia/LGFIM.

References

  1. Bohra N, Bhatnagar V (2021) Group level social media popularity prediction by MRGB and Adam optimization. J Comb Optim 41(2):328–347

    Article  MathSciNet  MATH  Google Scholar 

  2. Sangwan N, Bhatnagar V (2021) Video popularity prediction using stacked bilstm layers. Malays J Comput Sci 34(3):242–254

    Article  Google Scholar 

  3. Malik S, Bansal P (2022) Multimodal semantic analysis with regularized semantic autoencoder. J Intell Fuzzy Syst 42:909–917

    Article  Google Scholar 

  4. Banerjee S, Jenamani M, Pratihar DK (2020) A survey on influence maximization in a social network. Knowl Inf Syst 62:3417–3455

    Article  Google Scholar 

  5. Li Y, Fan J, Wang Y, Tan K-LL (2018) Influence maximization on social graphs: a survey. IEEE Trans Knowl Data Eng 30:1852–1872

    Article  Google Scholar 

  6. Domingos P, Richardson M (2001) Mining the network value of customers. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 57–66. Association for Computing Machinery (ACM)

  7. Kempe D, Kleinberg J, Tardose (2003) Maximizing the spread of influence through a social network. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

  8. Sumith N, Annappa B, Bhattacharya S (2018) Influence maximization in large social networks: Heuristics, models and parameters. Future Gener Comput Syst 89:777–790

    Article  Google Scholar 

  9. Peng S, Yu S, Mueller P (2018) Social networking big data: opportunities, solutions, and challenges. Future Gener Comput Syst 86:1456–1458

    Article  Google Scholar 

  10. Han M, Li Y (2018) Influence analysis: a survey of the state-of-the-art. Math Found Comput 1(3):201–253

    Article  MathSciNet  MATH  Google Scholar 

  11. Wang Z, Sun C, Xi J, Li X (2021) Influence maximization in social graphs based on community structure and node coverage gain. Future Gener Comput Syst 118:327–338

    Article  Google Scholar 

  12. He Q, Wang X, Mao F, Lv J, Cai Y, Huang M, Xu Q (2020) CAOM: a community-based approach to tackle opinion maximization for social networks. Inf Sci 513:252–269

    Article  MathSciNet  Google Scholar 

  13. Huang H, Shen H, Meng Z (2020) Community-based influence maximization in attributed networks. Appl Intell 50:354–364

    Article  Google Scholar 

  14. Beni HA, Bouyer A (2020) TI-SC: top-k influential nodes selection based on community detection and scoring criteria in social networks. J Ambient Intell Humaniz Comput 1:3

    Google Scholar 

  15. Zhang Z, Li X, Gan C (2020) Identifying influential nodes in social networks via community structure and influence distribution difference. Digit Commun Netw

  16. Li J, Cai T, Deng K, Wang X, Sellis T, Xia F (2020) Community-diversified influence maximization in social networks. Inf Syst 92:101522

    Article  Google Scholar 

  17. Atif Y, Al-Falahi K, Wangchuk T, Lindström B (2020) A fuzzy logic approach to influence maximization in social networks. J Ambient Intell Hum Comput 11:2435–2451

    Article  Google Scholar 

  18. Banerjee S, Jenamani M, Pratihar DK (2019) ComBIM: a community-based solution approach for the Budgeted Influence Maximization Problem. Expert Syst Appl 125:1–13

    Article  Google Scholar 

  19. Huang H, Shen H, Meng Z, Chang H, He H (2019) Community-based influence maximization for viral marketing. Appl Intell 49:2137–2150

    Article  Google Scholar 

  20. Singh SS, Kumar A, Singh K, Biswas B (2019) C2IM: community based context-aware influence maximization in social networks. Physica A 514:796–818

    Article  MathSciNet  MATH  Google Scholar 

  21. Liqing Q, Wei J, Jinfeng Y, Xin F, Shuang Z (2019) PHG: a three-phase algorithm for influence maximization based on community structure. IEEE Access 1

  22. Li X, Cheng X, Su S, Sun C (2018) Community-based seeds selection algorithm for location aware influence maximization. Neurocomputing 275:1601–1613

    Article  Google Scholar 

  23. Ko YY, Cho KJ, Kim SW (2018) Efficient and effective influence maximization in social networks: a hybrid-approach. Inf Sci 465:144–161

    Article  MathSciNet  MATH  Google Scholar 

  24. Ye F, Liu J, Chen C, Ling G, Zheng Z, Zhou Y (2018) Identifying influential individuals on large-scale social networks: a community based approach. IEEE Access 6:47240–47257

    Article  Google Scholar 

  25. Jalayer M, Azheian M, Kermani Agha Mohammad Ali M (2018) A hybrid algorithm based on community detection and multi attribute decision making for influence maximization. Comput Ind Eng 120:234–250

  26. Shang J, Zhou S, Li X, Liu L, Wu H (2017) CoFIM: a community-based framework for influence maximization on large-scale networks. Knowl-Based Syst 117:88–100

    Article  Google Scholar 

  27. Bozorgi A, Samet S, Kwisthout J, Wareham T (2017) Community-based influence maximization in social networks under a competitive linear threshold model. Knowl-Based Syst 134:149–158

    Article  Google Scholar 

  28. Hosseini-Pozveh M, Zamanifar K, Naghsh-Nilchi AR (2017) A community-based approach to identify the most influential nodes in social networks. J Inf Sci 43:204–220

    Article  Google Scholar 

  29. Halappanavar M, Sathanur AV, Nandi AK (2016) Accelerating the mining of influential nodes in complex networks through community detection. In: 2016 ACM International Conference on Computing Frontiers - Proceedings, New York, New York, USA, pp 64–71. Association for Computing Machinery, Inc

  30. Bozorgi A, Haghighi H, Sadegh Zahedi M, Rezvani M (2016) INCIM: a community-based algorithm for influence maximization problem under the linear threshold model. Inf Process Manag 52:1188–1199

    Article  Google Scholar 

  31. Jaouadi M, Ben Romdhane L (2016) DIN: an efficient algorithm for detecting influential nodes in social graphs using network structure and attributes. In: Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA. IEEE Computer Society

  32. Zhao Y, Li S, Jin F (2016) Identification of influential nodes in social networks with community structure based on label propagation. Neurocomputing 210:34–44

    Article  Google Scholar 

  33. Rahimkhani K, Aleahmad A, Rahgozar M, Moeini A (2015) A fast algorithm for finding most influential people based on the linear threshold model. Expert Syst Appl 42:1353–1361

    Article  Google Scholar 

  34. Li H, Bhowmick SS, Sun A, Cui J (2015) Conformity-aware influence maximization in online social networks. VLDB J 24:117–141

    Article  Google Scholar 

  35. Chen YC, Zhu WY, Peng WC, Lee WC, Lee SY (2014) CIM: community-based influence maximization in social networks. ACM Trans Intell Syst Technol 5:1–31

    Article  Google Scholar 

  36. Wang Y, Cong G, Song G, Xie K (2010) Community-based greedy algorithm for mining top-K influential nodes in mobile social networks. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD ’10, New York, New York, USA, p 1039. ACM Press

  37. Fortunato S, Barthelemy M (2006) Resolution limit in community detection. Proc Natl Acad Sci USA 104:36–41

    Article  Google Scholar 

  38. Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E Stat Nonlinear Soft Matter Phys 76:036106

    Article  Google Scholar 

  39. Leung IXY, Hui P, Lio P, Crowcroft J (2008) Towards real-time community detection in large networks. Phys Rev Stat Nonlinear Soft Matter Phys 79:066107

  40. Barber MJ, Clark JW (2009) Detecting network communities by propagating labels under constraints. Phys Rev Stat Nonlinear Soft Matter Phys 80:026129

    Article  Google Scholar 

  41. Šubelj L, Bajec M (2011) Unfolding communities in large complex networks: combining defensive and offensive label propagation for core extraction. Phys Rev E Stat Nonlinear Soft Matter Phys 83:036103

    Article  MathSciNet  Google Scholar 

  42. Rezaei A, Far SM, Soleymani M (2015) Near linear-time community detection in networks with hardly detectable community structure. In: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015, pp 65–72. Association for Computing Machinery, Inc

  43. Clauset A, Newman MEJ, Moore C (2004) Finding community structure in very large networks. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Top 70:6

    Google Scholar 

  44. Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008:P10008

    Article  MATH  Google Scholar 

  45. Rosvall M, Bergstrom CT (2007) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci USA 105:1118–1123

    Article  Google Scholar 

  46. Xavier CR , Da Fonseca Vieira V, Evsukoff AG (2019) Studying the effect of community structure for seed selection in an influence model. Int J Inf Decis Sci 11(4):300–319

  47. Leskovec J, Krause A, Guestrin C, Faloutsos C, Vanbriesen J, Glance N (2007) Cost-effective outbreak detection in networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, New York, USA, pp 420–429. ACM Press

  48. Goyal A, Lu W, Lakshmanan L. V. (2011) CELF++: optimizing the greedy algorithm for influence maximization in social networks. In: Proceedings of the 20th International Conference Companion on World Wide Web, WWW 2011, pp 47–48

  49. Tang Y, Xiao X, Shi Y (2014) Influence maximization: Near-optimal time complexity meets practical efficiency. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, New York, New York, USA, pp 75–86. Association for Computing Machinery

  50. Tang Y, Shi Y, Xiao X (2015) Influence maximization in near-linear time: a martingale approach. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, vol. 2015-May, New York, New York, USA, pp 1539–1554. Association for Computing Machinery

  51. Chen W, Wang Y, Yang S (2009) Efficient influence maximization in social networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, New York, USA, pp 199–207. ACM Press

  52. Cheng S, Shen H, Huang J, Zhang G, Cheng X (2013) StaticGreedy: solving the scalability-accuracy dilemma in influence maximization. In: International Conference on Information and Knowledge Management, Proceedings, New York, New York, USA, pp 509–518. ACM Press

  53. Peng S, Zhou Y, Cao L, Yu S, Niu J, Jia W (2018) Influence analysis in social networks: a survey. J Netw Comput Appl 106:17–32

    Article  Google Scholar 

  54. Page L, Page L, Brin S, Motwani R, Winograd T (1998) The PageRank citation ranking: bringing order to the web

  55. Kleinberg JM (1999) Authoritative sources in a hyperlinked environment. J ACM 46(5):604–632

    Article  MathSciNet  MATH  Google Scholar 

  56. Kimura M, Saito K, Nakano R, Motoda H (2010) Extracting influential nodes on a social network for information diffusion. Data Min Knowl Discov 20:70–97

    Article  MathSciNet  Google Scholar 

  57. Wang C, Chen W, Wang, Y (2012) Scalable influence maximization for independent cascade model in large-scale social networks. In: Data Mining and Knowledge Discovery, vol. 25, pp 545–576. Springer

  58. Chen W, Wang C, Wang Y (2010) Scalable influence maximization for prevalent viral marketing in large-scale social networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, New York, USA, pp 1029–1038. ACM Press

  59. Kim J, Kim SK, Yu H (2013) Scalable and parallelizable processing of influence maximization for large-scale social networks? In: Proceedings of International Conference on Data Engineering, pp 266–277

  60. Galhotra S, Arora A, Roy S (2016) Holistic influence maximization: combining scalability and efficiency with opinion-aware models. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, vol. 26-June-20, pp 743–758

  61. Goyal A, Lu W, Lakshmanan LV (2011) SIMPATH: an efficient algorithm for influence maximization under the linear threshold model. In: 2011 IEEE 11th International Conference on Data Mining, pp 211–220. IEEE

  62. Chen W, Yuan Y, Zhang L (2010) Scalable influence maximization in social networks under the linear threshold model. In: Proceedings of IEEE International Conference on Data Mining, ICDM, pp 88–97

  63. Jung K, Heo W, Chen W (2012) IRIE: scalable and robust influence maximization in social networks. In: Proceedings of IEEE International Conference on Data Mining, ICDM, pp 918–923

  64. Liu Q, Xiang B, Chen E, Xiong H, Tang F, Yu JX (2014) Influence maximization over large-scale social networks: a bounded linear approach. In: CIKM 2014—Proceedings of the 2014 ACM International Conference on Information and Knowledge Management, New York, New York, USA, pp 171–180.Association for Computing Machinery, Inc

  65. Cheng S, Shen H-W, Huang J, Chen W, Cheng X-Q (2014) IMRank: influence maximization via finding self-consistent ranking. In: SIGIR 2014 - Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 475–484

  66. Bahutair M, Al Aghbari Z, Kamel I (2021) NodeRank: finding influential nodes in social networks based on interests. J Supercomput 1–27

  67. Khan BS, Niazi MA (2017) Network community detection: a review and visual survey. arXiv

  68. Xie J, Szymanski BK, Liu X (2011) SLPA: uncovering overlapping communities in social networks via a speaker-listener interaction dynamic process.’ In: Proceedings - IEEE International Conference on Data Mining, ICDM, pp 344–349

  69. Khomami MMD, Rezvanian A, Meybodi MR, Bagheri A (2020) CFIN: a community-based algorithm for finding influential nodes in complex social networks. J Supercomput 1–30

  70. Samir AM, Rady S, Gharib TF (2021) LKG: a fast scalable community-based approach for influence maximization problem in social networks. Physica A 582:126258

    Article  Google Scholar 

  71. Xie X, Li J, Sheng Y, Wang W, Yang W (2021) Competitive influence maximization considering inactive nodes and community homophily. Knowl-Based Syst 233:107497

    Article  Google Scholar 

  72. Ahmad A, Ahmad T, Bhatt A (2020) HWSMCB: a community-based hybrid approach for identifying influential nodes in the social network. Physica A 545:123590

    Article  Google Scholar 

  73. Amiri B, Fathian M, Asaadi E (2021) Influence maximization in complex social networks based on community structure. J Ind Syst Eng 13(3):16–40

    Google Scholar 

  74. Newman MEJ (2006) Modularity and community structure in networks. Proc Natl Acad Sci USA 103:8577–8582

    Article  Google Scholar 

  75. Douglas JF (1997) Random walks and random environments, vol. 2, random environments. J Stat Phys 87:961–962

    Article  Google Scholar 

  76. Ahn Y-Y, Bagrow JP, Lehmann S (2009) Link communities reveal multiscale complexity in networks. Nature 466:761–764

    Article  Google Scholar 

  77. Palla G, Derenyi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society

  78. Scripps J, Tan PN, Esfahanian AH (2007) Exploration of link structure and community-based node roles in network analysis. In: Proceedings of IEEE International Conference on Data Mining, ICDM, pp 649–654. IEEE

  79. Panagopoulosécole G, Malliaros FD, Vazirgiannisécole M (2020) Influence maximization using influence and susceptibility embeddings. Technical Report

  80. Leskovec J, Sosič R (2016) SNAP: a general-purpose network analysis and graph-mining library. ACM Trans Intell Syst Technol (TIST) 8(1):1

    Article  Google Scholar 

  81. Kunegis J (2013) The Koblenz Network Collection (KONECT)

  82. Leskovec J, Krevl A (2014) Snap datasets, s.n.a.p. and stanford . Acessed Oct 25 2020

  83. Haque M, Banerjee DS (2019) Accelerating influence maximization using heterogeneous algorithms. J Supercomput 76:4747–4769

    Article  Google Scholar 

Download references

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. The authors would like to thank the anonymous reviewers for their valuable comments that helped to improve the quality of this manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mahdi Esmaeili.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Taherinia, M., Esmaeili, M. & Minaei-Bidgoli, B. A high-performance algorithm for finding influential nodes in large-scale social networks. J Supercomput 78, 15905–15952 (2022). https://doi.org/10.1007/s11227-022-04418-2

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-022-04418-2

Keywords

Navigation