Identifying influential spreaders in complex networks by an improved gravity model

Li, Zhe; Huang, Xinyu

doi:10.1038/s41598-021-01218-1

Identifying influential spreaders in complex networks by an improved gravity model

Article
Open access
Published: 12 November 2021

Volume 11, article number 22194, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Identifying influential spreaders in complex networks by an improved gravity model

Download PDF

Zhe Li¹ &
Xinyu Huang²

2170 Accesses
17 Citations
Explore all metrics

Abstract

Identification of influential spreaders is still a challenging issue in network science. Therefore, it attracts increasing attention from both computer science and physical societies, and many algorithms to identify influential spreaders have been proposed so far. Degree centrality, as the most widely used neighborhood-based centrality, was introduced into the network world to evaluate the spreading ability of nodes. However, degree centrality always assigns too many nodes with the same value, so it leads to the problem of resolution limitation in distinguishing the real influences of these nodes, which further affects the ranking efficiency of the algorithm. The k-shell decomposition method also faces the same problem. In order to solve the resolution limit problem, we propose a high-resolution index combining both degree centrality and the k-shell decomposition method. Furthermore, based on the proposed index and the well-known gravity law, we propose an improved gravity model to measure the importance of nodes in propagation dynamics. Experiments on ten real networks show that our model outperforms most of the state-of-the-art methods. It has a better performance in terms of ranking performance as measured by the Kendall’s rank correlation, and in terms of ranking efficiency as measured by the monotonicity value.

Enhancement of Gravity Centrality Measure Based on Local Clustering Method by Identifying Influential Nodes in Social Networks

Identifying influential nodes in complex networks using a gravity model based on the H-index method

Article Open access 29 September 2023

Effectively identifying the influential spreaders in large-scale social networks

Article 10 September 2014

Introduction

Network science plays an extremely key role in many fields¹. The heterogeneity of real networks² puts forward a vital question: How to measure the importance of nodes quantitatively? An effective algorithm to identify influential spreaders may be a good answer. Identification of influential spreaders can be widely used in epidemic analysis^3,4, rumor analysis⁵, power grid protection⁶, knowledge graph⁷, social computing^8,9, information propagation¹⁰, community detection^11,12, discovery of candidate drug targets and essential proteins¹³, discovery of important species^14,15, and so on.

So far, most known methods merely use structural information¹⁶, which can be classified into neighborhood-based centralities and path-based centralities roughly. Typical representatives of neighborhood-based centralities are degree centrality¹⁷ (DC), k-shell decomposition method¹⁸ (KS) and H-index¹⁹ while typical representatives of path-based centralities are betweenness centrality²⁰ (BC) and closeness centrality²¹ (CC).

Although the above methods are very classic, it is difficult to identify the vital nodes in complex networks accurately and efficiently. In order to solve this problem, many effective node ranking algorithms^{22,23,24,25,26,27,28,29} have been proposed in recent years, among which the algorithms based on gravity law seem very promising. Hence, a series of algorithms^{28,29,30,31,32,33,34,35,36,37,38,39,40} based on the gravity law have been proposed, and their performance is much better than the above classic methods. Typical representatives are gravity centrality²⁸ (GC) and local gravity model²⁹ (LGM). GC regards the k-shell value of a node as its mass, the shortest distance between two nodes in the network as its distance, while LGM regards the degree value of a node as its mass, and the shortest distance between two nodes as its distance. However, whether the degree or k-shell value is regarded as the mass, there is a shortcoming, i.e., DC and KS both assign too many nodes with the same value. So it leads to the problem of resolution limitation in distinguishing the real influences of these nodes, which further affects the ranking efficiency of the algorithm.

In this paper, in order to solve the above problem, we propose a high-resolution index combining both DC and KS. Furthermore, based on the proposed index and the well-known gravity law, we propose an improved gravity model to measure the importance of nodes in propagation dynamics. Experiments on ten real networks show that our model performs best in comparison with the above well-known state-of-the-art methods both in terms of ranking performance as measured by the Kendall’s rank correlation, and in terms of ranking efficiency as measured by the monotonicity value.

Results

Algorithms

Firstly, we take a toy network shown in Fig. 1 to illustrate the resolution limit problem for DC and KS. The degree and k-shell values of each node in the toy network are shown in Table 1. Obviously, $k(1)=k(8)=k(9)=1$, $k(2)=k(3)=3$, $k(4)=k(5)=k(6)=4$, $k_s(1)=k_s(8)=k_s(9)=1$, $k_s(2)=k_s(3)=2$, $k_s(4)=k_s(5)=k_s(6)=k_s(7)=3$, where k(i) and $k_s(i)$ are the degree and k-shell value of node i, respectively. DC and KS always assigns too many nodes with the same value, which leads to the problem of resolution limitation in distinguishing the real influences of these nodes.

Table 1 The degree and k-shell values of each node in the toy network.

Full size table

A simple solution is to consider both DC and KS, that is, to estimate the influence of node i by $k(i)+k_s(i)$. However, the problem has not been completely solved. Take node 2 and node 3 as an example, compared with node 2, node 3 is closer to the center of the network, so node 3 may be more conducive to propagation. However, we cannot distinguish the two nodes by the above proposed method. Although both node 2 and node 3 are in the 2-shell, node 3 is removed later than node 2, that is, the 2-shell decomposition process includes two stages, node 2 is removed in the first stage and node 3 is removed in the second stage. So we introduce the stage number at which the node is removed from the network while performing the k-shell decomposition.

Given a network G, during the process of k-shell decomposition for the k-degree iteration, the total number of stages is q(k), and node i is removed in the p(i) stage. The improved k-shell index of node i , denoted by $k_{s}^*(i)$, can be calculated by

$$\begin{aligned} k_{s}^*(i)=k_{s}(i)+\frac{p(i)}{\max \limits _{k}q(k)+1}. \end{aligned}$$

(1)

The process of k-shell decomposition and the $k_{s}^*$ value of each node in the toy network are shown in Table 2 and Table 3, respectively. Take node 3 as an example, $q(1)=1$, $q(2)=2$, $q(3)=1$, and then $\max \limits _{k}q(k)=2$, so $k_{s}^*(3)=k_{s}(3)+p(3)/(\max \limits _{k}q(k)+1)=2+2/(2+1)\approx 2.667$.

Table 2 The process of k-shell decomposition in the toy network.

Full size table

Table 3 The $k_{s}^*$ value of each node in the toy network.

Full size table

The index combining degree and k-shell of node i, denoted by DK(i), can be defined by

$$\begin{aligned} DK(i)=k(i)+k_{s}^*(i). \end{aligned}$$

(2)

Such index is named as degree k-shell (DK) index. The DK value of each node in the toy network are shown in Table 4. As shown in Table 4, node 2 and node 3 can be distinguished (DC, KS, DC+KS failed), node 7 can be distinguished from nodes 4–6 (KS failed), so DK index is a high-resolution index. Furthermore, DK carries both the local and global information of nodes.

Table 4 The DK value of each node in the toy network.

Full size table

Inspired by the gravity law, we regard DK value of a node as its mass and the shortest distance between two nodes in the network as their distance. Hence the influence of node i can be estimated as follows

$$\begin{aligned} DKGM(i)=\sum _{j\ne {i},d(i,j)\le R}\frac{DK(i)DK(j)}{d^{2}(i,j)}, \end{aligned}$$

(3)

where d(i, j) is the shortest distance from node i to node j and R is the truncation radius²⁹. Such method is named as DK-based gravity model (DKGM). The algorithmic description of the DKGM is provided in Algorithm 1.

The result of DKGM with $R=2$ of the toy network is shown in Table 5. Take node 3 as an example, the 1-order neighbors of node 3 are node 2, node 4 and node 7, the 2-order neighbors of node 3 are node 1, node 5 and node 6, so $DKGM(3)=DK(3)*DK(2)+DK(3)*DK(4)+DK(3)*DK(7)+DK(3)*DK(1)/4+DK(3)*DK(5)/4+DK(3)*DK(6)/4\!\approx143.08$.

Table 5 The result of DKGM with $R=2$ of the toy network.

Full size table

By Algorithm 1, we can find that calculating the improved k-shell index needs the following times operations, $N_{ks1}\left\langle k \right\rangle + N_{ks2}\left\langle k \right\rangle + \cdot + N_{ksmax}\left\langle k \right\rangle $ = $(N_{ks1} + N_{ks2} + \cdot + N_{ksmax})\left\langle k \right\rangle $ = $N \left\langle k \right\rangle $ = M, so the computational complexity of this part is O(M), where $N_{ks1}$ is the number of 1-shell nodes, ksmax is the max k-shell value and $\left\langle k \right\rangle $ is the average degree. The part with the highest computational complexity in our model is computing the R-order neighbors of each node, it needs $N \left\langle k \right\rangle ^{R}$ times operations, so the computational complexity of this part is $O(N \left\langle k \right\rangle ^{R})$. Therefore, the computational complexity of our model is $O(N \left\langle k \right\rangle ^{R})$. Fortunately, since most real networks are of small-world property, R is usually set to 2 or 3 to obtain the optimal result. So the computational complexity of our model in real-life applications is generally not more than $O(N \left\langle k \right\rangle ^3)$, where $\left\langle k \right\rangle \ll N$.

Data description

In this paper, we use ten real networks from different fields to test the performance of DKGM, including four social networks (PB⁴¹, Facebook⁴², WV⁴³ and Sex⁴⁴), two collaboration networks (Jazz⁴⁵ and NS⁴⁶), one transportation network (USAir⁴⁷), one communication network (Email⁴⁸), one infrastructure network (Power⁴⁹) and one technological network (Router⁵⁰). These networks’ topological features are shown in Table 6, including the number of nodes, denoted by N, the number of links, denoted by M, the average degree, denoted by $\langle k\rangle $, the average distance, denoted by $\langle d\rangle $, the clustering coefficient⁴⁹, denoted by C, the assortative coefficient⁵¹, denoted by r, the degree heterogeneity⁵², denoted by H, and the epidemic threshold⁵³ of the SIR model⁵⁴, denoted by $\beta _c$.

Table 6 The basic topological features of the ten real networks.

Full size table

Empirical results

In this paper, we apply the famous SIR model⁵⁴ to compare the influential rankings produced by algorithms and simulations. Given the network and infection rate $\beta $, 1000 independent implementations are performed and averaged in order to obtain the standard ranking of the influences of nodes (see details about SIR model in Methods). In each implementation every node is selected once as the seed once. The accuracy of an algorithm is measured by Kendall’s Tau ($\tau $)⁵⁵ (see details about the Kendall’s Tau in Methods) between the standard ranking and the ranking produced by the algorithm. The larger the value of $\tau $, the better the performance. The accuracies of DKGM and the seven benchmark algorithms (see details about the benchmark centralities in Methods) for $\beta =\beta _c$ are compared in Table 7, and the accuracies of different $\beta $ values are shown in Fig. 2.

Table 7 The algorithms’ accuracies measured by Kendall’s Tau for $\beta =\beta _c$. The parameters in the related algorithms (i.e., LGM and DKGM) are adjusted to their optimal values according to the largest $\tau $. The best algorithm for each network is emphasized by bold.

Full size table

As shown in Table 7, compared with the five classic methods (DC, KS, H-index, BC, CC), GC, LGM and DKGM are very competitive. Especially in the NS, Power and Router networks, the advantage of the gravity-based methods are extremely obvious. It can be seen from Table 6 that NS, Power and Router are extremely sparse (with very few links). In this tree-like networks, there are very few cycles, that is, most paths have no alternative paths, so propagation is very difficult. In this case, neither the neighborhood-based methods (DC, KS and H-index) nor the path-based methods (BC and CC) can work well. Furthermore, compared with GC and LGM, DKGM always performs best. As shown in Figure 2, DKGM also performs very competitive compared with the seven benchmark algorithms for different $\beta $ not too far from $\beta _c$.

The optimal truncation radius $R^*$ of LGM can be estimated by

$$\begin{aligned} R^*\approx \frac{1}{2}\left\langle d \right\rangle \end{aligned}$$

(4)

at $\beta =\beta _c$²⁹. As shown in Figure 3, DKGM still keeps this property.

Furthermore, the accuracies of GC, LGM with $R = \left\langle d \right\rangle /2$ and DKGM with $R = \left\langle d \right\rangle /2$ for $\beta =\beta _c$ are compared in Table 8. As shown in Table 8, although the truncation radius is set heuristically, DKGM still performs best among the three algorithms.

Table 8 The accuracies of GC, LGM ($R = \left\langle d \right\rangle /2$) and DKGM ($R = \left\langle d \right\rangle /2$) for $\beta =\beta _c$. The best algorithm for each network is emphasized by bold.

Full size table

Finally, we apply the monotonicity⁵⁶, denoted by $M_{r}$, to measure the ranking efficiency of algorithms. This metric is used to measure the uniqueness of the elements in a ranking list and it can be computed by

$$ M_{r} (L) = \left[ {1 - \frac{{\sum\nolimits_{{r \in L}} {N_{t} } (r)(N_{t} (r) - 1)}}{{N(N - 1))}}} \right]^{2} $$

(5)

where L is the ranking list, and $N_{t}(r)$ is the number of ties with the same rank r.

The monotonicity of node ranking list produced by different algorithms is shown in Table 9. As shown in Table 9, except the PB network, DKGM always performs best among the eight algorithms. In the PB network, the reason why GC narrowly defeated DKGM is that DKGM just considers 1-order neighbors while GC considers 3-order neighbors. The results reported in Table 9 demonstrate DKGM is a remarkably high-resolution algorithm.

Table 9 The monotonicity of node ranking list produced by different algorithms, the best algorithm for each network is emphasized by bold.

Full size table

Discussion

Degree centrality and the k-shell decomposition method, as the most widely used neighborhood-based centralities, were introduced to the network world to evaluate the spreading ability of the nodes. However, the two methods always assign too many nodes with the same value, which leads to the problem of resolution limitation in distinguishing the real influences of these nodes. To solve the above problem, combining the two methods (i.e., DC and KS), we propose a high-resolution index (DK) that can simultaneously reflect the local and global information of nodes. Furthermore, we propose an improved gravity model (DKGM) that combining DK index and the gravity law to evaluate the spreading ability of nodes. The empirical results show that DKGM performs best in comparison with seven well-known benchmark methods and DKGM is a remarkably high-resolution algorithm.

A potential disadvantage of DKGM is how to set truncation radius R. Fortunately, as shown in Fig. 3, we find an empirical relation between $R^*$ and the average distance $\left\langle d \right\rangle $, so we can use the relation (see Eq. 4) to approximate $R^*$. In addition, since most real networks are of small-world property^49,57, $R^*$ should be small, it can be set to 2 or 3 generally.

There are still some potential problems in the future. First of all, the original law of gravity is symmetrical, but due to the different effects of different nodes or the inherent asymmetry of dynamics^58,59, the influence of node i on node j may be different from that of node j on node i, in which the asymmetric form of gravity law may be involved. Secondly, as the heterogeneity of the links greatly change their importance⁶⁰, how to use gravity model in the weighted networks is still an open issue. We will also develop some other better methods based on the gravity law to identify influential spreaders.

Methods

Benchmark centralities

We denote an undirected and unweighted network as $G=<V,E>$, where V and E are the sets of nodes and links, respectively, denote $|V|=N$ and $|E|=M$, so the network has N nodes and M links. The adjacent matrix of G is represented by $A=(a_{ij})_{N\times N}$, if there is a link from node i to node j, $a_{ij}=1$, otherwise, $a_{ij}=0$.

DC¹⁷ of node i can be calculated by

$$\begin{aligned} DC(i)=k(i), \end{aligned}$$

(6)

where $k(i)=\sum _j a_{ij}$.

KS¹⁸ works by iterative decomposition of the network into different shells. The first step of KS is to remove all the nodes in the network whose degree $k=1$. Then it remove nodes whose degree $k \le 1$ after one round removal because this step may lead to the reduction of the degree values during the process of removal. Until there are no nodes in the network with degree $k \le 1$, all the nodes which have been removed in this step create 1-shell and their k-shell values are equal to one. Then repeat this process to obtain 2-shell, 3-shell, ... , and so on. Finally all nodes are divided into different shells and the k-shell value of each node can be obtained.

The H-index¹⁹ of node i, represented by H(i), is defined as the maximal integer value satisfying that there are at least H(i) neighbors of node i and degrees of these neighbors are all no less than H(i).

BC²⁰ of node i can be calculated by

$$\begin{aligned} BC(i)=\sum _{s\ne {i},s\ne {t},i\ne {t}}\frac{g_{st}(i)}{g_{st}}, \end{aligned}$$

(7)

where $g_{st}$ is the number of shortest paths from node s to node t, and $g_{st}(i)$ is the number of shortest paths from node s to node t that pass through node i.

CC²¹ of node i can be calculated by

$$\begin{aligned} CC(i)=\frac{N-1}{\sum \limits _{j\ne i} d(i,j)}. \end{aligned}$$

(8)

GC²⁸ of node i can be calculated by

$$\begin{aligned} GC(i)=\sum _{j\in \psi _i}\frac{k_s(i)k_s(j)}{d^{2}(i,j)}, \end{aligned}$$

(9)

where $\psi _i$ is the neighborhood set whose distance to node i is less than or equal to 3.

LGM²⁹ of node i can be calculated by

$$\begin{aligned} LGM(i)=\sum _{d_{ij}\le R,j\ne {i}}\frac{k(i)k(j)}{d^{2}(i,j)}. \end{aligned}$$

(10)

SIR model

The SIR model⁵⁴ initially considers all nodes as susceptible (S) except the source node in the infected (I) state. Each infected node can infect its susceptible neighbors with probability $\beta $. In each subsequent step, all infected nodes change their own states to recovered (R). A node in the recovered state will never participate in the propagation dynamic process with the probability $\lambda $. The propagation process continues until there are no nodes in the infected state. The influence of node i can be estimated by

$$\begin{aligned} F(i) = N_r/N, \end{aligned}$$

(11)

where $N_r$ is the number of recovered nodes when dynamic process achieving steady state. $\lambda $ is set to 1 for simplicity, and the corresponding epidemic threshold⁵³ is

$$\begin{aligned} \beta _c\approx \frac{\left\langle k \right\rangle }{\left\langle k^{2} \right\rangle -\left\langle k \right\rangle }, \end{aligned}$$

(12)

where $\left\langle k^{2} \right\rangle $ is the second-order moment of the degree distribution.

The Kendall’s Tau

The Kendall’s Tau⁵⁵ is a measure of the strength of correlation between two sequences. $X=(x_1, x_2, ... ,x_N)$ and $Y=(y_1, y_2, ..., y_N)$ are two sequences with N elements. For any pair of two-tuples $(x_i,y_i)$ and $(x_j,y_j)$ $(i\ne j)$, if $x_i>x_j$ and $y_i>y_j$ or $x_i<x_j $ and $y_i<y_j$, the pair is concordant. If $x_i>x_j$ and $y_i<y_j$ or $x_i<x_j$ and $y_i>y_j$, the pair is inconsistent. If $x_i=x_j$ or $y_i=y_j$, the pair is neither concordant nor inconsistent. Kendall’s Tau of X and Y can be defined as

$$\begin{aligned} \tau =\frac{2(n_+-n_-)}{N(N-1)}, \end{aligned}$$

(13)

where $n_+$ is the number of concordant pairs and $n_-$ is the number of discordant pairs.

Data availability

All relevant data are available at https://github.com/MLIF/Network-Data.

References

Newman, M. E. J. Networks (Oxford University Press, Oxford, 2018).
Book MATH Google Scholar
Caldarelli, G. Scale-Free Networks: Complex Webs in Nature and Technology (Oxford University Press, Oxford, 2007).
Book MATH Google Scholar
Barabási, A. L., Gulbahce, N. & Loscalzo, J. Network medicine: A network-based approach to human disease. Nat. Rev. Genet. 12, 56–68 (2011).
Article PubMed PubMed Central Google Scholar
Zhu, P., Zhi, Q., Guo, Y. & Wang, Z. Analysis of epidemic spreading process in adaptive networks. IEEE Trans Circuits Syst. II Express Briefs 66, 1252–1256 (2018).
Google Scholar
Borge-Holthoefer, J. & Moreno, Y. Absence of influential spreaders in rumor dynamics. Phys. Rev. E 85, 026116 (2012).
Article ADS Google Scholar
Albert, R., Albert, I. & Nakarado, G. L. Structural vulnerability of the North American power grid. Phys. Rev. E 69, 025103 (2004).
Article ADS Google Scholar
Park, N., Kan, A., Dong, X.L., Zhao, T. & Faloutsos, C. Estimating node importance in knowledge graphs using graph neural networks. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 596–606 (ACM Press, 2019).
Yu, Z., Shao, J., Yang, Q. & Sun, Z. ProfitLeader: Identifying leaders in networks with profit capacity. World Wide Web 22, 533–553 (2019).
Article Google Scholar
Fan, C., Zeng, L., Sun, Y. & Liu, Y. Finding key players in complex networks through deep reinforcement learning. Nat. Mach. Intell. 2, 317–324 (2020).
Article PubMed PubMed Central Google Scholar
Xu, W. et al. Identifying structural hole spanners to maximally block information propagation. Inf. Sci. 505, 100–126 (2019).
Article MathSciNet MATH Google Scholar
Zheng, Z., Ye, F., Li, R. H., Ling, G. & Jin, T. Finding weighted k-truss communities in large networks. Inf. Sci. 417, 344–360 (2017).
Article MATH Google Scholar
He, K., Li, Y., Soundarajan, S. & Hopcroft, J. E. Hidden community detection in social network. Inf. Sci. 425, 92–106 (2018).
Article MathSciNet Google Scholar
Csermely, P., Korcsmáros, T., Kiss, H. J. M., London, G. & Nussinov, R. Structure and dynamics of molecular networks: A novel paradigm of drug discovery: A comprehensive review. Pharmacol. Ther. 138, 333–408 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bellingeri, M. & Bodini, A. Food web's backbones and energy delivery in ecosystems. Sci. Rep. 125, 586–594 (2016).
Google Scholar
Bellingeri, M., Cassi, D. & Vincenzi, S. Increasing the extinction risk of highly connected species causes a sharp robust-to-fragile transition in empirical food webs. Ecol. Model 251, 1–8 (2013).
Article Google Scholar
Lü, L. et al. Vital nodes identification in complex networks. Phys. Rep. 650, 1–63 (2016).
Article MathSciNet ADS Google Scholar
Bonacich, P. Factoring and weighting approaches to status scores and clique identification. Math. Sociol. 2, 113–120 (1972).
Article Google Scholar
Kitsak, M. et al. Identification of influential spreaders in complex networks. Nat. Phys. 6, 888–893 (2010).
Article CAS Google Scholar
Lü, L., Zhou, T., Zhang, Q. M. & Stanley, H. E. The H-index of a network node and its relation to degree and coreness. Nat. Commun. 7, 10168 (2016).
Article PubMed PubMed Central ADS Google Scholar
Freeman, L. C. A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977).
Article Google Scholar
Freeman, L. C. Centrality in social networks conceptual clarification. Soc. Netw. 1, 215–239 (1979).
Article Google Scholar
Kumar, A., Srinivasan, K., Cheng, W. H. & Zomaya, A. Y. Hybrid context enriched deep learning model for fine-grained sentiment analysis in textual and visual semiotic modality social data. Inform. Process. Manag. 57, 102141 (2020).
Article Google Scholar
Yanez-Sierra, J., Diaz-Perez, A. & Sosa-Sosa, V. An efficient partition-based approach to identify and scatter multiple relevant spreaders in complex networks. Entropy 23, 1216 (2021).
Article MathSciNet PubMed PubMed Central ADS Google Scholar
Iwendi, C., Ponnan, S., Munirathinam, R., Srinivasan, K. & Chang, C. Y. An efficient and unique TF/IDF algorithmic model-based data analysis for handling applications with big data streaming. Electronics 8, 1331 (2019).
Article Google Scholar
Zhu, J. & Wang, L. Identifying influential nodes in complex networks based on node itself and neighbor layer information. Symmetry 13, 1570 (2021).
Article Google Scholar
Li, C., Wang, L., Sun, S. & Xia, C. Identification of influential spreaders based on classified neighbors in real-world complex networks. Appl. Math. Comput. 320, 512–523 (2018).
Article MathSciNet MATH Google Scholar
Wang, J., Li, C. & Xia, C. Improved centrality indicators to characterize the nodal spreading capability in complex networks. Appl. Math. Comput. 334, 388–400 (2018).
MathSciNet Google Scholar
Ma, L. L., Ma, C., Zhang, H. F. & Wang, B. H. Identifying influential spreaders in complex networks based on gravity formula. Physica A 451, 205–212 (2015).
Article MATH ADS Google Scholar
Li, Z. et al. Identifying influential spreaders by gravity model. Sci. Rep. 9, 8387 (2019).
Article PubMed PubMed Central ADS Google Scholar
Liu, F., Wang, Z. & Deng, Y. GMM: A generalized mechanics model for identifying the importance of nodes in complex networks. Knowl. Based Syst. 193, 105464 (2020).
Article Google Scholar
Yang, X. & Xiao, F. An improved gravity model to identify influential nodes in complex networks based on k-shell method. Knowl. Based Syst. 227, 107198 (2021).
Article Google Scholar
Shang, Q., Deng, Y. & Cheong, K. H. Identifying influential nodes in complex networks: Effective distance gravity model. Inf. Sci. 577, 162–179 (2021).
Article MathSciNet Google Scholar
Maji, G., Namtirtha, A., Dutta, A. & Malta, M. C. Influential spreaders identification in complex networks with improved k-shell hybrid method. Expert Syst. Appl. 144, 113092 (2020).
Article Google Scholar
Maji, G., Dutta, A., Malta, M. C. & Sen, S. Identifying and ranking super spreaders in real world complex networks without influence overlap. Expert Syst. Appl. 179, 115061 (2021).
Article Google Scholar
Ullah, A. et al. Identification of nodes influence based on global structure model in complex networks. Sci. Rep. 11, 6173 (2021).
Article PubMed PubMed Central ADS Google Scholar
Bi, J. et al. Temporal gravity model for important node identification in temporal networks. Chaos Solitons Fractals 147, 110934 (2021).
Article MathSciNet Google Scholar
Li, H., Shang, Q. & Deng, Y. A generalized gravity model for influential spreaders identification in complex networks. Chaos Solitons Fractals 143, 110456 (2021).
Article MathSciNet Google Scholar
Wang, X., Yang, Q., Liu, M. & Ma, X. Comprehensive influence of topological location and neighbor information on identifying influential nodes in complex networks. PLoS ONE 16, e0251208 (2021).
Article CAS PubMed PubMed Central Google Scholar
Huang, X., Chen, D., Wang, D. & Ren, T. Identifying influencers in social networks. Entropy 22, 450 (2020).
Article MathSciNet PubMed Central Google Scholar
Yan, X., Cui, Y. & Ni, S. Identifying influential spreaders in complex networks based on entropy weight method and gravity law. Chin. Phys. B 29, 048902 (2020).
Article ADS Google Scholar
Adamic, L. A. & Glance, N. The political blogosphere and the 2004 U.S. election: Divided they blog. In Proceedings of the 3rd International Workshop on Link Discovery. 36–43 (ACM Press, 2005).
Mcauley, J. J. & Leskovec, J. Learning to discover social circles in ego networks. Adv. Neural Inf. Process. Syst.
Leskovec, J., Huttenlocher, D. & Kleinberg, J. Predicting positive and negative links in online social networks. In Proceedings of the 19th International Conference on World Wide Web. 641–650 (ACM Press, 2010).
Rocha, L. E., Liljeros, F. & Holme, P. Simulated epidemics in an empirical spatiotemporal network of 50,185 sexual contacts. PLoS Comput. Biol. 7, e1001109 (2011).
Article CAS PubMed PubMed Central ADS Google Scholar
Gleiser, P. & Danon, L. Community structure in Jazz. Adv. Complex Syst. 6, 565 (2003).
Article Google Scholar
Newman, M. E. J. Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 74, 036104 (2006).
Article MathSciNet CAS ADS Google Scholar
Batageli, V. & Mrvar, A. Pajek Datasets. http://vlado.fmf.uni-lj.si/pub/networks/data/ (2007).
Guimerȧ, R., Danon, L., Díaz-Guilera, A., Giralt, F. & Arenas, A. Self-similar community structure in a network of human interactions. Phys. Rev. E 68, 065103 (2003).
Article ADS Google Scholar
Watts, D. J. & Strogatz, S. H. Collective dynamics of small-world networks. Nature 393, 440–442 (1998).
Article CAS MATH ADS Google Scholar
Spring, N., Mahajan, R., Wetherall, D. & Anderson, T. Measuring ISP topologies with rocketfuel. IEEE/ACM Trans. Netw. 12, 2–16 (2004).
Article Google Scholar
Newman, M. E. J. Assortative mixing in networks. Phys. Rev. Lett. 89, 208701 (2002).
Article CAS PubMed ADS Google Scholar
Hu, H. B. & Wang, X. F. Unified index to quantifying heterogeneity of complex networks. Physica A 387, 3769–3780 (2008).
Article ADS Google Scholar
Castellano, C. & Pastor-Satorras, R. Thresholds for epidemic spreading in networks. Phys. Rev. Lett. 105, 218701 (2010).
Article PubMed ADS Google Scholar
Hethcote, H. W. The mathematics of infectious diseases. SIAM Rev. 42, 599–653 (2009).
Article MathSciNet MATH ADS Google Scholar
Kendall, M. A new measure of rank correlation. Biometrika 30, 81–89 (1938).
Article MATH Google Scholar
Bae, J. & Kim, S. Identifying and ranking influential spreaders in complex networks by neighborhood coreness. Physica A 395, 549–559 (2014).
Article MathSciNet MATH ADS Google Scholar
Amaral, L. A. N., Scala, A., Barthelemy, M. & Stanley, H. E. Classes of small-world networks. PNAS 97, 11149–11152 (2000).
Article CAS PubMed PubMed Central ADS Google Scholar
Yan, G., Fu, Z. Q. & Chen, G. Epidemic threshold and phase transition in scale-free networks with asymmetric infection. Eur. Phys. J. B 65, 591–594 (2008).
Article CAS MATH ADS Google Scholar
Wang, W. et al. Asymmetrically interacting spreading dynamics on complex layered networks. Sci. Rep. 4, 5097 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bellingeri, M., Bevacqua, D., Scotognella, F. & Cassi, D. The heterogeneity in link weights may decrease the robustness of real-world complex weighted networks. Sci. Rep. 9, 10692 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar

Download references

Acknowledgements

The authors greatly appreciate the reviews’ suggestions and the editor’s encouragement.

Author information

Authors and Affiliations

Software College, Shenyang University of Technology of China, Shenyang, 110870, People’s Republic of China
Zhe Li
Software College, Northeastern University of China, Shenyang, 110819, People’s Republic of China
Xinyu Huang

Authors

Zhe Li
View author publications
You can also search for this author in PubMed Google Scholar
Xinyu Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.L. devised the research project. Z.L. performed the research. Z.L. and X.H. analyzed the data. Z.L. and X.H. wrote the paper.

Corresponding authors

Correspondence to Zhe Li or Xinyu Huang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Z., Huang, X. Identifying influential spreaders in complex networks by an improved gravity model. Sci Rep 11, 22194 (2021). https://doi.org/10.1038/s41598-021-01218-1

Download citation

Received: 04 September 2021
Accepted: 25 October 2021
Published: 12 November 2021
DOI: https://doi.org/10.1038/s41598-021-01218-1
Springer Nature Limited

This article is cited by

Identifying influential nodes in complex networks using a gravity model based on the H-index method
- Siqi Zhu
- Jie Zhan
- Xing Li
Scientific Reports (2023)
Identifying important nodes in complex networks based on extended degree and E-shell hierarchy decomposition
- Jun Liu
- Jiming Zheng
Scientific Reports (2023)
Identifying vital nodes for influence maximization in attributed networks
- Ying Wang
- Yunan Zheng
- Yiguang Liu
Scientific Reports (2022)
Identifying influential spreaders by gravity model considering multi-characteristics of nodes
- Zhe Li
- Xinyu Huang
Scientific Reports (2022)

Identifying influential spreaders in complex networks by an improved gravity model

Abstract

Similar content being viewed by others

Enhancement of Gravity Centrality Measure Based on Local Clustering Method by Identifying Influential Nodes in Social Networks

Identifying influential nodes in complex networks using a gravity model based on the H-index method

Effectively identifying the influential spreaders in large-scale social networks

Introduction