Identify hidden spreaders of pandemic over contact tracing networks

Huang, Shuhong; Sun, Jiachen; Feng, Ling; Xie, Jiarong; Wang, Dashun; Hu, Yanqing

doi:10.1038/s41598-023-32542-3

Identify hidden spreaders of pandemic over contact tracing networks

Article
Open access
Published: 19 July 2023

Volume 13, article number 11621, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Identify hidden spreaders of pandemic over contact tracing networks

Download PDF

Shuhong Huang¹^na1^nAff7,
Jiachen Sun²^na1,
Ling Feng^3,4,
Jiarong Xie¹,
Dashun Wang⁵ &
…
Yanqing Hu⁶

1506 Accesses
1 Altmetric
Explore all metrics

Abstract

The COVID-19 infection cases have surged globally, causing devastations to both the society and economy. A key factor contributing to the sustained spreading is the presence of a large number of asymptomatic or hidden spreaders, who mix among the susceptible population without being detected or quarantined. Due to the continuous emergence of new virus variants, even if vaccines have been widely used, the detection of asymptomatic infected persons is still important in the epidemic control. Based on the unique characteristics of COVID-19 spreading dynamics, here we propose a theoretical framework capturing the transition probabilities among different infectious states in a network, and extend it to an efficient algorithm to identify asymptotic individuals. We find that using pure physical spreading equations, the hidden spreaders of COVID-19 can be identified with remarkable accuracy, even with incomplete information of the contract-tracing networks. Furthermore, our framework can be useful for other epidemic diseases that also feature asymptomatic spreading.

Network-based prediction of COVID-19 epidemic spreading in Italy

Article Open access 17 November 2020

Trading contact tracing efficiency for finding patient zero

Article Open access 30 December 2022

Modeling the Spread of COVID-19 Over Varied Contact Networks

As the COVID-19 pandemic continues to spread at rapid rates^1,2,3, and the development of effective pharmacological treatments is still uncertain according to WHO, non-pharmacological interventions like isolation of the infectious through quarantines^4,5 are the most effective and possibly the only means of containing the continued outbreaks, as it effectively reduces the person to person transmissions ⁶. Yet, unlike other infectious diseases like SARS and Ebola, COVID-19 is unique in that a large portion of its infected population is mild or asymptotic⁷. Even some of the asymptotic infections do not exhibit any clinic symptoms until self-recovery^8,9. Without being detected and subsequently quarantined, the asymptomatic population (i.e. hidden spreaders) sustains the ongoing spreading of the disease to the susceptible population unknowingly^10,11. This poses a major challenge in the effective mitigation of the pandemic spreading. Furthermore, empirical studies have shown that such asymptomatic infections accounts for a large proportion of the population^{12,13,14,15,16,17,18}, as much as up to 80%¹⁸. Currently, estimation of the asymptomatic cases is done through exhaustive screening of close contacts of the known infected cases in the contact tracing networks¹⁷. This untargeted method requires large amount of resources and is time consuming, that in turn leads to ineffective or delayed interventions to quarantine the asymptomatic cases. On the other hand, the combination of a mobile app-based contact tracing network¹⁹ and a statistical framework²⁰ shows the potential to accurately localize high-risk spreaders^21,22. Hence, a targeted screening in the contact tracing network is pertinent, such that asymptomatic individuals can be estimated with high precision for intervention and spreading mitigation.

Here we incorporate the empirical characteristics of the COVID-19 spreading dynamics into a Markovian process, i.e. vectors that represent the different infection stages and their associated transition probabilities. By embedding the transition process into a contact tracing network that includes the known infected nodes (individuals), we develop a method that predicts the infectious states of the rest of the network with high precision. By combining such predictions with the network structure, we then derive the spreading power of every node taking into account of both its infectious state and its specific location in the network, such that screening of the asymptomatic can be prioritised accordingly. The effectiveness of our method is validated by empirical data from two COVID-19 transmission networks in Singapore. Moreover, in the simulated COVID-19 transmission experiment of contact-tracing network, we find that a screening scheme designed by the proposed computational framework outperforms several machine-learning baselines designed in this work and the random screening of infection neighbors. The latter was widely used in early COVID-19 outbreaks in China. Furthermore, even in the realistic situation of incomplete information on the contact tracing network, with missing links or sub networks consisting of only contacts of the infected cases, our method retains high accuracy. Thus our method is highly effective in asymptomatic case estimation and can be implemented to any contact-tracing networks either constructed manually^23,24 or through technological means²⁵ such as Bluetooth^26,27, GPS²⁸ and digital check-in check-out technologies (e.g. health QR codes²⁹ widely used in China).

Given the spreading of the COVID-19 occurring over the contact network, the challenge is to identify asymptomatic nodes with the information of infected symptomatic individuals (nodes) that have been identified from a certain time T. We approach this by estimating the probability of each node being in the infected state as illustrated in Fig. 1. Specifically, we first construct the transition dynamical equations among different infection stages and states based on the empirically observation of COVID-19 disease progression. The set of transition equations is then combined with the contact network topology and data on the observed infection history to deduce the state of each node in the network.

As observed in many clinical studies, the hidden spreaders of COVID-19 fall into two different categories. One is the presymptomatic infections who are asymptomatic and infected, but will later develop clinical symptoms (e.g. fever, cough, dyspnea, etc.); The other type corresponds to the asymptomatic patients who carry the virus but have never exhibited any symptoms until recovery. As a result, an individual can have a total of 5 different states in the process of COVID-19 spreading (see Fig. 1a), namely, Susceptible (S), Presymptomatic (P), Asymptomatic (A), Symptomatic Infectious (I) and Recovered (R). Since the infectious duration in the states of P, I and A follows a specific probability distribution, here we further break down the P, I and A states into finer states representing the progression in each of the 3 states, i.e., the number of days passed since the beginning of the states. For better clarity, we denote t as the number of days of the COVID-19 evolution on the entire network and d as the number of days in a particular infected state for a particular individual. Since an individual i can be at any stage in the process, we can use $\textbf{Z}_i(t)$ to represent the state probabilities at time t:

$$\begin{aligned} \begin{aligned} \textbf{Z}_i(t) =&(S_i(t),P^1_i(t),...,P^d_i(t), ..., I^1_i(t),...,\\&I^d_i(t),...,A^1_i(t),...,A^d_i(t),...,R_i(t)) \end{aligned} \end{aligned}$$

(1)

where $S_i(t)$ and $R_i(t)$ is the probability that the individual i is susceptible and recovered at day t, respectively. $P^d_i(t), I^d_i(t)$ and $A^d_i(t)$ are the probabilities that i is in the state of P, I and A for d days at the time of t. Since all of asymptomatic, presymptomatic and symptomatic states are infected states, their total probability corresponds to that of a node is infectious, and we use $C_i(t)$ to represent it:

$$\begin{aligned} C_i(t) = P_i(t)+I_i(t)+A_i(t) \end{aligned}$$

(2)

where $P_i(t) = \sum _{d=1}^\infty P^d_i(t)$, $I_i(t) = \sum _{d=1}^\infty I^d_i(t)$, $A_i(t) = \sum _{d=1}^\infty A^d_i(t)$. Throughout this work we use $C_i(t)$ as a key indicator to infer whether an individual is infected .

From here, we can extract the probability transition dynamics among the 5 different states as follows. First, for a node who is in the susceptible state S at t, its next state at $t+1$ will be jointly determined by the state of its neighbors in the network at t. Specifically, the probability of a node i in S state remains in S on day $t+1$ (i.e., not infected by any of its infected neighbor on the next day) is:

$$\begin{aligned} S_i(t+1) = S_i(t)\cdot \prod _{j\in \partial i}(1-\mathcal {F}(t,j,\beta )) \end{aligned}$$

(3)

where $\partial {i}$ represents the set of neighbors (contacts) of i in the network, $\mathcal {F}(t,j,\beta )$ represents the probability that i is infected by j. Since Eq. (3) origins of belief propagation, the infection possibility is strictly accurate only if the network has a tree structure³⁰. However, Eq. (3) can still provide reasonable results on many networks with loops^31,32,33. Threfore, the infection of node i can only happen if j is in the infected state on day t (probability $C_j(t)$), and happens to transmit it to i (probability $\beta$). Then we have:

$$\begin{aligned} \mathcal {F}(t,j,\beta )=C_j(t)\cdot \beta \end{aligned}$$

(4)

Here $\beta$ can be estimated from the empirically observed disease reproduction number $R_0$ for COVID-19 and the average number of neighbors in the contact tracing network $\langle k \rangle$. Specificaly, $\beta = \frac{R_0}{\lambda \langle k \rangle }$, where $\lambda$ is the average time a susceptible person carries the virus, which can be expressed as $\lambda = p\cdot \mu _{A}+(1-p)\cdot \bigg (\exp \bigg (\mu _P+\frac{\sigma _P^2}{2}\bigg )+\mu _I\bigg )$, where p is the proportion of asymptomatic infected cases, $\mu _{A},\exp \bigg (\mu _P+\frac{\sigma _P^2}{2}\bigg ),\mu _{I}$ are the average time of the virus carried by infected individuals in A, P and I states^34,35 respectively.

Next, for an individual under S state at time t, the probability of becoming presymptomatic state P at $t+1$ is:

$$\begin{aligned} P_i^1(t+1) =S_i(t)\cdot (1-p)\cdot (1-\prod _{j\in \partial i}\bigg (1-\mathcal {F}(t,j,\beta )\bigg ) \end{aligned}$$

(5)

Accordingly, we can calculate the probability that the state of i become A at $t+1$ as:

$$\begin{aligned} A_i^1(t+1) =S_i(t)\cdot p\cdot \bigg (1-\prod _{j\in \partial i}(1-\mathcal {F}(t,j,\beta )\bigg ) \end{aligned}$$

(6)

In the third case where a node i is in the infected state (i.e. E, I or A, $d \ge 1$) on day t, the transition probabilities that they will stay in the same state on day $t+1$ are:

$$\begin{aligned} \begin{aligned}{}&P^{d+1}_i(t+1) = P^d_i(t)\cdot \bigg (\frac{1-F_P(d)}{1-F_P(d-1)}\bigg )\\&I^{d+1}_i(t+1) = I^d_i(t)\cdot \bigg (\frac{1-F_{I}(d)}{1-F_{I}(d-1)}\bigg )\\&A^{d+1}_i(t+1) = A^d_i(t)\cdot \bigg (\frac{1-F_{A}(d)}{1-F_{A}(d-1)}\bigg ) \end{aligned} \end{aligned}$$

(7)

where $F_P(d)=\int _{-\infty }^{d}{f_P(t)dt}$, $F_{I}(d)=\int _{-\infty }^{d}f_I(t)dt$, $F_{A}(d)$ $=\int _{-\infty }^{d}f_A(t)dt$ are the cumulative distribution functions of duration length d for P, I, A states, respectively. For mathematical convince, we simply set $F_P(0)=F_{I}(0)=F_{A}(0)=0$. The fourth case is that individual in the presymptomatic state P turns into the symptomatic infectious state I at the next day, and can be described with the following transition probability:

$$\begin{aligned} I^1_i = \sum _{d=1}^{\infty } P^d_i\cdot \frac{F_P(d)-F_P(d-1)}{1-F_P(d-1)} \end{aligned}$$

(8)

In the fifth case, an individual in the state I or the state A has a certain probability of being recovered i.e, turning into the R state on the next day. From the above equation, we obtain the probability that the individual i is in the state of R at the time $t+1$ is:

$$\begin{aligned} R_{i} (t + 1) = & R_{i} (t) + \sum\limits_{{d = 1}}^{\infty } {A_{i}^{d} } \cdot \frac{{F_{A} (d) - F_{A} (d - 1)}}{{1 - F_{A} (d - 1)}} \\ & + \sum\limits_{{d = 1}}^{\infty } {I_{i}^{d} } \cdot \frac{{F_{I} (d) - F_{I} (d - 1)}}{{1 - F_{I} (d - 1)}} \\ \end{aligned}$$

(9)

To validate our mathematical framework, we test it on a real contact-tracing network in the Infectious Stay Away exhibition³⁶ (ISA network, see Sect. SI 1 for data detailed description) with 410 individuals and average degree $\langle k \rangle$ of 13 (more experiments on another social network are illustrated in Sect. SI 5). We simulate the spreading with the empirically observed parameters on COVID-19 spreading mechanisms³⁴ (see Methods for the simulation details). All parameters used in simulation are listed in Table 1. From repeated simulations, we then obtain the probability of every possible state of a node, and compare this baseline with the theoretical results from Eqs. (3–9). Here we set the dimension of Z to 77 according to the empirical temporal distributions of the infected states^34,35 (see Methods for detail). From Fig. 2a and b, we can see that our theoretical result on the temporal evolutions of the disease in the whole network is well validated by the simulations. These show that our transition probability framework is accurate in producing the real spreading dynamics.

Now we extend the proposed transition probability equations to identify nodes with high risk of being asymptomatic, assuming the infection history on symptomatic nodes is already known. The underlying principle is to update every node’s state by incorporating the information of known infection into Eqs.(2–9) in the subsequent days, and then deduce the infection probability $C_i(T)$ for each node i in the network (see the details in the Methods). The nodes with higher $C_i(T)$ are identified as having high risk of being infected at day T. We test the effectiveness by applying it on two sets of real COVID-19 spreading data on the contact-tracing network in Singapore^23,24 (see Fig. 2c and d). The details of network is provided in Method and in Sect. SI 1. We find that the ranking our $C_i(T)$ values are highly correlated with the date of infection t of nodes (Fig. 2e and f), meaning nodes with higher infection probabilities indeed have higher risk of being infected in the real COVID-19 spreading data.

The Singapore empirical datasets have the constraint of merely including the symptomatic individuals’ identities in the network. Therefore, to further evaluate our method, we simulate a realistic COVID-19 spreading process on the ISA network for T days to obtain the detailed infection history of every node in the network, such that the exact infection history on the asymptomatic nodes can be obtained. Assuming only the symptomatic nodes with state I are observed, i.e. infection histories of these nodes are known, we use our above method to identify those infected individuals among the rest of the nodes. Specifically, we select the nodes with the highest $C_i(T)$ values as the mostly likely infected nodes. In practice, the historical information of patients may be inaccurate based on personal statement. The infection time also needs to be inferenced from the dynamic structure of the contact tracking network. Inaccurate historical information may affect the perfomance. To our best knowledge, there is few prior works for estimating asymptomatic nodes in the network. Therefore, we also design several screening baselines based on the popular graph neural networks methods including Node2Vec³⁷, graph convolutional network ³⁸ and graph attention networks³⁹ to further compare our results. Here the neural network models are utilized for an infected-uninfected binary classification task based on the network structure. A better perfomance can be achieved with more complex models. (Detailed methodologies for those methods in Sect. SI3).

The simulations results show that our transition probabilistic method (i.e. static screening) significantly outperforms the other methods in terms of the accuracy and recall on the local network where one can only observe the nearest neighbors of the known nodes in states I (see SI Fig. 1). Such advantage is still evident when we consider the alternative scenario that one can observe the full network structure^26,27 (see SI Fig. 3), and the intermediate scenario when only nearest and second nearest neighbors are known in the network (see SI Fig. 2). In a more realistic setting, the screening of the contact tracing network happens continuously in time. Here one can update the set of known infected nodes after every screening, and subsequently update the infected risk for the rest of the network from time to time. Therefore, we develop a dynamic screening method by updating the evaluation of $C_i(t)$ every time a new infected node is found through selective screening of the network (see the details in the Methods Section). This dynamic screening method outperforms (see Fig. 3a–d) other screening methods and even our previous static screening method (see Fig. 3d inset), implying that such dynamic screening method is highly effective in identifying infected nodes by screening less people.

Very often, the contact tracing network collected through either manual survey or digital tracking is at best incomplete, such that it is important to have a screening method that is still robust when there is missing information on the network structure. To test such robustness of our method, we randomly remove up to 80% of the edges in the ISA network, and test the accuracy of the method based on the remaining network (see the results on another network in SI Figs. 12–17). We find that the dynamic screening method on the various scenarios can still reliably identify the infected nodes in terms of accuracy and recall rate, as shown in Fig. 4 (see the robustness result on the static screening method in SI Fig. 16).

Lastly, we study the effectiveness of our method in containing the overall spread of COVID-19. In the widespread of COVID-19, limited resource on screening constrains the number of individuals the government can screen in a given day. Hence, targeted screening and mitigation can have significant impact on ‘flattening the curve’ of daily infected cases. To study such effect, we again simulate the COVID-19 spreading on the ISA network³⁶, and start screening/testing from day 10 using our method (i.e. ‘neighbor containment’). Each day 2% (4% in SI Figs. 18, 19) of the whole network are tested for the disease, and the positive ones are immediately quarantined, corresponding a transition to the state R (see details of the containment strategy in Sect. SI 4). As shown in Fig. 5a–h, our method is highly effective in suppressing the daily infection cases and total infection cases, outperforming both the baseline strategy of only quarantining the infected ones (labeled as ‘infection containment’) and the strategy randomly screening $2\%N$ among the neighbors of the known infections (labelled as ‘neighbor containment’), where N is the network size. In addition, we find that even with up to 80% missing links, our method is still robust enough to effectively suppress the spreading, close to that of knowing the full network structure. It shows that our method is expected to be highly effective in containing COVID-19 spread in practice.

In this paper, based on the transmission rule of COVID-19 and the underlying physical spreading equations, we for the first time studied the estimation of asymptomatic infections in the contact-tracing network, which is a current major concern in the prevention and containment of COVID-19 worldwide. We provided a complete computational framework of inferring latent infection on contact network. Based on this, we proposed a feasible method for optimal detection of latent infection in combination with nodal transmission ability in the network. We show that the COVID-19 transmission can be broken in a timely and efficient manner by the proposed method, which outperforms the direct contact screening, a typical method widely used in China. In addition, our simulation on a real contact network demonstrated that, this method is robust even with incomplete network information, demonstrating its effectiveness in practical scenarios.

With the theoretical model presenting in this paper, it is the first time to focus on the inference task on a network from a spreading perspective. However, there are two main realistic concerns of the proposed framework. On the one hand, The virus transmission constant $\beta$ may be varied for different virus mutant. As shown in the supplementary information, the model is robust among different transmission constant $\beta$. As the epidemic progress, updating the transmission constant dynamically could address this issue. On the other hand, the transmission ability has individual variances. For this complex situation, the proposed model can be improved to satisfy the practice. For example, the nodes on the network can be grouped based on their personal information such as age, past medical history, etc. For different individual groups, different transmission dynamics can be established.

Therefore, we believe that the theory and the corresponding methods in identifying COVID-19 hidden spreaders are of great practical significance. In principle, it provides policymakers and front-line workers in COVID-19 with important and effective guidance and tools that could be deployed swiftly to fight COVID-19, and save billions of people around the world who are still suffering as the epidemic continues to spread throughout the world.

Methods

Singapore COVID-19 datasets

The data was collected by the Singapore government^23,24, and contains comprehensive records on the dates of showing symptoms and confirming the disease, as well as their contact networks. We pick the infected nodes from the first two different time points, and set T to January 26, 2020 and February 19, 2020 for Singapore A and Singapore B, respectively. Then based on the known infection history of the nodes, our transition probability method estimates the infection probabilities $C_i(T)$ of every other node in the networks.

The dimension of the state vector

The dimension of the state vector $\textbf{Z}_i(t)$. The dimension of the state vector $\textbf{Z}_i(t)$ corresponds to the total number of sub states possible during the various disease progression paths, i.e., the number of days that an individual can be in each of the 3 different infected states. From the empirical temporal distributions of the infected states^34,35 (Table 1), we use 3 standard deviations⁴⁰ as cut off on the max number of days in states P, I, A, which are 20, 20, 35 days, yielding a dimension of 77 for Z. (S and R states have no sub states).

COVID-19 spreading simulation

COVID-19 spreading simulation. At the starting time $T=0$, we select the 3 nodes with the largest degree in the network as initial infected nodes, whose infected states are determined as either Asymptomatic or Presymptomatic according to the parameter p of Table 1. Then we apply the empirically observed parameters on COVID-19 spreading mechanisms³⁴ including reproductive number $R_0=3.50$ and asymptotic infection ratio $p=15\%$ on our equations to simulate the spreading. The set of values are listed in Table. 1(see Sect. SI 1 for the detail description of parameters and Sect. SI 3 for the discussion of the parameter sensitivity). Each simulation corresponds to one realization of the actual spreading based on the realistic dynamics, and the actual states of each node at every time step can be captured. More details of the simulation of COVID-19 are provided in Sect. SI 2.

Identifying infection probability

Identifying infection probability $C_i(T)$. The goal is to identify nodes with high risk of being asymptomatic with infection history on known symptomatic nodes, and we extend our transition probability equations to study this problem. At a certain time T, given the set I of infected individuals, the first day of infection $s_j$ and the day of recovery $r_j$ for each individual $j\in I$, we aim to develop a method from Eqs. (2–9) to deduce the infection probability $C_i(T)$ for each node i in the network. Note that the day of recovery can also be the day of death or quarantine. The initial condition at $t=0$ is that every node in the network is in susceptible state, i.e. $\textbf{Z}_i(0)=\{1,0,\cdots ,0\}$. The day of first infection in the network is set to 1, i.e. $\min _{j\in I}s_j=1$, and we update every node’s state in the subsequent days depending on whether their infection history is known at time T. For the known nodes $j\in I$, we artificially assign their infection states according to the known information, meaning that j is assigned state S when $t<s_j$, state R when $t>r_j$, and infectious state when $s_j<t<r_j$. For the other nodes, we evaluate their state vector $Z_i(t)$ at every time step t according to the transition probabilities in Eqs. (2–9), until the final day T, such that their probabilities $C_i(T)$ of being infected can be evaluated from $Z_i(T)$.

Dynamic screening method

Dynamic screening method Every time we screen only node k that is of highest risk according to the algorithm; if node k is COVID-19 positive, it is added to the known infected nodes set I, and its neighbors are added to the unknown set, and we repeat the transition probability calculations according to Eqs. (2–9) from time $0<t\le T$; if k is negative, its probability state vector is set to be $\textbf{Z}_k(t)=\{1,0,\cdots ,0\}$ in the calculation of Eqs. (2–9). Next, the revised estimations of infection probabilities for each unknown node from Eqs. (2–9) tells us which node is the most risky and to be tested.

Table 1 COVID-19’s clinical parameters and infectious characteristics used in this work.

Full size table

Data availability

All of network structures used in this paper and the code to simulate disease spreading can be found in https://github.com/Timbrer/HiddenSpreader.

References

Hui, D. S. et al. The continuing 2019-nCoV epidemic threat of novel coronaviruses to global health-the latest 2019 novel coronavirus outbreak in Wuhan. China Int. J. Infect. Dis. 91, 264 (2020).
Article CAS PubMed Google Scholar
Megan, S. How the pandemic might play out in 2021 and beyond. Nature( 2020).
World Health Organization et al. WHO Director-general’s opening remarks at the media briefing on COVID-19-11 March 2020“Who director-general’s opening remarks at the media briefing on Covid-19-11 march 2020”, (2020).
Hellewell, J. et al. Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts. Lancet Glob. Health 8(4), e488–e496 (2020).
Article PubMed PubMed Central Google Scholar
Maier, B. F. & Brockmann, D. Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China. Science 368, 742 (2020).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Chan, J.F.-W. et al. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: A study of a family cluster. Lancet 395, 514 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gao, Z., Xu, Y., Sun, C., Wang, X., Guo, Y., Qiu, S. & Ma, K. A systematic review of asymptomatic infections with COVID-19 immunology and infection, J. Microbiol. (2020)
Kimball, A. et al. Asymptomatic and presymptomatic SARS-CoV 2 infections in residents of a long-term care skilled nursing facility-King County Washington. Morb. Mort. Wkly. Rep. 69, 377 (2020).
Article CAS Google Scholar
Long, Q.-X. et al. Clinical and immunological assessment of SARS-CoV-2 asymptomatic infections. Nat. Med. 26(8), 1200–1204 (2020).
Article CAS PubMed Google Scholar
Liu, Y.-C., Liao, C.-H., Chang, C.-F., Chou, C.-C. & Lin, Y.-R. A locally transmitted case of SARS-CoV-2 infection in TaiwanNew England. J. Med. 382, 1070 (2020).
Google Scholar
Rothe, C. et al. Transmission of 2019-nCoV infection from an asymptomatic contact in Germany. N. Eng. J. Med. 382, 970 (2020).
Article Google Scholar
Quilty, B. J. et al. Effectiveness of airport screening at detecting travellers infected with novel coronavirus (2019-nCoV). Eurosurveillance 25, 2000080 (2020).
Article PubMed PubMed Central Google Scholar
Byambasuren, O., Cardona, M., Bell, K., Clark, J., McLaws, M.-L. & Glasziou, P. Estimating the extent of true asymptomatic COVID-19 and its potential for community transmission: Systematic review and meta-analysis, Available at SSRN 3586675 ( 2020).
Nishiura, H. et al. Estimation of the asymptomatic ratio of novel coronavirus infections (Covid-19). Int. J. Infect. Dis. 94, 154 (2020).
Article CAS PubMed PubMed Central Google Scholar
Day, M. Covid-19 identifying and isolating asymptomatic people helped eliminate virus in Italian village. Br. Med. J.https://doi.org/10.1136/bmj.m1165 (2020).
Article Google Scholar
Yu, Y., Liu, Y.-R., Luo, F.-M., Tu, W.-W., Zhan, D.-C., Yu, G. & Zhou, Z.-H. medRxiv COVID-19 Asymptomatic Infection Estimation ( 2020)
Mizumoto, K., Kagaya, K., Zarebski, A. & Chowell, G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship Yokohama, Japan, 2020. Eurosurveillance 25, 2000180 (2020).
Article PubMed PubMed Central Google Scholar
Heneghan, C., Brassey, J., Jefferson, T. Centre for evidence-based medicine COVID-19: What proportion are asymptomatic? ( 2020).
Schneider, T. et al. Epidemic management and control through risk-dependent individual contact interventions. PLOS Comput Biol. 18, e1010171 (2022).
Article CAS PubMed PubMed Central Google Scholar
Altarelli, F., Braunstein, A., Dall’Asta, L., Lage-Castellanos, A. & Zecchina, R. Bayesian inference of epidemics on networks via belief propagation. Phys. Rev. Lett. 112, 118701 (2014).
Article ADS PubMed Google Scholar
Pei, S., Liljeros, F. & Shaman, J. Identifying asymptomatic spreaders of antimicrobial-resistant pathogens in hospital settings. Proc. Natl. Acad. Sci. 118, e2111190118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Baker, A. et al. Epidemic mitigation by statistical inference from contact tracing data. Proc. Natl. Acad. Sci. 118, e2106548118 (2021).
Article CAS PubMed PubMed Central Google Scholar
LINKS ESTABLISHED BETWEEN CHURCH CLUSTERS AND WUHAN TRAVELLERS“Links established between church clusters and Wuhan travellers”, ( 2020a),https://www.moh.gov.sg/news-highlights/details/links-established-between-church-clusters-and-wuhan-travellers,Lastaccessed4Apr2020.
“12 more cases discharged, 52 new cases of Covid-19 infection confirmed”, ( 2020b),https://www.moh.gov.sg/news-highlights/details/12-more-cases-discharged-52-new-cases-of-covid-19-infection-confirmed,Lastaccessed4Apr2020.
Kondylakis, H. et al. COVID-19 mobile apps: A systematic review of the literature. J. Med. Internet Res. 22(12), e23170 (2020).
Article PubMed PubMed Central Google Scholar
Drew, D. A. et al. Rapid implementation of mobile technology for real-time epidemiology of COVID-19. Science 368(6497), 1362–1367 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Ferretti, L. et al. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science 368, eabb6936 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chang, S. et al. Mobility network models of COVID-19 explain inequities and inform reopening. Nature 589(7840), 82–87 (2020).
Article ADS PubMed Google Scholar
Mozur, P., Zhong, R., Krolik, A. In coronavirus fight, China gives citizens a color code, with red flags, New York Times 1 ( 2020).
Mezard, M. & Montanari, A. Information, physics, and computation (Oxford University Press, Oxford, 2009).
Book MATH Google Scholar
Cantwell, G. T. & Newman, M. E. Message passing on networks with loops. Proc. Natl. Acad. Sci. 116, 23398 (2019).
Article ADS MathSciNet PubMed PubMed Central MATH Google Scholar
Frey, B.J., MacKay, D. A revolution: Belief propagation in graphs with cycles, Adv. Neural Inf. Process. Syst. 10 ( 1997).
Murphy, K., Weiss, Y., Jordan, M.I. Loopy belief propagation for approximate inference: An empirical study arXiv preprint arXiv:1301.6725 ( 2013).
Li, Q. et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 382(1199), 1207 (2020).
Google Scholar
Zhou, F. et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: A retrospective cohort study. Lancet 395(10229), 1054–1062 (2020).
Article CAS PubMed PubMed Central Google Scholar
Isella, L. et al. What’s in a crowd? Analysis of face-to-face behavioral networks. J. Theor. Biol. 271, 166 (2011).
Article ADS MathSciNet PubMed MATH Google Scholar
Grover, A., Leskovec, J. Node2vec: Scalable feature learning for networks, In booktitle Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining ( 2016) pp. 855–864.
Kipf, T.N., Welling, M. Semi-supervised classification with graph convolutional networks, In: booktitle 5th International Conference on Learning Representations (ICLR) ( 2016).
Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P. & Bengio, Y. Graph attention networks, In booktitle 6th International Conference on Learning Representations (ICLR) ( 2017).
Wheeler, D.J. & Chambers, D.S. Understanding statistical process control, USPC ( 1992).
DiCiccio, T. J. & Efron, B. Bootstrap confidence intervals. Stat. Sci. 11, 189–212 (1996).
Article MathSciNet MATH Google Scholar
Tang, B. et al. Estimation of the transmission risk of the 2019-nCoV and its implication for public health interventions. J. Clin. Med. 9, 462 (2020).
Article PubMed PubMed Central Google Scholar
Riou, J. & Althaus, C. L. Pattern of early human-to-human transmission of Wuhan 2019 novel coronavirus(2019-nCoV), December 2019 to January 2020. Eurosurveillance 25, 2000058 (2020).
Article PubMed PubMed Central Google Scholar
Zhao, S. et al. Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in china, from 2019 to 2020: A data-driven analysis in the early phase of theoutbreak. Int. J. Infect. Dis. 92, 214 (2020).
Article CAS PubMed PubMed Central Google Scholar
Liu, T., Hu, J., Kang, M., Lin, L., Zhong, H., Xiao, J., He, G., Song, T., Huang, Q. & Rong, Z. et al.Transmission dynamics of 2019 novel coronavirus (2019-nCoV) BioRxiv ( 2020b).
Imai, N., Dorigatti, I., Cori, A., Donnelly, C., Riley, C. & Ferguson, N. “Report 2: Estimating the potential total number of novel coronavirus cases in Wuhan city, China. 22 january 2020-imperial college London. who collaborating centre for infectious disease modelling. mrc centre for global infectious disease analysis, j-idea, imperial college london, uk”, ( 2020).
Cao, Z., Zhang, Q., Lu, X., Pfeiffer, D., Jia, Z., Song, H. & Zeng, D.D.Estimating the effective reproduction number of the 2019-nCoV in China, MedRxiv ( 2020).
Shen, M., Peng, Z., Xiao, Y. & Zhang, L. Modelling the epidemic trend of the 2019 novel coronavirus outbreak in China, BioRxiv ( 2020).
Wu, J. T., Leung, K. & Leung, G. M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan China: A modelling study. Lancet 395, 689 (2020).
Article CAS PubMed PubMed Central Google Scholar
Read, J.M., Bridgen, J.R., Cummings, D.A., Ho, A. & Jewell, C.P. Novel coronavirus 2019-nCoV: early estimation of epidemiological parameters and epidemic predictions MedRxiv ( 2020).
Majumder, M. & Mandl, K. D. Early transmissibility assessment of a novel coronavirus in Wuhan, China, China ( 2020).

Download references

Acknowledgments

This work is partially supported by National Natural Science Foundation of China under Grants No. 12275118, Natural Science Foundation of Guangdong for Distinguished Youth Scholar, Guangdong Provincial Department of Science and Technology (grant no. 2020B1515020052), Guangdong High-Level Personnel of Special Support Program, Young TopNotch Talents in Technological Innovation (grant no. 2019TQ05X138) and NUS AcRF Grant A-0004550-00-00.

Author information

Shuhong Huang
Present address: Institute of Neuroscience, Technical University of Munich, Munich, 80802, Germany
These authors have contributed equally: Shuhong Huang and Jiachen Sun.

Authors and Affiliations

School of Data and Computer Science, Sun Yat-sen University, Guangzhou, 510006, China
Shuhong Huang & Jiarong Xie
Tencent, Shenzhen, 518057, China
Jiachen Sun
Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), Singapore, 138632, Singapore
Ling Feng
Department of Physics, National University of Singapore, Singapore, 117551, Singapore
Ling Feng
Kellogg School of Management, Northwestern University, Evanston, IL, USA
Dashun Wang
Department of Statistics and Data Science, College of Science, Southern University of Science and Technology, 518055, Shenzhen, China
Yanqing Hu

Authors

Shuhong Huang
View author publications
You can also search for this author in PubMed Google Scholar
Jiachen Sun
View author publications
You can also search for this author in PubMed Google Scholar
Ling Feng
View author publications
You can also search for this author in PubMed Google Scholar
Jiarong Xie
View author publications
You can also search for this author in PubMed Google Scholar
Dashun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yanqing Hu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.H. conceived the project. S.H., Y.H., J.S., L.F., J.X., D.W. designed the experiments. S.H., performed the experiments and numerical modelling. Y.H., S.H., J.S., L.F., J.X., D.W., wrote the manuscript.

Corresponding author

Correspondence to Yanqing Hu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Huang, S., Sun, J., Feng, L. et al. Identify hidden spreaders of pandemic over contact tracing networks. Sci Rep 13, 11621 (2023). https://doi.org/10.1038/s41598-023-32542-3

Download citation

Received: 11 October 2022
Accepted: 29 March 2023
Published: 19 July 2023
DOI: https://doi.org/10.1038/s41598-023-32542-3
Springer Nature Limited

Identify hidden spreaders of pandemic over contact tracing networks

Abstract

Similar content being viewed by others

Network-based prediction of COVID-19 epidemic spreading in Italy

Trading contact tracing efficiency for finding patient zero

Modeling the Spread of COVID-19 Over Varied Contact Networks