A novel indicator in epidemic monitoring through a case study of Ebola in West Africa (2014–2016)

Kwak, Minkyu; Sun, Xiuxiu; Wi, Yunju; Nah, Kyeongah; Kim, Yongkuk; Jin, Hongsung

doi:10.1038/s41598-024-62719-3

A novel indicator in epidemic monitoring through a case study of Ebola in West Africa (2014–2016)

Article
Open access
Published: 27 May 2024

Volume 14, article number 12147, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

A novel indicator in epidemic monitoring through a case study of Ebola in West Africa (2014–2016)

Download PDF

Minkyu Kwak¹,
Xiuxiu Sun²,
Yunju Wi¹,
Kyeongah Nah³,
Yongkuk Kim⁴ &
…
Hongsung Jin¹

178 Accesses
Explore all metrics

Abstract

The E/S (exposed/susceptible) ratio is analyzed in the SEIR model. The ratio plays a key role in understanding epidemic dynamics during the 2014–2016 Ebola outbreak in Sierra Leone and Guinea. The maximum value of the ratio occurs immediately before or after the time-dependent reproduction number (R_t) equals 1, depending on the initial susceptible population (S(0)). It is demonstrated that transmission rate curves corresponding to various incubation periods intersect at a single point referred to as the Cross Point (CP). At this point, the E/S ratio reaches an extremum, signifying a critical shift in transmission dynamics and aligning with the time when R_t approaches 1. By plotting transmission rate curves, β(t), for any two arbitrary incubation periods and tracking their intersections, we can trace CP over time. CP serves as an indicator of epidemic status, especially when R_t is close to 1. It provides a practical means of monitoring epidemics without prior knowledge of the incubation period. Through a case study, we estimate the transmission rate and reproduction number, identifying CP and R_t = 1 while examining the E/S ratio across various values of S(0).

A review of epidemiological parameters from Ebola outbreaks to inform early public health decision-making

Article Open access 26 May 2015

Estimating the basic reproductive ratio for the Ebola outbreak in Liberia and Sierra Leone

Article Open access 24 February 2015

Transmission dynamics and control of Ebola virus disease (EVD): a review

Article Open access 10 October 2014

Introduction

The time-dependent reproduction number serves as a measure indicating the effectiveness of disease control^1,2,3,4,5,6. Estimating the time-dependent reproduction number, ${\text{R}}_{t}$, depends on how the model structure is defined, which epidemiological features it should include, and what kind of data are available^7,8. A challenge with the assessment of the time-dependent reproduction number with SIR (susceptible–infectious–recovered) type of models is the estimation of the transmission rate, especially when it varies over the course of the epidemic due to the implementation of control policies^3,4,8. Some approaches assume explicit form of the functions on the several pieces of time intervals which are determined by the timing of control policies and the epidemic situation⁹. Pollicott et al.¹⁰ estimated the time-dependent transmission rate (β(t)) of a SIR model by solving a linear ordinary differential equation (ODE) of β(t) when the removal rate (γ) is a given value. Applying this method, Wang et al.¹¹ developed an approach to utilize a machine learning algorithm to estimate the transmission rate with the time-dependent variables reflecting the intensity of non-pharmaceutical policies. Nadler et al.¹² incorporated these types of time-dependent data through variational data assimilation. Grimm et al.¹³ partitioned the time interval into several shorter interval on each of which they assumed constant transmission rates. They estimated these rates using a machine learning approach involving physics-informed neural networks.

In this study, we introduce a point in the epidemic curve called CP, which has the potential to serve as an indicator signifying that the disease is nearing control. This point can be identified without prior knowledge of the incubation period, making it a useful measure to determine the epidemic status when the time-dependent reproduction number is nearing one. The CP may occur before or after ${\text{R}}_{t}=1$, but it indicates proximity to this critical value. To obtain CP, the parameters of the SEIR system are estimated by solving the inverse problem^{10,14,15,16,17,18}. Then, the system is rearranged to construct the time-dependent transmission rate and the time-dependent reproduction number. We prove that for any incubation period, the transmission rate curves pass through CP, where $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$. The intersection point of the time-dependent transmission rate curves for two arbitrary incubation periods becomes CP. The role of the $E/S$ ratio is investigated in determining which point, between CP and ${\text{R}}_{t}=1$ 1, appears first. As S(0) increases, the extreme value of the $E/S$ ratio converges to a constant and CP appears before ${\text{R}}_{t}=1$.

To simulate the process, we utilize data from the Ebola outbreak in Sierra Leone and Guinea. This outbreak began in Guinea in December 2013 and later spread to other West African countries, including Liberia and Sierra Leone, resulting in nearly 30,000 infections from 2014 to 2016^19,20,21,22. In this case study, we estimate the dates of occurrence for CP and ${\text{R}}_{t}=1$ and calculate the time difference between them. The time-dependent reproduction number is reached to one after a few days after CP, with its value being greater than one at the time of CP. Therefore, CP holds the potential to be used as a precautionary indicator signifying that the disease is nearing control.

Materials and methods

A population ${N}$ is partitioned into compartments labeled S, E, I, and R, representing susceptible, exposed, infectious, and removed individuals, respectively. The model includes four parameters: time-dependent transmission rate $\left(\beta \left(t\right)\right)$, rate of progression from exposure to infection $\left(\sigma \right)$, removal rate ($\gamma$), and case fatality rate ($f$) ²³. The SEIR model analyzed in this paper has several assumptions. Firstly, it assumes that the size of the total population remains constant: this assumption corresponds to the net input to the susceptible by births being equal to the net mortality²⁴. This simplification is often adopted to focus solely on the dynamics of disease transmission without considering demographic changes. Secondly, the population is assumed to be homogeneous, implying that individuals within each compartment are considered identical in terms of susceptibility, exposure, infectiousness, and recovery²⁵. Thirdly the population is assumed to be well-mixed, with individuals having an equal chance of coming into contact with any other individual^25,26,27. This assumption facilitates modeling the spread of the disease in a population where interactions are random and frequent. Fourthly, once individuals recover from the infectious stage, they gain immunity and cannot be infected again, at least for some period²⁸.

The transmission dynamics are described as follows:

$$\left\{ {\begin{array}{*{20}l} {\frac{{dS}}{{dt}} = - {\frac{{\beta \left( t \right)SI}}{N}}} \hfill \\ {\frac{{dE}}{{dt}} = {\frac{{\beta \left( t \right)SI}}{N}} - \sigma E} \hfill \\ {\frac{{dI}}{{dt}} = \sigma E - \gamma I} \hfill \\ {\frac{{dR}}{{dt}} = \left( {1 - f} \right)\gamma I} \hfill \\ {N = S + E + I + R} \hfill \\ \end{array} } \right.$$

(1)

The point of $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$ and incubation periods

For various incubation periods, the transmission rate curves pass through a single point, as depicted in Fig. 3a,c. This point of intersection is denoted as CP. It is proven that the point where $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$ coincides with the cross point. CP is independent of the incubation period $(1/\sigma )$, and it can be easily estimated by plotting two transmission rate curves for any two incubation periods.

Theorem 1

The transmission rate $(\beta \left(t\right))$ shares a single common point where $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$. At this point, the value of $\beta \left(t\right)$ is independent of $\upsigma$²⁹.

Proof

Let ${f}_{1}(t)\equiv \sigma E$ and ${f}_{2}\left(t\right)\equiv S+E$.

Then,

$$S=-{\frac{{f}_{1}(t)}{\sigma }}+{f}_{2}\left(t\right)$$

From Eq. (1)

$${\frac{dS}{dt}}=-{\frac{\beta \left(t\right)SI}{N}}$$

$${\frac{dE}{dt}}={\frac{\beta \left(t\right)SI}{N}}-\sigma E$$

one has

$$\beta \left( t \right) = - \frac{{\frac{dS}{{dt}}}}{\frac{SI}{N}} = \frac{{\frac{{f_{1}}^{\prime } (t)}{\sigma } - {f_{2}}^{\prime } (t)}}{{ - \frac{{f_{1} (t)}}{\sigma } + f_{2} \left( t \right)}}\frac{N}{I} = \frac{{f_{1}}^{\prime } (t) - \sigma {f_{2}}^{\prime } (t)}{{ - f_{1} \left( t \right) + \sigma f_{2} \left( t \right)}}\frac{N}{I}$$

and

$$\frac{\partial \beta }{\partial \sigma }=\frac{{f}_{1}\left(t\right){{f}_{2}}^{\prime}(t)-{f}_{2}(t){{f}_{1}}^{\prime}(t)}{{(-{f}_{1}\left(t\right)+\sigma {f}_{2}\left(t\right))}^{2}}\frac{N}{I}=0$$

Thus, $\beta \left(t\right)$ is independent of $\upsigma$ when ${f}_{1}\left(t\right){{f}_{2}}^{\prime}(t)={f}_{2}(t){{f}_{1}}^{\prime}(t)$ or $E{S}^{\prime}={E}^{\prime}S$

or $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0.$

The value of ${\text{R}}_{t}$ at CP

To compare the times of $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$ and ${\text{R}}_{t}=1$, the value of ${\text{R}}_{t}$ is investigated when $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$. The time-dependent reproduction number can be expressed as²³,

$$\begin{array}{c}{\text{R}}_{t}= \frac{\beta \left(t\right)S}{\gamma N}\end{array}$$

(2)

By rearranging the second equation in Eq. (1), the transmission rate can be written as:

$$\begin{array}{c}\beta \left(t\right)=\frac{N}{SI}\left(\frac{dE}{dt}+\sigma E\right)\end{array}$$

(3)

Then, ${\text{R}}_{t}$ can be written as:

$$\begin{array}{c}{\text{R}}_{t}= \frac{SE^{\prime}-ES^{\prime}}{\upgamma I\left(S+E\right)}+\frac{\sigma E}{\upgamma I }\frac{1}{\left(1+E/S\right)}\end{array}$$

(4)

Since $S{E}^{\prime}-E{S}^{\prime}=0$ at the point where $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$, the time-dependent reproduction number at CP can be written as:

$$\begin{array}{c}{\text{R}}_{t}= \frac{\sigma E}{\upgamma I }\frac{1}{\left(1+E/S\right)}\end{array}$$

(5)

The value of ${\text{R}}_{t}$ at CP can be used to estimate which event occurs first: CP or ${\text{R}}_{t}=1$.

There are 3 cases of ${\text{R}}_{t}$ values at CP:

(i)
If $\frac{\sigma E}{\upgamma I }\frac{1}{\left(1+E/S\right)}>1$, then ${\text{R}}_{t}>1$ and CP appears earlier than ${\text{R}}_{t}=1$
(ii)
If $\frac{\sigma E}{\upgamma I }\frac{1}{\left(1+E/S\right)}=1$, then CP and ${\text{R}}_{t}=1$ appear at the same time.
(iii)
If $\frac{\sigma E}{\upgamma I }\frac{1}{\left(1+E/S\right)}<1$, then ${\text{R}}_{t}<1$ and CP appears later than ${\text{R}}_{t}=1$.

In the SEIR model, $\sigma E$ represents the number of newly infected people entering compartment ${I}$, and $\upgamma I$ represents the number of infected people leaving compartment ${I}$. The ratio of $\frac{\sigma E}{\upgamma I }$ is entirely dependent on the variation in compartment ${I}$. It is always greater than one when $dI/dt>0$, as indicated by Eq. (1). Additionally, the ratio $E/S$ also plays a crucial role in determining the value of ${\text{R}}_{t}$ at $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0.$ At this point, $E/S$ reaches a maximum while $\frac{1}{\left(1+E/S\right)}$ attains a minimum value. When S(0) is sufficiently large so that ${S}\gg {E}$, CP always appears earlier than ${\text{R}}_{t}=1$. Therefore, CP can be a precautionary indicator that ${\text{R}}_{t}=1$ is imminent. Assuming that the initial susceptible population $S(0)$ is very small, the effect of $E/S$ cannot be ignored in Eq. (5), and CP can appear after ${\text{R}}_{t}=1$. Although the appearance time of CP and ${\text{R}}_{t}=1$ is dependent on the value of $E/S$, they are very close for usual cases between $dE/dt=0$ and $dI/dt=0$, where $\frac{\sigma E}{\upgamma I }\approx 1$ and $\frac{1}{\left(1+E/S\right)}\approx 1$. Therefore, CP can be an alternative indicator to ${\text{R}}_{t}=1$, suggesting that the epidemic is almost under control when $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$.

Case study

In this study, we utilized the data of cumulative cases and deaths from the Ebola outbreak in Guinea and Sierra Leone, sourced from the World Health Organization³⁰. The dataset covered a period of 2 years: for Guinea, we included 262 data points reported from March 25, 2014, to March 26, 2016, and for Sierra Leone, 196 data points reported from May 27, 2014, to October 30, 2015.

Cumulative case and death data

The time-dependent transmission rate and reproduction number in the SEIR model are estimated using only two regression functions for the cumulative cases and deaths data. Two equations, $\frac{dC}{dt}$ and $\frac{dD}{dt}$, are added to the SEIR system in Eq. (6). Here, C represents cumulative cases and D represents disease-induced deaths. The inclusion of $C$ and $D$ in the SEIR system does not alter its dynamics; it simply accounts for the exposed and deceased population. Data fitting is conducted using the logistic equation as the base function²¹.

$$\begin{array}{c}\left\{\begin{array}{c}\frac{dC}{dt}=\sigma E\\ \frac{dD}{dt}=f\gamma I\end{array}\right.\end{array}$$

(6)

Procedure to construct the transmission rate

We obtain regression functions by curve fitting the cumulative data of cases and deaths. Then, the values of $\gamma$ and $f$ are calculated using the linear least square method in Eqs. (8) and (9). All variables ${I}$,${R}$, ${E}$ and ${S}$ are determined, and the time-dependent transmission rate ($\beta \left(t\right)$) is constructed for various incubation periods, after which the time-dependent reproduction rate $\left({\text{R}}_{t}\right)$ is obtained. The overall procedure shown in Fig. 1 summarizes the algorithm to obtain $\beta \left(t\right)$.

$C$ and $D$ represent curve-fitted data of the cumulative cases and deaths. By solving the inverse problem consisting of Eqs. (1) and (6), we estimate $\gamma$, $f$, ${I}$, ${R}$, ${E}$ and ${S}$, and then $\beta \left(t\right)$.

Curve fitting of $C$ and $D$

To estimate the values of parameters for the transmission dynamics Eq. (1), we fit the model to the cumulative data of cases $\left(C\right)$ and deaths $\left(D\right)$. Coefficients a, b, c and d are obtained from fitting the solutions of Eq. (7). We used a logistic function for regression with the Levenberg–Marquardt method^31,32 in MATLAB³³, as it is convenient and suitable for describing the proposed method and approximates very well. For Guinea, the R-squared values are 0.9999 for cases and 0.9999 for deaths in Fig. 2a. For Sierra Leone, the R-squared values are 0.9478 for cases and 0.9998 for deaths, as shown in Fig. 2b.

$$\begin{array}{c}\left\{\begin{array}{c}\frac{dC}{dt}=aC\left(1-bC\right)\\ \frac{dD}{dt}=cD\left(1-dD\right)\end{array}\right.\end{array}$$

(7)

Estimation of removal rate $\left(\gamma \right)$ and fatality $\left(f\right)$

We assume that the total population size ${N}={S}+{E}+{I}+{R}+{D}$ in each country is constant and that the initial value of ${R}(0)=0.$

From Eq. (1), we have

$$\left\{ \begin{array}{c}\frac{1}{f\gamma }\frac{dD}{dt}=I \\ \frac{1}{f\gamma }\frac{{d}^{2}D}{d{t}^{2}}=\frac{dI}{dt}=\sigma E-\gamma I=\frac{dC}{dt}-\frac{1}{f}\frac{dD}{dt}\\ \frac{{d}^{2}D}{d{t}^{2}}= f\gamma \frac{dC}{dt}-\gamma \frac{dD}{dt}\end{array}\right.$$

(8)

It is rewritten as

$$\begin{array}{c}\left(\begin{array}{cc}\frac{dC}{dt}& -\end{array}\frac{dD}{dt}\right)\left(\begin{array}{c}f\gamma \\ \gamma \end{array}\right)=\frac{{d}^{2}D}{d{t}^{2}},\end{array}$$

(9)

The parameters $f$ and $\upgamma$ are estimated using the linear least square method or pseudoinverse.

For the simulation, we choose a data set from the beginning to various days during the Ebola outbreak³⁴. The mean infectious time of Guinea is listed in Table 1. The mean infectious time is 10.46 days or 9.90 days if we use the dataset of 1–200 or 1–240 corresponding to October 16, 2015 and December 18, 2015, respectively. The simulation uses 10.46 days as the mean infectious time. For Sierra Leone, we take the mean infectious time as 10.14 days estimated by the data up to August 7, 2015. The fatality rate in Guinea is 65.86% and in Sierra Leone is 30.50%, and there is no significant change in either country from beginning to end.

Table 1 Mean infectious times and fatality for various sets of data.

Full size table

Construction of $\beta \left(t\right)$

The values of S, E, I and R are determined according to Eqs. (10)–(15). The parameters $f$ and $\upgamma$ are estimated using the linear least square method, as detailed in Table 1.

From Eq. (6),

$$\begin{array}{c}I(t)=\frac{1}{f\gamma }\frac{dD}{dt}\end{array}$$

(10)

Integrating $\frac{dR}{dt}$ in Eq. (1), we obtain

$$\begin{array}{c}R\left(t\right)=\left(1-f\right)\gamma\int_0^{t} {I }(t)dt,\end{array}$$

(11)

E(t) is calculated directly from Eq. (6)

$$\begin{array}{c}E(t)=\frac{1}{\sigma }\frac{dC}{dt}\end{array}$$

(12)

when $\upsigma$ is given.

Defining

$$\upphi \left(t\right)\stackrel{\scriptscriptstyle\text{def}}{=}\frac{\beta \left(t\right){SI}}{{N}}$$

we have

$$\begin{array}{c}\upphi \left(t\right)=\frac{dE}{dt}+\sigma E\end{array}$$

(13)

Integrating $\frac{dS}{dt}$ in Eq. (1), we obtain

$$\begin{array}{c}S\left(t\right)=S(0)-\int_0^{t}{\upphi \left(t\right)}dt\end{array}$$

(14)

The initial value

$$\begin{array}{c}S(0)=N-\left(E\left(0\right)+I\left(0\right)+R\left(0\right)+{D}\left(0\right)\right)\end{array}$$

(15)

Putting ${S}$ into $\upphi (t)$, we have

$$\beta \left(t\right) = \frac{{N}}{{{SI}}}\left( {\frac{dE}{dt}} + \sigma {E} \right)$$

Transmission rate curves and the date of CP

The incubation period is defined as the interval between exposure to a pathogen and the initial occurrence of symptoms and signs³⁵. The mean incubation period of Ebola virus disease ranges from 2 to 21 days depending on simulation methods, data and country^20,23,35. For three different incubation periods, the time-dependent transmission rates are calculated. The first incubation period is 5.3 days according to the Ebola virus in Congo²³. The second one is 11.4 days, based on data from the WHO Ebola Response Team²⁰, and the third one is the maximum incubation days³⁵. Figure 3a,c show the estimated transmission rates with various lengths of incubation periods ($1/\sigma$). The greater the value of the incubation period the greater the maximum value of the transmission rate and its decay rate. Additionally, one can also observe that the transmission rate curves intersect at a single point for three incubation periods, as shown in Theorem 1.

Time comparison of CP and ${\text{R}}_{t}=1$ points

Table 2 shows the values of $E/S$, $\frac{\sigma E}{\upgamma I}$, and ${\text{R}}_{t}$ in Eq. (5) at CP. In both countries, ${\text{R}}_{t}>$ 1 at CP, indicating that CP is expected to appear earlier than ${\text{R}}_{t}=1$. For Guinea, ${\text{R}}_{t}=1$ appears on the 241st and CP on the 237th day from March 25, 2014. For Sierra Leone, it appears on the 189th day and CP on the 185th day from May 27, 2014 referring to Table four. The reference dates correspond to the time when ${\text{R}}_{t}=1$ in Fig. 3 are approximately nearby November 20, 2014 for Guinea and December 2, 2014 for Sierra Leone for the incubation period of 11.4 days (see Table 3). CP is very close to the date of ${\text{R}}_{t}=1$ and is independent of the incubation period. Therefore, CP can be an alternative indicator suggesting that the disease is very close to being under control, ${\text{R}}_{t}\approx 1$, even when the incubation period is unknown or estimated with high uncertainty.

Table 2 Information on $E/S$, $\frac{\sigma E}{\upgamma I}$, and ${\text{R}}_{t}=1$ at CP for Guinea and Sierra Leone.

Full size table

Table 3 The dates of CP and ${\text{R}}_{t}=1$.

Full size table

Table 3 presents the time sequence in which four events occur, including CP, ${\text{R}}_{t}=1$, and the maximum values of E(t) and I(t). Using Eqs. (1) and (2), ${\text{R}}_{t}=1$ can be expressed as

$$\begin{array}{c}dE/dt+dI/dt=0\end{array}$$

(16)

${\text{R}}_{t}=1$ occurs only a few days after passing the time point of CP, as shown in Fig. 3b,d.

Informations about the reported data and its corresponding dates

Table 4 shows the data indicated by the inflection point obtained from the regression curve in the reported data points. On November 12, the cumulative number of confirmed cases in Guinea jumped from 1878 to 1919 in two days, which corresponds to an index from 63 to 64 in Table 4. The number of inflection points from the regression is between 1894 and 1901. This means that $dE/dt=0$ occurs between November 12 and 14, 2014. In Sierra Leone, the inflection point from the regression is between 6329 and 6667. Hence, $dE/dt=0$ occurs between November 21 and 27, 2014. The calendar date that corresponds to CP is November 15–16 for Guinea and November 26–27 for Sierra Leone. The days of $dI/dt=0$ are November 24–25 and December 6–7. The time-dependent reproduction number is nearly 1 on November 20–22 for Guinea and December 2–4 for Sierra Leone. The time-dependent reproduction number (${\text{R}}_{t}$) at CP was 1.011 for Guinea and 1.028 for Sierra Leone, as shown in Table 2. The index refers to the order in which the number of cases was reported from March 25, 2014 for Guinea and May 27, 2014 for Sierra Leone.

Table 4 Reference dates for incidents.

Full size table

The ratio of $E/S$ for various $S(0)$

Figure 4a,c illustrate the estimated $E/S$ ratios for four S(0) ranging from 20,000 to 80,000 for Guinea and Sierra Leone. In Guinea, when S(0) is 20,000, the maximum value of $E/S$ occurs around the 243rd day. When S(0) is 60,000, it occurs around the 239th day, and for 80,000, it occurs approximately on the 238th day, as depicted in Fig. 4a. For Sierra Leone, when S(0) is 20,000, the maximum value of $E/S$ occurs around the 210th day. When S(0) is 80,000, the maximum value of $E/S$ occurs on the 189th day, as shown in Fig. 4c. The extreme values of the $E/S$ ratios are traced from the $E/S$ curves and CP in Fig. 4b,d. These extreme values are estimated pointwise from the $E/S$ ratio curves, while the Cross Points are calculated from two transmission rate curves. Since $dE/dt=0$ and $dI/dt=0$ can be estimated through Eqs. (10)–(12), the time of ${\text{R}}_{t}=1$ estimated from Eq. (16) is not affected by S(0). When S(0) is 20,000, ${\text{R}}_{t}=1$ appears before the maximum value of $E/S$. Until S(0) reaches 30,000 for Guinea and 60,000 for Sierra Leone, ${\text{R}}_{t}=1$ appears before the maximum value of $E/S$. For S(0) over 100,000, the maximum value of $E/S$ converges to the 237th day for Guinea and the 185th day for Sierra Leone, as shown in Fig. 4b,d. As S(0) increases beyond 80,000, the maximum value of $E/S$ appears before ${\text{R}}_{t}=1$ for both countries in Fig. 4b,d.

Discussion

By solving the inverse problem, the time-dependent transmission rate is estimated using the cumulative data of Ebola outbreaks in Sierra Leone and Guinea between 2014 and 2016, and by rearranging the differential equation system of the SEIR model. The logistic equation (Eq. 7) fits very well to the data, as shown in Fig. 2, and it is very useful to explain the proposed algorithm. However, other statistical methods such as the adaptive Metropolis–Hastings (M–H) algorithm for the Bayesian Markov Chain Monte Carlo (MCMC) procedure can also be used³⁶. After obtaining the appropriate regression function, the variables ($S$, $E$, $I$ and $R$) and parameters ($f,\gamma$) of the system can be found easily by the inverse method. Although the mean infectious time can be selected from other references^19,36, the mean infectious time is estimated using cumulative data by its pseudoinverse (Eqs. 8, 9). If parameters ($f,\gamma$) are given, then Eqs. (1) and (2) are enough to estimate ${\text{R}}_{t}$, $E/S$, and CP.

We can track the E/S ratio by tracking the distance between the two transmission rate curves for the two incubation periods. If the distance between two transmission rate curves for the two incubation periods does not decrease, the infectious disease is continuously spreading, so quarantine measures must be further strengthened. At CP the difference becomes 0, it means that quarantine measures are being implemented appropriately.

The existence of CP is inherent in the SEIR model and, therefore, does not depend on data regression methods. From CP, it is observed that the transmission rate for a longer incubation period is lower than that for a shorter incubation period. The point at which $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$, or CP, represents the moment when the transmission rate shifts to a less transmissible rate for a longer incubation period.

The length of the incubation period varies greatly depending on the initial infection dose, the rate of pathogen replication, and the defense mechanisms within the host^37,38. The possibility that the characteristics of the pathogen changed before and after CP cannot be ruled out. At CP or near ${\text{R}}_{t}=1$, the amount or pattern of replication of the pathogen in the host immune system may change. It is necessary to study changes in the incubation period depending on the characteristics of the pathogen within the host.

The accuracy of estimating the date of ${\text{R}}_{t}=1$ depends entirely on the precision of creating the regression function, which relies on different datasets, such as those up to August or September 2014. In this case study, the date of ${\text{R}}_{t}=1$ is estimated using the equation $dE/dt+dI/dt=0$, derived from Eqs. (1) and (2). The inflection points of cumulative data, such as when $dE/dt=0$ and $dI/dt=0$, appeared only a few days apart, as shown in Table 3. CP and ${\text{R}}_{t}=1$ occur between $dE/dt=0$ and $dI/dt=0$, which means that the occurrence times of CP and ${\text{R}}_{t}=1$ are very close. Although the timing of CP and ${\text{R}}_{t}=1$ depends on the value of $E/S$ in Eq. (5), they are very close to each other. The time-dependent reproduction number reaches one a few days after passing CP, indicating that the reproduction number is still greater than one at the time of CP. However, we can infer from CP that the epidemic will begin to decline within a few days. Thus, CP can be considered a new indicator that the epidemic is nearly under control. Moreover, since CP is not affected by the incubation period, it has the potential to serve as a criterion that can replace ${\text{R}}_{t}=1$ when there is uncertainty about the length of the incubation period.

The value of S(0) is a crucial factor that determines the temporal relationship between ${\text{R}}_{t}=1$ and CP. When S(0) is set to over 80,000, CP consistently appears earlier than ${\text{R}}_{t}=1$ for both countries, serving as a precautionary indicator that ${\text{R}}_{t}=1$ is imminent. However, assuming S(0) is small, such as 20,000, CP may appear after ${\text{R}}_{t}=1$, as shown in Fig. 4. In this case study, S(0) is assumed to be close to the total population, exceeding millions for both countries. Consequently, CP is expected to appear earlier than ${\text{R}}_{t}=1$, and this is also confirmed in the simulation.

Conclusion

In solving the inverse problem of SEIR, we prove that transmission rate curves for various incubation periods intersect at a single point, denoted as CP (Cross Point), where $\frac{{d}}{{{dt}}}\left( {E/S} \right)=0$. The extreme value of the ratio $E/S$ occurs immediately before or immediately after ${\text{R}}_{t}=1$, depending on S(0). Therefore, in CP, ${\text{R}}_{t}=1$ is very close, so we can expect the epidemic to stabilize soon. The $E/S$ value can be estimated using incidence data or cumulative data in the inverse method, when the mean generation time and S(0) are given. Then the extreme value of $E/S$ can be traceable. However, tracing the CP is more convenient. By plotting transmission rate curves, β(t), for any two arbitrary incubation periods and tracking where they intersect, we can trace CP in time. Since CP is obtained using a random incubation period, accurate incubation period information is not required to find the extreme point of the ratio of $E/S$. Tracking $E/S$ ratio through other methods such as stochastic and artificial intelligence can be useful to predict and estimate the states of the epidemic. If S(t) is controlled by an effective vaccine or appropriate interventions, CP can be reached very quickly. This would be one way to get ${\text{R}}_{t}=1$ quickly.

Data availability

All data generated or analysed during this study are included in this published article. GitHub Reference: https://github.com/wuj2293/The-role-of-the-ES-ratio-in-the-SEIR-model.

References

Cauchemez, S. et al. Real-time estimates in early detection of SARS. Emerg. Infect. Dis. 12, 110. https://doi.org/10.3201/eid1201.050593 (2006).
Article PubMed PubMed Central Google Scholar
Nishiura, H. & Chowell, G. The effective reproduction number as a prelude to statistical estimation of time-dependent epidemic trends. Math. Stat. Estimation Approaches Epidemiol. https://doi.org/10.1007/978-90-481-2313-1_5 (2009).
Article Google Scholar
Cori, A., Ferguson, N. M., Fraser, C. & Cauchemez, S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am. J. Epidemiol. 178, 1505–1512. https://doi.org/10.1093/aje/kwt133 (2013).
Article PubMed Google Scholar
Thompson, R. N. et al. Improved inference of time-varying reproduction numbers during infectious disease outbreaks. Epidemics 29, 100356. https://doi.org/10.1016/j.epidem.2019.100356 (2019).
Article CAS PubMed PubMed Central Google Scholar
Dehning, J. et al. Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions. Science 369, eabb9789. https://doi.org/10.1126/science.abb9789 (2020).
Article CAS PubMed Google Scholar
Huisman, J. S. et al. Estimation and worldwide monitoring of the effective reproductive number of SARS-CoV-2. Elife 11, e71345. https://doi.org/10.7554/eLife.71345 (2022).
Article CAS PubMed PubMed Central Google Scholar
Annunziato, A. & Asikainen, T. Effective reproduction number estimation from data series. JRC121343. https://doi.org/10.2760/036156 (2020).
Gostic, K. M. et al. Practical considerations for measuring the effective reproductive number. R t. PLoS Comput. Biol. 16, e1008409. https://doi.org/10.1371/journal.pcbi.1009679 (2020).
Article CAS Google Scholar
McCarthy, Z. et al. Quantifying the shift in social contact patterns in response to non-pharmaceutical interventions. J. Math. Ind. 10, 1–25. https://doi.org/10.1186/s13362-020-00096-y (2020).
Article MathSciNet CAS Google Scholar
Pollicott, M., Wang, H. & Weiss, H. Extracting the time-dependent transmission rate from infection data via solution of an inverse ODE problem. J. Biol. Dyn. 6, 509–523. https://doi.org/10.1080/17513758.2011.645510 (2012).
Article MathSciNet PubMed Google Scholar
Wang, X., Wang, H., Ramazi, P., Nah, K. & Lewis, M. A hypothesis-free bridging of disease dynamics and non-pharmaceutical policies. Bull. Math. Biol. 84, 57. https://doi.org/10.1007/s11538-022-01012-8 (2022).
Article MathSciNet PubMed PubMed Central Google Scholar
Nadler, P., Wang, S., Arcucci, R., Yang, X. & Guo, Y. An epidemiological modelling approach for COVID-19 via data assimilation. Eur. J. Epidemiol. 35, 749–761. https://doi.org/10.1007/s10654-020-00676-7 (2020).
Article CAS PubMed PubMed Central Google Scholar
Grimm, V., Heinlein, A., Klawonn, A., Lanser, M. & Weber, J. Estimating the time-dependent contact rate of SIR and SEIR models in mathematical epidemiology using physics-informed neural networks. Electron. Trans. Numer. Anal 56, 1–27. https://doi.org/10.1553/etna_vol56s1 (2022).
Article MathSciNet Google Scholar
Hadeler, K. Parameter identification in epidemic models. Math. Biosci. 229, 185–189. https://doi.org/10.1016/j.mbs.2010.12.004 (2011).
Article MathSciNet CAS PubMed Google Scholar
Kong, J. D., Jin, C. & Wang, H. The inverse method for a childhood infectious disease model with its application to pre-vaccination and post-vaccination measles data. Bull. Math. Biol. 77, 2231–2263. https://doi.org/10.1007/s11538-015-0121-5 (2015).
Article MathSciNet PubMed Google Scholar
Smirnova, A., deCamp, L. & Chowell, G. Forecasting epidemics through nonparametric estimation of time-dependent transmission rates using the SEIR model. Bull. Math. Biol. 81, 4343–4365. https://doi.org/10.1007/s11538-017-0284-3 (2019).
Article MathSciNet CAS PubMed Google Scholar
Mubayi, A. et al. Analytical estimation of data-motivated time-dependent disease transmission rate: An application to ebola and selected public health problems. Trop. Med. Infectious Disease 6, 141. https://doi.org/10.3390/tropicalmed6030141 (2021).
Article Google Scholar
Wang, X., Wang, H., Ramazi, P., Nah, K. & Lewis, M. From policy to prediction: Forecasting COVID-19 dynamics under imperfect vaccination. Bull. Math. Biol. 84, 90. https://doi.org/10.1007/s11538-022-01047-x (2022).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Chowell, G. & Nishiura, H. Transmission dynamics and control of Ebola virus disease (EVD): A review. BMC Med. 12, 1–17. https://doi.org/10.1186/s12916-014-0196-0 (2014).
Article Google Scholar
WHO Ebola Response Team. Ebola virus disease in West Africa—The first 9 months of the epidemic and forward projections. N. Engl. J. Med. 371, 1481–1495. https://doi.org/10.1056/NEJMoa1411100 (2014).
Article CAS PubMed Central Google Scholar
Burghardt, K. et al. Testing modeling assumptions in the West Africa Ebola outbreak. Sci. Rep. 6, 34598. https://doi.org/10.1038/srep34598 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Abah, R. T., Zhiri, A. B., Oshinubi, K. & Adeniji, A. Mathematical analysis and simulation of Ebola virus disease spread incorporating mitigation measures. Franklin Open 6, 100066. https://doi.org/10.1016/j.fraope.2023.100066 (2024).
Article Google Scholar
Althaus, C. L. Estimating the reproduction number of Ebola virus (EBOV) during the 2014 outbreak in West Africa. PLoS Curr. https://doi.org/10.1371/currents.outbreaks.91afb5e0f279e7f29e7056095255b288 (2014).
Article PubMed PubMed Central Google Scholar
Anderson, R. M. & May, R. M. Directly transmitted infections diseases: Control by vaccination. Science 215, 1053–1060. https://doi.org/10.1126/science.7063839 (1982).
Article ADS MathSciNet CAS PubMed Google Scholar
Anderson, R. M. & May, R. M. Infectious Diseases of Humans: Dynamics and Control (Oxford University Press, 1991).
Book Google Scholar
Murray, J. D. Mathematical Biology I: An introduction (Springer, 2002).
Book Google Scholar
Sturniolo, S., Waites, W., Colbourn, T., Manheim, D. & Panovska-Griffiths, J. Testing, tracing and isolation in compartmental models. PLoS Comput. Biol. 17, e1008633. https://doi.org/10.1371/journal.pcbi.1008633 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Le, A., King, A. A., Magpantay, F. M. G., Mesbahi, A. & Rohani, P. The impact of infection-derived immunity on disease dynamics. J. Math. Biol. 83, 1–23. https://doi.org/10.1007/s00285-021-01681-4 (2021).
Article MathSciNet Google Scholar
Wi, Y. Analysis of transmission rate of COVID-19 using SEIR model: M.Sc. Thesis (Korean), Chonnam National University, http://www.riss.kr/link?id=T16494961 (2022).
WHO. Ebola (Ebola Virus Disease) 2020, https://www.cdc.gov/vhf/ebola/history/2014-2016-outbreak/case-counts.html (2020).
Weitz, J. S. & Dushoff, J. Modeling post-death transmission of Ebola: Challenges for inference and opportunities for control. Sci. Rep. 5, 8751. https://doi.org/10.1038/srep08751 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Gavin, H. P. The Levenberg-Marquardt algorithm for nonlinear least squares curve-fitting problems. Department of civil and environmental engineering, Duke University 19. https://people.duke.edu/~hpgavin/ExperimentalSystems/lm.pdf (2019).
The MathWorks Inc. MATLAB version: 9.7.0 (R2019b) (The MathWorks Inc., Natick, Massachusetts, United States) https://www.mathworks.com/ (2019).
Browne, C., Gulbudak, H. & Webb, G. Modeling contact tracing in outbreaks with application to Ebola. J. Theor. Biol. 384, 33–49. https://doi.org/10.1016/j.jtbi.2015.08.004 (2015).
Article ADS MathSciNet PubMed Google Scholar
Van Kerkhove, M. D., Bento, A. I., Mills, H. L., Ferguson, N. M. & Donnelly, C. A. A review of epidemiological parameters from Ebola outbreaks to inform early public health decision-making. Sci. Data 2, 1–10. https://doi.org/10.1038/sdata.2015.19 (2015).
Article Google Scholar
Shen, M., Xiao, Y. & Rong, L. Modeling the effect of comprehensive interventions on Ebola virus transmission. Sci. Rep. 5, 15818. https://doi.org/10.1038/srep15818 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Nishiura, H. Early efforts in modeling the incubation period of infectious diseases with an acute course of illness. Emerg. Themes Epidemiol. 4, 1–12. https://doi.org/10.1186/1742-7622-4-2 (2007).
Article Google Scholar
Virlogeux, V. et al. Brief report: Incubation period duration and severity of clinical disease following severe acute respiratory syndrome coronavirus infection. Epidemiology 26, 666–669. https://doi.org/10.1097/EDE.0000000000000339 (2015).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge the Ministry of Education, Republic of Korea, for funding this research through the National Research Foundation of Korea. Special thanks are extended to the reviewers and editors for their valuable feedback.

Funding

This research was supported by the basic research program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2020R1I1A3071769, NRF-2017R1D1A3B06032544, NRF-2022R1F1A1063007), Republic of Korea. KN gratefully acknowledges support from the National Institute for Mathematical Sciences (NIMS) Grant funded by the Korean Government (NIMS-B24730000).

Author information

Authors and Affiliations

Department of Mathematics and Statistics, Chonnam National University, Gwangju, South Korea
Minkyu Kwak, Yunju Wi & Hongsung Jin
Department of Mathematics and Physics, Luoyang Institute of Science and Technology, Henan, China
Xiuxiu Sun
Busan Center for Medical Mathematics, National Institute of Mathematical Sciences, Busan, South Korea
Kyeongah Nah
Department of Mathematics, Kyungpook National University, Daegu, South Korea
Yongkuk Kim

Authors

Minkyu Kwak
View author publications
You can also search for this author in PubMed Google Scholar
Xiuxiu Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yunju Wi
View author publications
You can also search for this author in PubMed Google Scholar
Kyeongah Nah
View author publications
You can also search for this author in PubMed Google Scholar
Yongkuk Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hongsung Jin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization and methodology, M.K. and H.J. ; software and validation, X.S., Y.W. and H.J. ; formal analysis, investigation and resources, H.J., Y.K. and Y.W. ; writing—original draft preparation, M.K., K.N. and H.J. ; writing—review and editing, M.K., Y.K. and H.J. ; visualization, X.S., Y.W. and H.J. ; supervision, M.K. and H.J. ; funding acquisition, M.K., Y.K., K.N., and H.J.

Corresponding author

Correspondence to Hongsung Jin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kwak, M., Sun, X., Wi, Y. et al. A novel indicator in epidemic monitoring through a case study of Ebola in West Africa (2014–2016). Sci Rep 14, 12147 (2024). https://doi.org/10.1038/s41598-024-62719-3

Download citation

Received: 05 February 2024
Accepted: 21 May 2024
Published: 27 May 2024
DOI: https://doi.org/10.1038/s41598-024-62719-3
Springer Nature Limited

A novel indicator in epidemic monitoring through a case study of Ebola in West Africa (2014–2016)

Abstract

Similar content being viewed by others

A review of epidemiological parameters from Ebola outbreaks to inform early public health decision-making

Estimating the basic reproductive ratio for the Ebola outbreak in Liberia and Sierra Leone

Transmission dynamics and control of Ebola virus disease (EVD): a review

Introduction

Materials and methods

The point of \(\frac{{d}}{{{dt}}}\left( {E/S} \right)=0\) and incubation periods

Theorem 1

Proof

The value of \({\text{R}}_{t}\) at CP

Case study

Cumulative case and death data

Procedure to construct the transmission rate

Curve fitting of \(C\) and \(D\)

Estimation of removal rate \(\left(\gamma \right)\) and fatality \(\left(f\right)\)

Construction of \(\beta \left(t\right)\)

Transmission rate curves and the date of CP

Time comparison of CP and \({\text{R}}_{t}=1\) points

Informations about the reported data and its corresponding dates

The ratio of \(E/S\) for various \(S(0)\)

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation