Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection

Roberts, M. G.; Heesterbeek, J. A. P.

doi:10.1007/s00285-007-0112-8

Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection

Open access
Published: 08 August 2007

Volume 55, pages 803–816, (2007)
Cite this article

Download PDF

You have full access to this open access article

Journal of Mathematical Biology Aims and scope Submit manuscript

Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection

Download PDF

M. G. Roberts¹ &
J. A. P. Heesterbeek²

4148 Accesses
89 Citations
1 Altmetric
Explore all metrics

Abstract

We investigate the merit of deriving an estimate of the basic reproduction number $ \mathcal{R}_0 $ early in an outbreak of an (emerging) infection from estimates of the incidence and generation interval only. We compare such estimates of $ \mathcal{R}_0 $ with estimates incorporating additional model assumptions, and determine the circumstances under which the different estimates are consistent. We show that one has to be careful when using observed exponential growth rates to derive an estimate of $ \mathcal{R}_0 $ , and we quantify the discrepancies that arise.

On the Reproduction Number of Epidemics with Sub-exponential Growth

A Note on Observation Processes in Epidemic Models

Article 07 March 2020

Population Dynamics of Infectious Diseases

1 1 Introduction

The basic reproduction number R ₀ of an infectious agent is defined as the expected number of secondary cases caused by one typical infected individual in a population consisting of susceptibles only [3,6,7]. When an outbreak has started and the approximation that the population is fully susceptible no longer holds, one generally refers to the effective reproduction number R. The value of R ₀ is, as a rule, different for different infectious agents and depends among other things on the characteristics of the population that the agent invades. Given this, it is not immediate that one can adopt previously determined values or size ranges for a new outbreak, unless many of the complicated characteristics of, for example, population composition and contact structure are comparable. For various reasons one can be interested in the value of R ₀ or R early in an outbreak and during the outbreak. Notably, under a homogeneous mixing assumption, the values give insight into the extent of the control problem and a means of calculating how much control effort is needed.

In recent years several new methods for estimating R ₀ from outbreak data have been published, either as a general tool or for specific applications [4,9,11,17,21,24]. Some, like Wallinga and Teunis [24] do not need much data, but require knowledge of the generation time distribution. Most of these methods, however, are data-hungry: they either need contact information, use the whole outbreak time series (so are effectively retrospective measures), or increase in accuracy as the time series becomes longer. While several methods are promising many problems remain, and for various reasons. For example, we do not observe infections we observe detections, i.e. individuals (people, animals, farms, plants) exhibiting symptoms. There is then a possibly unknown incubation period distribution that convolutes the infection process into the observed process. Detections are not necessarily in the order of infection. Moreover, R ₀ is a generation-based concept [12], but generations are not observed—a daily number of new detections is observed from possibly mixed generations. Early in the outbreak stochastic influences play a large role. Also, heterogeneity between individual infectivity and susceptibility and in contact pattern may cause the distribution of which R ₀ is the mean to be highly skewed (e.g. [16]). To make matters worse, we almost never have data from an uncontrolled situation — some control measures, effective or not, often operate from the moment of detection of the index case.

Despite advances, but in light of the problems encountered, many publications in which R ₀ is estimated from outbreak data still depend on cumulative incidence and generation interval only (see for example [5,8,14,18,26]). As a rule, the cumulative incidence in outbreaks of an infectious disease is observed to initially grow approximately exponentially with time (and hence the incidence grows exponentially too). A frequently used approach is to fit an exponential function to the (cumulative) incidence and to use the approximate relationship ${R_0} \approx {e^{r{T_G}}}$ to estimate R ₀, where r is the exponential growth rate and T _G is the observed mean generation interval of the epidemic. For rT _G small the further approximation ${R_0} \approx 1 + r{T_G}$ is sometimes used. Many of the problems mentioned above apply to these estimates. For example, the ‘real’ value of r is not observed, not only because of control measures in operation but also due to the stochasticity in the early phase. In addition, the definition of the generation interval is not always consistently used and the method presupposes that the population is homogeneously mixing. Still, the method is easy and intuitive and one can wonder in which circumstances it would be a ‘good’ approximation, and how large discrepancies can be when these circumstances are not met. In this paper we investigate these questions.

2 2 Model-consistent estimation of R ₀

We now derive estimates of R ₀ based on specific models and compare these with the previously mentioned approximations, which we denote $R_0^ + = {e^{r{T_G}}}$ and $R_0^ - = 1 + r{T_G}$. We need to emphasise that R ₀ is independent of timescale, whereas T _G has dimension time and r has dimension time⁻¹. We also need to emphasise that we do not know r or T _G, we assume that they have been estimated from data in some way, for example by estimating the doubling time of the incidence, D, and writing r = log(2)/D. We do not write ^r or ^T _G for these estimates, as this would result in far too many hats in one paper.

Assume for simplicity that the population is mixing homogeneously. The incidence of an emerging infection may be calculated from

$$i(t) = \delta (t) + {{S(t)} \over N}\int\limits_0^\infty {A(\tau )i(t - \tau )d\tau } $$

where δ(t), a unit spike, is the incidence of infection at time zero, the kernel A(τ) is the expected infectivity of an infected as a function of τ, the time since exposure to infection [1,6,19]. The number in the population susceptible at time t is

$$S(t) = N - \int\limits_0^t {i(u){\rm{d}}u} $$

For an emerging infection we assume the entire population to be susceptible at time zero. If this is not the case, we take N to be the size of the susceptible population prior to infection. As a first step towards developing a model, we specify the general form of the kernel A(τ). We write A(τ) = R ₀ f(τ), where R ₀ is the basic reproduction number that we wish to estimate, and f(τ) is the infectivity kernel, which is also the probability distribution of the generation interval.

For an emerging infection we have little information about f. We may have observations of the latent period (the time from exposure to infection to becoming infectious, T _E); the incubation period (the time from exposure to infection to the onset of symptoms) which we may in some cases assume to equal T _E; or the infectious period T _I. Given these we may wish to impose a particular form on the kernel, and use our limited knowledge to estimate parameter values for the distribution. These estimates may be revised as more information becomes available.

One quantity of interest is the mean generation interval of the epidemic, which is taken here to be the mean time from an individual’s exposure to infection to exposing others to infection (see [10], for an insightful exposition). We refer not to the time to the first occurrence of a secondary infection, but to the average time to all secondary infections. Alternatively, and equivalently, it can be defined as the expected duration of the primary infection at the time that a secondary infection occurs (see [22]). The mean generation interval may be determined from the formula

$${T_G} = \int\limits_0^\infty {tf(t){\rm{d}}t} $$

Given a probability distribution for the generation interval, f(τ), and an estimated initial rate of exponential increase for the epidemic, r, we approximate the initial stages of the epidemic by i(t) = e ^rt with S(t) ≃ N. Equation (1) then leads to a model-consistent estimate of the basic reproduction number via the formula

$${R_0}\int\limits_0^\infty {{e^{ - rt}}f(t){\rm{d}}t} = 1$$

(see [6]). If f(t) were a delta function, then Eqs. (2, 3) would lead to the estimate R ₀ = R ⁺₀ . For the SIR model, where f(t) = γe ^−γt, Eqs. (2, 3) lead to the estimate R ₀ = R ⁻₀ . We now computeR ₀ for three distribution functions which may be used as kernels: those with a fixed, exponentially or trapezoidally distributed infectious period (see Fig. 1). We refer to these as R ^fix₀ , R ^exp₀ and R ^trap₀ , respectively. We also compute R ₀ for the model with latent and infectious periods that each have gamma distributions, referred to as R ^(m,n)₀ . We have R ^exp₀ = R ^(1,1₀ )} and R ^fix₀ = lim_m,n→∞ R ^(m,n)₀ .

2.1 2.1 Fixed infectious period

Given fixed latent and infectious periods, T _E and T _I respectively, and assuming f constant when non-zero, we have f(τ) = 1/T _I for T _E < τ < T _E+ T _I and f(τ) = 0 otherwise. For this distribution T _G = T _E + T _I/2 and

$${R_0} = R_0^{{\rm{fix}}} = {{r\left( {{T_G} - {T_E}} \right)} \over {\sinh r\left( {{T_G} - {T_E}} \right)}}{e^{r{T_G}}}$$

R ⁺₀ is useful as an estimator for R ^fix₀ when the latent period may be regarded as fixed and the infectious period is short relative to the timescale 1/r (rT _I is small). As sinh x > x whenever x > 0, and ${\lim _{x \to 0}}{{\sinh x} \over x} = 1$ we have R ^fix₀ ≤ R ⁺₀ , and ${\lim _{{T_E} \to {T_G}}}R_0^{{\rm{fix}}} = R_0^ + $. Wallinga and Lipsitch [23] showed that R ⁺₀ is an upper bound on estimates of R ₀ for any distribution f(t).

2.2 2.2 Trapezoidal infection kernel

Consider the kernel

$$f(\tau ) = \left\{ {\matrix{ {{1 \over {{T_I}}}{{\tau - {\tau _a}} \over {{\tau _b} - {\tau _a}}}} & : & {\tau \in \left( {{\tau _a},{\tau _b}} \right)} \cr {{1 \over {{T_I}}}} & : & {\tau \in \left( {{\tau _b},{\tau _c}} \right)} \cr {{1 \over {{T_I}}}{{{\tau _d} - \tau } \over {{\tau _d} - {\tau _c}}}} & : & {\tau \in \left( {{\tau _a},{\tau _b}} \right)} \cr 0 & : & {{\rm{otherwise}}} \cr } } \right.$$

This is a suitable approximation to an infectivity function where nobody is infectious before τ _a time units or after τ _d time units post-exposure, maximum infectivity occurs between τ _b and τ _c time units after exposure, and contact rates are constant. The distribution is consistent with a mean latent period of ${T_E} = {{{\tau _a} + {\tau _b}} \over 2}$, a mean infectious period of ${T_I} = \left( {{\tau _d} + {\tau _c} - {\tau _b} - {\tau _a}} \right)/2$ and a mean generation interval of ${T_G} = {T_E} + {{{T_I}} \over 2} + {{{{\left( {{\tau _d} - {\tau _c}} \right)}^2} - {{\left( {{\tau _b} - {\tau _a}} \right)}^2}} \over {12\left( {{\tau _d} + {\tau _c} - {\tau _b} - {\tau _a}} \right)}}$ Hence, if the trapezium is symmetric ${T_G} = {T_E} + {{{T_I}} \over 2}$, which is the same relationship as that for the fixed infectious period. The basic reproduction number solves R ^trap₀ ¯f(r) = 1, where ¯f(s) is the Laplace transform of f(t) (see Appendix 1)

2.3 2.3 SEIR differential equation models

In an extended SEIR differential equation model the population of size N is made up of S susceptibles, E that have been exposed to infection but are not yet infectious, I infectious and R that have been infected and recovered. If the epidemic processes have a much faster timescale than the demographic processes, we obtain the equations

$$\eqalign{ & {{d{E_1}} \over {dt}} = \beta {S \over N}\sum\limits_{j = 1}^n {{I_j} - mv{E_1}} \cr & {\rm{for }}i = 2,...,m{\rm{ }}{{d{E_i}} \over {dt}} = mv{E_{i - 1}} - mv{E_i} \cr & {{d{I_1}} \over {dt}} = mv{E_m} - n\gamma {I_1} \cr & {\rm{for }}j = 2,...,n{\rm{ }}{{d{I_j}} \over {dt}} = n\gamma {I_{j - 1}} - n\gamma {I_j} \cr & {{dR} \over {dt}} = n\gamma {I_n} \cr} $$

The exposed and infectious classes have been subdivided E = Σ ^m_i=1 E _i and I = Σ ⁿ_j=1 I _j, respectively. The times spent in the exposed and infectious classes are gamma distributed with means T _E = 1/ν and T _I = 1/γ, respectively, andR ₀ = β/γ. The mean generation interval is ${T_G} = {T_E} + {{n + 1} \over {2n}}{T_I}$ (see Appendix 2). If the initial rate of exponential increase of the epidemic is r, then

$$R_0^{(m,n)} = {{{{2nr} \over {n + 1}}\left( {{T_G} - {T_E}} \right){{\left( {1 + {r \over m}{T_E}} \right)}^m}} \over {1 - {{\left( {1 + {{2r} \over {n + 1}}\left( {{T_G} - {T_E}} \right)} \right)}^{ - n}}}}$$

This result is derived in Appendix 2, where it is also shown that given values of r, T _E and T _G, R ^(m,n)₀ is an increasing function of both m and n.

2.4 2.4 Exponentially distributed infectious period

The well-known SEIR differential equation model is the special case of Eqs. (6) with m = n = 1. For this model the times spent in the exposed and infectious classes are exponentially distributed with means T _E = 1/ν and T _I = 1/γ respectively, and the appropriate kernel function in Eqs. (2, and 3) is $f(\tau ) = {{\gamma v} \over {\gamma - v}}\left( {{e^{ - v\tau }} - {e^{ - \gamma \tau }}} \right)$ (see [6]). The mean generation interval is ${T_G} = {T_E} + {T_I}$, and given r we have

$$R_0^{\exp } = R_0^{(1,1)} = 1 + r\left( {{1 \over v} + {1 \over \gamma }} \right) + {{{r^2}} \over {v\gamma }} = 1 + r{T_G} + {r^2}{T_E}\left( {{T_G} - {T_E}} \right)$$

The approximation $R_0^ - = 1 + r{T_G} \le R_0^{\exp }$ is appropriate for the SIR model, for which ν → ∞, T _E → 0 and T _G → T _I = 1/γ. Hence R ⁻₀ performs best as an estimate when either the latent period T _E or the infectious period T _I is small compared to T _G, and performs worst when they are equal.

3 3 Method and results

We assumed that we had estimated values of the initial rate of exponential increase of infection incidence, r, the mean latent period, T _E, and the mean generation interval, T _G. We then used Eqs. (4, and 7) to calculate model-based estimates of the basic reproduction number using the assumptions of a fixed or exponentially distributed infectious period, leading to R ^fix₀ and R ^exp₀ , respectively. We did this for values of the ratio of the latent period to the generation interval, T _E/T _G, in the range zero to 0.99. The values of R ^fix₀ and R ^exp₀ are plotted as functions of T _E/T _G for rT _G = 0.5, 1.0, 1.5, 2.0 in Fig. 2, and compared with the values of the estimators $R_0^ - = 1 + r{T_G}$ and $R_0^ + = {e^{r{T_G}}}$ in those cases. When rT _G = 0.5, 1.0, 1.5 or 2.0 we have R ⁻₀ = 1.5, 2.0, 2.5 or 3.0 and R ⁺₀ = 1.65, 2.72, 4.48 or 7.39, respectively.

The results shown in Fig. 2 illustrate that for fixed values of r, T _E and T _G, the values of R ⁻₀ and R ⁺₀ are lower and upper bounds, respectively for both R ^exp₀ and R ^fix₀ . In Sects. (2.1 and 2.4) it was shown that R ^fix₀ ≤ R ⁺₀ and R ⁻₀ ≤ R ^exp₀ , respectively. It is proved in Appendix 2 that R ^(m,n)₀ 0 is an increasing function of both m and n. Putting these results together we obtain the inequality

$$R_0^ - \le R_0^{\exp } \le R_0^{(m,n)} < R_0^{{\rm{fix}}} \le R_0^ + $$

where m and n are any finite positive integers.

Table 1 was constructed to illustrate the results that may be obtained for some specific infections. Parameters were chosen from the literature to be representative of influenza [20], severe acute respiratory syndrome (SARS) [19], smallpox [1] and foot and mouth disease (FMD) [11]. For each infection a trapezium distribution was constructed for f(τ), and used together with an estimate of R ₀ to calculate an estimate of the initial exponential increase, r. The function f(τ) was also used to calculate values for the mean generation interval T _G, the mean latent period T _E and the mean infectious period T _I.

Table 1 Estimates of R ₀ that could be made for emerging infections

Full size table

If these values of r, T _G and T _E had been estimated from data, and then R ₀ had been estimated by R ⁻₀ , R ⁺₀ , R ^fix₀ , R ^exp₀ or R ^trap₀ , the estimates presented in Table 1 would have resulted. By construction, the estimated value of R ^trap₀ then corresponds to the assumed value of R ₀. Hence Table 1 must be regarded as a comparison of estimates that may be made; the relative values are important rather than the absolute values.

4 4 Conclusions and discussion

We have derived and discussed model-consistent methods for estimating the basic reproduction number (R ₀) for an infectious disease from the initial rate of exponential growth of incidence of infection (r) at the beginning of an epidemic. These methods can only be applied to incidence data from the period where it is reasonable to assume that the whole population may be regarded as susceptible (S(t) }~ N).

Among the first pieces of information obtained for an emerging infection are observations of the latent and infectious periods. These may be used as estimates to construct a rectangular kernel for an integral equation model, or to derive rate parameters for a differential equation model. Examples of infectivity kernels with the same latent and infectious periods are shown in Fig. 1. These kernels appear to be quite different, with the exponential kernel allowing some transmission of infection from time zero, and exhibiting a long infection tail. These features can lead to disparities in the results from modelling exercises. The transmission at early times mitigates against the success of control methods based on contact tracing, or any other method with inherent delays. The tail can lead to transmission appearing to continue in the model long after control measures should have eliminated the infection. The trapezoidal kernels also shown in Fig. 1 allow for some variability in the fixed and latent periods to be incorporated in the model. For example, the kernel shown as suitable for modelling influenza (Fig. 1a) is consistent with a latent period uniformly distributed between 1.2 and 2.0 days, and an infectious period of 4.0 days. The kernel shown as suitable for modelling SARS (Fig. 1b) is consistent with nobody being infectious before 4 days, everybody infectious by 7 days, everybody still infectious at 11 days and nobody infectious after 14 days; with the proportion infectious at intermediate times determined by linear interpolation. As well as allowing for some variability, the trapezoidal kernel has the advantage over the rectangular one that it is a continuous function, and this avoids problems with numerical schemes that do not allow discontinuities. Of course, if further information is available, then other distributions may be more appropriate.

Figure 2 compares estimates of the basic reproduction number based on fixed (Fig. 2a, 2b) and exponentially distributed (Fig. 2c, 2d) infectious periods, R ^fix₀ andR ^exp₀ , respectively, with the estimates based only on mean generation interval, $R_0^ - = 1 + r{T_G}$ and $R_0^ + = {e^{r{T_G}}}$. The estimate R ⁻₀ is inaccurate whenever rT _G is not small. Using the fixed infectious period model, R ⁺₀ approximates R ^fix₀ when T _E/T _G is near to one, that is when the infection has a long latent period and a short infectious period. The estimate R ⁻₀ is an approximation to R ^exp₀ if either the latent period or infectious period are very short, but R ⁺₀ is never a good estimator for R ^exp₀ and it’s use is therefore inconsistent with an SEIR model.

The estimates of R ₀ derived in this paper have the ordering R ⁻₀ ≤ R ^exp₀ ≤ R ^(m,n)₀ < R ^fix₀ ≤ R ⁺₀ . This inequality establishes that, given the same values of r, T _E and T _I, R ⁻₀ provides a closer estimate for R ^exp₀ than for R ^fix₀ , and R ⁺₀ provides a closer estimate for R ^fix₀ than for R ^exp₀ . Even though results derived from the gamma distributed kernel, R ^(m,n)₀ , are not displayed in Fig. 2, we have established that R ^fix₀ and R ^exp₀ are upper and lower bounds respectively for R ^(m,n)₀ . Note that for the model with a fixed infectious period, ${T_G} = {T_E} + {T_I}/2$, but for the model with an exponentially distributed infectious period, ${T_G} = {T_E} + {T_I}$. The inequality, and the results presented in Fig. 2 are derived on the assumption that T _E and T _G are the same in both models; T _I is defined consistently with the model, and hence differs between models. This is in contrast to the distributions presented in Fig. 1, which have the same latent and infectious periods, T _E and T _I, and hence different mean generation intervals T _G.

Table 1 shows results that could be obtained when estimating R ₀ for emerging infections, with parameters suitable for these infections. The parameter values indicated for each infection should be regarded as sensible values rather than exact estimates, and the results are presented to indicate the relevance of Fig. 2. For all four examples there is close agreement between R ^fix₀ and R ^trap₀ . This is not surprising for the influenza example where the trapezium kernel has steep sides (Fig. 1a), but also applies for examples such as SARS (Fig. 1b). The results in Table 1 confirm that if rT _G is small, hence R ⁻₀ is close to one, then R ⁻₀ may be used as an estimator for R ^exp₀ when an exponential model is appropriate. If a model with a fixed infectious period is more appropriate then R ⁺₀ is a better estimate for R ^fix₀ , especially when T _E /T _G is closer to one: compare for example the relative values of R ⁺₀ and R ^fix₀ for smallpox where T _E /T _G = 0.73 and FMD where T _E /T _G = 0.33.

Recently other estimation methods have been suggested as improvements on R ⁻₀ and R ⁺₀ . Wearing et al. [25] compared estimates based on R ^exp₀ with results obtained using gamma-distributed infection kernels. Lloyd [15] found similar results for models of within-host virus dynamics. Heffernan and Wahl [13] also examined the problem, and provided correction factors for estimates of R ₀ based on both the mean and variance of observed transition times. Wallinga and Lipsitch [23] used a similar approach to ours, and derived estimates of R ₀ for a selection of infectivity kernels f(t), including those derived from a gamma-distributed infectious period but only with T _E = 0. They also considered the case where f(t) is a Normal distribution; if employed though this distribution should be truncated to avoid the possibility of negative generation intervals.

Even though we selected a number of particular kernels for our study, and these cover a reasonable range of first choices, our method is applicable to all biologically sensible kernels. When estimating kernels from data, one should be careful. Estimates made early in an epidemic are likely to be based on household studies, and may be truncated due to local saturation of contacts. In addition, it is unclear how valid such estimates are when extrapolated to the wider community with multiple levels of mixing.

We must be careful in attempting to draw conclusions from our analysis. As a new infection emerges the appropriate model is speculative, and in any situation there is no such thing as the correct model. For example, in the context of pandemic influenza, Ferguson et al. [8] had T _G = 2.6 and T _E = 1.48 days, and estimated R ₀ ≈ R ⁻₀ , obtaining values in the range 1 – 2. Their model was more complex than those discussed here, but we have seen that for low values of rT _G, R ⁻₀ provides a reasonable estimate. Mills et al. [18] used an SEIR model, and estimated R from R ^exp₀ which is consistent with their model. Our approach is to advocate using an estimate of R ₀ that is consistent with the model used to evaluate control strategies.

References

Aldis G.K. and Roberts M.G. (2005). An integral equation model for the control of a smallpox outbreak. Math. Biosci. 195: 1–2
Article MATH MathSciNet Google Scholar
Abramowitz M. and Stegun I.A. (1964). Handbook of Mathematical Functions. Dover, New York
MATH Google Scholar
Anderson R.M. and May R.M. (1991). Infectious Diseases of Humans: Dynamics and Control. Oxford Press, New York
Google Scholar
Cauchemez S., Boële P.-Y., Donnelly C.L., Ferguson N.M., Thomas G., Lueng G.M., Hedley A.J., Anderson R.M. and Valeron A.-J. (2006). Real-time estimates in early detection of SARS. Emerg. Infect. Dis. 12: 110–13
Google Scholar
Choi B.C.K. and Pak A.W.P. (2003). A simple approximate mathematical model to predict the number of severe acute respiratory syndrome cases and deaths. J. Epidemiol. Commun. Health 57: 831–35
Article Google Scholar
Diekmann O. and Heesterbeek J.A.P. (2000). Mathematical Epidemiology of Infectious Diseases: Model Analysis and Interpretation. Wiley, New York
Google Scholar
Diekmann O., Heesterbeek J.A.P. and Metz J.A.J. (1990). On the definition and computation of the basic reproduction ratio R ₀ in models for infectious diseases in heterogeneous populations. J. Math. Biol. 28: 365–82
Article MATH MathSciNet Google Scholar
Ferguson N., Cummings D.A.T., Cauchemez S., Fraser C., Riley S., Meeyai A., Iamsirithaworn S. and Burke D.S. (2005). Strategies for containing an emerging influenza pandemic in Southeast Asia. Nature 437: 209–14
Article Google Scholar
Ferrari M.J., Bjørnstad O.N. and Dobson A.P. (2005). Estimation and inference of $ \mathcal{R}_0 $ of an infectious pathogen by a removal method Math. Biosci. 198: 14–6
Article MATH MathSciNet Google Scholar
Fine P.E.M. (2003). The interval between successive cases of an infectious disease. Am. J. Epidemiol. 158: 1039–047
Article Google Scholar
Haydon D., Chase-Topping M., Shaw D.J., Matthews L., Friar J., Wilesmith J. and Woolhouse M.E.J. (2003). The construction and analysis of epidemic trees with reference to the 2001 UK foot-and-mouth outbreak. Proc. R. Soc. Lond B 270: 121–27
Article Google Scholar
Heesterbeek J.A.P. (2002). A brief history of $ \mathcal{R}_0 $ and a recipe for its calculation Acta. Biotheor. 50: 189–04
Article Google Scholar
Heffernan J.M. and Wahl L.M. (2006). Improving Estimates of the basic reproductive ratio: using both the mean and dispersal of transition times. Theor. Popul. Biol. 70: 135–45
Article MATH Google Scholar
Lipsitch M., Cohen E., Cooper B., Robins J.M., Ma S., James L., Gopalakrishna G., Chew S.K., Tan C., Samore M.H., Fisman D. and Murray M. (2003). Transmission dynamics and control of severe acute respiratory syndrome. Science 300: 1966–970
Article Google Scholar
Lloyd A.L. (2001). The dependence of viral parameter estimates on the assumed viral life cycle: limitations of studies of viral load data. Proc. R. Soc. B 268: 847–54
Article Google Scholar
Lloyd-Smith J.O., Schreiber S.J., Kopp P.E. and Getz W.M. (2005). Superspreading and the effects of individual variation on disease emergence. Nature 438: 355–59
Article Google Scholar
Meester R., Diekmann O., Koning J. and Jong M.C.M. (2002). Modeling and real-time prediction of classical swine fever epidemics. Biometrics 58: 178–84
Article MathSciNet Google Scholar
Mills C.E., Robins J.M. and Lipsitch M. (2004). Transmissibility of 1918 pandemic influenza. Nature 432: 904–06
Article Google Scholar
Roberts M.G. (2004). Modelling strategies for minimizing the impact of an imported exotic infection. Proc. R. Soc. B 271: 2411–415
Article Google Scholar
Roberts M.G., Baker M., Jennings L.C., Sertsou G. and Wilson N. (2007). A model for the spread and control of pandemic influenza in an isolated geographical region. J. R. Soc. Interface 4: 325–30
Article Google Scholar
Stegeman J.A., Elbers A.R.W., Smak J. and Jong M.C.M. (1999). Quantification of the transmission of classical swine fever virus between herds during the 1997–998 epidemic in the Netherlands. Prevent. Veterinary Med. 42: 219–34
Article Google Scholar
Svensson A. (2007). A note on generation intervals in epidemic models. Math. Biosci. 208: 300–11
Article MATH MathSciNet Google Scholar
Wallinga J. and Lipsitch M. (2007). How generation intervals shape the relationship between growth rates and reproductive numbers. Proc. R. Soc. Ser. B 274: 599–04
Article Google Scholar
Wallinga J. and Teunis P. (2004). Different epidemic curves for severe acute respiratory syndrome reveal similar impacts of control measures. Am. J. Epidemiol. 160: 509–16
Article Google Scholar
Wearing H.J., Rohani P. and Keeling M.J. (2005). Appropriate models for the management of infectious diseases. PLoS Med. 7: 621–27
Google Scholar
Zhou G. and Yan G. (2003). Severe acute respiratory syndrome epidemic in Asia. Emerg. Infect. Dis. 9: 1608–610
Google Scholar

Download references

Acknowledgment

This work was supported by the EU Sixth Framework Programme for research for policy support (contract SP22-CT-2004-511066). The authors wish to thank three anonymous referees whose comments greatly improved the manuscript.

Author information

Authors and Affiliations

Centre for Mathematical Biology, Institute of Information and Mathematical Sciences, Massey University, Private Bag 102 904, North Shore Mail Centre, Auckland, New Zealand
M. G. Roberts
Faculty of Veterinary Medicine, University of Utrecht, Yalelaan 7, 3584 CL, Utrecht, The Netherlands
J. A. P. Heesterbeek

Authors

M. G. Roberts
View author publications
You can also search for this author in PubMed Google Scholar
J. A. P. Heesterbeek
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. A. P. Heesterbeek.

Appendices

Appendix 1: The derivation of R ^trap₀

To find the Laplace transform of the trapezoidal distribution (5), define the functions

$$\phi ({\tau _a},{\tau _b},s) = \int\limits_{{\tau _a}}^{{\tau _b}} {{e^{ - s\tau }}} d\tau = {{ - {e^{ - s{\tau _a}}} - {e^{ - s{\tau _b}}}} \over s}$$

and

$$\psi ({\tau _a},{\tau _b},s) = \int\limits_{{\tau _a}}^{{\tau _b}} {\tau {e^{ - s\tau }}} d\tau = {{{e^{ - s{\tau _a}}} - {e^{ - s{\tau _b}}}} \over {{s^2}}} + {{{\tau _a}{e^{ - s{\tau _a}}} - {\tau _b}{e^{ - r{\tau _b}}}} \over s}$$

Then, given an estimated value of r, the basic reproduction number solves

$${{R_0^{trap}} \over {{T_I}}}\left( {{{\psi ({\tau _a},{\tau _b},r) - {\tau _a}\phi ({\tau _a},{\tau _b},r)} \over {{\tau _b} - {\tau _a}}} + \phi ({\tau _b},{\tau _c},r) + {{{\tau _d}\phi ({\tau _c},{\tau _d},r) - \psi ({\tau _c},{\tau _d},r)} \over {{\tau _d} - {\tau _c}}}} \right) = 1$$

Appendix 2: The proof of inequality (8)

For the extended SEIR model (6)the probability that an individual infected at time zero is in one of the infected classes (E _i or I _j) at time t is found by solving the differential equations with β = 0, E ₁(0) = 1, E _i(0) = 0 for i = 2,…, m and I _j(0) = 0 for j = 1,…, n. In the Laplace transform domain the solutions are

$$\eqalign{ & {{{\rm{\bar E}}}_i}(s) = {{{{(m\nu )}^{i - 1}}} \over {{{(s + m\nu )}^i}}} \cr & {{\bar I}_j}(s) = {{{{(m\nu )}^m}} \over {{{(s + m\nu )}^m}}}{{{{(n\gamma )}^{j - 1}}} \over {{{(s + n\gamma )}^j}}} \cr}$$

The probability that an individual infected at time zero has become infectious or has ceased to be infectious by time t is h(t) or g(t) respectively, where

$$\eqalign{ & \bar h(s) = {{{{(m\nu )}^m}} \over {s{{(s + m\nu )}^m}}} \cr & \bar g(s) = {{{{(m\nu )}^m}{{(n\gamma )}^n}} \over {s{{(s + m\nu )}^m}{{(s + n\gamma )}^n}}} \cr} $$

Back-transforming,

$$h(t) = {F_{m,\nu }}(t){\rm{ }}g(t) = \int\limits_0^t = {{d{F_{m,\nu }}(x)} \over {dx}}{F_{n,\gamma }}(t - x)dx$$

where

$${F_{m,\nu }}(t) = P(m,m\nu t) = {1 \over {(m - 1)!}}\int\limits_0^{mvt} {{x^{m - 1}}{e^{ - x}}dx} $$

is a regularized incomplete gamma function (see 6.5.1, [2]). The infectivity kernel has Laplace transform

$$\bar f(s) = \gamma \left( {\bar h(s) - \bar g(s)} \right) = {{\gamma {{(m\nu )}^m}} \over {s{{(s + m\nu )}^m}}}\left( {1 - {{{{(n\gamma )}^n}} \over {{{\left( {s + n\gamma } \right)}^n}}}} \right)$$

The generation interval is given by the formula

$${T_G} = \int\limits_0^\infty {tf(t)dt = - \mathop {\lim }\limits_{s \to 0} {{d\bar f} \over { ds }} = {1 \over \nu } + {{n + 1} \over {2n\gamma }}} $$

If the initial rate of exponential increase of the epidemic is r, then R ^(m,n)₀ ¯f(r) = 1, hence

$$R_0^{(m,n)} = {{{r \over \gamma }{{\left( {1 + {r \over {m\nu }}} \right)}^m}} \over {1 - {{\left( {1 + {r \over {n\gamma }}} \right)}^{ - n}}}}$$

This result may be found in Wearing et al. [25]. The function (1 + x/m)^m is positive for positive x, and increases monotonically from 1 + x to e ^x as m increases from 1 to ∞. Hence, given values of r, ν and γ, R ^(m,n)₀ is an increasing function of m, but a decreasing function of n. However, substituting ${1 \over v} = {T_E}$ and ${1 \over \gamma } = {{2n} \over {n + 1}}({T_G} - {T_E})$ we obtain

$$R_0^{(m,n)} = {{{{2nr} \over {n + 1}}\left( {{T_G} - {T_E}} \right){{\left( {1 + {r \over m}{T_E}} \right)}^m}} \over {1 - {{\left( {1 + {{2r} \over {n + 1}}\left( {{T_G} - {T_E}} \right)} \right)}^{ - n}}}}$$

The function

$${f_n}(x) = {{{{nx} \over {n + 1}}} \over {1 - {{\left( {1 + {x \over {n + 1}}} \right)}^{ - n}}}}$$

is positive for positive x, and increases monotonically from 1+ x/2 to ${{x{e^x}} \over {{e^x} - 1}}$ as n increases from 1 to ∞. The proof is a straightforward but tedious manipulation of expressions: we multiply the numerator and denominator of the expression for f _n(x) by (1+x/(n+1))ⁿ and then show that ${{{f_{n + 1}}(x)} \over {{f_n}(x)}} > 1$ for n≥ 1 and x > 0. Hence, given values of r, T _E and T _G, R ^(m,n)₀ is an increasing function of both m and n. In the limit as m and n tend to infinity, R ^(m,n)₀ tends to R ^fix₀ . Hence for all positive finite integers m and n, $R_0^{\exp } \le R_0^{(m,n)} < R_0^{fix}$, completing the proof of inequality (8).

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License ( https://creativecommons.org/licenses/by-nc/2.0 ), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Roberts, M.G., Heesterbeek, J.A.P. Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection. J. Math. Biol. 55, 803–816 (2007). https://doi.org/10.1007/s00285-007-0112-8

Download citation

Received: 17 August 2006
Revised: 23 May 2007
Published: 08 August 2007
Issue Date: November 2007
DOI: https://doi.org/10.1007/s00285-007-0112-8

Keywords

Mathematics Subject Classification (2000)

92B05

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection

Abstract

Similar content being viewed by others

On the Reproduction Number of Epidemics with Sub-exponential Growth

A Note on Observation Processes in Epidemic Models

Population Dynamics of Infectious Diseases

1 1 Introduction

2 2 Model-consistent estimation of R ₀

2.1 2.1 Fixed infectious period

2.2 2.2 Trapezoidal infection kernel

2.3 2.3 SEIR differential equation models

2.4 2.4 Exponentially distributed infectious period

3 3 Method and results

4 4 Conclusions and discussion

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: The derivation of R ^trap₀

Appendix 2: The proof of inequality (8)

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2000)

Navigation

Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection

Abstract

Similar content being viewed by others

On the Reproduction Number of Epidemics with Sub-exponential Growth

A Note on Observation Processes in Epidemic Models

Population Dynamics of Infectious Diseases

1 1 Introduction

2 2 Model-consistent estimation of R 0

2.1 2.1 Fixed infectious period

2.2 2.2 Trapezoidal infection kernel

2.3 2.3 SEIR differential equation models

2.4 2.4 Exponentially distributed infectious period

3 3 Method and results

4 4 Conclusions and discussion

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: The derivation of R trap0

Appendix 2: The proof of inequality (8)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2000)

Search

Navigation

2 2 Model-consistent estimation of R ₀

Appendix 1: The derivation of R ^trap₀