3.1 Introduction

Two applications of multi-type branching processes to epidemic models are presented. The first application is to an SEIR epidemic model and the second application is to the same epidemic model but with dispersal. The SEIR epidemic is modeled as a two-type branching process. Occurrence of an outbreak depends on the number of exposed and infectious individuals. It is shown that the offspring pgfs for the exposed and infectious populations lead to an explicit formula for the probability of an outbreak. In the SEIR model with dispersal, the case of two regions with different healthcare situations are considered. One region has poor healthcare versus another region with excellent healthcare. It is shown that the rate and the direction of movement have a large impact on the occurrence of an outbreak. Branching process theory is used to investigate the probability of an outbreak when the movement rates differ between the two regions.

Although the SIR and SEIR epidemic models are simple, they are often used as a first approximation during or after disease outbreaks to provide estimates of the potential spread of the disease or to understand the pattern of spread. For example, SIR and SEIR epidemic models in conjunction with data provided useful information about the spread of the 2002–2003 SARS (Severe Acute Respiratory Syndrome) pandemic which began in China, the 2009–2010 H1N1 influenza pandemic which began in Mexico, and the 2014 Ebola outbreak in Africa [11, 22, 33].

3.2 SEIR Epidemic

Consider an SEIR epidemic model, where S, E, I, and R are the susceptible, exposed, infectious, and recovered individuals, respectively. With disease-related mortality rate α, the population size is not constant, \(S(t) + E(t) + I(t) + R(t) = N(t)\). The deterministic SEIR ODE model has the form:

$$\displaystyle{ \begin{array}{ll} \dfrac{dS(t)} {dt} & = -\beta \dfrac{S(t)} {N(t)}I(t) \\ \dfrac{dE(t)} {dt} & =\beta \dfrac{S(t)} {N(t)}I(t) -\delta E(t) \\ \dfrac{dI(t)} {dt} & =\delta E(t) -\gamma I(t) -\alpha I(t) \\ \dfrac{dR(t)} {dt} & =\gamma I(t).\end{array} }$$
(3.1)

Births, deaths, and temporary immunity are not included in this model. However, the basic reproduction number near the disease-free state has the same form as in the SIRS model considered in Chapter 2 [36]:

$$\displaystyle{ \mathcal{R}_{0} = \dfrac{\beta } {\gamma +\alpha }. }$$
(3.2)

For the CTMC SEIR epidemic model, let \(\mathbf{X}(t) = (X_{1}(t),X_{2}(t),X_{3}(t),X_{4}(t))\) denote the discrete random variables for the four states, (S(t), E(t), I(t), R(t)). The transition rates for the CTMC SEIR epidemic model (MC Rates) and those for the corresponding branching process approximation (BP rates) for exposed and infectious populations, X 2 and X 3 are given in Table 3.1. Because the rates are nonlinear, the solution of the deterministic model does not represent the mean of the stochastic model. Note that event 1 has a nonlinear transition rate in the MC model but a linear rate in the branching process approximation.

Table 3.1 Transition rates for the CTMC SEIR epidemic model (MC Rates) and for the corresponding branching process approximation for exposed and infectious individuals (BP Rates).

To compute the probability of epidemic extinction for the multi-type branching process, the pgfs for the random variables, X 2 and X 3, are defined. Applying the transition rates from Table 3.1, the pgfs for X 2 and X 3 are

$$\displaystyle\begin{array}{rcl} f_{1}(s_{1},s_{2})& =& \dfrac{\delta s_{2}} {\delta } {}\\ f_{2}(s_{1},s_{2})& =& \dfrac{\beta s_{1}s_{2} +\gamma +\alpha } {\beta +\gamma +\alpha }. {}\\ \end{array}$$

Although f 1 is a simple function, f 2 is not. The expectation matrix of the pgfs is

$$\displaystyle{M = \left [\begin{array}{*{10}c} 0 & 1\\ \dfrac{\beta } {\beta +\gamma +\alpha } & \dfrac{\beta } {\beta +\gamma +\alpha } \end{array} \right ].}$$

Matrix \(J =\varLambda (M - I)\) is

$$\displaystyle{J = \left [\begin{array}{*{10}c} -\delta & \delta \\ \beta &-\gamma -\alpha \end{array} \right ],}$$

where \(\varLambda =\mathrm{ diag}(\delta,\beta +\gamma +\alpha )\). Both matrices are irreducible. It is clear that if \(\mathcal{R}_{0} > 1\), the branching process is supercritical. In particular, if \(\mathcal{R}_{0} > 1\), the unique fixed point of the pgfs is (q 1 , q 2 ) ∈ (0, 1)2, where \(q_{i}^{{\ast}} = 1/\mathcal{R}_{0}\) (Whittle’s result). The difference between the SIR and SEIR CTMC models is that the exposed period increases the time until extinction and the presence of both exposed and infectious individuals increases the probability of an outbreak. This latter result can be seen in the probability of extinction (no major outbreak) for the CTMC SEIR epidemic model, which is given by

$$\displaystyle{\mathbb{P}_{0}(e_{0},i_{0}) \approx (1/\mathcal{R}_{0})^{e_{0}+i_{0} }.}$$

A numerical example of the SEIR CTMC model along with the deterministic solution is given in Figure 3.1. Three of the four sample paths represent an outbreak, whereas in one there is no outbreak. The probability of a major outbreak is \(1 - \mathbb{P}_{0}(1,0) = 0.625\).

Fig. 3.1
figure 1

Four sample paths of the CTMC SEIR epidemic model along with the deterministic solution (dashed curve). Parameter values are β = 0. 4, δ = 0. 4, γ = 0. 1, α = 0. 05, and ε = 0. Initial values are S(0) = 499, E(0) = 1, \(I(0) = 0 = R(0)\). The basic reproduction number \(\mathcal{R}_{0} = 2.67\). The probability of a major outbreak is \(1 - \mathbb{P}_{0}(1,0) = 0.625\).

If additional mortality occurs during the exposed period at rate ε X 2(t), then the pgf for X 2 is

$$\displaystyle{f_{1}(s_{1},s_{2}) = \dfrac{\delta s_{2}+\epsilon } {\delta +\epsilon }.}$$

The basic reproduction number for the ODE model with mortality during the exposed period differs from (3.2) and is equal to

$$\displaystyle{ \mathcal{R}_{0} = \dfrac{\beta \delta } {(\delta +\epsilon )(\gamma +\alpha )}. }$$
(3.3)

If \(\mathcal{R}_{0} > 1\), then the fixed point can be explicitly determined,

$$\displaystyle\begin{array}{rcl} q_{1}^{{\ast}}& =& \dfrac{\delta } {\delta +\epsilon } \dfrac{1} {\mathcal{R}_{0}} + \dfrac{\epsilon } {\delta +\epsilon } {}\\ q_{2}^{{\ast}}& =& \dfrac{1} {\mathcal{R}_{0}}. {}\\ \end{array}$$

A model with natural births and deaths in all stages yields a similar result [4]. The value for the probability of extinction is greater in the exposed period than in the infectious period, q 1  > q 2 . This is a reasonable result since in the exposed period, individuals may die with probability \(\epsilon /(\delta +\epsilon )\) before becoming infectious or become infectious but not transmit the disease with probability \(\delta /[(\delta +\epsilon )\mathcal{R}_{0}]\).

3.3 Epidemic Dispersal

Suppose disease is spread between two populations each occupying different regions or patches and modeled by the SEIR epidemic equations within each patch. In population 1, poor healthcare facilities result in frequent disease outbreaks. In population 2, better healthcare facilities and reduced mortality and recovery rates result in no major outbreaks. For population 1, the basic reproduction number is greater than one but for population 2, the basic reproduction number is less than one. With dispersal between these two populations, the outcome changes depending on the direction and the rate of dispersal.

Let the disease parameters for each of these populations be denoted as β i , δ i , γ i , and ε i , i = 1, 2. The rate of dispersal from population 1 to 2 is m 1 and the rate from population 2 to 1 is m 2. For simplicity, the model assumes that all individuals within each stage, S, E, I, or R, disperse at the same rates, i.e., with rates m 1 and m 2. The disease is spread by the movement of exposed or infectious individuals between these two populations. The deterministic model for population 1 with dispersal is

$$\displaystyle{ \begin{array}{ll} \dfrac{dS_{1}(t)} {dt} & = -\beta _{1} \dfrac{S_{1}(t)} {N_{1}(t)}I_{1}(t) - m_{1}S_{1}(t) + m_{2}S_{2}(t) \\ \dfrac{dE_{1}(t)} {dt} & =\beta _{1} \dfrac{S_{1}(t)} {N_{1}(t)}I_{1}(t) -\delta _{1}E_{1}(t) -\epsilon _{1}E_{1}(t) - m_{1}E_{1}(t) + m_{2}E_{2}(t) \\ \dfrac{dI_{1}(t)} {dt} & =\delta _{1}E_{1}(t) -\gamma _{1}I_{1}(t) -\alpha _{1}I_{1}(t) - m_{1}I_{1}(t) + m_{2}I_{2}(t) \\ \dfrac{dR_{1}(t)} {dt} & =\gamma _{1}I_{1}(t) - m_{1}R_{1}(t) + m_{2}R_{2}(t).\end{array} }$$
(3.4)

A similar system holds for population 2. Without dispersal, m i  = 0, the basic reproduction number \(\mathcal{R}_{0i}\), i = 1, 2, for each population is given by formula (3.3), where the parameters for population 1 or 2 have subscripts 1 or 2, respectively. We assume \(\mathcal{R}_{01} > 1\) and \(\mathcal{R}_{02} < 1\).

A branching process approximation for the corresponding CTMC SEIR model for two patches can be applied if the population size is large but the exposed and infectious population sizes are small. For the branching process, we are only interested in the exposed and infectious stages. The direction and the rate of movement of individuals in these disease stages have a large impact on the probability of an outbreak.

Let \(\mathbf{X}(t) = (X_{1}(t),X_{2}(t),X_{3}(t),X_{4}(t))\) denote the four discrete random variables for stages E 1,  I 1, E 2, and I 2, respectively. The four probability generating functions of the approximating branching process are

$$\displaystyle{ \begin{array}{ll} f_{1}(s_{1},s_{2},s_{3},s_{4})& = \dfrac{\delta _{1}s_{2} +\epsilon _{1} + m_{1}s_{3}} {\delta _{1} + m_{1} +\epsilon _{1}} \\ f_{2}(s_{1},s_{2},s_{3},s_{4})& = \dfrac{\beta _{1}s_{1}s_{2} +\gamma _{1} +\alpha _{1} + m_{1}s_{4}} {\beta _{1} +\gamma _{1} +\alpha _{1} + m_{1}} \\ f_{3}(s_{1},s_{2},s_{3},s_{4})& = \dfrac{\delta _{2}s_{4} +\epsilon _{2} + m_{2}s_{1}} {\delta _{2} +\epsilon _{2} + m_{2}} \\ f_{4}(s_{1},s_{2},s_{3},s_{4})& = \dfrac{\beta _{2}s_{3}s_{4} +\gamma _{2} +\alpha _{2} + m_{2}s_{2}} {\beta _{2} +\gamma _{2} +\alpha _{2} + m_{2}}. \end{array} }$$
(3.5)

If the spectral radius of the expectation matrix ρ(M) > 1, then the process is supercritical. A formula for the minimal fixed point of (3.5), (q 1 , q 2 , q 3 , q 4 ) can be computed numerically. In the supercritical case, the probability of no major outbreak is approximately

$$\displaystyle{\mathbb{P}_{0}(e_{10},i_{10},e_{20},i_{20}) = (q_{1}^{{\ast}})^{e_{10} }(q_{2}^{{\ast}})^{i_{10} }(q_{3}^{{\ast}})^{e_{20} }(q_{4}^{{\ast}})^{i_{20} },}$$

where e j0 and i j0 are the initial number of exposed and infectious individuals in patch j, respectively. The probability of a major outbreak is \(1 - \mathbb{P}_{0}(e_{10},i_{10},e_{20},i_{20}).\)

An example with equal dispersal rates for the two populations, m 1 = m 2, results in outbreaks in both populations. The probability of an outbreak increases in population 1 but decreases in population 2. In Figure 3.2, population 1 has \(\mathcal{R}_{01} = 2.67\) but in population 2 there is lower transmission, higher recovery, and lower mortality, so that \(\mathcal{R}_{02} = 0.89\). One infectious individual introduced into population 1 gives a probability for no major outbreak, \(\mathbb{P}_{0}(0,0,0,1) = 0.751\).

Fig. 3.2
figure 2

Four sample paths of the SEIR CTMC model with dispersal along with the ODE solution (dashed curve). Parameter values are β 1 = 0. 4, β 2 = 0. 2, \(\delta _{1} = 0.4 =\delta _{2}\), γ = 0. 1, γ 2 = 0. 05, α 1 = 0. 05, α 2 = 0. 025, \(\epsilon _{1} = 0 =\epsilon _{2}\), and \(m_{1} = 0.05 = m_{2}\). Initial values are S 1(0) = 500, S 2(0) = 499, \(E_{1}(0) = 0 = E_{2}(0)\), I 1(0) = 1, and \(R_{1}(0) = 0 = R_{2}(0)\). The basic reproduction numbers for each population are \(\mathcal{R}_{10} = 2.67\) and \(\mathcal{R}_{02} = 0.89\). Probability of no major outbreak is \(\mathbb{P}_{0}(0,0,0,1) = 0.751\).

With unequal dispersal between the two populations, the probability of an outbreak depends on the direction and magnitude of the dispersal rates (as in the birth-death-dispersal model in Chapter 1). In Figure 3.3, the fixed points (q 1 , q 2 , q 3 , q 4 ) of the pgfs in (3.5) are computed numerically given both m 1 and m 2 lie in the range [0, 0. 2]. If dispersal is greater toward the population with good healthcare facilities it is possible to eradicate disease in both populations, that is, no major outbreaks occur. Allowing movement out of the poor healthcare region but restricting movement into the region (bottom left graph in Figure 3.3) is a good strategy for disease control, but restricting movement out of the poor healthcare strategy but allowing movement into that region (bottom right graph in Figure 3.3) is a poor strategy.

Fig. 3.3
figure 3

Probability of no major outbreak for a CTMC SEIR epidemic model with dispersal between two populations as estimated by the fixed points q i . Dispersal from population 1 to 2 is m 1 and from population 2 to 1 is m 2. Parameter values are the same as in Figure 3.2, except for the dispersal parameters: β 1 = 0. 4, β 2 = 0. 2, \(\delta _{1} = 0.4 =\delta _{2}\), γ = 0. 1, γ 2 = 0. 05, α 1 = 0. 05, α 2 = 0. 025, \(\epsilon _{1} = 0 =\epsilon _{2}\).

3.4 Summary

The multi-type branching process application to an SEIR epidemic with dispersal illustrates the importance of controlling movement into and out of particular regions to prevent an outbreak. Although prevention and control measures are more complex in real epidemic or pandemic situations, the basic SIR and SEIR models are often used in conjunction with data to help estimate the potential spread of the disease, e.g., SARS, influenza, and Ebola [11, 22, 33]. The control measures in pandemic situations often include travel restrictions, quarantine, isolation, and drugs such as antiviral medication to prevent infection. Other specific applications of branching processes to infectious disease models include vector-transmitted diseases [4, 6, 9, 18], HIV infection, [12] and bovine respiratory syncytial virus [19].