1 Introduction

The outcome of a single epidemic involving two competing types of a pathogen is key to our theoretical understanding of several important epidemiological questions. For an outbreak of an emerging disease, e.g., an influenza pandemic, the outcome of such a mixed epidemic characterizes the early stages of pathogen adaptation to the new host population (Antia et al. 2003; Holmes 2009; Alexander and Day 2010; Kubiak et al. 2010). In influenza drift, a similar situation occurs in each season as the epidemic is seeded by a genetically and antigenically heterogeneous pathogen population or mutants arise during the early stages of the epidemic (Koelle et al. 2006; Boni et al. 2006; Lipsitch et al. 2007; Omori et al. 2010; Ho and Chao 2017). For plant disease evolution, the scenario describes the simultaneous spread of a wild-type and a mutant in an uninfected field (Ohtsuki and Sasaki 2006) and coexistence of multiple strains of (baculo) virus in annual insect populations is determined by similar epidemic interactions (Fleming-Davies et al. 2015). Furthermore, the modeling structure is analogous to that describing spontaneous behavioral changes in response to a severe epidemic (Poletti et al. 2009; Funk et al. 2009; Kiss et al. 2010).

Pathogen competition has been studied extensively in the theoretical literature for the case where the host–pathogen interaction occurs at an endemic equilibrium, i.e., in a situation where demographic turn-over and disease transmission have established an epidemiological equilibrium. Long-term coexistence at the endemic equilibrium is not possible (Bremermann and Thieme 1989) unless additional factors are included; see, for example, Gjini et al. (2016) and Lipsitch et al. (2009). The present study differs in that we are concerned with competition during a single epidemic, i.e., we study competition occurring in a host population whose immunity structure is dynamically changing in response to disease transmission and loss of immune hosts.

A previous study on competition during an epidemic by Saunders (1981) and Kendall and Saunders (1983) has focused on establishing the basic model structure and on characterizing the possible outcome of the epidemic in the case of full cross-immunity. Multi-strain epidemics have also been studied in the context of epidemics on networks (Funk and Jansen 2010; Marceau et al. 2011; Karrer and Newman 2011; Miller 2013; Hebert-Dufresne et al. 2013), but their results should be compared to the stochastic formulation by Svensson and Scalia Tomba (2001).

The present paper will rely heavily on the methods of Kendall and Saunders, and the expression “epidemics in competition” is a reference to the title of their original work. Specifically, we extend their results to the case where the two strains confer only partial cross-immunity. Partial cross-immunity is particularly important for influenza drift, though similar issues arise when modeling the introduction of dengue and malaria to new areas. In addition, partial immunity allows us to model early stages of pathogen speciation as our formulation will span the full range from two identical pathogens to two immunologically separate species. The existence of cross-reactions among strains is well documented. However, the exact effect is often unclear and may vary from system to system (Ackerman et al. 1990; Gomes and Medley 2001; Gog and Swinton 2002). In this paper, we focus on the case where infection cofers complete immunity to the infecting strain while cross-immunity acts by reducing infectivity during a subsequent infection with the other strain. Our main motivation for this choice is mathematical, as it is well known that this formulation is particularly amenable for analytic investigation (Gupta et al. 1998; Ferguson and Andreasen 2001).

In its simplest form—and omitting for now the possibility of partial cross-immunity—the problem may be described as follows: Imagine that a pathogen is introduced into a susceptible host population and that the pathogen occurs in two variants which differ in their transmission properties. What will be the outcome of such an epidemic in terms of the size of the epidemic and its distribution among the two types?

Clearly, the answer depends on the exact way in which the two pathogens are introduced. Figure 1 shows the range of possible outcomes, and in particular, Fig. 1a, c shows the extreme cases where one pathogen is introduced sufficiently early that its has caused and completed its epidemic prior to the arrival of the other type, while Fig. 1b shows an intermediate situation where the two epidemics overlap. The extreme cases are easy to characterize analytically as they consist of a pair of sequential epidemics, each involving only a single pathogen type (Andreasen and Sasaki 2006; Newman 2005). However, for the overlapping epidemics in Fig. 1b the outcome depends on the absolute size and composition of the initial introductions, suggesting that it would be necessary to include the stochastic effects of small pathogen population size during the early phases of the epidemic to obtain a complete characterization (Svensson and Scalia Tomba 2001).

Here, we will take another approach focusing on those aspects of the epidemic interaction that may be determined deterministically. This implies that we will not aim at providing a unique answer but rather provide a range of possible outcomes.

Fig. 1
figure 1

Three characteristic epidemics that can arise for a system of two competing viral strains conferring complete cross-protection in a homogeneously mixing host population. White area indicates prevalence of the most transmissible type (\({{{\mathfrak {R}}_0}}=2\)); black area prevalence of least transmissible type (\({{{\mathfrak {R}}_0}}=1.25\)). The three epidemic curves differ only in initial conditions

Our starting point is the two-strain epidemic with full cross-protection as this case leads to the least complicated model while still allowing us to present the basic model structure in Sect. 2, and to develop the necessary mathematical methods in Sect. 3. In Sect. 4, we extend the analysis to the case where partial cross-immunity acts by reducing infectivity for infected hosts who are already immune to the other strain, and in Sect. 5, we show how the strength of cross-immunity can span the full range of epidemics in competition.

2 Two Pathogen Types and Full Cross-Immunity

For a homogeneously mixing host population, the state of the epidemic may be captured by specifying the fraction of the host population that is susceptible S and the fractions that are currently infected by each pathogen type IY. Neglecting the possibility of coinfection and assuming an exponential infectious period with recovery rates \(\nu _{K}\) and transmission coefficients \(\beta _{K}, K=I,Y\), the transmission dynamics are described by

$$\begin{aligned} \dot{S}= & {} -\mathop \beta \nolimits _{I}SI - \mathop \beta \nolimits _{Y}SY \end{aligned}$$
(1)
$$\begin{aligned} \dot{I}= & {} \mathop \beta \nolimits _{I}SI-\nu _{I} I \end{aligned}$$
(2)
$$\begin{aligned} \dot{Y}= & {} \mathop \beta \nolimits _{Y}SY-\nu _{Y}Y . \end{aligned}$$
(3)

We will assume that the disease is non-lethal such that the population size remains constant and omit explicit reference to the dynamics of the recovered class as its size is redundant.

In the next section, we will show that for any nonnegative initial condition (siy) there exists a number \(s_\infty \) such that \(S(t)\rightarrow s_\infty \) and that \(I(t), Y(t)\rightarrow 0\) for \(t\rightarrow \infty .\) It is convenient to express the outcome in terms of the fraction of the susceptible population that remains uninfected at the end of the epidemic \(\sigma = s_\infty /s\) and the fraction of hosts that became infected by either strain during the entire course of the epidemic \(A_{I}= \nu _{I}\int _0^\infty I\, dt\) with a similar definition of \(A_Y\).

We notice that the quantities

$$\begin{aligned} \dot{S}/S= & {} -\mathop \beta \nolimits _{I} I -\mathop \beta \nolimits _{Y} Y \end{aligned}$$
(4)
$$\begin{aligned} \dot{S}+\dot{I}+\dot{Y}= & {} -\nu _{I} I-\nu _{Y} Y \end{aligned}$$
(5)

are linear expressions in I and Y. We will refer to quantities with this property as (generalized) first integrals.

First integrals are useful because we can obtain information about \(\sigma ,\) \(A_{I},\) and \(A_{Y}\) by integration of (45) over the entire epidemic period:

$$\begin{aligned} \int _0^\infty \dot{S}/S\, dt= & {} -\mathop \beta \nolimits _{I}\int ^\infty _0 I\,dt-\mathop \beta \nolimits _{Y}\int ^\infty _0 Y\,dt\\ \int ^\infty _0 \dot{S} + \dot{I} +\dot{Y} \,dt= & {} -\nu _{I}\int _0^\infty I\,dt-\nu _{Y}\int _0^\infty Y\,dt, \end{aligned}$$

which yields two conditions in the three unknowns \((\sigma ,A_I,A_Y)\)

$$\begin{aligned} -\,\log \sigma&= {\mathfrak {R}}_0^I A_I + {\mathfrak {R}}_0^Y A_Y \end{aligned}$$
(6)
$$\begin{aligned} s(1-\sigma ) + i+y&= A_I + A_Y. \end{aligned}$$
(7)

It is remarkable that the range of outcomes from the competing epidemics depends only on the reproduction numbers \({\mathfrak {R}}_0^K=\frac{\mathop \beta \nolimits _{K}}{\nu _K}\) and not explicitly on the growth rates as is the case for epidemics on networks (Karrer and Newman 2011).

The condition (7) may be seen as a simple balance equation dividing the total population between those still susceptible and those who were infected with either pathogen type at some point during the epidemic. The first condition (6) generalizes the final size equation to a mixed epidemic.

Fig. 2
figure 2

Graphical solution of the final size problem for an epidemic with two types of the disease. The fraction of hosts that are still susceptible at the end of the epidemic is denoted \(\sigma \). The value of \(\sigma \) depends on initial conditions, and thus, each value of \(\sigma \) corresponds to a possible outcome of the epidemic. A vertical line divides the unit interval into three segments representing (from bottom to top) the fraction of the population infected with type I, those infected with type Y and the fraction of the host population that remains uninfected during the entire epidemic. Solid points represent the outcomes of numerical solutions to the model (13) for initial conditions of the form \(I(0)= 0.00001/N, Y(0)= 0.99998/N\) with varying N. Panel a shows the case where sequential epidemics are not possible since \(\sigma _\infty ^Y<{\mathfrak {R}}_0^I\). Parameter values are \({\mathfrak {R}}_0^I=1.6,\) \({\mathfrak {R}}_0^Y=1.3,\) and \(\nu _I=\nu _Y=1\). Panel b shows the case where sequential epidemics may occur. The open circle represents the sequential epidemics where the inferior type has completed its epidemic prior to the arrival of the superior competitor. Sequential epidemics can arise only if \(1/{\mathfrak {R}}_0^I<\sigma ^Y_\infty \). Parameter values are \({\mathfrak {R}}_0^I=2.0,\) \({\mathfrak {R}}_0^Y=1.25,\) and \(\nu _I=\nu _Y=1\)

To determine the three quantities \((\sigma ,A_I,A_Y)\), one would need one additional condition. However, this third condition will depend on the size and composition of the pathogen introduction and we will not explore this issue here. We first discuss what information can be obtained from the two conditions (67). Due to the invariance of the first octant, we observe that every trajectory of the differential equation (13) with initial conditions of the form \(s,i,y\ge 0\) and \(s+i+y\le 1\) will satisfy (67). It is, however, not clear if every positive point \((\sigma ,A_I,A_Y)\) satisfying (67) corresponds to a solution of the differential equation.

Our primary interest is the condition during a pandemic arising from a newly introduced pathogen, so for now we focus on the situation where \(s\approx 1\) and \(0<i,y\ll 1\). Without loss of generality, we may also assume that \({\mathfrak {R}}_0^I>{\mathfrak {R}}_0^Y\).

Considering the fraction of the population that remains susceptible \(\sigma \) as a parameter, we can determine the fraction that become infected with each of the pathogen types as

$$\begin{aligned} A_I(\sigma )= & {} \frac{1}{{\mathfrak {R}}_0^I-{\mathfrak {R}}_0^Y}\left( {\mathfrak {R}}_0^Y(1-\sigma )+\log \sigma \right) \end{aligned}$$
(8)
$$\begin{aligned} A_Y(\sigma )= & {} \frac{-1}{{\mathfrak {R}}_0^I-{\mathfrak {R}}_0^I}\left( {\mathfrak {R}}_0^I(1-\sigma )+\log \sigma \right) \end{aligned}$$
(9)

The two curves are shown in Fig. 2. Remarkably, the magnitudes of \(A_I\) and \(A_Y\) are proportional to the final size expression \(f_L: \sigma \mapsto {\mathfrak {R}}_0^L(1-\sigma )+\log \sigma \) for the other type \(L=Y,I\). The graph for \(f_I\) may be seen from the curve marked \(A_Y\) in Fig. 2, and we have \(f_K(0^+)=-\infty \), \(f_K(1)=0\), and \(f_K'(1)= 1- {\mathfrak {R}}_0^K\) so that for \({\mathfrak {R}}_0^K>1,\) \(f_K\) has a unique root \(\sigma ^K_\infty \) in the interval (0, 1) giving the fraction of hosts that are susceptible at the end of an epidemic consisting entirely of type K pathogens. We observe that \(A_I\) is positive for \(\sigma < \sigma ^Y_\infty \). Similarly, \(A_Y\) is positive for \(\sigma > \sigma ^I_\infty \), and hence, feasible solutions occur only in the interval \( \sigma ^I_\infty \le \sigma \le \sigma ^Y_\infty \). Figure 2b suggests that not all feasible solutions may arise as outcome of the epidemic. We shall discuss this issue in the next section.

Remark In this section, we have characterized the epidemic in terms of the cumulated attack rate \(A_I\) as this allows for the simple interpretation of Fig. 2. For the more complex situation in Sect. 4, the cumulated force of infection \(\alpha = \int \varLambda \, dt= \int \beta I\, dt\)—or alternatively the probability of remaining uninfected \(\pi =\exp (-\,\int \varLambda \, dt)\)—gives a more compact description.

3 Elementary Properties of the Model

From a mathematical view point, model (13) is somewhat awkward as it has a continuum of equilibria where every point of the form (s, 0, 0) is a stationary point. Thus, the model is not “generic” and results from dynamical systems theory are of limited use. For the classical model of an epidemic with a single infectious agent, the model is typically solved by explicitly determining the solution curves using S as the independent variable. A similar approach is not available for our model, so instead we will in this section use elementary methods to study the map \(\varPhi \) that connects to every possible initial condition the corresponding final outcome of the epidemic. The results are essentially reformulations of the statements in (Kendall and Saunders 1983, Theorem 1).

Although our main interest is the limiting case where the initial infectious seed is small \(0<I(0), Y(0)\approx 0\), we first expand our problem allowing for any initial condition where no hosts are immune at the onset of the epidemic. This allows us to separate limiting processes in the time domain from limiting processes in the initial conditions.

If the entire population is either susceptible or infected at the onset of the epidemic, the initial condition \(\left( S(0),I(0),Y(0)\right) =(s,i,y)\) satisfies \(s+i+y=1\) so that the set of initial conditions may be described by the 2-D simplex

$$\begin{aligned} \varGamma = \{\ (i,y)\ |\ i,y\ge 0, \ 1\ge i+y \}. \end{aligned}$$

For the methods, we develop in this section it is natural to express the final size in terms of \(s_\infty \)—the actual size of the susceptible population—rather than to measure the final size relative to the size of the susceptible population at epidemic onset \(\sigma = s_\infty /s\) as we do in Sects. 2 and 4, but the conversion between \(\sigma \) and \(s_\infty \) is straightforward.

We first prove (the expected fact) that every trajectory goes to a limit point as time goes to infinity.

Theorem 1

For every initial condition in \(\varGamma \), there exists a number \(s_\infty \) so that the corresponding solution \(\left( S(t),I(t),Y(t)\right) \) to model (13) satisfies

$$\begin{aligned} \lim _{t\rightarrow \infty } S(t) = s_\infty \quad \lim _{t\rightarrow \infty } I(t)= \lim _{t\rightarrow \infty } Y(t) = 0. \end{aligned}$$

The theorem and its proof are generalizations of results by Brauer (2008). The proof builds on the following elementary

Lemma 1

(Brauer 2008) If f(t) is a nonnegative monotone non-increasing continuously differentiable function, then there exists \(f_\infty \ge 0\) such that as \(t\rightarrow \infty ,\) \(f(t)\rightarrow f_\infty ,\) and in addition \(f'(t)\rightarrow 0.\)

To see that Theorem 1 holds, observe that by the invariance of the planes \(I=0\) and \(Y=0\), we have that \(I,Y \ge 0\) and hence that S is a non-increasing function. Since \((S+I+Y)'=-(\nu _I I+\nu _Y Y),\) we conclude that if \(I(0)+Y(0)>0\) then \(S+I+Y\) is decreasing and hence there exists \(s_\infty \ge 0\) such that \((S(t)+I(t)+Y(t)) \rightarrow s_\infty \) as \(t\rightarrow \infty \). By Brauer’s lemma, we know that \(\nu _II+\nu _YY\rightarrow 0\) for \(t\rightarrow \infty .\) Hence, by the positivity of I and Y we have that \( I(t), Y(t)\rightarrow 0\) for \(t\rightarrow \infty \) and thus \(S(t)\rightarrow s_\infty \) for \(t\rightarrow \infty \).

Theorem 1 shows that we can define a map from the initial conditions to the corresponding limit point of the solution to (13)

$$\begin{aligned} \varPhi : (i,y)\mapsto s_\infty \end{aligned}$$

as a map of the form

$$\begin{aligned} \varPhi : \varGamma \rightarrow [0,1] \end{aligned}$$

Keeping in mind that by assumption \({{{\mathfrak {R}}_0^I}}>{{{\mathfrak {R}}_0^Y}},\) we can now state our main result about this map.

Theorem 2

(Kendall and Saunders 1983) The map \(\varPhi : \varGamma \mapsto [0,1]\) has the following properties

  1. 1.

    If \( \varPhi (i,y)<1/{{{\mathfrak {R}}_0^I}}\) then \(\varPhi \) is continuous at (iy).

  2. 2.

    If \(\varPhi (i,y)>1/{{{\mathfrak {R}}_0^I}}\) then \(i=0\).

  3. 3.

    Assume that \(\varPhi (0,y_0)>1/{{{\mathfrak {R}}_0^I}}\) and let \(\varPsi \) denote the unique solution to the equation

    $$\begin{aligned} \log \varPsi +{{{\mathfrak {R}}_0^I}}\varPhi (0,y_0)(1-\varPsi )=0, \end{aligned}$$
    (10)

    then

    $$\begin{aligned} \varPhi (i,y)\rightarrow \varPsi \ \hbox {for}\ (i,y)\rightarrow (0,y_0) \ \hbox {and} \ (i,y)\in \varGamma _+, \end{aligned}$$

    where \(\varGamma _+\) denotes the interior of \(\varGamma \).

Remark Properties 2 and 3 characterize sequential epidemics. Property 2 states that after the conclusion of an epidemic, the population can only support an additional epidemic if the first epidemic did not involve the superior competitor and Property 3 then states that if in addition the inferior competitor (Y) caused an epidemic sufficiently small such that the effective reproduction number of the superior competitor \(\varPhi (0,y){{{\mathfrak {R}}_0^I}}\) exceeds unity, then the superior strain (I) will produce a sequential epidemic if it is present \((i>0)\).

Property 3 also shows that in this situation, the map \(\varPhi \) is discontinuous at \((0,y_0).\) The limit \(\varPsi \) defined by (10) gives the final size—in terms of the fraction still susceptible after a double epidemic where the inferior strain (Y) has completed its epidemic before the superior strain (I) gives rise to an epidemic among those hosts that were still susceptible.

Proof of Theorem 2

The basic idea is to break \(\varPhi \) into a composite map where the first map takes (iy) into the value of the solution at a (suitably chosen) fixed time T. This map is continuous in general (Coddington and Levinson 1955). The second map takes (S(T), I(T), Y(T)) into \((S(\infty ),I(\infty ),Y(\infty ))\). For the selected T, we show that this map is bounded by suitable exponential decays toward the limit point and use this observation to establish continuity. The detailed proofs may be found in Kendall and Saunders (1983); we will only sketch how to prove claim 1. \(\square \)

Claim 1 Consider a point \((i_0,y_0)\in \varGamma \) with \(\varPhi (i_0,y_0)<1/{{{\mathfrak {R}}_0^I}}\) and let \(X_0(t)=(S_0(t),I_0(t),Y_0(t))\) denote the solution to (13) with initial conditions \((i_0,y_0).\) Let \(X_0^*=\left( \varPhi (i_0,y_0),0,0\right) \) be the limit of \(X_0(t)\) as \(t\rightarrow \infty \). Set \(\kappa =1/{{{\mathfrak {R}}_0^I}}-\varPhi (i_0,y_0)>0.\)

For a given \(1>\varepsilon >0\) set \(\varepsilon _1= \varepsilon \kappa /4\) and observe that there exists a T such that \(||X_0(t)-X_0^*||< \varepsilon _1\) for all \(t\ge T\) where ||.|| denotes the 2 or 3 dimensional Euclidean norm as appropriate. Since the solution of (13) at the finite time T depends continuously on initial conditions (Coddington and Levinson 1955), we can find \(\delta >0\) such that

$$\begin{aligned} ||(i,y)-(i_0,y_0)||<\delta \Rightarrow ||X(T)-X_0(T)||<\varepsilon _1 \end{aligned}$$

for the solution X(t) with initial conditions (iy). We need to show that \(||X^*-X^*_0||\!<\!\varepsilon .\)

We first observe that by construction, \(S(T)\le \varPhi (i_0,y_0)+\kappa /2,\) such that \(1/{{{\mathfrak {R}}_0^I}}- S(T)> 1/{{{\mathfrak {R}}_0^I}}-(\varPhi (i_0.y_0)+\kappa /2)>\kappa /2\). Since S(t) is decreasing, the inequality holds for all \(t\ge T.\) This result provides an estimate for I(t) since \(I' =\mathop \beta \nolimits _{I}(S-1/{{{\mathfrak {R}}_0^I}})I\le -\frac{1}{2}\mathop \beta \nolimits _{1}\kappa I\), for \(t\ge T\). We deduce that for \(t\ge T\) we have

$$\begin{aligned} I(t)\le I(T)\exp \left( -\,\frac{1}{2}\mathop \beta \nolimits _{I}\kappa t\right) , \end{aligned}$$

with a similar expression for Y(t). This provides us with suitable bound on how fast I and Y decay and thus a bound for the decay of \(S(t)-S(\infty ).\)

We can now study \(S(T)-S(\infty )\) and first observe that

$$\begin{aligned} \begin{aligned} \log S(\infty )/S(T)&= -\int _T^\infty \mathop \beta \nolimits _{I}I(\tau )+\mathop \beta \nolimits _{Y}Y(\tau )\,d\tau \\&\ge -\int _T^\infty \mathop \beta \nolimits _{I}I(T)e^{-\frac{1}{2}\mathop \beta \nolimits _{I}\kappa \tau } + \mathop \beta \nolimits _{Y}Y(T)e^{-\frac{1}{2}\mathop \beta \nolimits _{Y}\kappa \tau }\,d\tau \\&=-\frac{2}{\kappa }(I(T)+Y(T)). \end{aligned} \end{aligned}$$

A simple rearrangement yields

$$\begin{aligned} \begin{aligned} 0<S(T)-S(\infty )&\le \left( 1-\exp \left( -\frac{2}{\kappa }(I(T)+Y(T))\right) \right) S(T)\\&\le \frac{2}{\kappa }((I(T)+Y(T)) S(T)\le \frac{4}{\kappa }\varepsilon _1\le \varepsilon . \end{aligned} \end{aligned}$$

Finally, we notice that

$$\begin{aligned} ||X^*-X_0^*||\le ||X^*-X(T)|| + ||X(T)-X_0(T)||+ ||X_0(T)-X_0^*||, \end{aligned}$$

and that we have shown that each term is smaller than \(\varepsilon .\)

This concludes our discussion of the proof of Theorem 2.

Fig. 3
figure 3

The map \(\varPhi \) for values of (iy) close to the origin. Along the i-axis \(\varPhi (i,0)\rightarrow s_\infty ^Y\)—the size of an epidemic consisting only of type Y pathogens. On the y-axis \(\varPhi (0,y)\rightarrow s^I_\infty \)—the size of an epidemic consisting only of type I. If sequential epidemics are possible—that is, if \({{{\mathfrak {R}}_0^I}}s_Y^\infty >1\), the value of double limit is \(\lim _{y\rightarrow 0^+}\lim _{i\rightarrow 0^+} \varPhi (i,y)=s^\dagger \)—the final size of the susceptible population in an i-epidemic that occurs in a host population that has already been exposed to a y-epidemic (case shown). If sequential epidemics cannot occur (case not shown), we have \(\lim _{y\rightarrow 0^+}\lim _{i\rightarrow 0^+} \varPhi (i,y)=s_\infty ^Y\). In both cases, the map \(\varPhi \) is discontinuous at (0, 0). Parameter values used \({{{\mathfrak {R}}_0^I}}= 2, {{{\mathfrak {R}}_0^Y}}=1.2\)

To finish our characterization of the possible outcomes of the mixed epidemic for the case where \(i,y\ll 1\), we combine Theorem 2 with our observations in Sect. 2. Figure 3 shows the nature of \(\varPhi \) in a neighborhood of origin, and in particular, we notice that

$$\begin{aligned}&\lim _{i\rightarrow 0^+} \varPhi (i,0) = s^I_\infty \\&\lim _{y\rightarrow 0^+} \varPhi (0,y) = s_\infty ^Y\\&\lim _{y\rightarrow 0^+} \lim _{i\rightarrow 0^+} \varPhi (i,y) = s^\dagger \end{aligned}$$

where \(s^\dagger \) is the final size of the susceptible population after sequential epidemics if \({{{\mathfrak {R}}_0^I}}s_\infty ^Y>1,\) and \(s^\dagger = s_\infty ^Y\) if sequential epidemics are not possible. Thus, the map \(\varPhi \) is discontinuous at the origin—and if sequential epidemics are possible, it is in addition also discontinuous along the y-axis. Therefore, every neighborhood of the origin will contain (1) a point that is mapped into a value close to \(s^I_\infty \) and (2) a point that is mapped into a value close to \(s^\dagger \) and (3) a point that is mapped into a value close to \(s_\infty ^Y\)—where the two latter points may be identical in the case of no sequential epidemics. Since \(\varPhi \) is continuous in the interior of \(\varGamma \) and \(\varGamma \) is connected, we can find points that are mapped into every value in the interval \([s^I_\infty ,s^\dagger )\) while \(s_\infty ^Y\) may be an isolated point in the image when sequential epidemics can occur. This result explains the range of outcomes that is indicated in Fig. 2b.

Although it seems obvious, it is hard to prove that \(\varPhi \) does not have extreme values outside the interval \([s^I_\infty ,s^\dagger ).\)Kendall and Saunders (1983) provide a quite complicated proof of the claim—we will return to the issue in the discussion.

We conclude that it is not possible to uniquely determine a final size of the susceptible population in a mixed epidemic unless one makes further assumptions about the size and composition of the initial infection. We did, however, show that the final size of the susceptible population falls in the set \([s_\infty ^I,s^\dagger )\cup \{s_\infty ^Y\}\). Once \(s_\infty \) is determined, the corresponding attack rates for the two pathogens can be found using Eqs. (67).

4 Partial Cross-Immunity

A somewhat more complicated situation arises if infection with one strain confers only partial protection against infection with the other strain. Here, we assume that cross-immunity acts by reducing infectivity by a factor \(\tau ,\ 0\!<\!\tau \!<\!1\) during the second infection. Furthermore, we will assume that the two infections are independent in the sense that a host can become infected by the second strain while still being infected with the first strain and that recovery from the two infections is independent.

Let \(I_0\) denote the fraction of hosts that are currently infected with strain I and who have no immunity to strain Y, while \(I_1\) denotes the fraction of hosts who are infected with strain I and who are immune to strain Y at the time of infection. With our assumptions, the force of infection of strain I is

$$\begin{aligned} \varLambda _I= \mathop \beta \nolimits _{I}(I_0+\tau _Y I_1), \end{aligned}$$
(11)

with similar expressions for strain Y.

Let \(R_K, K=I,Y\) denote the fraction of hosts that are or have been infected with strain K and who have not yet been infected with strain \(L=Y,I.\) Since we assume that hosts are susceptible to a second infection immediately after acquiring their first infection, hosts infected with strain I flow directly from the susceptible class S into the \(R_I\)-class and we have

$$\begin{aligned} \dot{S}&= -(\varLambda _I+\varLambda _Y) S \end{aligned}$$
(12)
$$\begin{aligned} \dot{R}_K&= \varLambda _K S - \varLambda _L R_K, \quad \hbox {for} \quad K=I,Y\quad L=Y,I. \end{aligned}$$
(13)

In addition, we can keep track of infected hosts and their immune status at the time of infection

$$\begin{aligned} \dot{K}_0&= \varLambda _K S-\nu _K K_0 \quad \hbox {for} \quad K=I,Y\\ \dot{K}_1&= \varLambda _K R_L-\nu _K K_1 \quad \hbox {for} \quad K=I,Y \end{aligned}$$

which—using Eq. (11)—simplifies to

$$\begin{aligned} {{\dot{\varLambda }}}_K&=\mathop \beta \nolimits _{K}\varLambda _K(S+\tau _L R_L) -\nu _K\varLambda _K. \quad \hbox {for} \quad K=I,Y\quad L=Y,I. \end{aligned}$$
(14)

Equations (12, 13, 14) give a total of 5 equations yielding a closed system, describing the transmission dynamics of the two strains.

In addition, we need to specify initial conditions \(S(0) = s, R_I(0)=r_I, R_Y(0)=r_Y.\) Here, \(0\le s+r_I+r_Y\le 1\) and \(0<\varLambda _K(0)=l_K\) as in the previous section.

In order to obtain simple first integrals, we change to variables that account for the fraction of totally susceptible individuals S and the fraction of hosts susceptible to each of the two strains \(P_K= S+R_L\). In this representation, the model is

$$\begin{aligned} \begin{aligned} \dot{S}&= -(\varLambda _I+\varLambda _Y)S\\ \dot{P}_K&= - \varLambda _K P_K \quad \hbox {for}\quad K=I,Y\quad L=Y,I\\ {{\dot{\varLambda }}}_K&= \mathop \beta \nolimits _{K}((1-\tau _L) S+\tau _LP_K))\varLambda _K -\nu _K\varLambda _K \end{aligned} \end{aligned}$$
(15)

with initial conditions \((s,p_K,l_K)\) satisfying \(0\le p_I+p_Y-s\le 1, 0<s\le p_K\le 1\) and \(0\le l_K\). An extension of Theorem 1 shows that the trajectories all have \(\omega \)-limits of the form \(S(\infty )=s_\infty >0\), \(P_K(\infty ) = p^K_\infty \ge 0\) and \(\varLambda _K(\infty )=0\).

One may now look for first integrals as in the previous sections. There are three trivial integrals: \(\dot{S}/S\), \(\dot{P}_I/P_I, \dot{P}_Y/P_Y\). We next look for a suitable linear combination of the five dynamical equations that yields a generalization of the balance equation (7). To find such a combination, we seek a linear combination of the five dynamical equations such that the four quadratic terms \(\varLambda _KS\) and \(\varLambda _KP_K\) vanish. Thus, the coefficients of the quadratic terms must satisfy the (under-determined) homogeneous linear equations

where a row corresponds to a quadratic term and a column to a dynamical equation. Since the coefficient matrix has rank 4, the solution

$$\begin{aligned} \left( (1-\tau _I)(1-\tau _Y),\tau _Y(1-\tau _I), \tau _I(1-\tau _Y),\frac{1-\tau _I}{\mathop \beta \nolimits _{I}},\frac{1-\tau _Y}{\mathop \beta \nolimits _{Y}}\right) ^T \end{aligned}$$

is unique up to a scaling factor. We conclude that

$$\begin{aligned}&(1-\tau _I)(1-\tau _Y)\dot{S} + \tau _Y(1-\tau _I)\dot{P}_I+ \tau _I(1-\tau _Y)\dot{P}_Y + \frac{1-\tau _I}{\mathop \beta \nolimits _{I}}{{\dot{\varLambda }}}_I + \frac{1-\tau _Y}{\mathop \beta \nolimits _{Y}}{{\dot{\varLambda }}}_Y \nonumber \\&\quad =-\frac{\nu _I(1-\tau _I)}{\mathop \beta \nolimits _{I}}\varLambda _I -\frac{\nu _Y(1-\tau _Y)}{\mathop \beta \nolimits _{Y}}\varLambda _Y, \end{aligned}$$
(16)

which gives us our fourth and last first integral.

We can now formulate a necessary condition that must be satisfied by the outcome of the mixed epidemic in terms of \(\sigma =s_\infty /s\) (the fraction of those hosts that were fully susceptible at onset of the epidemic who are still fully susceptible at the end of the epidemic) \(\pi _K=P_K(\infty )/p_K,\) (with similar interpretation) and \(\alpha _K=\int _0^\infty \varLambda _K\, dt,\) (the cumulated force of infection). We find that the outcome of the epidemic must satisfy

$$\begin{aligned} \begin{aligned} \log \sigma&= -(\alpha _I+\alpha _Y)\\ \log \pi _K&= - \alpha _K\qquad K=I,Y\\ 0&= s(1-\tau _I)(1-\tau _Y)(1-\sigma )+p_I\tau _Y (1-\tau _I)(1-\pi _I)\\&\quad +\,p_Y\tau _I(1-\tau _Y)(1-\pi _Y) -\frac{\nu _I(1-\tau _I)}{\mathop \beta \nolimits _{I}}\alpha _I -\frac{\nu _Y(1-\tau _Y)}{\mathop \beta \nolimits _{Y}}\alpha _Y\\&\quad -\,\frac{1-\tau _I}{\mathop \beta \nolimits _{I}}\left( \varLambda _I(\infty )-l_I\right) -\frac{1-\tau _Y}{\mathop \beta \nolimits _{Y}}\left( \varLambda _Y(\infty )-l_Y\right) \end{aligned} \end{aligned}$$
(17)

Condition (17) gives us four equations in the five unknowns \((\sigma ,\pi _I,\pi _Y,\alpha _I,\alpha _Y)\), and we conclude that the outcome of the competing epidemics is determined up to one parameter, the value of which will depend on the details of the initial conditions—as in Sect. 2.

To further simplify the discussion, we assume that the initial infection is small \(0\le l_k\ll 1\) such that the last line of (17) vanishes. Substituting \(\sigma = \pi _I\pi _Y\) and \(\alpha _K=-\log \pi _K\) into the last equation in (17), we obtain after a suitable rearrangement, the condition

$$\begin{aligned} \begin{aligned}&F(\pi _I,\pi _Y) = -(1-\tau _I)(1-\tau _Y)(1-\pi _I)(1-\pi _Y)\\&\quad +\, \frac{1-\tau _I}{s{{{\mathfrak {R}}_0^I}}}\left( {\mathfrak {R}}_e^I(1-\pi _I)+\log \pi _I\right) + \frac{1-\tau _Y}{s{{{\mathfrak {R}}_0^Y}}}\left( {\mathfrak {R}}_e^Y(1-\pi _Y)+\log \pi _Y\right) =0, \end{aligned}\nonumber \\ \end{aligned}$$
(18)

where \({\mathfrak {R}}_e^K={{{\mathfrak {R}}_0^K}} \left( (1-\tau _L)s+\tau _Lp_K\right) \) is the effective reproduction number of the strain considering the preexisting immunity in the population. Equation (18) gives one equation in the two unknowns \((\pi _I,\pi _Y)\), and the corresponding curve in \((\pi _I,\pi _Y)\)-space gives us the range of possible outcomes of the competing epidemics in the sense that once \((\pi _I,\pi _Y)\) is determined, \(\sigma \) and \((\alpha _I,\alpha _y)\) may be found from (17).

Thus, to characterize the possible outcomes of the competing epidemics it suffices to describe the solutions to Eq. (18). We will assume that both strains can grow at the onset of the epidemic so that \({\mathfrak {R}}_e^K>1\) for both strains. In addition, we must have that \(0\le \pi _K\le 1\) to ensure that the solution is biologically meaningful.

Fig. 4
figure 4

Geometric analysis of the number of solutions to \(F(\pi _I,\pi _Y)=0\), Eq. (18). Since F is concave for fixed \(\pi _K,\) the equation has at most two solutions along a horizontal line (or along a vertical line). a (Sequential epidemics do not occur). The sign of F at the boundary of the unit square gives the number of solutions in the different regions. In the gray region, there is exactly one solution along each horizontal line; in the white area, there is between 0 and 2 solutions. The points \(\pi _\infty ^K\), \(K=I,Y\) denote the probability of remaining uninfected to strain K at the end of an epidemic consisting only of strain K. b (A sequential epidemic with a Y epidemic followed by an I epidemic may occur). The sign variation along the lines \(\pi _I=\pi _\infty ^I\) depends on the existence and magnitude of a IY-sequential epidemic \(\pi _\infty ^{IY}\). In the gray areas of region \(I+II\), there is one solution along a horizontal line—in the white areas of \(I+II\), there is none. The same analysis may be applied to vertical lines

The structure of Eq. (18) allows for an elementary analysis of its solutions as shown in Fig. 4. Let \(\pi _\infty ^K\) denote the non-trivial solution to the one-strain final size of the susceptible population, i.e., \(0<\pi _\infty ^K<1\) is the (unique) solution to

$$\begin{aligned} {\mathfrak {R}}_e^K(1-\pi ) + \log \pi = 0. \end{aligned}$$

Clearly, the equation \(F(\pi _I,\pi _Y) = 0\) has three simple solutions:

$$\begin{aligned} (\pi _\infty ^I,1)\quad (1,\pi _\infty ^Y) \quad (1,1). \end{aligned}$$

The lines \(\pi _K=\pi _\infty ^K, K=I,Y\) divide the unit square into four regions as indicated in Fig. 4. Along the line \(\pi _I=1,\) F changes sign as indicated in Fig. 4, while \(F(x,y)\rightarrow -\infty \) for \(x\rightarrow 0^+\) ensuring that F is negative close to the axis. Since \(\partial ^2F/\partial \pi _I^2 = - {{{\mathfrak {R}}_0^I}}/\pi _I^2 <0\), the function \(\pi _I\mapsto F(\pi _I,\pi _Y)\) is concave and along a horizontal line we have that

Lemma 2

For fixed \(\pi _Y^*\in (\pi ^Y_\infty ,1),\) Eq. (18) has exactly one solution \(\pi _I\in (0,1).\)

For fixed \(\pi _Y^*\in (0,\pi _\infty ^Y),\) Eq. (18) has 0, 1 or 2 solutions. The case of one solution occurs iff \(\partial F/\partial \pi _I\) vanishes at the solution point.

To determine more precisely the location of the solutions in the interval \((\pi _\infty ^Y,1),\) consider the function

$$\begin{aligned} \begin{aligned} \pi _Y&\mapsto F\left( \pi _\infty ^I,\pi _Y\right) \\&= \frac{1-\tau _Y}{s{{{\mathfrak {R}}_0^Y}}} \left( \left( -\,(1-\tau _I)\left( 1-\pi _\infty ^I\right) s{{{\mathfrak {R}}_0^Y}}+{\mathfrak {R}}_e^Y\right) (1-\pi _Y)+\log \pi _Y\right) \\&= \frac{1-\tau _Y}{s{{{\mathfrak {R}}_0^Y}}}\left( {\mathfrak {R}}_e^{IY}(1-\pi _y)+\log \pi _y\right) , \end{aligned} \end{aligned}$$
(19)

where

$$\begin{aligned} {\mathfrak {R}}_e^{IY}= {{{\mathfrak {R}}_0^Y}}\left( (1-\tau _I)\pi _\infty ^Is+\tau _I p_Y\right) \end{aligned}$$
(20)

is the effective sequential reproduction number for strain Y if the population has already experienced the full course of an epidemic with strain I. If \({\mathfrak {R}}_e^{IY}>1\), an IY sequential epidemic is possible and \(F=0\) has a solution of the form \((\pi _\infty ^I,\pi _\infty ^{IY})\) where \(\pi _\infty ^{IY}\) solves the final size equation \(F(\pi _\infty ^I,y)=0\).

If \({\mathfrak {R}}_e^{IY}<1\), then F is negative on the line segment \((\pi _\infty ^I,y).\) Looking along a horizontal line segment, \(y\in (0,1)\) there is one solution in region I and none in region II.

If \({\mathfrak {R}}_e^{IY}>1\), then F is positive on the line segment \((\pi _\infty ^I,y)\), \(y\in (1,\pi _\infty ^{IY})\) and for each fixed \(\pi _Y\) in this interval, \(F=0\) has exactly one solution in region II and none in region I. For \(y<\pi _\infty ^{IY}\), F is negative on the line \(\pi _I=\pi _\infty ^I\) and the location of the solution is reversed.

We can now use the implicit function theorem to see that the set of solutions consists of one or more smooth curves in the unit square and exclude the possibility of singular points:

Lemma 3

If \((\pi _I,\pi _Y)\in (0,1)\times (0,1)\), then

  1. 1.

    \(\partial F/\partial \pi _I = 0 \Rightarrow \pi _I>1/{\mathfrak {R}}_e^I\)

  2. 2.

    \(F=0 \wedge \partial F/\partial \pi _I = 0 \Rightarrow \pi _Y<\pi _\infty ^Y\)

Proof

We have that

$$\begin{aligned} \partial F/\partial \pi _I= & {} (1-\tau _I)(1-\tau _Y)(1-\pi _Y)+ \frac{1-\tau _I}{s{{{\mathfrak {R}}_0^I}}}\left( -\,{\mathfrak {R}}_e^I+\frac{1}{\pi _I}\right) \\> & {} \frac{1-\tau _I}{s{{{\mathfrak {R}}_0^I}}}\left( -\,{\mathfrak {R}}_e^I+\frac{1}{\pi _I}\right) . \end{aligned}$$

The last term can be negative only if \(\pi _I>1/{\mathfrak {R}}_e^I\).

To see the last claim, we observe that by the elementary inequality \(-\log u + u-1>0\) we have that

$$\begin{aligned} \begin{aligned} F+(1-\pi _I)\frac{\partial F}{\partial \pi _I}&= \frac{1-\tau _I}{s{{{\mathfrak {R}}_0^I}}}\left( \log \pi _I +\frac{1}{\pi _I}-1\right) \!+\! \frac{1-\tau _Y}{s{{{\mathfrak {R}}_0^Y}}}\left( {\mathfrak {R}}_e^Y(1-\pi _Y)+ \log \pi _Y\right) \\&> \frac{1-\tau _Y}{s{{{\mathfrak {R}}_0^Y}}}\left( {\mathfrak {R}}_e^Y(1-\pi _Y)+ \log \pi _Y\right) . \end{aligned} \end{aligned}$$

The last term can be negative only if \(\pi _Y<\pi _\infty ^Y\).

Since \(1/{\mathfrak {R}}_e^K>\pi _\infty ^K\), we observe that the curve of solutions can have a horizontal tangent only in region IV and a vertical tangent only in region II and that there are no singular points.

Considering the curve in terms of a function of \(\pi _Y\), we have that at the point \((\pi ^I_\infty ,1)\)

$$\begin{aligned} 0= F_I'((\pi _\infty ^I),1)\pi _I'+F_Y'(\pi _\infty ,1)= & {} \frac{1-\tau _I}{s{{{\mathfrak {R}}_0^I}}} \left( -\,{\mathfrak {R}}_e^I+\frac{1}{\pi _\infty ^I}\right) \pi _I'\\&+ \frac{1-\tau _Y}{s{{{\mathfrak {R}}_0^Y}}}\left( -\,{\mathfrak {R}}_e^{IY}+1\right) , \end{aligned}$$

showing that \(\pi _I\) is negative when \({\mathfrak {R}}_e^{IY}>1\) and positive when the condition is not satisfied.

We can now summarize this observation and Lemmas 2 and 3 in the following

Theorem 3

Assume that \({\mathfrak {R}}_e^K>1, K=I,Y\), let \(\pi _\infty ^K\) denote the final size of the susceptible population for each strain and let \({\mathfrak {R}}_e^{IY}\) given by Eq. (20) denote the effective reproduction number of strain Y for a IY-sequential epidemic. When \({\mathfrak {R}}_e^{IY}>1\), the size of the sequential epidemic is denoted \(\pi ^{IY}_\infty \).

The lines \(\pi _K=\pi _\infty ^K\) divide the unit square into four regions. The region \((\pi _\infty ^I,1)\times (\pi _\infty ^Y,1)\) is named region I, and the other three regions are named II, III and IV counter-clockwise as indicated in Fig. 4.

The solutions to Eq. (18) in the unit square consist of the isolated point (1, 1) and a curve \(\gamma (u) = (\pi _I(u),\pi _Y(u))\) with the following properties:

  1. 1.

    The curve \(\gamma \) enters the top of the unit square at \((\pi _\infty ^I,1)\).

    1. (a)

      If \(R_e^{IY}>1\), the tangent of \(\gamma \) at the point of entry lies in region II. In region II, \(\gamma \) is concave and it leaves region II at the point \((\pi _\infty ^I,\pi _\infty ^{IY})\) (cf. Fig. 5c, d).

    2. b)

      If \(R_e^{IY}<1\), the tangent of \(\gamma \) at the entry point lies in region I and \(\gamma \) does not intersect region II (cf. Fig. 5a, b).

  2. 2.

    In region I, \(\gamma \) is a monotonically decreasing single-valued function of \(\pi _I\).

  3. 3.

    The properties of \(\gamma \) are symmetric in I and Y.

  4. 4.

    No point in region III satisfies \(F=0.\)

The curve \(\gamma \) describes a necessary condition that must be satisfied by solutions to Eqs. (12, 13, 14) or equivalently to Eq. (15). More precisely, any \(\omega \)-limit point of a solution to the differential equation with initial conditions \(l_I, l_Y\ll 1\) satisfies

$$\begin{aligned} \left( S(\infty )/S(0),P_I(\infty )/P_I(0),P_Y(\infty )/P_Y(0),\varLambda _I(\infty ), \varLambda _Y(\infty )\right) \!\!=\! (\pi _i\pi _y,\pi _i,\pi _y,0,0), \end{aligned}$$

where \((\pi _I,\pi _Y)\) lies on \(\gamma \). One can use methods from the proof of Theorem 2 to prove that not all points on \(\gamma \) are limit points for solutions to (15). In fact, we find that the size of the sequential epidemic bounds the possible outcomes for epidemics where both strains are present, such that only those points in \(\gamma \) that lie in Region I of Fig. 4 plus the final outcome of single-strain epidemics are possible final sizes.

Fig. 5
figure 5

The possible outcomes of a two-strain epidemic. The solutions to the necessary condition (18) are shown in thin line, while the heavy line indicate those points that can arise as solutions to the epidemic model (15). Open circles indicate sequential epidemics, while filled circles indicate epidemics where at least one strain is absent. The panels a and b correspond to the two panels in Fig. 4. Parameter values common to all graphs are \({{{\mathfrak {R}}_0^I}}=2.0\), \({{{\mathfrak {R}}_0^Y}}=1.5\) and \(s=p_I=p_Y=1\). In a \(\tau _I=0.0, \tau _Y=0.0\); in b \(\tau _I=0.8, \tau _Y=0.2\); in c \(\tau _I=0.5, \tau _Y=0.6\); in d \(\tau _I=0.9, \tau _Y=0.8\)

This gives our final characterization of the outcome of “epidemics in competition.”

Theorem 4

Let

$$\begin{aligned} \varGamma _\delta =\big \{\ (s,p_I,p_Y,l_I,l_Y)\ \big | \ 0\le p_I+p_Y-s\le 1,\\ \quad 0<s\le p_K\le 1,\ 0\le l_k< \delta \ \big \} \end{aligned}$$

denote the set of biologically feasible initial conditions to Eq. (15) with initial infection less than \(\delta >0\) and let

$$\begin{aligned} \varPhi : \varGamma _\delta \rightarrow [0,1]\times [0,1] \end{aligned}$$

denote the map from initial conditions to \((\pi _I,\pi _Y)\), the limit of \((P_I(t)/P_I(0),P_Y(t)/P_Y(0))\) for \(t\rightarrow \infty \). Then, for \(\delta \rightarrow 0\) the set \(\varPhi (\varGamma _\delta )\) approaches the set consisting of the segment of the curve \(\gamma \) that lies in Region I in Fig. 4 combined with the three points \((\pi _\infty ^I,1)\), \((1,\pi _\infty ^Y)\) and (1, 1).

5 The Effect of Cross-Immunity on the Final Epidemic

To describe the biological implications of Theorem 4, let us consider how the strength of cross-immunity affects the possible outcomes in the situation where all hosts are initially susceptible to both strains such that \(s=p_I=p_Y=1\). The two extreme situations are well known: If there is no cross-immunity \(\tau _K=1\), the two diseases act independently and if both strains are initially present in the population, we expect the final size of the susceptible population to the strains to be \((\pi _\infty ^I,\pi _\infty ^Y)\). The case of full cross-immunity \(\tau _K=0\) was discussed in Sects. 23.

For intermediate values of \(\tau _K\), we first observe that with initial conditions \(s=p_I=p_Y=1\) both strains have effective reproduction numbers \({\mathfrak {R}}_e^K>1\) that are independent of the strength of the cross-immunity \(\tau _K\). Consequently, the final size of the susceptible population for a one-strain epidemic \(\pi _\infty ^K\) is also independent of \(\tau _K\) and the effective reproduction number for the sequential epidemic \({\mathfrak {R}}_e^{IY} = {{{\mathfrak {R}}_0^Y}} s \left( (1-\tau _I)\pi _\infty ^I +\tau _I\right) \) is linear in \(\tau _I\).

When there is no cross-immunity, \(\tau _k=1\), we have \({\mathfrak {R}}_e^{IY}={{{\mathfrak {R}}_0^Y}}\)—corresponding to the fact that an initial epidemic with strain I does not affect strain Y. When cross-immunity is perfect \(\tau _K=0\), we obtain (as expected) the same condition for a sequential epidemic as we found in Sect. 2. Thus, for weaker cross-immunity (\(\tau _K\) larger) the effective reproduction number of a KL-sequential epidemic increases and for some value of \(1> \tau _K\ge 0\), a KL-sequential epidemic is possible.

This scenario is depicted in Fig. 5. For strong cross-immunity (small \(\tau _K\)), the possible outcomes lie as shown in Fig. 5a—or if one type of sequential epidemic is possible either in (b) or (c). As cross-immunity weakens, the outcomes look like (b) or (c) and for even weaker cross-immunity as (d). As both \(\tau _k\) approaches zero, the two sequential final sizes move down toward the point \((\pi _\infty ^I,\pi _\infty ^Y)\) describing the final sizes of two independent epidemics.

6 Discussion

In this article, we have described the outcome of a mixed epidemic where two related strains of a pathogen compete for susceptible hosts during a single epidemic episode. The final result of such “epidemics in competition” depends critically on details of the initial conditions and possibly on stochastic variation during the early period of low disease prevalence. Rather than making specific assumptions about conditions during the onset, we have focused on those aspects of the epidemic that may be assessed by deterministic methods.

In general, the final outcome may be characterized by the number of hosts that have experienced each of the possible histories of infection during the epidemic. In the cases we have studied, these numbers fall on a single curve in the space of possible outcomes, with the caveat that epidemics where only one of the strains is present may lead to an outcome that is an isolated point which is not part of the curve for mixed epidemics. Such isolated points arise when conditions support a sequential epidemic, i. e. when a strain can produce an epidemic in a population that has already experienced the full course of an epidemic with the other strain.

These observations hold also for the case where strains exert only partial cross-immunity in the sense that infection with one strain reduces the transmission of the alternate strain during a subsequent infection. We finally showed that—as cross-immunity weakens—this model spans the full range of outcomes from the case of full cross-immunity to the case of completely unrelated strains.

This type of competition between two closely related strains represents the first step in the natural selection among new variants of a disease during outbreaks or pandemics. In focusing on the interaction during the course of the epidemic, the analysis offers a somewhat unusual way to study natural selection among micro-parasites as most studies of natural selection assumes that the disease is in a quasi-steady state determined by the balance between disease transmission and demographic turn-over in the host population. The present approach may be especially useful for rapidly evolving pathogens such as many RNA-viruses where the demographic balance is of little importance.

From a mathematical view point, the paper raises several interesting questions. As all models of a single epidemic, the present models are structurally unstable in the technical sense that small changes of the right hand side of the equations may lead to qualitatively different final states. This fact is somewhat obvious because even small levels of demographic turn-over alter the long-term behavior of the solutions. Still the single epidemic is worth studying as it is of major interest in applications to zoonotic outbreaks and pandemic influenza as well as forming the basic structure for much current theoretical work in particular in network theory. It would be interesting to know which types of perturbations—if any—the models are robust to.

The structural instability of the model also limits the available mathematical tools as most results from dynamical systems require that the model is generic. Thus, our basic results concerning the continuity of the map from initial conditions to outcome rely on quite basic theory, namely the fact that the value of the solution to an ODE after finite time depends continuously on initial conditions combined with explicit evaluation of the approach to the semi-stable equilibrium. How and when may this method be generalized?

Kendall and Saunders (1983) showed that the map from initial conditions to final size is monotonic in the sense that if the inoculum of one strain is reduced by a certain amount while the inoculum of the second strain is increased by the same amount, then the final number of hosts infected with the second type increases while the number infected with the first type decreases. To prove monotonicity, Kendall and Saunders used a complicated coupling argument; see Ball (1995). It may be possible to generalize this proof to the case of partial cross-immunity.

The most important mathematical challenge lies in the determination of the set of possible outcomes of two-strain epidemics. Locating the outcome involves two distinct limits: the \(\omega \)-limit for the solutions of the model’s differential equations and subsequently the limit where initial conditions for the infected classes go to zero mimicking the epidemic situation of a newly introduced pathogen. To handle the last limit, we relied on the fact that ad hoc methods yield a suitable number of first integrals of the model so that the relation between initial conditions and the \(\omega \)-limit was analytically available. The model of infectivity-reduction was carefully chosen to allow for this process. It is straightforward to see that the methods we have developed in Sect. 4 may also be applied to the case of polarized immunity. Polarized immunity refers to the situation where after infection with the first strain, a fraction of the hosts recognizes perfectly a challenge from the a second strain while the remainder of the hosts experience the second strain as a novel infection (Gog and Swinton 2002). However, it is not clear if our results—and in particular the one-dimensional nature of the outcome space—will generalize to the case where cross-immunity leads to a reduction in susceptibility.

Throughout the paper, we have assumed that the host population mixes homogeneously. It is well known that for a single pathogen in a heterogeneously mixing population, there exists a unique final size, but the size is in general not analytically available (Magal et al. 2016; Miller 2012; Andreasen 2011; Rass and Radcliffe 2003). The generalization to multi-strain epidemics is not clear. In fact, for heterogeneously mixing populations it is easy to find examples where the space of outcomes for a two-strain epidemic is multi-dimensional.