Endogenous viral mutations, evolutionary selection, and containment policy design

Mellacher, Patrick

doi:10.1007/s11403-021-00344-3

Endogenous viral mutations, evolutionary selection, and containment policy design

Regular Article
Open access
Published: 07 January 2022

Volume 17, pages 801–825, (2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of Economic Interaction and Coordination Aims and scope Submit manuscript

Endogenous viral mutations, evolutionary selection, and containment policy design

Download PDF

Patrick Mellacher ORCID: orcid.org/0000-0001-6757-8104¹

3200 Accesses
5 Citations
8 Altmetric
Explore all metrics

Abstract

How will the novel coronavirus evolve? I study a simple epidemiological model, in which mutations may change the properties of the virus and its associated disease stochastically and antigenic drifts allow new variants to partially evade immunity. I show analytically that variants with higher infectiousness, longer disease duration, and shorter latent period prove to be fitter. “Smart” containment policies targeting symptomatic individuals may redirect the evolution of the virus, as they give an edge to variants with a longer incubation period and a higher share of asymptomatic infections. Reduced mortality, on the other hand, does not per se prove to be an evolutionary advantage. I then implement this model as an agent-based simulation model in order to explore its aggregate dynamics. Monte Carlo simulations show that a) containment policy design has an impact on both speed and direction of viral evolution, b) the virus may circulate in the population indefinitely, provided that containment efforts are too relaxed and the propensity of the virus to escape immunity is high enough, and crucially c) that it may not be possible to distinguish between a slowly and a rapidly evolving virus by looking only at short-term epidemiological outcomes. Thus, what looks like a successful mitigation strategy in the short run, may prove to have devastating long-run effects. These results suggest that optimal containment policy must take the propensity of the virus to mutate and escape immunity into account, strengthening the case for genetic and antigenic surveillance even in the early stages of an epidemic.

When might host heterogeneity drive the evolution of asymptomatic, pandemic coronaviruses?

Article 20 June 2022

Mutation induced infection waves in diseases like COVID-19

Article Open access 10 June 2022

Deconvolving mutational patterns of poliovirus outbreaks reveals its intrinsic fitness landscape

Article Open access 17 January 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the autumn of 2021 it has become clear, that even wide-spread access to vaccines has not (yet) eliminated the threat of COVID-19, as new variants prove to be fitter and better able to escape immunity (Bernal et al. 2021; Hoffmann et al. 2021a, 2021b; Wall et al. 2021). Mutations have already started to become a concern in mid-2020 (Korber et al. 2020), whereas other early studies concluded that the evolutionary pace of SARS-CoV2 was too low to endanger vaccine efficacy (Dearlove et al. 2020). In the beginning of 2021, however, most experts believed that COVID-19 will become “endemic,” i.e., circulate perpetually in varied forms at least in certain areas (e.g., countries with low access to vaccines) (Phillips 2021).

In this article, i aim to shed light on two research questions which naturally arise in this situation: a) in which direction will the SARS-CoV2 virus causing COVID-19 evolve?, and b) which aggregate dynamics can we expect from an evolving virus in terms of infections and fatalities? In order to study these topics, i develop a parsimonious epidemiological model that simultaneously captures two types of viral evolution:

a) Genetic variation: Mutations can change the characteristics of a virus in a way that alter its evolutionary fitness, e.g., by increasing its transmissibility (Chen et al. 2020).

b) Antigenic variation (Yuan et al. 2021): Antigenic drift is a process in which the virus changes its antigenic profile. Since antibody response is targeted at the antigenic profile, an antigenic drift allows a virus to escape (some) immunity previously acquired by the viral host. Research on the influenza virus shows that genetic change is more gradual than antigenic evolution and that immune escape depends on the antigenic distance between two variants (Smith et al. 2004).

Both types of variation are subject to natural selection, which favors “useful” (Darwin 1859) variation, i.e., variation which increases the growth rate of the number of people infected by a variant. A recent study suggested that the viral evolution of SARS-CoV2 has been accelerating (McCarthy et al. 2021).

Theoretical modeling of fitness evolution usually relies on the so-called fitness landscape approach first introduced by Wright (1932). In this model, mutations cause a species (e.g., virus) to move along an n-dimensional landscape. Each spot in this landscape represents a phenotype associated with a given fitness value, which may be static or change over time due to, for instance, environmental effects (e.g., Wilke et al. 2001). This landscape may be multi-peaked: Such an approach (using a one-dimensional landscape) was recently used to study viral evolution by e.g., Rüdiger et al. (2020). It may also, however, be single-peaked, a case in which the species continuously approaches the peak. Gurevich et al. (2021) study the evolutionary competition between two distinct strains and find that increased testing favors a test-evasive strain.

In order to capture partial strain-dependent immunity, Roche et al. (2011) develop an agent-based model where previous infections provide partial cross-immunity depending on the evolutionary distance between the infecting variant and the variant that had caused an infection in the past (as observed empirically by Smith et al. 2004). An alternative approach is chosen by Griffin et al. (2020), who develop a parsimonious model with strain-dependent immunity that does not need to store the “antigenic history” (ibid) of each agent.

This paper contributes to the literature on the theoretical modeling (the impact) of viral mutations and—more specifically—COVID-19 variants (e.g., Roche et al. 2011; Basurto et al. 2021; Buckee et al. 2007; Cao et al. 2021; Gabler et al. 2021; Gurevich et al. 2021; Gordo et al. 2009; Griffin et al. 2020; Halley et al. 2021; Marquioni and Aguiar 2021; Pageaud et al. 2021; Rella et al. 2021; Rüdiger et al. 2020; Williams et al. 2021). My contribution is to (a) introduce a parsimonious model of endogenous viral evolution capturing both genetic and antigenic variation (i.e., evolving intrinsic and extrinsic fitness, see Smith et al. 2004), as well as imperfect cross-immunity, and (b) use this framework to study aggregate dynamics and the direction of viral evolution under varying containment policies.

This paper also contributes to the literature using agent-based modeling to studying the COVID-19 crisis (e.g., Basurto et al. 2021; Delli Gatti and Reissl 2020; Dignum et al. 2020; Gabler et al. 2021; Kerr et al. 2021; Lasser et al. 2020; Mellacher 2020, 2021a; Silva et al. 2020; Vermeulen et al., 2020; Wallentin et al. 2020).

Finally, this paper is also related to a literature to the study of evolutionary processes in economics as pioneered by Nelson and Winter (1982). This approach has been adopted in a particularly fruitful way using agent-based models in the field of innovation economics (e.g., Ma and Nakamori, 2005; Dosi et al. 2010), which inter alia considers the case of directed change due to evolutionary selection (Fanti, 2021; Hötte 2020; Mellacher and Scheuer 2021). Modeling the behavior of agents as an adaptive (boundedly rational) processes is another highly promising field of study where evolutionary processes are employed by economists. A recent example for such a model is developed by Lux (2021), who extends the classical SIR model to include adaptive endogenous social distancing.

The rest of this paper is structured as follows: The second section presents the basic model and analyzes the direction of evolution under varying containment scenarios analytically. The third section discusses the implementation of this model as an agent-based simulation model (ABM). The fourth section shows the results of a quantitative analysis of the ABM using Monte Carlo simulations. Finally, section five concludes.

2 SEPAIRD model and analytical results

2.1 Baseline model (without containment policies)

This section describes the baseline model using differential equations in order to investigate its basic properties analytically. In order to capture pre-symptomatic and asymptomatic infections, I extend the classical SIR model to incorporate the following compartments (which also denote the states to which agents in the ABM may belong): Susceptible (S), exposed (E), pre-symptomatic (P), (permanently) asymptomatic (A), (symptomatic) infected (I), recovered (R) or dead (D). Susceptibles may become infected with the virus, if they meet a person who belongs to the compartments P, A or I. Once infected, a susceptible becomes exposed. This is known as a latent period, in which the person neither displays any symptoms, nor is infectious. After the latent period, a person becomes pre-symptomatic. In this case, agents are able to transmit the disease, but do not display any symptoms (yet). Pre-symptomatic agents may later become either symptomatic infected or stay permanently asymptomatic. Finally, permanently asymptomatic will recover, whereas symptomatic infected may either recover or die. A similar approach is used by Lee et al. (2009). This rather detailed specification is necessary to disentangle the effects of various properties of the virus and its associated disease on the effective reproduction number $R_{t}$, which governs the growth rate of the number of infected.

Like the standard SIR model, this model operates on the simplifying homogenous mixing assumption, i.e., every member of compartment i has an equal probability to meet a member of compartment j.

Figure 1 gives a graphical overview of the model:

The laws of motion (omitting mutations and variants) of this model are given as follows, where $\beta$ denotes the average number of infectious contacts per period, $\frac{1}{\alpha }$ the average latent period, $\frac{1}{\mu }$ the average pre-symptomatic time, $\nu$ the share of symptomatic infections ($0 < \nu < 1$), $\frac{1}{\gamma }$ the average duration of symptoms (if the infection takes a symptomatic course), and $\lambda$ the chance to survive a symptomatic infection ($0 < { }\lambda { } < 1$):

$$ \begin{array}{*{20}c} {\dot{S}\left( t \right) = - S\left( t \right)\beta \left( {\frac{I\left( t \right) + P\left( t \right) + A\left( t \right)}{{N\left( t \right)}}} \right)} \\ \end{array} $$

(1)

$$ \begin{array}{*{20}c} {\dot{E}\left( t \right) = S\left( t \right)\beta \left( {\frac{I\left( t \right) + P\left( t \right) + A\left( t \right)}{{N\left( t \right)}}} \right) - E\left( t \right)\alpha } \\ \end{array} $$

(2)

$$ \begin{array}{*{20}c} {\dot{P}\left( t \right) = E\left( t \right)\alpha - P\left( t \right)\mu } \\ \end{array} $$

(3)

$$ \begin{array}{*{20}c} {\dot{A}\left( t \right) = P\left( t \right)\mu \left( {1 - \nu } \right) - A\left( t \right)\gamma } \\ \end{array} $$

(4)

$$ \begin{array}{*{20}c} {\dot{I}\left( t \right) = P\left( t \right)\mu \nu - I\left( t \right)\gamma } \\ \end{array} $$

(5)

$$ \begin{array}{*{20}c} {\dot{R}\left( t \right) = A\left( t \right)\gamma + I\left( t \right)\lambda \gamma } \\ \end{array} $$

(6)

$$ \begin{array}{*{20}c} {\dot{D}\left( t \right) = I\left( t \right)\left( {1 - \lambda } \right)\gamma } \\ \end{array} $$

(7)

The basic reproduction number is defined as the number of people infected by one infected person in an otherwise susceptible population. Since an infected person may belong to the compartments E (where they are not infectious), P, A or I (where they are infectious), it can be calculated by multiplying the number social contacts per time period with the time spent, i.e.,

$$ \begin{array}{*{20}c} {R_{0} = \frac{\beta }{\mu } + \frac{{\beta \left( {1 - \nu } \right)}}{\gamma } + \frac{\beta \nu }{\gamma }} \\ \end{array} $$

(8)

which collapses to:

$$ \begin{array}{*{20}c} {R_{0} = \frac{\beta }{\mu } + \frac{\beta }{\gamma }} \\ \end{array} $$

(9)

The effective reproduction number $R_{t}$ is the number of people infected by one infected person in a population which otherwise does not only consist of susceptibles. Assuming that infectious contacts are not directed toward any group (i.e., homogenous mixing), the share of these contacts with susceptibles is given by the number of susceptibles in the population, i.e., $\frac{S\left( t \right)}{{N\left( t \right)}}$. In the absence of any containment measures, the effective reproduction number is thus given by:

$$ \begin{array}{*{20}c} {R_{t} = \left( {\frac{\beta }{\mu } + \frac{\beta }{\gamma }} \right)\frac{S\left( t \right)}{{N\left( t \right)}}} \\ \end{array} $$

(10)

If $R_{t} < 1$, the virus will die out. If $R_{t} > 1$, the number of infected will grow exponentially. As such, $R_{t}$ is the key metric to capture the evolutionary fitness of a new variant. Accordingly, we can use partial derivatives to investigate how changes in the viral properties may affect $R_{t}$, i.e., to investigate the shape of the fitness landscape with regard to each viral property.

Differentiating $R_{t}$ with respect to each parameter and variable shows that it increases with $\beta$ (i.e., the transmissibility), decreases with $\gamma$ and $\mu$ (i.e., increases with the time in which an individual is infectious $\frac{1}{\mu } + \frac{1}{\gamma }$), increases with $S\left( t \right)$ (i.e., the susceptibles) and decreases with $N\left( t \right)$ (i.e., the total population).

From this analysis follows easily that those mutations are more successful, which increase the transmissibility and the infectious time. From the relationship with $S\left( t \right)$ follows that those mutations are more successful that a) exhibit lower cross-immunity (i.e., circumvents pre-existent immunity against other variants more effectively), and b) that are able to attack more quickly before the recipients are able to obtain cross-immunity from other variants, i.e., have a lower latent period. A decrease in $N\left(t\right)$ is achieved by a higher lethality of the disease caused by the variant. Such a decrease, however, affects all variants simultaneously and thus cannot be assumed to provide an evolutionary advantage to any specific variant.

2.2 Uniform social distancing

Uniform social distancing (which could also be imagined as a blanket lockdown or compulsory mask wearing) reduces the number of social contacts (or their infectiousness) for all individuals in the same way.^{Footnote 1} This can be modeled by substituting $\beta$ with $\beta \left( {1 - \delta } \right)$, where $\delta$ denotes the share of social contacts avoided during each period (or alternatively the reduction of infectiousness of each social contact). Accordingly, the laws of motion regarding S and E change:

$$ \begin{array}{*{20}c} {\dot{S}\left( t \right) = - S\left( t \right)\beta \left( {1 - \delta } \right)\left( {\frac{I\left( t \right) + P\left( t \right) + A\left( t \right)}{{N\left( t \right)}}} \right)} \\ \end{array} $$

(11)

$$ \begin{array}{*{20}c} {\dot{E}\left( t \right) = S\left( t \right)\beta \left( {1 - \delta } \right)\left( {\frac{I\left( t \right) + P\left( t \right) + A\left( t \right)}{{N\left( t \right)}}} \right) - E\left( t \right)\alpha } \\ \end{array} $$

(12)

The basic reproduction number is now:

$$ \begin{array}{*{20}c} {R_{0} = \left( {\frac{\beta }{\mu } + \frac{\beta }{\gamma }} \right)\left( {1 - \delta } \right)} \\ \end{array} $$

(13)

With the new effective reproduction number given by:

$$ \begin{array}{*{20}c} {R_{t} = \left( {\frac{\beta }{\mu } + \frac{\beta }{\gamma }} \right)\left( {1 - \delta } \right)\frac{S\left( t \right)}{{N\left( t \right)}}} \\ \end{array} $$

(14)

This does not change any of the considerations above. If, however, $\delta$ is interpreted as a reduction in infectiousness due to face masks or other protective equipment, any variants that circumvent these measures more effectively also have an evolutionary advantage.

2.3 Isolation of symptomatic cases

Another widespread measure to slow the spread of COVID-19 is to isolate symptomatic cases (or to encourage self-isolation upon developing symptoms). If we assume that all symptomatic cases are isolated, this changes the laws of motion regarding S and E accordingly (i.e., individuals in compartment I do not any longer spread the virus), if we assume that this measure is not combined with uniform social distancing):

$$ \begin{array}{*{20}c} {\dot{S}\left( t \right) = - S\left( t \right)\beta \left( {\frac{P\left( t \right) + A\left( t \right)}{{N\left( t \right)}}} \right)} \\ \end{array} $$

(15)

$$ \begin{array}{*{20}c} {\dot{E}\left( t \right) = S\left( t \right)\beta \left( {\frac{P\left( t \right) + A\left( t \right)}{{N\left( t \right)}}} \right) - E\left( t \right)\alpha } \\ \end{array} $$

(16)

Adapting Eq. 8, the basic reproduction number is now:

$$ \begin{array}{*{20}c} {R_{0} = \frac{\beta }{\mu } + \frac{{\beta \left( {1 - \nu } \right)}}{\gamma }} \\ \end{array} $$

(17)

Accordingly, the effective reproduction number is:

$$ \begin{array}{*{20}c} {R_{t} = \left( {\frac{\beta }{\mu } + \frac{{\beta \left( {1 - \nu } \right)}}{\gamma }} \right)\frac{S\left( t \right)}{{N\left( t \right)}}} \\ \end{array} $$

(18)

In this case, the results regarding the evolutionary advantage of an increase in $\beta$, a decrease in $\gamma$ and $\mu$, an increase in $S\left( t \right)$ still hold. In addition to that, however, $R_{t}$ decreases with $\nu$ (i.e., the share of symptomatic infections) and increases with the pre-symptomatic phase $\mu$, even if the total duration of being infectious ($\mu + \gamma$) is constant.^{Footnote 2}

If the lethality depends mechanically on the share of symptomatic infections, as suggested by above formulation (i.e., the chance to survive a symptomatic infection $\lambda$ is unchanged by a change in the share of symptomatic infections), the evolutionary selection mechanism also favors less deadly variants as a side effect of favoring asymptomatic infections. If, however, the survival chance is independent from the share of symptomatic infections, lethality is unaffected by evolutionary selection.

3 Agent-based model

Due to the interactions between the different variants of the virus, the outcomes of a full model covering endogenous variants cannot be analyzed purely by differential equations. I thus implement this model as a simple open-source agent-based simulation model in NetLogo (Wilensky 1999)^{Footnote 3} to a) confirm the analytical predictions of the model behavior and b) explore aggregate dynamics under varying scenarios.

In order to capture mutations, the interplay between the variants, while at the same time keeping the analysis as simple as possible, I make the following assumptions:

1.
Each agent may only be infected by one virus variant simultaneously.
2.
Whenever an individual is infected, the virus may mutate, creating a new variant which is an offspring of the infecting (“parent”) variant.
3.
A mutation randomly changes the properties of the virus (infectiousness, latent period, share of asymptomatic infections, incubation period, disease duration, lethality) with the means given by the actual values of the parent variant (see Eq. 19).
4.
Each virus variant belongs has an antigenic cluster that may change during a mutation, i.e., a mutation may be coupled with an antigenic drift.^{Footnote 4}
5.
A previous infection within the same antigenic cluster provides perfect cross-immunity between variants. It may provide cross-immunity between antigenic clusters depending on the antigenic (evolutionary) distance between two antigenic clusters.
6.
Each agent has an equal probability to meet another agent (i.e., homogeneous mixing).

The following sequence of events occurs during each simulation step:

1.
Infected agents meet other agents and may infect them, a process in which mutations and antigenic drifts may occur (described in more detail in Sect. 3.1).
2.
The disease progresses, i.e., agents may become infectious, develop symptoms and become isolated (if such a containment policy is active), recover or die (see Sect. 3.2).
3.
Infection statistics are updated (see Sect. 3.3).

3.1 Infections, mutations and antigenic drifts

In order to save computational resources, the model only explicitly processes social contacts of infectious agents who are not isolated. This concerns agents who are asymptomatic (A in the notation chosen in Sect. 2 of this paper), pre-symptomatic (P) and—depending on policy—also symptomatic infectious (I). Each agent of these types randomly meets $\eta$ other agents who are alive. Each social contact of an agent infected with variant $m$ with another agent who is neither immune to variant m, nor currently infected with any variant, causes an infection with probability $i_{m}$.^{Footnote 5}

Each infection may cause a mutation with a constant probability $\phi$. In this case, the newly infected agent is the first carrier of the newly emerged variant. During each mutation, all properties of the virus and its associated disease, namely infectiousness, latent period, incubation period, total infection duration, share of asymptomatic infections and lethality, are subject to stochastic multiplicative change, where $h_{i,k}$ is the property k of the variant i, which is an offspring of variant j.^{Footnote 6}:

$$ \begin{array}{*{20}c} {h_{i,k} = \left( {1 + \omega_{i,k} } \right)h_{j,k} } \\ \end{array} $$

(19)

where $\omega_{k,i}$ ($\omega_{k,i}$ >−1) is drawn from a normal distribution with a mean of $\theta$ and standard deviation of $\sigma^{I}$.^{Footnote 7}

$$ \begin{array}{*{20}c} {\omega_{k,i} \sim N\left( {\theta ,\sigma^{I} } \right)} \\ \end{array} $$

(20)

The mean ($\theta$) is set to 0 in order not to presuppose any direction of evolution. Instead, all properties change stochastically and those changes, which prove to be advantageous, assert themselves endogenously in the competition against the other variants. Thus, “each slight variation, if useful, is preserved” Darwin (1859, 61).

A fraction $\kappa $ of mutations is coupled by an antigenic drift, i.e., able to evade some immunity. Following empirical research on the influenza virus (Smith et al. 2004), this is modeled by assuming that each variant belongs to an “antigenic cluster.” Whenever an antigenic drift (i.e., a new immune escaping mutation) occurs, every agent who is immune to its ancestor may become immune to the new antigenic cluster with the probability $\psi^{I}$.

Figure 2 shows an example “phylogenetic tree”, which plots the evolution of the virus into different variants. The arrows point from ancestor variants to their descendants. Each descendant differs slightly from its ancestor. Two antigenic drifts (at ${\text{V}}_{1.2}$ and ${\text{V}}_{1.1.2}$) created three distinct antigenic clusters which are colored differently in this figure in order to highlight them. An infection with a variant from the yellow antigenic cluster (e.g., ${\text{V}}_{1.1.2.1}$) provides perfect immunity against other variants belonging to the yellow antigenic cluster. The chance that it also provides cross-immunity against variants belonging to the white antigenic cluster is $\psi^{I}$, which in turn provides cross-immunity against the green cluster again with probability $\psi^{I}$.

3.2 Disease progression

Whenever an individual n is infected with virus m, the actual latent period $l_{n,m}$, incubation time $b_{n,m}$, and the disease duration $d_{n,m}$ (all ≥ 0) are drawn from a normal distribution with the means given by the viral attributes (latent period $l_{m}$, incubation time $b_{m}$, disease duration $d_{m}$) and a standard deviation which is a fraction $\sigma^{II}$ of the respective attribute:

$$ \begin{array}{*{20}c} {l_{n,m} \sim N\left( {l_{m} , l_{m} \sigma^{II} } \right)} \\ \end{array} $$

(21)

$$ \begin{array}{*{20}c} {b_{n,m} \sim N\left( {b_{m} , b_{m} \sigma^{II} } \right)} \\ \end{array} $$

(22)

$$ \begin{array}{*{20}c} {d_{n,m} \sim N\left( {d_{m} , d_{m} \sigma^{II} } \right)} \\ \end{array} $$

(23)

These values are then rounded to the next integer. During each simulation step, a counter (which is initialized with 0 for newly infected individuals) records the time passed since becoming infected. Once it reaches l_nm, the individual is infectious and can infect others. At $b_{nm}$, it develops symptoms with probability $v_{m}$ and may isolate itself. At $d_{m}$ it either dies with a probability given by the lethality rate $f_{m}$ or otherwise recovers and becomes immune against this antigenic cluster.^{Footnote 8} If an agent has acquired immunity against at least one other variant, it additionally benefits from a “cross protection” $\psi^{II}$ that aims to account for the fact that immunity against other strains (or a vaccination) reduce the lethality even if it cannot prevent an infection. In such a case, the probability of dying is given by $f_{m} \left( {1 - \psi^{II} } \right)$.

Using a recursive function, the agent may also acquire immunity against the ancestor and/or descendants of this antigenic cluster with probability $\psi^{I}$, as well as the ancestor and possible descendants of those antigenic clusters, to which the agent just acquired cross-immunity with the same probability et cetera.

3.3 3.3 Parameters

The simulation is initialized with the parameters described in Table 1.

Table 1 Parameters of the simulation

Full size table

This calibration allows for a basic reproduction number of the SARS-CoV2 wild type of 2.5 and a share of pre-symptomatic infections of 50% as estimated by the CDC (2021). The incubation time, as well as the share of symptomatic cases are also taken from CDC (2021). The cross protection against a lethal infection was derived from Abu-Raddad et al. (2021), who estimate the effectiveness of a vaccine against lethal COVID-19 to be 97.5%. All other parameters are set to replicate stylized facts of the virus, such as a low propensity of the virus to mutate (which is certainly higher in my model than in the real world, as my model is only populated by 10,000 agents), and an even lower propensity for an antigenic drift to occur. Thus, the simulation results should not be interpreted as an accurate quantitative prediction of what will happen, but as an explorative scenario analysis.

4 Simulation results

In order to analyze the properties of the model, I rely on Monte Carlo simulations. Specifically, I run each scenario 100 times with fixed random seeds for 500 periods. I then analyze the results using the programming language R (R Core Team 2018) with the ggplot2 package (Wickham 2016). In doing so, I want to (a) get an idea of the “mean” simulation result, but also (b) about the distribution of results and their statistical significance. I thus rely on quantile regressions to capture aggregate dynamics, and notched box plots to interpret cumulative outputs at a single simulation step.

4.1 Viral evolution

In this subsection, I give an overview of viral evolution under varying mutation parameters, and show how they are affected by social distancing. Detailed results concerning the evolution of each property are in line with the analytical predictions and presented in the Appendix.

Figures 3, 4, 5, and 6 show that:

(a)
Evolutionary selection causes the virus to evolve differently in the face of “smart” containment policies aimed at isolating symptomatic individuals.
(b)
Social distancing is generally able to curb viral evolution.
(c)
Contrary to b), levels of social distancing that bring the effective reproduction number of the wild type in a population inhabited by a very large share of susceptibles close to 1 can cause variants to evolve to higher fitness than lower levels of social distancing, if there is high cross-immunity between the antigenic clusters, as can be seen in the bottom right part of the figures.

In order to present these results concisely, I compute mean $R_{0,m}$ and $R_{0,m}^{adapted}$, which accounts for the isolation of infected, in the following way for the active strains m at time step 500,^{Footnote 9} where, following the notation introduced above, $\eta $ denotes the number of daily contacts, $i_{m}$ the infectiousness, $d_{m}$ the duration, $l_{m}$ the latent period, $v_{m}$ the probability of developing symptoms, $b_{m}$ the incubation time. Please note that the latent period has to be deducted from the disease duration, as $d_{m}$ covers both the infectious and the pre-infectious (“latent”) period:

$$ R_{0,m} = \eta i_{m} (d_{m} - l_{m} ) $$

(24)

$$ R_{0,m}^{adapted} = (1 - v_{m} )\eta i_{m} (d_{m} - l_{m} ) + v_{m} \eta i_{m} (b_{m} - l_{m} ) $$

(25)

Figures 3 and 4 show the mean $R_{0}$ for scenarios in which symptomatic individuals are not isolated or isolated, respectively. We can see that isolating individuals exhibiting symptoms causes the virus to die out in more scenarios and to be less fit in the others with regard to $R_{0}$.

Figures 5 and 6 show the evolution of the mean relative adapted $R_{0}^{adapted}$, i.e. $\frac{{R_{0}^{adapted} }}{{R_{0} }}$ which describes how efficient isolation policies are in curbing the spread of the virus. If the relative adapted $R_{0}^{adapted}$ is equal to one, isolation of symptomatic individuals does not curb the spread of the virus at all. While isolation policies tend to become more efficient if they are not enacted (see Fig. 5), adaptive evolution causes them to become less efficient, if they are enacted (see Fig. 6).

4.2 The public health impact of an evolving virus

This subsection discusses the public health outcomes of an evolving virus and to which extent they can be used to identify the propensity of the virus to mutate. Figures 7 and 8 show public health outcomes in the first 100 time steps of the simulation, which cover (most of) the first wave for almost all scenarios.

There is no visible difference in mortality between the scenarios (assuming a 99% cross protection against a lethal infection) for a given level of social distancing. The number of infected agents is more sensitive to changes, as the second wave of infection is visible for scenarios without any cross-immunity between antigenic clusters and very low levels of social distancing. In more realistic scenarios, however, we also cannot distinguish between the scenarios with regard to the mutation chance and the cross immunity between antigenic clusters by looking only at public health outcomes of the first 100 days.

Things are different, however, if we look at a longer time horizon. Figure 9 shows the share of infected agents for the first 500 simulation steps, and Figs. 10 and 11 show the mortality rate for the same time horizon and cross-protection levels of 99% and 90%, respectively. Depending on the levels of cross-protection, the mortality rate added in the “endemic” phase can even surpass the mortality during the first wave.

Moreover, the marginal burden associated with not preventing a full-scale outbreak (i.e., the difference between 50 and 60% social distancing in our scenarios) may drastically increase even in a scenario with high cross-protection and moderate cross-immunity, as mutations and antigenic drifts increase the chances of each individual to become infected at least once in their lifetime.

4.3 Genetic and antigenic variation

Figures 12 and 13 show that the genetic and antigenic evolution of the virus are closely connected in my model: Fig. 8 shows the evolution of the maximum antigenic distance to the wild-type, where two antigenic clusters with an antigenic distance of 1 are separated by only one antigenic drift. Figure 9 shows the mean phylogenetic distance of all active variants to the wild-type, where two variants have a phylogenetic distance of 1 if they are separated by only one mutation.

5 Conclusion

I developed a simple theoretical model to study the genetic and antigenic evolution of a virus under varying containment scenarios in a stylized way. The properties of the wild-type (i.e., the first variant of the virus prior to any mutations) are calibrated to resemble (infections with) the SARS-CoV2 virus causing COVID-19. Despite its limitations, some of which I outline below, I derived several crucial insights from my model that are empirically testable:

First, containment policies have an impact on the speed of the evolution of the virus. All containment policies that successfully curb the number of infections reduce the number of mutations and thus also the rate at which the virus increases its fitness. If cross-immunity is high, however, the ultimate fitness of a virus may be higher if it circulates in a population engaging in medium-level social distancing than in a population not engaging in social distancing at all due to a slower, but longer, evolution of the virus.

Second, containment policies may also affect the direction of viral evolution. Namely, if symptomatic individuals are isolated, viral fitness increases with the incubation time and the share of asymptomatic infections. Those traits then assert themselves via the endogenous process of evolutionary selection, thus making “smart” isolation policies less effective over time.

Third, it is often not possible to distinguish between an “endemic” scenario, in which variations of the virus persist by continuously evolving and escaping immunity and a non-endemic scenario by looking only at public health outcomes during the first wave of infections. What seems to be a successful “herd immunity” strategy may plant the seeds for a long-term presence of a potentially lethal virus. Thus, it is crucial to monitor the number and severity of reinfections already in the early phases of an epidemic in order to assess a virus’ potential to become endemic and optimally design containment policies accordingly.

My model is limited in various ways: First, it assumes that each property of the virus and associated disease may be subject to the same process of stochastic change, which is an obvious stylization. Second, it assumes a lethality rate, and, more generally, all properties of the disease which are uniform across the population. Relaxing this assumption in order to account for e.g., an age-dependent severity of the disease, may influence the results by reducing mortality even in case of lower levels of cross-protection gained from a past infection (or vaccination). Third, my model does not consider behavioral heterogeneity within the population (Mellacher 2021b) or endogenous changes in the social distancing behavior of the population (Lux 2021; Proaño and Makarewicz 2021). Fourth, my model purely concentrates on public health outcomes and thus does not consider any societal or economic impact of social distancing.

Further research on this topic could go in several directions: First, it could address the limitations of this model by including endogenous viral evolution into a larger economic-epidemiological agent-based model (e.g., Basurto et al. 2020; Delli Gatti and Reissl 2020; Mellacher 2020). Second, one could aim to calibrate the parameters governing the viral evolution with empirical data in order to provide more accurate empirical forecasts. Third, this simple approach could be extended to incorporate crucial aspects of viral evolution that are not yet covered in these large agent-based models, but seem to play an important role empirically, such as vaccination campaigns or the spread of the virus in a multi-country world.

Code availability

The Code of the simulation model and a graphical user interface is available at https://github.com/patrickmellacher/viralmutations.

Notes

Please note that reducing infectiousness is only a perfect substitute to reducing social contacts in the standard homogeneous mixing SIR-framework. Gutin et al. (2021) develop a network-based SIR model and show that the effect of social distancing depends on the network structure of the social interactions.
Please note that this result holds even if we would assume that asymptomatic individuals are less infectious due to biological reasons (for instance, because coughing individuals spread the virus faster). In order to test this, replace the rate of infectious contacts $\beta$ with $\beta_{X}$ for asymptomatic and pre-symptomatic individuals and with $\beta_{Y}$ for symptomatic individuals. If all symptomatic individuals are isolated, the effective reproduction number becomes $R_{t} = \left( {\frac{{\beta_{X} }}{\mu } + \frac{{\beta_{X} \left( {1 - \nu } \right)}}{\gamma }} \right)\frac{S\left( t \right)}{{N\left( t \right)}}$. This does not affect our results and such a change does thus not result in a trade-off for the direction of viral evolution. If symptomatic individuals are not isolated, however, assuming that $\beta_{X} < \beta_{Y}$ would redirect viral evolution toward more symptomatic infections and a shorter incubation period.
The model features a Graphical User Interface and can be obtained at BLINDED.
In the case of the (well researched) influenza viruses, the genetic evolution is more gradual (and less punctuated) than the antigenic evolution, and influenza variants (strains) thus group into antigenic clusters (See Smith et al. 2004).
Please note that it is not necessary to disentangle these two effects in a classical SIR-type model as, on average, the following condition holds $\eta \varphi = \beta$, where $\beta$ denotes the number of infectious contacts per period as defined in Sect. 2.
This approach follows a simple multiplicative approach (e.g., Miller et al. 2018) for each dimension determining the fitness separately, as the fitness landscape is single-peaked or flat in each dimension.
In the model code, $\omega_{k,i}$ is set to be − 0.99 at minimum in order to avoid any negative values for $h_{i,k}$ even for extreme parameters of the random distribution.
Please note that under this specification, the lethality rate does not depend on the probability of developing symptoms.
If the virus died out before time step 500, it computes the last surviving strain.

References

Basurto A, Dawid H, Harting P, Hepp J, Kohlweyer D (2021) How to design virus containment policies? A joint analysis of economic and epidemic dynamics under the COVID-19 pandemic. Bielefeld working papers in economics and management no. 06–2021
Bernal JL, Andrews N, Gower C, Gallagher E, Simmons R, Thelwall S, Ramsay M (2021) Effectiveness of COVID-19 vaccines against the B. 1.617. 2 variant. N Engl J Med 385:585–594
Article Google Scholar
Buckee C, Danon L, Gupta S (2007) Host community structure and the maintenance of pathogen diversity. Proc Roy Soc B Biol Sci 274(1619):1715–1721
Google Scholar
Cao S, Feng P, Wang W, Shi Y, Zhang J (2021) Small-world effects in a modified epidemiological model with mutation and permanent immune mechanism. Nonlinear Dyn, pp 1–16
CDC (2021) Covid-19 pandemic planning scenarios. https://www.cdc.gov/coronavirus/2019-ncov/hcp/planning-scenarios.html (download on 2nd of July 2021)
Chen J, Wang R, Wang M, Wei GW (2020) Mutations strengthened SARS-CoV-2 infectivity. J Mol Biol 432(19):5212–5226
Article Google Scholar
Darwin C (1859) On the origin of species by means of natural selection, or the preservation of favoured races in the struggle for life. John Murray, London
Book Google Scholar
Dearlove B, Lewitus E, Bai H, Li Y, Reeves DB, Joyce MG, Rolland M (2020) A SARS-CoV-2 vaccine candidate would likely match all currently circulating variants. Proc Natl Acad Sci 117(38):23652–23662
Article Google Scholar
Delli Gatti D, Reissl S (2020) ABC: an agent based exploration of the macroeconomic effects of Covid-19. CESifo Working Paper No. 8763
Dignum F, Dignum V, Davidsson P, Ghorbani A, van der Hurk M, Jensen M, Verhagen H (2020) Analysing the combined health, social and economic impacts of the corovanvirus pandemic using agent-based social simulation. Mind Mach 30(2):177–194
Article Google Scholar
Fanti L (2021) ‘Kaldor Facts’ and the decline of Wage Share: An agent based-stock flow consistent model of induced technical change along Classical and Keynesian lines. J Evol Econ 31(2):379–415
Article Google Scholar
Gabler J, Raabe T, Röhrl K, von Gaudecker HM (2021) The effectiveness of strategies to contain SARS-CoV-2: testing, vaccinations, and NPIs. arXiv preprint arXiv:2106.11129
Gordo I, Gomes MGM, Reis DG, Campos PR (2009) Genetic diversity in the SIR model of pathogen evolution. PLoS ONE 4(3):e4876
Article Google Scholar
Griffin A, Roberts GO, Spencer SE (2020) An epidemic model for an evolving pathogen with strain-dependent immunity. Math Biosci 330:108480
Article Google Scholar
Gurevich Y, Ram Y, Hadany L (2021) Modeling the evolution of SARS-CoV-2 under non-pharmaceutical interventions. medRxiv
Gutin G, Hirano T, Hwang SH, Neary PR, Toda AA (2021) The effect of social distancing on the reach of an epidemic in social networks. J Econ Interac Coord 16:629–647
Article Google Scholar
Halley JM, Vokou D, Pappas G, Sainis I (2021) Evolving SARS-CoV-2 variants and mutational cascades. medRxiv
Hoffmann M, Hofmann-Winkler H, Krüger N, Kempf, A, Nehlmeier I, Graichen L, Pöhlmann S (2021a) SARS-CoV-2 variant B. 1.617 is resistant to Bamlanivimab and evades antibodies induced by infection and vaccination. Cell Reports, 109415
Hoffmann M, Arora P, Groß R, Seidel A, Hörnich BF, Hahn AS, Pöhlmann S (2021b) SARS-CoV-2 variants B. 1.351 and P. 1 escape from neutralizing antibodies. Cell 184(9):2384–2393
Hötte K (2020) How to accelerate green technology diffusion? Directed technological change in the presence of coevolving absorptive capacity. Energy Econ 85:104565
Article Google Scholar
Kerr CC, Stuart RM, Mistry D, Abeysuriya RG, Rosenfeld K, Hart GR, (2021) Covasim: an agent-based model of COVID-19 dynamics and interventions. MedRxiv, 2020–05
Korber B, Fischer WM, Gnanakaran S, Yoon H, Theiler J, Abfalterer W, Montefiori DC (2020) Tracking changes in SARS-CoV-2 Spike: evidence that D614G increases infectivity of the COVID-19 virus. Cell 182(4):812–827
Article Google Scholar
Lasser J, Zuber J, Sorger J, Klager E, Kletečka-Pulker M, Willschke H (2020) Agent-based simulations for optimized prevention of the spread of SARS-CoV-2 in nursing homes. arXiv preprint arXiv:2104.00550
Lee EK, Chen CH, Pietz F, Benecke B (2009) Modeling and optimizing the public-health infrastructure for emergency response. Interfaces 39(5):476–490
Article Google Scholar
Lux T (2021) The social dynamics of COVID-19. Physica a: Stat Mech Appl 567:125710
Article Google Scholar
Ma T, Nakamori Y (2005) Agent-based modeling on technological innovation as an evolutionary process. Eur J Oper Res 166(3):741–755
Article Google Scholar
Marquioni VM, de Aguiar MA (2021) Modeling neutral viral mutations in the spread of SARS-CoV-2 epidemics. PLoS ONE 16(7):e0255438
Article Google Scholar
McCarthy KR, Rennick LJ, Nambulli S, Robinson-McCarthy LR, Bain WG, Haidar G, Duprex WP (2021) Recurrent deletions in the SARS-CoV-2 spike glycoprotein drive antibody escape. Science 371(6534):1139–1142
Article Google Scholar
Mellacher P (2021a) What if Merkel had acted like Johnson against Covid-19? Investigación Económica 80(317):82–108
Article Google Scholar
Mellacher P, Scheuer T (2021) Wage inequality, labor market polarization and skill-biased technological change: an evolutionary (agent-based) approach. Comput Econ 58:233–278
Article Google Scholar
Mellacher P (2020) COVID-town: an integrated economic-epidemiological agent-based model. GSC Discussion Paper Series No. 23
Mellacher P (2021b) The impact of corona populism: empirical evidence from Austria and theory. GSC Discussion Paper Series No. 24
Miller CR, Van Leuven JT, Wichman HA, Joyce P (2018) Selecting among three basic fitness landscape models: additive, multiplicative and stickbreaking. Theor Popul Biol 122:97–109
Article Google Scholar
Nelson RR, Winter SG (1982) An evolutionary theory of economic change. Harvard University Press
Google Scholar
Pageaud S, Ponthus N, Gauchon R, Pothier C, Rigotti C, Eyraud-Loisel A (2021) Adapting French COVID-19 vaccination campaign duration to variant dissemination. medRxiv.
Phillips N (2021) The coronavirus is here to stay-here’s what that means. Nature 590(7846):382–384
Article Google Scholar
Proaño CR, Makarewicz T (2021) Belief-driven dynamics in a behavioral SEIRD macroeconomic model with sceptics. CAMA Working Paper 51/2021
R Core Team (2018) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL: https://www.R-project.org/
Rella SA, Kulikova YA, Dermitzakis ET, Kondrashov FA (2021) Rates of SARS-CoV-2 transmission and vaccination impact the fate of vaccine-resistant strains. Sci Rep 11(1):1–10
Article Google Scholar
Roche B, Drake JM, Rohani P (2011) An Agent-Based Model to study the epidemiological and evolutionary dynamics of Influenza viruses. BMC Bioinform 12(1):1–10
Article Google Scholar
Rüdiger S, Plietzsch A, Sagués F, Sokolov IM, Kurths J (2020) Epidemics with mutating infectivity on small-world networks. Sci Rep 10(1):1–11
Article Google Scholar
Silva PC, Batista PV, Lima HS, Alves MA, Guimarães FG, Silva RC (2020) COVID-ABS: An agent-based model of COVID-19 epidemic to simulate health and economic effects of social distancing interventions. Chaos Solitons Fractals 139:110088
Article Google Scholar
Smith DJ, Lapedes AS, De Jong JC, Bestebroer TM, Rimmelzwaan GF, Osterhaus AD, Fouchier RA (2004) Mapping the antigenic and genetic evolution of influenza virus. Science 305(5682):371–376
Article Google Scholar
Vermeulen B, Müller M, Pyka A (2020) Social network metric-based interventions? Experiments with an agent-based model of the COVID-pandemic in a metropolitan region. J Artif Soc Soc Simul 24(3):6
Article Google Scholar
Wall EC, Wu M, Harvey R, Kelly G, Warchal S, Sawyer C, (2021) Neutralising antibody activity against SARS-CoV-2 VOCs B. 1.617. 2 and B. 1.351 by BNT162b2 vaccination. The Lancet 397(10292):2331–2333.
Wallentin G, Kaziyeva D, Reibersdorfer-Adelsberger E (2020) COVID-19 intervention scenarios for a long-term disease management. Int J Health Policy Manag 9(12):508
Google Scholar
Wickham H (2016) ggplot2: Elegant Graphics for Data Analysis. Springer, New York. ISBN 978–3–319–24277–4, https://ggplot2.tidyverse.org.
Wilensky U (1999) NetLogo. http://ccl.northwestern.edu/netlogo/
Wilke CO, Ronnewinkel C, Martinetz T (2001) Dynamic fitness landscapes in molecular evolution. Phys Rep 349(5):395–446
Article Google Scholar
Williams BJ, St-Onge G, Hébert-Dufresne L (2021) Localization, epidemic transitions, and unpredictability of multistrain epidemics with an underlying genotype network. PLoS Comput Biol 17(2):e1008606
Article Google Scholar
Yuan M, Huang D, Lee CCD, Wu NC, Jackson AM, Zhu X, Wilson IA (2021) Structural and functional ramifications of antigenic drift in recent SARS-CoV-2 variants. Science. https://doi.org/10.1126/science.abh1139
Article Google Scholar

Download references

Acknowledgements

I thank two anonymous referees and the editor, Prof. Thomas Lux, for their valuable and timely feedback, as well as their helpful suggestions for improvement. I also thank the participants of the 28th PhD/PostDoc ABM Webinar and the COLIBRI Day 2021 for their comments. All errors are mine.

Funding

Open access funding provided by University of Graz.

Author information

Authors and Affiliations

Graz Schumpeter Centre, University of Graz, Graz, Austria
Patrick Mellacher

Authors

Patrick Mellacher
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Patrick Mellacher.

Ethics declarations

Conflict of interest

None.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

This appendix shows the mean viral properties of surviving variants at simulation step 500 (or of those variants which died out last). Unless stated explicitly, these simulations cover the results of simulations without isolation of symptomatic individuals. These simulation results are in line with the analytical predictions presented in Sect. 2. See Figs.

14,

15,

16,

17,

18,

19,

20, and

21.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mellacher, P. Endogenous viral mutations, evolutionary selection, and containment policy design. J Econ Interact Coord 17, 801–825 (2022). https://doi.org/10.1007/s11403-021-00344-3

Download citation

Received: 12 August 2021
Accepted: 14 December 2021
Published: 07 January 2022
Issue Date: July 2022
DOI: https://doi.org/10.1007/s11403-021-00344-3

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Endogenous viral mutations, evolutionary selection, and containment policy design

Abstract

Similar content being viewed by others

When might host heterogeneity drive the evolution of asymptomatic, pandemic coronaviruses?

Mutation induced infection waves in diseases like COVID-19

Deconvolving mutational patterns of poliovirus outbreaks reveals its intrinsic fitness landscape

1 Introduction

2 SEPAIRD model and analytical results