Model
We modeled SARS-CoV-2 transmission according to a system of ordinary differential equations,
$$\begin{aligned} \begin{aligned} \frac{\text {d}S}{\text {d}t}&= \mu - \left( \delta + \beta (1-u) (\alpha A + I + H) + \iota + \nu \right) S \\ \frac{\text {d}E}{\text {d}t}&= \left( \beta (1-u) (\alpha A + I + H) \right) \left( S + (1-\epsilon ) V\right) + \iota S - \left( \delta + \rho \right) E \\ \frac{\text {d}A}{\text {d}t}&= (1-\sigma ) \rho E - \left( \delta + \gamma \right) A \\ \frac{\text {d}I}{\text {d}t}&= \sigma \rho E - \left( \delta + \gamma \right) I \\ \frac{\text {d}H}{\text {d}t}&= \gamma \kappa I - \left( \delta + \eta \right) H \\ \frac{\text {d}V}{\text {d}t}&= \nu S - \left( \delta + \beta (1-u) (\alpha A + I + H) (1-\epsilon ) \right) V \end{aligned} \end{aligned}$$
(1)
with variables and parameters defined in Tables 1 and 2.
Table 1 State variables in the model In the model, all individuals are initially susceptible, S, with infections introduced at a constant rate, \(\iota \), from outside the population. Individuals then transition to the exposed class, E, where they reside for an average of \(\rho ^{-1}\) days. A proportion \(\sigma \) experience a symptomatic infection, I. The remainder experience an asymptomatic infection, A, and have a fraction, \(\alpha \), of the infectiousness of symptomatic infections. Individuals reside in the I and A classes for an average of \(\gamma ^{-1}\) days. All asymptomatic infections and a proportion \(1-\kappa \) of symptomatic infections then recover and become fully immune to subsequent infection. The remaining proportion \(\kappa \) of symptomatic infections transition to the hospitalized class, H, from which they exit through either recovery or death after an average of \(\eta ^{-1}\) days.
Rather than track deaths due to COVID-19, D(t), as a state variable in Eq. (1), we assume that they follow directly from hospitalizations, H(t), according to
$$\begin{aligned} D(t) = \left\{ \begin{array}{ll} \eta H(t) \varDelta _-, &{} H(t)\le H_\text {max} \\ \eta H(t) \left( \varDelta _- + (\varDelta _+ - \varDelta _-) \left( 1 - e ^ {h (H(t) - H_\text {max})} \right) \right) , &{} H(t) > H_\text {max} \\ \end{array} \right. . \end{aligned}$$
This results in the probability of death moving beyond a minimum of \(\varDelta _-\) toward a maximum of \(\varDelta _+\) as H(t) exceeds \(H_\text {max}\), as illustrated in Fig. 1. The motivation for this choice is to account for the possibility that patients could experience increased mortality when the demand for certain resources, such as intensive care unit beds or ventilators, exceeds their availability. One other optimal control analysis of COVID-19 has incorporated a similar phenomenon (Piguillem and Shi 2020), albeit with a different functional form.
Individuals who are recovered and immune are not followed explicitly, as the model’s assumption of density-dependent transmission only requires specification of susceptible and infectious classes in the transmission term. Due to our assumption that rates of birth, \(\mu \), and death due to reasons other than COVID-19, \(\delta \), are equal, changes in population size over the course of the epidemic are modest, making the distinction between density- and frequency-dependent transmissions negligible.
The primary form of control in the model is achieved through the variable u, which represents a proportional reduction in the transmission coefficient, \(\beta \). As is standard for mass-action models of directly transmitted pathogens, \(\beta \) reflects the product of the rate at which susceptible and infectious individuals come into contact and the probability of transmission given that a contact has occurred (Keeling and Rohani 2007). Thus, a wide range of non-pharmaceutical interventions could result in changes in u, including school closures, work from home policies, and shelter in place mandates, as well as more targeted approaches, such as isolation based on self-awareness of symptoms or contact tracing. In addition to control through u, the V class represents individuals who have been vaccinated, with individuals entering V from S at rate \(\nu \) beginning on day \(\tau _\nu \). We assume that vaccination may not provide complete protection, resulting in vaccinated individuals becoming infected at a fraction \(1-\epsilon \) of the rate at which fully susceptible individuals become infected.
Basic Reproduction Number, \(R_0\)
At its core, the behavior of this model is similar to that of an SEIR model with demography. Because analyses of the transient and asymptotic properties of this class of models are plentiful in textbooks and elsewhere (e.g., Keeling and Rohani 2007), we omit such an analysis here. We do, however, derive a formula to express the basic reproduction number, \(R_0\), as a function of model parameters. This relationship plays a role in how we parameterize the model.
We use the next-generation method (van den Driessche and Watmough 2008) to obtain a formula describing \(R_0\) as a function of model parameters. This method depends on matrices \({\mathcal {F}}\) and \({\mathcal {V}}\), whose elements are defined as the rates at which secondary infections increase the ith compartment and the rates at which disease progression, death, and recovery decrease the ith compartment, respectively. For “disease compartments” E, A, I, and H, these matrices are defined as
$$\begin{aligned} \begin{aligned} {\mathcal {F}}&= \begin{bmatrix} \beta (\alpha A + I + H)S\\ 0\\ 0\\ 0 \end{bmatrix}\\ {\mathcal {V}}&= \begin{bmatrix} (\delta + \rho )E\\ -\,(1- \sigma )\rho E + (\delta + \gamma )A\\ -\,\sigma \rho E + (\delta + \gamma )I\\ -\,\gamma \kappa I + (\delta + \eta )H \end{bmatrix} . \end{aligned} \end{aligned}$$
(2)
These matrices are then used to define two others,
$$\begin{aligned} F= & {} \frac{\partial {\mathcal {F}}_i}{\partial x_j}(0,y_0) = \begin{bmatrix} 0 &{}\quad \alpha \beta &{} \quad \beta &{}\quad \beta \\ 0&{}\quad 0&{}\quad 0&{}\quad 0\\ 0&{}\quad 0&{}\quad 0&{}\quad 0\\ 0&{}\quad 0&{}\quad 0&{}\quad 0 \end{bmatrix}\nonumber \\ V= & {} \frac{\partial {\mathcal {V}}_i}{\partial x_j}(0,y_0) = \begin{bmatrix} \delta + \rho &{}\quad 0&{}\quad 0&{}\quad 0\\ -\,\rho (1 - \sigma )&{}\quad \delta + \gamma &{}\quad 0&{}\quad 0\\ -\,\rho \sigma &{}\quad 0&{}\quad \delta + \gamma &{}\quad 0\\ 0&{}\quad 0&{}\quad -\,\gamma \kappa &{}\quad \delta + \eta \end{bmatrix} , \end{aligned}$$
(3)
where x represents disease compartments and y non-disease compartments. The inverse of V is
$$\begin{aligned} V^{-1} = \begin{bmatrix} \frac{1}{\delta + \rho } &{}\quad 0 &{}\quad 0 &{}\quad 0\\ \frac{\rho (1 - \sigma )}{(\delta + \gamma )(\delta + \rho )}&{}\quad \frac{1}{\delta + \gamma } &{}\quad 0 &{}\quad 0\\ \frac{\rho \sigma }{(\delta + \gamma )(\delta + \rho )} &{}\quad 0 &{}\quad \frac{1}{\delta + \gamma } &{}\quad 0\\ \frac{\kappa \gamma \rho \sigma }{(\delta + \gamma )(\eta + \delta )(\delta + \rho )} &{}\quad 0 &{}\quad \frac{\kappa \gamma }{(\delta + \gamma )(\eta + \delta )} &{}\quad \frac{1}{\eta + \delta } \end{bmatrix} , \end{aligned}$$
(4)
which, along with F, defines the matrix \(K = FV^{-1}\). Specifying
$$\begin{aligned} K = \beta \begin{bmatrix} \frac{(1-\sigma )\alpha \rho (\delta +\eta ) + \rho \sigma (\delta + \eta ) + \kappa \gamma \rho \sigma }{(\delta +\gamma )(\delta +\rho )(\delta +\eta )} &{}\quad \frac{\alpha }{\delta + \gamma } &{}\quad \frac{\eta + \delta + \kappa \gamma }{(\delta + \gamma )(\eta + \delta )} &{}\quad \frac{1}{\eta + \delta }\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{} \quad 0 &{}\quad 0 \end{bmatrix}, \end{aligned}$$
(5)
we obtain
$$\begin{aligned} R_0 = \frac{\beta \rho \left( (1-\sigma )\alpha (\delta +\eta ) + \sigma (\delta + \eta ) + \sigma \kappa \gamma \right) }{(\delta +\gamma )(\delta +\rho )(\delta +\eta )} \end{aligned}$$
(6)
as the maximum eigenvalue of K.
Optimal Control Problem
The optimal control problem is to minimize the objective functional
$$\begin{aligned} J(u) = \int _{t_0}^{t_1} D(t)^2 + c \, u(t)^2 \text {d}t \end{aligned}$$
(7)
subject to the constraints of the state dynamics described in Eq. (1) and the initial conditions that \(S(t_0)=1\) and all other state variables equal zero at time \(t_0\). The parameter c weights the extent to which the control, u(t), is prioritized for minimization relative to deaths, D(t). Squared terms for D(t) and u(t) are chosen both for mathematical convenience in the case of u(t) and to more heavily penalize solutions for u(t) that permit relatively high values of D(t) or u(t) at any given time.
To find the optimal control, \(u^*(t)\), that minimizes J(u), we follow standard results from optimal control theory applied to systems of ordinary differential equations (Lenhart and Workman 2007). These techniques make use of Pontryagin’s Maximum Principle to determine the pointwise minimum of the Hamiltonian of the system, \({\mathcal {H}}\), using adjoint variables, \(\lambda \), that correspond to each of the state variables.
To solve for the adjoint variables, we first define the Hamiltonian,
$$\begin{aligned} {\mathcal {H}}= & {} D^2 + c \, u^2 + \lambda _S \left( \mu - \left( \delta + \beta (1-u) (\alpha A + I + H) + \iota + \nu \right) S \right) \nonumber \\&+\, \lambda _{E} \left( \beta (1-u) (\alpha A + I + H) \left( S + (1-\epsilon ) V\right) + \iota S - \left( \delta + \rho \right) E \right) \nonumber \\&+ \,\lambda _A \left( (1-\sigma )\rho E - \left( \delta + \gamma \right) A \right) + \lambda _I \left( \sigma \rho E - \left( \delta + \gamma \right) I \right) \nonumber \\&+\, \lambda _H \left( \gamma \kappa I - \left( \delta + \eta \right) H \right) \nonumber \\&+ \,\lambda _V \left( \nu S - \left( \delta + \beta (1-u) (\alpha A + I + H) (1-\epsilon ) \right) V \right) . \end{aligned}$$
(8)
We then define differential equations describing the behavior of each adjoint variable as the negative of the partial derivative of \({\mathcal {H}}\) with respect to the state variable corresponding to each adjoint variable. This yields
$$\begin{aligned} \begin{aligned} \frac{\text {d}\lambda _S}{\text {d}t} =\,&\lambda _S \left( \delta + \beta (1-u) (\alpha A + I + H) + \iota + \nu \right) \\&- \lambda _{E} \left( \beta (1-u) (\alpha A + I + H) + \iota \right) - \lambda _V (\nu ) \\ \frac{\text {d}\lambda _{E}}{\text {d}t} =\,&\lambda _{E} \left( \delta + \rho \right) - \lambda _A \left( (1-\sigma )\rho \right) - \lambda _I \left( \sigma \rho \right) \\ \frac{\text {d}\lambda _A}{\text {d}t} =\,&\lambda _S \left( \beta (1-u) \alpha S \right) - \lambda _{E} \left( \beta (1-u) \alpha (S+ (1-\epsilon ) V) \right) \\&+ \lambda _A (\delta + \gamma ) + \lambda _V \left( \beta (1-u) \alpha (1-\epsilon ) V \right) \\ \frac{\text {d}\lambda _I}{\text {d}t} =\,&\lambda _S \left( \beta (1-u) S \right) - \lambda _{E} \left( \beta (1-u) (S+ (1-\epsilon ) V) \right) \\&+ \lambda _I \left( \delta + \gamma \right) - \lambda _H \left( \gamma \kappa \right) + \lambda _V \left( \beta (1-u) (1-\epsilon ) V \right) \\ \frac{\text {d}\lambda _H}{\text {d}t} =&- \frac{\partial D^2}{\partial H} + \lambda _S \left( \beta (1-u)S \right) - \lambda _{E} \left( \beta (1-u) (S+ (1-\epsilon ) V) \right) \\&+ \lambda _H \left( \delta + \eta \right) + \lambda _V \left( \beta (1-u)(1-\epsilon )V \right) \\ \frac{\text {d}\lambda _V}{\text {d}t} =&-\lambda _{E} \left( \beta (1-u)(\alpha A+I+H)(1-\epsilon ) \right) \\&+\lambda _V \left( \delta + \beta (1-u)(\alpha A+I+H)(1-\epsilon ) \right) , \end{aligned} \end{aligned}$$
(9)
where
$$\begin{aligned} \frac{\partial D^2}{\partial H} = \left\{ \begin{array}{ll} 2 (\eta \varDelta _-)^2 H, &{} H \le H_\text {max} \\ 2 H ^{-1} D ^ 2 - 2 \eta h H D (\varDelta _+ - \varDelta _-) e^{h(H-H_\text {max})}, &{} H > H_\text {max} \\ \end{array} \right. . \end{aligned}$$
Equation (9) can be solved backward in time with transversality conditions at time \(t_1\) equal to zero for each adjoint variable.
To find the pointwise optimal control, \(u^*\), we find the value of u that minimizes \(\frac{\partial {\mathcal {H}}}{\partial u}\), which yields
$$\begin{aligned} u^* = \frac{-\beta (\alpha A+I+H)\left( \lambda _S S + \lambda _V (1-\epsilon ) V - \lambda _E (S + (1-\epsilon ) V \right) }{2c} . \end{aligned}$$
(10)
The optimal control is subject to a lower bound of 0 and an upper bound of \(u_\text {max}\), with those values used for \(u^*\) whenever the right-hand side of Eq. (10) yields values outside those bounds.
To find \(u^*(t)\) numerically, we use the forward–backward sweep method (Lenhart and Workman 2007), which involves first solving for the state variables forward in time, next solving for the adjoint variables backward in time, and then plugging the solutions for the relevant state and adjoint variables into Eq. (10), subject to bounds on u(t). As is often the case for optimal control problems (Lenhart and Workman 2007), we found that we needed to perform these steps iteratively and with a convex combination of controls across iterations to achieve convergence. Specifically, we repeated the forward–backward sweep process 50 times, after which we took newly proposed solutions of \(u^*(t)\) as the average of the 20 most recent solutions, until the algorithm was stopped after 2000 iterations. We assessed convergence with a statistic defined as
$$\begin{aligned} \frac{\sum _{\forall t} |u_\text {new}-u_\text {old}|}{\sum _{\forall t} |u_\text {new}|} , \end{aligned}$$
(11)
where smaller values indicate better convergence. All numerical solutions were obtained with a Runge–Kutta 4 routine implemented with the ode function from the deSolve package (Soetaert et al. 2010) in R.
Model Parameterization
In Table 2, we specify low, intermediate, and high values of each parameter. For some parameters, estimates matching our parameter definitions were taken from other studies. For other parameters, additional steps were necessary to match our parameter definitions. We elaborate on the latter below.
Transmission coefficient, \(\beta \) We based values of this parameter on assumptions about \(R_0\) by solving for \(\beta \) as a function of \(R_0\) and other parameters in Eq. (6). Because estimates of \(R_0\) for COVID-19 vary widely (Park et al. 2020), we chose values that span a range of estimates that may be applicable to the USA.
Background birth and death rates, \(\mu \) and \(\delta \) We parameterized \(\mu \) consistent with a rate of 3,791,712 births in a population of 331 million in the USA in 2018 (Martin et al. 2019). To achieve a constant population size in the absence of COVID-19, we set \(\delta \) equal to \(\mu \).
Probability of death among hospitalized cases, \(\varDelta \) and h We assume that \(\varDelta _-\) is equal to early estimates from the USA (2.6%) (Centers for Disese Control and Prevention 2020) and that \(\varDelta _+\) is equal to estimates from Italy (7.2%) (Onder et al. 2020). Estimates of how quickly \(\varDelta _+\) might be approached have not been made empirically, so the value of h was assumed. The intermediate value of \(h=701\) corresponds to an increase in H of 50% beyond \(H_\text {max}\) resulting in 50% of the maximum increase from \(\varDelta _-\) to \(\varDelta _+\).
Progression through hospitalization, \(\eta \) We used line-list data [1] to estimate a mean (13.2 days) and standard deviation (7.39 days) of the time between hospital admission and either discharge or death.
Timing of vaccine introduction, \(\tau _\nu \) Experts have stated that a vaccine against COVID-19 could be available to the public by spring 2021 (Amanat and Krammer 2020). We used April 1, 2021, as our default value for this parameter.
Vaccination rate, \(\nu \) In the 2009 H1N1 pandemic, 100 million doses of vaccine were administered between October 2009 and April 2010 [8,9]. Consistent with that, our default value of \(\nu \) resulted in 0.197% of the population being vaccinated each day.
Hospital capacity, \(H_\text {max}\) We based our definition of this parameter on hospital beds and did not include other aspects of hospital resources, such as ICU beds, ventilators, or hospital staffing. Specifically, we adopted an estimate of 312,090 beds available for COVID-19 patients in the USA by the Institute for Health Metrics and Evaluation [54].
Maximum effect of control, \(u_\text {max}\) A model incorporating survey data and age-based contact patterns estimated that social distancing could reduce transmission of SARS-CoV-2 by 73% (Jarvis et al. 2020). Based on this, we used 0.7 as an intermediate value of this parameter and 0.5 and 0.9 as lower and upper values.
Model Calibration
To obtain realistic behavior of the model, we calibrated it to match the cumulative number of reported deaths in the USA within the first 100 days of 2020 (i.e., by April 9), which we obtained from the New York Times [39]. We focused our calibration on the parameter \(\iota \), due to the difficulty of empirically estimating the rate at which imported infections appear. To obtain a value of \(\iota \) that resulted in the model matching the reported number of deaths, we simulated the model across 300 values of \(\iota \) evenly spaced between \(10^{-12}\) and \(10^{-4}\) on a log scale, performed a linear interpolation of the simulated number of deaths across those values of \(\iota \), and found the value of \(\iota \) that most closely matched reported deaths.
We calibrated the model under a total of 18 different parameter scenarios, crossing low, intermediate, and high values of \(R_0\) with low, intermediate, and high values of \(u_\text {max}\) and low and high values of \(\omega \). The latter represents the proportion of all deaths caused by COVID-19 that were reported. Because non-pharmaceutical interventions began going into effect in the USA within the timeframe of this calibration period, we calibrated the model subject to an assumed pattern of u(t) through the first 100 days of 2020. We chose a logistic functional form for this, with a minimum of 0 and a maximum of \(u_\text {max}\). Two parameters that control the midpoint and slope of the increase from 0 to \(u_\text {max}\) were selected by an informal process of trial and error, with the goal of having the model’s predictions of deaths over time match the timing of reported deaths under all 18 parameter scenarios. We also used this process to select the date on which importations were initiated through \(\iota \).