Introduction

The ongoing global pandemic of coronavirus disease 2019 (COVID-19) has caused devastating loss of human lives and inflicted severe economic burden in the US (WHO, 2020). By 20 March 2021, more than 510,000 people were killed by COVID-19, the unemployment reaches 11.5% (CBO, 2021), and fiscal shortfall reaches over 200 billion (The Council of State Governments, 2020). It is known that findings from scientific research on non-pharmaceutical interventions (NPIs) and pharmaceutical interventions (e.g., available vaccinations (Le et al., 2020; Bubar et al., 2021; Markoviv et al., 2021) and drugs (Rabby, 2020)) have successfully guided policymakers on implementing public health strategies during a seasonal or pandemic flu. To control the COVID-19 pandemic, countries of the world have deployed a wide range of (NPIs) (Haushofer and Metcalf, 2020; Priesemann et al., 2021), including personal NPIs (e.g., home isolation, hand hygiene, face mask), community NPIs (e.g., school closure, social distancing, limiting for mass gatherings), and environmental hygiene (e.g., surface cleaning) (CDC, 2021). Methodologies, from randomized controlled trials (Haushofer and Metcalf, 2020), econometric methods (Hsiang et al., 2020; Cho, 2020) to mathematical models (Dehning et al., 2020; Chen et al., 2020; Ruktanonchai et al., 2020; Brauner et al., 2021; Zhong et al., 2021; Yang et al., 2021; Wang et al., 2022), have been developed to measure the effects of these NPIs. Table 1 provides a brief overview of some existing methodologies for estimating NPIs’ effectiveness at reducing COVID-19 transmission. But many of them are simulation studies, which often cannot be used without predicting the pandemic when the NPIs are adjusted. The other data-driven studies, which mostly incorporate binary data of whether the NPIs are implemented instead of human behaviors/opinions toward interventions, would fail to capture the impact of human and social development. Moreover, without considering NPIs’ varieties on space, timing, and duration, we cannot understand whether these NPIs have had the desired effect of controlling the epidemic. In this study, we aim to tackle the challenge by modeling the COVID-19 spreading as a feedback control system, where the NPIs and vaccination note the controllers, and then to help policymakers determine the magnitude and timing of interventions’ deployment as circumstances change.

Table 1 Summary of methodologies on estimating NPIs’ effectiveness.

Engineering perspectives are useful in epidemic modeling (Yan et al., 2015), including the principle of control theory that provides a theoretical basis for NPIs’ functioning and transmission management. Control theory, originally developed for engineered systems with applications to power grids, manufacturing, aircraft, satellite, and robots, has been adapted to understand the controllability of complex systems emerging in ecology, biology, and society. The recent work about network control enables us to identify the minimal driver nodes (Liu et al., 2011) or the lowest control costs (Yan et al., 2015, 2012) for node control, edge control (Nepusz and Vicsek, 2012), target control (Gao et al., 2014), multilayer control (Pósfai et al., 2016; Menichetti et al., 2016), temporal control (Li et al., 2017), and data-driven control (Baggio et al., 2021). However, we continue to lack general answers to practically applying control theory to human and natural systems, like the dynamical system for disease spreading. The difficulty is rooted in, for example. to control a vehicle’s speed, we know the pedals and the steering wheel are the drivers prompting a car to move with the desired speed and in the desired direction, but the practical drivers are unknown for complex human and natural systems (Liu and Barabási, 2016; L'´ober, 2016). Specifically, when focusing on the COVID-19 pandemic, we hypothesize that non-pharmaceutical and pharmaceutical interventions are the “drivers” to determine the dynamics of the COVID-19 pandemic in each location through controlling the infection, recovery, and death rates (L'´ober, 2016). We validate this hypothesis by developing a parsimonious model that predicts how the interventions influence the spreads in 381 MSAs in the US and ultimately estimates the control of COVID-19.

Model

Model COVID-19 spreading as a feedback control system

Increasing evidence shows that the spread of COVID-19 follows compartmental models (Hsiang et al., 2020; Lai et al., 2020; López and Rodó, 2020; COVID et al., 2020), such as the SIRD (Susceptible–Infectious–Recovered–Death) model, which is mathematically described by the nonlinear equations expressing a population balance as follows (Keeling and Rohani, 2011):

$$\begin{array}{l}\dot{S}=\mu I(t)+\mu R(t)-\beta (t)S(t)I(t)-V(t),\\ \dot{I}=\beta (t)S(t)I(t)-\gamma (t)I(t)-\delta (t)I(t)-\mu I(t),\\ \dot{R}=\gamma (t)I(t)-\mu R(t),\\ \dot{D}=\delta (t)I(t),\end{array}$$
(1)

where S, I, R, and D are the susceptible, infected, recovered, and death numbers, respectively. S + I + R + D = Ω and Ω is the population at the given place. Here, V is the count of people fully vaccinated with an initial efficacy rate 90% (Thompson et al., 2021) (see that V(t) changes as a function of the cross variant immunity and waning immunity in Eq. (14)). The parameter μ is the crude birth and death rate, and the epidemiological parameters β, γ, and δ are the infection, recovery, and death rates (Θ = [β, γ, δ]T). Studies show that epidemiological parameters are time-dependent, which adapt accordingly to the change in interventions (Dehning et al., 2020; Chen et al., 2020). By defining β0, γ0, and δ0 as the infection, recovery and death rates before interventions (\({{{\Theta }}}_{0}={[{\beta }_{0},{\gamma }_{0},{\delta }_{0}]}^{{\mathrm {T}}}\)), we define Θ(t) = Θ0 + UΘ(t), where \({U}_{{{\Theta }}}(t)={[{U}_{\beta }(t),{U}_{\gamma }(t),{U}_{\delta }(t)]}^{{\mathrm {T}}}\) is a vector of the control input signals. As shown in Fig. 1a, the controllers UΘ(t) work as the edge control (Nepusz and Vicsek, 2012) and the vaccination V works as the node control (Liu et al., 2011). Based on nonlinear feedback control law, we develop the controllers UΘ(t) using the feedback linearization approach. Subject to our designed control actions, the SIRD model could perfectly track the reference trajectories which is generated from reported real-world pandemic data. As illustrated in Fig. 1b, the output model trajectory fits the real-world three-dimensional data when the feedback effect is included, while it fails to fit when it is not considered. Note that our approach is general and can be extended to other models that consider n-dimensional data.

Fig. 1: The SIRD-model based feedback control system reveals how the interventions’ magnitudes and timing govern the dynamics of disease.
figure 1

a Based on nonlinear feedback control law, we develop the controllers \({U}_{{{\Theta }}}={[{U}_{\beta },{U}_{\gamma },{U}_{\delta }]}^{{\mathrm {T}}}\) working as the edge control on infection rate, recovery rate, and death rate. On the other side, the vaccination V works as the node control on the susceptible population. b With these controllers, the output trajectory of disease X = [S, I, R, D]T of the feedback control system fit with the real-world infection data of the disease. c Through the model, which includes the nonlinearity or interactions between NPIs, the human behavior toward NPIs (e.g., stay-at-home order θs, face-mask wearing θf, and testing θg) are linked to the controllers UΘ through their effects (e.g., \({w}_{{{\Theta }}}^{{\mathrm {s}}}\), \({w}_{{{\Theta }}}^{{\mathrm {f}}}\), \({w}_{{{\Theta }}}^{{\mathrm {g}}}\)). Icon credits for c: stay-at-home logo, Tinypolkadoz/Dreamstime.com; face mask, Elenabsl/Dreamstime.com; testing, Zubada/Dreamstime.com; vaccination, www.creazilla.com.

Next, we propose the parsimonious model by employing the difference-in-difference estimation, which maps human behavior toward NPIs into the designed controller, as shown in Fig. 1c. Specifically, the human behavior toward NPIs \({{{{\boldsymbol{\theta }}}}}_{I}={[{\theta }_{{\mathrm {s}}},{\theta }_{{\mathrm {f}}},{\theta }_{{\mathrm {g}}}]}^{{\mathrm {T}}}\), which are designated as stay-at-home order θs, face-mask wearing θf, and testing θg, are measured according to Google Trend and survey data from YouGov (see Dataset for the details). The designed controller, \({U}_{{{\Theta }}}={[{U}_{\beta },{U}_{\gamma },{U}_{\delta }]}^{{\mathrm {T}}}\), are the changes in pre-intervention infection, recovery and death rates. Then, this model could measure the effect of NPIs θI on the controllers UΘ by comparing the changes in infection dynamics (designed controller) before and after the same region’s NPIs deployment. Beyond existing models, which assume the effects on policies are approximately linear (Hsiang et al., 2020), we also identify the interactions between NPI policies. Thus, we are able to compile the compound evolution of human behavior toward the NPIs, to the evolution of the control signals with their respective effects, \({{{{\boldsymbol{W}}}}}_{{{\Theta }}}^{{\mathrm {I}}}={[{w}_{{{\Theta }}}^{{\mathrm {s}}},{w}_{{{\Theta }}}^{{\mathrm {f}}},{w}_{{{\Theta }}}^{{\mathrm {g}}}]}^{{\mathrm {T}}}\).

In short, the feedback control system characterizes how the input of human behavior toward NPIs determines the output of designed controllers (changes in pre-intervention infection, recovery, and death rates) in the first step. And it exhibits how the designed controllers and vaccine rollout determine the final output of disease dynamics in the second step. Thus, the two-steps approach captures how the NPIs and vaccine rollout govern the disease dynamics, through combining the principles of control theory (L'´ober, 2016; Stewart et al., 2020) and statistical estimation (Hsiang et al., 2020; Gertler et al., 2016).

Deriving the feedback controllers

The feedback controllers measure the output of the SIRD model and then manipulate the inputs on infection rate, recovery rate, and death rate as needed to drive the model output toward the desired COVID-19 pandemic trajectory. Then we could rewrite the Eq. (1) as

$$\begin{array}{lll}\frac{{{{\rm{d}}}}I(t)}{{{{\rm{d}}}}t}&=&[{\beta }_{0}+{U}_{\beta }(t)]S(t)I(t)-[{\gamma }_{0}+{U}_{\gamma }(t)]I(t)-[{\delta }_{0}+{U}_{\delta }(t)]I(t)-\mu I(t)\\ \frac{{{{\rm{d}}}}R(t)}{{{{\rm{d}}}}t}&=&[{\gamma }_{0}+{U}_{\gamma }(t)]I(t)-\mu R(t)\\ \frac{{{{\rm{d(t)}}}}D}{{{{\rm{d}}}}t}&=&[{\delta }_{0}+{U}_{\delta }(t)]I(t)\end{array}$$
(2)

with the controller set \({U}_{{{\Theta }}}(t)={[{U}_{\beta },{U}_{\gamma },{U}_{\delta }]}^{{\mathrm {T}}}\). Uβ, Uγ, Uδ are the controllers on infection rate, recovery rate, and death rate. For the reference COVID-19 pandemic trajectory \({X}_{{\mathrm {d}}}={[{S}_{{\mathrm {d}}},{I}_{{\mathrm {d}}},{R}_{{\mathrm {d}}},{D}_{{\mathrm {d}}}]}^{{\mathrm {T}}}\) and its corresponding output trajectory X, the error dynamics XdX is governed by the following equations:

$$\dot{\tilde{I}}(t)=-{k}_{1}\tilde{I}(t),\quad \dot{\tilde{R}}(t)=-{k}_{2}\tilde{R(t)},\quad \dot{\tilde{D}}(t)=-{k}_{3}\tilde{D}(t),$$
(3)

where \(\tilde{I}={I}_{{\mathrm {d}}}-I\), \(\tilde{R}={R}_{{\mathrm {d}}}-R\), and \(\tilde{D}={D}_{{\mathrm {d}}}-D\) are the differences/errors between real-word data and simulated data. Here, k1, k2, and k3 are positive gains of the designed feedback controller. The solution to the linear ordinary differential equations above are expressed as

$$\tilde{I}(t)={e}^{-{k}_{1}t}\tilde{I}(0),\quad \tilde{R(t)}={e}^{-{k}_{2}t}\tilde{R}(0)\quad \tilde{D}(t)={e}^{-{k}_{3}t}\tilde{D}(0),$$
(4)

and the errors \((\tilde{I}(t),\,\tilde{R(t)},\tilde{D}(t))\to (0,\,0,\,0)\). Therefore, the error system is exponentially stable and X(t) → Xd(t). The simulation of the closed-loop system is performed selecting k1 = k2 = k3 = 0.1. The controllers are

$${U}_{\beta }(t)=\left\{\begin{array}{l}\frac{1}{S(t)I(t)}\left({\dot{I}}_{{\mathrm {d}}}(t)+\mu I(t)+[{\gamma }_{0}+{U}_{\gamma }(t)]I(t)+[{\delta }_{0}+{U}_{\delta }(t)]I(t)-{k}_{1}({I}_{{\mathrm {d}}}(t)-I(t))\right)-{\beta }_{0},\,\,{{\mbox{for}}}\,\,{U}_{\beta }(t) > -{\beta }_{0}\quad \\ -{\beta }_{0},\,\,{{\mbox{for}}}\,\,{U}_{\beta }(t)\le -{\beta }_{0}\quad \end{array}\right.$$
(5)
$${U}_{\gamma }(t)=\left\{\begin{array}{l}\frac{1}{I(t)}\left({\dot{R}}_{{\mathrm {d}}}(t)+\mu R(t)-{k}_{2}({R}_{{\mathrm {d}}}(t)-R(t))\right)-{\gamma }_{0},\,\,{{\mbox{for}}}\,\,{U}_{\gamma }(t) >-\!{\gamma }_{0}\quad \\ -{\gamma }_{0},\,\,{{\mbox{for}}}\,\,{U}_{\gamma }(t)\le -\!{\gamma }_{0}\quad \end{array}\right.$$
(6)
$${U}_{\delta }(t)=\left\{\begin{array}{l}\frac{1}{I(t)}\left({\dot{D}}_{{\mathrm {d}}}(t)-{k}_{3}({D}_{{\mathrm {d}}}(t)-D(t))\right)-{\delta }_{0},\,\,{{\mbox{for}}}\,\,{U}_{\delta }(t) > -\!{\delta }_{0}\quad \\ -{\delta }_{0},\,\,{{\mbox{for}}}\,\,{U}_{\delta }(t)\le -\!{\delta }_{0}\quad \end{array}\right.$$
(7)

The control is subject to a positivity constraint. To prevent non-physical transmission rate Uβ(t) + β0, recovery rate Uγ(t) + γ0, and death rate Uδ(t) + δ0, the controllers are designed as Uβ(t) ≥ −β0, Uγ(t) ≥ −γ0, Uδ(t) ≥ −δ0. Here \({{{\Theta }}}_{0}={[{\beta }_{0},{\gamma }_{0},{\delta }_{0}]}^{{\mathrm {T}}}\) is the pre-intervention epidemiological parameters learned by infection of ’New York’ MSA from 1 March 2020 to 13 March 2020 with Nelder-Mead simplex algorithm (Gao and Han, 2012). For the controllers, irrespective to the values of initial condition Θ0, the error system is stable and X(t) → Xd(t).

Using difference-in-difference method to learn the NPIs’ marginal effects to the controllers

For NPI set θI, we use the linear regression model for the difference-in-difference to calculate their effects on the vector of infection rate, recovery rate, and death rate UΘ(t) + Θ0. Consider the model

$${U}_{{{\Theta }}}(t)+{{{\Theta }}}_{0}=\lambda (t)+f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}}(t))+{\epsilon }_{{{\Theta }}}(t)$$
(8)

where λ(t) is the factor for time trend and ϵΘ(t) is the residual term. Then, the difference of outcome controllers from time t−1 to time t is

$$\begin{array}{l}{U}_{{{\Theta }}}(t)-{U}_{{{\Theta }}}(t-1)=[\lambda (t)+f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}}(t))+{\epsilon }_{{{\Theta }}}(t)]\\ \qquad\qquad\qquad\qquad\quad\,\,-\,[\lambda (t-1)+f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}}(t))+{\epsilon }_{{{\Theta }}}(t-1)]\end{array}$$
(9)

Adding up the all difference from time  0 to time t

$$\begin{array}{l}{U}_{{{\Theta }}}(t)-{U}_{{{\Theta }}}(0)=\mathop{\sum}\limits_{t^{\prime} \in [1,t]}\lambda (t^{\prime} )-\mathop{\sum}\limits_{t^{\prime} \in [0,t-1]}\lambda (t^{\prime} )+\mathop{\sum}\limits_{t^{\prime} \in [1,t]}{\epsilon }_{{{\Theta }}}(t^{\prime} )-\mathop{\sum}\limits_{t^{\prime} \in [0,t-1]}{\epsilon }_{{{\Theta }}}(t^{\prime} )\\ \qquad\qquad\qquad\qquad+\,f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}}(t))-f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}}(0))\\ \,\,\,\,\,\qquad\qquad\qquad\approx f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}}(t))-f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}}(0))\end{array}$$
(10)

As when t = 0 no NPIs are implemented, thus, UΘ(t) = 0 and f(θI(0)) = 0,

$${U}_{{{\Theta }}}(t)=f({{{{\boldsymbol{\theta }}}}}_{{{{\mathrm {{I}}}}}}(t))$$
(11)

Vaccination adoption model

Like the innovation adoption model, the daily newly vaccination adoption (vaccination coverage) will follow the bell curve, the normal distribution,

$$h(t,u,\sigma )=\hat{V}\frac{1}{\sigma \sqrt{2\pi }}{e}^{-\frac{1}{2}{\left(\frac{t-u}{\sigma }\right)}^{2}}$$
(12)

where \(\hat{V}\) is the saturation of vaccination coverage. The cumulative vaccination adoption (vaccination coverage) is \(H(t)=Pr(1\le x\le t)=\int\nolimits_{1}^{t}h(t,u,\sigma ){\mathrm {d}}t\) with t ∈ [1, Ts]. It means, the cumulative vaccination coverage will reach saturation of \(\hat{V}\) in Ts days. For testing the case of a well-controlled COVID 19, we tune the u and σ to fit the real-world vaccination coverage in the United States from 12 January 2021 to 12 January 2022 in Fig. 6.

To consider the waning nature of vaccine (Mallapaty et al., 2021), we define \(a(t-t^{\prime} )\) as the decreasing vaccine efficacy since \(t^{\prime}\) with initial efficacy 0.9,

$$a(t-t^{\prime} )=\left\{\begin{array}{l}0.9-b(t-t^{\prime} ),0.9 > =b(t-t^{\prime} )\\ 0,0.9\, < \, b(t-t^{\prime} )\end{array}\right.$$
(13)

where b is the decreasing rate. When \(0.9 \,<\, b(t-t^{\prime} )\), the vaccine totally loses its efficacy and the vaccinated people again become susceptible.

Then, the effectively vaccinated population at t is

$$V(t)=\mathop{\sum}\limits_{t^{\prime} \le t}a(t-t^{\prime} )h(t^{\prime} )-\mathop{\sum}\limits_{t^{\prime} \le t-1}a(t-1-t^{\prime} )h(t^{\prime} )$$
(14)

where \(h(t^{\prime} )\) is the count of new vaccinated population at \(t^{\prime}\).

Results

Tracking the pandemic’s trajectory with designed controllers

To validate its accuracy in predicting the disease evolution in the future and its applicability in determining the needed interventions to control the infection process, we test the NPIs data and infection data at 381 Metropolitan statistical areas (MSAs) from 1 April 2020 to 20 February 2021. The MSAs, defined as the core areas integrating social and economic adjacent counties, are contiguous areas of relatively high population and traffic density. We consider each MSA as a “closed population” the disease evolution could be modeled by the nonlinear SIRD dynamical model, Eq. (1), in a compact form, as

$$\dot{X}=F(X,{U}_{{{\Theta }}}+{{{\Theta }}}_{0}),$$
(15)

with X = [S, I, R, D]T is the state/output vector and \({U}_{{{\Theta }}}={[{U}_{\beta },{U}_{\gamma },{U}_{\delta }]}^{{\mathrm {T}}}\) is the input vector. Using feedback linearization control design, we construct a nonlinear feedback control law \({U}_{{{\Theta }}}=\phi (X,\dot{X},{X}_{{\mathrm {d}}},{\dot{X}}_{{\mathrm {d}}})\), to track the real-world pandemic trajectories \({X}_{{\mathrm {d}}}={[{S}_{{\mathrm {d}}},{I}_{{\mathrm {d}}},{R}_{{\mathrm {d}}},{D}_{{\mathrm {d}}}]}^{{\mathrm {T}}}\) (see Eqs. (2)–(7)) in Model). The feedback law ϕ(.) relies on the measurements of the model full state X and the reference trajectory (Xd) and their time derivative (\(\dot{X}\) and \({\dot{X}}_{{\mathrm {d}}}\)).

Theoretically, for each MSA, the output trajectory X perfectly fits the reported infection trajectory Xd when the control action UΘ is applied. This fact is illustrated in Fig. 2, which shows the evolution of the predicted and reported trajectories from 1 April 2020 to 20 February 2021 for two MSAs, namely the “New York” MSA (with New York as the core) and ’Houston’ MSA (with Houston as the core). Moreover, the excellent alignments between the real-world and model data of all 381 MSAs indicate our approach’s predictive power, as shown in Fig. 2b. On the other side, Fig. 2c shows all MSAs’ infection rate [β0 + Uβ(t)], recovery rate [γ0 + Uγ(t)], and death rate [δ + Uδ(t)], respectively, and mark out the respective rates for the two examples of MSA. Overall speaking, the infection rate and recovery rate decrease till May, rebound in June, decrease again to the lowest in September and start to fluctuate till February. The death rate continues to decrease in October and stays relatively constant beyond. As validated in literature (Pei et al., 2020), “New York” MSA enforces interventions more effectively and earlier, leading to a relatively lower infection rate, recovery rate, and death rate than most MSAs till October. One can observe that “Houston” MSA follows medium patterns.

Fig. 2: The designed feedback controllers UΘ drive the output trajectories (\(\dot{X}\)) in the SIRD model ingeniously fits the reported trajectories (\({\dot{X}}_{{\mathrm {d}}}\)) for 381 MSAs.
figure 2

a The reported data (dots) of infected/recovered/dead cases are fitted with the output trajectory (solid line) for example MSAs, i.e., “New York” MSA and “Houston” MSA. b Comparison between the reported data and output data at all MSAs. Each MSA represent a dot. c All MSAs' temporal infection rate, [β0 + Uβ(t)], recovery rate, [γ0 + Uγ(t)], and death rate, [δ0 + Uδ(t)] with marking out the examples MSAs' rates. d shows the MSA’s temporal effective reproductive ratio (Re) and e shows each MSA’s average effective reproductive ratio from 14 February 2020 to 20 February 2021 in the cartogram map, in which the geometry of regions is distorted according to their population. The effective reproductive ratio reaches the lowest as of 20 February 2021.

Having the daily transmission rate, recovery rate, and death rate, we compute the effective reproductive ratio \({R}_{e}(t)=\frac{[{\beta }_{0}+{U}_{\beta }(t)]S(t)}{[{\gamma }_{0}+{U}_{\gamma }(t)]+[{\delta }_{0}+{U}_{\delta }(t)]+\mu }\), representing the expected number of secondary infected cases at time t when no vaccinations are rolled out. The function Re(t) is a critical threshold in understanding whether the outbreak is under control. More precisely, if Re(t) < 1, then the ongoing outbreak will eventually fade out, whereas Re(t) > 1 means an acceleration of the infection dynamics leading to a substantial growth of infected cases and deaths. Fig. 2d, e, respectively visualize the temporal and spatial distribution of the MSAs’ effective reproductive ratio. Our results reveal that just a few MSAs’ effective reproductive ratios ever reach Re(t) < 1, implying the need to implement more rigorous interventions to achieve an effective control of the pandemic. For example, from 14 February 2021 to 20 February 2021, the average effective reproductive ratios in “New York” and “Houston” MSAs are evaluated as 1.468 and 1.393, respectively, revealing a critical need for stronger interventions.

Mapping human behaviors as actuators to steer NPIs as controllers

Although our controller, UΘ, precisely predicts the infectious, death toll, and recovery, it is thus far unknown how the measurable interventions change the controllers. Compared with the driving car, it is similar that we know the desired speed and direction in order to move the car to the target location, but we do not know what the pedals and the steering wheel for epidemic control are and how the changes in the pedals and the steering wheel determine the speed and direction. In another word, as depicted in Fig. 1, we assume that the changes of interventions, directly steering the control signals, will shape the disease dynamics when they are tightened and loosened to different levels. For multiple NPIs θI, we divide them into two sets, i.e., the set of community NPIs ϑc (e.g., social distancing and quarantine) and the set personal NPIs ϑp (e.g., face covering, test, and frequent hand wash). Then we develop the following mathematical model \({\hat{U}}_{{\Theta }}(t)={[{U}_{\beta },{U}_{\gamma },{U}_{\delta }]}^{\mathrm {T}}=f({{\boldsymbol{\theta }}}_{{\boldsymbol{I}}}(t),{{\boldsymbol{W}}}_{\Theta }^{I})\) as (see [Eqs. (8)–(11)] in Model)

$${\hat{U}}_{{{\Theta }}}(t)=\mathop{\prod}\limits_{i\in \{c,p\}}(1-\mathop{\sum}\limits_{j\in {\vartheta }^{i}}{w}_{{{\Theta }}}^{j}{\theta }_{j}(t))-1$$
(16)

where \({\hat{U}}_{{{\Theta }}}(t)\) is the estimate of the control action based on NPIs with their magnitudes θj(t) and their impact value \({w}_{{{\Theta }}}^{j}\). θI is the vector representing different NPIs \({{{{\boldsymbol{\theta }}}}}_{{{{\mathrm{I}}}}}={[{\theta }_{1},{\theta }_{2},...,{\theta }_{i},...,{\theta }_{n}]}^{{\mathrm {T}}}\). WΘ is a vector representing the effects of NPIs at θI and \({{{{\boldsymbol{W}}}}}_{{{\Theta }}}^{{\mathrm {I}}}=[{w}_{{{\Theta }}}^{1},{w}_{{{\Theta }}}^{2},...,{w}_{{{\Theta }}}^{i},...,{w}_{{{\Theta }}}^{n}]\) where \({w}_{{{\Theta }}}^{i}={[{w}_{\beta }^{i},{w}_{\gamma }^{i},{w}_{\delta }^{i}]}^{{\mathrm {T}}}\). For each specific NPI j, large \({w}_{{{\Theta }}}^{j}\) value demonstrates that NPI j has strong impact. If \({w}_{{{\Theta }}}^{j}={[0,0,0]}^{{\mathrm {T}}}\), the control \({\hat{U}}_{{{\Theta }}}(t)\) is independent with the NPI j. The term ∏i∈{c, p} indicates that community NPIs ϑc and personal NPIs \(\vartheta^{\rm{p}}\) have a joint effect on the controller. For either community NPIs ϑc or personal NPIs \(\vartheta^{\rm{p}}\), \(1-{\sum }_{{\theta }_{j}\in {\vartheta }^{i}}{w}_{{{\Theta }}}^{j}{\theta }_{j}(t)\) term denotes the combined impact of the NPI set. Usually, \({\hat{U}}_{{{\Theta }}}\) is a non-positive value, showing the reductions in each controller.

Given the collected data sets for eight NPIs (see Table 2), we evaluate the goodness of fit versus different combinations of NPIs to find an effective parsimonious model for Eq. (16). A parsimonious model, which has great explanatory predictive power, accomplishes a better prediction of controllers with as few NPIs as possible. With 70% of the dataset of NPIs as the training data, we use the mean absolute error to test the predictive accuracy for the rest 30% of the dataset. Figure 3 shows that including three NPIs in the model gains the highest predictive accuracy for the designed controllers (see Supplementary text for the details). We find that the most representative NPIs for Eq. (16) are: (1) stay-at-home order, represented by the normalized ratio of excessive time of staying at home, θs; (2) face-mask wearing, represented by the fraction of people wearing face masks, θf; (3) testing, represented by the normalized fraction of tested population, θg. Commonly, stay-at-home order and face-mask wearing have a positive impact on decreasing the number of reported cases. Differently, testing θg(t) may positively or negatively impact the reported infected cases. The reason is that more testing could allow identifying more cases when the number of testing is not sufficient, and yet, more testing may also lead to fewer infected cases (Chiu et al., 2020). Representing the three NPIs as \({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}}={[{\theta }_{{\mathrm {s}}},{\theta }_{{\mathrm {f}}},{\theta }_{{\mathrm {g}}}]}^{{\mathrm {T}}}\), then,

$${\hat{U}}_{{{\Theta }}}(t)=\left(1-{w}_{{{\Theta }}}^{{\mathrm {s}}}{\theta }_{{\mathrm {s}}}(t)\right)\left(1-{w}_{{{\Theta }}}^{{\mathrm {f}}}{\theta }_{{\mathrm {f}}}(t)-{w}_{{{\Theta }}}^{{\mathrm {g}}}{\theta }_{{\mathrm {g}}}(t)\right)-1$$
(17)

In the following studies, we only use these three selected NPIs for predictions.

Table 2 Non-pharmaceutical interventions considered.
Fig. 3: NPIs selection for predicting the designed controllers UΘ.
figure 3

Given eight NPIs (a), i.e., stay-at-home order, school closure, quarantine, working from home, face-mask wearing, testing, frequent hand wash, and avoid crowding, we test the accuracy of the predictive model with different combinations of NPIs for Uβ (b), Uγ (c), and Uδ (d). The model achieves high parsimony (with fewer NPIs) and a high level of goodness of fit (with the lowest mean absolute error) using three NPIs, which are stay-at-home order, face-mask wearing, and testing.

Effects of NPIs on shaping the disease dynamics

Based of the parsimonious model of Eq. (17), we learn the parameter-by-intervention specific marginal effects \({w}_{{{\Theta }}}^{{\mathrm {s}}}\), \({w}_{{{\Theta }}}^{{\mathrm {f}}}\), and \({w}_{{{\Theta }}}^{{\mathrm {t}}}\), reflecting the variations of \({\hat{U}}_{{{\Theta }}}\) as a function of the evolution of the time-dependent local NPIs θs(t), θf(t), and θt(t) at MSAs. With the objective to directly infer the \({\hat{U}}_{{{\Theta }}}\) with the NPIs data, we first use 70% dataset of NPIs as the training data to learn the marginal effects. Then, we estimate the counterparts \({\hat{U}}_{{{\Theta }}}\) given as Eq. (17) using the 30% left testing datasets and evaluate the fit between UΘ and \({\hat{U}}_{{{\Theta }}}\). Taking the “New York” MSA and “Houston” MSA shown in Fig. 4a, b as illustrative examples; given the evolution of NPIs, the estimated control signals \({\hat{U}}_{\beta }\) and \({\hat{U}}_{\delta }\) with the learned marginal effects, fit the predicted model-based control law defined in terms of Uβ and Uδ. Based on the proportionality between Uγ(t) and Uβ(t) (see Fig. S1), i.e., Uβ(t)/Uγ(t) = 2.664, here, we only consider Uβ(t) and Uδ(t).

Fig. 4: The learned effects enable us to use the magnitude of NPIs to predict infection at each MSA.
figure 4

Using the 70% NPIs and infection data, we learned the NPIs' effects in the model of Eq. (17). Based on the learned effects, a, b illustrates how the magnitude of NPIs determine the controllers (\({\hat{U}}_{\beta }(t)\) and \({\hat{U}}_{\delta }(t)\)) in the example MSAs. To test the accuracy of model, we compare [\({\hat{U}}_{\beta }(t)\) and \({\hat{U}}_{\delta }(t)\)] with analytical controllers [Uβ(t) and Uδ(t)] with the left 30% NPIs and infection data (vertical shaded area) in the right-side plots of a, b. c All MSAs' magnitudes of NPIs. The solid line represents the average, and the shaded area represents the standard variance for all MSAs. Though the large difference between MSAs' magnitudes of interventions, in d, the marginal effects for stay-at-home order and face-mask wearing are mainly negative. Testing, which may help to find more infection and death, have either negative or positive effect. Thus, by taking the estimated controllers \({\hat{U}}_{{{\Theta }}}=[{\hat{U}}_{\beta },{\hat{U}}_{\gamma },{\hat{U}}_{\delta }]\) as the input of the SIRD model, we find the estimated infection/death assemble with reported infection/death with R2 > 0.9, see the two example dates in e and f. Figure S2 shows the estimation accuracy for all others dates. It should be noted that as the \({\hat{U}}_{\gamma }\) keeps constantly proportional to \({\hat{U}}_{\beta }\) (see Fig. S1), there is no need for further exploring \({\hat{U}}_{\gamma }\).

It is remarkable that for all MSAs, Eq. (17) stays robust to the huge heterogeneity of locality. As depicted in Fig. 4c, the NPIs have a great high standard deviation (σ) across MSAs, especially for stay-at-home order and face-mask wearing. One can notice that the average magnitude of stay-at-home order decreases from 80% to 40% recently with σ = 0.140. Besides, nearly 53.56% of people are wearing face masks when being outdoors with σ = 0.20 while the average magnitude of testing increases to 9.19% with σ = 0.032. To test the robustness of the model of Eq. (17), we trained and validated the model iteratively on different MSAs’ datasets for NPIs. The distributions of marginal effects across the three NPIs are shown in Fig. 4d. Most MSAs’ marginal effects for stay-at-home order and face-mask wearing are negative. The statistics imply the generic feature of Eq. (17) in capturing NPIs’ effects despite the huge regional heterogeneity of human behaviors. However, the average marginal effect of testing has two opposite outcomes. For \({\hat{U}}_{\beta }(t)\), 73.5% of MSAs’ testing’ marginal effects are positive, meaning 26.5% of them are negative. For \({\hat{U}}_{\delta }(t)\), 46.0% of MSAs’ testing’ marginal effects are positive, meaning 54.0% of them are negative. The positive effects suggest that testing is favorable to finding more infections and deaths, while the negative effects show that testing reduces infection or death.

Applying the estimated control signals \({\hat{U}}_{\beta }(t)\), \({\hat{U}}_{\delta }(t)\), and \({\hat{U}}_{\gamma }(t)={\hat{U}}_{\beta }(t)/2.264\) to the SIRD model, we find that the new output trajectories \(\hat{X}\) fit the reported infection Xd with R2 ≥ 0.9, as shown in Fig. S2. As well, Fig. 4e, f depict the predicted infected/dead cases with reported infected/dead cases on 20 October 2020 and 20 February 2021. All the results further validate the model in assessing NPIs’ effects on disease dynamics.

Needed NPIs and vaccinations for achieving R e to a manageable level

The two-steps approach, integrating the designed controllers in Eqs. (1)–(15)) and the effect of NPIs on controllers in Eqs. (17), has successfully mapped human behaviors to NPIs to infection rate, recovery rate, and death rate of the SIRD model. Then, the effective reproductive number Re can be equivalently translated in terms of magnitudes of NPIs (θI) and V (the ratio of people full vaccinated with an efficacy rate 90% (Thompson et al., 2021)), that is,

$${R}_{{\mathrm {e}}}(t)=\frac{{\beta }_{0}+f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}},{{{{\boldsymbol{W}}}}}_{\beta }^{{\mathrm {I}}})}{{\gamma }_{0}+{\delta }_{0}+\mu +f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}},{{{{\boldsymbol{W}}}}}_{\gamma }^{{\mathrm {I}}})+f({{{{\boldsymbol{\theta }}}}}_{{\mathrm {I}}},{{{{\boldsymbol{W}}}}}_{\delta }^{{\mathrm {I}}})}S(t).$$
(18)

From the equation above, we can determine the needed magnitude of NPIs for achieving Re(t) < 1, the criterion for the disease die out, under a given vaccination coverage. Here, the vaccination coverage is viewed as an open-loop control action or a given parameter whose assigned value affects the speed of propagation of the disease.

When there is no vaccination administered in the US (like before 12 January 2021 V = 0), the needed magnitude of NPIs to achieve an effective reproductive number is below the threshold, Re(t) < 1 (Dowdle, 1998), for ’New York’ and ’Houston’ MSAs are shown in Fig. 5a, b. The “triangular prism” in three dimensions represents the needed magnitude of the stay-at-home order, face-mask wearing, and testing. The horizontal slices of the “triangular prism” are the visualization for the needed magnitude of stay-at-home order and face-mask wearing if the magnitude of testing is fixed. If stay-at-home order and face-mask wearing interventions are enforced at a level greater than about 80%, Re(t) < 1, which forms a “triangle” as shown in Fig. 5a, b. As opposed to stay-at-home order and face-mask wearing, the testing intervention have a positive marginal effect on ’New York’ MSA and ’Houston’ MSA. Hence, in both cases, the “triangle” becomes smaller for larger testing. Some MSAs (’Log Angles’ MSA and ’Miami’ MSA in Fig. S4) have a negative marginal effect for testing intervention, and their “triangle” becomes bigger for larger testing. According to their testing interventions ’ marginal effects, the shapes of “triangular prism” for more MSAs are shown in Fig. S4.

Fig. 5: Needed NPIs to keep effective reproductive ratio Re ≤ 1 with different vaccine coverage.
figure 5

Take the “New York” MSA and “Houston” MSA as examples; a, b show the needed magnitude for NPIs in order to keep Re ≤ 1 with zero vaccine coverage. The horizontal slices of the “triangular prism” are the visualization for the needed magnitude of stay-at-home order and face-mask wearing if the magnitude of testing is fixed. c When people are fully vaccinated at different levels (vaccine coverage), how much extents of stay-at-home order and face-mask wearing could be relaxed while keeping Re = 1. The magnitude of testing is fixed, the same as each MSA’s recent testing capacity. These results demonstrate relaxing interventions need thorough and careful consideration when the ratios of full vaccinated people <0.6.

Since 12 January 2021, the US began the first jab of vaccine, and undoubtedly, Re(t) could be smaller with mass vaccinations according to the Eq. (18). For different ratios of fully vaccinated people with 90% efficiency, we could find at least how much NPIs could be relaxed (reduced) to achieve Re(t) = 1 in Fig. 5c. Setting aside testing intervention, more magnitude of stay-at-home order and face-mask wearing interventions could be eased if more people are fully vaccinated. However, when 60% of people are fully vaccinated, only 55.3% of stay-at-home order and 64.8% face-mask wearing could be eased. When 70% of people are fully vaccinated, the stay-at-home order and face-mask wearing could be eased entirely. However, to achieve Re(t) < 1, the policymakers should relax NPIs with more caution.

When the variants are considered in the model

The world has endured multiple variants of the SARS-CoV-2, and there is a possibility that COVID-19 will become endemic. Although it means large-scale NPIs enforced by governments are likely to lessen, it is worth noting that it also means the world may bear long-term suffering. A COVID endemic could claim millions more lives each year and cause devastating economic burdens on immunization, treatment, and prevention (Lee et al., 2020; Heywood and Macintyre, 2020). Additionally, it is possible that repeated tightening and loosening interventions are needed in the future due to recurrent outbreaks (Stewart et al., 2020; Metcalf et al., 2020; Thompson et al., 2020). Therefore, it is valuable to explore the “zero COVID,” which was achieved temporarily in New Zealand (Baker et al., 2020), Vietnam, Brunei, and Island states in the Carbbean (Lee et al., 2020). We utilize the vaccination data (CDC, 2021) and investigate the possibility of theoretically reaching zero infections, i.e., no new cases at least for three months in a given MSA (Diekmann et al., 2012) when within/without variants. Furthermore, the day when the following 3 months having zero new cases is defined as the curbing day.

We assume that the ratio of fully vaccinated people (also called vaccination coverage), follows the innovation adoption model according to Rogers (Oldenburg and Glanz, 2008), which is a normal distribution, see Eq. (12)). Then, the total number of immunized people is defined as V(t) in Eq. (14) with the waning vaccine effectiveness. As shown in Fig. 6a, the reported data reveal that the ratio of fully vaccinated people from 12 January 2021 to 18 February 2022 in the US increases to 64%. We obtained the vaccination coverage as a curve that gradually reaches saturation within example periods of 270 and 360 days (Ts) by fitting the real-world ratio. Considering a full vaccination of 60% of people after 360 days since 12 January 2021, the average curbing day for all MSAs corresponds to the hundred seventieth day. Under this configuration, from in Fig. 6b, one can notice that ’New York’ MSA reaches the curbing stage after 197 days while ’Houston’ MSA achieves a total curbing stage after 187 days. These results reveal that broader vaccination coverage further reduces both the time until the curbing day and the death toll. However, when we consider variants of concern: Alpha, Beta, Gamma, Delta, and Omicron in the model, we find that curbing day is hard to achieve in three years as new cases will re-emerge even after a temporary period of “zero new cases.” We simply assume that infection rates increase and death rates decrease according to the emergence of variants (see Table S2). Though highly contagious variants would potentially exacerbate the infections, their less-deadly characteristics would help reduce the deaths. (see Fig. 6c, d).

Fig. 6: Disease severity with/without variants of concern.
figure 6

By (1) defining the day reaching zero new infection for 90 days as the curbing day, (2) assuming that interventions (i.e., stay-at-home order, face-mask wearing, and testing) keep the same as that from 5 January 2021 to 11 February 2021, (3) consider the ratio of people fully vaccinated as the vaccination coverage, we test the curbing days for MSAs since 12 January 2021. a The vaccination adoption model H(t) illustrates how the saturation of people fully vaccinated is reached in period Ts. Specifically, the orange line is the reported cumulative vaccination coverage. b MSAs' estimated curbing days if 60% are full vaccinated with 360 days. c, d Box plots for MSAS' additional death in the following 1000 days for different saturation of vaccination coverage with/without the presence of variants.

Discussion

We present the COVID-19 pandemic as the feedback control system to show how the evolution of disease adapts to human behaviors to NPIs and the vaccine coverage. This approach enables us to model the linear and nonlinear effects of multiple NPIs on the SIRD model-based feedback controllers on the epidemiological parameters. By reducing eight NPIs to three representative ones (i.e., stay-at-home order, face-mask wearing, and testing), we obtain the model having great explanatory predictive power toward disease dynamics. By studying the NPIs at 381 MSAs from 1 April 2020 to 20 February 2021, we find that the two-steps approach is robust and efficient to predict daily infection and death accurately. Beyond in line with existing studies that NPIs are effective in suppressing the disease outbreaks (Lai et al., 2020; Flaxman et al., 2020; Group et al., 2021), we could directly link the NPIs’ marginal effects to the effective reproductive number Re. Without the need for up-to-date knowledge of current infections and ’nowcasting’. Besides accurate forecast on both case counts and deaths, this approach could provide practical information for policymakers regarding the extent of safely relaxing NPIs and the needed vaccination coverage to manage COVID-19 locally. The approach is also universal and thus can be used by policymakers elsewhere.

By analyzing the needed NPIs for keeping Re < 1, we find that MSAs should continue to enforce their stay-at-home order and face-mask wearing. Loosening the degree below 80% could lead to a resurgence of COVID-19. Our results show that MSAs should continue to require wearing masks and unless 60% of people are fully vaccinated, the MSA should not relax the stay-at-home order and face-mask wearing intervention.

It is currently being debated whether COVID-19 ("Zero COVID”) can be eliminated and some view it as the only means of preventing future crises and insist that mass vaccination is necessary (Lee et al., 2020; Heywood and Macintyre, 2020). However, the goal is undermined by the new variants. The continuous emergence of new, highly contagious, and less deadly variants gives rise to a long process with fluctuations in new infections. The best outcome would be that future mutation of the COVID-19 virus keeps less lethal and becomes endemic. Thus, to persistently fight against COVID-19, our experiments reinforce the argument that maintaining NPIs and encouraging people to get vaccines are necessary and key strategies to lower the new deaths as much as possible.

Our study has some limitations. The first limitation is rooted in the dataset for COVID-19 and the dataset for NPIs. Given the mass mild or asymptomatic infections, the inaccuracy of reported infections and deaths would increase the uncertainty of the SIRD model-based feedback controllers. The second limitation is assuming each MSA as the closed population, which ignores the fluctuation of infections caused by case importation and exportation. This simplification would influence the results from needed NPIs and PIs for suppressing COVID-19 to the elimination test. The third limitation is assuming vaccination adoption follows the curve of diffusion of innovations without considering people’s attitudes to vaccinations. As the attitudes towards vaccination vary by age, race, ethnicity, and education, it is hard to capture the full complexity. The fourth limitation is that our model outcomes are determined by the people’s responses to NPIs, which are time-dependent. When it’s being used for prediction, it would be more suitable for predicting short-term rather than long-term dynamics especially when the NPIs deployment changes. Nevertheless, our study provides practical insights into tightening or relaxing NPIs for the aim of living with COVID-19.