Bayesian-based predictions of COVID-19 evolution in Texas using multispecies mixture-theoretic continuum models

Jha, Prashant K.; Cao, Lianghao; Oden, J. Tinsley

doi:10.1007/s00466-020-01889-z

Bayesian-based predictions of COVID-19 evolution in Texas using multispecies mixture-theoretic continuum models

Original Paper
Published: 31 July 2020

Volume 66, pages 1055–1068, (2020)
Cite this article

Download PDF

Computational Mechanics Aims and scope Submit manuscript

Bayesian-based predictions of COVID-19 evolution in Texas using multispecies mixture-theoretic continuum models

Download PDF

2904 Accesses
30 Citations
Explore all metrics

Abstract

We consider a mixture-theoretic continuum model of the spread of COVID-19 in Texas. The model consists of multiple coupled partial differential reaction–diffusion equations governing the evolution of susceptible, exposed, infectious, recovered, and deceased fractions of the total population in a given region. We consider the problem of model calibration, validation, and prediction following a Bayesian learning approach implemented in OPAL (the Occam Plausibility Algorithm). Our goal is to incorporate COVID-19 data to calibrate the model in real-time and make meaningful predictions and specify the confidence level in the prediction by quantifying the uncertainty in key quantities of interests. Our results show smaller mortality rates in Texas than what is reported in the literature. We predict 7003 deceased cases by September 1, 2020 in Texas with $95\%$ CI 6802–7204. The model is validated for the total deceased cases, however, is found to be invalid for the total infected cases. We discuss possible improvements of the model.

Understanding COVID-19 Epidemics: A Multi-Scale Modeling Approach

Spatio-temporal predictive modeling framework for infectious disease spread

Article Open access 24 March 2021

Hybrid Modeling of COVID-19 Spatial Propagation over an Island Country

1 Introduction

Modeling the spreading of infectious diseases can help in extracting relevant information from the data, such as effective reproduction rates, mortality rates, contact rates, in exploring the effectiveness of various preventive measures and their effect on the epidemic, developing a deeper understanding of how the particular disease spreads and major features that support the spreading, [15]. During 2020, much of the world has been under lockdown due to a novel SARS-CoV-2 virus. The virus originated in Wuhan, China, and has resulted in the loss of many lives, loss of livelihood, and almost complete shutdown of economies. SARS-CoV-2 virus infection spreads via droplets generated during coughing/sneezing and touching contaminated surfaces. To slow down its spread, people are advised to maintain social distancing and in certain places even strict restriction on public movement is enforced. The virus seem to infect all groups, however, has been more deadly for older people and people with compromised immunity [8, 9, 18, 28]. While the search for vaccines and drugs are going on, the researchers across the globe have put in an effort to develop a model that captures the evolution of the epidemic and reveal the important parameters which help policy makers and medical professionals in devising preventive strategies.

Existing works on COVID-19 epidemic prediction include stochastic transmission models (compartmental models) based on the idea of putting individuals in different categories, such as susceptible (S), exposed (E), infected (I), recovered/removed (R), and deceased (D). Each category evolves in time often captured by systems of ODEs and are coupled to other species through the cross-interaction terms. These models average the individual interaction and therefore are more effective for large-scale epidemics. Such models have been applied to extract the relevant information such as the effective reproduction rate (number of secondary infections at given time) [2, 17], account for asymptotic cases on disease spread [26], and to test the effects of preventive measures such as social distancing and isolation [14]. The parameters in the cross-interaction terms can be either assumed to be constant in time or time varying to include the effect of various preventive policies such as social-distancing, strict isolation directives, in the model [10, 17]. Compartmental models can be generalized by allowing fields corresponding to each category to vary in space. The random (Brownian) motion of individuals is approximated by diffusion term whereas the interaction between categories is modeled by reaction terms, [16]. This results in a set of nonlinearly coupled reaction–diffusion PDEs (partial differential equations). Recently a model of this type has been applied to predict the spread of disease in Lombardy, Italy [30]. That work is based on the earlier model of the spread of rabies [16].

In this work, we consider a multi-species model of the evolution of COVID-19 which is characterized by a system of PDEs. The species comprise of the fractions of the population in the following categories: exposed but no symptoms, already infected, recovered after an infection, and those who succumbed to the virus. These fractions are treated as a continuous fields over a bounded domain in $\mathbb {R}^2$ (the state of Texas). The model considered in this work is inspired from the recent work [30] and earlier work [16].

The goal of this work is to apply a Bayesian learning in OPAL to model calibration, validation and prediction utilizing the spatially and temporally resolved data and to address uncertainties. We adopt the OPAL (Occam Plausibility Algorithm [11, 23, 24]) that is developed to deliver a systematic path toward valid prediction in the presence of uncertainties. Depending on the spatial resolution of interest (such as county, state, country), the data are available in real time listing the number of COVID-19 cases, number of recovered patients, number of patients deceased. An objective of this work is to leverage the data for model calibration and validation. Since the data are evolving, it is also necessary to evolve the model parameters (i.e. continuously update the model parameters over specified time intervals).

Bayesian learning consists of three key steps: calibration, validation, and prediction. In the calibration step, the model parameters are sampled through some assumed probability density function and, with an appropriate likelihood function, the probability densities (posteriors) of parameters are determined. This is the step where the data is sampled to obtain the pdf (probability distribution function) of model parameters. In the validation step, the model output is compared with the data and if the error is within a preset tolerance, the model is specified as Not Invalid. It can be noted that the calibration and validation steps are similar to the training and testing step in a deep learning framework; however, there is one important difference: in this Bayesian method, parameters are updated in the validation step as well. In all of the Bayesian steps, the likelihood function plays the key role. It assigns a penalty when the model output does not agree with the data. This is similar to the loss function in deep learning.

COVID-19 cases in Texas were initially slowly growing (period March 15–May 31). Near the end of May, the cases started rising rapidly. The jump in cases can be attributed to 1) opening up of businesses and less restriction in movement of people post May 20, and 2) the increase in COVID-19 testing resulting in large numbers of infected cases which may have gone unnoticed for mild cases otherwise. For our analysis, we consider the period June 1–June 30 as in this period the government policy remained somewhat the same making the assumption of temporally constant parameters less error-prone. We consider June 1–June 20 data for the calibration and June 21–June 30 for the validation of the model. Based on calibrated and validated model, we predict the total number of infected and deceased cases in 25 districts of Texas in the period July 1–September 1. It is shown that care is required in setting up a good model and scenario of prediction. Sensitivity analysis plays an important role in the model inference. For example, we show that when some of the model parameters were fixed to the values reported in previous studies, specifically the fatality rate (change from infected to deceased) and recovery rates (from infected to recovered), the total deceased cases will be insensitive to any change in parameters. The total deceased cases was mainly sensitive to the parameters described above and, by fixing them, we lose the ability to train our model. This also explains why in [30] authors saw large numbers of infected cases when they were trying to match the total deceased cases with data. Since the most sensitive parameters affecting total deceased cases was fixed in their study, they over estimate the infected cases to match the total deceased cases. Another important point emerging from our study is the sensitivity of the model output on the initial condition.

Results of the calibrated-validated model show a decrease in mortality rates for Texas as compared to mortality rates reported in literature [30]. Validation results show that the model parameters affecting total infected cases gets updated significantly, and, therefore, the total infected cases QoI computed from the validation posterior and calibration posterior show noticeable differences. This, we think, is natural as the number of infected cases is changing rapidly and the model is inadequate to accommodate this. Whereas, the total deceased cases QoI computed from the validation posterior and calibration posterior are close and we see small update of the parameters affecting the total deceased cases in the validation step. This suggests that the deceased cases in Texas has stabilized. We place higher confidence in our prediction of deceased cases. On June 30, 2020, there was total 2525 fatalities and total 175,977 infected cases all over Texas. We predict 7003 deceased cases and 301,658 infected cases by September 1, 2020 with uncertainty (standard deviation) 102 and 5786 respectively. The $95\%$ CI are 6802–7204 for deceased cases and 290,251–313,064 for infected cases.

The remainder of this paper is organized as follows: in Sect. 2, we describe the forward model. In Sect. 3, Bayesian approach in OPAL and associated components are presented. The discretization of the forward model, source and processing of the data and map are described in Sect. 4. Therein, we also present the sensitivity analysis results. In Sect. 5, we apply the Bayesian inference and show calibration-validation-prediction results. We discuss the prediction results and further improvements of the model in Sect. 6. We make the codes available publicly in website https://github.com/prashjha/BayesForSEIRD.

2 Reaction–diffusion model

Let $\Omega \subset \mathbb {R}^2$ be a simply-connected geographical region projected on a 2D plane. Let [0, T] be the time domain of interest. At any point $\varvec{x}\in \Omega $ and time $t\in [0,T]$, we introduce following real-valued fields:

$\phi _s = \phi _s(\varvec{x},t)$—susceptible population density (those not yet infected by COVID-19)
$\phi _e = \phi _e(\varvec{x},t)$—exposed population density (which are exposed but do not yet show symptoms)
$\phi _i = \phi _i(\varvec{x},t)$—infected population density
$\phi _r = \phi _r(\varvec{x},t)$—recovered population density
$\phi _d = \phi _d(\varvec{x},t)$—deceased population density
$\phi _n = \phi _n(\varvec{x},t)$—total population density

We have $\phi _n = \phi _s + \phi _e + \phi _i + \phi _r + \phi _d$. By population density, of course, we mean the number of individuals per unit area in $\Omega $. We assume $\phi _n(\varvec{x},t) >0$ for all $(\varvec{x},t) \in \Omega \times [0,T]$.

Assuming sufficient smoothness and differentiability, the density fields are governed by the following nonlinear coupled system of PDEs:

$$\begin{aligned} \partial _t \phi _s&= \alpha \phi _n - \left( 1 - \frac{A}{\phi _n} \right) \beta _i \phi _s \phi _i - \left( 1 - \frac{A}{\phi _n} \right) \beta _e \phi _s \phi _e\nonumber \\&\quad - \mu \phi _s \phi _n + \nabla \cdot (\phi _n \nu _s \nabla \phi _s), \nonumber \\ \partial _t \phi _e&= \left( 1 - \frac{A}{\phi _n} \right) \beta _i \phi _s \phi _i + \left( 1 - \frac{A}{\phi _n} \right) \beta _e \phi _s \phi _e - \sigma \phi _e\nonumber \\&\quad - \gamma _e \phi _e - \mu \phi _e \phi _n + \nabla \cdot (\phi _n \nu _e \nabla \phi _e), \nonumber \\ \partial _t \phi _i&= \sigma \phi _e - \gamma _d \phi _i - \gamma _r \phi _i - \mu \phi _i \phi _n + \nabla \cdot (\phi _n \nu _i \nabla \phi _i), \nonumber \\ \partial _t \phi _r&= \gamma _r \phi _i + \gamma _e \phi _e - \mu \phi _r \phi _n + \nabla \cdot (\phi _n \nu _r \nabla \phi _r), \nonumber \\ \partial _t \phi _d&= \gamma _d \phi _i, \end{aligned}$$

(1)

where $\gamma _a, a\in \{e,r\},$ are recovery rates, $\gamma _d$ is the mortality rate of COVID-19 infected patients, $\beta _a, a\in \{e,i\},$ are contact rates, $\nu _a, a\in \{s,e,i,r\},$ are diffusivity constants of various densities, $\alpha $ is the birth rate, $\sigma $ is the inverse of the incubation period, $\mu $ is the general mortality rate (non COVID-19), and A is the constant appearing due to Allee effect. Above model is based on commonly accepted physical processes, see [15]. Boundary conditions for fields are taken as Neumann conditions:

$$\begin{aligned} \nabla \phi _a \cdot \varvec{n}= 0, \qquad \forall (\varvec{x},t) \in \varOmega \times [0,T], \forall a\in \{s,e,i,r,d\}. \end{aligned}$$

(2)

Initial conditions are given by

$$\begin{aligned} \phi _a(\varvec{x}, 0) = \phi _a^0(\varvec{x}), \qquad \forall \varvec{x}\in \varOmega , \forall a\in \{s,e,i,r,d\}. \end{aligned}$$

(3)

We ignore natural death and birth i.e. $\alpha = \mu = 0$. We remark that designing initial condition for this model is bit more involved and we discuss this in some detail in Sect. 4. Initially we have the data for the total infected cases (sum of infected, recovered, and deceased), deceased cases, recovered cases, and total population. To obtain the remaining fields, namely, exposed cases and susceptible cases, we assume that at $t=0$, there is a real number R such that

$$\begin{aligned} \phi _e(\varvec{x}, 0) = R \phi _i(\varvec{x}, 0). \end{aligned}$$

(4)

Calculations show that the model output can be sensitive to the choice of R. We add it to our list of model parameters to be determined by Bayesian inference.

3 Bayesian learning

Suppose the SEIRD (susceptible-exposed-infected-recovered-deceased) model in (1) can be expressed concisely as

$$\begin{aligned} {\mathcal {A}}(\varvec{\theta }, S; \varvec{u}(\varvec{\theta }, S)) = 0, \end{aligned}$$

(5)

where $\varvec{\theta }= (A, \beta _e, \nu _s, \nu _i, \gamma _e, \gamma _r, \gamma _d, \sigma , R) \in \Theta \subset [0,\infty )^{N_\theta }$, $N_{\theta } = 9$, is the model parameter vector, S a scenario specifying the conditions on which the problem is posed, and $\varvec{u}(\varvec{\theta }, S) = (\phi _s, \phi _e, \phi _i, \phi _r, \phi _d)$ is the vector of population densities satisfying the forward problem for a given parameter $\varvec{\theta }$ and scenario S. We assume that $\beta _i = \beta _e$ and $\nu _r = \nu _e = \nu _s$ in rest of the article. This reduces the computational complexity of the model inference problem at a very little cost of approximation.

We assume that the solution $\varvec{u}(\varvec{\theta },S)$ belongs to the space $[L^2(0,T; H^1(\Omega ))]^5$, i.e., for each $a \in \{s,e,i,r,d\}$, the components of $\varvec{u}$ satisfy

$$\begin{aligned} \int _0^T ||\phi _a||^2_1 < \infty , \end{aligned}$$

(6)

where

$$\begin{aligned} ||\phi _a||_1 = \left( \int _\Omega |\phi _a|^2 + |\nabla \phi _a|^2 d\varvec{x}\right) ^{1/2} \end{aligned}$$

(7)

is the $H^1$ Hilbert space norm. For simplicity, we let V denote the solution space, where

$$\begin{aligned} V = [L^2(0,T; H^1(\Omega ))]^5. \end{aligned}$$

(8)

In the rest of the article, we will assume that for any $\varvec{\theta }\in \Theta $ and given scenario S, the solution $\varvec{u}= \varvec{u}(\varvec{\theta },S)$ of the forward problem (5) exists and belongs to the space V.

While the solution of (5) describes the distribution of the disease over the total geographical region of the state of Texas, we are ultimately interested in the specific quantities computed from the solution $\varvec{u}$, such as the total number of cases of infected and deceased patients in each of a set of 25 districts into which the state is partitioned. The quantity of interest Q (QoI or QoIs when more than one quantity of interest) is a functional defined on the space of solutions of forward model, i.e.,

$$\begin{aligned} Q : V \rightarrow \mathbb {R}; \qquad Q(\varvec{u}(\varvec{\theta }, S_p)) = \tilde{Q}(\varvec{\theta }), \end{aligned}$$

(9)

where $S_p$ is the prediction scenario, see [23, 24]. $\tilde{Q}$ is the random function since $\varvec{\theta }$ is a random variable. For the problem at hand, we consider the total cases of infected and deceased patients in each of 25 districts of Texas at prediction time $t=T_p$ as the QoIs.

Before we describe the three key steps in OPAL Bayesian learning, it is important to first discuss the various components of the approach.

3.1 Noise in the data and the model inadequacy

3.1.1 Experimental noise

Suppose g is the real data (the ground truth), $\varvec{Y}$ is the recorded data with some margin of error, and $\epsilon $ is the noise, then $\varvec{Y}$ must be related to g by some function f (unknown)

$$\begin{aligned} \varvec{Y}= f(g, \epsilon ). \end{aligned}$$

To proceed further, we need to assume some reasonable form of function f. Following common assumptions, we suppose that $\epsilon $ follows the Gaussian distribution with mean 0 and take $f(g,\epsilon ) = g + \epsilon $ (additive noise), resulting in

$$\begin{aligned} \varvec{Y}= g + \epsilon . \end{aligned}$$

(10)

Since $\varvec{Y}$ is a vector, $\epsilon = (\epsilon _1, .., \epsilon _N)$ where each $\epsilon _i$ is given by the Gaussian distribution with 0 mean and $\sigma _{i}$ standard deviation.

3.1.2 Model inadequacy

For a given parameter $\varvec{\theta }$ and scenario S, we compute the parameter-to-observable map $\varvec{d}(\varvec{\theta }, S)$ at some time t. The model is always imperfect so a model of inadequacy must be constructed. Following [22,23,24], we assume

$$\begin{aligned} g - \varvec{d}(\varvec{\theta }, S) = \gamma (\varvec{\theta },S), \end{aligned}$$

(11)

where $\gamma (\varvec{\theta },S)$ is the modeling error and may depend on the parameters and scenario. Dependence of $\gamma $ on $\varvec{\theta }$ and S is not known and therefore one has to develop hypothesis about its values.

It is possible to combine the data noise and model inadequacy and assume a probability distribution for the combined error $\gamma + \epsilon $. If done so, we have

$$\begin{aligned} \varvec{Y}- \epsilon - \varvec{d}(\varvec{\theta }) = \gamma (\varvec{\theta }, S) \Rightarrow \varvec{Y}- \varvec{d}(\varvec{\theta }) = \epsilon + \gamma (\varvec{\theta }, S), \end{aligned}$$

(12)

i.e., the difference between the recorded data and the model output is equal to the sum of the noise and the model inadequacy. In this article, we assume $\epsilon + \gamma (\varvec{\theta }, S) \sim {\mathcal {N}}(\mathbf {0}, \Sigma )$, where $\Sigma $ is the covariance matrix. Here $x\sim {\mathcal {N}}(\mu , \sigma )$ means that x is sampled from a Gaussian distribution or the x is the random variable with the probability distribution given by ${\mathcal {N}}(\mu , \sigma )$.

3.2 The likelihood function

To infer for the model parameters, we need a likelihood probability distribution function L that assigns a penalty depending on how far the model output $\varvec{d}$ for a given parameter $\varvec{\theta }$ is from the data $\varvec{Y}$. We let

$$\begin{aligned} L(\varvec{\theta }, S; t)&= {\mathcal {N}}(\varvec{Y}(t) - \varvec{d}(\varvec{\theta }, S; t), \Sigma ). \end{aligned}$$

(13)

where $\Sigma $ is covariance matrix. We also assume that the sum of noise and model inadequacy is uncorrelated and the covariance matrix, for the noise and model inadequacy, $\Sigma $ is a diagonal matrix. We may assume that $\Sigma = \Sigma (t)$, i.e. covariance depends on the time at which the model output and the data are compared.

3.3 Calibration, validation, and prediction

Model inference in OPAL Bayesian learning framework consists of three steps: calibration, validation, and prediction, see Fig. 2. These are described below.

3.3.1 Model calibration

In this step, the model parameters are tuned so that the statistics of the output of the model agrees with that of the data. We assume that the data for model calibration correspond to the calibration scenario $S_c$. Let $\pi _c(\varvec{\theta }|S_c)$ be the prior probability distribution of the model parameters, $\pi _c(\varvec{Y}_c|\varvec{\theta },S_c)$ be the conditional probability of the data when the parameter is fixed to $\varvec{\theta }$ (the likelihood function), $\pi _c(\varvec{\theta }|\varvec{Y}_c, S_c)$ be the conditional probability of the parameter for a given data $\varvec{Y}_c$ (posterior), and $\pi _c(\varvec{Y}_c|S_c)$ be the evidence. Bayes’ rule relates the prior, likelihood, and the posterior as follows:

$$\begin{aligned} \pi _c(\varvec{\theta }|\varvec{Y}_c, S_c) = \frac{\pi _c(\varvec{Y}_c|\varvec{\theta }, S_c) \pi _c(\varvec{\theta }|S_c)}{\pi _c(\varvec{Y}_c|S_c)}. \end{aligned}$$

(14)

The evidence is the marginalization of the numerator in (14) so that the posterior $\pi _c(\varvec{\theta }|\varvec{Y}_c,S_c)$ is integrated to 1 with respect to $\varvec{\theta }\in \Theta $. It is given by

$$\begin{aligned} \pi _c(\varvec{Y}_c|S_c) = \int _{\Theta } \pi _c(\varvec{Y}_c|\varvec{\theta }, S_c) \pi _c(\varvec{\theta }|S_c) d\varvec{\theta }. \end{aligned}$$

(15)

We consider log-normal priors to ensure that the samples of parameters remain positive, see Sect. 4 for more details. Suppose we consider COVID-19 data at first $N_c$ days as the calibration data, i.e. $\varvec{Y}_c = (\varvec{Y}(t_i))_{i=1}^{N_c}$. In the scenario $S_c$, we consider the total number of infected and deceased cases in whole of Texas as the data. The corresponding parameter-to-observable map is $\varvec{d}(\varvec{\theta }, S_c; t) = (d_1, d_2)$ with

$$\begin{aligned} d_1(\varvec{\theta }, S_c; t)&= \int _{\Omega } \sum _{a\in \{i,r,d\}}\phi _a(\varvec{x}, t) d\varvec{x}, \nonumber \\ d_2(\varvec{\theta }, S_c; t)&=\int _{\Omega } \phi _d(\varvec{x}, t) d\varvec{x}. \end{aligned}$$

(16)

The likelihood function is simply the product of likelihood function at each time $t_i$, $i = 1,2,\ldots ,N_c$,

$$\begin{aligned} \pi _c(\varvec{Y}_c|\varvec{\theta }, S_c)&= \prod _{i=1}^{N_c} L(\varvec{\theta }, S_c; t_i)\nonumber \\&= \prod _{i=1}^{N_c} \prod _{j=1}^2 {\mathcal {N}}(Y_{cj}(t_i) - d_j(\varvec{\theta }, S_c; t_i), \sigma _{j}(t_i)), \end{aligned}$$

(17)

where we substituted the form of $L(\varvec{\theta }, S_c; t_i)$ from (13) and assumed that the noise in the infected cases and the deceased cases are not correlated and may depend on the time of the data.

As the prior and the likelihood functions are known, we can solve for the posterior $\pi _c(\varvec{\theta }|\varvec{Y}_c, S_c)$ using (14). (14) is solved numerically using the MCMC algorithm. These steps assimilate the data into the posterior $\pi _c(\varvec{\theta }|\varvec{Y}_c, S_c)$ which informs the sampling of parameters for an accurate model output.

3.3.2 Model validation

In this step, we validate the model by first tuning the parameters using the validation data obtained under the validation scenario $S_v$, and then computing the QoIs at validation times and comparing it with the data. Let $\pi _v(\varvec{\theta }|S_v) = \pi _v(\varvec{\theta }| \varvec{Y}_c, S_c, S_v)$ be the prior for the validation step which is conditioned on the calibration data $\varvec{Y}_c$ and the scenario $S_c$. We take the calibration posterior $\pi _c(\varvec{\theta }|\varvec{Y}_c, S_c)$ as the validation prior, i.e. $ \pi _v(\varvec{\theta }| \varvec{Y}_c, S_c, S_v) = \pi _c(\varvec{\theta }|\varvec{Y}_c, S_c)$. Let $\pi _v(\varvec{Y}_v|\varvec{\theta }, \varvec{Y}_c, S_c, S_v)$ be the likelihood function, $\pi _v(\varvec{\theta }|\varvec{Y}_v, \varvec{Y}_c, S_c, S_v)$ be the validation posterior, and $\pi _v(\varvec{Y}_v|\varvec{Y}_c, S_c, S_v)$ be the evidence.

Suppose the COVID-19 data at times $t_i$, for $i = N_c+1, N_c+2, \ldots , N_c+N_v$ defines the validation data, i.e. $\varvec{Y}_v = (\varvec{Y}(t_i))_{i=N_c+1}^{N_v}$. Similar to calibration scenario, we let the total number of infected and deceased cases in Texas be the data in validation scenario $S_v$. P-to-o map $\varvec{d}(\varvec{\theta }, S_v;t)$ is defined similarly to (16). The likelihood function is

$$\begin{aligned}&\pi _v(\varvec{Y}_v|\varvec{\theta }, \varvec{Y}_c, S_c, S_v) = \prod _{i=1}^{N_v} L(\varvec{\theta }, S_v; t_{N_c+i})\nonumber \\&\quad = \prod _{i=1}^{N_v} \prod _{j=1}^2 {\mathcal {N}}(Y_{vj}(t_{N_c + i}) - d_j(\varvec{\theta }, S_v; t_{N_c + i}), \sigma _{j}(t_{N_c + i})), \end{aligned}$$

(18)

As in the calibration step, we solve for the posterior $\pi _v(\varvec{\theta }|\varvec{Y}_v)$ using (14). For the validation of the model, we sample the parameter from the validation posterior, solve the forward problem ${\mathcal {A}}(\varvec{\theta }, S_v; \varvec{u}(\varvec{\theta }, S_v)) = 0$, and compute the QoI Q. If the difference of prediction (QoI in other words) and the data $\varvec{Y}_v$ is within the tolerance $\gamma _{tol}$, i.e.

$$\begin{aligned} d(Q(\varvec{\theta }, S_v; t_{N_c+ N_v}), \varvec{Y}(t_{N_c+ N_v})) \le \gamma _{tol}, \end{aligned}$$

(19)

then we declare the model as Not Invalid. Here, $d(\cdot , \cdot )$ is the metric that compares the two random fields, [24]. For the current model, we perform validation as follows: we consider the total infected and deceased cases in Texas as QoI and compute the standard deviation $\sigma _{inf}$ and $\sigma _{dec}$ of the normalized error in infected cases and deceased cases. With tolerance $\gamma _{inf} = 0.08$ ($8\%$ error) and $\gamma _{ dec} = 0.04$ ($4\%$ error), we check if $\sigma _{inf} < \gamma _{inf}$ and $\sigma _{dec} < \gamma _{dec}$ to determine the validity of the model.

3.3.3 Model prediction

Suppose the model was found to be Not Invalid. We compute the total infected cases and total deceased cases, as well as the infected and deceased cases in each of 25 districts in Texas, at prediction days from $t=N_c + N_v + 1$ to $t = T_p$. The QoIs so obtained are the random fields. Standard deviation of QoI indicates the uncertainty in the predictions.

4 Numerical approximation of forward problem and sensitivity analysis

In this section, we outline the numerical approximation of the forward model. The data, obtained from https://www.dshs.texas.gov/coronavirus/additionaldata/, consists of total population, total infected cases (sum of active infected, recovered, and deceased cases), deceased cases, and recovered cases for each of the 254 county in Texas. The number of recovered cases are not exact as noted in the source of the data. We process the county-wise data to obtain the data for each district and also total data in Texas. We let data in period June 1–June 20 as the calibration data and June 21–June 30 as the validation data. We obtain the map of Texas along with the district boundaries in shapefile format from http://gis-txdot.opendata.arcgis.com/datasets/texas-state-boundary. To triangulate the Texas region, we follow these steps:

1.
Load the Texas map file in QGIS software. QGIS software is freely available.
2.
Coarse grain the outer boundary segments using Simplify tool in QGIS. The original map has few very small length segments which may create problem in triangulation or result in very fine mesh.
3.
Obtain the vertices using Extract Vertices tool in QGIS and save the vertices layer using save layer as option. Select As_XY in Graphical category while saving the file in a csv format.
4.
Prepare a Gmsh input file using vertices file for triangulation.

In Fig. 3, we show the triangulation along with the total cases of infection and the total fatal cases in 25 districts at the beginning of model inference, i.e., 1st June 2020.

Table 1 Prior probability data: parameter values from previous studies [3, 30]. The values are converted to the appropriate units

Full size table

4.1 Numerical discretization

Suppose $(\cdot , \cdot )$ denote the $L^2$ inner product over domain $\Omega $. We assume homogeneous Neumann boundary condition for all densities. Let $V_a = H^1(\Omega )$ for $a\in \{s,e,i,r,d,n\}$ and $V_a^h$ be the finite dimensional approximation of $V_a$ by continuous piecewise linear interpolations over triangulation $\mathcal{{T}}^h$. Suppose $\phi _{a_n} \in V_{a}^h$ denote the solution at time $t_n$. Given $\{\phi _{a_n}\}_{a\in \{s,e,i,r,d\}}$, we seek $\phi _{a_{n+1}}$ at time $t_{n+1} = t_n + \Delta t$. Since the equations for $\phi _a$ are nonlinearly coupled, we consider a fixed point iteration at each time step. Let $\phi _a^k,\phi _a^{k+1} \in V_{a}^h$ denote the current iteration and next iteration (unknown) approximation of $\phi _{a_{n+1}}$ and let $\tilde{\phi }_a \in V_{a}^h$ denote the test function. At iteration step k at time $t_n$, weak forms for susceptible, exposed, infected, and deceased density fields are as follows:

1. Susceptible

$$\begin{aligned}&(\phi _s^{k+1}, \tilde{\phi }_s) + \Delta t \left( \left( 1 - \frac{A}{\phi _n^k}\right) \beta _i\phi _i^k \phi _s^{k+1}, \tilde{\phi }_s \right) \nonumber \\&\qquad + \Delta t \left( \left( 1 - \frac{A}{\phi _n^k}\right) \beta _e\phi _e^k \phi _s^{k+1}, \tilde{\phi }_s \right) \nonumber \\&\qquad + \Delta t (\mu \phi _n^k \phi ^{k+1}_s, \tilde{\phi }_s) + \Delta t (\nu _s \phi _n^k \nabla \phi ^{k+1}_s, \nabla \tilde{\phi }_s) \nonumber \\&\quad = (\phi _{s_n}, \tilde{\phi }_s) + \Delta t(\alpha \phi _n^k, \tilde{\phi }_s). \end{aligned}$$

(20)

2. Exposed

$$\begin{aligned}&(\phi _e^{k+1}, \tilde{\phi }_e) + \Delta t((\sigma + \gamma _e) \phi _e^{k+1}, \tilde{\phi }_e) \nonumber \\&\qquad + \Delta t (\mu \phi _n^k \phi ^{k+1}_e, \tilde{\phi }_e) + \Delta t (\nu _e \phi _n^k \nabla \phi ^{k+1}_e, \nabla \tilde{\phi }_e) \nonumber \\&\quad = (\phi _{e_n}, \tilde{\phi }_e) + \Delta t \left( \left( 1 - \frac{A}{\phi _n^k}\right) \beta _i\phi _s^{k+1} \phi _i^k, \tilde{\phi }_e \right) \nonumber \\&\qquad + \Delta t \left( \left( 1 - \frac{A}{\phi _n^k}\right) \beta _e\phi _s^{k+1} \phi _e^k, \tilde{\phi }_e \right) . \end{aligned}$$

(21)

3. Infected

$$\begin{aligned}&(\phi _i^{k+1}, \tilde{\phi }_i) + \Delta t((\gamma _d + \gamma _r) \phi _i^{k+1}, \tilde{\phi }_i) \nonumber \\&\qquad + \Delta t (\mu \phi _n^k \phi ^{k+1}_i, \tilde{\phi }_i) + \Delta t (\nu _i \phi _n^k \nabla \phi ^{k+1}_i, \nabla \tilde{\phi }_i) \nonumber \\&\quad = (\phi _{i_n}, \tilde{\phi }_i) + \Delta t (\sigma \phi _e^{k+1}, \tilde{\phi }_i). \end{aligned}$$

(22)

4. Recovered

$$\begin{aligned}&(\phi _r^{k+1}, \tilde{\phi }_r) + \Delta t (\mu \phi _n^k \phi ^{k+1}_r, \tilde{\phi }_r) + \Delta t (\nu _r \phi _n^k \nabla \phi ^{k+1}_r, \nabla \tilde{\phi }_r) \nonumber \\&\quad = (\phi _{r_n}, \tilde{\phi }_r) + \Delta t (\gamma _r \phi _i^{k+1}, \tilde{\phi }_r) + \Delta t (\gamma _e \phi _e^{k+1}, \tilde{\phi }_r). \end{aligned}$$

(23)

5. Deceased

$$\begin{aligned} (\phi _d^{k+1}, \tilde{\phi }_d) = (\phi _{d_n}, \tilde{\phi }_d) + \Delta t (\gamma _d \phi _i^{k+1}, \tilde{\phi }_d). \end{aligned}$$

(24)

The fields $\phi _a^{k+1}$ are solved in the same order as their weak forms are presented above. Note that in weak form for $\phi _e$, we consider updated solution $\phi _s^{k+1}$ instead of $\phi _s^k$. Similar is true for other equations. Algorithm 1 provides the key steps required to solve the forward problem. The solver is implemented through FEniCS [1, 20]. The resulting linear systems in (20)–(24) are solved by the GMRES algorithm with the incomplete LU preconditioners.

4.2 Units

We let time be in the units of days and length in units of 100 km. Let $\rho _0 = 10000 \text { people}/(100 \text { km})^2$. The densities $\phi _a$, $a\{s,e,i,r,d,n\}$, are in units of $\rho _0$. The parameters $\alpha , \gamma _e, \gamma _r, \gamma _d, \sigma $ have unit of 1/day, parameters $\beta _i, \beta _e, \mu $ have unit of $1/(\rho _0 \text { day})$, A has unit of $1/\rho _0$, and $\nu _s, \nu _e, \nu _i, \nu _r$ have unit of $(100 \text { km})^2/(\rho _0 \text { day})$.

Table 2 The mean, the variance in ln() of the parameter space, and the approximated mode for each parameters of interest derived from the calibration posterior samples

Full size table

4.3 Initial condition

We obtain the total population, total infected cases, deceased cases, recovered cases for each of the county from the data at $t=0$ (1st June 2020). To specify the initial population densities $\phi _a$, $a\in {s,e,i,r,d}$, we proceed as follows: for $\phi _a$, $a\in \{i, r, d, n\}$, we consider the following sum of 254 Gaussian functions centered at the centroid of counties:

$$\begin{aligned} \phi _a(\varvec{x}, 0) = \sum _{i=1}^{254} A_i \exp \left[ -\frac{|\varvec{x}- \varvec{x}_{c,i}|^2}{2B_i^2} \right] , \end{aligned}$$

(25)

where $A_i$ is the amplitude of Gaussian, $B_i$ is the length scale controlling the decay, $\varvec{x}_{c,i}$ is the centroid of $i^{\text {th}}$ county. We take $B_i = \sqrt{\text {Area of county } i/(4\pi )}$ and choose amplitude such that the integration of individual Gaussian functions over the $\mathbb {R}^2$ is same as the number of cases (infected, recovered, deceased or total population depending on $a\in \{i, r, d\}$). This approach leads to $95.5\%$ of the number of cases in each county fall into a circle centered at its centroid with the radius approximated by square-root of its area. To determine the remaining two species, we first hypothesize that the exposed cases density is given by

$$\begin{aligned} \phi _e(\varvec{x}, 0) = R \phi _i(\varvec{x}, 0). \end{aligned}$$

(26)

Using the fact that $\phi _n = \sum _{a\in \{s,e,i,r,d\}} \phi _a$, we determine $\phi _s$. The parameter R in above is treated as the model parameter. We will see next the effect of parameter R on the total infected cases.

4.4 Sensitivity analysis

In this section, we perform a sensitivity analysis of quantities of interest on different parameters. We consider the total infected and total deceased cases as the QoI. We consider two settings. In the first setting, we fix $\gamma _e, \gamma _r, \gamma _d, \sigma $ according to Table 1 and let $\varvec{\theta }= (A, \beta _e, \nu _s, \nu _i, R)$ denote the model parameters. In the second setting, we include $\gamma _e,\gamma _r, \gamma _d, \sigma $ in the parameter list. In Table 1, we list the values of parameters reported in the previous study and their range considered in the sensitivity study.

We performed the convergence study to confirm that the model has been discretized correctly in space and time. The choice of mesh and time step were constrained by the fact that the PDEs have to be solved many times. The triangulation of map in this and the study in the next section consists of 2969 vertices and 5683 triangle elements. The mesh size is about 18.942 km. The final time is $T=20$ days and the size of time step is $\Delta t = 0.1$ day.

Table 3 The mean, the variance in ln() of the parameter space, and the approximated mode for each parameters of interest derived from the validation posterior samples

Full size table

We employ open source library SALib and use the method of Morris [6, 21] for sensitivity calculation. We generate 1200 and 2000 samples of parameters for setting 1 and 2, and compute the total infected and deceased case for each parameter. In Fig. 4, we plot the $\mu ^*$ (mean of the absolute value of the elementary effects), total infected and deceased QoIs at different parameter samples. From the plots, we note that while the variation in the infected QoI is very large indicating that the model can be calibrated. The variation in the deceased QoI is extremely small (below order 1) in setting 1 indicating that the model can not be calibrated for a given deceased data. The results of setting 2 show that the deceased QoI is most sensitive to parameters $\gamma _d, \sigma , \gamma _r$ and the other parameters have almost no effect. For this reason the parameters $\gamma _d, \sigma , \gamma _r$ are kept variable and learned from the data. Results also show the negligible effect of $\gamma _e, \nu _i$ on QoIs, and, therefore, their values are fixed from Table 1.

5 Inference results

We consider the total infected and deceased cases respectively in the period June 1–June 20 and June 21–June 30 as the calibration and validation data. We predict the number of infected and deceased cases for period July 1–September 1. We assume that $\nu _r = \nu _e = \nu _s$ and $\beta _i = \beta _e$. Based on the sensitivity study in the preceding section, we fix values of $\gamma _e$ and $\nu _i$ from Table 1 and consider $\varvec{\theta }= (A, \beta _e, \nu _s, \gamma _r, \gamma _d, \sigma , R)$. For the posterior sampling in the calibration and validation steps, we utilize the preconditioned Crank-Nicolson (pCN) algorithm implemented in the hIPPYlib library (version 3.0.0) [31, 32].

5.1 Calibration

We consider a log-normal prior with mode and variance for parameters given in Table 1. We note that for the parameter A, we consider a mode of 400 to ensure that A is not sampled often in a nonphysical regime. From the SEIRD model (1), the term $ \left( 1 - \frac{A}{\phi _n} \right) \beta _i \phi _s \phi _i $ represents the portion of the susceptible population transitioning to exposed due to infected population. If A is such that $1 - A/\phi _n < 0$ then the transmission direction is reversed which is nonphysical and undesired.

The pCN algorithm is employed for generating samples from the posterior distribution, which is ideal for the inference problems with high dimensional parameter spaces and Gaussian priors. We refer to the interested readers to [4, 5, 13] and the reference within for the theory associated with Markov chain Monte Carlo and the pCN algorithm. With multiple runs of the pCN algorithms with different step size factors $\beta $, we choose $\beta = 0.3$ to maximize the efficiency of the posterior samples.

A set of $\sim 3500$ calibration posterior samples are obtained through running 4 independent chains, with an average acceptance rate of $20\%$. The results for the Bayesian calibration of the model parameters, including both a typical chain evolution of the model outputs and the marginalized calibration posterior densities are shown in Fig. 5. The model outputs of the calibration posterior samples match the data with reasonable precision. The marginalized calibration posterior densities indicate a higher recovery rate, a lower mortality rate, and a longer incubation period compared to our prior assumptions, with the approximated modes at $\sim 1/13.6 \text { day}^{-1}$, $\sim 1/462 \text { day}^{-1}$, and $\sim 1/12.2 \text { day}^{-1}$ respectively. A summary of the mean, the variance, and the approximated mode is shown in Table 2. The model outputs for all the calibration posterior samples and the model output at the mean of the calibration posterior are plotted with the data in Fig. 6.

5.2 Validation

We approximate the calibration posterior density by a log-normal density using its mean and variance and use it as the prior density for the validation step. We employ the pCN algorithm with the step size factor $\beta = 0.3$ to sample from the validation posterior density. A set of $\sim 4500$ validation posterior samples are obtained through running 4 chains with an average acceptance rate of $30\%$. Using the validation posterior, we compute the total infected cases and deceased cases from $t = 20$ to $t=30$ and compare with the data. The standard deviation of the normalized error in total infected and deceased cases are found to be $0.0863 > \gamma _{inf} = 0.08$ and $0.0068 < \gamma _{dec} = 0.04$. This implies that the model is Invalid for the infected QoI and Not Invalid for the deceased QoI. This conclusion is strengthened by the plots in Fig. 8 which shows that the model under predicts the infected cases whereas the model prediction of the deceased cases is very close to the data.

5.3 Prediction

Using the validation posterior, we compute the total number of cases in 25 districts until September 1, 2020. The model predicts 7003 fatalities with $95\%$ CI 6802–7204 and 301658 total cases of COVID-19 infection with $95\%$ CI 290251–313064 in Texas by September 1, 2020. Uncertainty, in terms of the standard deviation of the quantity of interest, in the prediction for deceased and infected cases are 102 and 5786 respectively. Figure 9 shows the evolution of cases in Texas along with the confidence intervals. We select top five districts in terms of the total infected cases as of June 30, 2020 and plot the evolution of the cases in these five districts until September 1, 2020. Figure 11 shows the projection of the district QoIs by August 15 and September 1 on Texas map.

6 Conclusion

Bayesian techniques have been employed to predict the COVID-19 spread in Texas. The model is found to be adequate to predict the deceased cases, however, falls short for the infected cases. By September 1, we predict to see about 7003 fatalities and 301658 infected cases. Uncertainties, in terms of the standard deviation of the QoI distribution, in deceased and infected cases are about 102 and 5786. Calculations show the SEIRD model employed in this work is not valid for the prediction of the infected cases. The cases of COVID-19 infection has been rising rapidly and it may be the case that the model is not adequate to account for the rapid increase in cases.

Several extensions of the model can be considered. For example, the model parameters can be allowed to vary in time and space; see [10, 17] where the parameters in ODE based SEIRD model are considered to be time dependent. Physical landscape or heterogeneities can be added to the model by considering non-homogenous and possibly anisotropic diffusion models [16]; higher infection diffusivity in densely populated counties, anisotropic diffusion to include the effects of highways/freeways. Another aspect believed to play a major role in the dynamics of COVID-19 spread is the asymptomatic/mildly symptomatic cases which are often not accounted in the data, see [7, 12, 19, 25, 29]. This work can be extended, similar to [26, 27], by subdividing the infected portion of the population into the asymptomatic and symptomatic groups to account for the effects of the unreported cases. OPAL provides a framework to rank models and select the best model for prediction. It can be applied to different variants of the SIR model such as SIS (susceptible-infected-susceptible), SEIR, SIRD, MSIR (M stands for immunity inherited from mother), SEIIR (II for infected but asymptomatic and infected but symptomatic), etc and find the best model for the prediction of the infected cases.

References

Alnæs MS, Blechta J, Hake J, Johansson A, Kehlet B, Logg A, Richardson C, Ring J, Rognes ME, Wells GN (2015) The fenics project version 1.5. Archive of Numerical Software 3:
Anastassopoulou C, Russo L, Tsakris A, Siettos C (2020) Data-based analysis, modelling and forecasting of the covid-19 outbreak. PloS One 15:e0230405
Article Google Scholar
Backer JA, Klinkenberg D, Wallinga J (2020) Incubation period of 2019 novel coronavirus (2019-ncov) infections among travellers from Wuhan, China, 20–28 January 2020. Eurosurveillance 25:2000062
Article Google Scholar
Beskos A, Pinski FJ, Sanz-Serna JM, Stuart AM (2011) Hybrid Monte Carlo on Hilbert spaces. Stoch Process Appl 121:2201–2230
Article MathSciNet Google Scholar
Brooks S, Gelman A, Jones G, Meng X-L (2011) Handbook of Markov Chain Monte Carlo. CRC Press, Boca Raton
Book Google Scholar
Campolongo F, Cariboni J, Saltelli A (2007) An effective screening design for sensitivity analysis of large models. Environ Model Softw 22:1509–1518
Article Google Scholar
Chan JF-W, Yuan S, Kok K-H, To KK-W, Chu H, Yang J, Xing F, Liu J, Yip CC-Y, Poon RW-S et al (2020) A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. The Lancet 395:514–523
Article Google Scholar
Chen N, Zhou M, Dong X, Qu J, Gong F, Han Y, Qiu Y, Wang J, Liu Y, Wei Y et al (2020) Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. The Lancet 395:507–513
Article Google Scholar
Dowd JB, Andriano L, Brazel DM, Rotondi V, Block P, Ding X, Liu Y, Mills MC (2020) Demographic science aids in understanding the spread and fatality rates of covid-19. Proc Natl Acad Sci 117:9696–9698
Article Google Scholar
Dureau J, Kalogeropoulos K, Baguelin M (2013) Capturing the time-varying drivers of an epidemic using stochastic dynamical systems. Biostatistics 14:541–555
Article Google Scholar
Farrell K, Oden JT, Faghihi D (2015) A bayesian framework for adaptive selection, calibration, and validation of coarse-grained models of atomistic systems. J Comput Phys 295:189–208
Article MathSciNet Google Scholar
Gatto M, Bertuzzo E, Mari L, Miccoli S, Carraro L, Casagrandi R, Rinaldo A (2020) Spread and dynamics of the covid-19 epidemic in Italy: effects of emergency containment measures. Proc Natl Acad Sci 117:10484–10491
Article Google Scholar
Hairer M, Stuart AM, Vollmer SJ (2014) Spectral gaps for a metropolis-hastings algorithm in infinite dimensions. Ann Appl Probab 24:2455–2490
Article MathSciNet Google Scholar
Hellewell J, Abbott S, Gimma A, Bosse NI, Jarvis CI, Russell TW, Munday JD, Kucharski AJ, Edmunds WJ, Sun F, Flasche S, Quilty BJ, Davies N, Liu Y, Clifford S, Klepac P, Jit M, Diamond C, Gibbs H, van Zandvoort K, Funk S, Eggo RM (2020) Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts. Lancet Glob Health 8(4):e488–e496. https://doi.org/10.1016/S2214-109X(20)30074-7
Keeling MJ, Rohani P (2011) Modeling infectious diseases in humans and animals. Princeton University Press, Princeton
Book Google Scholar
Keller JP, Gerardo-Giorda L, Veneziani A (2013) Numerical simulation of a susceptible-exposed-infectious space-continuous model for the spread of rabies in raccoons across a realistic landscape. J Biol Dyn 7:31–46
Article MathSciNet Google Scholar
Kucharski AJ, Russell TW, Diamond C, Liu Y, Edmunds J, Funk S, Eggo RM, Sun F, Jit M, Munday JD, Davies N, Gimma A, van Zandvoort K, Gibbs H, Hellewell J, Jarvis CI, Clifford S, Quilty BJ, Bosse NI, Abbott S, Klepac P, Flasche S (2020) Early dynamics of transmission and control of COVID-19: a mathematical modelling study. Lancet Infect Dis 20(5):553–558. https://doi.org/10.1016/S1473-3099(20)30144-4
Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, Ren R, Leung KSM, Lau EHY, Wong JY, Xing X, Xiang N, Wu Y, Li C, Chen Q, Li D, Liu T, Zhao J, Liu M, Tu W, Chen C, Jin L, Yang R, Wang Q, Zhou S, Wang R, Liu H, Luo Y, Liu Y, Shao G, Li H, Tao Z, Yang Y, Deng Z, Liu B, Ma Z, Zhang Y, Shi G, Lam TTY, Wu JT, Gao GF, Cowling BJ, Yang B, Leung GM, Feng Z (2020) Early transmission dynamics in Wuhan, China, of Novel Coronavirus–Infected Pneumonia. New Engl J Med 382(13):1199–1207. https://doi.org/10.1056/NEJMoa2001316
Li R, Pei S, Chen B, Song Y, Zhang T, Yang W, Shaman J (2020) Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (sars-cov-2). Science 368:489–493
Article Google Scholar
Logg A, Mardal K-A, Wells GN et al (2012) Automated solution of differential equations by the finite element method. Springer, Berlin
Book Google Scholar
Morris MD (1991) Factorial sampling plans for preliminary computational experiments. Technometrics 33:161–174
Article Google Scholar
Oden J (2017) Foundations of predictive computational sciences, ICES Reports
Oden JT (2018) Adaptive multiscale predictive modelling. Acta Numer 27:353
Article MathSciNet Google Scholar
Oden JT, Babuška I, Faghihi D (2017) Predictive computational science: computer predictions in the presence of uncertainty. Encyclopedia of Computational Mechanics Second Edition 1–26
Pan X, Chen D, Xia Y, Wu X, Li T, Ou X, Zhou L, Liu J (2020) Asymptomatic cases in a family cluster with sars-cov-2 infection. Lancet Infect Dis 20:410–411
Article Google Scholar
Park SW, Cornforth DM, Dushoff J, Weitz, JS (2020) The time scale of asymptomatic transmission affects estimates of epidemic potential in the covid-19 outbreak, Epidemics, p 100392
Peirlinck M, Linka K, Costabal FS, Bendavid E, Bhattacharya J, Ioannidis J, Kuhl E (2020) Visualizing the invisible: the effect of asymptomatic transmission on the outbreak dynamics of covid-19, medRxiv
Surveillances V (2020) The epidemiological characteristics of an outbreak of 2019 novel coronavirus diseases (covid-19)–hina. China CDC Weekly 2(2020):113–122
Google Scholar
Tang B, Xia F, Bragazzi NL, Wang X, He S, Sun X, Tang S, Xiao Y, Wu J (2020) Lessons drawn from china and south korea for managing covid-19 epidemic: insights from a comparative modeling study, medRxiv
Viguerie A, Lorenzo G, Auricchio F, Baroli D, Hughes TJR, Patton A, Reali A, Yankeelov TE, Veneziani A (2021) Simulating the spread of COVID-19 via a spatially-resolved susceptible–exposed–infected–recovered–deceased (SEIRD) model with heterogeneous diffusion. Appl Math Lett 111:106617. https://doi.org/10.1016/j.aml.2020.106617
Villa U, Petra N, Ghattas O (2019) hIPPYlib: an extensible software framework for large-scale inverse problems governed by PDEs; Part I: deterministic inversion and linearized Bayesian inference. arxiv:1909.03948
Villa U, Petra N, Ghattas O (2018) hIPPYlib: an extensible software framework for large-scale deterministic and Bayesian inverse problems. J Open Source Softw 3(30):940. https://doi.org/10.21105/joss.00940

Download references

Acknowledgements

This work was supported by funds provided by the Cockrell Family Regents Chair in Engineering $\#2$ at the University of Texas at Austin.

Author information

Authors and Affiliations

Oden Institute for Computational Engineering and Sciences, The University of Texas at Austin, Austin, USA
Prashant K. Jha, Lianghao Cao & J. Tinsley Oden

Authors

Prashant K. Jha
View author publications
You can also search for this author in PubMed Google Scholar
Lianghao Cao
View author publications
You can also search for this author in PubMed Google Scholar
J. Tinsley Oden
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Prashant K. Jha.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jha, P.K., Cao, L. & Oden, J.T. Bayesian-based predictions of COVID-19 evolution in Texas using multispecies mixture-theoretic continuum models. Comput Mech 66, 1055–1068 (2020). https://doi.org/10.1007/s00466-020-01889-z

Download citation

Received: 15 July 2020
Accepted: 19 July 2020
Published: 31 July 2020
Issue Date: November 2020
DOI: https://doi.org/10.1007/s00466-020-01889-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Bayesian-based predictions of COVID-19 evolution in Texas using multispecies mixture-theoretic continuum models

Abstract

Similar content being viewed by others

Understanding COVID-19 Epidemics: A Multi-Scale Modeling Approach

Spatio-temporal predictive modeling framework for infectious disease spread

Hybrid Modeling of COVID-19 Spatial Propagation over an Island Country

1 Introduction

2 Reaction–diffusion model

3 Bayesian learning