Sequential time-window learning with approximate Bayesian computation: an application to epidemic forecasting

Valeriano, João Pedro; Cintra, Pedro Henrique; Libotte, Gustavo; Reis, Igor; Fontinele, Felipe; Silva, Renato; Malta, Sandra

doi:10.1007/s11071-022-07865-x

Sequential time-window learning with approximate Bayesian computation: an application to epidemic forecasting

Original Paper
Published: 25 September 2022

Volume 111, pages 549–558, (2023)
Cite this article

Download PDF

Nonlinear Dynamics Aims and scope Submit manuscript

Sequential time-window learning with approximate Bayesian computation: an application to epidemic forecasting

Download PDF

João Pedro Valeriano¹,
Pedro Henrique Cintra²,
Gustavo Libotte³^nAff4,
Igor Reis⁵,
Felipe Fontinele⁶,
Renato Silva³ &
…
Sandra Malta³

1 Citation
3 Altmetric
Explore all metrics

Abstract

The long duration of the COVID-19 pandemic allowed for multiple bursts in the infection and death rates, the so-called epidemic waves. This complex behavior is no longer tractable by simple compartmental model and requires more sophisticated mathematical techniques for analyzing epidemic data and generating reliable forecasts. In this work, we propose a framework for analyzing complex dynamical systems by dividing the data in consecutive time-windows to be separately analyzed. We fit parameters for each time-window through an approximate Bayesian computation (ABC) algorithm, and the posterior distribution of parameters obtained for one window is used as the prior distribution for the next window. This Bayesian learning approach is tested with data on COVID-19 cases in multiple countries and is shown to improve ABC performance and to produce good short-term forecasting.

Forecasting Multi-Wave Epidemics Through Bayesian Inference

Article 28 July 2021

Assessing parameter identifiability in compartmental dynamic models using a computational approach: application to infectious disease transmission models

Article Open access 14 January 2019

GrowthPredict: A toolbox and tutorial-based primer for fitting and forecasting growth trajectories using phenomenological growth models

Article Open access 18 January 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Since the onset of the novel Coronavirus (SARS-CoV-2) pandemic, computational methodologies have played a fundamental role in helping to understand the dynamics of the spread of the virus in society [1]. Computational models are capable of capturing, to a certain extent, the behavior of the data that describes the advance of the virus, making it possible to simulate predictive scenarios that collaborate with the decision-making of government authorities and in the allocation of medical and financial resources. Mathematical models and computer simulations can also provide relevant indicators to assist in the implementation of social distancing measures, hoping to stave off the advance of the disease. Adiga et al. [2] present a comparative analysis of computational models used to describe the behavior of the epidemic. Challenges of modeling COVID-19 are discussed in Refs. [3, 4], whereas Eker [5] analyzes the validity and usefulness of computational models in such context.

To date, the world has had more than 378 million confirmed cases, with more than 569 million individuals dead [6]. In several countries, the number of daily cases has already had at least two waves of infection, when a meaningful increase in the number of cases occurs after a significant drop in the number of new infections during the previous wave. Numerous compartmental models, which have been widely used to simulate the spreading of COVID-19 [7], in its canonical form, have no descriptive capacity to represent the behavior of data with multiple waves [8, 9]. Further drawbacks of the classical SIR model are discussed by Singh and Gupta [10]. This poses an even greater challenge when such models are used to simulate the spreading dynamics of COVID-19, requiring more sophisticated computational frameworks to be established, to provide more reliable results.

A growing body of literature has proposed computational models and techniques to overcome the difficulties imposed by the data when the epidemic is at an advanced stage. Of note, the variety of works related to the modeling of the second (and subsequent) waves of COVID-19 is more restricted than those related to the early stages of the pandemic. Below, we summarize the most relevant ones that we are aware of. Kaxiras and Neofotistos [11] extended the forced-SIR model, proposed in Ref. [12], which provides an approximate analytical solution for the differential equations that represent the well-known SIR model, to allow multiple waves to be captured; Cacciapaglia et al. [13] model the multi-wave pattern by considering a master equation for the time-evolution of the total number of infected individuals in particular locations. Such equation is based on the epidemic Renormalization Group (eRG) framework [14], which is extended to include the diffusion of the epidemic between multiple nearly isolated regions; Singh and Gupta [10] propose what they call the Generalized SIR (GSIR) model, which is an integrative model encompassing multiple waves that emerge and vanish within a time interval. Special solutions of the constituent waves of the model are demonstrated, employing well-known growth functions, leading to time-varying parameters and a closed-form solution of all the system parameters.

2 Motivation and objectives

As mentioned before, although there is no current closed definition for an infection wave, several countries have had more than one sequence of sharp increases followed by a substantial drops in the number of daily new cases, which is popularly characterized as an infection wave. Typical compartmental models (such as SIRD and SEIRD) are not capable of capturing this behavior considering its canonical structure [15]. Such restriction is a result of the basic assumptions behind the model, that the population is homogeneously mixed, resulting in one single infection wave until the so-called herd immunity is reached.

To account for inhomogeneous mixing in the population, reinfection due to poor immune response or immunity loss, specific social behaviors, or governmental policies that can change the infection dynamics, several groups work with modified SIRD/SEIRD models [16,17,18]. However, adding more compartments may drastically increase the number of parameters to be fitted in the model. For instance, Ramezani et al. [17] implement a modified SEIRD model to account for asymptomatic patients and individuals who self-isolate (SEARIDQ model), which uses a total of 14 parameters. Such an increase in the number of parameters also increases the chances of falling into a non-identifiable model, using the same dataset [19], given the same number of curves to be fitted. Although some techniques have been proposed to bypass this problem, they often require more data than what is available.

Another approach for capturing the complex dynamics of local epidemics is to use SIRD/SEIRD models with time-varying parameters. For example, if $\beta $ corresponds to the infection rate of susceptible individuals, the use of masks or social isolation, therefore, decreases its value [20, 21]. In this context, Dehning et al considered a SIR model with a time-varying infection rate and conducted Bayesian inference to identify times of variation of this rate and connect these times to non-pharmaceutical interventions implemented in Germany at the early stages of the COVID-19 epidemic [22]. The introduction of time-varying parameters usually requires confining their variation to an analytic function, which may not represent the true temporal dynamics of parameters, once many of them are not directly measurable. This choice also affects the generality of the model, as a particular choice of the functional form that describes a parameter may not apply in another context. Furthermore, compartmental models struggle to take into account testing and contact tracing in their dynamics, which further complicates the use of time-varying functions [23]. Even if we overcome this problem, the individual policies, social behavior and testing of each country should make the generalization of these models for other countries nearly impossible.

The challenge faced by epidemic models also increases, as new variants with higher transmissibility or immune escape emerge, such as the variants of concern (VOC) Alpha (B.1.1.7), Beta (B.1.351), Delta (B.1.617.2), Gamma (P.1), and Omicron (B.1.1.529). The appearance of each VOC is associated with local or global change in the temporal dynamics of parameters associated with the pandemic [24, 25]. For example, the Alpha variant is associated with a 50% increase in transmissibility. Such an increase may reflect on a change in $\beta $ over time as the variant spread through a region [26].

Aiming to provide an alternative to fitting the limited amount of data and producing accurate short-term predictions, we propose a time-window SEIRD model, with time-varying window size as the rate of effective parameter change is not the same throughout the epidemic, and different window sizes may be more appropriate at different times. The parameters of the model may be considered constant through the time-window being fitted (see Sect. 3.1 for further details) and the number of parameters of the model remains the same, decreasing the chances of falling into a non-identifiable problem. This procedure allows capturing the temporal variations of epidemiological parameters along time-windows without requiring the model to be defined with time-dependent parameters, making it possible to fit a curve with a simple model with piece-wise constant parameters within each time-window, emulating the behavior of time-varying parameters, but not defined by an analytic function. Time-window methods are common in nowcasting (correction for reporting delays) methods for epidemiological surveillance [27,28,29].

To the best of our knowledge, the framework proposed by Liao et al. [30] is the one that most resembles what is being proposed here. Although both methodologies use an approach in which data are divided into time-windows, the fundamental difference is that the methodology of Liao et al. [30] uses an exhaustive search associated with the least-squares method to determine the optimal parameters of the compartmental model, and a machine learning method is employed to track and predict the values of parameters, based on the variation of the values of the basic reproduction number and a growth rate in the historical data. On the other hand, we adopt a Bayesian approach, so that the knowledge obtained from past windows is propagated to the later windows, to gradually fit the data and compose the behavior of the model.

Keeping in mind the choice of using time-windows to analyze data, the first idea might be to deal with each window separately and fit every one of them independently. As we will show, this can be an inefficient approach, and we propose an alternative solution to connect information between consecutive time-windows and use this to improve model fitting. For this purpose, we need an inference algorithm capable of using information acquired in a previous window to fit the next one.

In this work, we choose to use ABC-SMC (approximate Bayesian computation with sequential Monte Carlo) [31], which generates a posterior distribution for the parameters of the model in an arbitrary window. This posterior distribution can then be used as the prior distribution for the next time-window, and this procedure goes for every following time-window in the dataset.

In Results section, we present the fitting of data on COVID-19 cumulative cases and deaths in Brazil, as a proof of concept of the improvement gained by fitting time-windows using past window posteriors instead of flat priors.

3 Methodology

3.1 Time-window fitting and the use of past window posteriors

Our goal is to analyze an epidemiological time series of cumulative infected and dead individuals, considering a model of coupled ordinary differential equations. We consider long enough time periods over which the data spam over, such that the epidemiological parameters change over time. Such a change can be due to a particular social behavior, governmental policies, environmental factors, or natural selection—all of which may lead to the emergence of multiple epidemic waves. In this case, one may suppose that the principles for the system’s time evolution are the same, but some of its properties have changed over time, that is, the model is the same over the time series, although the parameters probably change.

As we are not interested in a functional form for the time variation of the model parameters, we take an alternative approach. If one considers only a small enough time interval of the dataset, then this interval should be reasonably described by a model with constant parameters. Motivated by this fact, we divide the epidemic data into multiple time-windows, each to be fitted separately with the same model, but obtaining different sets of piece-wise constant parameters.

The fitting algorithm starts by choosing a number N of days to be considered in each time-window. We also need to choose by how many days one window shall be shifted from the previous. This shift will be denoted by d (days). Therefore, if the first time-window goes from day 1 until day N, the second time-window will go from day $1+d$ until day $N+d$. Notice that, if $d < N$, there will be an overlap of $N-d$ days between consecutive windows. We fit the model to the data of a time-window using the ABC-SMC algorithm that generates a posterior distribution for the parameters, which in turn can be used to make predictions for periods following the end of the current time-window.

The use of the ABC-SMC algorithm for fitting the model implies the choice of a prior distribution for the model’s parameters. For simplicity, for each time-window, one can start by adopting an uniform prior distributions for all parameters (with different ranges depending on the nature of each parameter), in the case of lack of knowledge to build more informative priors.

We propose a way to go beyond the flat prior simplification, still without considering the knowledge gained from the data, but only the knowledge obtained while fitting data on past time-windows. By hypothesis, if we consider that the data are described by an ordinary differential equation model with time-varying parameters, the difference between the distributions of such parameters for consecutive time-windows should be small, especially in the case that an overlap exists between consecutive time-windows. Therefore the posterior distribution obtained for the n-th window should provide a reasonable initial estimate—the prior distribution—for the $(n+1)$-th window. So we propose to use this approach instead of a flat prior in order to provide useful information for the ABC-SMC fitting algorithm, further optimizing the process.

3.2 Adaptive window size

A possible problem with dividing the epidemic data into multiple time-windows is how to choose the window sizes. It is important to notice that varying window sizes may be more appropriate for different windows of the time series. To counter that, we developed a simple algorithm to choose the window size of the n-th window based on the goodness-of-fit in the past two windows.

First, a given size $s_{1}$ is chosen for the first time-window, and we set the lower and upper bounds for window sizes, denoted by $s_{\mathrm {min}}$ and $s_{\mathrm {max}}$, respectively. In turn, the step size in the window size variation, $\Delta s$, is chosen to be the same as the offset d between the last days of consecutive time-windows. Then, the second time-window will have the same size as the first one, that is, $s_{2}=s_{1}$. But, starting from the third time-window, the window size will be chosen by the following algorithm: let $y_{m}^{i}$ be the actual data for the m-th day of the time-window, whereas ${\hat{y}}_{m}^{i}$ denotes the prediction of the model for the same day. If n indexes the size $s_{n}$ of the n-th time-window, the normalized root mean square deviation (NRMSD) for the i-th component of the data vector—denoted by $\varepsilon ^{i}_{n}$—over the same window is given by

$$\begin{aligned} \varepsilon ^{i}_{n} = \frac{\displaystyle \sqrt{\sum _{m} \frac{({\hat{y}}^{i}_{m} - y^{i}_{m})^2}{s_{n}}}}{y^{i}_{\mathrm {max}}-y^{i}_{\mathrm {min}}} \; , \end{aligned}$$

(1)

so that

$$\begin{aligned} \varepsilon _{n} = \sum \limits _{i} \varepsilon ^{i}_{n} \; , \end{aligned}$$

(2)

where i identifies the component of which the NRMSD is being calculated, the index m runs over the days inside the n-th time-window. Then, for the n-th window, with $n \ge 3$, the window size is chosen according to Algorithm 1.

The intuition behind this procedure is that smaller time-windows are easier to fit. In this case, we measure the goodness-of-fit by $\varepsilon $, so that the smaller $\varepsilon $, the better the quality of the fit. Therefore, if $\varepsilon $ increases from one time-window to the next, it can be understood that the fitting may require a greater deal of effort. If we assume that our model should give a good description of the data in a small enough time span, we could expect to improve the quality of the fit by making the time-window smaller, the way we proceed to the next time-window. On the other hand, if $\varepsilon $ decreases between two time-windows, recalling that consecutive windows, n and $(n+1)$, have an overlap of $s_{n+1} - \Delta s$ points, we can understand that the new $\Delta s$ points at the end of the $(n+1)$-th window are in good agreement, in terms of model parameters, with the behavior of the data in the n-th window. Therefore, increasing the window size can allow the simultaneous consideration of a larger range of the time-series that is related to the same set of parameters of the chosen model, decreasing the possibility of overfitting and improving generalization.

The lower bound $s_{\mathrm {min}}$ can be set considering the number of free parameters in the fitting problem, bearing in mind that fitting very few data points can lead to overfitting, so it is reasonable to have at least more data points than free parameters. For the upper bound $s_{\mathrm {max}}$, it is more complex to set a strict natural limit, but it is worth recalling the motivation regarding the approach to divide the data into time-windows: there is a limit on how long a fixed set of parameters can adequately fit the data, so we set an upper bound on the maximum expected range describable by a single constant set of parameters.

Section S2 of Supplementary Information text presents a practical comparison between considering fixed and adaptive window sizes, showing that the results are rather similar, but adaptive window size Algorithm 1 does not require one to choose a specific window size.

Figure 1 graphically summarizes the methodology described in this section for the inference of model parameters and generation of forecasts in each time-window of the considered data.

4 Application to epidemic forecasting

We implemented the time-window model with an ABC-SMC algorithm for curve fitting. This means that we divide the epidemic curve into multiple time-windows, which are considered separately by fitting a time-independent compartmental model. The epidemic model chosen is a SEIRD model including infection by pre-symptomatic individuals (for details see [32, 33]) described by ordinary differential equations system (3).

$\beta _I$ and $\beta _E$ stand for the infection rate of infected and exposed individuals, respectively, c represents the inverse of the incubation period, $\gamma $ and $\mu $ express the recovery and death rates, respectively. The model is solved using a 4th-order Runge–Kutta algorithm subjected to the constraint $N = S +E + I + R + D$, and with the initial conditions $S(0) = S_0 $, $E(0) = E_0 $, $ I(0) = I_0$, $R(0) = R_0 $ and $D(0) = D_0 $. All five parameters are set free for the fitting process, alongside the total population N, from which the initial condition for S is derived according to $S_0 = N - I_0 - E_0 - R_0 - D_0$.

$$\begin{aligned} \begin{aligned}&\frac{\mathrm {d}S}{\mathrm {d}t} = -\frac{\beta _I S I}{N} - \frac{\beta _E S E}{N} \\&\frac{\mathrm {d}E}{\mathrm {d}t} = \frac{\beta _I S I}{N} + \frac{\beta _E S E}{N} - \alpha E \\&\frac{\mathrm {d}I}{\mathrm {d}t} = \alpha E - (\gamma + \mu )I \\&\frac{\mathrm {d}R}{\mathrm {d}t} = \gamma I \\&\frac{\mathrm {d}D}{\mathrm {d}t} = \mu I \; . \end{aligned} \end{aligned}$$

(3)

For our analysis, we consider data on cumulative cases and deaths for different countries. Therefore, at the beginning of each time-window, we only have initial values for deaths $D_{0}$ and cumulative cases $C_{0}$. We need a way to estimate the initial values $E_{0}$, $I_{0}$ and $R_{0}$. For doing so, we define new parameters $c_{E}$ and $c_{R}$ to be fit together with the system of equations (3), such that $R_{0} = c_{R} (C_{0}-D_{0}) \Rightarrow I_{0} = (1-c_{R}) (C_{0}-D_{0})$, and $E_{0} = c_{E} (C_{0}-D_{0})$.

5 Results and discussion

Here, we present different comparisons between results from flat prior and past window posterior approaches, fitting the SEIRD model to epidemic data on cumulative cases and deaths of COVID-19 in Brazil. To run the ABC-SMC with adaptive time-window sizes, we set the minimum time-window length $s_{\mathrm {min}} = 10$ days and the maximum window length to $s_{\mathrm {max}} = 50$ days.

Before proceeding to the comparison between different approaches, we can already see, in Figure 2, the piece-wise 10-day predictions from fitting throughout the curve of cumulative cases of COVID-19 in Brazil. The curve is divided into four subplots for better visualization. Since windows are shifted by five days and predictions are computed for ten days, we only show forecasts of alternated windows, in order to avoid overlap in prediction curves. Although the first predictions tend to overestimate the growth due to a lack of information regarding the epidemic parameters, the remaining predictions describe the epidemic curve fairly well, capturing the general trend of cases over different scenarios. In supplementary figures S5 and S6 one can also see the fit, and the following forecast, for each separate time-window along the epidemic curve of Brazil for both approaches. In these windows, it is possible to see that the past window posterior approach leads to more consistent fittings, with smaller variation between different runs of the method.

Figure 3 shows the mean, with standard deviation, of the windows’ sizes, by window, over 10 executions of the ABC-SMC fitting, starting from a 30-day time-window. More important than the actual average window size is the fluctuation around it. The window size selection algorithm presents a better convergence to the optimal window size when combined with the past window posterior approach. This can also be seen as a hint to the convergence improvement of the ABC-SMC by the use of past window’s posteriors instead of flat priors.

We proceed by comparing values of $\varepsilon $ over each time-window considering the quality of both the fit and the prediction. Figures 4 and 5 show the fit and prediction NRMSDs, respectively, for each time-window of the data on Brazil. Using the past window posterior as an informative prior on the current value of the epidemiological parameters leads to an NRMSD approximately two orders of magnitude smaller. During the prediction procedure, the past window posterior approach also shows a smaller $\varepsilon $, this time different by one order of magnitude. Both approaches were fit with 1000 accepted samples in each posterior of the ABC-SMC algorithm, and the curves are the results of 10 runs of the fitting procedure.

The accumulation of information along the fitting of consecutive time-windows may be analyzed by considering the evolution of $\varepsilon $ through the epidemic data. During the first few time-windows, NRMSD curves in Figures 4 and 5 for the flat prior approach and the past window approach are similar to each other, which indicates that there is not enough information yet about the parameter’s values to be learned by the past window posterior approach. As we fit more time-windows, information is accumulated by the past window approach, leading to smaller NRMSD values.

Looking at the prediction error on each day of the prediction window, we get the heat map presented in Figure 6 comparing the error magnitude for each day of the prediction window in each of the time-windows of the curve. Here, the relative error is calculated as the difference between predicted daily cases and the actual data on it, divided by the data value for normalization. In both cases, the first days show larger errors. However, the past window posterior approach leads to smaller error by day for a longer period. Closer to the 35th time-window, the flat prior approach drastically increases its error through the prediction window (as shown by the purple color). This is further evidence that using the adaptive window method with the past window posterior approach is a more adequate method for generating forecasts for the next few days of the epidemic curve.

The same analysis presented so far is also done considering data from Germany, India, Japan, South Korea, USA and UK, and it can be found in section S5 of Supplementary Information. The results remained consistent for other countries, as one can see in Figure 7, even though the epidemic curves from these countries are quite different from one another, which indicates the robustness of the method.

Figure 7 shows the box-plots of the distribution of the ratio $\varepsilon _{\mathrm {flat}}/\varepsilon _{\mathrm {past}}$ between fit NRMSDs in each window, obtained via flat prior and past posterior approaches. We considered five different initial window sizes for each country studied and ten different executions of the inference algorithm. In all countries, over 96% of the ratio distribution is above 1, indicating that $\varepsilon _{\mathrm {flat}} > \varepsilon _{\mathrm {past}}$. Therefore, in more than 96% of the time, the past posterior approach leads to a better model fitting to the data.

We can conclude that separating complex dynamical data in time-windows can allow for its tractability through simple models, and we present a way to do this via approximate Bayesian computation. It is clear that considering data in the past, when choosing the prior distribution for a time-window, leads to better results. In this work, we consider on cases and deaths of COVID-19, but, in principle, this same methodology could be applied in many different scenarios involving dynamical quantities.

Data availability

All the code used to generate the presented results—together with some examples of the code output for ease of analysis by the interested reader—is available at the GitHub repository https://github.com/gustavolibotte/LNCC-COVID-19-prediction/tree/TMLearningABC.

References

Sonabend, R., Whittles, L.K., Imai, N., Perez-Guzman, P.N., Knock, E.S., Rawson, T., Gaythorpe, K.A., Djaafara, B.A., Hinsley, W., FitzJohn, R.G., et al.: Non-pharmaceutical interventions, vaccination, and the sars-cov-2 delta variant in england: a mathematical modelling study. The Lancet (2021). https://doi.org/10.1016/S0140-6736(21)02276-5
Adiga, A., Dubhashi, D., Lewis, B., Marathe, M., Venkatramanan, S., Vullikanti, A.: Mathematical models for COVID-19 pandemic: a comparative analysis. J. Indian Inst. Sci. 100(4), 793–807 (2020). https://doi.org/10.1007/s41745-020-00200-6
Article Google Scholar
Bertozzi, A.L., Franco, E., Mohler, G., Short, M.B., Sledge, D.: The challenges of modeling and forecasting the spread of COVID-19. Proceedings of the National Academy of Sciences 117(29), 16732–16738 (2020) arXiv:2004.04741. https://doi.org/10.1073/pnas.2006520117
Vespignani, A., Tian, H., Dye, C., Lloyd-Smith, J.O., Eggo, R.M., Shrestha, M., Scarpino, S.V., Gutierrez, B., Kraemer, M.U.G., Wu, J., Leung, K., Leung, G.M.: Modelling COVID-19. Nat. Rev. Phys. 2(6), 279–281 (2020). https://doi.org/10.1038/s42254-020-0178-4
Article Google Scholar
Eker, S.: Validity and usefulness of COVID-19 models. Humanit. Soc. Sci. Commun. 7(1), 54 (2020). https://doi.org/10.1057/s41599-020-00553-4
Article Google Scholar
Roser, M., Ritchie, H., Ortiz-Ospina, E., Hasell, J.: Coronavirus pandemic (COVID-19). Our World in Data (2020). Retrieved May 18, 2021 from https://ourworldindata.org/coronavirus
Massonis, G., Banga, J.R., Villaverde, A.F.: Structural identifiability and observability of compartmental models of the COVID-19 pandemic. Annu. Rev. Control (2020). https://doi.org/10.1016/j.arcontrol.2020.12.001
Article Google Scholar
Moein, S., Nickaeen, N., Roointan, A., Borhani, N., Heidary, Z., Javanmard, S.H., Ghaisari, J., Gheisari, Y.: Inefficiency of SIR models in forecasting COVID-19 epidemic: a case study of Isfahan. Sci. Rep. 11(1), 4725 (2021). https://doi.org/10.1038/s41598-021-84055-6
Article Google Scholar
Brauer, F.: Compartmental models in epidemiology, pp. 19–79. Springer, Berlin, Heidelberg (2008). https://doi.org/10.1007/978-3-540-78911-6_2
Singh, P., Gupta, A.: Generalized SIR (GSIR) epidemic model: an improved framework for the predictive monitoring of COVID-19 pandemic. ISA Trans. (2021). https://doi.org/10.1016/j.isatra.2021.02.016
Article Google Scholar
Kaxiras, E., Neofotistos, G.: Multiple epidemic wave model of the COVID-19 pandemic: modeling study. J. Med. Internet Res. (2020). https://doi.org/10.2196/20912
Article Google Scholar
Kaxiras, E., Neofotistos, G., Angelaki, E.: The first 100 days: modeling the evolution of the COVID-19 pandemic. Chaos Solitons Fractals 138, 110114 (2020). https://doi.org/10.1016/j.chaos.2020.110114
Article MathSciNet Google Scholar
Cacciapaglia, G., Cot, C., Sannino, F.: Multiwave pandemic dynamics explained: how to tame the next wave of infectious diseases. Sci. Rep. 11(1), 6638 (2021) arXiv:2011.12846. https://doi.org/10.1038/s41598-021-85875-2
Cacciapaglia, G., Sannino, F.: Interplay of social distancing and border restrictions for pandemics via the epidemic renormalisation group framework. Sci. Rep. 10(1), 15828 (2020). https://doi.org/10.1038/s41598-020-72175-4
Article Google Scholar
Moein, S., Nickaeen, N., Roointan, A., Borhani, N., Heidary, Z., Javanmard, S.H., Ghaisari, J., Gheisari, Y.: Inefficiency of sir models in forecasting covid-19 epidemic: a case study of isfahan. Sci. Rep. 11(1), 1–9 (2021). https://doi.org/10.1038/s41598-021-84055-6
Article Google Scholar
Batistela, C.M., Correa, D.P., Bueno, Á.M., Piqueira, J.R.C.: Sirsi compartmental model for covid-19 pandemic with immunity loss. Chaos Solitons Fractals 142, 110388 (2021). https://doi.org/10.1016/j.chaos.2020.110388
Article MathSciNet Google Scholar
Ramezani, S.B., Amirlatifi, A., Rahimi, S.: A novel compartmental model to capture the nonlinear trend of covid-19. Comput. Biol. Med. 134, 104421 (2021). https://doi.org/10.1016/j.compbiomed.2021.104421
Article Google Scholar
Asamoah, J.K.K., Jin, Z., Sun, G.-Q., Seidu, B., Yankson, E., Abidemi, A., Oduro, F., Moore, S.E., Okyere, E.: Sensitivity assessment and optimal economic evaluation of a new covid-19 compartmental epidemic model with control interventions. Chaos Solitons Fractals 146, 110885 (2021). https://doi.org/10.1016/j.chaos.2021.110885
Massonis, G., Banga, J.R., Villaverde, A.F.: Structural identifiability and observability of compartmental models of the covid-19 pandemic. Annu. Rev. Control (2020). https://doi.org/10.1016/j.arcontrol.2020.12.001
Article Google Scholar
He, S., Peng, Y., Sun, K.: Seir modeling of the covid-19 and its dynamics. Nonlinear Dyn. 101(3), 1667–1680 (2020). https://doi.org/10.1007/s11071-020-05743-y
Article Google Scholar
Calafiore, G.C., Novara, C., Possieri, C.: A time-varying sird model for the covid-19 contagion in italy. Annu. Rev. Control (2020). https://doi.org/10.1007/s11071-020-05743-y
Article MathSciNet Google Scholar
Dehning, J., Zierenberg, J., Spitzner, F.P., Wibral, M., Neto, J.P., Wilczek, M., Priesemann, V.: Inferring change points in the spread of covid-19 reveals the effectiveness of interventions. Science 369(6500), 9789 (2020). https://doi.org/10.1126/science.abb9789
Article Google Scholar
Sturniolo, S., Waites, W., Colbourn, T., Manheim, D., Panovska-Griffiths, J.: Testing, tracing and isolation in compartmental models. PLoS Comput. Biol. 17(3), 1008633 (2021). https://doi.org/10.1371/journal.pcbi.1008633
Article Google Scholar
Davies, N.G., Abbott, S., Barnard, R.C., Jarvis, C.I., Kucharski, A.J., Munday, J.D., Pearson, C.A., Russell, T.W., Tully, D.C., Washburne, A.D., et al.: Estimated transmissibility and impact of sars-cov-2 lineage b. 1.1. 7 in england. Science 372(6538) (2021). https://doi.org/10.1126/science.abg3055
Naveca, F.G., Nascimento, V., de Souza, V.C., de Lima Corado, A., Nascimento, F., Silva, G., Costa, Á., Duarte, D., Pessoa, K., Mejía, M., et al.: Covid-19 in amazonas, brazil, was driven by the persistence of endemic lineages and p. 1 emergence. Nat. Med. (2021). https://doi.org/10.1038/s41591-021-01378-7
Article Google Scholar
Lee, L.Y.W., Rozmanowski, S., Pang, M., Charlett, A., Anderson, C., Hughes, G.J., Barnard, M., Peto, L., Vipond, R., Sienkiewicz, A., Hopkins, S., Bell, J., Crook, D.W., Gent, N., Walker, A.S., Peto, T.E.A., Eyre, D.W.: Severe acute respiratory syndrome coronavirus 2 (sars-cov-2) infectivity by viral load, s gene variants and demographic factors, and the utility of lateral flow devices to prevent transmission. Clin. Infect. Dis. (2021). https://doi.org/10.1093/cid/ciab421
Article Google Scholar
Rotejanaprasert, C., Ekapirat, N., Areechokchai, D., Maude, R.J.: Bayesian spatiotemporal modeling with sliding windows to correct reporting delays for real-time dengue surveillance in thailand. Int. J. Health Geogr. 19(1), 1–13 (2020). https://doi.org/10.1186/s12942-020-00199-0
Article Google Scholar
McGough, S.F., Johansson, M.A., Lipsitch, M., Menzies, N.A.: Nowcasting by bayesian smoothing: a flexible, generalizable model for real-time epidemic tracking. PLoS Comput. Biol. 16(4), 1007735 (2020). https://doi.org/10.1371/journal.pcbi.1007735
Bastos, L.S., Economou, T., Gomes, M.F., Villela, D.A., Coelho, F.C., Cruz, O.G., Stoner, O., Bailey, T., Codeço, C.T.: A modelling approach for correcting reporting delays in disease surveillance data. Stat. Med. 38(22), 4363–4377 (2019). https://doi.org/10.1002/sim.8303
Article MathSciNet Google Scholar
Liao, Z., Lan, P., Liao, Z., Zhang, Y., Liu, S.: TW-SIR: time-window based SIR for COVID-19 forecasts. Sci. Rep. 10(1), 22454 (2020). https://doi.org/10.1038/s41598-020-80007-8
Article Google Scholar
Minter, A., Retkute, R.: Approximate bayesian computation for infectious disease modelling. Epidemics 29, 100368 (2019). https://doi.org/10.1016/j.epidem.2019.100368
Article Google Scholar
Loli Piccolomini, E., Zama, F.: Monitoring italian covid-19 spread by a forced seird model. PloS One 15(8), 0237417 (2020). https://doi.org/10.1371/journal.pone.0237417
Article Google Scholar
Rapolu, T., Nutakki, B., Rani, T.S., Bhavani, S.D.: A time-dependent seird model for forecasting the covid-19 transmission dynamics. medRxiv (2020). https://doi.org/10.1101/2020.05.29.20113571

Download references

Acknowledgements

The authors acknowledge the National Laboratory for Scientific Computing (LNCC/MCTI, Brazil) for providing HPC resources of the SDumont supercomputer, which have contributed to the research results reported within this paper. URL: http://sdumont.lncc.br

Funding

J. P. Valeriano acknowledges funding by FAPESP, process 2020/14169-0. I.Reis acknowledges funding by FAPESP, process 2021/02027-0. P. H. P. Cintra acknowledges funding by CAPES, process 88887.625345/2021-00. Gustavo Libotte is supported by a postdoctoral fellowship from the Carlos Chagas Filho Foundation for Supporting Research in the State of Rio de Janeiro (FAPERJ), grant number E-26/200.347/2021.

Author information

Gustavo Libotte
Present address: Department of Computational Modeling, Polytechnic Institute, Rio de Janeiro State University, Nova Friburgo, Brazil

Authors and Affiliations

Instituto de Física Teórica, Universidade Estadual Paulista, R. Dr. Bento Teobaldo Ferraz, 271, Bloco 2, Barra Funda, São Paulo, SP, 01140-070, Brazil
João Pedro Valeriano
Instituto de Física Gleb Wataghin, Universidade Estadual de Campinas, Rua Sérgio Buarque de Holanda, 777, Campinas, SP, 13083-859, Brazil
Pedro Henrique Cintra
Laboratório Nacional de Computção Científica, Av. Getulio Vargas, 333, Petrópolis, RJ, 25651-076, Brazil
Gustavo Libotte, Renato Silva & Sandra Malta
Instituto de Física de São Carlos, Universidade de São Paulo, Av. Trab. São Carlense, 400 - Parque Arnold Schimidt, São Carlos, SP, 13566-590, Brazil
Igor Reis
Department of Physics, University of Alberta, 116 St & 85 Ave, Edmonton, AB, T6G 2E1, Canada
Felipe Fontinele

Authors

João Pedro Valeriano
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Henrique Cintra
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo Libotte
View author publications
You can also search for this author in PubMed Google Scholar
Igor Reis
View author publications
You can also search for this author in PubMed Google Scholar
Felipe Fontinele
View author publications
You can also search for this author in PubMed Google Scholar
Renato Silva
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Malta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to João Pedro Valeriano.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 34151 KB)

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Valeriano, J.P., Cintra, P.H., Libotte, G. et al. Sequential time-window learning with approximate Bayesian computation: an application to epidemic forecasting. Nonlinear Dyn 111, 549–558 (2023). https://doi.org/10.1007/s11071-022-07865-x

Download citation

Received: 21 March 2022
Accepted: 02 September 2022
Published: 25 September 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s11071-022-07865-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Sequential time-window learning with approximate Bayesian computation: an application to epidemic forecasting

Abstract

Similar content being viewed by others

Forecasting Multi-Wave Epidemics Through Bayesian Inference

Assessing parameter identifiability in compartmental dynamic models using a computational approach: application to infectious disease transmission models

GrowthPredict: A toolbox and tutorial-based primer for fitting and forecasting growth trajectories using phenomenological growth models

1 Introduction

2 Motivation and objectives

3 Methodology