The World Health Organization declared the coronavirus disease 2019 a pandemic on March 11, pointing to the over 118,000 cases in over 110 countries and territories around the world at that time [18]. The COVID-19 pandemic in Brazil began on February 26, 2020, when a man from Sao Paulo who returned from Italy tested positive for the virus. The first case of COVID-19 in Rio de Janeiro was confirmed on March 5, 2020. Nowadays, Brazil is considered the epidemic center of Latin America, occupying the second place in the total number of cases and, more recently, in the total number of deaths. Currently (July 9, 2020), Sao Paulo and Rio de Janeiro are the states with the highest number of deaths by the new coronavirus in Brazil, according to data from the Brazilian Health Ministry.
In the past few months, a considerable number of studies related to the evolution of COVID-19 in the world have been submitted and published. In the following, we describe some of these works.
Ahmadi et al. [1] developed mathematical models to predict the number of COVID-19 cases in Iran from April 3, 2020, to May 13, 2020. The unknown parameters in these models were estimated by running the fminsearch, a MATLAB function, which is a least-squares algorithm. Torrealba et al. [22] analyzed the modeling and prediction of COVID-19 in Mexico, from an initial approximation, and using the Gauss–Newton algorithm, the authors estimated parameters in the Gompertz and Logistic models. Articles [12, 15, 16] estimated the number of total COVID-19 cases and deaths in world using the Gompertz model. In [11], they analyzed the number of deaths by COVID-19 with social distancing in Brazil, Sao Paulo. Using the SEIR model, the authors recommend temporary lockdowns, however, great the economic and social costs. In [2], the authors studied the impact on the evolution of cases in Rio de Janeiro, Brazil, considering a Susceptible-Infectious-Quarantined-Recovered (SIQR) model with containment.
In this work, we model the near-future trajectory of the cumulative number of infections and deaths by COVID-19 for Brazil and two Brazilian states, given the cumulative number of infected cases and deaths from February 26, 2020 to July 2, 2020. Data were obtained from the Brazilian Ministry of Health until July 2 (https://covid.saude.gov.br), and we considered the cumulative reports from the date on which the first case was notified in Brazil and in each analyzed state. We also estimate the basic reproduction number \((R_0)\), which represents the average number of secondary infections generated by each infected person. To estimate the unknown parameters, we use the Gompertz model and propose a gradient type iterative method (Minimal error method).
The Gompertz model is one of the particular cases of the Richards model, as well as Brody, negative exponential, logistic and von Bertalanffy models, see [20, 21].
The growth functions can be grouped in three main categories: those without inflection point (Brody and negative models), those with sigmoidal shape and a fixed inflection point (Gompertz, logistic, and von Bertalanffy models), and those with a flexible inflection point (Richards model). The logistic, Gompertz, and von Bertalanffy models exhibit inflection points at about 50, 37, and 30% of the upper asymptote, respectively, which means that the Gompertz and von Bertalanffy processes are asymmetric, whereas the logistic is a symmetric process, see [19].
The Gompertz and the logistic models are the most frequently used sigmoid functions, and the literature on these models is extensive [3, 21]. In general, the cumulative number of deaths and cases by COVID-19 presents an asymmetrical sigmoidal growth curve. Therefore, using an inappropriate growth curve can have a substantial impact on forecasting [3].
The Gompertz model [5] was proposed by Benjamin Gompertz in 1825. Since then, this exponential model has been used to describe growth in plants, animals, bacteria, and cancer cells [22]. The Gompertz differential equation has the following form,
$$\begin{aligned} \displaystyle N_t=r N\ln \Big (\frac{K }{N}\Big ), \;\text { with }\;\;\;N(t_0)=N^{1}. \end{aligned}$$
(1)
where t represents time; N(t) is the cumulative population size at time t; r the intrinsic growth rate of model; K is the maximum value of model (N) when t goes to infinity; \(t_0\) represents the initial time; and \(N^1\) is the initial population or condition.
The analytic solution from (1) is
$$\begin{aligned} N(t)=Ke^{\ln (N^1/K)\times e^{-r(t-t_0)}}, \end{aligned}$$
(2)
where \(\lim \limits _{t\rightarrow \infty } N(t)=K\).
The turning point is the time at which the rate of accumulation changes from increasing to decreasing or vice versa and can be easily located by finding the inflection point of the epidemic curve, that is, the moment at which the trajectory begins to decline. Clearly, this quantity is of epidemiological importance, indicating either the beginning (i.e., the moment of acceleration after deceleration) or end (i.e., the moment of deceleration after acceleration) of a phase [6]. From Eq. (2), it is trivial to show that the inflection point is given by
$$\begin{aligned} t_{i.p}=t_0+\frac{\ln (\ln (K/N^1))}{r}. \end{aligned}$$
(3)
At the inflection point, the number of infected or killed is given by \(N(t_{i.p})=K/e\).
In this paper, we consider the discrete function, from Eq. (2) and for \(t_0=1\),
$$\begin{aligned} N^i= \displaystyle Ke^{\ln (N^1/K)\times e^{-r(i-1)}},\quad i =1,2, \ldots ,m; \end{aligned}$$
(4)
where i represents time in days and \(N^i\) is the cumulative number of infected cases (or deaths) at day i.
Table 1 Estimates of parameters unknown In this paper, we consider that equality (3) is the inflection point of Eq. (4).
We denote \({{\varvec{M}}}=\left( N^1,N^2,\ldots ,N^m\right) \in {\mathbb {R}}^m\). The inverse problem is to estimate \({{\varvec{x}}}=(K,r)\), from (4), given \({{\varvec{M}}}\).