1 Introduction

The outbreak of the coronavirus disease (COVID-19) in 2019 occurred in Wuhan, China, at the end of 2019. This is a severe respiratory syndrome caused by the novel coronavirus of zoonotic origin [1]. The Chinese government has implemented many measures, including the establishment of specialized hospitals and restrictions on travel, to reduce the spread. By April 20, 2020, the outbreak in China has been basically controlled. However, the outbreak is still rampant all over the world. At present, the USA, Italy, Spain and other countries are still in the rising stage of the outbreak. It has posed a great threat to the public health and safety of the world.

At present, many countries have adopted mitigation measures to restrict travel and public gatherings, which have a serious impact on the economy. Therefore, it is very important to predict the development trend of this epidemic and estimate the peak of the isolated cases. The epidemic model is a basic tool to research the dynamic behaviors of disease and predict the spreading trend of disease. Establishing a reasonable epidemic model can effectively characterize the development process of the disease. Ricardo [2] researched a fractional SEIR model with treatment and given some sufficient conditions to ensure the local stability of equilibrium points. Yang and Xu [3] considered a fractional-order SEIR model with general incidence rate and established the local and uniform asymptotic stability of equilibria. However, these studies remain at the theoretical stage and do not use real data to analyze the trend of specific epidemics. In the outbreak of COVID-19, three types of data are usually available: isolated cases, recovered cases and death cases. The traditional fractional SEIR model cannot describe the trend of isolated individuals and dead individuals caused by diseases. Hence, it difficult for the traditional SEIR model to use the information contained in isolated cases and death cases. Therefore, a generalized SEIR epidemic model is proposed in this paper and is defined as SEIQRP model. Some qualitative properties of this model are first analyzed, including the existence and uniqueness of the disease-free and the endemic equilibrium points. Then, conditions are also established to ensure the local asymptotic stability of both disease-free and endemic equilibrium points.

So far, many scholars have researched COVID-19 from different perspectives [1, 4,5,6,7,8]. In [9], the epidemics trend of COVID-19 in China was predicted under public health interventions. In [10], the basic reproduction number of COVID-19 in China was estimated and the data-driven analysis was performed in the early phase of the outbreak. Peng [11] proposed a generalized SEIR model to analyze the spread of COVID-19 in China. The model can describe the trends of isolated individuals, recovered individuals and dead individuals, and make full use of the information contained in the real data. However, the model does not take into account the infectivity of patients in the incubation period. As we all know, one of the important reasons that COVID-19 can cause a global outbreak is that the epidemic has a long incubation period. Hence, considering the infectivity of the patients in the incubation period is in line with the actual situation. In order to predict the cumulative number of confirmed cases and combine the actual measures taken by the government on the outbreak of COVID-19, we further put forward an improved SEIQRP model, denoted by SEIQRPD model. At present, we can obtain the epidemic data of COVID-19 outbreak in the USA before April 20, 2020. We use these data and the improved model to predict the epidemic trend of the USA in the next two weeks, and estimate the peak of isolated cases. Firstly, the data before April 5 were selected to identify the model parameters, and the prediction ability of the improved model with the epidemiological data from April 6 to April 20 is verified. Then, the cumulative number of confirmed cases and isolated cases are predicted in the next two weeks. The peak of isolated cases is thus predicted.

In recent years, with the continuous development of fractional calculus theory, fractional-order system modeling approaches have been applied in various engineering and non-engineering fields [12,13,14]. Compared with the short memory of the integer derivative, the fractional derivative has the information of the whole time interval or long memory. It is more accurate to describe the biological behavior of population by using fractional differential equation.

The rest of this manuscript is structured as below. In Sect. 2.1, the fractional SEIQRP model and the modified SEIQRP model is proposed. Some qualitative properties of the SEIQRP model are discussed in Sect. 2.2. In Sect. 2.3, the prediction ability of the modified SEIQRP model is verified by using real data. The disease development in the USA in the next two weeks after April 21, 2020, is predicted. Finally, conclusion and some future works are discussed in Sect. 3.

2 Preliminaries and model derivation

2.1 Preliminaries

In this section, some useful lemmas and definitions will be given to analyze some results of this paper.

Definition 1

[15] The Caputo fractional-order derivative is given below

$$\begin{aligned}_{0}^{C}D^{\alpha }_{t}f(t)=\left\{ \begin{array}{rcl} &{}\frac{1}{\varGamma (n-\alpha )}\int _{0}^{t}\frac{f^{(n)}(\xi )}{(t-\xi )^{1+\alpha -n}}\mathrm{d}\xi , &{}n-1<\alpha <n,\\ &{}\frac{\mathrm{d}^{n}}{\mathrm{d}t^{n}}f(t), &{}\alpha =n, \end{array} \right. \end{aligned}$$

where n is the smallest positive integer not less than \(\alpha \).

Definition 2

[15, 16] The Mittag–Leffler function is given below

$$\begin{aligned}E_{\alpha }(z)=\sum _{k=0}^{\infty }\frac{z^{k}}{\varGamma (k\alpha +1)},\end{aligned}$$

where \(n-1\le \alpha \le n\).

Consider the following n-dimensional fractional-order differential equation system

$$\begin{aligned} _{0}^{C}D^{\alpha }_{t}X(t)=AX(t),\;\;x(0)=x_{0}, \end{aligned}$$
(1)

where \(\alpha \in (0,1)\), \(X(t)=(x_{1}(t),x_{2}(t),\ldots ,x_{n}(t))^{T}\) is an n-dimensional state vector and A is an \(n\times n\) constant matrix.

Lemma 1

[17] For the corresponding linear time-invariant system (1), the following results are true:

(i) The zero solution is asymptotically stable, if and only if all eigenvalues \(s_{j}\) \((j=1,2,\ldots ,n)\) of A satisfy \(|\arg (s_{j})|>\frac{\alpha \pi }{2}\).

(ii) The zero solution is stable, if and only if all eigenvalues \(s_{j}\) of A satisfy \(|\arg (s_{j})|\ge \frac{\alpha \pi }{2}\) and eigenvalues with \(|\arg (s_{j})|=\frac{\alpha \pi }{2}\) have the same algebraic multiplicity and geometric multiplicity.

2.2 SEIQRP model

The outbreak of COVID-19 has had a great impact on the economic growth of any country and daily life of any human. In order to control and prevent the possible outbreak of infectious diseases like the COVID-19 or other insect diseases in the future, it is very important to establish an appropriate model. The transmission diagram of the generalized SEIR model proposed in this paper is shown in Fig. 1. We divide the total population into six distinct epidemic classes: susceptible, exposed, infectious, quarantined, recovered and insusceptible. We will represent the number of individuals at time t in the above classes by S(t), E(t), I(t), Q(t), R(t) and P(t), respectively. The specific explanations of the above six categories are as follows:

Fig. 1
figure 1

Transmission diagram for the model involving six population classes

  • Susceptible S(t): the number of uninfected individuals at the time t.

  • Exposed E(t): the number of infected individuals at the time t, but still in incubation period (without clinical symptoms and low infectivity).

  • Infectious I(t): the number of infected individuals at the time t (with obvious clinical symptoms).

  • Quarantined Q(t): the number of individuals who have been diagnosed and isolated at the time t.

  • Recovered R(t): the number of recovered individuals at the time t.

  • Insusceptible P(t): the number of susceptible individuals who are not exposed to the external environment at the time t.

The incidence rate plays a very important role in the epidemic, which can describe the evolution of infectious disease. According to the spread of different diseases in different regions, there are many forms of incidence rate [18,19,20]. We expect to establish a generalized incidence rate, which can contain most of these specific forms. Further, in order to match the dimensions of fractional-order model with respect to time t, Diethelm[21] pointed out that both the left-hand and the right-hand sides of fractional-order epidemic model must have the same dimensions. Inspired by the work of[3, 21, 22], the fractional-order SEIQRP model with general incidence rate is given by

$$\begin{aligned} \left\{ \begin{aligned} _{0}^{C}D^{\alpha }_{t}S(t)&=\varLambda -\beta _{1}^{\alpha }f_{1}(S(t))g_{1}(I(t))\\ {}&\quad -\beta _{2}^{\alpha }f_{2}(S(t))g_{2}(E(t))-\mu ^{\alpha } S(t)-\rho ^{\alpha } S(t),\\ _{0}^{C}D^{\alpha }_{t}E(t)&=\beta _{1}^{\alpha }f_{1}(S(t))g_{1}(I(t))+\beta _{2}^{\alpha }f_{2}(S(t))g_{2}(E(t))\\ {}&\quad -\varepsilon ^{\alpha } E(t)-\mu ^{\alpha } E(t),\\ _{0}^{C}D^{\alpha }_{t}I(t)&=\varepsilon ^{\alpha } E(t)-(\delta ^{\alpha }+\mu ^{\alpha })I(t),\\ _{0}^{C}D^{\alpha }_{t}Q(t)&=\delta ^{\alpha } I(t)-(\lambda ^{\alpha }+\kappa ^{\alpha }+\mu ^{\alpha })Q(t),\\ _{0}^{C}D^{\alpha }_{t}R(t)&=\lambda ^{\alpha } Q(t)-\mu ^{\alpha } R(t),\\ _{0}^{C}D^{\alpha }_{t}P(t)&=\rho ^{\alpha } S(t)-\mu ^{\alpha } P(t), \end{aligned} \right. \end{aligned}$$
(2)

where \(\alpha \in (0,1)\), \(\varLambda \) is the inflow number of susceptible individuals, \(\beta _{1}\) and \(\beta _{2}\) denote the infection rates of the infected individuals and the exposed individuals, respectively, \(\rho \) is the protection rate and \(\varepsilon \) represents the incubation rate. \(\delta \) is the rate at which symptomatic infections are diagnosed and quarantined. \(\lambda \) and \(\kappa \) represent the cure rate of isolated individuals and the death rate caused by the disease, respectively, \(\mu \) is the natural mortality, and the incidence rates \(\beta _{1}f_{1}(S)g_{1}(I)\) and \(\beta _{2}f_{2}(S)g_{2}(E)\) are used to describe the transmission of diseases, which satisfy the following conditions [23]:

(i) \(f_{1}(0)=f_{2}(0)=g_{1}(0)=g_{2}(0)=0\),

(ii) \(f_{1}(S)>0\), \(f_{2}(S)>0\), \(g_{1}(I)>0\), \(g_{2}(E)>0\), for any S, E, \(I>0\),

(iii) \(f'_{1}(S)>0\), \(f'_{2}(S)>0\), \(g'_{1}(I)>0\), \(g'_{2}(E)>0\), for any S, E, \(I>0\),

(iv) \(g'_{1}(I)-\frac{g_{1}(I)}{I}\le 0\), \(g'_{2}(E)-\frac{g_{2}(E)}{E}\le 0\), for any E, \(I>0\).

2.3 Modified SEIQRP model

In the following, we will combine the actual situation and government mitigation policy to improve the SEIQRP model (2). Firstly, during the outbreak of COVID-19, we need to make reasonable assumptions according to the actual mitigation policies and circumstances.

  • During the COVID-19 outbreak, population mobility is strictly controlled by many countries. In particular, the policy of city closure was implemented in Hubei Province, China. Therefore, the impact of migration will not be considered in the improved model.

  • For the prediction of this short-term virus transmission, the impact of natural mortality will not be considered. It is assumed that novel coronavirus is the only cause of death during the outbreak.

  • In order to predict the trend of cumulative confirmed cases, it is necessary to simulate the number of death cases. Therefore, a new class D(t) to the SEIQRP model will be added in the modified model, which denotes the number of death cases at time t.

  • The time-varying cure rate, mortality rate and infection rate will be applied to the improved model. This can better simulate the impact of the improvement of medical conditions and government control on individuals in reality.

Based on the above assumptions and analysis, the following improved model for COVID-19 is proposed.

$$\begin{aligned} \left\{ \begin{aligned} _{0}^{C}D^{\alpha }_{t}S(t)&=-\beta _{1}^{\alpha }(t)f_{1}(S(t))g_{1}(I(t))\\ {}&\quad -\beta _{2}^{\alpha }f_{2}(S(t))g_{2}(E(t))-\rho ^{\alpha } S(t),\\ _{0}^{C}D^{\alpha }_{t}E(t)&=\beta _{1}^{\alpha }(t)f_{1}(S(t))g_{1}(I(t))\\ {}&\quad +\beta _{2}^{\alpha }f_{2}(S(t))g_{2}(E(t))-\varepsilon ^{\alpha } E(t),\\ _{0}^{C}D^{\alpha }_{t}I(t)&=\varepsilon ^{\alpha } E(t)-\delta ^{\alpha } I(t),\\ _{0}^{C}D^{\alpha }_{t}Q(t)&=\delta ^{\alpha } I(t)-(\lambda ^{\alpha }(t)+\kappa ^{\alpha }(t))Q(t),\\ _{0}^{C}D^{\alpha }_{t}R(t)&=\lambda ^{\alpha }(t) Q(t),\\ _{0}^{C}D^{\alpha }_{t}P(t)&=\rho ^{\alpha } S(t),\\ _{0}^{C}D^{\alpha }_{t}D(t)&=\kappa ^{\alpha }(t) Q(t), \end{aligned} \right. \end{aligned}$$
(3)

where \(\beta _{1}(t)=\sigma _{1}\exp {(-\sigma _{2} t)}\), \(\lambda (t)=\lambda _{1}(1-\exp {(\lambda _{2}t)})\) and \(\kappa (t)=\kappa _{1}\exp {(-\kappa _{2}t)}\). The parameters \(\sigma _{1}\), \(\sigma _{2}\), \(\lambda _{1}\), \(\lambda _{2}\), \(\kappa _{1}\) and \(\kappa _{2}\) are all positive, where \(\sigma \) is used to simulate the intensity of government control. It should be emphasized that the protection rate \(\rho \) for susceptible individuals also reflects the intensity of government control.

3 Qualitative analysis of the SEIQRP model

3.1 The existence and uniqueness of equilibrium point

Obviously, the right-hand side of system (2) satisfies the local Lipschitz condition; then, there exists a unique solution of system (2) [15, 24].

Theorem 1

The solutions of system (2) are nonnegative, and the closed set \(\Omega =\{(S,E,I,Q,R,P)\in \mathbb {R}_{+}^{6}: 0\le S+E+I+Q+R+D\le \frac{\varLambda }{\mu ^{\alpha }}\}\) is a positive invariant set of system (2).

Proof

In order to investigate the non-negativity of solutions of system (2), we consider the following system

$$\begin{aligned} \left\{ \begin{aligned} _{0}^{C}D^{\alpha }_{t}S_{1}(t)&=-\beta _{1}^{\alpha }f_{1}(S(t))g_{1}(I(t))-\beta _{2}^{\alpha }f_{2}(S(t))g_{2}(E(t))\\ {}&-\mu ^{\alpha } S(t)-\rho ^{\alpha } S(t),\\ _{0}^{C}D^{\alpha }_{t}E_{1}(t)&=\beta _{1}^{\alpha }f_{1}(S(t))g_{1}(I(t))+\beta _{2}^{\alpha }f_{2}(S(t))g_{2}(E(t))\\ {}&-\varepsilon ^{\alpha } E(t)-\mu ^{\alpha } E(t),\\ _{0}^{C}D^{\alpha }_{t}I_{1}(t)&=\varepsilon ^{\alpha } E(t)-(\delta ^{\alpha }+\mu ^{\alpha })I(t),\\ _{0}^{C}D^{\alpha }_{t}Q_{1}(t)&=\delta ^{\alpha } I(t)-(\lambda ^{\alpha }+\kappa ^{\alpha }+\mu ^{\alpha })Q(t),\\ _{0}^{C}D^{\alpha }_{t}R_{1}(t)&=\lambda ^{\alpha } Q(t)-\mu ^{\alpha } R(t),\\ _{0}^{C}D^{\alpha }_{t}P_{1}(t)&=\rho ^{\alpha } S(t)-\mu ^{\alpha } P(t), \end{aligned} \right. \end{aligned}$$
(4)

where the initial conditions are \(S_{10}=0\), \(E_{10}=0\), \(I_{10}=0\), \(Q_{10}=0\), \(R_{10}=0\) and \(P_{10}=0\). (0, 0, 0, 0, 0, 0) is the unique solution of the above system. According to the fractional-order comparison theorem [25], one can deduce that the solutions of system (2) satisfy \(S(t)\ge 0\), \(E(t)\ge 0\), \(I(t)\ge 0\), \(Q(t)\ge 0\), \(R(t)\ge 0\) and \(P(t)\ge 0\). By adding six equations of system (2), one can deduce

$$\begin{aligned} \begin{aligned} _{0}^{C}D^{\alpha }_{t}N(t)&=\varLambda -\mu ^{\alpha } N(t)-\kappa ^{\alpha } Q(t)\\ {}&\le \varLambda -\mu ^{\alpha } N(t). \end{aligned} \end{aligned}$$

By applying the fractional-order comparison theorem, one has

$$\begin{aligned} N(t)\le (-\frac{\varLambda }{\mu ^{\alpha }}+N(0))E_{\alpha }(-\mu ^{\alpha } t^{\alpha })+\frac{\varLambda }{\mu ^{\alpha }}. \end{aligned}$$

When \(N(0)\le \frac{\varLambda }{\mu ^{\alpha }}\), since \(E_{\alpha }(-\mu ^{\alpha } t^{\alpha })\ge 0\), we have

$$\begin{aligned} S(t)+E(t)+I(t)+Q(t)+R(t)+P(t)\le \frac{\varLambda }{\mu ^{\alpha }}. \end{aligned}$$

Thus, we can draw the result of Theorem 1.

System (2) has an obvious disease-free equilibrium point \(M_{0}=(S_{0},0,0,0,0,P_{0})\), where

$$\begin{aligned} S_{0}=\frac{\varLambda }{\mu ^{\alpha }+\rho ^{\alpha }},\;\;P_{0}=\frac{\rho ^{\alpha }\varLambda }{\mu ^{\alpha }(\mu ^{\alpha }+\rho ^{\alpha })}. \end{aligned}$$

In order to obtain the endemic equilibrium point of system (2), we set:

$$\begin{aligned} \left\{ \begin{aligned}&\varLambda -\beta _{1}^{\alpha }f_{1}(S(t))g_{1}(I(t))-\beta _{2}^{\alpha }f_{2}(S(t))g_{2}(E(t))\\ {}&-\mu ^{\alpha } S(t)-\rho ^{\alpha } S(t)=0,\\&\beta _{1}^{\alpha }f_{1}(S(t))g_{1}(I(t))+\beta _{2}^{\alpha }f_{2}(S(t))g_{2}(E(t))\\ {}&-\varepsilon ^{\alpha } E(t)-\mu ^{\alpha } E(t)=0,\\&\varepsilon ^{\alpha } E(t)-(\delta ^{\alpha }+\mu ^{\alpha })I(t)=0,\\&\delta ^{\alpha } I(t)-(\lambda ^{\alpha }+\kappa ^{\alpha }+\mu ^{\alpha })Q(t)=0,\\&\lambda ^{\alpha } Q(t)-\mu ^{\alpha } R(t)=0,\\&\rho ^{\alpha } S(t)-\mu ^{\alpha } P(t)=0, \end{aligned} \right. \end{aligned}$$
(5)

which implies

$$\begin{aligned} S=\frac{\varLambda -(\varepsilon ^{\alpha }+\mu ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }},\;\;I=\frac{\varepsilon ^{\alpha } E}{\sigma ^{\alpha }+\mu ^{\alpha }}. \end{aligned}$$

Combining the above equations and the second equation of (5), one has

$$\begin{aligned} \begin{aligned} (\mu ^{\alpha }+\varepsilon ^{\alpha })E&=\beta _{1}^{\alpha }f_{1}\left( \frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }}\right) g_{1}\left( \frac{\varepsilon ^{\alpha } E}{\mu ^{\alpha }+\delta ^{\alpha }}\right) \\ {}&+\beta _{2}^{\alpha }f_{2}\left( \frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }}\right) g_{2}(E). \end{aligned} \end{aligned}$$
(6)

Define

$$\begin{aligned} \begin{aligned} \varphi (E)&=\beta _{1}^{\alpha }f_{1}\left( \frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }}\right) g_{1}\left( \frac{\varepsilon ^{\alpha } E}{\mu ^{\alpha }+\delta ^{\alpha }}\right) \\ {}&+\beta _{2}^{\alpha }f_{2}\left( \frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }}\right) g_{2}(E)-(\mu ^{\alpha }+\varepsilon ^{\alpha })E. \end{aligned} \end{aligned}$$
(7)

Note that \(\varphi (0)=0\) and \(\varphi (\frac{\varLambda }{\mu ^{\alpha }+\varepsilon ^{\alpha }})=-\varLambda <0\). In order to show that \(\varphi (E)=0\) has at least one positive root in the interval \((0,\frac{\varLambda }{\mu ^{\alpha }+\varepsilon ^{\alpha }}]\), we need to prove that \(\varphi '(0)>0\). Hence,

$$\begin{aligned} \begin{aligned} \varphi '(E)&=-\frac{(\mu ^{\alpha }+\varepsilon ^{\alpha })\beta _{1}^{\alpha }}{\mu ^{\alpha }+\rho ^{\alpha }}f'_{1}\left( \frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }}\right) g_{1}\left( \frac{\varepsilon ^{\alpha } E}{\mu ^{\alpha }+\delta ^{\alpha }}\right) \\ {}&+\frac{\varepsilon ^{\alpha } \beta _{1}^{\alpha }}{\mu ^{\alpha }+\delta ^{\alpha }}f_{1}\left( \frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }}\right) g'_{1}\left( \frac{\varepsilon ^{\alpha } E}{\mu ^{\alpha }+\delta ^{\alpha }}\right) \\ {}&-\frac{(\mu ^{\alpha }+\varepsilon ^{\alpha })\beta _{2}^{\alpha }}{\mu ^{\alpha }+\rho ^{\alpha }}f'_{2}\left( \frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }}\right) g_{2}(E)\\ {}&+\beta _{2}^{\alpha }f_{2}\left( \frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E}{\mu ^{\alpha }+\rho ^{\alpha }}\right) g'_{2}(E)-(\mu ^{\alpha }+\varepsilon ^{\alpha }). \end{aligned} \end{aligned}$$
(8)

Therefore, we have

$$\begin{aligned} \begin{aligned} \varphi '(0)&=\frac{\varepsilon ^{\alpha }\beta _{1}^{\alpha }}{\mu ^{\alpha }+\delta ^{\alpha }}f_{1}(S_{0})g'_{1}(0)+\beta _{2}^{\alpha }f_{2}(S_{0})g'_{2}(0)-(\mu ^{\alpha }+\varepsilon ^{\alpha })\\ {}&=(\mu ^{\alpha }+\varepsilon ^{\alpha })({{ R }}_{0}-1), \end{aligned} \end{aligned}$$

where the basic reproduction number is given by

$$\begin{aligned} \begin{aligned} { R }_{0}&=\frac{\varepsilon ^{\alpha }\beta _{1}^{\alpha }}{(\mu ^{\alpha }+\varepsilon ^{\alpha })(\mu ^{\alpha }+\delta ^{\alpha })}f_{1}(S_{0})g'_{1}(0)\\ {}&+\frac{\beta _{2}^{\alpha }}{\mu ^{\alpha }+\varepsilon ^{\alpha }}f_{2}(S_{0})g'_{2}(0). \end{aligned} \end{aligned}$$
(9)

If \({ R }_{0}>1\), system (2) has at least one endemic equilibrium point \(M_{*}=(S_{*},E_{*},I_{*},Q_{*},R_{*},P_{*})\), where

$$\begin{aligned} S_{*}=\frac{\varLambda -(\mu ^{\alpha }+\varepsilon ^{\alpha })E_{*}}{\mu ^{\alpha }+\rho ^{\alpha }},\;\;I_{*}=\frac{\varepsilon ^{\alpha } E_{*}}{\mu ^{\alpha }+\delta ^{\alpha }}, \\ Q_{*}=\frac{\delta ^{\alpha } I_{*}}{\mu ^{\alpha }+\lambda ^{\alpha }+\kappa ^{\alpha }},\;\;R_{*}=\frac{\lambda ^{\alpha } Q_{*}}{\mu ^{\alpha }},\;\;P_{*}=\frac{\rho ^{\alpha } S_{*}}{\mu ^{\alpha }}. \end{aligned}$$

In the following section, we show that endemic equilibrium point \(M_{*}\) is unique. According to the above analysis and hypothesis (i)–(iii), one has

$$\begin{aligned} \begin{aligned} \varphi '(E_{*})&=-\frac{(\mu ^{\alpha }+\varepsilon ^{\alpha })\beta _{1}^{\alpha }}{\mu ^{\alpha }+\rho ^{\alpha }}f'_{1}(S_{*})g_{1}(I_{*})+\frac{\varepsilon ^{\alpha }\beta _{1}^{\alpha }}{\mu ^{\alpha }+\delta ^{\alpha }}f_{1}(S_{*})g'_{1}(I_{*})\\ {}&-\frac{(\mu ^{\alpha }+\varepsilon ^{\alpha })\beta _{2}^{\alpha }}{\mu ^{\alpha }+\rho ^{\alpha }}f'_{2}(S_{*})g_{2}(E_{*})+\beta _{2}^{\alpha }f_{2}(S_{*})g'_{2}(E_{*})\\ {}&-(\mu ^{\alpha }+\varepsilon ^{\alpha })\\&=-\frac{(\mu ^{\alpha }+\varepsilon ^{\alpha })\beta _{1}^{\alpha }}{\mu ^{\alpha }+\rho ^{\alpha }}f'_{1}(S_{*})g_{1}(I_{*})\\ {}&-\frac{(\mu ^{\alpha }+\varepsilon ^{\alpha })\beta _{2}^{\alpha }}{\mu ^{\alpha }+\rho ^{\alpha }}f'_{2}(S_{*})g_{2}(E_{*})\\ {}&+\frac{\varepsilon ^{\alpha }\beta _{1}^{\alpha }}{\mu ^{\alpha }+\delta ^{\alpha }}f_{1}(S_{*})g'_{1}(I_{*})-\frac{\varepsilon ^{\alpha }\beta _{1}^{\alpha }f_{1}(S_{*})g_{1}(I_{*})}{(\mu ^{\alpha }+\delta ^{\alpha })I_{*}}\\ {}&+\beta _{2}^{\alpha }f_{2}(S_{*})g'_{2}(E_{*})-\frac{\beta _{2}^{\alpha }f_{2}(S_{*})g_{2}(E_{*})}{E_{*}}\\&=-\frac{(\mu ^{\alpha }+\varepsilon ^{\alpha })\beta _{1}^{\alpha }}{\mu ^{\alpha }+\rho ^{\alpha }}f'_{1}(S_{*})g_{1}(I_{*})\\ {}&-\frac{(\mu ^{\alpha }+\varepsilon ^{\alpha })\beta _{2}^{\alpha }}{\mu ^{\alpha }+\rho ^{\alpha }}f'_{2}(S_{*})g_{2}(E_{*})\\ {}&+\frac{\varepsilon ^{\alpha }\beta _{1}^{\alpha }}{\mu ^{\alpha }+\delta ^{\alpha }}f_{1}(S_{*})[g'_{1}(I_{*})-\frac{g_{1}(I_{*})}{I_{*}}]\\ {}&+\beta _{2}^{\alpha }f_{2}(S_{*})[g'_{2}(E_{*})-\frac{g_{2}(E_{*})}{E_{*}}]. \end{aligned} \end{aligned}$$
(10)

By hypothesis (iv), this implies \(\varphi '(E_{*})<0\). If there is another endemic equilibrium point \(M_{**}\), then \(\varphi '(E_{**})\ge 0\) holds, which contradicts the previous discussion. Hence, system (2) has a unique endemic equilibrium point \(M_{*}\) when \({ R }_{0}>1\). Based on the above analysis, the following results can be obtained.

Theorem 2

System (2) has a unique disease-free equilibrium point \(M_{0}\), if \({ R }_{0}<1\). System (2) has a unique endemic equilibrium point \(M_{*}\), if \({ R }_{0}>1\).

3.2 Stability analysis

In this section, the local asymptotic stability of disease-free equilibrium point \(M_{0}\) and endemic equilibrium point \(M_{*}\) for system (2) is discussed.

Theorem 3

With regard to system (2), the disease-free equilibrium point \(M_{0}\) is locally asymptotic stability, if \( R_{0} <1\); the disease-free equilibrium point \(M_{0}\) is unstable, if \( R_{0} >1\).

Proof

The Jacobian matrix of system (2) at the disease-free equilibrium point \(M_{0}\) is

$$\begin{aligned} J_{M_{0}}= \begin{bmatrix} J_{11} &{} J_{12}\\ J_{21} &{} J_{22} \end{bmatrix}, \end{aligned}$$

where

$$\begin{aligned}J_{11}= \begin{bmatrix} -\mu ^{\alpha }-\rho ^{\alpha } &{} -h_{2} &{} -h_{1} \\ 0 &{} h_{2}-\mu ^{\alpha }-\varepsilon ^{\alpha } &{} h_{1} \\ 0 &{} \varepsilon ^{\alpha } &{} -\mu ^{\alpha }-\delta ^{\alpha } \end{bmatrix},\end{aligned}$$
$$\begin{aligned} J_{12}= \begin{bmatrix} 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} 0 \end{bmatrix},\;\; J_{21}= \begin{bmatrix} 0 &{} 0 &{} \delta ^{\alpha } \\ 0 &{} 0 &{} 0 \\ \rho ^{\alpha } &{} 0 &{} 0 \end{bmatrix}, \end{aligned}$$
$$\begin{aligned} J_{22}= \begin{bmatrix} -\mu ^{\alpha }-\kappa ^{\alpha }-\lambda ^{\alpha } &{} 0 &{} 0\\ \lambda ^{\alpha } &{} -\mu ^{\alpha } &{} 0\\ 0 &{} 0 &{} -\mu ^{\alpha } \end{bmatrix}, \end{aligned}$$

where \(h_{1}=\beta _{1}^{\alpha }f_{1}(S_{0})g'_{1}(0)\) and \(h_{1}=\beta _{2}^{\alpha }f_{2}(S_{0})g'_{2}(0)\). The corresponding characteristic equation is

$$\begin{aligned} \begin{aligned} H(s)&=|sE-J_{M_{0}}|\\ {}&=(s+\mu ^{\alpha })^{2}(s+\mu ^{\alpha }+\rho ^{\alpha })(s+\mu ^{\alpha }+\lambda ^{\alpha }+\kappa ^{\alpha })H_{1}(s), \end{aligned} \end{aligned}$$
(11)

where

$$\begin{aligned} \begin{aligned} H_{1}(s)&=s^{2}+(\delta ^{\alpha }+2\mu ^{\alpha }+\varepsilon ^{\alpha }-\beta _{2}^{\alpha }f_{2}(S_{0})g'_{2}(0))s\\ {}&+(\mu ^{\alpha }+\delta ^{\alpha })(\mu ^{\alpha }+\varepsilon ^{\alpha })\\ {}&-\beta _{2}^{\alpha }(\mu ^{\alpha }+\delta ^{\alpha })f_{2}(S_{0})g'_{2}(0)-\varepsilon ^{\alpha }\beta _{1}^{\alpha }f_{1}(S_{0})g'_{1}(0). \end{aligned} \end{aligned}$$
(12)

The characteristic equation \(H(s)=0\) has four obvious negative characteristic roots, which are denoted by \(s_{1}=s_{2}=-\mu ^{\alpha }\), \(s_{3}=-\mu ^{\alpha }-\rho ^{\alpha }\) and \(s_{4}=-\mu ^{\alpha }-\lambda ^{\alpha }-\kappa ^{\alpha }\), respectively. The discriminant of \(H_{1}(s)\) in quadratic form is

$$\begin{aligned} \begin{aligned} \Delta&=[\delta ^{\alpha }+2\mu ^{\alpha }+\varepsilon ^{\alpha }-\beta _{2}^{\alpha }f_{2}(S_{0})g'_{2}(0)]^2\\ {}&-4[(\mu ^{\alpha }+\delta ^{\alpha })(\mu ^{\alpha }+\varepsilon ^{\alpha })\\ {}&-\beta _{2}^{\alpha }(\mu ^{\alpha }+\delta ^{\alpha })f_{2}(S_{0})g'_{2}(0)-\varepsilon ^{\alpha }\beta _{1}^{\alpha }f_{1}(S_{0})g'_{1}(0)]\\&=(\delta ^{\alpha }-\varepsilon ^{\alpha }+\beta _{2}^{\alpha }f_{2}(S_{0})g'_{2}(0))^{2}+4\varepsilon ^{\alpha }\beta _{1}^{\alpha }f_{1}(S_{0})g'_{1}(0)\\ {}&>0. \end{aligned} \end{aligned}$$

This implies that the other two eigenvalues \(s_{5}\) and \(s_{6}\) of characteristic equation \(H(s)=0\) are real roots. Hence,

$$\begin{aligned} s_{5}+s_{6}=-(\delta ^{\alpha }+2\mu ^{\alpha }+\varepsilon ^{\alpha }-\beta _{2}^{\alpha }f_{2}(S_{0})g'_{2}(0)), \\ s_{5}s_{6}=(\mu ^{\alpha }+\delta ^{\alpha })(\mu ^{\alpha }+\varepsilon ^{\alpha })(1-{ R }_{0}). \end{aligned}$$

If \({ R }_{0}<1\), then one can obtain \(s_{5}+s_{6}<0\) and \(s_{5}s_{6}>0\), which imply \(s_{5}<0\) and \(s_{6}<0\). If \({ R }_{0}>1\), one has \(s_{5}s_{6}<0\), which imply \(s_{5}>0\) or \(s_{6}>0\). It follows from Lemma 1 that the proof is completed.

Further, we will show the locally asymptotic stability of the endemic equilibrium point \(M_{*}\) of system (2). Similarly, the corresponding Jacobian matrix of system (2) at \(M_{*}\) is

$$\begin{aligned} J_{M_{*}}= \begin{bmatrix} J_{1} &{} J_{2}\\ J_{3} &{} J_{4} \end{bmatrix}, \end{aligned}$$

where

$$\begin{aligned} J_{1}= \begin{bmatrix} -l_{3}-l_{4}-\mu ^{\alpha }-\rho ^{\alpha } &{} -l_{2} &{} -l_{1} \\ l_{3}+l_{4} &{} l_{2}-\varepsilon ^{\alpha }-\mu ^{\alpha } &{} l_{1}\\ 0 &{} \varepsilon ^{\alpha } &{} -\delta ^{\alpha }-\mu ^{\alpha } \end{bmatrix}, \end{aligned}$$
$$\begin{aligned} J_{2}= \begin{bmatrix} 0 &{} 0 &{} 0\\ 0 &{} 0 &{} 0\\ 0 &{} 0 &{} 0\\ \end{bmatrix},\;\; J_{3}= \begin{bmatrix} 0 &{} 0 &{} \delta ^{\alpha } \\ 0 &{} 0 &{} 0 \\ \rho ^{\alpha } &{} 0 &{} 0 \\ \end{bmatrix}, \end{aligned}$$
$$\begin{aligned} J_{4}= \begin{bmatrix} -\mu ^{\alpha }-\lambda ^{\alpha }-\kappa ^{\alpha } &{} 0 &{} 0\\ \lambda ^{\alpha } &{} -\mu ^{\alpha } &{} 0\\ 0 &{} 0 &{} -\mu ^{\alpha }\\ \end{bmatrix}, \end{aligned}$$

with

$$\begin{aligned} l_{1}= & {} \beta _{1}^{\alpha }f_{1}(S_{*})g'_{1}(I_{*}),\;\;l_{2}=\beta _{2}^{\alpha }f_{2}(S_{*})g'_{2}(E_{*}), \\ l_{3}= & {} \beta _{1}^{\alpha }f'_{1}(S_{*})g_{1}(I_{*}),\;\;l_{4}=\beta _{2}^{\alpha }f'_{2}(S_{*})g_{2}(E_{*}). \end{aligned}$$

Hence, the corresponding characteristic equation is

$$\begin{aligned} \begin{aligned} L(s)&=|sE-J_{M_{*}}|\\&=(s+\mu ^{\alpha })^{2}(s+\mu ^{\alpha }+\lambda ^{\alpha }+\kappa ^{\alpha })L_{1}(s), \end{aligned} \end{aligned}$$
(13)

where

$$\begin{aligned} L_{1}(s)=s^{3}+a_{1}s^{2}+a_{2}s+a_{3}, \end{aligned}$$

with

$$\begin{aligned} a_{1}= & {} \varepsilon ^{\alpha }+3\mu ^{\alpha }+\delta ^{\alpha }+\rho ^{\alpha }-l_{2}+l_{3}+l_{4},\\ a_{2}= & {} (\mu ^{\alpha }+\varepsilon ^{\alpha }-l_{2})(2\mu ^{\alpha }+\delta ^{\alpha }+\rho ^{\alpha })+(\mu ^{\alpha }+\delta ^{\alpha })(\mu ^{\alpha }+\rho ^{\alpha })\\&\quad +(2\mu ^{\alpha }+\delta ^{\alpha }+\varepsilon ^{\alpha })(l_{3}+l_{4})-\varepsilon ^{\alpha } l_{1},\\ a_{3}= & {} (\mu ^{\alpha }+\delta ^{\alpha })(\mu ^{\alpha }+\rho ^{\alpha })(\mu ^{\alpha }+\varepsilon ^{\alpha }-l_{2})\\&\quad +(\mu ^{\alpha }+\delta ^{\alpha })(\mu ^{\alpha }+\varepsilon ^{\alpha })(l_{3}+l_{4})-\varepsilon ^{\alpha }(\mu ^{\alpha }+\rho ^{\alpha })l_{1}. \end{aligned}$$

By the corresponding results in [26, 27], let

$$\begin{aligned} \begin{aligned} D _{1}(L_{1}(s))&= {\left| \begin{array}{ccccc} 1 &{} a_{1} &{} a_{2} &{} a_{3} &{} 0\\ 0 &{} 1 &{} a_{1} &{} a_{2} &{} a_{3}\\ 3 &{} 2la_{1} &{} a_{2} &{} 0 &{} 0\\ 0 &{} 3 &{} 2a_{1} &{} a_{2} &{} 0\\ 0 &{} 0 &{} 3 &{} 2a_{1} &{} a_{2} \end{array} \right| }\\&=18a_{1}a_{2}a_{3}+a_{1}^{2}a_{2}^{2}-4a_{1}^{3}a_{3}-4a_{2}^{3}-27a_{3}^{2}. \end{aligned} \end{aligned}$$
(14)

Then, the following result can be obtained.

Theorem 4

With regard to system (2), assume that \({ R }_{0}>1\),

  1. (i)

    If \( D _{1}(L_{1}(s))>0\), \(a_{1}>0\), \(a_{3}>0\) and \(a_{1}a_{2}>a_{3}\), then the endemic equilibrium point \(M_{*}\) is locally asymptotically stable.

  2. (ii)

    If \( D _{1}(L_{1}(s))<0\), \(a_{1}>0\), \(a_{2}>0\) and \(a_{3}>0\), then the endemic equilibrium point \(M_{*}\) is locally asymptotically stable for \(\alpha \in (0,\frac{2}{3})\).

  3. (iii)

    If \( D _{1}(L_{1}(s))<0\), \(a_{1}<0\) and \(a_{2}<0\), then the endemic equilibrium point \(M_{*}\) is unstable for \(\alpha \in (\frac{2}{3},1)\).

  4. (iv)

    If \( D _{1}(L_{1}(s))<0\), \(a_{1}>0\), \(a_{2}>0\) and \(a_{1}a_{2}=a_{3}\), then for \(\alpha \in (0,1)\), the endemic equilibrium point \(M_{*}\) is locally asymptotically stable.

Proof

Based on the previous discussion, the characteristic equation \(L(s)=0\) has three obvious negative roots \(s_{1}=s_{2}=-\mu ^{\alpha }\) and \(s_{3}=-\mu ^{\alpha }-\lambda ^{\alpha }-\kappa ^{\alpha }\). In order to investigate the stability of equilibrium point \(M_{*}\), we only need to discuss the range of the root of \(L_{1}(s) = 0\), denoted by \(s_{4}\), \(s_{5}\) and \(s_{6}\).

(i) By the results in [27], if \( D _{1}(L_{1}(s))>0\), then \(s_{4}\), \(s_{5}\) and \(s_{6}\) are real roots. Further, by Routh–Hurwitz criterion, the necessary and sufficient conditions for \(s_{i}(i=4,5,6)\) to lie in the left half plane are

$$\begin{aligned} a_{1}>0,\;\;a_{3}>0,\;\;a_{1}a_{2}>a_{3}. \end{aligned}$$

That is to say, under the above conditions, the roots of \(L_{1}(s)=0\) satisfy

$$\begin{aligned} |\arg (s_{i})|>\frac{\pi }{2}>\frac{\alpha \pi }{2}\;\;(i=4,5,6). \end{aligned}$$

Therefore, \(M_{*}\) is locally asymptotically stable and (i) holds

(ii) By the results in [27], if \( D _{1}(L_{1}(s))<0\), then \(L_{1}(s)=0\) has a real root and a pair of conjugate complex roots, denoted by \(s_{4}\), \(m+ni\) and \(m-ni\), respectively. Thus, one has

$$\begin{aligned} \begin{aligned} L_{1}(s)&=s^{3}+a_{1}s^{2}+a_{2}s+a_{3}\\&=(s-s_{4})(s-m-ni)(s-m+ni). \end{aligned} \end{aligned}$$

By calculation,

$$\begin{aligned} a_{1}=-s_{4}-2m,\;\;a_{2}=2s_{4}m+m^{2}+n^{2},\;\;a_{3}=-s_{4}(m^{2}+n^{2}). \end{aligned}$$

The conditions \(a_{1}>0\), \(a_{2}>0\) and \(a_{3}>0\) imply that

$$\begin{aligned} -s_{1}>2m,\;\;m^{2}+n^{2}>-2s_{4}m,\;\;s_{4}<0. \end{aligned}$$

Further, one has

$$\begin{aligned} m^{2}+m^{2}\tan ^{2}\theta>-2s_{4}m>4m^{2}. \end{aligned}$$

That is to say \(\tan ^{2}\theta >3\), which implies

$$\begin{aligned} \theta =|\arg (s)|>\frac{\pi }{3}. \end{aligned}$$

Therefore, in order to ensure the establishment of \(|\arg (s)|>\frac{\alpha \pi }{2}\), we must have \(\alpha <\frac{2}{3}\). Thus, (ii) holds.

The proof of conclusions (iii) and (iv) is similar to that of conclusion (ii); hence, we omit it.

4 Numerical simulations

4.1 Data sources

The data used in this paper are from the Johns Hopkins University Center for Systems Science and Engineering (https://github.com/CSSEGISandData/COVID-19). the Johns Hopkins University publishes data of accumulated and newly confirmed cases, recovered cases and death cases of COVID-19 from January 22, 2020.

4.2 Analysis of the SEIQRP model

In the following discussion, the standard incidence rate [28] is used to describe the transmission of COVID-19 and is given by

$$\begin{aligned} \beta _{1}^{\alpha }f_{1}(S)g_{1}(I)=\frac{\beta _{1}^{\alpha }SI}{N},\;\;\beta _{2}^{\alpha }f_{2}(S)g_{2}(E)=\frac{\beta _{2}^{\alpha }SE}{N}, \end{aligned}$$

where N represents the total population of the region at the initial time. Hence,

$$\begin{aligned} \left\{ \begin{aligned} _{0}^{C}D^{\alpha }_{t}S(t)&=\varLambda -\frac{\beta _{1}^{\alpha }S(t)I(t)}{N}-\frac{\beta _{2}^{\alpha }S(t)E(t)}{N}\\ {}&-\mu ^{\alpha } S(t)-\rho ^{\alpha } S(t),\\ _{0}^{C}D^{\alpha }_{t}E(t)&=\frac{\beta _{1}^{\alpha }S(t)I(t)}{N}+\frac{\beta _{2}^{\alpha }S(t)E(t)}{N}\\ {}&-\varepsilon ^{\alpha } E(t)-\mu ^{\alpha } E(t),\\ _{0}^{C}D^{\alpha }_{t}I(t)&=\varepsilon ^{\alpha } E(t)-(\delta ^{\alpha }+\mu ^{\alpha })I(t),\\ _{0}^{C}D^{\alpha }_{t}Q(t)&=\delta ^{\alpha } I(t)-(\lambda ^{\alpha }+\kappa ^{\alpha }+\mu ^{\alpha })Q(t),\\ _{0}^{C}D^{\alpha }_{t}R(t)&=\lambda ^{\alpha } Q(t)-\mu ^{\alpha } R(t),\\ _{0}^{C}D^{\alpha }_{t}P(t)&=\rho ^{\alpha } S(t)-\mu ^{\alpha } P(t). \end{aligned} \right. \end{aligned}$$
(15)

The effectiveness of model (15) in describing the spread of COVID-19 is illustrated by selecting the confirmed, cured and death cases in the USA. According to the real data provided by the Johns Hopkins University, the outbreak in the USA has not been brought under full control. The data of confirmed cases, recovered cases and death cases are selected from January 22, 2020, to April 20, 2020. Assuming that the confirmed individuals will be isolated, then

$$\begin{aligned} Isolated=Confirmed-Recovered-Death.\nonumber \\ \end{aligned}$$
(16)

This hypothesis is in line with the actual situation. Hence, we can obtain the real data of isolated cases. Let \(e_{Q}=Q_{r}-Q_{m}\), \(e_{R}=R_{r}-R_{m}\) and \(e_{D}=D_{r}-D_{m}\), where \(Q_{r}\), \(R_{r}\) and \(D_{r}\) denote the real data of isolated cases, recovered cases and death cases, respectively, \(Q_{m}\), \(R_{m}\) and \(D_{m}\) represent the measured value of isolated cases, recovered cases and death cases, respectively. Here, we consider a minimization objective function as:

$$\begin{aligned} J_{min}=\sum _{i=1}^{T}(e_{Q}^{2}(i)+e_{R}^{2}(i)+e_{D}^{2}(i)), \end{aligned}$$
Table 1 Summary table of the parameter identification for model (15) after using least squares fitting to real data from January 22, 2020, to April 20, 2020
Fig. 2
figure 2

Number of isolated cases predicted and recovered cases predicted by the model (15) for the USA

Table 2 Summary of parameter identification of model (17) (data used from January 22, 2020, to April 20, 2020)

where T is the number of real data of isolated cases available. The internal function lsqcurvefit of MATLAB is used to minimize the above objective function value. Similar methods are adopted for subsequent parameter identification. Through the fractional predictor–corrector method [29, 30] and the least squares fitting [31], we can identify the parameters of the model (15) through the real data, which is given in Table 1. For more detail with respect to fractional predictor–corrector algorithm, one can refer to [29, 30].

Based on the parameters in Table 1, we can make a simple prediction of isolated cases and recovered cases in the USA, which is shown in Fig. 2. We need to emphasize that the peak here represents the number of isolated cases rather than the cumulative number of confirmed cases.

Using the parameters in Table 1, it can be calculated by using (9) that \({ R }_{0}=0.0289<1\). By the conclusion in Theorem 3, the disease-free equilibrium point is local asymptotic stable. The SEIQRP model has a basic guiding significance for predicting and fitting spreading dynamics of COVID-19. However, the prediction of this model for COVID-19 is relatively rough; we still need to improve model (15) according to actual mitigation policies and research objectives. According to the analysis in Sect. 2.1, we choose the SEIQRPD model to predict the trend of the epidemic in the USA under reasonable assumptions.

4.3 The SEIQRPD model for the prediction of COVID-19

Similarly, the standard incidence rate is used to describe the transmission of COVID-19, and the fractional SEIQRPD model can be expressed as

$$\begin{aligned} \left\{ \begin{aligned} _{0}^{C}D^{\alpha }_{t}S(t)&=-\frac{\beta _{1}^{\alpha }(t)S(t)I(t)}{N}-\frac{\beta _{2}^{\alpha }S(t)E(t)}{N}-\rho ^{\alpha } S(t),\\ _{0}^{C}D^{\alpha }_{t}E(t)&=\frac{\beta _{1}^{\alpha }(t)S(t)I(t)}{N}+\frac{\beta _{2}^{\alpha }S(t)E(t)}{N}-\varepsilon ^{\alpha } E(t),\\ _{0}^{C}D^{\alpha }_{t}I(t)&=\varepsilon ^{\alpha } E(t)-\delta ^{\alpha } I(t),\\ _{0}^{C}D^{\alpha }_{t}Q(t)&=\delta ^{\alpha } I(t)-(\lambda ^{\alpha }(t)+\kappa ^{\alpha }(t))Q(t),\\ _{0}^{C}D^{\alpha }_{t}R(t)&=\lambda ^{\alpha }(t) Q(t),\\ _{0}^{C}D^{\alpha }_{t}P(t)&=\rho ^{\alpha } S(t),\\ _{0}^{C}D^{\alpha }_{t}D(t)&=\kappa ^{\alpha }(t) Q(t). \end{aligned} \right. \end{aligned}$$
(17)
Fig. 3
figure 3

Fitting effect of the improved model (17) on the outbreak in Hubei, Guangdong, Hunan and Zhejiang

It should be emphasized that the numerical properties of model (17) are mainly studied. Different from the analysis of model (2) and model (15), the fractional-order \(\alpha \) of model (3) and model (17) is not required to be within (0, 1). Therefore, in the parameter identification of model (17), the search range of parameter \(\alpha \) is considered as (0, 2). When \(\alpha =1\), the fractional-order SEIQRPD model is similar to the integer-order model used in [24]. According to the data provided by Johns Hopkins University, by April 20, 2020, the outbreak in China has been basically controlled. In many provinces of China, the number of new cases per day is increasing in single digits. This means that the data in China contain more information about the spreading dynamics of COVID-19. Therefore, the data in Hubei, Guangdong, Hunan and Zhejiang are selected to research the fitting effect of the model (17). According to the real data of these four regions in China, the parameters of the model (17) are identified, and the results are shown in Table 2. The model (17) successfully capture the trend of the outbreak, which is shown in Fig. 3.

According to Definition 1, when \(\alpha =1\), the fractional-order system (17) becomes integer-order system. Tables 5 and 6, respectively, show the prediction results of cumulative confirmed cases and isolated cases by the integer-order system corresponding to model (17). By comparing Table 3 with Tables 5 and 4 with Table 6, we can find that the fractional system (17) has better prediction effect than its corresponding integer system.

Table 3 Summary of real and predicted data for the cumulative confirmed cases in the USA from April 6, 2020, to April 20, 2020
Table 4 Summary of real and predicted data for the isolated cases in the USA from April 6, 2020, to April 20, 2020
Table 5 Summary of real and predicted data for the cumulative confirmed cases in the USA from April 6, 2020, to April 20, 2020, when \(\alpha =1\)
Table 6 Summary of real and predicted data for the isolated cases in the USA from April 6, 2020, to April 20, 2020, when \(\alpha =1\)

At present, we can obtain the epidemic data of the USA from January 22, 2020, to April 20, 2020. We need to preprocess the data to remove the data smaller than 0.5% of the current highest number of confirmed cases. In order to test the prediction ability of the SEIQRPD model (17) for the development process of the epidemic in the USA, we select the data before April 5 to identify the parameters of the SEIQRPD model (17). Furthermore, in order to illustrate the ability of the SEIQRPD model (17) in predicting the outbreak, we compared the real data and fitted data after April 5, which is given in Tables 3, 4 and Fig. 4. According to the results in Tables 3 and 4, it can be found that the real values of current isolated cases and cumulative confirmed cases fall within the range of 94–106% of the predicted values. This shows that the SEIQRPD model (17) can effectively predict the data in the next two weeks.

The data before April 20, 2020, are selected to identify the parameters of the improved model (17), and the results are shown in Table 7. The cumulative number of confirmed cases and the number of isolated cases after two weeks are predicted, which is given in Table 8.

Fig. 4
figure 4

Based on the data of the USA from January 22 to April 5, 2020, to verify the accuracy of the forecast for the next 15 days

Table 7 Summary table of the parameter identification for model (17) (data used from January 22, 2020, to April 20, 2020)
Table 8 Summary of predicted data for the USA from April 21, 2020, to May 5, 2020

The isolated cases in the USA will peak on June 18, 2020, with the peak of \(1.0431\times 10^{6}\), which is shown in Fig. 5. We must point out that the peak forecast here is only based on the data before April 20, 2020. It is only a rough estimate from the point of view of numerical simulation. The peak value will be greatly influenced by the policy and environment. The change of policy and environment makes the prediction of peak value very difficult. For the peak prediction, we will do further research in the future work.

5 Conclusion

We first propose the fractional SEIQRP model with generalized incidence rates. Some qualitative properties of the SEIQRP model are discussed. In order to predict COVID-19 effectively, we propose an improved SEIQRPD model according to the actual mitigation situation. According to the data of the USA before April 5, 2020, the trend of the outbreak in the USA from April 6 to April 20 is successfully predicted as compared to the real records. Then, using the data before April 20, 2020, we forecast the trend of the outbreak in the USA in the next two weeks and estimate the peak of isolated cases and the date of the peak.

Fig. 5
figure 5

Based on the data of the USA from January 22 to April 20, 2020, to verify the accuracy of the forecast for the next 15 days

The improved SEIQRP model proposed in this paper successfully captures the trend of COVID-19. The long-term prediction needs to adjust the model appropriately according to the change of policy and medical level. We will discuss in the future work.