1 Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in December 2019 and caused numerous COVID-19 outbreaks worldwide [1]. Neither a vaccine against the virus nor a specific treatment was available at that moment [2]. Infected individuals may develop some typical symptoms, including fever, continuous cough, fatigue, headache, and sore throat. More than 50% of the severe cases may die of the infection [3], while some exposed and asymptomatic infectious people are also contagious [4], which complicates the efforts to control the pandemic through intervention strategies [5].

Effective surveillance and early detection of SARS-CoV-2 transmission in unaffected regions are essential to mitigate the initial spread of the infection and substantially reduce health care, economic, and societal burden [6]. The findings of the Global Health Security Index [7] show that only 19% of countries can detect and report epidemics of potential international concerns rapidly, and less than 5% of countries can respond to and mitigate the spread of an epidemic promptly. Hence, almost no countries are fully prepared for epidemics or pandemics [8]. The National Health System (NHS) of the United Kingdom (UK) has evolved as one of the largest healthcare systems in the world, which is based on need instead of an individual’s ability to pay for the treatment [9]. However, the UK was one of the most affected countries by COVID-19 in the world.

The UK medical community suggested that the government should have followed the early recommendation of the World Health Organization (WHO) based on responses to past epidemics, including SARS in 2003, MERS in 2012, and H1N1 [10]. The UK did not initiate a massive testing program for COVID-19 suspected patients and provide appropriate personal protective equipment (PPE) and pharmaceutical resources for healthcare workers [11]. Thus, coping with this emerging infectious disease and its uncontrollable worldwide expansion has become a great challenge. The mitigating efforts during the early COVID-19 outbreak from January 31 to November 5, 2020 in the UK could be analyzed in five stages. The first stage was from January 31, 2020, when the first two confirmed cases were reported by the UK and hospitalized, till March 23, 2020, when the UK implemented a national lockdown. The second stage was from March 23 to May 13, 2020, when the UK ended the national lockdown. The third stage was from May 13 to July 4, 2020, during which the UK gradually reopened. The fourth stage was from July 4 to September 24, 2020, during which the UK returned to normal life. The fifth stage was from September 24 to November 5, 2020, during which the UK intensified the NPIs due to the second outbreak.

The effectiveness of these interventions can be quantified by an effective reproduction number \(R_{e}(t)\), which represents the average number of infections produced by an infected individual during the variable infectious period [12]. If \(R_{e}(t)<1\), the epidemic will die out, otherwise, the number of new infections keeps increasing until the epidemic peaks when \(R_{e}(t)=1\). The effective reproduction number is a function of three factors, namely, the fraction of the population that is immune to the infection, the number of contacts between individuals after implementing NPIs or using PPE, and the length of time during which an infectious person is spreading the virus. Because the fraction of people who were immune to COVID-19 infection in the UK was still tiny, the epidemic did not stop because of herd immunity. People reduced their number of contacts and used PPE to prevent the infection. The improvement of detection efficiency could shorten the transmission time of SARS-CoV-2 [13]. These strategies were implemented together to bring \(R_{e}(t)\) below 1 and flatten the epidemic curve. Thus, the trend of the epidemic growth across the UK was no longer exponential since the strengthening of the NPIs [13].

Mathematical models play a crucial role in epidemiology for understanding and predicting the spread of infectious diseases. Two commonly used approaches are modeling using differential equations and modeling using agent-based network models. In the case of differential equations, these models describe the interactions between different compartments of a population (e.g., susceptible, infected, recovered) using differential equations [1416]. They are often used to capture the average behavior of a population and provide insights into the dynamics of disease transmission. On the other hand, agent-based network models simulate the interactions between individual agents in a population, allowing for a more detailed representation of heterogeneities and network structures [1719]. These models capture the behavior and movement patterns of individuals and can provide insights into the impact of individual-level factors on disease spread. We developed a system of ordinary differential equations (ODEs) to study the transmission dynamics of COVID-19 in the UK. The uncertainty in the model parameters can lead to significant inaccuracy in model projections [20]. We used a Markov Chain Monte Carlo (MCMC) method to estimate the mean values of these parameters and quantify this uncertainty [21]. We applied the Delayed Rejection Adaptive Metropolis (DRAM) algorithm to ensure the accuracy of parameter estimations [22].

We fitted the model with the daily number of confirmed new cases and daily number of deaths in the five stages from January 31 to November 5, 2020. We drove the effective reproduction numbers for the five stages. We used the partial rank correlation coefficient (PRCC) for the global sensitivity analysis. The model predicted the future trends of the epidemic and evaluated the effectiveness of NPIs under different scenarios.

2 The mathematical model

Based on the clinical progress of COVID-19, the epidemiological characteristics, and the countermeasures in the UK, we developed a compartmental model where individuals are divided into susceptible (S), exposed (E), symptomatic infectious (I), asymptomatic infectious (A), hospitalized (H), recovered (R), dead (D), quarantined susceptible (\(S_{q}\)), and quarantined exposed (\(E_{q}\)) compartments.

The schematic diagram for the model is illustrated in Figure 1, and the model parameters as well as initial values are shown in Tables 1 and 2. Let the transmission probability be \(\beta (t)\) and the contact rate be \(c(t)\). The susceptible individuals can be infected by E, A, or I. A proportion, \(q(t)\), of individuals exposed to the infection is quarantined through contact tracing. The quarantined individuals can move to the state \(S_{q}\) or \(E_{q}\) at the rate \((1-\beta (t))c(t)q(t)\) or \(\beta (t)c(t)q(t)\), depending on whether they are effectively infected. The parameter λ is the release rate of quarantined uninfected contacts, and \(\delta _{q}(t)\) is the rate at which quarantined infected individuals are hospitalized. The proportion, \(1-q(t)\), of individuals exposed to the infection who are missed from the contact tracing and enter the exposed state, E, at the rate \(\beta (t)c(t)(1-q(t))\). The fraction ρ of exposed individuals progresses to the symptomatic infectious state I at the rate ρα, while the rest of the exposed individuals enter the asymptomatic infectious state, A, at the rate \((1-\rho ) \alpha\). The asymptomatic infectious individuals recover at the rate \(\gamma _{A}\). The symptomatic infectious individuals are hospitalized at the rate \(\delta _{I}(t)\) and recover at the rate \(\gamma _{H}\), or die at the rate \(\mu (t)\). The resulting system of ODEs is

$$\begin{aligned} { } \textstyle\begin{cases} S'(t)=-[\beta (t)c(t)+c(t)q(t)(1-\beta (t))]\frac{S}{N}(\nu E+\theta A+I)+ \lambda S_{q}, \\ E'(t)=\beta (t)c(t)(1-q(t))\frac{S}{N} (\nu E+\theta A+I) -\alpha E, \\ I'(t)=\rho \alpha E-(\gamma _{I}+\delta _{I}(t)+\mu (t)) I, \\ A'(t)=(1-\rho )\alpha E- \gamma _{A} A, \\ S'_{q}(t)=c(t)q(t)(1-\beta (t))\frac{S}{N}(\nu E+\theta A+I)-\lambda S_{q}, \\ E'_{q}(t)=c(t)q(t)\beta (t)\frac{S}{N} (\nu E+\theta A+I) -\delta _{q}(t) E_{q}, \\ H'(t)=\delta _{I}(t) I+\delta _{q}(t) E_{q}-(\gamma _{H}+\mu (t))H, \\ R'(t)=\gamma _{I} I+\gamma _{A} A+\gamma _{H} H, \\ D'(t)=\mu (t)(I+H). \end{cases}\displaystyle \end{aligned}$$
(1)
Figure 1
figure 1

Schematic diagram of Model (1). The model that is consisted of nine compartments, namely, susceptible (S), exposed (E), symptomatic infectious (I), asymptomatic infectious (A), hospitalized (H), recovered (R), dead (D), quarantined susceptible (\(S_{q}\)), and quarantined exposed (\(E_{q}\)), describes the transmission dynamic of SARS-CoV-2

Table 1 Parameters of Model (1)
Table 2 Initial values of Model (1)

The total number of individuals is

$$ N(t)=S(t)+E(t)+I(t)+A(t)+S_{q}(t)+E_{q}(t)+H(t)+R(t)+D(t). $$

We assume that the transmission rate, \(\beta (t)\), the contact rate, \(c(t)\), the disease-induced mortality rate, \(\mu (t)\), the rate at which symptomatic infectious individuals are hospitalized, \(\delta _{I}(t)\), the quarantined proportion of close contacts, \(q(t)\), and the rate at which quarantined infected individuals are hospitalized, \(\delta _{q}(t)\), all vary with time t. These time-dependent parameters reflect the effect of varying NPIs over the five stages.

3 Parameter estimation and data fitting

The cumulative number of confirmed cases, the total number of deaths, the daily number of confirmed new cases, and the daily number of deaths were obtained from the official UK Government website [2].

We assumed \(S(0)=66{,}573{,}504\), which was the total population of the UK [31]. On January 31, 2020, as the first two confirmed cases were reported by the UK and hospitalized, we assumed \(H(0)=2\), and January 31, 2020, was the start date. At this time, no one was recovered, quarantined, or dead, \(R(0)=S_{q}(0)=E_{q}(0)=D(0)=0\). The parameters used in the simulations were \(\alpha =1/5\) [23], \(\rho =0.6\) [24], \(\gamma _{I}=1/12\) [25], \(\gamma _{H}=1/10\) [26], \(\gamma _{A}=1/9.5\) [27], \(\nu =\theta =0.5\) [28, 29], and \(\lambda =1/14\) [30].

Let \(\mathcal{C}(t,\chi )\) and \(\mathcal{D}(t,\chi )\) represent the cumulative number of confirmed cases and cumulative number of deaths, respectively. The dynamic equations of \(\mathcal{C}(t,\chi )\) and \(\mathcal{D}(t,\chi )\) are as follows:

$$ \textstyle\begin{cases} \frac{\text{d}\mathcal{C}(t,\chi )}{\text{d}t}=\delta _{I}(t) I+\delta _{q}(t) E_{q}, \\ \frac{\text{d}\mathcal{D}(t,\chi )}{\text{d}t}=\mu (t) (I+H), \end{cases} $$

where χ represents an unknown parameter set. The daily confirmed cases and deaths are

$$ \textstyle\begin{cases} \mathcal{P}_{\mathcal{C}}(i,\chi )= \int _{\text{day }i} \bigl( \delta _{I}(t) I+\delta _{q}(t) E_{q} \bigr)\,\text{d}t, \\ \mathcal{P}_{\mathcal{D}}(i,\chi )= \int _{\text{day }i}\mu (t) (I+H) \,\text{d}t. \end{cases} $$

We used the MCMC method to estimate \(\beta (t)\), \(c(t)\), \(\mu (t)\), \(\delta _{I}(t)\), \(q(t)\), and \(\delta _{q}(t)\) by fitting the model to the number of confirmed new cases and daily number of deaths. Other parameters were kept constant, as listed in Table 1. We ran the DRAM algorithm for 20,000 iterations with the last 5000 iterations converged after the initial ‘burn-in’ period. The Geweke convergence diagnostic was employed to assess the convergence of chains. More details on the MCMC method can be found in Appendix A.

Figures 2(A)–(D) show the fitting curves for the number of confirmed new cases, the cumulative number of confirmed cases, the daily number of deaths, and the cumulative number of deaths in five stages from January 31 to November 5, 2020, respectively. The symbols ‘+’ represent the reported number of confirmed new cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), respectively. The shadow represents 95% confidence intervals (CIs). The dotted lines indicate the division of each stage. Figure 10 shows a high correlation between the number of estimated cases and the number of reported cases, indicating a good fitting effect.

Figure 2
figure 2

The fitting curves for the five stages (from January 31 to November 5, 2020). (A) The fitting curves for the number of confirmed new cases in the five stages. (B) The fitting curves for the cumulative number of confirmed cases in the five stages. (C) The fitting curves for the daily number of deaths in the five stages. (D) The fitting curves for the cumulative number of deaths in the five stages. The symbols ‘+’ represent the reported number of confirmed new cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), respectively. The shaded parts correspond to 95% CIs. The dotted lines indicate the dividing lines of each stage

MCMC approach provided the estimated mean value of \(\beta (t)\), \(c(t)\), \(\mu (t)\), and \(\delta _{I}(t)\) in five stages, \(q(t)\) and \(\delta _{q}(t)\) in three stages as shown in Table 1 and Figure 9. We used profile analysis to verify that all these parameters are identifiable when fitting this model to the data [32].

4 The effective reproduction number \(R_{e}(t)\)

The effective reproduction number, \(R_{e}(t)\), is the mean number of secondary cases produced by an infected individual at any time, t, during an epidemic, which can be used to quantify the effectiveness of different mitigation strategies [33]. The effective reproduction numbers, \(R_{e}(t)\) is

$$ R_{e}(t)=R^{E}_{e}(t)+R^{A}_{e}(t)+R^{I}_{e}(t), $$
(2)

where

$$\begin{aligned} &R^{E}_{e}(t)=\beta (t)c(t) \bigl(1-q(t) \bigr) \frac{\nu}{\alpha} \frac{S(t)}{N(t)}, \\ &R^{A}_{e}(t)=\beta (t)c(t) \bigl(1-q(t) \bigr) \frac{\theta (1-\rho )}{\gamma _{A}}\frac{S(t)}{N(t)}, \\ &R^{I}_{e}(t)=\beta (t)c(t) \bigl(1-q(t) \bigr) \frac{\rho}{\gamma _{I}+\delta _{I}(t)+\mu (t)}\frac{S(t)}{N(t)}. \end{aligned}$$

Here, the effective reproduction numbers associated with the exposed population, \(R^{E}_{e}(t)\), represents the number of secondary cases produced by an exposed infected individual at any time during the incubation period, where \(\nu \beta (t)c(t)(1-q(t))\) is the transmission rate of exposed individuals who are missed through the contact tracing, and \(1/\alpha \) is the average length of the incubation period.

The effective reproduction numbers associated with the asymptomatic infected population, \(R^{A}_{e}(t)\), is the number of secondary cases produced by an asymptomatic infectious individual at any time during the asymptomatic infectious period. The factor \(1-\rho \) is the probability that an infectious individual is asymptomatic. In this expression, θ represents a reduction in the transmission rate of asymptomatic infectious individuals and \(1/\gamma _{A}\) is the average length of the asymptomatic infectious period.

The effective reproduction numbers associated with the symptomatic infected population, \(R^{I}_{e}(t)\), is the number of secondary cases produced by a symptomatic infectious individual during the infectious period, weighted by ρ, the probability that an infectious individual shows symptoms. Here, \(1/(\gamma _{I}+\delta _{I}(t)+\mu (t))\) is the average length of the symptomatic infectious period. The values for each of these reproduction numbers can be approximated using the parameters estimated by the DRAM algorithm.

Figure 3 shows the effective reproduction number, \(R_{e}(t)\), plotted as a function of time in each of the five stages. The areas from dark to light colors correspond to 50%, 90%, 95%, and 99% CIs. The mean values of for five stages are 3.7070 (95% CI: 3.3890–4.0255), 0.9500 (95% CI: 0.9216–0.9784), 0.0465 (95% CI: 0.0063–0.0867), 1.4470 (95% CI: 1.3990–1.5000), and 1.1605 (95% CI: 1.108–1.2075), respectively.

Figure 3
figure 3

The effective reproduction number, \(R_{e}(t)\), varies with time in the five stages (from January 31 to November 5, 2020). The areas from dark to light colors correspond to 50%, 90%, 95%, and 99% CIs

5 The global sensitivity analysis

We use the PRCC to study the global sensitivity of the parameters of Model (1). According to the estimated results of these parameters in Table 1, we choose a normal distribution for these parameters, where the mean and standard deviation are given in Table 1. The values of PRCCs for each stage are plotted in Figure 4 to show the importance of the parameters. The input parameters and output variables are positively (or negatively) correlated if the values of PRCCs are positive (or negative). The input parameters and output variables are strongly (moderately or weakly) correlated if the absolute values of PRCCs are between 0.4 and 1 (between 0.2 and 0.4 or between 0 and 0.2).

Figure 4
figure 4

The global sensitivity analysis. The light gray areas represent PRCC values corresponding to weak correlation, the dark gray areas represent PRCC values corresponding to moderate correlation, and the white areas represent PRCC values corresponding to strong correlation

In Figure 4, the effective reproduction number, \(R_{e}(t)\), and the transmission rate, \(\beta (t)\), are strongly positively correlated in stages 1, 2, and 3, moderately positively correlated in stage 4, and weakly correlated in stage 5. The effective reproduction number, \(R_{e}(t)\), and the contact rate, \(c(t)\), are strongly positively correlated in stages 1 and 2, moderately positively correlated in stages 3 and 4, and weakly correlated in stage 5. The effective reproduction number, \(R_{e}(t)\), and the rate at which symptomatic infectious individuals are hospitalized, \(\delta _{I}(t)\), are strongly negatively correlated in stages 1, 2, and 5, and weakly correlated in stages 3 and 4. The effective reproduction number, \(R_{e}(t)\), and the quarantined proportion of close contacts, \(q(t)\), are strongly negatively correlated in stage 3, moderately negatively correlated in stage 5, and weakly correlated in stage 4.

6 Predicting the epidemic propagation

Using our model with estimated parameters, we predict the trend of the early COVID-19 epidemic in the UK and evaluate the effectiveness of NPIs. In Figure 5(A), the purple curve shows that the number of confirmed new cases peaks on May 1, 2020, and the peak size is 465,000, if the NPIs of the first stage continue being implemented. The green curve shows that the number of confirmed new cases would be close to zero by November 18, 2022, if the NPIs of the second stage continue. The blue curve shows that the number of confirmed new cases would be close to zero by September 19, 2020, if the NPIs of the third stage continue. The pink curve shows that the number of confirmed new cases with the NPIs of the fourth stage peaks on December 25, 2020, and the peak size is 69,400. The cyan curve shows that the number of confirmed new cases peaks on January 31, 2021, and the peak size is 39,890 with the NPIs of the fifth stage.

Figure 5
figure 5

The impact of different NPIs on epidemic propagation. (A) The impact of different NPIs on the number of confirmed new cases. (B) The impact of different NPIs on the cumulative number of confirmed cases. (C) The impact of different NPIs on the daily number of deaths. (D) The impact of different NPIs on the cumulative number of deaths. The purple, green, blue, pink, and cyan curves show the number of confirmed new cases with the NPIs of the five stages in (A), the cumulative number of confirmed cases with the NPIs of the five stages in (B), the daily number of deaths with the NPIs of the five stages in (C), the cumulative number of deaths with the NPIs of the five stages in (D), respectively

In Figure 5(B), the purple curve shows that the final cumulative number of confirmed cases would be around 13,260,000, if the NPIs during the first stage continue. The green curve shows that the final cumulative number of confirmed cases would be around 817,100, if the NPIs in the second stage remain. The blue curve shows that the final cumulative number of confirmed cases would be around 330,700, if the NPIs in the third stage remain. The pink curve shows that the cumulative number of confirmed cases would be around 16,760,000 by the end of 2022, if the NPIs in the fourth stage remain. The cyan curve shows that the cumulative number of confirmed cases would be around 13,660,000 by the end of 2022, if the NPIs in the fifth stage remain.

In Figure 5(C), the purple curve shows that the daily number of deaths peaks on May 2, 2020, and the peak size is 10,030 if the NPIs of the first stage continue. The green curve shows that the daily number of deaths would be close to zero by June 11, 2022, if the NPIs of the second stage continue. The blue curve shows that the daily number of deaths would be close to zero by September 7, 2020, if the NPIs of the third stage continue. The pink curve shows that the number of confirmed new cases peaks on January 5, 2021, and the peak size is 598 with the NPIs of the fourth stage. The cyan curve shows that the number of confirmed new cases peaks on February 10, 2021, and the peak size is 487 with the NPIs of the fifth stage.

In Figure 5(D), the purple curve shows that the final cumulative number of deaths would be around 317,100, if the NPIs during the first stage continue. The green curve shows that the final cumulative number of deaths would be around 125,900, if the NPIs in the second stage remain. The blue curve shows that the final cumulative number of deaths would be around 41,360, if the NPIs in the third stage remain. The pink curve shows that the cumulative number of deaths would be around 185,000 by the end of 2022, if the NPIs in the fourth stage remain. The cyan curve shows that the cumulative number of deaths would be around 203,900 by the end of 2022, if the NPIs in the fifth stage remain.

The findings reveal the significant impact of NPIs on the dynamics of the disease spread. The higher peaks of confirmed new cases and daily deaths associated with the NPIs of the first stage emphasize the importance of implementing timely and stringent measures to mitigate the spread of the disease. The lower cumulative number of cases and deaths in the second and third stages underscores the effectiveness of NPIs in containing and managing the outbreak. Particularly, the lowest cumulative number of cases and deaths in the third stage signifies the critical role of NPIs at this phase. However, the proportion of deaths in the five stages is 2.4%, 15%, 13%, 1.1%, and 1.5%, respectively, which corresponds to the mortality rate \(\mu (t)\) in the five stages, as shown in Figure 9. We found that although the cumulative number of cases and deaths in the second and third stages is lower than in other stages, the proportion of deaths is higher than in other stages. Moreover, the effectiveness of close contact tracing and lockdown measures in reducing the peaks of confirmed new cases and daily deaths highlights the proactive measures that can be taken to curb the impact of the disease. These findings emphasize the critical importance of implementing targeted and robust NPIs, alongside proactive contact tracing and stringent lockdown measures, in safeguarding public health and reducing the burden on healthcare systems.

From November 5, 2020, the community in the UK implemented lockdown again, which could reduce the transmission rate, \(\beta (t)\), and the contact rate, \(c(t)\). In Figures 11 and 6, we simulate the effect of lockdown.

Figure 6
figure 6

The impact of reducing the contract rate, c, on epidemic propagation. (A) The impact of decreasing the contract rate, c, on the number of confirmed new cases. (B) The impact of decreasing the contract rate, c, on the cumulative number of confirmed cases. (C) The impact of decreasing the contract rate, c, on the daily number of deaths. (D) The impact of decreasing the contract rate, c, on the cumulative number of deaths

In Figure 11, the cyan curves display the daily number of confirmed cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), assuming the NPIs in the fifth stage remain in effect with a given transmission rate (\(\beta =0.0658\)). The purple, green, and blue dotted lines represent the projected statistics for the number of confirmed new cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D) when the transmission rates are reduced by 5%, 15%, and 45%, respectively. From a qualitative perspective, it is anticipated that by December 31, 2022, under the scenarios of reduced transmission rates, there will be noticeable decreases in the cumulative number of confirmed cases and deaths. Specifically, a 5% reduction in the transmission rate is expected to lead to a moderate decrease in both confirmed cases and deaths. A 15% reduction is projected to result in a significant reduction, while a 45% reduction is anticipated to have a substantial impact, with considerable decreases in both confirmed cases and deaths. These findings suggest that implementing measures to lower the transmission rate can have a positive influence on controlling the spread of the epidemic and mitigating its impact on public health.

In Figure 6, the cyan curves show the daily number of confirmed cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D) with NPIs at stage five remaining constant (\(c=11.318\)). The purple, green, and blue dotted lines show the same variables when contact rates are reduced by one (e.g., \(c=11.318-1\)), two (e.g., \(c=11.318-2\)), and three (e.g., \(c=11.318-3\)), respectively. By December 31, 2022, the cumulative number of confirmed cases will reduce by 30.63%, 61.54%, and 77.02%, respectively, and the cumulative number of deaths will reduce by around 25.06%, 50.37%, and 63.05%, respectively. We can anticipate that a reduction in contact rates will lead to a gradual decrease in the daily number of newly confirmed cases and daily deaths. This implies that altering the contact rates will slow down the spread of the disease and lower the overall number of cases. Moreover, the growth rates of cumulative confirmed cases and cumulative deaths will also be influenced, with larger reductions in contact rates resulting in more significant decreases. The qualitative analysis suggests that reducing contact rates will have a positive impact on disease transmission and case numbers, potentially slowing the spread of the outbreak and reducing mortality rates.

Next, we test the impact of the rate at which symptomatic infectious individuals are hospitalized and the quarantined proportion of close contacts on epidemic propagation. In Figure 12, we simulate the effect of increasing the rate of hospitalization for symptomatic infectious individuals, \(\delta _{I}(t)\). The cyan curves show the daily number of confirmed cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D) with NPIs at stage five remaining constant (\(\delta _{I}=0.0309\)). The purple, green, and blue dotted lines represent the same variables when \(\delta _{I}(t)\) increases by 1.5, 2.5, and 5.5 times, respectively. We found that increasing the rate at which symptomatic infectious individuals are hospitalized can significantly affect the epidemic propagation. A higher hospitalization rate can lead to a decrease in the daily number of confirmed cases and deaths, as well as a reduction in the cumulative number of confirmed cases and deaths over time. Specifically, a higher hospitalization rate can result in a more rapid isolation and treatment of infected individuals, which in turn can reduce the spread of the disease within the community. This can lead to a decline in the overall number of cases and deaths, as well as a slower growth rate in the cumulative numbers. Furthermore, by quarantining a higher proportion of close contacts, there can be a further reduction in the transmission of the disease. This can potentially limit the number of new cases and decrease the overall impact of the epidemic.

In Figure 7, we simulate the effect of increasing the quarantined proportion of close contacts, \(q(t)\). The cyan curves show the daily number of confirmed cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), if the NPIs in the fifth stage remain (\(q=0.8231\)). The purple, green, and blue dotted lines represent the number of confirmed new cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), when the quarantined proportion of close contact individual, \(q(t)\), increases by 1%, 3%, and 5%, respectively. We found that increasing the proportion of quarantine for close contacts may have a significant impact on the spread of the epidemic. By isolating close contacts more widely, it may effectively reduce the spread of the virus within the community, thereby lowering the daily number of confirmed cases and deaths. A higher quarantine proportion may lead to the cutting off of more potential chains of virus transmission, thus slowing the spread of the epidemic. This could potentially decrease the number of new cases and reduce the risk of infection within the community. Additionally, increasing the quarantine for close contacts may limit the continued spread of the epidemic, thereby reducing the strain on medical resources and systems. This could help in reducing the cumulative number of confirmed cases and deaths.

Figure 7
figure 7

The impact of increasing the quarantined proportion of close contact individuals, q, on epidemic propagation. (A) The impact of increasing the quarantined proportion of close contact individuals, q, on the number of confirmed new cases. (B) The impact of increasing the quarantined proportion of close contact individuals, q, on the cumulative number of confirmed cases. (C) The impact of increasing the quarantined proportion of close contact individuals, q, on the daily number of deaths. (D) The impact of increasing the quarantined proportion of close contact individuals, q, on the cumulative number of deaths

Finally, we predict the epidemic propagation under different scenarios. In Figure 8, the cyan curves show the daily number of confirmed cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), if the NPIs in the fifth stage remain (\(\beta =0.0659\), \(c=11.318\), \(\delta _{I}=0.0309\), and \(q=0.8231\)). The purple dotted lines represent the number of confirmed new cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), when \(\beta =0.95\times 0.0659\), \(c=11.318-1\), \(\delta _{I}=0.0309\), and \(q=0.8231\). The green dotted lines represent the number of confirmed new cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), when \(\beta =0.55\times 0.0659\), \(c=11.318-3\), \(\delta _{I}=0.0309\), and \(q=0.8231\). The blue dotted lines represent the number of confirmed new cases in (A), the cumulative number of confirmed cases in (B), the daily number of deaths in (C), and the cumulative number of deaths in (D), when \(\beta =0.3\times 0.0659\), \(c=11.318-6\), \(\delta _{I}=1.7\times 0.0309\), and \(q=1.03\times 0.8231\), where the values of β, c, and \(\delta _{I}\) are approximate to the second stage (\(\beta =0.0197\), \(c=5.6702\), and \(\delta _{I}=0.0536\), as shown in Table 1).

Figure 8
figure 8

Predicting the epidemic propagation. (A) Predicting the number of confirmed new cases. (B) Predicting the cumulative number of confirmed cases. (C) Predicting the daily number of deaths. (D) Predicting the cumulative number of deaths

The predictions suggest that implementing different levels of interventions can have a significant impact on reducing the spread of the disease and lowering the associated mortality by December 31, 2022. Under the purple scenario, which represents a moderate decrease in transmission and contact rates, there is an expected decrease in the cumulative number of confirmed cases by approximately 48.32%. This indicates that even small reductions in transmission and contact rates can lead to a noticeable decrease in the spread of the disease. The green and blue scenarios, with more substantial decreases in transmission and contact rates as well as an increase in isolation rate, show even larger reductions in the cumulative number of confirmed cases, by approximately 87.25% and 88.78% respectively. This suggests that implementing more aggressive interventions can result in a drastic reduction in the spread of the disease within the population. Furthermore, under all scenarios, there is an expected decrease in the cumulative number of deaths by approximately 39.53%, 71.44%, and 72.73% under the purple, green, and blue scenarios, respectively. This indicates that these interventions can also have a positive impact on reducing mortality associated with the disease, and the more intensive interventions have a greater potential for saving lives.

In summary, the findings indicate that even small reductions in transmission and contact rates can lead to a noticeable decrease in the spread of the disease, while more aggressive interventions can result in a significant reduction in both the spread of the disease and its associated mortality. These results highlight the potential impact of different levels of interventions in managing the disease outbreak.

7 Discussion and conclusions

We fitted our compartmental differential equation model to the number of confirmed new cases and daily number of deaths in the five stages from January 31 to November 5, 2020.

The model projections indicated that if the NPIs in the fifth stage remained, the epidemic could last more than two years. In this case, the cumulative number of confirmed cases would be around 13,660,000, and the cumulative number of deaths would be around 203,900 by December 31, 2022.

The simulations also showed that if community lockdown started from November 5, 2020, the transmission rate, the contact rate, and the rate at which symptomatic infectious individuals are hospitalized were approximately equal to those in the second stage of the most strict NPIs, and the proportion of quarantined close contacts increased by 3%, then the epidemic could die out as early as January 12, 2021, the final cumulative number of confirmed cases would be around 1,533,000, as shown in Figure 8(B), and the final cumulative number of deaths would be around 55,610, as shown in Figure 8(D). On November 5, 2020, the UK did not implement NPIs as strictly as the second stage. Hence, we assumed the transmission rate reduced by 45%, the contact rate reduced by 26.5%, the hospitalization rate and proportion of quarantined close contacts were the same as those in the fifth stage, then the epidemic could die out around July 4, 2021, the final cumulative number of confirmed cases would be around 1,741,000, and the final cumulative number of deaths would be around 58,240. Finally, we assumed that the UK implemented weak interventions from November 5, 2020, that were, the transmission rate decreased by 5%, the contact rate reduced by 8.84%, the hospitalization rate and proportion of quarantined close contacts were the same as those in the fifth stage, then the epidemic could last more than two years, the cumulative number of confirmed cases would be around 7,060,000, and the cumulative number of deaths would be around 123,300 by December 31, 2022. Even if the weak NPIs were implemented, the cumulative number of confirmed cases would reduce by 48.32%, and the cumulative number of deaths would reduce by around 39.53% by December 31, 2022. Hence, the NPIs could save a large number of lives.

Here, we demonstrated data-driven studies, assuming that multiple NPIs including but not limited to quarantine, contact tracing, social distancing, self-isolation, and community lockdown. The health and economic impacts of different interventions need to be balanced in the short and long term in any society. The epidemiological data implied that no country had yet seen enough infections to prevent the second wave of transmission by herd immunity if lockdown interventions were relaxed by November 2020.

The dynamics of person-to-person transmission are mainly driven by contacts between individuals [34, 35], which can be heterogeneous due to age and location, and to viral loads of infected individuals. Once the travel restrictions are lifted, human mobility will contribute to the rapid epidemic propagation within the UK [36].

In our study, we found that NPIs such as quarantine, contact tracing, social distancing, self-isolation, and community lockdown play a crucial role in controlling the spread of epidemics. This is consistent with the findings of previous studies [3739]. The data-driven studies presented in our study highlight the potential of these interventions to significantly impact the trajectory of the epidemic. It is clear that while these NPIs can have profound health benefits in terms of reducing the cumulative number of confirmed cases and deaths, the economic and social impacts also need to be considered. Our simulation results emphasize the need to balance the short-term and long-term effects of these interventions on both health and the economy. Additionally, the study underscores the importance of continued vigilance and caution, as lifting lockdown interventions too early could potentially lead to a second wave of transmission. Overall, the dynamic nature of person-to-person transmission and the influence of human mobility after travel restrictions are lifted further emphasize the significance of effective NPIs in mitigating epidemic propagation.

These projections all depend on the sensitivity of the simulations to the model assumptions. Our model did not account for the heterogeneity in the behavior of individuals, the population density, or the disease mortality as a function of age. We are continuing to refine our model to understand the impact of these effects and capture them more realistically in future models.