Introduction

Coronaviruses have long been an endemic source of human and animal infections. SARS-CoV-2 appeared in late 20191,2,3, causing the potentially fatal illness COVID-19. Due largely to human mobility patterns and infection characteristics, SARS-CoV-2 became a pandemic4, which a year after emergence continues to wreak public health, social, and economic havoc across the globe. While a variety of strategies have been implemented to attempt to control the spread of disease5,6, the death toll continues to rise and infection rates are uncontrolled in many parts of the world.

Knowing the extent of population exposure to SARS-CoV-2 has important implications for public policy measures, including ongoing efforts to contain infection7. Yet, while SARS-CoV-2 has an extensive global reach, the portion of populations currently and previously infected remains unclear. Viral assays only capture recent infections, and even then can be unreliable8. Serological antibody testing to ascertain levels of prior exposure is burdensome, with unreliable testing validity and the potential to lead to systematic under-counting9,10,11, with corrections to test sensitivity being subject to spectrum bias12. Tracking epidemic data across populations and evaluating the time evolution of key markers such as death rates provides a powerful complementary approach to well-designed seroprevalence studies.

For this analysis, we evaluate the growth and decline of case numbers during the well-defined epidemic phases. Generally, when a fully susceptible population is exposed to a new and contagious pathogen, an epidemic starts with an initial seeding phase when a small number of individuals are infected. Unconnected local outbreaks with unpredictable spread can occur due to variability in individual behaviour, the effects of potential ‘super-spreading’ events, and the non-uniform success of containment measures. This is followed by a phase of free exponential growth, during which ‘track and trace’ based containment is no longer feasible. Exponential spread of infections \(I(t) = {\text {e}}^{(\beta -\gamma )t}\) over time t happens at a rate \(1/\tau = \beta -\gamma > 0\), where \(\tau\) is the exponential time constant, when the average rate of new secondary infections caused by each primary infectious individual (\(\beta\)) exceeds the rate of recovery (\(\gamma\)). Note that here ‘recovering’ is defined as ceasing to be both infectious and susceptible. By reported daily death counts13, the disease dynamics of COVID-19 have fulfilled the condition of exponential growth and decline in nearly all parts of the world. For four countries highly affected by COVID-19 and with reliable reporting indices, we utilise this epidemic data to estimate the fraction of sub-populations which remain susceptible to SARS-CoV-2, which is a key parameter governing the course of the epidemic.

Results

While onset of the exponential phase varies due to different seeding times, we find that the time constant of exponential growth is very similar over a wide geographical range. Figure 1A illustrates this using data14 from four highly-affected countries: Spain, Italy, the United Kingdom, and the United States.

The exponential growth is sustained as long as S, defined as the ratio of susceptible individuals to the whole population, is close to 100%, and \(\beta\) and \(\gamma\) are kept constant. Slowing spread and ultimately reversal to an exponential decline can be driven by an increase of \(\gamma\), through interventions such as anti-viral therapies that reduce infectiousness time. Such therapies were not available for COVID-19 during the study period. Consequently, reductions of new COVID-19 infections can only be attributed to reductions in either \(\beta\) or S. Public health interventions aim to reduce \(\beta\) through hygiene, physical distancing, and mobility restriction measures designed to reduce contact frequencies5,6,15. In the following analysis we argue changes in COVID-19 disease dynamics are due to reductions of \(\beta\) and S, with different relative importance across geographic areas. In particular, a significant reduction of S is observed in the main metropolitan regions of the four countries investigated here.

Figure 1
figure 1

(A) Reported number of daily deaths due to SARS-CoV-2 infection (smoothed by a moving average over 7 days), over the initial phase of the outbreak in early-2020, for four highly affected countries (see “Methods”). The dashed line (shaded area) represents the mean (range of) early exponential growth with a time constant \(\tau = 3.4(2)\,\) days. The lines are spline interpolations to guide the eye. (B) Comparison of different metrics that potentially affect infection contact rates, shown for the example of the whole of the United Kingdom and London, demonstrating their time-dependence during the first introduction of public health interventions in early-2020.

Observed impact of susceptibility reduction on infection dynamics

Variability in the nature and timing of public health interventions and resulting population behaviours are eventually reflected in variations across countries in the slowdown and exponential decline of the daily death counts. Figure 1B illustrates, for the specific example of the United Kingdom and London, the timelines of public awareness (measured through frequencies of relevant internet search queries16), government restriction stringency (measured as aggregate ‘Oxford’ stringency index17), and visits to public meeting places (restaurants18, grocery stores, pharmacies, and public transit stations19). Qualitatively, all measures correlate with one another, with both voluntary behavioural changes and the stringency of public health interventions resulting in reduced frequency of visits to locations of high infection risk.

While it is difficult to quantitatively associate a time-dependent reduction of \(\beta (t)\) with these data, the observed location data in Fig. 1B (also shown for additional investigated areas in Supplementary Fig. S1), as well as the representative transit station data in Fig. 2 (lower panels), indicate that the public response between areas within one country is similar in timing and magnitude of deviation from baseline. However, the seeding of the epidemic in each country’s main metropolitan region (Italy-Milan/Lombardy; Spain-Madrid; England-London; USA-New York City) consistently occurs earlier than in the rest of the corresponding country, as exhibited in Fig. 2 (upper panels). As a consequence, the time period between initial epidemic onset and reduction of \(\beta\) taking place (driven by public behaviour change) differs in a situation in which public behaviour is modified in similar ways and at similar times. We therefore consider pairwise comparisons of main metropolitan regions and rest of countries a suitable tool to investigate the relative impact of reductions in S. Note that for our analysis of Italy we focus on the Lombardy region, whose population is dominated by Milan—the country’s largest metropolitan area. In addition, we used mortality data for England as a comparison for London, with the assumption that public health interventions in London best match those of England out of the whole of the UK.

Figure 2
figure 2

Population-normalised reported daily deaths due to SARS-CoV-2 infection (on a logarithmic scale) over the initial outbreak in early-2020, broken down into most populous metropolitan region (blue) and surrounding country (red). Data is shown together with fits and confidence intervals (see “Methods” for details). The stringency index is grey-scaled in the background, and has been shifted in time by \(+20\) days to approximately indicate when the effect of public restrictions on death rates could begin to become measurable. This shift includes the expected period of 5 days from exposure to symptom onset, plus the subsequent mean period of 15 days from onset to death, reported in the literature20. Public transit station use (bottom panels) over the same period is also shown for comparison, with faded lines for real-time data and bold lines after applying the same expected exposure-to-death time-shift described previously.

Figure 2 (upper panels) shows reported daily death counts14,21 for the four investigated countries, broken down into the data from the main metropolitan region and the rest of the country. Each of the corresponding eight data sets shows an initial exponential growth phase, followed by a transition to an exponential decline. Seeding of the epidemic consistently happened earlier in the metropolitan regions than elsewhere in each country, leading to higher population-normalised daily death counts in these areas. Yet, the time constant of the exponential growth is equal within errors for each metropolitan-country pair (except in the United States due to its heterogeneity), suggesting that the contact rate among a fully susceptible population is not critically dependent on factors related to population density22,23. In contrast, the subsequent decline is consistently but to a varying extent faster in the metropolitan regions than in the rest of the country. Together with the notably earlier transition from the exponential growth phase and the observation that public policies within countries were similar, this indicates that the reduction of S played a more important role in the metropolitan regions relative to the rest of the country in all four cases. The extracted exponential growth and decline time constants are given in the Supplementary Table S1.

Quantitatively, the time constants \(\tau _m\) and \(\tau _c\) of the exponential decay phase provide a relative measure of the reduction of susceptibility in each metropolitan region (\(S^m\)) compared with the remainder of the country (\(S^c\)). The observation of an extended period of purely exponential decline, together with a stable interval of public health interventions, shown in Figs. 1B and 2 (lower panels), (even when taking into account a delayed impact of \(\approx 20\) days of their effect on the daily death counts20,24), implies that both \(\beta (t)\) and S(t) do not vary much during this phase. In this case, the decay time constants can be used to yield the ratio of fractions of the population that remain susceptible to the virus, provided \(\beta =\beta _m=\beta _c\) remains equal across each country25. From the fits presented in Fig. 2, we find for the ratios \(S^m/S^c\) values of \(0.86\pm 0.05\) (Madrid/Spain), \(0.89\pm 0.04\) (Lombardy/Italy), \(0.70\pm 0.06\) (London/England), and \(0.59\pm 0.05\) (New York/United States). These ratios represent upper bounds for \(S^m\), since \(S^c\) is by definition \(\le 1\). Note that if instead \(\beta ^m\ne \beta ^c\), then the ratio of susceptibilities will be corrected by a factor \(\beta ^c/\beta ^m\) (see “Methods”).

Figure 3
figure 3

The peak daily death counts from early-2020 plotted against the delay between public behaviour change and the deviation from exponential growth leading up to the peak. A correction factor from excess death data was applied to the reported rates (open circles) to derive the expected rates (solid circles), which are linked. The four countries exclude the corresponding metropolitan regions. The inset illustrates, for the example of England, a death count calibration by comparing reported COVID-19 deaths to the all-cause excess death data.

We directly compare the relative timing of the epidemic evolution with public policy and behaviour changes (Fig. 2). The results corroborate the finding that a reduction of S had already begun to reduce new infection rates in the metropolitan regions when public behaviour changes started to have a significant impact. The difference in the effect of S for all eight geographic areas is shown in Fig. 3, where the peak number of daily deaths is plotted against the time delay between when visits to public meeting places are reduced and when death rates begin to slow (see “Methods”). The country-level data cluster around a common delay time that is similar to the reported average time between viral exposure and death20,24. This suggests that the main driver for the initial reduction in death (infection) rates is changes in public behaviour. This is not the case for the metropolitan region data, which in turn cluster around a common peak daily death count. This clustering suggests that the larger early spread of the disease was sufficient to reduce infection and death rates through reduction of S in addition to the effects based on behavioural changes.

Modelling and forecasting

To further investigate the dynamics of the epidemic in each area, we employ the well-established compartmental Susceptible-Infected-Recovered (SIR) model26,27, and solve the differential equations simultaneously for each pair of country and metropolitan regions, with coupling between the two (see “Methods” for details). In this model, the population is assumed to consist of homogeneously interacting individuals, neglecting typically higher level of potentially infectious contacts within sub-communities than between such communities. This assumption can lead to systematic underestimates of fractions of residual population susceptibility. The solutions for each area are used to estimate S and the time dependence of it, as well as to explore the potential future evolution of the infection dynamics. We incorporate public behaviour changes into the system by adjusting the trajectory of \(\beta (t)\) equally for each metropolitan region and country pair, informed by Fig. 2 (lower panels). To evaluate the effect of initial public health measures on S, we only include the death counts until mid-May 2020, to avoid any effects of the relaxation of restrictions on the data.

In contrast to the model-independent slope comparison from the previous section, solving the model equations also provides absolute values for the susceptibility and infection fatality rate \(\mu\) in each area, which are shown together in Table 1. Using both approaches, we consistently find a comparatively weaker reduction in S outside the early-seeded metropolitan regions.

Table 1 Results of solving the SIR model with a time-dependent contact rate in each area.
Figure 4
figure 4

SIR model and forecast throughout 2020 for London (blue) and the rest of England (red). Adjusted daily death counts are shown pre and post relaxation of interventions. Note that the vertical axis is displayed logarithmically. The hypothetical scenarios of no reduction in susceptibility S (a) and of no interventions (b) are displayed. Extrapolations for scenarios of continued stringent interventions (c), of ending interventions (d\(_1\)), and of a partial relaxation of stringency consistent with post-relaxation observations (d\(_2\)) are shown. Inset shows the reduction of S for London and the rest of England, together with fractions of negative serology testing for London.

Figure 4 shows the result of the model for the specific example of England and London, demonstrating that public policy interventions helped cap the death counts in both cases, but that a reduction of S is also a significant driver, particularly in London. Without relaxation of initial interventions, the model projects that containable infection levels would have occurred in mid-July and mid-August 2020 in London and the rest of England, respectively (solid lines c in Fig. 4). Note that this threshold was chosen to be \(\lesssim 10\) new daily cases per million population, in accordance with infection levels in countries with well-developed public health infrastructure (e.g. New Zealand, South Korea, Cuba, and Fiji), with proactive testing and ‘track and trace’ interventions, resulting in lasting containment approaching elimination of locally-acquired cases. In reality, a relaxation of restrictions (early-June 2020) and easing of travel restrictions were initiated. As a result, \(\beta\) increased, and slowed down the decline in death rates. In a forward projection based on a continuation of these new conditions with no further changes to \(\beta\), containable infection levels in London and the rest of England could have been reached in mid-August and the end of 2020, respectively (dash-dotted lines d\(_{2}\)). As further relaxations are underway, these dates may be further delayed if no compensating changes in public behaviour are introduced to maintain the early-July value of \(\beta =0.135\ {\text {day}}^{-1}\). We note that this value is already greater than the observed value of \(\gamma\) (see “Methods”), and thus would have been enough to drive another period of exponential growth in the absence of any reduced susceptibility effect.

We also model the hypothetical scenario of a full return to pre-intervention conditions, constituting the second wave (dash-dotted lines d\(_{1}\)). In this projection, infection rates begin to rise but the second peak for the metropolitan regions is reduced with respect to the first peak, while in contrast the second peak for countries is of higher magnitude than the first one, due to the fact that the height of the peak scales with S.

In Fig. 4 (inset) the resulting evolution of S(t) from the model for London and the rest of England is shown, together with the results of serology antibody tests for London28, which have recently been suspected to underestimate the actual (yet potentially temporary) immunity levels due to the lack of sensitivity of the tests to T-cell based immunity10. In both the slope-comparison method as well as the method based on the numerical solution of the coupled SIR model, the metropolitan regions exhibit a lower absolute value of susceptibility when compared with the rest of their respective countries.

Discussion

We have found that the dynamics of the SARS-CoV-2 pandemic have been influenced both by policy interventions and by reduced population susceptibility, with a relatively stronger contribution from susceptibility changes in early-seeded metropolitan regions. This difference appears to arise solely from earlier seeding of the disease in these areas. Our data provide a method of upward revision to the reported prevalence estimates derived from laboratory testing. In addition, while infection and thus death rates are a function of S and \(\beta\), the effect of S reduction is more durable, and \(\beta\) will rise whenever restrictions are relaxed unless protective measures prove to be fully compensatory. If they are not, a second peak may occur. The reduction in S would not necessarily avoid this, but would mitigate peak height. Forward projections based on the observed data also indicate that later-seeded areas are relatively more vulnerable in a second wave, which has implications for the distribution of healthcare resources. It is important to note that the first peaks in the scenario modelled without any interventions are significantly higher, and could lead to hospital admission numbers with a real risk of exceeding available capacities, which would in turn lead to an over-proportionally large negative public health impact with even larger death counts.

Our model assumes a homogeneously mixing population. Yet, real-life infection spreading is heterogeneous, with some sub-regions of cities or neighbourhoods more heavily affected than others, as well as demographic or geographical groups mixing more strongly internally than between each other. As a result of these heterogeneities, susceptibility-mediated effects on epidemic dynamics can occur at lower exposure levels than would be required for a homogeneously mixed and exposed population29. As a consequence, our absolute estimates of remaining susceptibilities in each region may need to be interpreted as effective values that cannot be used to directly infer cumulative fractions of the populations having been infected with the virus. For example, the figures for S reported in Table 1 cannot be read as 19\(\%\) (45\(\%\)) of the population in London (rest of England) have been exposed to SARS-CoV-2. Instead the observed epidemic evolution in the heterogeneous population follows a pattern that would be expected for a hypothetical homogeneous population in which the exposure rates had reached these levels.

The ratios of effective exposures between the areas are likely to represent the true exposure ratios. For example, we find that the fraction of the population that has been exposed to SARS-CoV-2 is 2.4 times greater in London than in the rest of England. For comparison, the REACT230 study of antibody prevalence finds a very similar ratio of 2.8 between London and England (excluding London) exposure. Similar calculations for Madrid/Spain and Lombardy/Italy yield ratios of 3.4 and 5.3, respectively, which are consistent with the values of 3.331 and 5.032 produced by each country’s corresponding national seroprevalence studies from the relevant time period. While our absolute effective exposure levels are higher than the actual levels, the REACT2 study could underestimate current immunity levels by a factor of two or higher, as T-cell based immunity can exist with negative laboratory antibody response10. This factor of underestimated population immunity levels is time-dependent, as waning of antibody levels cannot be assumed to coincide with or be proportional to the timescales of waning immunity. The use of low blood volume finger-prick rather than standard high-volume samples derived from venous puncture, as well as testing bias introduced through inhomogeneous test responsiveness in the selected test population (albeit corrected for through post-stratification), are further limitations of the REACT2 study. The intrinsic underestimate of the remaining susceptible fraction of the population in our approach and the systematic overestimate of antibody serology studies such as the REACT2 study show that best estimates of the true value of S will benefit from a combination of both methods. While both methods are likely to provide more reliable quantitative information on relative values of S between different regions, a combined approach based on both population statistics and sufficiently large-scale serosurveys could be helpful deriving robust absolute values of S.

Our modelling also assumes lasting immunity following acute infection, in keeping with normal immunological responses to viral infections33. Yet, for the purposes of our projections, immunity need only be preserved for as long as infection rates are uncontrollable. The level used for ‘controllable disease’ was chosen in accordance with data from countries achieving containment, and assumes a goal of elimination utilising, among other measures, tracking and tracing. It is possible that technological improvements will allow a successful track-and-trace strategy at higher infection levels. Forward-projecting continuations of modest relaxations of restrictions appears to make containment achievable, but observed slow down in the decline of death rates indicates that this will be reached later than would have occurred under more stringent restrictions. Mass vaccination programmes, not yet in existence during the timeframes studied here, can positively influence the infection dynamics. Our approach can be adapted to incorporate these programmes, as vaccinations further reduce S. In the simplest case, the initial value of S is adjusted to the size of the non-vaccinated population. As long as large numbers of vaccinations occur amidst active infection dynamics, the actual rate equations need to be modified.

Methods

Extraction of epidemic parameter values

The key parameters of an epidemic can be extracted from epidemiological data (infection, hospitalisation or death rates), as long as populations are sufficiently large for demographic variations to average out, reported data are reliable, and interventions are relatively uniformly implemented. The similar observed exponential growth illustrates that this condition is fulfilled for the four countries of our study (the United States is an exception in some respects due to the heterogeneity of interventions and population behaviour). A key quantity is the time evolution of number of infected individuals. Due to insufficient quantity and quality of testing however, reported daily deaths D(t) are likely a more reliable measure of COVID-19 pandemic dynamics than the number of confirmed cases. The infection fatality rate (IFR) \(\mu\), assumed to be a constant with time, is the probability that an infected individual dies from the disease and therefore validates D(t) as a proportional measure of infections, and our main findings do not depend on knowing the exact value of \(\mu\).

For our investigated areas, D(t) is obtained from Refs.14,21. Data for metropolitan regions is directly extracted from these sources, whereas data for the remainder of the country is derived as the national minus metropolitan data. Finally, data are normalised to the corresponding populations in each region34.

The initial growth of infections I is observed to be exponential in time \(I(t+\Delta t) = I(t) e^{\,\Delta t/\tau _i}\), demonstrated in Fig. 1A, when the population is assumed to be fully susceptible, and the mean time constant \(\tau _i\) characterising this initial growth is measured to be \(\tau _i = (3.4\pm 0.2)\) days across the four countries. The serial time \(T_s\) is defined as the average time between primary and secondary infections, and in addition the average number of secondary cases arising from a typical primary case is known as the basic reproduction number \(R_0=\beta /\gamma\). This means that, on average, after a time period \(T_s\) has elapsed the fraction of the population infected is scaled up by a factor \(R_0\), and therefore \(I(t+T_s) = I(t) e^{\,T_s/\tau _i} = R_0\, I(t)\), which leads to \(R_0 = \beta /\gamma = e^{\,T_s/\tau _i}\), providing a link between the observed growth rate and the epidemiological rates. Using the clinically determined serial time \(T_s = (4.0\pm 0.4)\) days35,36, we find \(R_0 = e^{\,T_s/\tau _i} = (3.2\pm 0.3)\), which is consistent with the literature6,37,38. It follows that the contact rate is \(\beta = (0.43 \pm 0.02 )\;{\text {d}}^{-1}\) in an unconstrained population, and the recovery rate is \(\gamma = (0.130 \pm 0.013 )\;{\text {d}}^{-1}\), corresponding to a mean infectious period of \((7.7\pm 0.7)\) days, again consistent with the literature39,40.

Turning now to the exponential decay phase, if the fraction of population that is susceptible S is instead different from 1, and has some other constant value, then the contact rate is simply modified accordingly and the final characteristic exponential decay time is given by

$$\begin{aligned} 1/\tau _f=S\beta _f-\gamma \end{aligned}$$
(1)

provided \(\beta _f\) is also constant (for example during a period with little or no change in public behaviour). This allows us to examine the ratio of susceptible populations between a metropolitan region (m) and the rest of its corresponding country (c) through the relation

$$\begin{aligned} \frac{S^{m}\beta ^m_f}{S^{c}\beta ^c_f}=\frac{1/\tau ^{m}_f+\gamma }{1/\tau ^{c}_f+\gamma }, \end{aligned}$$
(2)

by measuring the time constants of the exponential declines. If \(\beta _f\) is further assumed to take the same absolute value in the two areas, this equation simplifies to

$$\begin{aligned} \frac{S^{m}}{S^{c}}=\frac{1/\tau ^{m}_f+\gamma }{1/\tau ^{c}_f+\gamma }, \end{aligned}$$
(3)

providing a direct way to estimate the ratio of relative susceptibilities in each area.

Relating public health interventions (PHIs) to epidemic data

Variations in the stringency of country-wide public health interventions among countries have been extensively analysed by the Oxford group17. These data are not available at a regional (sub-country) level, and are not a direct measure of actual public behavior. We use cellular device location data19 to overcome both these limitations, and to evaluate the effect of PHIs. Such data are available for several high infection risk location categories, including retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential. Progressively introduced use of personal protective equipment, in particular face masks, will generally play a role in reducing \(\beta\) but in the time frame under investigation here this is not likely to be important as recommendations and significant population uptake occurred later in the course of the epidemic. We plot the cellular location data to show the relative change compared to the baseline value from the first five weeks of 2020 for each of the eight areas studied. Due to similarities of the relative timings and changes in magnitude of these measures, in principle any of them could be used as proxy for \(\beta (t)\). Based on the comparably low noise level as well as the consistent and clear functional shape of the location data at public transit stations, we chose these data for our analyses. Transit data, shown in Fig. 2 (lower panels), includes use of subway stations, taxi stands, sea ports, and other travel-related locations.

Empirically we find that the transit station location data are well described by a sigmoid function

$$\begin{aligned} {\text {TS}}(t) = \frac{{\text {TS}}_l}{1 + {\text {exp}}\left[ -(t-t_0)/{\text {TS}}_w \right] }, \end{aligned}$$
(4)

with a zero reference level (by definition the pre-pandemic normal level for early times \(t\ll t_0\)). The remaining fitting parameters characterise the magnitude of the response (\({\text {TS}}_l\)), the time (\(t_0\)) when the public behaviour change happened and time duration (\({\text {TS}}_w\)) over which the change occurred. As a comparable measure of the time when a significant reduction of \(\beta\) is expected to occur, we choose the time \(t_{50\%}\) when \({\text {TS}}(t)\) is reduced by \(50\%\) from normal use as determined by the fit for each area, with the error determined as the one standard deviation functional prediction interval.

As the time evolution of D(t) follows exponential trajectories (growth and decline) during periods of (near) constant parameters \(\beta\), \(\gamma\) and S, we display D(t) on a logarithmic scale and apply fit functions to \(D_{\text {log}}(t)=\log [D(t)]\) to avoid heteroscedastic bias. On the logarithmic scale, the exponential growth and decline phases are transformed to increasing and decreasing linear slopes, respectively. Therefore, as a suitable model function for fitting we use

$$\begin{aligned} D_{\text {log}}(t) = D_u\cdot t + (D_l-D_u)\cdot D_w\cdot \log \left( e^{t/D_w} + e^{t_0/D_w}\right) + C, \end{aligned}$$
(5)

which represents an interpolation between an increasing linear function with slope \(D_u\) for \(t \ll t_0\), and a decreasing linear function with slope \(D_l\) for \(t \gg t_0\). The temporal width of the transition between them is characterised by \(D_w\). This function corresponds to an analytically-integrated sigmoid (which itself is an interpolation between two constant values, representing the constant slopes). We then fit this model to the logarithm of the data in two steps: first, we individually fit the data at early and late times to linear functions using ordinary least-squares regression to obtain \(D_u\) and \(D_l\), respectively. The results of this stage are shown as the dashed lines in Fig. 2 (upper panels). These values are then used as constrained parameters of the full function Eq. (5) in a second fitting stage, the result of which is indicated by the solid lines in Fig. 2 (upper panels). We use the full fits to extract both the peak value of the fit to D(t) and the time \(t_{10\%}\) when D(t) reaches \(10\%\) of the value that would have been reached at the same time if the exponential growth trajectory had continued, indicated by the dotted lines in Fig. 2 (upper panels). This is a measure of when the dynamics has significantly departed from the exponential growth regime, and begins to transition to a decline. This point is located close to the peak, but is not biased by the slope of the exponential decline in the same way that the peak time is. The error of \(t_{10\%}\) is determined again as the one standard deviation functional prediction interval. For all fitting, we restrict data to a time window starting with exponential growth and ending just before the introduction of the first relaxation in PHIs where an uptick in \(\text {TS}(t)\) is observed (mid-May 2020).

We use the relative delay \(\Delta t=t_{10\%}- t_{50\%}\) as a measure of the influence of public behaviour change on \(\beta\) and the rate of new exposure to the virus. The expectation for a PHI-driven scenario, versus an effect of a reduction in S, is that \(\Delta t\) is comparable to the typical time from exposure to death, which is on the order of 20 days20,24.

Corrected death counts

For COVID-19 dynamics, the attributed daily death rates are a more reliable metric than the number of confirmed cases, as they are not dependent on testing practices, and should be less prone to under-reporting. However, daily death rates are still subject to variations in reporting methods and in how each region defines what is classified as a COVID-related death41. Alternatively, excess deaths can be calculated by examining how many additional deaths (from all causes) have occurred above a baseline value that would be expected for the same time of year had the epidemic not happened13. While excess deaths statistics include deaths indirectly related to COVID-19 (for example, ‘collateral’ deaths due to healthcare systems being overwhelmed, or reduced visits to emergency departments), and the baseline value for expected deaths may not exactly reflect the current situation42 (for example, in lockdown conditions the number of road traffic accidents will be lower than typical historical values), the number of excess deaths is nevertheless widely agreed to be the most reliable indicator to reflect the state of the epidemic, and alleviates some of the shortcomings associated with the reported daily deaths data41,43.

We compare the time evolution of D(t) (aggregated to weekly level) to that of weekly excess deaths during the spring of 2020 relative to the median value of the historical data up to the past five years, where available44. Accounting for the delay that it takes in processing and registering the deaths in each case, we apply a constant correction factor to the reported daily deaths. These factors faithfully reflect the registered excess deaths, and by area were 1.9 (Italy), 2.2 (England), 1.4 (Spain), 1.5 (US), and we found that these same factors applied equally well to each of the corresponding metropolitan regions, respectively. For the purpose of fitting analysis, we continue to work with the scaled reported deaths because these data are available at a daily granularity, whereas the excess deaths are only provided weekly. The scaling of the data is shown in Fig. 3 (inset) for the example of England.

SIR model with time-varying \(\beta (t)\)

Substantial efforts have gone into describing and predicting epidemic dynamics of many diseases, especially of COVID-19, through Monte Carlo-type discrete45,46,47 or continuous differential equation models48,49. Typically, these models require many input parameters such as fine-grained demographic and populations’ behavioural details. They are often critically dependent on these parameters with corresponding large variations between studies in terms of predicted outcomes and inevitable divergence from actual observations made after the forecasts50. In our work we instead aimed to understand the reduction of S and its impact in a largely model-independent way. In order to still derive quantitative trends and estimates, we chose the SIR model as the simplest epidemic model that takes time-dependent susceptibility (immunity) into account.

The well-established SIR framework is a deterministic model using a system of coupled ordinary differential equations, and is based on the compartmentalisation of populations into the fraction of individuals who are susceptible, S, infectious, I, and no longer susceptible (removed), R. The R compartment includes individuals who have recovered from disease and survived, along with those who have died. The rate coefficients \(\beta\) and \(\gamma\) describe the flow from the S to the I and from the I to the R compartments, respectively, and the evolution of the system is described by the set of differential equations

$$\begin{aligned} {\dot{S}}&=-\beta S I \end{aligned}$$
(6)
$$\begin{aligned} {\dot{I}}&=(\beta S-\gamma )I \end{aligned}$$
(7)
$$\begin{aligned} {\dot{R}}&=\gamma I, \end{aligned}$$
(8)

where the dot indicates a derivative with respect to time. These equations have previously been solved analytically for a time-independent \(\beta\)51. Here we do not include any background loss rate from disease-induced mortality, since its effect would be negligible compared with the epidemic rates - particularly in the early phase of the outbreak studied in this paper, when the total number of deaths is relatively small compared with the overall population size.

Since public policy interventions and public behaviour changes (whether or not they are influenced by these interventions) affect the value of \(\beta\), we introduce a time dependence of that parameter into the model \(\beta =\beta (t)\). Since no known intervention affects the duration of infectiousness, we treat \(\gamma\) as constant. Following the empirically found functional shape of the public response as representatively measured through use of public transit, we also model \(\beta (t)\) as a sigmoid with

$$\begin{aligned} \beta (t)=\beta _i+\frac{\beta _f-\beta _i}{1+\exp [-(t-t_{\text {int}})/t_w]}. \end{aligned}$$
(9)

The shape of this function implies that there was only one period of relevant change to \(\beta\) from its unrestricted initial value \(\beta _i\) to a later final value \(\beta _f\). This is justified as long as public policy relaxations do not take effect on D(t) and as long as compliance with them remains high (constant). We therefore restrict our main analysis to times that are not affected by such changes and only consider D(t) data until the end of May 2020, as no significant corresponding changes in \({\text {TS}}(t)\) are discernible prior to early-May (considering the delay between exposure and death).

This simple model is based on several idealised assumptions. It does not account for inhomogeneity of the population or of the mixing dynamics29, and so stochastic, regional and demographic effects are averaged out to effective parameters. In particular, it is known that \(\mu\) is a parameter whose value strongly increases with age, but also with strong dependence on health status52. This means that not only the demographic composition of an investigated population needs to be considered, but also the dynamics of epidemic spread relative to it (e.g. the timing of seeding the virus in settings or population groups with specific demographic makeup such as schools, high-frequency business travellers or care homes). As a consequence, the infection fatality rates reported in the literature have varied, with some convergence in the range of 0.5–\(1\%\)53. Findings outside this range with potential country to country variations may still occur and be meaningful as effective average figures. Possible changes of \(\mu (t)\) with time during the course of the epidemic have not been considered here, but are not likely to be important over the period studied in light of the high quality of exponential fits to the D(t) data.

Since S is generally time-dependent over the course of the epidemic, we use a standard Runge–Kutta iterative algorithm to calculate numerical solutions of the system of differential equations [68], and incorporate a time dependence to \(\beta (t)\) using Eq. (9). A cost function is generated, derived from the mean-squared deviation between the numerical solutions for the epidemic trajectory and the observed data \(D_{\text {log}}(t)\), in each geographical area considered. The individual cost functions for the metropolitan region and corresponding country are combined (with equal weighting) to simultaneously respect both sets of data in each pair, and a Nelder–Mead based optimisation method is then used to minimise this total cost function. This procedure provides our best-fit parameters for the epidemic dynamics.

Additionally in this process, coupling is introduced between each metropolitan region and its corresponding country by constraining the values of \(\beta (t)\) over time to be equal between the areas in each pair, as informed by the location data (Figs. 1B and 2 lower panels). Constraining the values of \(\beta\) in the metropolitan region and the rest of the country to be equal \(\left[ \beta ^m(t) = \beta ^c(t)\right]\) is based on the assumption that public health interventions have equal effects on each area. This is supported by the observation that the time constants for initial exponential growth and the public behaviour changes (in timing and magnitude) are very similar for countries and their metropolitan regions following nationally homogeneously-imposed public health interventions, with the exception of the United States as already noted. The initial value of \(\beta _i\) is determined by the previously determined (see above) exponential growth phase time constant. Free optimisation parameters then relate to timing of epidemic seeding and the nature of the change in \(\beta\) (namely \(t_{\text {int}}\) and \(t_w\)).

To account for demographic differences, we use the distributions of ages from population statistics in each of the eight geographical areas along with the reported age-dependence of the IFR24,52 to calculate the average apparent IFR for each metropolitan region (\(\mu ^m\)) and rest of country (\(\mu ^c\)). Then, rather than constrain our model by exactly these determined IFR values (which may be prone to systematic bias in the clinical data), we instead couple the fitting procedure between metropolitan region and rest of country by prescribing the ratio \(\mu ^m/\mu ^c\) in each case. The obtained ratios used as constraints for each fit were 0.664 (London/England), 0.985 (Lombardy/Italy), 0.887 (Madrid/Spain), and 0.854 (New York/US), and so the metropolitan regions have a generally younger population structure (and therefore lower effective infection fatality rate). Finally, the absolute values of \(\mu\) are then returned as fit parameters (see Supplementary Table S1), and are consistent with those reported in the literature53. An exception is the higher value obtained for Italy of \(\mu =1.95\%\), which may be attributed in part to a comparably older population structure, and potentially the increased impact on healthcare capacities.

The model can be extended by incorporating further changes in \(\beta (t)\) as public policy interventions and/or compliance and public behaviour change. For the example of England and London we note that a slight increase of \(\beta ^c=\beta ^m = 0.134\, {\text {day}}^{-1}\) is consistent with the observations of D(t) from mid-May until the end of June 2020. The continued match of the data with equal values of \(\beta (t)\) while the exponential decay constants differ between London and the rest of England further strengthens the hypothesis that public policy interventions had an equal effect on \(\beta\) across both areas.