We focus on two aspects of COVID-19 mortality over the first three months of the pandemic spanning from January 23rd, 2020 to April 28th, 2020, which we refer to as the first pandemic phase: first, the dynamic of COVID-19 mortality rates out of the total population, examining the weekly growth rate of the new mortality rate per capita; and second, the empirical shape of the mortality rate curve from the onset to the local peak of the first pandemic phase, examining three dependent variables discussed below. To filter out noise in the daily mortality data, we construct a 7-day rolling average of the daily mortality rate per capita and use these series of averages in our estimations and peak identifications. For simplicity, the mortality rate mentioned hereafter is referred to as the 7-day rolling average of the mortality rate.
Although examining the growth rate of the new mortality rate per capita provides evidence on how mortality rates evolve over time, it is not trivial to conclude which variables can characterize the cross-country difference in the empirical shape of the mortality curve from the onset to the local peak of the first pandemic phase. We argue that three outcome variables related to the local peak matter: first, the new mortality rate at the peak; second, the pandemic duration to the peak; third, the ratio of the new mortality rate at the peak to the pandemic duration to the peak. We illustrate this idea by comparing country cases. Figure 5 shows the daily new mortality rate curves for several countries: the left one comparing that of Hungary and Norway, the middle one comparing to that of Denmark and Norway, and the last one comparing that of Austria, Estonia, and Greece. The left figure shows that although the peak mortality rates of Hungary and Norway are around the same, their durations to the peak are different: Norway reached the peak faster, and thus had a steeper mortality curve. The middle figure shows that although the durations to the peak of Denmark and Norway are around the same (i.e., both around 25 days), their peak mortality rates are different: Denmark’s mortality rate climbed to a higher level before going down, and thus had a steeper mortality curve. Hence, a lower peak mortality rate or a longer duration to the peak implies a flatter mortality curve; however, only on the condition that all other things are held equal. Once both the peak mortality rate and the duration to the peak change in the same direction, it may be ambiguous whether the mortality curve is flattened or not, as demonstrated in the right figure of Fig. 5. Both Estonia and Greece realized a lower peak mortality rate and a shorter duration to the peak than Austria; however, it is obvious that Estonia has a much steeper mortality curve while Greece has a much flatter mortality curve compared with Austria. This implies that the ratio of the peak mortality rate to the duration to the peak also plays a key role in understanding the empirical mortality curve. Hence, it follows that all three outcome variables related to the peak (including the daily new mortality rate at the peak, the duration to the peak, and the ratio of the peak rate to the duration) together characterize the empirical shape of the mortality curve from the onset to the local peak of the first pandemic phase. By knowing how these three outcome variables are impacted by government pandemic policies or country-specific structural variables can we uncover to what extent these factors account for the pattern of mortality’s climb to the local peak of the first pandemic phase.
Policy Stringency and Mortality Dynamics
We start with a panel study of mortality growth rate dynamics, using the week-over-week growth rate of the new mortality rate per capita, accounting for containment and closure policy interventions (see Oxford’s COVID-19 Government Response Tracker).Footnote 9 Specifically, our dependent variable yi, t in country i on date t is defined as
$$ {y}_{i,t}=\log \left({MortalityRate}_{i,t}\right)-\log \left({MortalityRate}_{i,t-7}\right), $$
(1)
where MortalityRatei, t is the new mortality rate in country i on date t. A lower growth rate of the new mortality rate implies a flattening of the mortality curve.
Our first benchmark estimation uses the method of local projections (Jordà 2005), examining future (or current) mortality growth rate as a function of current (or past) mortality growth rate and degree of policy stringency. We aim to understand to what degree policy interventions are associated with future mortality growth, and therefore, the evolution of the pandemic. Local projections do not only simplify our problem but also produce robust estimates under misspecification.Footnote 10 Specifically, our model is
$$ {y}_{i,t+h}={\alpha}_i(h)+{\delta}_t(h)+\gamma (h){SI}_{i,t}+\beta (h){y}_{i,t-1}+{\varepsilon}_{i,t}(h), $$
(2)
where yi, t + h is the week-on-week growth rate of the new mortality rate in country i at date t + h for h = {14, 21, 28}. SIi, t is the Stringency Index constructed in the Oxford COVID-19 Government Response Tracker, an aggregate measure of the overall stringency of containment and closure policies,Footnote 11 at date t and yi, t − 1 is the one-day lagged mortality growth rate. Fixed effects are denoted as ai and δt, representing the country and time fixed effects, respectively. We choose to examine the response of new mortality growth with respect to SI at horizons no shorter than two weeks, with reference to studies on the incubation and death periods: The incubation period is 6 days on average (McAloon et al. 2020), and the death period (number of days from symptom onset to death) ranges from 12 to 15 days for different age groups on average according to CDC estimates.Footnote 12 The collection of estimates \( \hat{\gamma}(h) \) for h = {14, 21, 28} trace out the dynamic impact of stringency policies on mortality growth at the weekly frequency.Footnote 13
Additionally, we study the heterogeneity in the association between policy interventions and mortality growth by estimating the model with interaction terms between the Stringency Index and country-specific social and economic variables
$$ {y}_{i,t+h}={\alpha}_i(h)+{\delta}_t(h)+\gamma (h){SI}_{i,t}+\beta (h){y}_{i,t-1}+\theta (h){SI}_{i,t}\ast {x}_i+{\varepsilon}_{i,t}(h), $$
(3)
where xi is the country-specific variable of interest. We consider: the proportion of the elderly population (people aged 65 and over), the proportion of the urban population, proportion of employment in vulnerable sectors, population density, the logarithm of GNI per capita, health expenditure (% of GDP), population-weighted exposure to ambient PM2.5 pollution, the logarithm of tourist arrivals and departures, level of democracy, and country location measured with latitude and longitude.Footnote 14
Cross-Country Differences in Peak Mortality
We follow with a cross-country analysis examining the mortality rate at the peak, a key moment in the first quasi-bell curve, which puts hospitals’ capacity to their most severe test. The quasi-bell shapes are normalized by the day of the first significant death. As discussed before, we consider three outcome variables related to the peak: first, the logged peak mortality rate, second, the pandemic duration to the first peak (PD), and third, the ratio of the logged peak mortality rate to the PD. We opt to use linear regression analysis to examine the cross-country difference in the logged peak mortality rate and the ratio of the logged peak mortality rate to the PD and use survival analysis to examine the cross-country difference in the PD. Our cross-country peak mortality data is calculated from the sample from January 23rd, 2020 to April 28th, 2020, during which many OECD countries and emerging market economies finished their ride up to the first peak of the first quasi-bell in terms of contagion per capita and fatality per capita.
Our cross-sectional linear regression model is
$$ {y}_i={\beta}_0+{Z}_i\gamma +{X}_i\beta +{\varepsilon}_i, $$
(4)
where yi is the logged peak mortality rate or the ratio of the logged peak mortality rate to the PD in country i. A higher peak mortality rate implies a larger inflow of patients, stretching hospitals’ capacity. Accounting for the pandemic duration to the first peak, a higher ratio of the logged peak mortality rate to the PD implies a steeper mortality curve, characterized by either a larger or a faster patient inflow that could potentially overwhelm the healthcare system. We include a set of potential endogenous variables Zi. First, one may be interested in whether the cross-country difference in the intensity of the COVID-19 outbreak explains the cross-country difference in the empirical shape of the mortality curve. We include log(Early Mortalityi), the logged cumulative mortality rate in the first week after the first death, and Early Mortality Growthi, the growth rate of daily mortality rate in the first week after the first death, to control for the cross-country difference in the initial level and growth of the mortality rate. Second, one may be interested in whether proactive stringency (i.e., stringency policies in place before the first reported death) policy interventions influence the cross-country difference in the empirical shape of the mortality curve. We also include Early SIi, the average of the Stringency Index (SI) from its first non-zero value to the first death, accounting for how strict government interventions were before the first confirmed death, and Days from First SI to First Deathi, the number of days from the first non-zero SI to the first death, accounting for how early government interventions are implemented. Additionally, to account for cross-country differences in how aggressively countries respond to the pandemic and increase their policy intensities, we include a variable which we refer to as the SI Deltai (or Stringency Delta), calculated as the difference between a country’s maximum level of SI and its initial level of SI (SIi, 0), normalized by the number of days (Ti) between them:
$$ SI\ Delt{a}_i=\frac{\left[\max \left(S{I}_i\right)-S{I}_{i,0}\right]}{T_i}. $$
(5)
A higher maximum SI and/or a shorter time to the maximum SI will yield a higher SI Delta. Third, we include Early Mobilityi, the weekly average mobility index in terms of walking (reported by Apple) in the week before the first death. We emphasize that while these variables are important to investigate, all of them are endogenous, as they are calculated over part of the first wave period of COVID-19. We also include a set of country-specific control variables Xi that we take as exogenous, including the proportion of the elderly population (people aged 65 and over), the proportion of the urban population, proportion of employment in vulnerable sectors, population density, GNI per capita, health expenditure (% of GDP), level of democracy, and country location measured with latitude and longitude.
Cross-Country Differences in Time-to-Peak
We then proceed with a survival analysis studying the association between the pandemic duration to the first peak (PD) and a set of explanatory variables. We focus on the survival function of the mortality peaking.
$$ S(t)=\Pr \left(\mathrm{T}>\mathrm{t}\right), $$
(6)
which is defined as the probability that the PD is later than date t, which is the probability that the mortality peaks after date t. A higher probability that the PD is later than a certain date implies a longer PD, which could have ambiguous implications. On the one hand, it suggests a slower surge in hospitalization that could ease the burden on the healthcare system. On the other hand, a longer time-to-peak may imply a longer-lived, poorly managed pandemic.Footnote 15 Our benchmark specification is the Cox proportional hazards model (Cox 1972), which examines the relationship between the hazard function and a set of explanatory variables.Footnote 16 The hazard function is defined as the probability that the peak is on date t conditional on that the peak is reached until date t or later,
$$ \lambda (t)=\underset{dt\to 0}{\lim}\frac{\Pr \left(t\le T<t+ dt\right)}{dt\cdotp S(t)}=-\frac{S^{\prime }(t)}{S(t)}. $$
(7)
Our benchmark Cox proportional hazards model is
$$ \lambda \left(t|{Z}_i,{X}_i\right)={\lambda}_0(t)\exp \left({\beta}_0+{Z}_i\gamma +{X}_i\beta \right), $$
(8)
where λ(t| Xi) is the hazard function for country i on date t, conditioning on a set of endogenous variables Zi and exogenous variables Xi, and λ0(t) is the baseline hazard function. In addition to the same set of endogenous and exogenous variables as in the cross-country regression analysis, we also include the endogenous logged peak new mortality rate, log(Peak Mortalityi), to control for the cross-country difference in the peak level of mortality rates. The specification of the Cox Model implies that the effect on the hazard function of a one-unit increase in one covariate w ∈ {Z, X} with coefficient δ is to multiply the hazard function by eδ, and that the effect on the survival function is to raise it to a power given by the effect on the hazard function
$$ {S}_1(t)={S}_0{(t)}^{e^{\delta }}, $$
(9)
where S1(t) is the survival function on date t for a group with a one-unit higher value of the covariate w, all other variables held constant.
Limitations
We wish to briefly call out the limitations of our research design. First, our estimates cannot (and should not) be interpreted as causal. What we are reporting, across all models, are associations. Some of our variables are endogenous, which may bias our estimates. Moreover, given our choice to investigate country factors one-by-one, our regression estimates may also be biased from omitted variables. To overcome these challenges, we are in the process of collecting additional data at varying levels of detail to help deal with these issues, with the aim of achieving cleaner identification going forward. Nonetheless, under such data constraints, we believe our approach strikes a balance between parsimony, robustness, and informativeness.