1 Introduction

Economic fallout following major health shocks is well documented in historical case studies. The Black Death was paired with a considerable economic disruption and change (Jedwab et al. 2022; Alfani 2013) and the Spanish Flu caused economic disaster in many developed countries (Barro et al. 2020; Barro and Ursúa 2008). Pandemics are some of the most insidious health shocks as they often arrive unexpectedly with wide-reaching effects. Their economic impacts are a broad field of inquiry since pandemics can arise from many different viruses and can propagate through several economic channels including investment, employment and consumption, though the prime mover is the prospect of mortality.

Table 1 List of major influenza events 1870–2016.
Table 2 Working-age male mortality effects on cyclical log real GDP per capita
Table 3 Working-age population mortality effects on cyclical log real GDP per capita
Table 4 Comparison of 90% confidence intervals for \(\beta\) if exclusion restriction fails
Table 5 2SLS estimates: mortality effects on the employment-to-population ratio

The causal link between health and long-term economic outcomes has been established in the literature (Sharma 2018) and its dynamics well known (Swift 2011). Less understood are the short-run economic impacts of shocks to population health. Strauss and Thomas (1998) review a related literature on health and individual productivity and yet, the extent to which these, or other, health shocks combine to cause recessions is not well documented. This is, perhaps, surprising given the attention paid to business cycle mitigation by governments and central banks worldwide.

This paper offers insights from numerous influenza shocks occurring for the past 150 years which appear to coincide with economic slowdowns in developed economies. Examples include the Asian Flu (1957–58) and the Swine Flu that accompanied the slow recovery from the 2008 financial crisis. Specific to the USA, several economic downturns or slowdowns appear to coincide with influenza pandemics that strike every 28 years on average (Mackellar 2007: 430).Footnote 1 Major troughs in the USA business cycle occur in March of 1919, April 1958 and June 2009. As well, Italy experienced approximately 20 business cycles between 1861 and 2000 and saw major troughs in 1920 and 1958 (Delli Gatti et al. 2005: 83) and additionally was quite hard hit in the aftermath of the 2008–09 Great Recession by the 2009–2011 Sovereign Debt Crisis.Footnote 2

To provide evidence with a causal interpretation, we constrain our study to a single channel by which these pandemics are plausibly exogenous in our context in terms of their contribution to business cycles: mortality.Footnote 3 The causal mechanism we examine is appropriate for an analysis of short-run pandemic effects which can be expected to propagate through reductions in labour supply (Bloom et al. 2022). Because the scientific community recognizes influenza as a continued threat for future pandemics (Palese 2004), learning from past influenza pandemics remains an important empirical endeavour. Our analysis of less-studied influenza shocks is also motivated by literature showing that different health shocks (i.e. different causal pathways) can have different impacts; For example, Sharma (2018) finds a strong causal effect of population health on GDP through human capital, while Acemoglu and Johnson (2007) find no meaningful effect from epidemiological improvements.

Our contribution to the literatureFootnote 4 is twofold. First, we establish the causal impact of pandemics as one type of exogenous health event on the business cycle. This analysis of short-run economic fallout adds to the broader literature linking health to economic performance, including Weil (2007). Aside from case studies of pandemics, existing causal analyses focus on the long-run and obscure the short-run impacts we measure here. For example, Sharma (2018) uses 10-year averaged data while Acemoglu and Johnson (2007) study 40- and 60-year spans. Second, our analysis extends the historical record of pandemic case studies by documenting the importance of several less-studied health shocks. In focusing on influenza pandemics, our analysis complements evidence on the macroeconomic impacts from the Spanish Flu (Barro et al. 2020; Karlsson et al. 2014). Whereas large-scale health shocks (e.g. the Black Death 1347–1352) had obvious economic consequences, we show that mortality from numerous smaller, but nonetheless serious, pandemics caused GDP to decrease from trend by about 0.3% for each additional death per 1000 persons of working age, on average.

Identifying the economic impacts of historical pandemic events is challenging because data surrounding these shocks are available with limited frequency: annual, rather than quarterly or monthly. Pandemics are also likely to have heterogeneous effects across countries and by pandemic event (Alfani 2013). For example, Clay et al. (2018) find regional pollution differences to be a factor in Spanish Flu mortality. Our data combine the Jordà–Schularick–Taylor (JST) Macrohistory Database (Jordà et al. 2017) with the Human Mortality Database (2020), hereafter the HMD, to form an unbalanced panel across 16 countries spanning 1870–2016. One advantage of these historical panel data is to capture effects across several countries and several events. This exercise will identify a causal parameter that represents a weighted local average treatment effect (LATE) across pandemics. Because Abadie (2003) and others highlight the difficulty interpreting weighted LATE parameters, we supplement our main findings with details on which events weigh most heavily in our causal framework, that is, when and where influenza pandemics induce mortality the most.

Our estimates address endogeneity between economic performance and mortality—an important consideration since national wealth may also affect public health. We first demonstrate considerable drops in real log GDP per capital coincident with influenza pandemics that are evident in the raw data and then provide two-stage least squares (2SLS) estimates of pandemic-induced mortality shocks on the cyclical component of log GDP. The pandemic-induced mortality effects we measure using our instrumental variable approach are larger than the GDP–mortality relationship suggested by OLS regressions that may suffer from endogeneity. Our specifications include a wide range of controls under which case we are confident that pandemic timing is a plausibly exogenous instrument. However, we also present Conley et al. (2012) bounds for our estimates under departures from the exclusion restriction. Where data permit, we also provide ancillary evidence in favour of the causal channel we identify. Pandemic-induced mortality effects on the cyclical component of GDP can be expected to propagate through reductions in labour supply. Estimates suggest that influenza pandemics may cause meaningful decreases in the employment-to-population ratio. In addition, our results are robust to several important considerations, alternative measures of the cyclical component of GDP, alternative mortality measures and a variety of covariates.

2 Literature review

A substantial literature details the contribution of health status and pandemics to historical economic events. Our focus on historical short-run effects situates the current analysis in a relatively sparser literature. Alfani and Murphy (2017), Alfani and Percoco (2019) and Jedwab et al. (2022) note that major pre-industrial events including the Black Death caused asymmetric economic shocks across Europe because of differences in population density and economic development. Important immediate effects are also evident from the COVID-19 pandemic (Baker et al. 2020). Our empirical approach is most like Barro et al. (2020), where Spanish Flu mortality is shown to have decreased short-run real GDP per capita by 3%. In a review of empirical approaches, Bloom et al. (2022) argue that these growth-type regressions may be a suitable strategy when panel data are available. Barro and Ursúa (2008) use similar data to study economic crises. Their results suggest that the Spanish Flu was the fourth-worst contraction in recent history after the two World Wars and the Great Depression.

The Spanish Flu receives particular attention in the literature. Karlsson et al. (2014), for example, find little discernible effect on earnings but increased poorhouse rates and a reduced return to capital across Swedish regions. Garrett (2008, 2009) finds that mortality decreased the supply of manufacturing workers and increased the marginal product of labour, the marginal product of capital and real wages in the USA. Brainerd and Siegler (2003) argue that USA states with higher influenza mortality during the Spanish Flu era subsequently experienced higher per capita income growth rates. Beach et al. (2022) revisit the Spanish Flu’s impact to provide lessons for COVID-19, noting deeper recessions in countries with higher 1918 influenza mortality. Aassve et al. (2021) using respondent attitudes to a general social survey also find that experiencing the Spanish Flu likely had permanent impacts on the level of social trust that was passed on to the descendants of Flu survivors which can also impact economic development.

Our focus on short-run or business cycle effects differs from the larger literature on long-run impacts of health shocks (e.g. Acemoglu and Johnson 2007; Barro 2013; Bloom et al. 2004). Pamuk (2007) argues that the great divergence in economic growth among western economies may be rooted in the effects of the Black Death. Arora (2001) finds that long-term health measures including stature and life expectancy appear to have permanently altered the growth paths for major industrialized countries over the course of 125 years. Jordà et al. (2020a, b) link pandemics and the natural rate of interest since the fourteenth century, finding that interest rates fall by about 1.5 per cent—lasting 20 years because pandemics reduce labour relative to capital.

Pandemic effects on the macroeconomy manifest through several channels. Following the insights from Bloom et al. (2022), our approach will examine one such channel that is associated with short-run impacts: mortality. Grimm (2010) notes that mortality shocks induce expenses and income loss but also reduce the number of household consumption units. Given that influenza pandemics effects differ across age cohorts, this latter point would particularly apply to the Spanish Flu which had high mortality among prime working-age adults. The 2009 pandemic had short-run hospitalization costs exceeding 20 million GBP in the UK (Lau et al. 2019) and decreased labour supply considerably in Chile (Duarte et al. 2017).

Our analysis also relates to the wider literature relating health to economic growth that is often summarized by the Preston curve (Preston 1975) which noted a long-term relationship between life expectancy and per capita income over time with recent research noting more than proportional increases in life expectancy at higher per capita income levels (de la Escosura 2023). Fogel (1994) noted the positive long-run relationship between nutrition improvements, human health capital and economic growth. Indeed, this literature is not settled with regard to causality. For example, Ye and Zhang (2018) examine 15 OECD countries and 5 developing countries from 1971 to 2015 and find a range of results from no causality to a unidirectional relationship in either direction to bi-directional causality using Granger tests. Bloom et al. (2019) also consider bi-directional causality between health status and per capita GDP as well as the presence of confounding factors noted by Deaton (2013) including education, technological progress and institutional quality. Further nuances include whether specific diseases are communicable (e.g. influenza pandemics) or non-communicable (e.g. cardiovascular, diabetes) and whether longer-term effects on health will arise through life expectancy or infant mortality (Bloom et al. 2019; Suhrcke and Urban 2010).

Finally, our findings contribute to understanding the historical influence of pandemic health shocks in the broader literature on the causes of business cycles. This literature is primarily structural models that parameterize the economy. The current analysis can be seen as preliminary causal evidence to assist with the inclusion of health shocks, specifically recurring influenza pandemics. Known factors affecting the business cycle are many, including interest rates, financial crises and oil prices, all of which may propagate differently through consumption, investment and other components of GDP. Some reviews of this literature are available in Stock and Watson (1999), for the USA and Sensier et al. (2004) for Europe.

3 Pandemic events

Influenza pandemics worldwide since 1870 are presented in Table 1. These pandemics are, by definition, global events, and we model them as such in the subsequent analysis. The influenza pandemic during 1873 to 1875 was preceded by equine influenza in the USA and Canada (Judson 1873). The loss of working animals in the nineteenth century in addition to any animal to human transmission was a serious economic effect, and it should be noted the contraction of the 1870s was particularly severe in some countries.Footnote 5 The 1889 –1892 Russian Flu pandemic had an estimated global death toll of 1 million people, and its spread was facilitated by the rapid population growth and urbanization of the nineteenth century. However, recent evidence asks the question whether this event may have been a coronavirus rather than influenza (Brüssow and Brüssow 2021).Footnote 6 The 1918–1920 Spanish Flu is the most devastating pandemic event of recent history infecting nearly one-third of the world’s population and killing an estimated 50 to 100 million people (Mamelund 2008, p 601). These pandemics spread globally given transportation improvements over the course of the nineteenth and early twentieth century. Thus, while the severity of these pandemics certainly differed by country, the timing of pandemic onset in annual data does not. In the post-World War II period, the spread of air travel made the rapid spread of pandemics an even greater concern and pandemic declaration by the WHO can be considered particularly definitive during this period. The 1957–1958 Asian Flu and the 1968–1970 Hong Kong Flu were major events with global death tolls estimated at 2 million and 1 million, respectively.

4 Data

4.1 Economic panel data

Our analysis employs release 5 of the JST Database, which spans 16 developed countries during 1870–2016. Countries are: Australia, Belgium, Canada, Denmark, Finland, France, Italy, Japan, Netherlands, Norway, Portugal, Spain, Sweden, Switzerland, the UK and the USA.Footnote 7 These data form an unbalanced panel since some covariates are not available for some countries in earlier years or during war years. Appendix Table 7 summarizes these limitations. Fortunately, our outcomes are constructed from real gross domestic product (GDP) per capita, which is continuously available for all countries from 1870 through 2016.

Our analysis examines the cyclical component of log GDP per capita, denoted \(\widetilde{lGDP}\). We take the natural log of the JST variable “real GDP per capita index” (2005 = 100) and detrend the resulting variable three different ways: using the HP (Hodrick and Prescott 1997) filter,Footnote 8 the BK bandpass filter (Baxter and King 1999),Footnote 9 and linear detrending (LDT) that allows for a break mid-series in 1946. This break allows for the considerable change observed in the average GDP series around the time of the first influenza vaccine. Our preferred estimates use the HP filter, since our results are then more comparable to the broader business cycle literature. However, the HP filter has been criticized as potentially being arbitrary (Hamilton 2018).Footnote 10 Presenting HP results alongside others, particularly the BK band pass filter, demonstrates the robustness of our findings to the particulars of any one detrending procedure and builds confidence in our findings.

The \(\widetilde{lGDP}\) measure is advantageous for capturing the business cycle in our data. Changes in log GDP approximate percentage change, which aids in cross-country comparisons since larger economies experience larger nominal GDP fluctuations. Furthermore, the HP and BK filters isolate precisely those high-frequency movements in the time series that we wish to examine, separately by country.Footnote 11 Both of these filters isolate the cyclical component from a potentially nonlinear trend and thus may remove more persistent periods of growth that a linear detrending procedure would not. Measured deviations in these series, then, might be expected to be smaller than measured deviations in the LDT series.

To illustrate the net effect of influenza pandemic onset on economies in our raw data, we reorganize the data into pandemic spells. We then plot average log GDP across years to pandemic onset. Because advancements in medical technology and living standards may have allowed pandemics to propagate differently over time, we break the data into two large spans. A natural break point is 1946, in the light of the importance of the 1940s “international epidemiological transition” for population growth (Omran 1971), economic growth (Acemoglu and Johnson 2007) and because of the onset of flu vaccines.Footnote 12 A precipitous drop in log GDP at pandemic onset is evident in Fig. 1. These data are highly suggestive of pandemic-induced downturns. The dynamics of log GDP differ somewhat in the lead-up to the pandemic, primarily because of war effects in the pre-1946 era that we will account for fully in regression models to follow.

Fig. 1
figure 1

Average log GDP and pandemic onset, before and after 1946. Data source: Jordà et al. (2017). Author calculations on spell level data centred with zero at pandemic onset averaged by year-to-pandemic. Local linear regression (bandwidth of 1) overlaid

Examining pandemic effects through mortality permits an imperfect measure of the intensity of a pandemic. Pandemics may differ following improvements in public health and medical technologies that vary across countries and have reduced infectious disease mortality during the years 1870–2016 (Cutler et al. (2006) discuss mortality determinants). Annual death rates by sex and age are available from the Human Mortality Database (HMD) for most of the time series: Sweden, France, Belgium, Denmark, the Netherlands and Norway start from 1870. Italy, Switzerland and Spain start from 1872, 1876 and 1908, respectively.

We construct two mortality measures pertinent to the question at hand. The first is an annual death rate among working-age males (\({D}^{\mathrm{M}}\)), generated for each of the 16 countries (\(j\)) using counts of age-specific (\(a\)) deaths among males (\(\mathrm{MD}\)) and male population (\(\mathrm{MPOP}\)):

$${D}_{jt}^{\mathrm{M}}=\frac{\left(\sum_{a=16}^{65}{\mathrm{MD}}_{jt}\left(a\right)\right)}{\left(\sum_{a=16}^{65}{\mathrm{MPOP}}_{jt}\left(a\right)\right)}$$
(1)

\({D}^{\mathrm{M}}\) captures mortality among men ages 16 − 65 providing a measure that should capture effects on the population most directly responsible for labour supply during the period of analysis.Footnote 13 Figure 2 illustrates that, in the spell-arranged data, pandemic onset coincides with a considerable jump in \({D}^{M}.\) The increase is particularly evident pre-1946, when medical interventions were more limited. However, even after 1946 the increase in death rates is much larger than any other year-over-year increase observed in the data.

Fig. 2
figure 2

Mortality among working-age males and pandemic onset. Data sources: Jordà et al. (2017) and the Human Mortality Database (2020). Author calculations on spell level data centred with zero at pandemic onset averaged by year-to-pandemic. Local linear regression (bandwidth of 1) overlaid

The focus on male mortality is more defendable earlier in the time series. Because female labour force participation has risen dramatically starting in the mid twentieth century, we also generate a working-age population mortality rate \({D}^{\mathrm{P}}\) (for both sexes combined) following an equivalent formula. A considerable jump in this variable at pandemic onset is also seen in Fig. 3. It will turn out that our results are largely immune to sex differences in mortality rates.

Fig. 3
figure 3

Mortality among working-age population (both sexes) and pandemic onset. Data sources: Jordà et al. (2017) and the Human Mortality Database (2020). Author calculations on spell level data centred with zero at pandemic onset averaged by year-to-pandemic. Local linear regression (bandwidth of 1) overlaid

5 Model and identification

Conditional on disruptions to consumption and investment, pandemics can be expected to influence the economy through mortality. Indeed, mortality is identified as a key short-run mechanism in a recent review of empirical approaches (Bloom et al. 2022). This mechanism manifests in lower GDP largely through decreased labour supply, for which there is evidence where data exist (see correlation plots for Sweden, the USA and the UK in Appendix Figure 7).Footnote 14 We cannot rule out the existence of other mechanisms behind this causal channel, however, including the possibility of behavioural changes in workplaces and the like, that might follow from mortality or accompanying morbidity.Footnote 15

Equations (2) and (3) detail our empirical model, to be estimated by 2SLS. Mortality measures (\(D\)) are instrumented by binary pandemic timing indicators (\(\mathrm{Flu}\)). The parameter we wish to identify is \(\beta\), the pandemic-induced effect of mortality on various measures of the cyclical component of log GDP per capita, \(\widetilde{lGDP}\). Wald (1940) notes that \(\beta\) effectively compares the correlation of \(\widetilde{lGDP}\) and mortality in pandemic periods to non-pandemic periods. Our estimates will embody the magnitude of the pandemic by measuring the differential mortality rates induced by pandemics across countries.

$${D}_{jt}= {\psi }_{j}+\delta {\mathrm{Flu}}_{t}+{{\varvec{X}}}_{1jt}^{^{\prime}}{\varvec{\pi}}+{{\varvec{X}}}_{2jt}^{^{\prime}}{\varvec{\phi}}+{{\varvec{\varepsilon}}}_{{\varvec{j}}{\varvec{t}}}$$
(2)
$${\widetilde{lGDP}}_{jt}={\alpha }_{j}+\beta {D}_{jt}+{{\varvec{X}}}_{1jt}^{^{\prime}}{\varvec{\eta}}+{{\varvec{X}}}_{2jt}^{^{\prime}}{\varvec{\theta}}+{u}_{jt}$$
(3)

All specifications include country-specific fixed effects (\(\alpha_j\)),Footnote 16 which may help account for unmeasured contextual factors important to pandemics (Alfani 2022). Time-varying covariates include the vector \({{\varvec{X}}}_{1}\) comprising dummies for the two world wars, the real short-term interest rate and exchange rate adjusted wheat and oil prices.Footnote 17 We also include a dummy variable to allow for a break in the GDP series in 1946, which is circa the introduction of vaccines.Footnote 18 We further include the vector \({{\varvec{X}}}_{2}\) in some specifications, which includes shares of GDP components from the national accounting identity and the debt-to-GDP ratio.Footnote 19 We cluster standard errors by country because the disturbance term in Eq. (2) may be subject to serial correlation. Because the number of clusters in our data (16) may be too low for reliable inference, we also compute clustered p-values using the wild restricted efficient (WRE) bootstrap method (Davidson and MacKinnon 2010).Footnote 20

Our identification strategy requires that \(Flu\) induce a natural experiment in mortality rates. Pandemics are certainly exogenous events. This is particularly true in annual data where the speed at which pandemics propagate or the order in which they spread across countries does not vary; these world events affect all countries in the data in the same years. One limitation of any pandemic study where all countries are “treated” together is that accounting for time effects becomes challenging. We largely sidestep this concern by estimating \(\widetilde{lGDP}\), which is already stripped of any trend. It will also turn out that our results are robust to international or country-specific time trends.

Instrumental variables in macroeconomic analyses often fail to fulfil exclusion restrictions (Bazzi and Clemens 2013). In our case, \({{\varvec{X}}}_{2}\) controls may be particularly important to ensuring the exclusion restriction holds. It is difficult to imagine other ways, aside from health (i.e. mortality), that pandemic timing in annual data could affect the cyclical component of GDP when holding constant consumption, investment, exports and government spending. Nevertheless, we illustrate the impact of possible departures from the exclusion restriction on confidence intervals for \(\beta\).

The influenza-induced shocks we measure with \(\beta\) comprise a weighted local average treatment effect (LATE) because pandemic impacts are not homogeneous events. Angrist and Imbens (1995) show that monotonicity is required to identify a weighted LATE. Figure 4 illustrates that pandemic timing has a monotonic effect on our mortality measures, \({D}^{\mathrm{M}}\) and \({D}^{\mathrm{P}}\), by confirming that the cumulative density function (CDF) of the endogenous variable during pandemics is always to the right of the CDF for non-pandemics.Footnote 21

Fig. 4
figure 4

Empirical CDFs of endogenous variables conditional on covariates. Data sources: Jordà et al. (2017) and the Human Mortality Database (2020). Plots illustrate the monotonicity principle of Angrist and Imbens (1995) for both endogenous measures of mortality. ECDF’s generated from using control variables from the first stage (Eq. 2)

The weighed LATE we identify can be challenging to interpret because the weights vary by country and by event. Because the weights are proportional to the distance between the CDFs, we illustrate which countries and time periods can be expected to receive higher weight in by plotting the difference in CDFs (shaded) against kernel density estimates for different countries and decades in Figs. 5 and 6. All countries have considerable coverage over the shaded range, suggesting all contribute considerably to \(\beta\). Weights are largest over the range (− 1.5, 1.5), and over this range countries that contribute more than their share of observations include Finland, France, Italy, the Netherlands, Norway, Portugal, Spain, Sweden, Switzerland and the UK.Footnote 22 Thus, we identify a LATE more heavily reflective of Europe. The decadal analysis, in Fig. 6, suggests that the LATE we identify is reflective more heavily of the 1880–1910s and the 1940–1990s than other decades. The former is particularly sensible, as the Spanish Flu (1918) is the most substantive pandemic event in our data.

Fig. 5
figure 5

Analysis of influenza pandemic timing as an instrumental variable, by country. Data sources: Jordà et al. (2017) and the Human Mortality Database (2020). Shaded grey area is the difference in empirical CDFs of the residual working-age male death rate from Fig. 4a (non-pandemic–pandemic), scaled on the right vertical axis. Overlaid is the density of residual working-age male death rate by country, scaled on the left vertical axis

Fig. 6
figure 6

Analysis of influenza pandemic timing as an instrumental variable, by decade. Data sources: Jordà et al. (2017) and the Human Mortality Database (2020). Shaded grey area is the difference in empirical CDFs of the residual working-age male death rate from Fig. 4a (non-pandemic–pandemic), scaled on the right vertical axis. Overlaid is the density of residual working-age male death rate by decade, scaled on the left vertical axis

6 Results

Our findings establish a robust causal impact of pandemic-induced mortality on the business cycle. Mortality shocks identified by our pandemic timing instrument decrease \(\widetilde{lGDP}\) across different measures of the cycle and different mortality rates. Estimates of pandemic-induced mortality among working-age males are summarized in Panel A of Table 2.Footnote 23 In columns 1–3, the business cycle is captured using HP-filtered \(\widetilde{lGDP}\). Column 1 is the baseline specification with only fixed effects as controls. In columns 2 and 3, we add vectors \({{\varvec{X}}}_{1}\) and \({{\varvec{X}}}_{2}\) successively. Estimates are very stable across all three columns; however, we interpret column 3 as it is the most conservative in the sense that our exclusion restriction is mostly likely to be satisfied in this case. The coefficient of − 0.003 can be interpreted as follows: each additional death per 1000 (among working-age males) causes a contemporaneous average decrease in GDP per capita of about 0.3 per cent. To put this magnitude in the context, from 1917 to 1918 the average mortality rate in our data rose by about 8 deaths per 1000, suggesting that influenza-induced mortality alone would have contracted GDP from trend by an average of about 2.4%, which is about half the observed contraction from trend in our data for this period.Footnote 24

Columns 4–6 repeat with an alternative outcome measure: the BK-filtered \(\widetilde{lGDP}\). The results are highly similar despite the differences in how these two filters capture the business cycle. All specifications are statistically significant based on cluster-robust inference. The WRE bootstrap provides 2SLS inference that is more conservative with few clusters. Resulting p-values suggest statistical significance for all specifications aside from column 1 and allow us to be specific about the probability of a type 1 error in our preferred specifications: statistical significance is attained at the 6.7 and 7.2 per cent levels in columns 3 and 6, respectively.

First-stage results in Panel B of Table 2 suggest that our instruments are strong. As expected, pandemic timing has a strong positive impact on \({D}^{\mathrm{M}}\), conditional on whichever covariates are included. We assess instrument weakness following Montiel Olea and Pflueger (2013) by comparing the effective \(F\)-statistic for exclusion of \(\mathrm{Flu}\) to critical values at the 5% level, where these critical values indicate a fraction of the “worst-case” bias that remains after instrumenting. Our instrument is sufficiently strong that we are able to reject a worst-case bias of 10% or more in the most conservative specifications (columns 3 and 6) and to reject a bias of 5% or more elsewhere. Thus, we are confident that our instrument is sufficiently strong to identify pandemic-induced impacts on mortality in our data.Footnote 25

For comparison, OLS estimates in Panel C illustrate the correlation, pandemic or not, between mortality among working-age males and the business cycle. Unlike 2SLS estimates that exploit the exogenous pandemic shocks to identify mortality impacts, OLS estimates may suffer from identification concerns mentioned earlier so we believe our identification strategy to be important. OLS estimates are about one-third of the size and are not statistically significant in specifications that hold constant the covariates \({{\varvec{X}}}_{2}\). A weaker relationship when including non-pandemic mortality is expected as pandemics likely may have a stronger impact for a host of reasons, including their unexpected nature and considerable increases in unmeasurable morbidity challenges. Outside of a major health shock, national wealth is less correlated with life expectancy in the countries we study. Deaton (2003) shows that all are found on the flat portion of the Preston curve.

We now examine the same models through the lens of pandemic-induced mortality among both sexes. This alternative measure, \({D}^{P}\), may be important because our time series span periods from the past where the workforce was male dominated and periods closer to the present where the workforce is much more evenly distributed across the sexes. 2SLS impacts in Panel A of Table 3 are highly similar to those based on \({D}^{\mathrm{M}}\), too alike to draw any conclusions about nuances in mortality across the sexes. Instead, it appears that pandemic-induced working-age mortality, broadly defined, has a robust negative impact on the business cycle.

Our estimates suggest that mortality is one causal channel through which influenza pandemics contribute meaningfully to the business cycle. On average, these events increase mortality rates in our data by 1.73 deaths per 1000. The mortality consequences alone, then, suggest that influenza pandemics over the past 150 years decreased GDP per capita below its trend by half a percentage point on average. While modest compared with the drop in US GDP during the COVID pandemic or the Spanish Flu, it is not a trivial effect. Our 2SLS framework does not model the dynamics of how these shocks propagate over time once they hit. Such an analysis is better explored in a structural model, which would be motivated by the causal findings of this paper. Furthermore, while our analysis does include the Spanish Flu, several other influenza pandemics might be considered less-severe health shocks and the data available cover industrialized countries where, from a global perspective, impacts may have been less drastic.

Our interpretation of \(\widehat{\beta }\) depends on the argument that pandemic events are exogenous. Although we believe it to be so, particularly with covariates \({{\varvec{X}}}_{2}\), this assumption is fundamentally untestable. Conley et al. (2012) provide a “local to zero” method to estimate confidence intervals around \(\beta\) in cases where the exclusion restriction does not hold perfectly (where \(Flu\) affects \(\widetilde{lGDP}\) through channels other than \(D\), captured by some parameter \(\gamma\)). The distribution for \(\gamma\) is assumed to be \(\gamma \sim N(\mathrm{0,0.001})\), which is conservative in the sense that it allows \(\gamma\) to approach the full size of our measured effect of \(\beta\).Footnote 26 For our preferred specification, column (3) of Tables 2 and 3, 90% local to zero confidence intervals are presented alongside cluster-robust confidence intervals that assume the exclusion restriction holds (\(\gamma =0\)) in Table 4.Footnote 27 The bounds for \(\beta\) do not change significantly, which suggests that our estimates are reliable even in the unlikely case that \(\gamma \ne 0.\) This is to be expected as violations of the exclusion restriction are most egregious when instruments are weak, which is not our case.

6.1 Further robustness checks

Our findings are robust to the choice of business cycle measure. Both the HP and BK filters isolate particular frequencies in the data and thus generate cyclical measures that is relative to a nonlinear trend. However, we also illustrate that our results hold when using a linear detrending procedure (LDT) whereby \(\widetilde{lGDP}\) is comprised of residuals from country-specific regressions of \(\mathrm{log\,GDP}\) on the year of observation and the \(1946\) dummy variable. This measure is impacted negatively in all specifications (columns 5–8 of Tables 9 and 11 in the Appendix). Coefficients are much larger, which is to be expected as deviations from a linear trend over such a long period are necessarily larger than deviations from the nonlinear HP or BK trends that fluctuate slowly to capture sustained periods of higher growth.

We also consider more carefully the context of our most significant pandemic. The year 1918 overlaps both WW1 and the Spanish Flu. This matters because wartime contractions during our period of analysis are more than twice as large as non-wartime contractions among OECD countries (Barro and Ursúa 2008). WW1, in particular, was among the most significant contractions in the period of analysis with total deaths of combatants and civilians of about 20 million (Mougel 2011).Footnote 28Although the vector \({{\varvec{X}}}_{1}\) includes a WW1 timing control, we may not fully disentangle differential mortality impacts of the during this crucial year. To provide some evidence that WW1 is not confounding the results, we adjust \({D}^{\mathrm{M}}\) for the year 1918 using estimates from Barro et al. (2020), where separate death rates for the Spanish Flu and WW1 are available for all countries in our data aside from Finland.Footnote 29 The results using these adjusted mortality numbers are virtually unchanged (see Appendix Table 14).

An alternative way to measure pandemic-induced mortality is to examine excess mortality from influenza pandemics. Measurement concerns prevent us from adopting this as our main approach; any estimate of excess mortality introduces further uncertainty. Nevertheless, OLS estimates using excess mortality should be broadly similar to our IV results. Following Murray et al. (2006), we use the mean of adjacent year values twice the duration of the pandemic to establish a baseline mortality rate, separately for each pandemic in each country.Footnote 30 Excess mortality (\({\mathrm{EM}}_{jt}\)) is the difference between this baseline and original mortality data. Equation (4) details our calculation for a pandemic of duration \(S\) beginning in year \(T\), using the mortality rate \({D}^{\mathrm{M}}\) as an example.

$${\mathrm{EM}}_{jt}^{M}=\left({D}_{jt}^{\mathrm{M}}-\left[\frac{{\sum }_{t=T-S}^{T-1}{D}_{jt}^{\mathrm{M}}+{\sum }_{t=T+S}^{T+2S-1}{D}_{jt}^{\mathrm{M}}}{2S}\right]\right)$$
(4)

Estimates of the effect of excess mortality on the cyclical component of GDP in Appendix Table 15 suggest very similar impacts to the IV estimates in Tables 2 and 3. Point estimates using \({\mathrm{EM}}_{jt}\) exceed IV estimates somewhat but fall well within the confidence interval of the IV estimates, and so, we cannot rule out that the IV estimate might exceed these estimates with stronger instruments.

We consider the possibility that any subtle trends in any of our covariates and/or mortality rates might lead to spurious findings, even if it is unlikely given that our outcome is detrended. 2SLS estimates in Appendix Table 16 present results, for both the HP- and BK-filtered outcomes and for both mortality measures \({D}^{\mathrm{M}}\) and \({D}^{P}\), that include overall time trends (columns 1–4) and country-specific time trends (columns 5–8). The results are very similar, if not slightly larger. However, as the instruments are also slightly weaker, we cannot rule out that this difference is the result of a small increase in bias.

The impact measured above represents an average effect across all pandemics. However, not all events were equally severe. In particular, the flu events spanning 1873–75, 1899–1900 and 1946 may not be considered by some to have been full-fledged pandemics. A LATE that is not identified using the variation across these events might be expected to measure a stronger response. We redefine our instrument to treat these years as non-pandemic events and present 2SLS estimates in Table 17. As expected, point estimates are indeed larger although confidence intervals overlap. These results are suggestive that stronger pandemics had stronger effects, although it is worth noting that the indicator for more severe pandemics, while still a strong predictor of \({D}^{P}\) is somewhat weaker as an instrument for \({D}^{\mathrm{M}}\).

Finally, we rule out the possibility that our results are driven entirely by the most severe pandemic in the data, the Spanish Flu. Appendix Table 18 reports 2SLS results excluding the years 1918–1920. The results are broadly similar, particularly in columns 3 and 6 which are conditional on covariates that account for the overlapping world war events and their economic impacts, which might be particularly problematic as confounders during this period.

6.2 Employment

Labour supply is expected to be the main mechanism behind the causal link between mortality and GDP. In this section, we examine the potential intermediate effect that pandemic-induced mortality has on the employment-to-population ratio, \(E,\) where data permit (USA 1890–1990, UK 1870–2011 and Sweden 1870–2000). While Appendix Figure 7 illustrates correlation between \(E\) and \({D}^{\mathrm{M}}\), this is merely suggestive of an intermediating role. Instead, we re-estimate Eqs. (2) and (3), replacing \(E\) as the outcome variable. Table 5 shows results for \({D}^{\mathrm{M}}\) and \({D}^{P}\) mortality measures. Point estimates suggest a considerable negative impact on the employment ratio in these three countries for each additional death per 1000. However, the instrument is not strong enough to consider these estimates anything beyond suggestive. This is not surprising since \(N=275\) with covariates. Further, inference based on three clusters is certainly unreliable and WRE bootstrapped p-values suggest statistical significance well above the 10% level. Thus, we cannot as confidently place any emphasis on causal results for employment.

7 Discussion and conclusion

Our results suggest that over the course of the period 1870–2016, influenza shocks decreased the short-run (cyclical) component of GDP by increasing mortality rates among the working-age population. In addition to contributing to a growing knowledge of pandemic events, the influenza shocks we examine comprise a quasi-experimental setting under which we can identify causal effects of mortality on the business cycle. This causal channel appears to act, in part, through a reduction of labour supply, though it may capture other population health effects from these pandemic events. These events on average increased mortality rates in our data by 1.73 deaths per 1000 and suggest that influenza pandemics over the past 150 years decreased GDP per capita below its trend by half a percentage point on average.

While meaningful and significant, the effects we identify are modest enough that we would not claim mortality as the only channel by which pandemics contribute to the business cycle. Mortality effects from influenza pandemic shocks were predominant in the late nineteenth century and early twentieth century, which featured more rudimentary medical technology and treatments. Yet, our findings suggest that they continue to be worth consideration even after the introduction of vaccines. Even where mortality from influenza is not particularly egregious, behavioural responses in the workforce from attempts to reduce mortality impact from infection spread or accompanying morbidity (Bloom et al. 2022) are also likely to be important in the short run. Given that mortality may only be the tip of the proverbial iceberg, it should be noted that comparing morbidity and mortality shocks remains a promising avenue for further research pending the availability of long-run data on illnesses and pandemics as may certainly be the case in the wake of the COVID-19 pandemic and the evidence regarding the effects of “Long Covid” (Bach 2022; World Bank 2022). Indeed, going forward the macroeconomic effects of COVID will be an interesting area of further research given the novelty of the coronavirus, the extent of lockdowns, the rapid development and deployment of vaccines as well the magnitude of fiscal and monetary responses relative to previous pandemic periods.

Other important channels are at play which we hold constant in our analysis. Consider disruptions to trade, investment and particularly consumption, which comprises close to two-thirds of GDP in most developed countries (Attanasio 1999). For example, interplay between consumption and mortality might follow from reduced mobility due to taking private measures, lockdowns that are implemented in response to rising mortality, or from deferred consumerism when employment is less stable. A prominent example is the loss of 2.8 billion USD by the Mexican tourism sector during H1N1 (Rassy and Smith 2013). Ultimately, our results also motivate the inclusion of influenza pandemics as an overlooked health shock in structural models of health and the business cycle, building on such as those proposed by Bloom et al. (2022), which have been motivated by the recent and severe COVID-19 pandemic. The impact of influenza pandemics on business cycle fluctuations, while moderate is nevertheless of significance and important to appreciate given that economic history shows they have happened before and will inevitably happen again.