Introduction

Life expectancy is a widely reported statistic, for example by official governmental agencies every year, or various United Nations agencies such as the Human Development Program (United Nations Development Programme, 2022). However, a common (but incorrect) interpretation of period life expectancy \({e}_{0}\) is that it measures actual lifespan in that period’s population. In other words, the number of deaths in an observed population should be \(\left({\text{t}}\text{otal }{\text{p}}{\text{opulation}}\right)/{e}_{0}\). But in several countries, the actual number of deaths is much smaller than the ratio would predict. For example, in the United States with a population of 337 million and a life expectancy at birth of 77 years, we should expect an annual number of deaths of \(\left(337/77\right)=4.38\) million. But the actual number is about 3.28 million (Department of Economic United Nations & Population Division Social Affairs, 2022). Such a large over-prediction is surprising (at least to us).

Mere inequality is of course no surprise to a demographer—as the latter would say, only in a "stationary" population is the average age at death given by \({e}_{0}\), and only in a stationary population do deaths equal \(\left({\text{t}}\text{otal }{\text{p}}{\text{opulation}}\right)/{e}_{0}\). That corresponding stationary population is of course defined by having zero growth and the mortality rates of an observed population. In an observed population, the number of deaths depends on the actual age structure, not on a hypothetical "stationary" age structure. One can readily imagine population structures where the number of deaths is much higher (e.g., a population with everyone over 70) or much lower (e.g., a population where everyone is between 15 and 25) than in a stationary population. But such examples are unrealistic and they also cut both ways, in the sense of implying either higher or lower actual deaths. Here we answer the following questions that grow out of the above observations. Why do we so easily find cases where the ratio \(\left({\text{t}}\text{otal }{\text{p}}{\text{opulation}}\right)/{e}_{0}\) is greater than the number of deaths? Is the over-prediction typical, and if so, why? What do we find in a global sample, using the United Nations World Population Prospects (hereafter, WPP) (Department of Economic United Nations & Population Division Social Affairs, 2022), where data cover a wide range but data quality is uneven? What do we find in high-quality data, as in the Human Mortality Database (hereafter HMD) (Berkeley USA University of California & Max Planck Institute For Germany, 2022)? Does the pattern of age-structures change in a common way across countries, and if so why?

It is obvious that crude death rate depends on the age-patterns of both mortality and population, but how? We derive an equation that reveals how these two factors determine the difference between the actual crude death rate and the stationary crude death rate. For small perturbations, we obtain a sensitivity for the crude death rate. Our sensitivity has a pleasing mathematical similarity to the famous Keyfitz (1977) and Demetrius (1978) "entropy"-like sensitivity of life expectancy. As in the latter case (Vaupel, 1986), we show that our result reveals the age-specific sensitivity of crude death rate to change in the population age structure, and that the sensitivity differs in high and low mortality contexts. Then we use our analysis to show that small deviations in age-structure from stationarity always lead to over-prediction. Our result suggests that the extent of over-prediction depends on the size of the deviation from stationarity.

Next we explore crude death rates across many countries and time periods, using data from both HMD and WPP to compute the difference between crude death rate and \(\left(1/{e}_{0}\right)\). We find a striking global over-prediction of the crude death rate. Then we use long-term Swedish data to provide a useful perspective on the crude death rate over a long period with distinct demographic regimes. Of course, over-prediction does not always occur, and we show when under-prediction occurs and discuss reasons why.

Finally, we explore the deviation from stationarity in populations across many countries and periods. We use data from the WPP and the HMD to show that the age-structural deviation from stationarity is usually small enough that over-prediction will almost always be found. Then we examine more closely how the deviation from stationarity changes over time. Using examples, we argue that the demographic transition produced deviations in the form of population bulges that have always led to what we call over-prediction. We also provide a longer-term perspective by examining age-structural change in Sweden from 1751 to 2021. We conclude with a brief discussion.

Crude death rate and age structure

Consider a population (in a specified year) in which individuals at age \(x\) have a death rate \(\mu \left(x\right)\) and a probability \(l\left(x\right)\) of surviving to at least age \(x\). We consider only females here; two sexes can be included with more algebra. Say that the population’s age-structure is described by a fraction \(u\left(x\right)\) of individuals at age \(x\). The per-capita death rate is called (by demographers) the crude death rate, and is

$${C}_{D}= {\text{deaths per capita}}=\int dx \mu (x) u(x).$$
(1)

We use a per-capita rate so the results apply for any total population.

In a stationary population with the same age-specific mortality rates, the fraction \({u}_{s}\left(x\right)\) of individuals at age \(x\) is \(\left(l\left(x\right)/{e}_{0}\right)\), and the crude death rate is

$$\begin{array}{c}{C}_{Ds}=\left(\frac{1}{{e}_{0}}\right).\end{array}$$
(2)

(The above follows from (1), using \(l\left(0\right)=1\)). How do these two crude rates compare?

Sensitivity of crude death rate

Say that an observed population has a non-stationary age structure:

$$\begin{array}{c}u\left(x\right)={\left[{u}_{s}\left(x\right)\right]}^{\left(1+g\left(x\right)\right)}.\end{array}$$
(3)

Then Eq. (1) implies that the crude death rate is

$$\begin{array}{c}{C}_{D}=\int dx\hspace{0.17em}\mu \left(x\right)\hspace{0.17em}{\left[{u}_{s}\left(x\right)\right]}^{\left(1+g\left(x\right)\right)}.\end{array}$$
(4)

Here and later the lower limit on the integral is always zero; the upper limit may be taken as infinity, or some finite number that exceeds the largest age at death. The difference between the crude death rates in an actual and a corresponding stationary population is

$$\begin{array}{c}\delta {C}_{D}=\left[{C}_{D}-\left(\frac{1}{{e}_{0}}\right)\right]. \end{array}$$
(5)

This difference is a sensitivity when \(g\left(x\right)\) is small.

Take \(g\left(x\right)\) to be small enough so that we can take logarithms in (3) and expand to get:

$$\begin{array}{c}[{u}_{s}\left(x\right)]{}^{\left(1+g\left(x\right)\right)}\simeq {u}_{s}\left(x\right)+g\left(x\right){u}_{s}\left(x\right)log{u}_{s}\left(x\right).\end{array}$$
(6)

This is of course only a linear expansion, but later numerical work shows that the approximation is reasonable for \(g\left(x\right)\) as large as say \(0.2\), which corresponds to a 20% deviation from stationary age structure. Using this expansion in (5), the sensitivity of the crude death rate is thus

$$\begin{array}{c}\delta {C}_{D}={C}_{D}-\left(\frac{1}{{e}_{0}}\right)\simeq \int dxg\left(x\right)\hspace{0.17em}\mu \left(x\right)\hspace{0.17em}{u}_{s}\left(x\right)\mathrm{log}{u}_{s}\left(x\right)\end{array}$$
(7)
$$\begin{array}{c}=\int dx\hspace{0.17em}g\left(x\right)\hspace{0.17em}\mu \left(x\right)\hspace{0.17em}\left(\frac{l\left(x\right)}{{e}_{0}}\right)\hspace{0.17em}\mathrm{log}\left(\frac{l\left(x\right)}{{e}_{0}}\right).\end{array}$$
(8)

As we next show, the above expression quantifies the intuition that the actual value of sensitivity depends on the age pattern of the deviations \(g\left(x\right)\).

Sensitivity and entropy-like measures

The sensitivity (8) shows that the contribution of a deviation in the age-structure at age \(x\) is proportional to the ratio:

$$\begin{array}{c}-\mu \left(x\right)\hspace{0.17em}\left(\frac{l\left(x\right)}{{e}_{0}}\right)\hspace{0.17em}log\left(\frac{l\left(x\right)}{{e}_{0}}\right).\end{array}$$
(9)

We use the minus sign because the logarithm is always negative. Thus a positive deviation at any age, \(g\left(x\right)>0\), will decrease \(\delta {C}_{D}\), and the converse.

There is a striking similarity between the sensitivity in (8) and the sensitivity of life expectancy to mortality change (Demetrius, 1978; Goldman & Lord, 1986; Keyfitz, 1977; Vaupel, 1986). In the latter, \(g\left(x\right)\) is a small fractional change in mortality rate at age \(x\), and the resulting change in life expectancy is

$$\begin{array}{c}\delta {e}_{0}=\int dx\hspace{0.17em}g\left(x\right)\hspace{0.17em}l\left(x\right)\hspace{0.17em}\mathrm{log}l\left(x\right)\hspace{0.17em}.\end{array}$$
(10)

Thus the contribution of a change at age \(x\) to \(\delta {e}_{0}\) is proportional to the ratio

$$\begin{array}{c}-l\left(x\right)\hspace{0.17em}logl\left(x\right),\end{array}$$
(11)

as discussed by Vaupel (1986). If we ignore the mortality factor in (9), the age-pattern of the remaining contribution to \(\delta {C}_{D}\) is similar to that of \(\delta {e}_{0}\). As we now show, the mortality factor in (9) does not alter the essential pattern.

Figure 1 shows the age-specific sensitivity weights from Eq. (9) for Sweden in four different years (data from the HMD). In high-mortality regimes (the earliest year, 1850) with high infant mortality and old-age mortality, the maximum value of sensitivity weight (0.013, 12.6% of the overall weight) occurs at age zero. With the decline in infant mortality (1950), the sensitivity weight at age zero drops dramatically and the maximum value shifts to age 80. As the mortality rate decreases further, the maximum value moves to age 87 and age 90 in 2000 and 2019, respectively. This figure supports demographic intuition, and shows that a decrease in the proportion of very young and/or old people will lead to a high positive \(\delta {C}_{D}\). But do we often see such an age pattern, or the opposite, and why?

Fig. 1
figure 1

Sensitivity Weight in Sweden from HMD

Sensitivity and the Kullback–Leibler distance

We now focus on the age-structure of the population. In the definition of \(g\left(x\right)\) in (3), take logarithms on both sides to see that

$$\begin{array}{c}g\left(x\right)\hspace{0.17em}log{u}_{s}\left(x\right)=log\left(\frac{u\left(x\right)}{{u}_{s}\left(x\right)}\right).\end{array}$$
(12)

Now insert this in (8) to get our first expression for sensitivity:

$$\begin{array}{c}\delta {C}_{D}=\int dx\hspace{0.17em}\mu \left(x\right)\hspace{0.17em}{u}_{s}\left(x\right)\hspace{0.17em}\mathrm{log}\left(\frac{u\left(x\right)}{{u}_{s}\left(x\right)}\right)\hspace{0.17em}.\end{array}$$
(13)

The integrand on the right measures the difference between the real age-structure \(u\left(x\right)\) and the stationary structure \({u}_{s}\left(x\right)\) for a given age \(x\). The overall departure from stationarity can be measured by the similar Kullback–Leibler distance (Kullback & Leibler, 1951):

$$\begin{array}{c}{K}_{u}=-\int dx\hspace{0.17em}\hspace{0.17em}{u}_{s}\left(x\right)\hspace{0.17em}\mathrm{log}\left(\frac{u\left(x\right)}{{u}_{s}\left(x\right)}\right)\ge 0\hspace{0.17em}.\end{array}$$
(14)

The inequality on the right above is well known.

Given that mortality rates are bounded between some \(A>0\) and some \(B>0\), the sensitivity in Eq. (13) is bounded between two negative numbers:

$$\begin{array}{c}-A\hspace{0.17em}{K}_{u}>\int dx\hspace{0.17em}\mu \left(x\right)\hspace{0.17em}{u}_{s}\left(x\right)\hspace{0.17em}\mathrm{log}\left(\frac{u\left(x\right)}{{u}_{s}\left(x\right)}\right)>-B\hspace{0.17em}{K}_{u}\hspace{0.17em}.\end{array}$$
(15)

Thus we have a key result: when a population has a structure slightly different from stationary, we always have an over-estimation:

$$\begin{array}{c}{\text{C}}{\text{r}}{\text{u}}{\text{d}}{\text{e}} \, {\text{d}}{\text{e}}{\text{a}}{\text{t}}{\text{h}} \, {\text{r}}{\text{a}}{\text{t}}{\text{e}} \, {C}_{D}<\left(\frac{1}{{e}_{0}}\right)={C}_{Ds}.\end{array}$$

And from (14, 15), the over-estimation should increase with the Kullback–Leibler distance.

Observed crude death rates

We have shown mathematically that over-prediction is typical in a population with about 20% deviation from stationary. But the deviations in observed populations could be larger, or different. How typical is over-prediction in populations across time and space?

We find, as shown in Fig. 2, that most populations display the pattern we expect: negative sensitivity and over-prediction. The difference between the actual crude death rate (\({C}_{D}\)) and \(\left(1/{e}_{0}\right)\) is negative in most populations and years in the HMD (94.54%) and in the WPP (95.41%).

Fig. 2
figure 2

The sensitivity of crude death rate (\({\mathrm{C}}_{\mathrm{D}}\)\({\mathrm{C}}_{\mathrm{Ds}}\)) and life expectancy at birth (\({\mathrm{e}}_{0}\)) in countries from HMD and WPP

An interesting feature of this comparison (Fig. 2) is that under-prediction (i.e., \({C}_{D}<\left(1/{e}_{0}\right)\)) typically occurs when life expectancy is very high or very low relative to the average. In countries with low \({e}_{0}\), we likely have limited or poor data on population at old ages. In countries with high \({e}_{0}\), we likely have populations with long-term low fertility and thus a historically unusual age structures, dominated by an increasing percentage of older individuals. Clearly, over-prediction will not be seen in such populations.

Figure 3 reveals the long-term pattern of the sensitivity of crude death rate in Sweden. In the pre-demographic transition stage (until about 1850) we expect the population to fluctuate around stationarity, and indeed the difference between actual and stationary crude death rates fluctuates around 0. During the demographic transition (beginning in 1850 but most pronounced in the period 1900–1950) the differences become negative and decrease to a low around 1950 and then increase. Additional perspective on these changes is provided below, when we examine the changing pattern of age structure.

Fig. 3
figure 3

The sensitivity of crude death rate (\({\mathrm{C}}_{\mathrm{D}}\)\({\mathrm{C}}_{\mathrm{Ds}}\)) in Sweden from HMD and WPP, 1751–2021

Changing age structures: patterns and causes

The deviation in age-structure is key to the crude death rate, but how far are populations from their corresponding stationary structures? To answer this question, we again use annual data from the HMD for 40 countries over a range of years (the longest period is 1751–2021 for Sweden). We also examined a larger set of data for the period 1950–2021 from the WPP for 126 countries and areas whose populations were over 5 million. Smaller countries were omitted as too sensitive to fluctuations.

Age-structural deviations

Recall that we compare a population’s actual age structure \(u\left(x\right)\), and the corresponding stationary structure \({u}_{s}\left(x\right)\). These yield the proportional deviations \(g\left(x\right)\) (defined in (3)). They also yield the observed crude death rate \({C}_{D}\) and the crude death rate \({C}_{Ds}\) for the corresponding stationary population with the same death rates.

Using the data sources above, we compute and present the frequency distribution of the \(g\left(x\right)\) values in Fig. 4. The peak values of \(g\left(x\right)\) occur near but slightly above 0, and most of values of \(g\left(x\right)\) are under 20% (about 8 in 10 for the WPP data, and about 9.5 in 10 for the HMD data). Hence age-specific deviations in observed populations from stationarity are often small enough that over-prediction will be found.

Fig. 4
figure 4

Frequency of \(\mathrm{g}\left(\mathrm{x}\right)\) value in countries from HMD and WPP. Dashed lines are at values ± 0.2; HMD data in green and WPP data in yellow

Overall deviation

We used the Kullback–Leibler distance (\({K}_{u}\)) to measure the overall deviation in the age distribution relative to the stationary distribution. Figure 5 shows the frequency and distribution of the \({K}_{u}\) values. All the \({K}_{u}\) values observed here are (naturally) positive, with peaks at 0.02 and means at 0.04 (HMD) and 0.13 (WPP). Therefore, observed age distributions are almost always non-stationary. Given that WPP data cover a wider range and are of uneven quality, we expect and find that WPP data show a wider range than HMD data, ranging from 0 to 0.5.

Fig. 5
figure 5

Frequency of \({\mathrm{K}}_{\mathrm{u}}\) value in countries from HMD and WPP. HMD data in green (or blue) and WPP data in yellow

Note that qualitatively similar results are obtained using an alternative indicator:

$$\begin{array}{c}{S}_{u}=\int dx\hspace{0.17em}{\left(\hspace{0.17em}u\left(x\right)\hspace{0.17em}-{u}_{s}\left(x\right)\hspace{0.17em}\right)}^{2}\hspace{0.17em},\end{array}$$
(16)

and are in the Appendix (Figures 10, 11, 12, 13).

Trends in overall deviation

To capture trends, we first present the \({K}_{u}\) values for three countries with high-quality data in Europe, Asia, and North America from 1900 to 2020 (Fig. 6). Taking Italy (left panel) as an example, if we ignore the wide swings due to the 1918 influenza epidemic and the two World Wars (Glei et al., 2015), \({K}_{u}\) values rose from 1900 to 1950 and steadily declined thereafter, stabilizing at levels varying between 0.02 and 0.03 after 1975. Japan and the United States follow a similar trajectory, as do many other industrialized countries.

Fig. 6
figure 6

\({\mathrm{K}}_{\mathrm{u}}\) value in 3 countries from HMD and WPP, 1900–2020

For a long-term perspective, consider Sweden (Fig. 7) with data starting in 1751. We ignore noise in the early historical data (especially for years preceding 1870–80) caused by lower data quality (Barbieri et al., 2015). In the years before the Industrial Revolution (1750–1850), age structure fluctuated with no systematic trend, as seen in the \({K}_{u}\) values in Sweden for that period. As industrialization took hold, and the demographic transition began, the distance \({K}_{u}\) first rose till about 1900, and then fell steadily until leveling off at a small value after about 2000.

Fig. 7
figure 7

\({\mathrm{K}}_{\mathrm{u}}\) value in Sweden from HMD and WPP, 1751–2021

Bulges in age distribution

The distance \({K}_{u}\) measures the deviation across the entire age distribution, but does not provide age-specific information (e.g., which distribution at a certain age is lower or higher). So here we compare the observed and stationary age distributions in a more detailed way by plotting their ratio.

Start with Denmark (Fig. 8, top-left panel). In the pre-demographic transition stage in which both birth and death rates are high (1850), the ratios decreased with age, falling below 1 at age 35, so the actual population was “younger” than the stationary population. In the early stage the demographic transition (1950) resulted in a bulge at ages 0–5. In later years that bulge travels to later ages, whereas there is a continuing reduction in birth rates and thus in the percentage as at young ages.

Fig. 8
figure 8

Ratio of the age distributions of the observed and stationary population in 6 countries from HMD, 1850–2019

Those changing depressions and bulges are observed in many other industrialized countries, as seen in the figure. A striking common feature in those bulges is that they are concentrated in the age range 10–50, which typically is the lowest mortality range. In other words, there is a larger proportion of individuals with lower mortality in the observed population than that in the stationary population. This explains the over-prediction.

In many countries, the above temporal patterns lead to a negative correlation between deviation and sensitivity, as shown in Fig. 9. What this means is that the difference between the actual crude death rate and the stationary value becomes more negative (and so larger in magnitude) as the distance \({K}_{u}\) decreases. A similar negative correlation is seen in for many years in the long-term data for Sweden. Focusing on the period 1900 to 1970, Fig. 3 shows that \({C}_{D}\) falls relative to the stationary \({C}_{Ds}\), while over the same period Fig. 7 shows that the distance \({K}_{u}\) decreases.

Fig. 9
figure 9

The deviation (\({\mathrm{K}}_{\mathrm{u}}\)) and the sensitivity (\({\mathrm{C}}_{\mathrm{D}}\)\({\mathrm{C}}_{\mathrm{Ds}}\)) in 6 countries from HMD and WPP, 1751–2021

Finally here we note that the distance \({K}_{u}\) is the main driver of the crude death rate (as compared with shifts in the age-pattern of mortality). Thus, there is relatively high discrimination of the deviation for distinguishing the mismatch pattern ("actual deaths > expected deaths" and "actual deaths < expected deaths"), as the corresponding area under the receiver operating characteristic curve (AUC) for \({K}_{u}\) is 0.74 in HMD and 0.7 in WPP data.

Stable vs. stationary populations

Much interest in demography has been devoted to stationary populations (Coale, 1972; Keyfitz, 1965). McCann (1976) analyzed crude death rate in stable populations growing at some constant rate, distinct from the stationary populations we consider here. Given that populations are rarely stable, and indeed the demographic transition means that growth rates must change with time, analyses based on stable growth are unlikely to be useful here. Even so, any period fertility and mortality leads to a growth rate and a purely hypothetical stable population. So we asked whether that stable age-distribution could be used instead of the stationary distribution. The answer was negative, but we present the comparison in the Appendix.

Discussion

The over-prediction of the crude death rate in terms of life expectancy at birth is widely observed in practice and is puzzling for non-demographers and demographers alike. Here we explored this discrepancy in several ways. We showed analytically that whenever a population age distribution is close to stationary, the stationary population has many more deaths than the actual number. Secondly, our formulas measure precisely the age-specific sensitivity of the crude death rate. In this context, we mention also the more general results in the interesting papers by Aburto et al. (2020), Vaupel (2021) and Nigri et al. (2022).

We used a large data set and found that over-prediction of crude death rate is widespread. The deviation between the observed and stationary populations is an important driver of the mismatch in the actual and expected number of deaths. We present both short-term (decades) and long-term (centuries) of analysis suggesting that the demographic transition produced persistent bulges at ages where the mortality rates are lower. The difference between actual deaths and stationary deaths is mainly driven by the central age groups. This is interesting since this means that more people than we might expect (under stationarity or stability) arrive at adult age. These factors help explain why we find widespread over-prediction of the actual number of deaths.