Data
We use the estimates of male period life expectancy at birth from the UN World Population Prospects (WPP) 2008 Revision from 1950 through 2005 (United Nations 2009). Period life expectancy refers to the life expectancy of a hypothetical cohort subjected to current mortality rates throughout its life (Preston et al. 2001: Section 3.1). Because of the significant impact of the HIV/AIDS epidemic on mortality rates, we do not include countries with a generalized HIV/AIDS epidemic in this analysis. We base our results on data from 158 countries, comprising about 90 % of the world’s population.
The UN produces estimates of age-specific mortality and period life expectancy at birth for 230 countries and areas, updated every two years. The data available vary widely between countries, with only 89 countries (39 %) having good vital registration data allowing direct and accurate estimation of age-specific mortality rates. Other countries have incomplete vital registration data (32 countries), summary estimates of child and adult mortality (38 countries), or estimates of child mortality only (49 countries), based on surveys, censuses or administrative records. Twenty-two small areas have no recent data at all.
There are many problems with the available data, particularly in the majority of countries without good vital registration data. These include the absence of age and sex breakdowns in census data, highly questionable census counts, incomplete geographical coverage, and major divergences between mortality estimates from different sources. The UN adjusts the available data in light of knowledge about the biases and quality of the different data sources. About half the countries lack age-specific mortality data; in these cases, the UN uses model life tables and relational models to estimate life expectancy from the available summary information about child and adult mortality.
There is considerable regional variation in the availability of reliable data. Of the 50 countries in Asia, 56 % have “reliable” or “fairly reliable” vital statistics, whereas 95 % of the countries in Europe and North America have reliable statistics. This number decreases dramatically in Africa, where only 5 of the 54 countries (9 %) maintain “reliable” or “fairly reliable” vital statistics (United Nations 2006).
Current UN Population Projection Methodology
Currently, the UN projects life expectancy at birth deterministically. The life expectancy at birth, \(\ell _{c,t + 1}\), for country c, in the next five-year period, \(t + 1\), is projected to be the life expectancy in the current time period, \(\ell _{c,t}\), plus the expected gain in life expectancy, \(g(\ell _{c,t})\). Observed five-year gains in life expectancy for 158 countries from 1950 to 2005 are plotted in Fig. 1. This figure highlights the nonconstant rate of change in life expectancy. To capture this, the UN has developed models that represent the gains in life expectancy by a double-logistic function of current life expectancy. The five deterministic UN models are shown in the bottom panel, where models vary by pace of gains in life expectancy.
The double-logistic function (Meyer 1994) has six parameters, as illustrated in Fig. 2. Four of them identify intervals of life expectancy when the rate of life expectancy gains is changing, one describes the approximate maximum gain in life expectancy, and the last parameter gives the asymptotic rate of gains as life expectancy increases. For each country, a UN analyst chooses one of five prescribed choices of the six parametersFootnote 1 by assessing the recently observed pace of mortality decline (United Nations 2009). The model implies that beyond a certain point, life expectancy increases at an effectively constant rate. This is consistent with research indicating that there is no evidence of an upper limit to life expectancy (Cohen and Oppenheim 2012; Oeppen and Vaupel 2002).
The logistic function has been used for more than a century to model population growth. Marchetti et al. (1996) showed that a sum of logistic functions can be used to model not only the adoption and substitution of competing technological innovations (Fisher and Pry 1971; Meyer 1994; Meyer et al. 1999) but also the social diffusion, learning, and adoption of new ideas, norms, attitudes, and behaviors associated with the fertility and mortality transitions (Marchetti 1997; Marchetti et al. 1996; Potter et al. 2010) or nuptiality (Goldstein and Kenney 2001; Hernes 1972; Li and Wu 2008).
The transition from high to low mortality can be decomposed into two processes, each of which can be approximated by a logistic function. The first process consists of initial slow growth and diffusion of progress against mortality (e.g., small mortality improvements at low levels of life expectancy associated with diffusion of hygiene and improved nutrition), followed by a period of accelerated improvements, especially for infants and children (e.g., larger gains associated with greater social and economic development, and mass immunization). The second process kicks in once the easiest gains have been achieved against infectious diseases, and produces continuing gains against noncommunicable diseases. These improvements occur at a slower pace because of ever-greater challenges to the prevention of premature deaths at older ages resulting from cardiovascular diseases or neoplasms, and to the delay of the onset of aging (Fogel 2004; Riley 2001).
To summarize, the UN projects life expectancy in the next time period deterministically using the equation
$$ \ell_{c,t+1} = \ell_{c,t} + g(\ell_{c,t}). $$
(1)
The expected five-year gain in life expectancy is a double-logistic function of the current level of life expectancy—namely,
$$ \begin{array}{lll} g(\ell_{c,t}| \boldsymbol{\uptheta}^{c}) &=& \frac{k^{c}}{1+\exp\left(-\frac{A_{1}}{\Delta_{2}^{c}}\left(\ell_{ct}-\Delta_{1}^{c} - A_{2} \Delta_{2}^{c}\right)\right)} \\&& + \frac{z^{c}-k^{c}}{1+\exp\left(-\frac{A_{1}} {\Delta_{4}^{c}} \left(\ell_{ct}-\sum_{i=1}^{3}\Delta_{i}^{c}-A_{2} \Delta_{4}^{c}\right)\right)}. \end{array} $$
(2)
In Eq. (2), \(\boldsymbol {\uptheta }^{c} = (\Delta _{1}^{c},\Delta _{2}^{c},\Delta _{3}^{c},\Delta _{4}^{c},k^{c},z^{c})\) are the six parameters of the double-logistic function for country c, whose meaning is illustrated in Fig. 2. The vector \(\boldsymbol {\uptheta }^{c}\) of the parameters for country c are chosen by a UN analyst from the five possibilities \((\boldsymbol {\uptheta }^{\text {Very Slow}}, \boldsymbol {\uptheta }^{\text {Slow}}, \boldsymbol {\uptheta }^{\text {Medium}}, \boldsymbol {\uptheta }^{\text {Fast}},\boldsymbol {\uptheta }^{\text {Very Fast}})\). The constants \(A_{1} = 4.4, \, A_{2} = 0.5\) are chosen so that the parameters \(\{ \Delta _{i}^{c}: \, i=1,2,3,4 \}\) are on an interpretable scale, but they are arbitrary in that they could be changed without altering the results, provided that their product, \(A_{1} A_{2}\), remains unchanged.
Stochastic Model
The UN projection method is deterministic and does not account for uncertainty. We now extend it to a stochastic model to allow for uncertainty. This involves two extensions. The first allows for stochastic changes within a country by replacing the deterministic model in Eq. (1) with a stochastic one by adding a random perturbation to Eq. (1). It then becomes a random walk with drift, where the drift term is given by the double-logistic function.
The second extension is to allow the parameters of the double-logistic function to vary between countries over a continuous range rather than among the current five UN possibilities. The resulting hierarchical model is
$$ \ell_{c,t+1} = \ell_{c,t} + g\left(\ell_{c,t}| \boldsymbol{\uptheta}^{(c)} \right) + \upvarepsilon_{c,t+1} , $$
(3)
where
$$\begin{array}{rll} g\left(\ell_{c,t}| \boldsymbol{\uptheta}^{c} \right) &=& \text{Double-Logistic function with parameters }\boldsymbol{\uptheta}^{c}{\kern-1.25pt} , \\ \boldsymbol{\uptheta}^{c} &=& \left(\Delta_{1}^{c},\Delta_{2}^{c},\Delta_{3}^{c},\Delta_{4}^{c},k^{c},z^{c}\right){\kern-1.25pt} , \\ \Delta_{i}^{c}|\upsigma_{\Delta_{i}} &\stackrel{\textrm iid}{\sim}& \text{Normal}_{[0,100]}\left(\Delta_{i}, \upsigma_{\Delta_{i}}^{2}\right){\kern-1.25pt}, \;\;\;\; i=1,\ldots,4 , \\ k^{c} |\upsigma_{k} & \stackrel{\textrm iid}{\sim} & \text{Normal}_{[0,10]}\left(k, \upsigma_{k}^{2}\right){\kern-1.25pt} , \\ z^{c} | \upsigma_{z}& \stackrel{\textrm iid}{\sim} & \text{Normal}_{[0,1.15]}\left(z, \upsigma_{z}^{2}\right){\kern-1.25pt}, \end{array}$$
where Normal\(_{[a,b]} (\upmu , \upsigma ^{2})\) denotes a normal distribution with mean \(\upmu \) and standard deviation \(\upsigma \), truncated to lie between a and b.
This model allows us to pool information about the rates of gains across countries by assuming that each set of country-specific double-logistic parameters is randomly sampled from a common truncated normal distribution. The normal distribution is truncated such that all the double-logistic parameters are positive.
The parameter \(z^{c}\) is the asymptotic average rate of increase in life expectancy per five-year period. Our prior distribution for this is informed by the results of Oeppen and Vaupel (2002), who found a strong positive linear trend in the “best practices” life expectancy (i.e., the highest life expectancy in a given year) from the mid-nineteenth century through 2000. By assuming that \(z^{c}\) is nonnegative, we are assuming that life expectancy will continue to increase, on average. In their regression of highest male life expectancy on year, Oeppen and Vaupel (2002) estimated a slope of 1.11 years per five-year period, with \(R^{2} = .98\). Because this is the rate of increase for “best practices” countries, we assume that the asymptotic rate of increase for any given country will not exceed the upper bound of a 99.9 % confidence interval for this estimate—namely, 1.15.
To specify the distribution of the random perturbations, \(\upvarepsilon _{c,t}\), we first estimated the model assuming them to be normally distributed with a constant variance, using the estimation method described later. Figure 3 shows the absolute residuals from this fit with a fitted regression spline. The spread of the residuals clearly decreases with increasing life expectancy. To account for this, we modeled \(\upvarepsilon _{c,t}\) as normally distributed with standard deviation proportional to the regression spline fitted to the absolute residuals shown in Fig. 3, so that
$$ \upvarepsilon_{ct} \stackrel{\textrm iid}{\sim} N(0, (\upomega\times f(l_{c,t-1}))^{2}) . $$
(4)
Our stochastic model builds on that proposed by Alkema et al. (2011) for probabilistic projection of the TFR for all countries. However, it differs in several respects. The double logistic model for the gains in life expectancy is more general than that for total fertility rate, since it asymptotes at a nonzero level, \(z^{c}\), which is estimated from the data for each country c. Also, prior information about the range of plausible values of \(z^{c}\) is available from other research, and this is incorporated explicitly via the Bayesian prior distribution. In the TFR model, in contrast, the prior distributions were largely uninformative.
Parameter Estimation
We adopt a Bayesian approach to estimating our model, making it a Bayesian hierarchical model. This requires specifying prior distributions for the 13 world parameters of the model: \((\Delta _{i}, \upsigma ^{2}_{\Delta _{i}})\) for \(i=1,\ldots ,4\); k, \(\upsigma ^{2}_{k}\), z, \(\upsigma ^{2}_{z}\), and \(\upomega \). We specify prior distributions that are proper but much more diffuse than the posterior distributions.
We set \(\Delta _{i} \sim N_{[0,100]}\left (a_{i},\updelta _{i}^{2}\right )\) for \(i=1,\ldots ,4\), \(k \sim N_{[0,10]}\left (a_{5}, \updelta _{5}^{2}\right )\) and \(z\,\sim N_{[0,1.15]}\left (a_{6}, \updelta _{6}^{2}\right )\). We set \((a_{1},\ldots ,a_{6})\) to the values specifying the UN medium-pace model: \((15.77,40.97,0.21,19.82,2.93,0.40)\). We set \(\left (\updelta _{1}^{2},\ldots ,\updelta _{6}^{2}\right )\) to the variances of the parameters among the different UN models.
For the world variance parameters—\(\upsigma ^{2}_{\Delta _{i}} (i=1,\ldots ,4)\), \(\upsigma ^{2}_{k}\), and \(\upsigma ^{2}_{z}\)—we used inverse-gamma prior distributions with 4 degrees of freedom (i.e., a shape parameter equal to 2). To set the parameters of these priors, we first fit the double-logistic model by least squares to the data from each country individually; then, for each parameter, we computed the empirical average squared deviations from the values for the UN medium-pace model. Next, we set the prior means of the reciprocals of the world variance parameters equal to the reciprocals of these values. This yielded rate parameters \((15.6^{2}, 23.5^{2}, 14.5^{2}, 14.7^{2}, 3.5^{2}, \) and \( 0.6^{2})\) for the six inverse-gamma prior distributions. The resulting prior distributions are guaranteed to be much more spread out than the posterior distribution. Finally, a diffuse Uniform [0,10] prior was used for \(\upomega \).
Experiments showed that the results were insensitive to changes in these priors, which is to be expected because the resulting prior distribution is much more spread out than the posterior distribution.
The posterior distribution of the world and country-level parameters was approximated by Markov chain Monte Carlo implemented in R. The approximately 1,000 parameters were updated one at a time, using Gibbs sampling (Gelfand and Smith 1990), Metropolis-Hastings sampling (Chib and Greenberg 1995; Hastings 1970), or slice sampling (Neal 2003). We used three chains, each the length of 100,000 scans, with a burn-in of 10,000 scans. Visual inspection of the trace plots, the Raftery-Lewis diagnostic (Raftery and Lewis 1992), and the Gelman-Rubin statistic (Gelman and Rubin 1992) indicated that the chains had converged and had explored the posterior distribution enough to yield good estimates of posterior quantiles of interest.
A free publicly available R software package called bayesLife is available to implement the method (Ševčíková and Raftery 2011). An additional R package called bayesDem, also freely and publicly available, provides a graphical user interface for bayesLife (Ševčíková 2011).