Keywords

2.1 Introduction

Probabilistic population forecasting has recently received growing attention from researchers and, to a lesser extent, from official agencies, which traditionally derive population projections deterministically. As discussed in Keilman et al. (2002) and Keilman (2018), there are three main approaches to stochastic population forecasting. The first approach relies on the theory of time series, with models suggested both by the frequentist and the Bayesian approaches. The best-known time series approach in a classical framework is due to Lee and Carter (1992), originally proposed to forecast mortality, and later modified to address fertility forecasting, see Lee (1993) and Lee and Tuljapurkar (1994). Many extensions, generalizations and modifications have been proposed: see, among others, Booth et al. (2002), Booth and Tickle (2008), Booth (2006) Cairns et al. (2006, 2011), Hyndman and Ullah (2007), Hyndman and Booth (2008), and Hyndman et al. (2013). Using the Bayesian approach, Alkema et al. (2011) suggest a Bayesian hierarchical time series model for fertility forecasting, Raftery et al. (2013) for mortality forecasting, and Bijak and Wiśniowski (2010) and Bijak and Bryant (2016) for migration forecasting. As a sign that probabilistic approaches are entering the mainstream of demographic forecasting, in 2014 the United Nations for the first time issued official probabilistic population projections for all countries up to 2100, see Alkema et al. (2015).

The second approach derives population forecasts based on the extrapolation of empirical errors. The observed errors from historical forecasts are used in the assessment of the uncertainty, see, among others, Stoto (1983). Following this approach, Alho and Spencer (1990) proposed the Scaled Model of Error, which was used to obtain stochastic population forecasts within the UPE (Uncertainty Population of Europe) project, see Alders et al. (2007).

The third approach, known as random scenarios or expert based approach, derives the forecast distribution of demographic components based on suitably elicited expert evaluations on their future trend, see, among others, Lutz et al. (1998). This is the approach that we follow in the present paper. The advantages and disadvantages of methods that rely on expert evaluations have been widely discussed in the literature. Goldstein (2004) and Lutz and Goldstein (2004), among others, stress how the random-scenario approach might be appealing to official agencies, due to its simplicity, the fact that its framework is based on scenarios, and the direct involvement of experts. The use of expert opinions allows taking into account behavioural theories on the future of the population (as argued by Lutz 2013) and allows incorporating in the forecasting exercise the knowledge of trends (such as policy changes and environmental changes) that might have an impact on the population dynamics. A further important advantage of expert based forecasting is that it does not require data on the past and therefore can be especially useful for developing countries, for which past data are usually poor. The main criticism of the expert based approach is related to the well-known and widely observed tendency of experts to underestimate the uncertainty. Keilman (1990) observed that, particularly when recent trends have been stable, the overconfidence of experts results in overly narrow prediction intervals. Among others, Alho and Spencer (1990) stress the conservativeness of expert opinions with respect to the decline in mortality, while Lee (1993) and Booth (2006) express concerns with respect to the accuracy of fertility forecasts. A further recognized drawback is that a forecast approach based on expert evaluations needs to focus on summary indicators of the demographic changes, and therefore turns out to be inflexible in forecasting age-schedules. Moreover, existing random-scenario methods, being generally based on trajectories that are obtained by the interpolation of a starting known and a final random value, are characterized by a variance and covariance structure which is not particularly flexible. Finally it is commonly emphasized that it is not easy to elicit from experts opinions on the across-time correlations for a single indicator and correlations between indicators.

Our method derives probabilistic population forecasts based on expert opinions in such a way as to take into account relations both between the demographic components and between the expert evaluations. As for the first kind of relation, between demographic components, there is a certain debate about the advisability and/or need to model such dependence. Indeed, if for some pairs of indicators the dependence is not under scrutiny, as for male and female life expectancies at birth, in other cases, as for immigration and fertility, it is more questionable. It is common practice in population forecasting to assume independence between the three components of demographic change, fertility, mortality and migration, for which separate forecasts are provided. Our method lets expert evaluations on the future trends of the demographic components drive the detection of the presence and assessment of the strength of any relations between them. Indeed, in principle the method we suggest can take into account any type of dependence between any pair of indicators. Our method does not exclude independence between the demographic components, an independence that can be the result of expert opinions.

Our method also takes into account any dependencies between expert evaluations. Indeed, we expect that experts who have been trained and work in the same field would share a certain amount of knowledge and information, which could induce associations between their opinions. In an expert based approach, an important and delicate issue to face is how to combine the opinions provided by several experts. A wide literature is available on the problem of the aggregation of expert opinions, see, among others, Genest et al. (1986) for a review. Popular pooling methods suggest combining expert opinions by working out averages. For instance, the linear rule derives the collective assignment through their (possibly weighted) average. Similarly, one can define geometric or logarithmic pooling rules. Such pooling techniques take into account the variability of the expert evaluations, but do not take into account their potential associations, associations that we think cannot be neglected. Here we suggest a method for combining expert opinions that allows modelling both the associations between them and their diversity, taking into account several sources of uncertainty.

We suggest combining the expert evaluations by resorting to the so-called Supra-Bayesian method of pooling, introduced by Morris (1974) and then developed by many authors, see, among others, French (1980, 1981), Winkler (1981), Lindley (1983, 1985), Gelfand et al. (1995), and Roback and Givens (2001). In this approach, expert opinions on unknown quantities are treated as observations and combined based on the theoretical framework provided by the Bayesian approach to statistics. The analyst specifies a likelihood function, to be parametrized in terms of the unknown objects, and a prior distribution for the parameters. The posterior distribution, obtained by applying Bayes’s theorem, updates the analyst’s prior opinion, on the basis of the evaluations provided by the experts, and can then be used as a forecast distribution for the unknown quantities of interest. This approach takes into account and exploits the variability of the expert evaluations. Hence, the larger the number of experts, the more informative the procedure is.

In the next section, we provide a description of the method that was first suggested in Billari et al. (2014). We discuss in detail the elicitation procedure, the model, and the Markov Chain Monte Carlo algorithm. In Sect. 2.3 we describe the results of applying the model to forecasting of the Italian population from 2010 to 2065. In Sect. 2.4 some concluding remarks are provided.

2.2 The Supra-Bayesian Forecasting Method

As is common practice in the expert based approach, our method focusses on summary indicators of the three components of demographic change: fertility, mortality and migration. Population forecasts by age and sex are then obtained, relying on the commonly used cohort-component model, with age-schedules derived from the corresponding summary indicators, based on suitable models. In the following we describe the method, considering the case of two summary indicators R 1 and R 2 to be jointly forecast from time t 0 to time T. The inputs of the method are the expert opinions, which we presume to have been elicited according to the conditional elicitation procedure suggested in Billari et al. (2012).

The elicitation procedure works as follows. Split the forecast interval [t 0, T] into two subintervals, considering a time point t 1 in it. In the first stage, the expert is asked to provide a forecast for each indicator at time t 1 and at time T, and an upper quantile for one of the two indicators at time t 1, say for instance R 1, as a value such that R 1 takes on a greater value with a predetermined probability. In the implementation of the method, this probability is set equal to 10%. In the second stage, the expert is asked to provide the following conditional forecasts:

  • A forecast and an upper quantile at t 1 for the second indicator R 2 presuming that R 1 takes at t 1 a value equal to the elicited upper quantile and the forecast respectively;

  • A forecast and an upper quantile at T for R 1 presuming that it takes at t 1 a value equal to the elicited upper quantile and the forecast respectively;

  • Three different forecasts at T for R 2 presuming three different combinations of values for R 1 at t 1 and T and R 2 at t 1.

In order to understand how the indicators’ mean and variance, along with their correlations, can be derived from the elicited values, consider the case of one single indicator. In the case of the forecast of one single indicator, the expert should provide at the first stage forecasts for times t 1 and T, say \(m_{t_1}\) and m T, and an upper quantile at time t 1, say \(q_{t_1}\) as a value such that there is a probability equal to α that the indicator takes on a value greater than \(q_{t_1}\). We assume Gaussian distributions for the indicator at the two time points, with means \(m_{t_1}\), respectively, m T. Under the Gaussian assumption, the variance \(\sigma ^2_{t_1}\) of the indicator at time t 1 can be easily derived from \(m_{t_1}\) and \(q_{t_1}\) as follows:

$$\displaystyle \begin{aligned}\sigma^2_{t_1}=\left(\frac{q_1-m_1}{z_{1-\alpha}}\right)^2\end{aligned}$$

with z 1−α being the quantile of order 1 − α of a standard Gaussian random variable.

At the second stage, the expert is asked to provide a forecast, say \(m_{T|t_1}\), of the indicator at time T presuming that it takes at time t 1 a value equal to the elicited quantile \(q_{t_1}\) and an upper quantile of the indicator at time T, say \(q_{T|t_1}\), presuming that at time t 1 the indicator is \(m_{t_1}\). Under the assumption of Gaussian distributions at the two time points, the conditional distribution of the indicator at T given that it is equal to \(q_{t_1}\) at t 1 is Gaussian, has mean \(m_{T|t_1}\) and variance \(\sigma ^2_{T|t_1}\) that can be derived as before from \(m_{T|t_1}\) and \(q_{T|t_1}\). The conditional distribution of the indicator is in this way completely specified, so that the correlation between the indicator at the two time points can be derived from standard results on Gaussian distributions.

This method can be easily generalized to the case of two indicators to be jointly forecast at t 1 and T. Therefore, the elicitation procedure allows indirectly eliciting across-time correlations for a single indicator, as the correlation between the rates at the two considered time points t 1 and T, and correlations both at the same time and across time for a pair of indicators, by asking for conditional forecasts.

This elicitation procedure yields vectors of forecasts of the two indicators at the two time points and their covariance matrix, one vector for each expert. In the method we suggest, the forecasts and the covariance matrices are used in a different way. We follow the Supra-Bayesian approach and suggest treating as data the forecasts provided by each expert at the two time points. In a Bayesian approach to inference, the analyst should, then, specify both the likelihood function, describing the random mechanism generating the evaluations and therefore to be parametrized in terms of the demographic summary indicators, and a prior distribution of these parameters, incorporating any information the expert has on them.

The likelihood function shapes the dependences between the expert evaluations. In Lindley (1983, 1985) a multivariate Gaussian distribution is used. Such a choice is motivated primarily by mathematical convenience, since it simplifies all computations related to the derivation of the posterior distributions. Nevertheless, the construction of a likelihood function of this kind is cumbersome, due to the large number of terms to be specified. Indeed, in the case of opinions elicited on several indicators at different time points, the choice of a multivariate Gaussian distribution requires the specification of all marginal means and variances and covariances.

Albert et al. (2012) suggest relying on a hierarchical random effects model, as a more parsimonious approach. At the beginning of the analysis, the experts are grouped by the analyst into a fixed number of homogeneity classes, corresponding to similar backgrounds or similar schools of thought. At the first level of the hierarchy, the opinions provided by the experts belonging to the same group are assumed to have the same distribution, indexed by parameters varying across groups. Then the different groups are assumed to have a common knowledge that is linked through a common distribution assigned to the group parameters and indexed by the parameter that represents the object of the expert evaluations. Finally, at the last level, a prior is assigned to this parameter, representing the overall uncertainty of the elicitation.

We suggest choosing a mixture model for the likelihood. Through this choice, we assume, as in Albert et al. (2012), that there are several different random mechanisms generating the expert evaluations, but we do not know which is the random mechanism generating the evaluations provided by each expert. Again, we presume that the experts can be grouped into a given number of classes, based on their shared knowledge and information, but for each expert we do not know which is the class the expert belongs to. We let the opinions provided by the experts determine their group membership, so as to implicitly derive the dependence structure of the expert evaluations.

On the side of the prior distributions, as in Albert et al. (2012), the group centres are assumed to be independent and to have the same distribution, centred at the vector of summary indicators. In this way, we take into account the heterogeneity of the expert evaluations due to their possessing different pieces of information. Finally, we use the elicited covariance matrices to specify the prior distribution of the unknown clusters covariance matrix.

The resulting hierarchical model can be schematized as in Fig. 2.1, for the case of K experts where x i is the vector of forecasts provided by expert i on the two indicators at two time points and \(R= (R_{1t_1},R_{1T},R_{2t_1},R_{2T})\) with R jt being the random variable associated with indicator j at time t. The evaluations of the two summary indicators at the two time points are assumed to be conditionally independent and drawn from a mixture of J multivariate Gaussian distributions of dimension 4, each denoted by N 4(μ j, Σj), for j = 1, ⋯ , J and with J fixed by the analyst, being the number of groups of experts, with weights p 1, …, p J. We assume in this way that each expert evaluation is distributed according to N 4(μ j, Σj) with probability p j. As for the prior distributions, the group means μ j are assumed to be independent conditional on the covariance matrix Σj and distributed according to a multivariate Gaussian distribution centred at the vector of summary indicators at the two time points R, and with covariance matrix equal to Σj scaled by k 0 so as to end up with a diffuse prior, as discussed below. The covariance matrices Σj are assumed to be independent and identically distributed according to an inverse-Wishart distribution with scale matrix Σ0 and n 0 degrees of freedom. The group probabilities p 1, …, p J are assumed to have a Dirichlet distribution with parameters (α 1, …, α J). The vector of summary indicators at the two time points R is assumed to have a multivariate Gaussian distribution. It is worth emphasizing that this choice of prior distributions ensures conditional conjugacy (see, among others, Lavine and West 1992), which is something we draw on in the design of the Markov Chain Monte Carlo algorithm needed for the simulation of the posterior distribution of the vector of summary indicators R, described later.

Fig. 2.1
figure 1

The mixture model

The analyst needs then to specify the number J of components of the mixture and the parameters of the priors Σ0, k 0, n 0, α, μ R, ΣR. The number of components can be chosen by fitting models with different J and then comparing them on the basis of indexes such as the Bayesian Information Criterion (BIC) or the Akaike Information Criterion (AIC). Since Σ0 is the centre of the prior on the groups covariance matrix, we suggest specifying it based on the elicited covariance matrices. In our implementation of the model, we set Σ0 equal to the arithmetic average of these covariance matrices, scaled so as to increase the variance of the elicited indicators. In this way we take into consideration and can correct the over-confidence of the experts, who tend to underestimate the variability of their forecasts. Since μ R is the centre of the prior assigned to vector R, it represents a prior guess of the future values of the indicators and can then be specified using all available information. For instance, it can be fixed based on the central scenarios provided by national and international statistical agencies.

As for the remaining hyper-parameters, we suggest specifying them so as to end up with very diffuse priors. In this way, the posterior distribution can be mainly determined by the data, the expert elicited forecasts. Indeed, k 0 and n 0 affect the spread of the prior distributions on the group means and on the group covariances, respectively: the smaller they are, the larger is the spread. We suggest setting them as small as possible in order to increase the variability of the priors. Due to the properties of the Dirichlet distribution, the smaller is the value of α j, the larger is the variability. Moreover α j is the probability for an expert to belong to group j. A standard choice to depict no prior information on the group membership is \(\alpha _j=\frac {1}{J}\). ΣR is the covariance matrix of the prior distribution on R. We suggest choosing rather high variances so as to end up with a diffuse prior, and setting the covariances equal to 0, which corresponds to assuming the a priori independence of the indicators.

The joint posterior distribution of the indicators \((R_{1t_1},R_{1T},R_{2t_1},R_{2T})\) can then be used as their forecast distribution at the two considered time points. Since this cannot be expressed in closed form, we suggest a Markov Chain Monte Carlo algorithm to draw samples from it. More precisely, we develop an auxiliary variables Gibbs-sampler, with full-conditionals that are all available in closed form due to the conditional conjugacy ensured by the choice of the prior distributions. For each observation, we introduce at each iteration an auxiliary variable Z i taking values in {1, 2, …, J}, which flags its group membership and is updated each iteration. At each iteration of the algorithm, the group means and covariance matrices are updated by drawing them from a multivariate Gaussian distribution and an inverse-Wishart distribution respectively, the vector of latent variables is updated by drawing each component from a discrete distribution on {1, 2, …, J}, the vector of group probabilities (p 1, …, p J) is updated by drawing it from a Dirichlet distribution and the vector R of summary indicators is updated by sampling it from a multivariate Gaussian distribution. The draws of the summary indicators from the joint posterior distribution are used as forecasts of the two summary indicators at the two time points, while forecasts for all points of the interval are obtained by resorting to suitable interpolation methods. In the application discussed in the next section, standard elementary quadratic interpolation techniques are used. As a by-product, the draws of the latent variables Z 1, …, Z K can be used for the estimation of the composition of the groups, that is, the clustering of experts in the J groups.

The Matlab package supraBayesian_popproj, downloadable from the web site of the publication, provides the codes implementing the Gibbs-sampler along with the codes for the derivation of the population forecasts by age and sex based on the simulations from the posterior distribution of the summary indicators.

2.3 An Application: Forecasting the Italian Population

In this section we illustrate an application of our forecasting method. The experts opinion used as inputs of the model were elicited according to the described procedure, through a questionnaire administered in 2012 in collaboration with the Italian Statistical Office (ISTAT). Experts were provided with information on the latest scenarios depicted by Eurostat and by the United Nations on the Italian summary indicators of demographic change. In 2015 the first official probabilistic population forecasts of the Italian population were issued by ISTAT starting from such elicited opinions. The Italian Statistical Office followed the method suggested in Billari et al. (2012) for the derivation of expert-based forecasts of the summary indicators. In the ISTAT forecasting exercise, the indicators were treated as independent and a multivariate Gaussian distribution was taken as the forecast distribution, with mean and covariance matrix obtained by averaging across the experts’ elicitations. In 2017, ISTAT provided an update of the population projections of 2015, based on the same elicited opinions; a detailed description of the implemented methodology is provided in ISTAT (2017).

The forecasting period was 2010–2065 and was split into two sub-intervals, employing 2030 as the midpoint. The opinions were elicited on the following summary indicators: Total Fertility Rate, Mean Age at Birth, Male and Female Life Expectancies at Birth, Total Number of Immigrants and of Emigrants. The opinions on Total Fertility Rate and Total Number of Immigrants were jointly elicited, as were the opinions on Male and Female Life Expectancies at birth. Figure 2.2 displays the forecasts of the Total Fertility Rate and of the Total Number of Immigrants at 2030 and 2065 provided by 14 experts, while Fig. 2.3 depicts the corresponding correlations indirectly elicited.

Fig. 2.2
figure 2

Expert forecasts of Total Fertility Rate and Total Number of Immigrants

Fig. 2.3
figure 3

Elicited correlations: Total Fertility Rate (average number of children per woman) and Total Number of Immigrants (in thousands)

With the Total Fertility Rate, there was low variability across expert evaluations: almost all the experts foresee a moderate increase in the rate from 2030 to 2065. With the Total Number of Immigrants, the evaluations show a higher variability, especially for 2065; the majority of experts forecast a decrease in the Total Number of Immigrants. As to the correlations, there is a general agreement on a positive high correlation between Total Number of Immigrants at 2030 and Total Number of Immigrants at 2065 and on a positive moderate/high correlation between Total Fertility Rate at 2030 and at 2065. For the majority of experts there is a positive correlation between Total Number of Immigrants at 2030 and Total Fertility Rate at 2030 and no correlation between the two rates at 2065. With regard to the correlation between Total Number of Immigrants and Total Fertility Rate at two different time points, for one-half of the experts there is no correlation and for the other half a moderate/high negative correlation between Total Number of Immigrants at 2030 and Total Fertility Rate at 2065, while all experts agree on there being no correlation between Total Fertility Rate at 2030 and Total Number of Immigrants at 2065.

Figure 2.4 presents the forecasts of the Life Expectancies for males and females and Fig. 2.5 presents the corresponding correlations. Note that the forecasts were provided by 16 experts, but only nine provided all inputs needed for the derivation of the correlations. All forecasts show a low variability, both at 2030 and 2065. With regard to the correlations, there is agreement among the experts on the correlation of Male Life Expectancy at the two time points and on the correlation between Male and Female Life Expectancies at 2030, all experts forecasting a positive high correlation. Similarly, almost all experts forecast a positive high correlation between Female Life Expectancy at 2030 and Male Life Expectancy at 2065. Regarding the correlation between Female Life Expectancy at the two time points, we observe for three experts a positive high correlation, for one expert a negative high correlation, and for all other experts, correlations almost equal to zero. In the case of the correlations between Male and Female Life Expectancies at 2065, three experts forecast a negative high correlation, one expert a very low positive correlation and the remaining five experts a positive high correlation.

Fig. 2.4
figure 4

Expert forecasts of Male and Female Life Expectancies (years)

Fig. 2.5
figure 5

Elicited correlations: Male and Female Life Expectancies

Similar disagreement is expressed about the correlation between Male Life Expectancy at 2030 and Female Life Expectancy at 2065: Two experts forecast a negative high correlation, three experts a zero correlation and four experts a positive high correlation.

Figure 2.6 displays the forecasts and correlations for Total Number of Emigrants provided by 16 experts. There is a high variability in the forecasts, both for 2030 and 2065. In particular, we can notice that six experts provided the same forecasts at 2030 and 2065, this is the reason why in the top panel of Fig. 2.6 only red asterisk is displayed for these experts. Regarding the across-time correlations, almost all experts forecast a positive high correlation. We could work out correlations only for 14 experts, since two of them did not provide the needed conditional forecasts.

Fig. 2.6
figure 6

Expert forecasts (in thousands) and correlations, Total Number of Emigrants

Based on the results of the elicitation procedure, the forecasting method explained in the previous section was then used to simulate the joint forecast distribution of Total Fertility Rate and Total Number of Immigrants at 2030 and 2065 and of the joint forecast distribution of Male and Female Life Expectancies at 2030 and 2065. The same method was applied to the separate simulation of the forecast distributions of the Total Number of Emigrants and of the Mean Age at Birth at 2030 and 2065. The prior parameters were specified as described in the previous section. In particular, the means and variances of the priors for the summary indicators were specified based on the ISTAT scenarios available in 2012: μ R was set equal to the vector of central scenarios and the variances for ΣR were derived from the high–low ISTAT scenarios available in 2012. The covariances were all fixed to 0. The mixture model was fit for different choices of the number J of components of the mixture, ranging from two to five. The model with two components was selected, since it had the smallest BIC.

The results shown in Tables 2.1, 2.2, 2.3, and 2.4 were obtained through a long run of the MCMC algorithm that provided 20,000 samples from the joint posterior distribution of the indicators at the two time points, 2030 and 2065; the first 10,000 were discarded, as burn-in. The convergence of the algorithm was assessed though different techniques, the trace plots of the chains run for Total Fertility Rate and Total Number of Immigrants and discarding the first 10,000 draws are depicted in Fig. 2.7. The analysis can be replicated using the Matlab code “supraBayesian_popproj” available in the online material of this book.

Fig. 2.7
figure 7

Trace plots, TFR as average number of children per woman, Total number of Immigrants in thousands

Table 2.1 2030 and 2065 Demographic Indicators: Prior, expert evaluations and posterior means and standard deviations, TFR as average number of children per woman, Life Expectancies as number of years, Total Number of Immigrants and of Emigrants in thousands
Table 2.2 Prior and posterior correlations, Total Number of Immigrants and Total Fertility Rate
Table 2.3 Prior and posterior correlations, Male and Female Life Expectancies at birth
Table 2.4 Total population forecasts and ISTAT estimates (in millions)

Table 2.1 shows the prior and posterior means and standard deviations for the summary indicators at 2030 and 2065, along with the arithmetic average and standard deviations of the corresponding expert opinions. For all indicators, as expected the posterior standard deviation at 2030 is smaller than the one at 2065, and both posterior standard deviations are smaller than the prior ones, since noninformative priors are used. Our forecasts show a lower variability compared against the one induced by ISTAT scenarios. The ISTAT central scenario, used as prior mean, predicts a Total Fertility rate equal to 1.5 both for 2030 and 2065, the arithmetic average of the expert opinions is 1.55 at 2030 and 1.65 at 2065, and our model predicts, as posterior mean, 1.53 at 2030 and 1.64 at 2065. The same kind of pattern can be observed for the Total Number of Immigrants, for which the ISTAT central scenario, used as prior mean, predicts for 2030 321,000 and for 2065 304,000; the arithmetic average of the expert elicitations is around 254,000 for 2030 and 212,000 for 2065; while the model forecasts, as posterior mean of the indicator, are 280,000 for 2030 and 262,000 for 2065. Regarding the Life Expectancies, the ISTAT central scenario, used as prior mean, predicts a Male Life Expectancy equal to 82.80 at 2030 and equal to 86.60 at 2065, a Female Life Expectancy equal to 87.70 at 2030 and equal to 91.50 at 2065, while the arithmetic averages of the expert opinions are 83.01 for 2030 and 86.96 for 2065 for males and 87.24 and 90.88 for females. The mixture model predicts posterior means of Male Life Expectancy equal to 82.93 at 2030 and to 86.89 at 2065, and a Female Life Expectancy equal to 87.21 at 2030 and to 91.02 at 2065. The ISTAT central scenario on Total Number of Emigrants predicts 101,000 emigrants in 2030 and 128,000 in 2065; the arithmetic average of the expert evaluations is 70,000 and 62,810 for 2030 and 2065 respectively; and the model predicts a Total Number of Emigrants equal to 91,480 in 2030 and 91,010 in 2065.

Table 2.2 provides the prior and posterior correlations at the same time (2030 and 2065) and across time for the Total Fertility Rate and the Total Number of Immigrants, and the correlations at the same time and across time between the two summary indicators. It is worth emphasizing that the prior correlations are derived from Σ0, which was obtained as the scaled arithmetic average of the covariance matrices elicited from each expert, while the posterior correlations are obtained from the 10,000 draws of the two rates at the two time points. The model predicts a moderate positive posterior across-time correlation for the Total Number of Immigrants and a moderate/low positive across-time correlation for Total Fertility Rate. All posterior correlations between the two rates are around zero, apart from the correlation between Total Number of Immigrants at 2030 and Total Fertility Rate at 2030, equal to 0.1288. The forecast of this positive, even though weak, correlation is in concordance with Sobotka (2003), Sobotka et al. (2008), Haug et al. (2002), Coleman (2006), and Goldstein et al. (2009), who argue that fertility rates in many European countries may have been increased by the compositional effect of the rising share of higher-fertility immigrants. The fact that the correlation between the two rates is almost zero at 2065 is due, in our opinion, to the difficulty for the experts to express, even indirectly, opinions on the long term associations.

Table 2.3 presents the prior and posterior correlations at 2030 and 2065 and across-time for the Male and Female Life Expectancy. Based on the elicited opinions, our model predicts a moderate/high correlation between Male Life Expectancy at 2030 and 2065, between Male and Female Life Expectancy at 2030, and between Female Life Expectancy at 2030 and Male Life Expectancy at 2065. All other correlations are predicted to be around zero.

For each of the summary indicators, from the 10,000 values obtained as draws from the corresponding posterior distribution, 10,000 trajectories over the time interval from 2010 to 2065 are obtained by relying on standard quadratic interpolation techniques. The forecast of the Italian Population from 2010 to 2065 was then derived based on the cohort-component model. The inputs of the model are the age- and time-specific fertility rates, age- and time-specific male and female survival rates, and age- and time- specific net migration rates, obtained from the corresponding summary indicators by applying standard smoothing techniques. In particular, the matrices of male and the matrices of female age- and time-specific mortality rates are obtained from the corresponding life expectancies at birth on the basis of the extended model life tables provided by the United Nations. The matrices of age- and time-specific fertility rates are derived from the vectors of total fertility rates and the vectors of mean maternal ages at birth, using a rescaled normal model. For migration, the matrices of male and female age-specific net migration flows are derived from the corresponding vectors of total net flows, applying a rescaled gamma model. This is a simplifying assumption that assumes the absence of pre-school, retirement, and post-retirement peaks in the age profile of migrations, with the only peak being related to labour migration.

Starting from an estimated total population at 2010 of 60,343 million, our model predicts a slight increase at 2030, with the total population forecast to be 61,795 million with an 85% forecast interval ranging from 60,137 million to 63,475 million. After 2030, the total population is predicted to decrease, reaching 57,146 million, with an 85% forecast interval from 50,135 to 64,503 million. As expected, the latter forecasts have a higher variability.

Table 2.4 presents the Italian population forecasts and prediction intervals obtained through our method and the values estimated by ISTAT from 2011 to 2018. Overall, our forecasts are above the ISTAT estimates, with differences in absolute value ranging from 142,000 in 2011 to 1,265,000 in 2018. One explanation of this over-prediction might be found in Table 2.1, where we see that on average, expert opinions at 2030 and especially at 2065 on Total Fertility are well above what is expected by the ISTAT central scenarios, and the same for Male Life Expectancy. It is as well plausible that the experts did not perceive the persistence of the great recession, which was linked to lower fertility (see Goldstein et al. 2013, Comolli and Bernardi 2015, Comolli 2017 and Matysiak et al. 2018) and to lower levels of net migration (see Anelli and Peri 2017), leading to smaller population sizes. The failure of our method to capture the decrease in the total population estimated by ISTAT from 2014 to 2018 might be due as well to the interpolation techniques used for the derivation of the forecast indicators between the starting time 2010 and 2030 and between 2030 and 2065.

2.4 Concluding Remarks

The method we have suggested makes explicit use of expert evaluations to derive probabilistic forecasts of the future trends in the population by age and sex. Our method makes use of expert opinions not only about the expected future behaviour of the demographic components but also about the across-time correlations of single indicators and about the correlations between the indicators. The expert evaluations are then combined in such a way as to take into account their associations. The advantages and limitations of an expert based approach have been discussed in the Introduction. Here, it is worth emphasizing the fact that experts are always involved in the population forecast at different levels of the forecasting procedure and to different degrees. In the time series approach, experts contribute to the choice of the model and the specification of the prior distributions. In the extrapolation from past errors approach, experts provide the central trajectories and contribute to the evaluation of the forecasts. Furthermore, we do not neglect information on past trends when considering expert evaluations as the main source for deriving the population forecasts. Indeed, expert evaluations should be based as well on such information. Our method allows taking into account the overconfidence of experts in their opinions, which might produce an undervaluation of the uncertainty of the forecasts. The entire process is treated within the formal framework provided by the Bayesian paradigm.

Our modelling strategy has some specific limitations. The main limitation is that we have focussed on summary indicators of the demographic changes, which are then converted into age schedules based on parametric models. An extension of the method is in principle feasible, the main difficulty being related to the elicitation of opinions on curves, depicting age patterns. Moreover, our method does not take into account the uncertainty in the initial distributions of the population by age and sex, this being particularly problematic in the case of inconsistencies between the census-based and register-based population records. Experts could be asked to express their opinion on the initial structure by age and sex of the population as well. Lastly, our method exploits expert opinions to derive the forecast distribution of two summary indicators at two time points, while forecasts for the years between the starting one and the midpoint, t 1 and between t 1 and the final time T, are obtained relying on standard interpolation techniques. In principle, our method can be generalized to the case of more than two indicators at more than two time points. The main limitation is on the side of the inputs of the forecasting procedure for the indicators. The indirect elicitation of the correlations requires, as seen in Sect. 2.2, questions on conditional forecasts that in the case of more than two time points and more than two indicators can be extremely cumbersome. More work should then be devoted to the selection of suitable interpolation techniques and experts could be involved in this choice as well, by asking them to express their opinion on the expected trends between the considered time points.

As a general consideration, the performance of the forecasting procedure relies on the number of experts and their commitment. The application of the method discussed in the previous section was based on the results of the first round of the questionnaire, when at most 16 experts contributed. A new round of the questionnaire is currently running, the results of which are not yet available. However, almost 100 experts have contributed, and we expect a better performance of the method here suggested.