Abstract
Organisations that develop demographic projections usually propose several variants with different demographic assumptions. Existing criteria for selecting a preferred projection are mostly based on retrospective comparisons with observations, and a prospective approach is needed. In this work, we use the mean–variance scaling (spatial variance function) of human population densities to select among alternative demographic projections. We test against observed and projected Norwegian county population density using two spatial variance functions, Taylor’s law (TL) and its quadratic generalisation, and compare each function’s parameters between the historical data and six demographic projections, at two different time scales (long term: 1978–2010 vs. 2011–2040; and short term: 2006–2010 vs. 2011–2015). We find that short-term projections selected by TL agree more accurately than the other projections with the recent county density data and reflect the current high rate of international migration to and from Norway. The variance function method implemented here provides an empirical test of an ex ante approach to evaluating short-term human population projections.
Similar content being viewed by others
Notes
The geographical boundaries of Hordaland and Rogaland changed on January 1, 2002, with a transfer of one municipality, which led to a 0.5% increase and a 1.2% decrease in the respective population density (Cohen et al. 2013).
2006–2010 data were selected so that historical period and projection period were of equal length. Choosing historical time series as long as the projections is a common practice in forecast accuracy evaluation (Smith et al. 2013).
References
Alho, J., & Spencer, B. D. (1997). The practical specification of the expected error of population forecasts. Journal of Official Statistics, 13, 203–225.
Alkema, L., Raftery, A. E., Gerland, P., Clark, S. J., Pelletier, F., Buettner, T., et al. (2011). Probabilistic projections of the total fertility rate for all countries. Demography, 48, 815–839.
Anderson, R. M., Gordon, D. M., Crawley, M. J., & Hassell, M. P. (1982). Variability in the abundance of animal and plant species. Nature, 296, 245–248.
Anderson, R. M., & May, R. M. (1988). Epidemiological parameters of HIV transmission. Nature, 333, 514–519.
Bailey, M. A. (2016). Real econometrics: The right tools to answer important questions. New York, NY: Oxford University Press.
Bongaarts, J., & Bulatao, R. A. (2000). Beyond six billion: Forecasting the world’s population. Washington, D.C.: Panel on Population Projections, Committee on Population, Commission on Behavioral and Social Sciences and Education, National Research Council.
Box, G. E. P., & Cox, D. R. (1964). An analysis of transformations. Journal of the Royal Statistical Society: Series B, 26(2), 211–252.
Breusch, T. S., & Pagan, A. R. (1979). A simple test for heteroscedasticity and random coefficient variation. Econometrica, 47, 1287–1294.
Brunborg, H., & Cappelen, Å. (2010). Forecasting migration flows to and from Norway using an econometric model. In Work session on demographic projections, Eurostat methodological working papers, pp. 321–344. http://ec.europa.eu/eurostat/documents/3888793/5848129/KS-RA-10-009-EN.PDF/fbaff784-8e78-417b-8b51-7f8fdf2f3031.
Brunborg, H., & Texmon, I. (2011). Befolkningsframskrivning 2011–2100: Modell og forutsetninger. Økonomiske Analyser, 4, 33–45.
Cappelen, Å., Ouren, J., & Skjerpen, T. (2011). Effects of immigration policies on immigration to Norway 1969–2010. Report. Statistics Norway. http://www.udi.no/en/statistics-and-analysis/research-and-development-reports/effects-of-immigration-policies-on-immigration-to-norway-1969-2010-2011/. Accessed 8 August 2015.
Cochrane, D., & Orcutt, G. H. (1949). Application of least squares regression to relationships containing auto-correlated error terms. Journal of the American Statistical Association, 44(245), 32–61. doi:10.1080/01621459.1949.10483290.
Cohen, J. E. (1986). Population forecasts and confidence intervals for Sweden: A comparison of model-based and empirical approaches. Demography, 23(1), 105–126. (erratum 25(2), 315, 1988).
Cohen, J. E., Roig, M., Reuman, D. C., & GoGwilt, C. (2008). International migration beyond gravity: A statistical model for use in population projections. Proceedings of the National Academy of Sciences USA, 105(40), 15269–15274.
Cohen, J. E., Xu, M., & Brunborg, H. (2013). Taylor’s law applies to spatial variation in a human population. Genus, 69(1), 25–60.
Coleman, C. D., & Swanson, D. A. (2007). On MAPE-R as a measure of cross-sectional estimation and forecast accuracy. Journal of Economic and Social Measurement, 32(4), 219–233.
D’Agostino, R. B. (1970). Transformation to normality of the null distribution of G1. Biometrika, 57(3), 679–681.
Dietz, T., Rosa, E. A., & York, R. (2007). Driving the human ecological footprint. Frontiers in Ecology and the Environment, 5, 13–18.
Eisler, Z., Bartos, I., & Kertész, J. (2008). Fluctuation scaling in complex systems: Taylor’s law and beyond. Advances in Physics, 57(1), 89–142.
Fox, J. (2008). Applied regression analysis and generalised linear models (2nd ed.). New York: Sage.
Fox, J., & Weisberg, S. (2011). An R companion to applied regression (2nd ed.). Thousand Oaks, CA: Sage.
Gerland, P., Raftery, A. E., Ševcíková, H., Li, N., Gu, D., Spoorenberg, T., et al. (2014). World population stabilization unlikely this century. Science, 346(6206), 234–237.
Goel, S., Hofman, J. M., Lahaie, S., Pennock, D. M., & Watts, D. J. (2010). Predicting consumer behavior with Web search. Proceedings of the National Academy of Sciences USA, 107(41), 17486–17490.
Greig, A., Dewhurst, J., & Horner, M. (2014). An application of Taylor’s power law to measure overdispersion of the unemployed in English labor markets. Geographical Analysis, 47(2), 121–133.
Hanley, Q. S., Khatun, S., Yosef, A., & Dyer, R.-M. (2014). Fluctuation scaling, Taylor’s law, and crime. PLoS ONE, 9(10), e109004. doi:10.1371/journal.pone.0109004.
Heyde, C. C., & Cohen, J. E. (1985). Confidence intervals for demographic projections based on products of random matrices. Theoretical Population Biology, 27(2), 120–153.
Horst, C., Carling, J., & Ezzati, R. (2010). Immigration to Norway from Bangladesh, Brazil, Egypt, India, Morocco and Ukraine. In PRIO Paper. Oslo: Peace Research Institute Oslo.
Keeling, M., & Grenfell, B. (1999). Stochastic dynamics and a power law for measles variability. Philosophical Transactions of the Royal Society B, 354(1384), 769–776.
Komsta, L., & Novomestky, F. (2015). Moments: Moments, cumulants, skewness, kurtosis and related tests. R package version 0.14. http://CRAN.R-project.org/package=moments
Lee, R. D., & Carter, L. R. (1992). Modelling and forecasting US mortality. Journal of the American Statistical Association, 87(419), 659–671.
Lee, R. D., & Tuljapurkar, S. (1994). Stochastic population forecasts for the United States: Beyond high, medium, and low. Journal of the American Statistical Association, 89(428), 1175–1189.
Lutz, W., & Goujon, A. (2004). Literate life expectancy: charting the progress in human development. In W. Lutz & W. Sanderson (Eds.), The end of world population growth in the 21st century: new challenges for human capital formation and sustainable development (pp. 159–186). London: Earthscan.
Lutz, W., Butz, W. P., & KC, S. (2014). World population and human capital in the twenty first century. New York: Oxford University Press.
Lutz, W., Sanderson, W., & Scherbov, S. (1997). Doubling of world population unlikely. Nature, 387, 803–805.
Lutz, W., Sanderson, W., & Scherbov, S. (2001). The end of world population growth. Nature, 412, 543–545.
Prais, S. J., & Winsten, C. B. (1954). Trend estimators and serial correlation. Cowles Commission Discussion Paper No. 383. Chicago, IL.
Raftery, A. E., Chunn, J. L., Gerland, P., & Ševcíková, H. (2013). Bayesian probabilistic projections of life expectancy for all countries. Demography, 50, 777–801.
Rayer, S. (2007). Population forecast accuracy: Does the choice of summary measure of error matter? Population Research and Policy Review, 26(2), 163–184.
R Core Team. (2015). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/
Smith, S. K., Tayman, J., & Swanson, D. A. (2013). A practitioner’s guide to state and local population projections. New York: Springer.
Statistics Norway. (2011). Population projections, 2011–2100. Report. http://www.ssb.no/en/befolkning/statistikker/folkfram/aar/2011-06-16. Accessed 7 April 2015.
Statistics Norway. (2015). Immigrants and Norwegian-born to immigrant parents. Report. http://www.ssb.no/en/befolkning/statistikker/innvbef. Accessed 7 April 2015.
StatBank Norway. (2015). Table: 05196: Population, by sex, age and citizenship. Data. https://www.ssb.no/statistikkbanken/selecttable/hovedtabellHjem.asp?KortNavnWeb=folkemengde&CMSSubjectArea=befolkning&PLanguage=1&checked=true. Accessed 7 April 2015.
Stoto, M. A. (1983). The accuracy of population projections. Journal of the American Statistical Association, 78(381), 13–20.
Swanson, D. A., Tayman, J., & Barr, C. F. (2000). A note on the measurement of accuracy for subnational demographic estimates. Demography, 37(2), 193–201.
Swanson, D. A., Tayman, J., & Bryan, T. M. (2011). MAPE-R: A rescaled measure of accuracy for cross-sectional subnational population forecasts. Journal of Population Research, 28(2), 225–243.
Taylor, L. R. (1961). Aggregation, variance and the mean. Nature, 189, 732–735.
Taylor, L. R., Perry, J. N., Woiwod, I. P., & Taylor, R. A. J. (1988). Specificity of the spatial power-law exponent in ecology and agriculture. Nature, 332, 721–722.
Taylor, L. R., & Taylor, R. A. J. (1977). Aggregation, migration and population mechanics. Nature, 265, 415–421.
Taylor, L. R., Woiwod, I. P., & Perry, J. N. (1978). The density-dependence of spatial behaviour and the rarity of randomness. Journal of Animal Ecology, 47(2), 383–406.
The World Bank. (2014). Population estimates and Projections. Report. http://data.worldbank.org/data-catalog/population-projection-tables. Accessed 7 April 2015.
United Nations. (2014). World population prospects: The 2012 revision. Report. Department of Economic and Social Affairs, Population Division. http://esa.un.org/wpp/. Accessed 7 April 2015.
US Census Bureau. (2014). 2014 National population projections. Report. http://www.census.gov/population/projections/data/national/2014.html. Accessed 7 April 2015.
Wickham, H. (2009). ggplot2: Elegant graphics for data analysis. New York: Springer.
Acknowledgements
Two reviewers and Associate Editor Rebecca Kippen provided helpful comments. This work was partially supported by National Science Foundation Grants EF-1038337 and DMS-1225529 to the Rockefeller University. JEC thanks Priscilla K. Rogerson for help.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Appendix
Appendix
Overview of MAPE-R
Rescaled mean absolute percentage error (MAPE-R) is a modified post hoc measure of population forecast accuracy based on mean absolute percentage error (MAPE) developed by Swanson et al. (2000). Since absolute percentage errors (APEs) between observed and projected population sizes are often right-skewed, the central tendency measure, MAPE, is heavily influenced by outliers and tends to overestimate the true error (Rayer 2007). To describe the percentage errors more robustly, taking account of their large outliers, Swanson et al. (2000) first calculated a modified Box-Cox transformation of APEs. The parameter of the modified Box-Cox transformation (Box and Cox 1964) was selected to maximize the likelihood that the transformed APEs were symmetrical, so that the mean became appropriate. Symmetry of APE was evaluated using D’Agostino’s test of skewness (D’Agostino 1970), where a rejection of the null hypothesis of zero skewness indicated that APE was not symmetrically distributed and the Box-Cox transformation was necessary. Then Swanson et al. (2000) calculated the inverse of the mean transformed APEs (MAPE-R) to return them to the original data scale.
Details of the theory and empirical test of MAPE-R can be found in Coleman and Swanson (2007) and Swanson et al. (2000, 2011).
Implementation of MAPE-R
We followed the procedure outlined in Swanson et al. (2011) to implement MAPE-R. To measure the accuracy of Norwegian county population projections between 2011 and 2015 for each of the six demographic projections and each year from 2011 to 2015, we first examined the distribution of APE between the projected and observed county population density and tested its skewness using the right-sided D’Agostino’s test of skewness with significance level 0.1 (Table S1). Here a 0.1 significance level was used instead of a lower value to avoid ‘‘a greater cost in terms of a downwardly biased measure of accuracy in not transforming a potentially skewed distribution’’ (Swanson et al. 2011). If the hypothesis that the skewness of APEs was zero was rejected (P < 0.1), then for each projection from each year, we performed the Box-Cox transformation for APE of each county, estimated the transformation parameter (λ from −2 to 2) that maximized the likelihood function for symmetry, averaged the transformed APEs over counties, and calculated MAPE-R using the inverse of Box-Cox transformation (Box and Cox 1964). Finally, the five MAPE-Rs (one from each year from 2011 to 2015) of each projection were compared among the six projections using the one-way ANOVA for the equality of their inter-annual average (see Materials and methods). We used MAPE for comparison if the hypothesis that the APEs had zero skewness was not rejected. Skewness and the corresponding P of APE and transformed APE (APE-T) were listed in Table S1.
D’Agostino’s test of skewness was performed using the function ‘‘agostino.test’’ from R package ‘‘moments’’ (Komsta and Novomestky 2015). Parameter λ at which the likelihood function achieved maxima was found using R function ‘‘optimize’’ (R Core Team 2015). R code for the calculation of MAPE-R was given below.
Rights and permissions
About this article
Cite this article
Xu, M., Brunborg, H. & Cohen, J.E. Evaluating multi-regional population projections with Taylor’s law of mean–variance scaling and its generalisation. J Pop Research 34, 79–99 (2017). https://doi.org/10.1007/s12546-016-9181-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12546-016-9181-0