Abstract
Statistical methods of dimension reduction and classification are used to obtain homogeneous local-area clustering with regard to the most relevant demographic parameters. The dimension reduction is conducted in two stages using Principal Component Analysis and a modified k-mean procedure is proposed to determine the final clusters. This clustering will be useful in future demographic studies at a local level, in particular to obtain forecasts of demographic rates and population projections. The region of Castile and León in Spain is used to illustrate the method. A Poisson model is used to explore the advantages of the new clustering over the more conventional classification based on provinces.
Similar content being viewed by others
References
Aggarwal, C.C. 2001. A human-computer cooperative system for effective high dimensional clustering. Pp. 201–206 inProceedings of the Seventh ACM SIGKDD. New York: ACM Press.
Chatfield, C. and A.J. Collins. 1980.Introduction to Multivariate Analysis. London: Chapman and Hall.
Cuesta-Albertos, J.A.A. Gordaliza and C. Matrán. 1997. Trimmed k-means. An attempt to robustify quantizers.Annals of Statistics 25: 553–576.
García-Escudero, L.A. and A. Gordaliza. 1999. Robustness properties of k-means and trimmed k-means.Journal of the American Statistical Association 97: 956–999.
Gordaliza, A. 1991. Best approximations to random variables based on trimming procedures.Journal of Approximation Theory 64: 162–180.
Harrell, F.E. 2001.Regression Modeling Strategies. New York: Springer-Verlag.
Hartigan, J. 1975.Clustering Algorithms. New York: John Wiley and Sons.
Jackson, J.E. 1991.A User's Guide to Principal Components. New York: John Wiley and Sons.
Lindsey, J.K. 1997.Applying Generalized Linear Models. London: Chapman and Hall.
Martin, D., A. Nolan and M. Tranmer. 2001. The application of zone-design methodology in the 2001 UK Census.Environment and Planning A. 33: 1949–1962.
Mayo Iscar, A. 2001. Estimación de parámetros en mezclas de poblaciones normales no homogéneas. Ph.D. Thesis, Universidad de Valladolid.
Ramirez, G. and J.M. Reguera. 1994.Modelo Funcional de la Territorialización de Servicios en Castilla y León. Valladolid: Junta de Castilla y León.
Rees, P. 1997. Problems and solutions in forecasting geographical populations.Journal of the Australian Population Association 14(2): 145–166.
Rogers, A. 1992. Heterogeneity and selection in multistate population analysis.Demography 29(1): 31–38.
Rogers, A. 1995.Multiregional Demography. Principles, Methods and Extensions. New York: Wiley.
Smith, S.K., J. Taiman and D.A. Swanson. 2001.State and Local Population Projections. New York: Kluwer Academic.
Vaupel, J.W. and A.I. Yashin. 1985. Heterogeneity's ruses: some surprising effects of selection on population dynamics.American Statistician 39: 176–185.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sabater, C.R., Alvarez Esteban, P.C., Iscar, A.M. et al. Clustering to reduce regional heterogeneity: A spanish case-study. Journal of Population Research 21, 73–93 (2004). https://doi.org/10.1007/BF03032211
Issue Date:
DOI: https://doi.org/10.1007/BF03032211