Abstract
In this article we study the prediction problem in small geographic areas in the situation where the survey data does not cover a substantial percentage of these areas. In such situation, the application of the Spatial Fay–Herriot model may involve a difficult and subtle process of determining neighboring areas. Ambiguity in definition of neighbors can potentially produce a problem of sensitivity of the conclusions to these definitions. In this article, we attempt to remedy this problem by incorporating random effects for higher level administrative divisions into the model. In this setting, only the higher-level random effects are supposed to have spatial correlations. This may potentially reduce the problem of ambiguity in the definition of spatial neighbors, provided that all higher level administrative divisions are represented in the sample. We also show that predicting in non-sampled areas is considerably more straightforward under the proposed model, as opposed to the case where the Spatial Fay–Herriot model is applied. In addition, we propose two new predictors for out-of-sample areas, under the spatial Fay–Herriot model. In order to compare the performance of the aforementioned models, we use the data from the Demographic and Family Health Survey of the year 2021, and the National Census carried out in 2017.
Similar content being viewed by others
References
Anselin L (1992) Spatial econometrics, Methods and models. Kluwer, Boston
Banerjee S, Carlin B, Gelfand A (2004) Hierarchical modeling and analysis for spatial data. Chapman and Hall, New York
Burgard JP, Morales D, Wölwer AL (2022) Small area estimation of socioeconomic indicators for sampled and unsampled domains. AStA Adv Stat Anal 106(2):287–314. https://doi.org/10.1007/s10182-021-00426-4
Cressie N (1993) Statistics for spatial data. Wiley, New York
Cressie N, Chan NH (1989) Spatial modeling of regional variables. J Am Stat Assoc 84:393–401
Datta GS, Lahiri PS (2000) A unified measure of uncertainty of estimated best linear unbiased predictors in small area estimation problems. Stat Sin 10:613–627
Datta GS, Lahiri PS, Maiti T (2002) Empirical Bayes estimation of median income of four-person families by state using time series and cross-sectional data. J Stat Plan Inference 102(1):83–97. https://doi.org/10.1016/S0378-3758(01)00173-2
Datta GS, Rao JNK, Smith DD (2005) On measuring the variability of small area estimators under a basic area level model. Biometrika 92:183–196
Erciulescu AL, Franco C, Lahiri P (2021) Use of administrative records in small area estimation. In: Chun P, Larson M (eds) Administrative records for survey methodology. Wiley, New York, pp 231–267
Esteban MD, Morales D, Pérez A, Santamaría L (2012) Small area estimation of poverty proportions under area-level time models. Comput Stat Data Anal 56:2840–2855
Fay RE (1987) Application of multivariate regression to small domain estimation. In: Platek R, Rao JNK, Särndal CE, Singh MP (eds) Small area statistics. Wiley, New York, pp 91–102
Fay RE, Herriot RA (1979) Estimates of income for small places: an application of James-Stein procedures to census data. J Am Stat Assoc 74:269–277
Getis A, Aldstadt J (2004) Constructing the spatial weights matrix using a local statistic. Geogr Anal 36:90–104. https://doi.org/10.1111/j.1538-4632.2004.tb01127.x
Ghosh M, Nangia N, Kim DH (1996) Estimation of median income of four-person families: a Bayesian time series approach. J Am Stat Assoc 91:1423–1431. https://doi.org/10.1080/01621459.1996.10476710
Hall P, Maiti T (2006) On parametric bootstrap methods for small area prediction. J R Stat Soc B 68:221–238
Harville D, Jeske D (1992) Mean squared error of estimation or prediction under a general linear model. J Am Stat Assoc 87:724–731
Jiang J, Lahiri PS, Wan SM (2002) A unified Jackknife theory for empirical best prediction with M-estimation. Ann Stat 30:1782–1810
Kackar RN, Harville DA (1984) Approximations for standard errors of estimators for fixed and random effects in mixed models. J Am Stat Assoc 79:853–862
Marcis L, Morales D, Pagliarella MC, Salvatore R (2023) Three-fold Fay-Herriot model for small area estimation and its diagnostics. Stat Methods Appl. https://doi.org/10.1007/s10260-023-00700-6
Marhuenda Y, Molina I, Morales D (2013) Small area estimation with spatio temporal Fay-Herriot models. Comput Stat Data Anal 58:308–325
Molina I, Salvati N, Pratesi M (2009) Bootstrap for estimating the MSE of the spatial EBLUP. Comput Stat 24:441–458
Moran PAP (1950) Notes on continuous stochastic phenomena. Biometrika 37(1):17–23
Petrucci A, Salvati N (2006) Small area estimation for spatial correlation in watershed erosion assessment. J Agric Biol Environ Stat 11(2):169–182
Pfefermann D (2002) Small area estimation- new developments and directions. Int Stat Rev 70:125–143
Pfeffermann D, Tiller RB (2005) Bootstrap approximation to prediction MSE for state-space models with estimated parameters. J Time Ser Anal 26:893–916
Prasad NGN, Rao JNK (1990) New important developments in small area estimation. J Am Stat Assoc 85(409):163–171
Pratesi M, Salvati N (2008) Small area estimation: the EBLUP estimator based on spatially correlated random area effects. Stat Method Appl 17(1):113–141
Pratesi M, Salvati N (2009) Small area estimation in the presence of correlated random area effects. J Off Stat 25(1):37–53
Rao JNK, Yu M (1994) Small area estimation by combining time series and cross-sectional data. Can J Stat 22:511–528
Sikov A, Cerda-Hernandez J (2023) Estimating the prevalence of anemia rates among children under five in Peruvian districts with a small sample size. Stat Methods Appl. https://doi.org/10.1007/s10260-023-00698-x
Singh BB, Shukla K, Kundu D (2005) Spatial-temporal models in small area estimation. Surv Methodol 31(2):183–195
Torabi M, Rao JNK (2014) On small area estimation under a sub-area model. J Multivar Anal 127:36–55
Zadlo T (2015) On prediction for correlated domains in longitudinal surveys. Commun Stat—Theory Methods 44(4):683–697. https://doi.org/10.1080/03610926.2013.857867
Bivand RS, Pebesma E, Gomez-Rubio V (2013) Applied spatial data analysis with R, 2nd edn. Springer, New York. https://asdar-book.org/
Chen S, Lahiri P (2003) A comparison of different MSPE estimators of EBLUP for the Fay–Herriot model. In: Proceedings of the section on survey research methods. American Statistical Association, Washington, pp 903–911
Datta GS, Fay RE, and Ghosh M (1991) Hierarchical and empirical Bayes multivariate analysis in small area estimation. In: Proceedings of Bureau of the Census 1991 annual research conference, U.S. Bureau of the Census, Washington, DC, pp 63–79
INEI, Perú (2019) Encuesta Demográfica y de Salud Familiar-ENDES. https://proyectos.inei.gob.pe/microdatos/
Rao JNK, Molina I (2015) Small area estimation. Wiley series in survey methodology, 2nd edn. Wiley, Hoboken
Saei A, Chambers R (2003). Small Area Estimation Under Linear and Generalized Linear Mixed Models With Time and Area Effects. Southampton Statistical Sciences Research Institute. Working Paper M03/15. https://eprints.soton.ac.uk/8165/1/8165-01.pdf
Saei A, Chambers R (2005) Empirical Best Linear Unbiased Prediction for Out of Sample Areas. Southampton Statistical Sciences Research Institute. Methodology Working Paper M05/03. https://eprints.soton.ac.uk/14073/
Funding
This research is supported by a grant from the Unidad de Investigaciones de la FIEECS-UNI.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
We wish to confirm that there are no known Conflict of interest associated with this publication and there has been no significant financial support for this work that could have influenced its outcome.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sikov, A., Cerda-Hernandez, J. Prediction in non-sampled areas under spatial small area models. Stat Methods Appl (2024). https://doi.org/10.1007/s10260-024-00754-0
Accepted:
Published:
DOI: https://doi.org/10.1007/s10260-024-00754-0