Abstract
Two-phase sampling is a cost-effective method of data collection using outcomedependent sampling for the second-phase sample. In order to make efficient use of auxiliary information and to improve domain estimation, mass imputation can be used in two-phase sampling. Rao and Sitter (1995) introduce mass imputation for two-phase sampling and its variance estimation under simple random sampling in both phases. In this paper, we extend the Rao-Sitter method to general sampling design. The proposed method is further extended to mass imputation for categorical data. A limited simulation study is performed to examine the performance of the proposed methods.
Similar content being viewed by others
References
Breidt, F. J., McVey, A., & Fuller, W. A. (1996). Two-phase estimation by imputation. Journal of the Indian Society of Agricultural Statistics, 49, 79–90.
Chipperfield, J., Chessman, J., & Lim, R. (2012). Combining household surveys using mass imputation to estimate population totals. Australian & New Zealand Journal of Statistics, 54(2), 223–238.
Fay, R. (1991). A design-based perspective on missing data variance. In Proceedings of the 1991 annual research conference, us bureau of the census, Vol. 429 (p. 440).
Firth, D., & Bennett, K. (1998). Robust models in probability sampling. Journal of the Royal Statistical Society. Series B. Statistical Methodology, 60(1), 3–21.
Fuller, W. A. (1998). Replication variance estimation for two-phase samples. Statistica Sinica, 1153–1164.
Fuller, W. A. (2009). Sampling statistics. John Wiley & Sons.
Fuller, W. A., & Kim, J. K. (2005). Hot deck imputation for the response model. Survey Methodology, 31(2), 139.
Hidiroglou, M. (2001). Double sampling. Survey Methodology, 27(2), 143–154.
Kim, J. K. (2011). Parametric fractional imputation for missing data analysis. Biometrika, 98(1), 119–132.
Kim, J. K., & Haziza, D. (2014). Doubly robust inference with missing data in survey sampling. Statistica Sinica, 24(1), 375–394.
Kim, J. K., Navarro, A., & Fuller, W. A. (2006). Replication variance estimation for two-phase stratified sampling. Journal of the American Statistical Association, 101(473), 312–320.
Kim, J. K., & Rao, J. N. (2012). Combining data from two independent surveys: a model-assisted approach. Biometrika, 99(1), 85–100.
Legg, J. C., & Fuller, W. A. (2009). Two-phase sampling. Handbook of Statistics, 29, 55–70.
Moore, R., & Robbins, N. (2004). A study of mass imputation in small-area estimation. In Joint statistical meeting. Toronto, Canada.
Neyman, J. (1938). Contribution to the theory of sampling human populations. Journal of the American Statistical Association, 33(201), 101–116.
Rao, J. N., & Sitter, R. (1995). Variance estimation under two-phase sampling with application to imputation for missing data. Biometrika, 82(2), 453–460.
Thompson, M. E., & Wu, C. (2008). Simulation-based randomized systematic pps sampling under substitution of units. Survey Methodology, 34(1), 3.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Park, S., Kim, J.K. Mass imputation for two-phase sampling. J. Korean Stat. Soc. 48, 578–592 (2019). https://doi.org/10.1016/j.jkss.2019.03.002
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1016/j.jkss.2019.03.002