Abstract
Auxiliary information is often used to improve the precision of estimators of the finite population cumulative distribution function through the use of superpopulation models. A variety of approaches are available to construct such estimators, including design-based, model-based and model-assisted methods. The superpopulation modeling framework can be either parametric or nonparametric, and the estimators can be constructed as either linear or nonlinear functions of the observations. In this article, we argue that model-assisted estimators based on a nonparametric model are a good overall choice for distribution function estimators, because they have good efficiency properties and are robust against model misspecification. When such estimators are constructed as linear functions of the data, they are also easily incorporated into the existing survey estimation paradigm through the use of survey weights. Theoretical properties of nonparametric distribution function estimators based on local linear regression are derived, and their practical behavior is evaluated in a simulation study.
Similar content being viewed by others
References
Bowman, A.W., Azzalini, A., 2003. Computational aspects of nonparametric smoothing with illustrations from the sm library. Computational Statistics & Data Analysis, 42, 545–560.
Breidt, F.J., Opsomer, J.D., 2000. Local polynomial regression estimators in survey sampling. Ann. Statist., 28, 1026–1053.
Chambers, R.L., Dorfman, A.H., Hall, P., 1992. Properties of estimators of the finite population distribution function. Biometrika, 79, 577–82.
Chambers, R.L., Dorfman, A.H., Wehrly, T.E., 1993. Bias robust estimation in finite populations using nonparametric calibration. Journal of the American Statistical Association, 88, 268–277.
Chambers, R.L., Dunstan, R., 1986. Estimating distribution functions from survey data. Biometrika, 73, 597–604.
Dorfman, A.H., Hall, P., 1993. Estimators of the finite population distribution function using nonparametric regression. Ann. Statist., 21, 1452–1475.
Fan, J., 1992. Design-adaptive nonparametric regression. Journal of the American Statistical Association, 87, 998–1004.
Horvitz, D.G., Thompson, D.J., 1952. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association, 47, 663–685.
Kuk, A.Y.C., 1988. Estimation of distribution functions and medians under sampling with unequal probabilities. Biometrika, 75, 97–103.
Kuo, L., 1988. Classical and prediction approaches to estimating distribution functions from survey data. Proceedings of the Section on Survey Research methods, American Statistical Association, 280–285.
Rao, J.N.K., Kovar, J.G., Mantel, H.J., 1990. On estimating distribution functions and quantiles from survey data using auxiliary information. Biometrika, 77, 365–75.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Johnson, A.A., Breidt, F.J. & Opsomer, J.D. Estimating Distribution Functions from Survey Data Using Nonparametric Regression. J Stat Theory Pract 2, 419–431 (2008). https://doi.org/10.1080/15598608.2008.10411884
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1080/15598608.2008.10411884