Abstract
Variance estimation is a fundamental problem in statistical modelling and plays an important role in the inferences after model selection and estimation. In this paper, we focus on several nonparametric and semiparametric models and propose a local averaging method for variance estimation based on the concept of partial consistency. The proposed method has the advantages of avoiding the estimation of the nonparametric function and reducing the computational cost and can be easily extended to more complex settings. Asymptotic normality is established for the proposed local averaging estimators. Numerical simulations and a real data analysis are presented to illustrate the finite sample performance of the proposed method.
Similar content being viewed by others
References
Albright SC, Winston WL, Zappe CJ (1999) Data analysis and decision making with Microsoft excel. Duxbury, Pacific Grove, CA
Box GEP, Ramirez J (1986) Studies in quality improvement: signal to noise ratios, performance criteria and statistical analysis: part II. Center for quality and productive improvement. Report 12, University of Wisconsin
Brown LD, Levine M (2007) Variance estimation in nonparametric regression via the differnce sequence method. Ann Stat 35:2219–2232
Brown LD, Levine M, Wang L (2016) A semiparametric multivariate partially linear model: a difference approach. J Stat Plan Inference 178:99–111
Butt WR (1984) Practical immunoassay: the state of the art. CRC Press
Cai TT, Levine M, Wang L (2009) Variance function estimation in multivariate nonparametric regression with fixed design. J Multivar Anal 100:126–136
Carroll RJ (1987) Truncated and censored samples from normal populations. J Am Stat Assoc 82(399):952–952
Carroll RJ, Ruppert D, (1988) Transformation and weighting in regression. CRC Press
Cui X, Lu Y, Peng H (2014) Estimation of partially linear regression model under partial consistency property. arXiv:1401.2163
Dette H, Munk A, Wagner T (1998) Estimating the variance in nonparametric regression—what is a reasonable choice? J R Stat Soc B 60:751–764
Fan J, Peng H (2004) Nonconcave penalized likelihood with a diverging number of parameters. Ann Stat 32:928–961
Fan J, Yao Q (1998) Efficient estimation of conditional variance functions in stochastic regression. Biometrika 85:645–660
Fan J, Zhang C, Zhang J (2001) Generalized likelihood ratio statistics and Wilks phenomenon. Ann Stat 29:153–193
Fan J, Peng H, Huang T (2005) Semilinear high-dimensional model for normalization of microarray data: a theoretical analysis and partial consistency. J Am Stat Assoc 100:781–796
Fan J, Guo S, Hao N (2012) Variance estimation using refitted cross-validation in ultrahigh dimensional regression. J R Stat Soc B 74:37–65
Gasser T, Sroka L, Jennen-Steinmetz C (1986) Residual variance and residual pattern in nonlinear regression. Biometrika 73:625–633
Hall P, Marron JS (1990) On variance estimation in nonparametric regression. Biometrika 77:415–419
Klipple K, Eubank RL (2007) Difference based variance estimators for partially linear models. Festschrift in Honor of Distinguished Professor Mir Masoom Ali On the Occasion of His Retirement, Muncie, IN, USA, pp 313–323
Müller U, Schick A, Wefelmeyer W (2003) Estimating the error variance in nonparametric regression by a covariate-matched U-statistic. Stat J Theor Appl Stat 37:179–188
Neyman J, Scott EL (1948) Consistent estimates based on partially consistent observations. Econometrica 16:1–32
Park CG, Kim I, Lee Y-S (2012) Error variance estimation via least squares for small sample nonparametric regression. J Stat Plan Inference 142:2369–2385
Rice J (1984) Bandwidth choice for nonparametric regression. Ann Stat 12:1215–1230
Tong T, Wang Y (2005) Estimating residual variance in nonparametric regression using least squares. Biometrika 92:821–830
van der Vaart AW (1998) Asymptotic statistics. Cambridge University Press, Cambridge
Wang L, Brown LD, Cai TT, Levine M (2008) Effect of mean on variance function estimation in nonparametric regression. Ann Stat 36:619–641
Wang L, Brown LD, Cai TT (2011) A difference based approach to the semiparametric partial linear model. Electron J Stat 5:619–641
Zhang W, Lee SY (2000) Variable bandwidth selection in varying-coefficient models. J Multivar Anal 74:116–134
Acknowledgements
The authors would like to thank the Editor, associate editor, and anonymous referees for their helpful comments and suggestions that have helped us to substantially improve the quality of the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Tao Huang’s research was supported in part by the State Key Program in the Major Research Plan of NSFC (No. 91546202), NSFC (No. 11771268) and Program for Innovative Research Team of SHUFE. Heng Peng’s research was supported part by CEGR Grant of the Research Grant Council of Hong Kong (Nos. HKBU202012 and HKBU12302615), FRG grant from the Hong Kong Baptist University (Nos. FRG214-15/064 and FRG2/16-17/042).
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Zhao, J., Peng, H. & Huang, T. Variance estimation for semiparametric regression models by local averaging. TEST 27, 453–476 (2018). https://doi.org/10.1007/s11749-017-0553-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11749-017-0553-3