Abstract
In this paper, we present a variable selection procedure by using basis function approximations and a partial group SCAD penalty for semiparametric varying coefficient partially linear models with longitudinal data. With appropriate selection of the tuning parameters, we establish the oracle property of this procedure. A simulation study is undertaken to assess the finite sample performance of the proposed variable selection procedure.
Similar content being viewed by others
References
Fan J.Q., Huang T. (2005) Profile likelihood inference on semiparametric varying-coefficient partially linear models. Bernoulli 11: 1031–1057
Fan J.Q., Li R. (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96: 1348–1360
Fan J.Q., Li R. (2004) New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis. Journal of American Statistical Association 99: 710–723
Fan J.Q., Lv J. (2008) Sure independence screening for ultrahigh dimensional feature space. Journal of the Royal Statistical Society: Series B 70: 849–911
Fan J.Q., Huang T., Li R. (2007) Analysis of longitudinal data with semiparametric estimation of covariance function. Journal of the American Statistical Association 102: 632–641
Frank I.E., Friedman J.H. (1993) A statistical view of some chemometrics regression tools. Technometrics 35: 109–148
Li Q., Huang C.J., Li D., Fu T.T. (2002) Semiparametric smooth coefficient models. Journal of Business & Economic Statistics 20: 412–422
Li R., Liang H. (2008) Variable selection in semiparametric regression modeling. The Annals of Statistics 36: 261–286
Lin X. H., Carroll R. J. (2001) Semiparametric regression for clustered data using generalized estimating equations. Journal of the American Statistical Association 96: 1045–1056
Schumaker L.L. (1981) Spline functions. Wiley, New York
Sun Y., Wu H. (2005) Semiparametric time-varying coefficients regression model for longitudinal data. Scandinavian Journal of Statistics 32: 21–47
Tibshirani R. (1996) Regression shrinkage and selection via the LASSO. Journal of the Royal Statistical Society: Series B 58: 267–288
Wang L., Li H., Huang J.Z. (2008) Variable selection in nonparametric varying coefficient models for analysis of repeated measurements. Journal of the American Statistical Association 103: 1556–1569
Xie H., Huang J. (2009) SCAD-penalized regression in high-dimensional partially linear models. The Annals of Statistics 37: 673–696
Xue L.G., Zhu L.X. (2007a) Empirical likelihood semiparametric regression analysis for longitudinal data. Biometrika 94: 921–937
Xue L.G., Zhu L.X. (2007b) Empirical likelihood for a varying coefficient model with longitudinal data. Journal of the American Statistical Association 102: 642–654
You J.H., Zhou Y. (2006) Empirical likelihood for semiparametric varying-coefficient partially linear regression models. Statistics & Probability Letters 76: 412–422
Zhang W., Lee S.Y., Song X. (2002) Local polynomial fitting in semivarying coefficient models. Journal of Multivariate Analysis 82: 166–188
Zou H. (2006) The adaptive lasso and its oracle properties. Journal of the American Statistical Association 101: 1418–1429
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Zhao, P., Xue, L. Variable selection in semiparametric regression analysis for longitudinal data. Ann Inst Stat Math 64, 213–231 (2012). https://doi.org/10.1007/s10463-010-0312-7
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-010-0312-7