Abstract
Additive varying coefficient models are a natural extension of multiple linear regression models, allowing the regression coefficients to be functions of other variables. Therefore these models are more flexible to model more complex dependencies in data structures. In this paper we consider the problem of selecting in an automatic way the significant variables among a large set of variables, when the interest is on a given response variable. In recent years several grouped regularization methods have been proposed and in this paper we present these under one unified framework in this varying coefficient model context. For each of the discussed grouped regularization methods we investigate the optimization problem to be solved, possible algorithms for doing so, and the variable and estimation consistency of the methods. We investigate the finite-sample performance of these methods, in a comparative study, and illustrate them on real data examples.
Similar content being viewed by others
References
Antoniadis A, Gijbels I, Verhasselt A (2012a) Variable selection in additive models using P-splines. Technometrics 54(4):425–438
Antoniadis A, Gijbels I, Verhasselt A (2012b) Variable selection in varying coefficient models using P-splines. J Comput Graph Stat 21(3):638–661
Avalos M, Grandvalet Y, Ambroise C (2003) Regularization methods for additive models. In: Advances in intelligent data analysis V. Lecture notes in computer science 2810, pp 509–520
Bach F (2008) Consistency of the group Lasso and multiple kernel learning. J Mach Learn Res 9:1179–1225
Bhatti M, Bracken P (2006) The calculation of integrals involving b-splines by means of recursion relations. Appl Math Comput 172:91–100
Bickel PJ, Ritov Y, Tsybakov A (2009) Simultaneous analysis of Lasso and Dantzig selector. Ann Stat A 37(4):1705–1732
Birgin EG, Martinez J, Raydan M (2000) Nonmonotone spectral projected gradient methods on convex sets. SIAM J Optim 10:1196–1211
Breheny P, Huang J (2009) Penalized methods for bi-level variable selection. Stat Interface 2:369–380
Breheny P, Huang J (2011) Coordinate descent algorithms for nonconvex penalized regression, with applications to biological feature selection. Ann Appl Stat 5:32–253
Brumback B, Rice J (1998) Smoothing spline models for the analysis of nested and crossed samples of curves (with discussion). J Am Stat Assoc 93:961–994
Chen R, Tsay RS (1993) Functional-coefficient autoregressive models. J Am Stat Assoc 88:298–308
de Boor C (1978) A pratical guide to splines. Springer, New York
Donoho D, Johnstone I (1995) Adapting to unknown smoothness via wavelet shrinkage. J Am Stat Assoc 90:1200–1224
Efron B, Hastie T, Johnstone I, Tibshirani R (2004) Least angle regression. Ann Stat 32:407–489
Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360
Fan J, Peng H (2004) Nonconcave penalized likelihood with a diverging number of parameters. Ann Stat 32:928–961
Fan J, Zhang J-T (2000) Two-step estimation of functional linear models with applications to longitudinal data. J R Stat Soc Ser B 62:303–322
Fan J, Zhang C, Zhang J (2001) Generalized likelihood ratio statistics and wilks phenomenon. Ann Stat 29:153–193
Figueiredo MAT, Nowak R, Wright S (2007) Gradient projection for sparse reconstruction: application to compressed sensing and other inverse problems. IEEE J Select Topics Signal Process 1:586–597
Harrison D, Rubinfeld D (1978) Hedonic prices and the demand for clean air. J Environ Econ Manag 5:81–102
Hastie TJ, Tibshirani RJ (1993) Varying-coefficient models. J R Stat Soc Ser B 55:757–796
Hoover D, Rice J, Wu C, Yang L-P (1998) Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika 85:809–822
Huang JZ, Wu CO, Zhou L (2002) Varying-coefficient models and basis function approximation for the analysis of repeated measurements. Biometrika 89:111–128
Huang J, Horowitz J, Ma S (2007) Asymptotic properties of bridge estimators in sparse high-dimensional regression models. Ann Stat 36:587–613
Huang J, Ma S, Xie H, Zhang C-H (2009) A group bridge approach for variable selection. Biometrika 96(2):339–355
Huang J, Breheny P, Ma S (2012) A selective review of group selection in high dimensional models. Stat Sci 27(4):481–499
Huang J, Zhang T (2010) The benefit of group sparsity. Ann Stat 38:1978–2004
Kaslow RA, Ostrow DG, Detels R, Phair JP, Polk BF, Rinaldo CR (1987) The multicenter aids cohort study: rationale, organization and selected characteristics of the participants. Am J Epidemiol 126:310–318
Kim Y, Choi H, Oh H (2008) Smoothly clipped absolute deviation on high dimensions. J Am Stat Assoc 103:1665–1673
Knight K, Fu W (2000) Asymptotics for Lasso-type estimators. Ann Stat 28:1356–1378
Li R, Liang H (2008) Variable selection in semiparametric regression modeling. Ann Stat 36:261–286
Lin B, Zhang H (2006) Component selection and smoothing in multivariate nonparametric regression. Ann Stat 32:2272–2297
Liu H, Zhang J (2008) On the \(\ell _1\)–\(\ell _q\) regularized regression. Technical report. Carnegie Mellon University, Pittsburgh
Meier L, Bühlman P (2007) Smoothing \(\ell _1\)-penalized estimators for high-dimensional time-course data. Electron J Stat 1:597–615
Meier L, van de Geer S, Bühlman P (2008) The group Lasso for logistic regression. J R Stat Soc Ser B 70:53–71
Nürnberger G (1989) Approximation by spline functions. Springer, New York
Qingguo T, Longsheng C (2012) Componentwise B-spline estimation for varying coefficient models with longitudinal data. Stat Pap 53(3):629–652
Ramsay J, Silverman B (1997) The analysis of functional data. Springer, Berlin
Rice J (2004) Functional and longitudinal data analysis: perspectives on smoothing. Stat Sin 14:631–647
van den Berg E, Schmidt M, Friedlander M, Murphy K (2008) Group sparsity via linear-time projection. Department of Computer Science, University of British Columbia, Vancouver
Wang H, Leng C (2007) Unified Lasso estimation with least squares approximation. J Am Stat Assoc 102:1039–1048
Wang H, Xia Y (2009) Shrinkage estimation of the varying coefficient model. J Am Stat Assoc 104:747–757
Wang L, Chen G, Li H (2007) Group scad regression analysis for microarray time course gene expression. Bioinformatics 23:1486–1494
Wang L, Li H, Huang J (2008) Variable selection in nonparametric varying-coefficient models for analysis of repeated measurements. J Am Stat Assoc 103:1556–1569
Wei X, Huang J, Li H (2011) Variable selection and estimation in high-dimensional varying-coefficient models. Stat Sin 21:1515–1540
Wu C, Yu K, Chiang C (2000) A two-step smoothing method for varying coefficient models with repeated measurements. Ann Inst Stat Math 52:519–543
Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J R Stat Soc Ser B 68:49–67
Zhang C (2010) Nearly unbiased variable selection under minimax concave penalty. Ann Stat 38:894–942
Acknowledgments
The authors thank the editor and two reviewers for their detailed reading of the manuscript and their valuable comments and suggestions that led to a considerable improvement of the paper. Support from the IAP Research Network nr. P6/03 and P7/06 of the Federal Science Policy, Belgium, is acknowledged. The second author also gratefully acknowledges financial support by the projects GOA/07/04 and GOA/12/014 of the Research Fund KULeuven and the FWO-Project G.0328.08N of the Flemish Science Foundation.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Antoniadis, A., Gijbels, I. & Lambert-Lacroix, S. Penalized estimation in additive varying coefficient models using grouped regularization. Stat Papers 55, 727–750 (2014). https://doi.org/10.1007/s00362-013-0522-1
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00362-013-0522-1