Skip to main content
Log in

Variable selection in high-dimensional double generalized linear models

  • Regular Article
  • Published:
Statistical Papers Aims and scope Submit manuscript

Abstract

In this paper we are concerned with the problems of variable selection and estimation in double generalized linear models in which both the mean and the dispersion are allowed to depend on explanatory variables. We propose a maximum penalized pseudo-likelihood method when the number of parameters diverges with the sample size. With appropriate selection of the tuning parameters, the consistency of the variable selection procedure and asymptotic properties of the resulting estimators are established. We also carry out simulation studies and a real data analysis to assess the finite sample performance of the proposed variable selection procedure, showing that the proposed variable selection method works satisfactorily.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Bergman B, Hynén A (1997) Dispersion effects from unreplicated designs in the 2k-p series. Technometrics 39: 191–198

    MATH  MathSciNet  Google Scholar 

  • Breiman L (1995) Better subset selection using nonnegative garrote. Technometrics 37: 373–384

    Article  MATH  MathSciNet  Google Scholar 

  • Cao CZ, Lin JG, Zhu LX (2010) Heteroscedasticity and/or autocorrelation diagnostics in nonlinear models with AR(1) and symmetrical errors. Stat Papers 51: 813–836

    Article  MATH  MathSciNet  Google Scholar 

  • Davidian M, Carroll RJ (1988) A note on extended quasi-likelihood. J R Stat Soc B 50: 74–82

    MathSciNet  Google Scholar 

  • Efron B, Hastie T, Johnstone I, Tibshirani R (2004) Least angle regression. Ann Stat 32: 407–451

    Article  MATH  MathSciNet  Google Scholar 

  • Engel J, Huele AF (1996) A generalized linear modeling approach to robust design. Technometrics 38: 365–373

    Article  MATH  Google Scholar 

  • Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96: 1348–1360

    Article  MATH  MathSciNet  Google Scholar 

  • Fan J, Peng H (2004) Nonconcave penalized likelihood with diverging number of parameters. Ann Stat 32: 928–961

    Article  MATH  MathSciNet  Google Scholar 

  • Fu WJ (1998) Penalized regression: the bridge versus the LASSO. J Comput Graph Stat 7: 397–416

    Google Scholar 

  • Gijbels I, Prosdocimi I, Claeskens G (2010) Nonparametric estimation of mean and dispersion functions in extended generalized linear models test. doi:10.1007/s11749-010-0187-1

  • Huang J, Horowitz JL, Ma SG (2006) Asymptotic properties of bridge estimators in sparse high-dimensional regression models. Ann Stat 36: 587–613

    Article  MathSciNet  Google Scholar 

  • Kou CF, Pan JX (2009) Variable selection for joint mean and covariance models via penalized likelihood. http://www.manchester.ac.uk/mims/eprints,MIMSEPrint:2009.49

  • Li R, Liang H (2008) Variable selection in semiparametric regression modeling. Ann Stat 36: 261–286

    Article  MATH  MathSciNet  Google Scholar 

  • Lin L (2004) Generalized quasi-likelihood. Stat Papers 45: 529–544

    Article  MATH  Google Scholar 

  • McCullagh P, Nelder JA (1989) Generalized linear models. Chapman and Hall, London

  • Nair VN, Pregibon D (1986) A data analysis strategy for quality engineering experiments. AT&T Tech J 65: 73–84

    Article  Google Scholar 

  • Nair VN, Pregibon D (1988) Analyzing dispersion effects from replicated factorial experiments. Technometrics 30: 247–257

    Article  MATH  MathSciNet  Google Scholar 

  • Nelder JA (2000) Quasi-likelihood and pseudo-likelihood are not the samething. J Appl Stat 27: 1007–1011

    Article  Google Scholar 

  • Nelder JA, Pregibon D (1987) An extended quasi-likelihood function. Biometrika 74: 221–232

    Article  MATH  MathSciNet  Google Scholar 

  • Nelder JA, Wedderburn RWM (1972) Generalized linear models. J R Stat Soc A 135: 370–384

    Article  Google Scholar 

  • Rigby RA, Stasinopolous DM (1996) Mean and dispersion additive models. In: Hardle W, Schimek MG (eds) Statical theory and computational aspects of smoothing. Physica-Verlag, Berlin, pp 215–230

  • Smyth GK (1989) Generalized linear models with varying dispersion. J R Stat Soc B 51: 47–60

    MathSciNet  Google Scholar 

  • Tibshirani R (1996) Regression shrinkage and selection via the LASSO. J R Stat Soc B 58: 267–288

    MATH  MathSciNet  Google Scholar 

  • Van der Vaart AW (1998) Asymptotic statics. Cambridge University Press, Cambridge

  • Wang DR, Zhang ZZ (2009) Variable selection in joint generalized linear models. Chin J Appl Probab Stat 25: 245–256

    MATH  Google Scholar 

  • Wang H, Li R, Tsai C (2007) Tuning parameter selectors for the smoothly clipped absolute deviation method. Biometrika 94: 553–568

    Article  MATH  MathSciNet  Google Scholar 

  • Wang H, Li B, Leng C (2009) Shrinkage tuning parameter selection with a diverging number of parameters. J R Stat Soc B 71: 671–683

    Article  MATH  MathSciNet  Google Scholar 

  • Wedderburn RWM (1974) Quasi-likelihood function, generalized linear models and the Gauss-Newton method. Biometrika 61: 439–447

    MATH  MathSciNet  Google Scholar 

  • Zhang ZZ, Wang DR (2011) Simultaneous variable selection for heteroscedastic regression models. Sci China Math 54(3): 515–530

    Article  MATH  MathSciNet  Google Scholar 

  • Zhou HB, You JH, Zhou B (2010) Statical inference for fixed-effects partially linear regression models with errors in variables. Stat Papers 51: 629–650

    Article  MATH  MathSciNet  Google Scholar 

  • Zou H (2006) The adaptive lasso and its oracle properties. J Am Stat Assoc 101: 1418–1429

    Article  MATH  Google Scholar 

  • Zou H, Li R (2008) One-step sparse estimates in nonconcave penalized likelihood models. Ann Stat 36: 1509–1533

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhongzhan Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xu, D., Zhang, Z. & Wu, L. Variable selection in high-dimensional double generalized linear models. Stat Papers 55, 327–347 (2014). https://doi.org/10.1007/s00362-012-0481-y

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00362-012-0481-y

Keywords

Mathematics Subject Classification (2000)

Navigation