Abstract
The generalized linear model is a very important tool for analyzing real data in several application domains where the relationship between the response and explanatory variables may not be linear or the distributions may not be normal in all the cases. Quite often such real data contain a significant number of outliers in relation to the standard parametric model used in the analysis; in such cases inference based on the maximum likelihood estimator could be unreliable. In this paper, we develop a robust estimation procedure for the generalized linear models that can generate robust estimators with little loss in efficiency. We will also explore two particular special cases in detail—Poisson regression for count data and logistic regression for binary data. We will also illustrate the performance of the proposed estimators through some real-life examples.
Similar content being viewed by others
References
Aeberhard WH, Cantoni E, Heritier S (2014) Robust inference in the negative binomial regression model with an application to falls data. Biometrics 70(4):920–931. doi:10.1111/biom.12212
Basu A, Harris IR, Hjort NL, Jones MC (1998) Robust and efficient estimation by minimizing a density power divergence. Biometrika 85(3):549–559. doi:10.1093/biomet/85.3.549
Bianco AM, Boente G, Rodrigues IM (2013) Resistant estimators in Poisson and Gamma models with missing responses and an application to outlier detection. J Multivar Anal 114:209–226. doi:10.1016/j.jmva.2012.08.008
Bianco AM, Yohai VJ (1996) Robust estimation in the logistic regression model. In: Rieder H (ed) Robust statistics, data analysis, and computer intensive methods. Springer, New York, pp 17–34
Cantoni E (2004) A robust approach to longitudinal data analysis. Can J Stat 32(2):169–180. doi:10.2307/3315940
Cantoni E, Ronchetti E (2001) Robust inference for generalised linear models. J Am Stat Assoc 96(455):1022–1030. doi:10.1198/016214501753209004
Croux C, Haesbroeck G (2003) Implementing the Bianco and Yohai estimator for logistic regression. Comput Stat Data Anal 44(1–2):273–295. doi:10.1016/S0167-9473(03)00042-2
Ghosh A, Basu A (2013) Robust estimation for independent non-homogeneous observations using density power divergence with applications to linear regression. Electron J Stat 7:2420–2456. doi:10.1214/13-EJS847
Ghosh A, Basu A (2015) Robust estimation for non-homogeneous data and the selection of the optimal tuning parameter: the density power divergence approach. J Appl Stat. doi:10.1080/02664763.2015.1016901
Hampel FR, Ronchetti EM, Rousseeuw PJ, Stahel WA (1986) Robust statistics: the approach based on influence functions. Wiley, New York
Heritier S, Cantoni E, Copt S, Victoria-Feser MP (2009) Robust methods in biostatistics. Wiley, New York
Hong C, Kim Y (2001) Automatic selection of the tuning parameter in the minimum density power divergence estimation. J Korean Stat Soc 30:453–465
Hosseinian S (2009) Robust inference for generalized linear models: binary and poisson regression. Thesis, Ecole Polytechnique Federal de Lausanne
Huber PJ (1964) Robust estimation of a location parameter. Ann Math Stat 35(1):73–101. doi:10.1214/aoms/1177703732
Huber PJ (1983) Minimax aspects of bounded-influence regression (with discussion). J Am Stat Assoc 78:66–80. doi:10.1080/01621459.1983.10477928
Leppik IE et al (1985) A double-blind crossover evaluation of progabide in partial seizures. Neurology 35:285
McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman & Hall, London
Morgenthaler S (1992) Least-absolute-deviations fits for generalized linear models. Biometrika 79(4):747–754. doi:10.1093/biomet/79.4.747
Phelps K (1982) Use of the complementary log-log function to describe dose–response relationships in insecticide evaluation field trials. In: Gilchrist R (ed) Lecture notes in statistics, vol. 14. GLIM.82: Proceedings of the international conference on generalized linear models. Springer, New York
Preisser JS, Qaqish BF (1999) Robust regression for clustered data with applications to binary regression. Biometrics 55:574–579. doi:10.1111/j.0006-341X.1999.00574.x
Thall PF, Vail SC (1990) Some covariance models for longitudinal count data with overdispersion. Biometrics 46(3):657–671
Warwick J, Jones MC (2005) Choosing a robustness tuning parameter. J Stat Comput Simul 75:581–588. doi:10.1080/00949650412331299120
Williams DA (1987) Generalised linear model diagnostics using the deviance and single case deletions. Appl Stat 36:181–191
Acknowledgments
The authors thank the editor, the associate editor and three anonymous referees for several useful suggestions that led to an improved version of the manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
This is part of the Ph.D. research work of A. Ghosh at the Indian Statistical Institute.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Ghosh, A., Basu, A. Robust estimation in generalized linear models: the density power divergence approach. TEST 25, 269–290 (2016). https://doi.org/10.1007/s11749-015-0445-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11749-015-0445-3