Abstract
This chapter introduces some of the necessary tools for detecting violations of the assumptions in a glm, and then discusses possible solutions. The assumptions of the glm are first reviewed (Sect. 8.2), then the three basic types of residuals (Pearson, deviance and quantile) are defined (Sect. 8.3). The leverages are then given in the glm context (Sect. 8.4) leading to the development of standardized residuals (Sect. 8.5). The various diagnostic tools for checking the model assumptions are introduced (Sect. 8.7) followed by techniques for identifying unusual and influential observations (Sect. 8.8). Comments about using each type of residual and the nomenclature of residuals are given in Sect. 8.6. We then discuss techniques to remedy or ameliorate any weaknesses in the models (Sect. 8.9), including the introduction of quasi-likelihood (Sect. 8.10). Finally, collinearity is discussed (Sect. 8.11).
Since all models are wrong the scientist must be alert to what is importantly wrong. It is inappropriate to be concerned about mice when there are tigers abroad.
Box [ 1 , p. 792]
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Box, G.E.P.: Science and statistics. Journal of the American Statistical Association 71, 791–799 (1976)
Breslow, N.E., Clayton, D.G.: Approximate inference in generalized linear mixed models. Journal of the American Statistical Association 88(421), 9–25 (1993)
Data Desk: Data and story library (dasl) (2017). URL http://dasl.datadesk.com
Diggle, P.J., Tawn, J.A., Moyeed, R.A.: Model-based geostatistics. Applied Statistics 47(3), 299–350 (1998)
Dunn, P.K., Smyth, G.K.: Randomized quantile residuals. Journal of Computational and Graphical Statistics 5(3), 236–244 (1996)
Gotway, C.A., Stroup, W.W.: A generalized linear model approach to spatial data analysis and prediction. Journal of Agricultural, Biological and Environmental Statistics 2(2), 157–178 (1997)
Hardin, J.W., Hilbe, J.M.: Generalized Estimating Equations. Chapman and Hall/CRC, Boca Raton (2012)
Johnson, B., Courtney, D.M.: Tower building. Child Development 2(2), 161–162 (1931)
Maron, M.: Threshold effect of eucalypt density on an aggressive avian competitor. Biological Conservation 136, 100–107 (2007)
McCullagh, P., Nelder, J.A.: Generalized Linear Models, second edn. Chapman and Hall, London (1989)
McCulloch, C.E.: Generalized linear mixed models. Institute of Mathematical Statistics (2003)
Pierce, D.A., Shafer, D.W.: Residuals in generalized linear models. Journal of the American Statistical Association 81, 977–986 (1986)
Pregibon, D.: Goodness of link tests for generalized linear models. Applied Statistics 29(1), 15–24 (1980)
Singer, J.D., Willett, J.B.: Improving the teaching of applied statistics: Putting the data back into data analysis. The American Statistician 44(3), 223–230 (1990)
Smyth, G.K.: Australasian data and story library (Ozdasl) (2011). URL http://www.statsci.org/data
Solow, A.R., Smith, W.: Detecting in a heterogenous community sampled by quadrats. Biometrics 47(1), 311–317 (1991)
Weisberg, S.: Applied Linear Regression. John Wiley and Sons, New York (1985)
Williams, D.A.: Generalized linear models diagnostics using the deviance and single-case deletions. Applied Statistics 36(2), 181–191 (1987)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this chapter
Cite this chapter
Dunn, P.K., Smyth, G.K. (2018). Chapter 8: Generalized Linear Models: Diagnostics. In: Generalized Linear Models With Examples in R. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-0118-7_8
Download citation
DOI: https://doi.org/10.1007/978-1-4419-0118-7_8
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-0117-0
Online ISBN: 978-1-4419-0118-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)