Abstract
Regression diagnostics are introduced for parameters in marginal association models for clustered binary outcomes in an implementation of generalized estimating equations. Estimating equations for intracluster correlations facilitate computational formulae for one-step deletion diagnostics in an extension of earlier work on diagnostics for parameters in the marginal mean model. The proposed diagnostics measure the influence of an observation or a cluster of observations on the estimated regression parameters and on the overall fit of the model. The diagnostics are applied to data from four research studies from public health and medicine.
Similar content being viewed by others
References
Banerjee, M., Frees, E.W.: Influence diagnostics for linear longitudinal models. J. Am. Stat. Assoc. 92, 999–1005 (1997)
Belsley, D.A., Kuh, E., Welsch, R.E.: Regression Diagnostics. Wiley, New York (1980)
Carey, V., Zeger, S.L., Diggle, P.: Modeling multivariate binary data with alternating logistic regressions. Biometrika 80, 517–526 (1993)
Christensen, R., Pearson, L.M., Johnson, W.: Case-deletion diagnostics for mixed models. Technometrics 34, 38–45 (1992)
Cook, R.D., Weisberg, S.: Residuals and Influence in Regression. Chapman and Hall, London (1982)
Diggle, P.J., Heagerty, P., Liang, K.-Y., Zeger, S.L.: Analysis of Longitudinal Data, 2nd edn. Oxford University Press, Oxford (2002)
Donner, A., Klar, N.: Design and Analysis of Cluster Randomization Trials in Health Research. Arnold, London (2000)
Fay, M.P.: Measuring a binary response’s range of influence in logistic regression. Am. Stat. 56, 5–9 (2002)
Hammill, B.G., Preisser, J.S.: A SAS/IML program for GEE and regression diagnostics. Comput. Stat. Data Anal. 51(2), 1197–1212 (2006)
Haslett, J.: A simple derivation of deletion diagnostic results for the general linear model with correlated errors. J. Roy. Stat. Soc. B 61, 603–609 (1999)
Haslett, J., Dillane, D.: Application of “delete = replace” to deletion diagnostics for variance component estimation in linear mixed model. J. Roy. Stat. Soc. B 66, 131–143 (2004)
Kuk, A.Y.C., Nott, D.J.: A pairwise likelihood approach to analyzing correlated binary data. Stat. Probab. Lett. 47, 329–335 (2000)
le Cessie, S., van Houwelingen, J.C.: Logistic regression for correlated binary data. Appl. Stat. 43, 95–108 (1994)
Li, C., Wang, W.H.: Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. Proc. Natl. Acad. Sci. 98, 31–36 (2001)
Liang, K.-Y., Zeger, S.L.: Longitudinal data analysis using generalized linear models. Biometrika 73, 13–22 (1986)
Liang, K.-Y., Zeger, S.L., Qaqish, B.F.: Multivariate regression analysis for categorical data. J. Roy. Stat. Soc. B 54, 3–40 (1992)
Lipsitz, S.R., Fitzmaurice, G.M.: Estimating equations for measures of association between repeated binary responses. Biometrics 52, 903–912 (1996)
Lipsitz, S.R., Laird, N.M., Harrington, D.P.: Generalized estimating equations for correlated binary data: using the odds ratio as a measure of association. Biometrika 78, 153–160 (1991)
Mancl, L.A., DeRouen, T.A.: A covariance estimator for GEE with improved small-sample properties. Biometrics 57, 126–134 (2001)
Pregibon, D.: Logistic regression diagnostics. Ann. Stat. 9, 705–724 (1981)
Preisser, J.S., Qaqish, B.F.: Deletion diagnostics for generalized estimating equations. Biometrika 83, 551–562 (1996)
Preisser, J.S., Qaqish, B.F.: Robust regression for clustered data with application to binary responses. Biometrics 55, 574–579 (1999)
Preisser, J.S., Arcury, T.A., Quandt, S.A.: Detecting patterns of occupational illness clustering with alternating logistic regressions applied to longitudinal data. Am. J. Epidemiol. 158, 495–501 (2003)
Preisser, J.S., Young, M.L., Zaccaro, D.J., Wolfson, M.: An integrated population-averaged approach to the design, analysis, and sample size determination of cluster-unit trials. Stat. Medicine 22, 1235–1254 (2003)
Prentice, R.L.: Correlated binary regression with covariates specific to each binary observation. Biometrics 44, 1033–1048 (1988)
Qu, A., Song, P.: Assessing robustness of generalised estimating equations and quadratic inference functions. Biometrika 91, 447–459 (2004)
Ridout, M.S.: Discussion of the paper by Liang, Zeger, and Qaqish, “Multivariate regression analysis for categorical data”. J. Roy. Stat. Soc. B 54, 35 (1992)
Sharples, K., Breslow, N.: Regression analysis of correlated binary data: some small sample results for the estimating equation approach. J. Stat. Comput. Simul. 42, 1–20 (1992)
Williams, D.A.: Generalized linear model diagnostics using the deviance and single case deletions. Appl. Stat. 36, 181–191 (1987)
Xiang, L.M., Tse, S.K., Lee, A.H.: Influence diagnostics for generalized linear mixed models: application to clustered data. Comput. Stat. Data Anal. 40, 759–774 (2002)
Ziegler, A., Arminger, G.: Parameter estimation and regression diagnostics using generalized estimating equations. In: Faulbaum, F., Bandilla, W. (eds.) StatSoft 95—Advances in Statistical Software 5, pp. 229–237. Lucius & Lucius, Stuttgart (1996)
Ziegler, A., Blettner, M., Kastner, C., Chang-Claude, J.: Identifying influential families using regression diagnostics for generalized estimating equations. Genet. Epidemiol. 15, 341–353 (1998)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Preisser, J.S., Perin, J. Deletion diagnostics for marginal mean and correlation model parameters in estimating equations. Stat Comput 17, 381–393 (2007). https://doi.org/10.1007/s11222-007-9031-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11222-007-9031-1