Skip to main content

Advertisement

Log in

Deletion diagnostics for marginal mean and correlation model parameters in estimating equations

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

Regression diagnostics are introduced for parameters in marginal association models for clustered binary outcomes in an implementation of generalized estimating equations. Estimating equations for intracluster correlations facilitate computational formulae for one-step deletion diagnostics in an extension of earlier work on diagnostics for parameters in the marginal mean model. The proposed diagnostics measure the influence of an observation or a cluster of observations on the estimated regression parameters and on the overall fit of the model. The diagnostics are applied to data from four research studies from public health and medicine.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Banerjee, M., Frees, E.W.: Influence diagnostics for linear longitudinal models. J. Am. Stat. Assoc. 92, 999–1005 (1997)

    Article  MATH  Google Scholar 

  • Belsley, D.A., Kuh, E., Welsch, R.E.: Regression Diagnostics. Wiley, New York (1980)

    MATH  Google Scholar 

  • Carey, V., Zeger, S.L., Diggle, P.: Modeling multivariate binary data with alternating logistic regressions. Biometrika 80, 517–526 (1993)

    Article  MATH  Google Scholar 

  • Christensen, R., Pearson, L.M., Johnson, W.: Case-deletion diagnostics for mixed models. Technometrics 34, 38–45 (1992)

    Article  MATH  Google Scholar 

  • Cook, R.D., Weisberg, S.: Residuals and Influence in Regression. Chapman and Hall, London (1982)

    MATH  Google Scholar 

  • Diggle, P.J., Heagerty, P., Liang, K.-Y., Zeger, S.L.: Analysis of Longitudinal Data, 2nd edn. Oxford University Press, Oxford (2002)

    Google Scholar 

  • Donner, A., Klar, N.: Design and Analysis of Cluster Randomization Trials in Health Research. Arnold, London (2000)

    Google Scholar 

  • Fay, M.P.: Measuring a binary response’s range of influence in logistic regression. Am. Stat. 56, 5–9 (2002)

    Article  Google Scholar 

  • Hammill, B.G., Preisser, J.S.: A SAS/IML program for GEE and regression diagnostics. Comput. Stat. Data Anal. 51(2), 1197–1212 (2006)

    Article  Google Scholar 

  • Haslett, J.: A simple derivation of deletion diagnostic results for the general linear model with correlated errors. J. Roy. Stat. Soc. B 61, 603–609 (1999)

    Article  MATH  Google Scholar 

  • Haslett, J., Dillane, D.: Application of “delete = replace” to deletion diagnostics for variance component estimation in linear mixed model. J. Roy. Stat. Soc. B 66, 131–143 (2004)

    Article  MATH  Google Scholar 

  • Kuk, A.Y.C., Nott, D.J.: A pairwise likelihood approach to analyzing correlated binary data. Stat. Probab. Lett. 47, 329–335 (2000)

    Article  MATH  Google Scholar 

  • le Cessie, S., van Houwelingen, J.C.: Logistic regression for correlated binary data. Appl. Stat. 43, 95–108 (1994)

    Article  MATH  Google Scholar 

  • Li, C., Wang, W.H.: Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. Proc. Natl. Acad. Sci. 98, 31–36 (2001)

    Article  MATH  Google Scholar 

  • Liang, K.-Y., Zeger, S.L.: Longitudinal data analysis using generalized linear models. Biometrika 73, 13–22 (1986)

    Article  MATH  Google Scholar 

  • Liang, K.-Y., Zeger, S.L., Qaqish, B.F.: Multivariate regression analysis for categorical data. J. Roy. Stat. Soc. B 54, 3–40 (1992)

    MATH  Google Scholar 

  • Lipsitz, S.R., Fitzmaurice, G.M.: Estimating equations for measures of association between repeated binary responses. Biometrics 52, 903–912 (1996)

    Article  MATH  Google Scholar 

  • Lipsitz, S.R., Laird, N.M., Harrington, D.P.: Generalized estimating equations for correlated binary data: using the odds ratio as a measure of association. Biometrika 78, 153–160 (1991)

    Article  Google Scholar 

  • Mancl, L.A., DeRouen, T.A.: A covariance estimator for GEE with improved small-sample properties. Biometrics 57, 126–134 (2001)

    Article  Google Scholar 

  • Pregibon, D.: Logistic regression diagnostics. Ann. Stat. 9, 705–724 (1981)

    MATH  Google Scholar 

  • Preisser, J.S., Qaqish, B.F.: Deletion diagnostics for generalized estimating equations. Biometrika 83, 551–562 (1996)

    Article  MATH  Google Scholar 

  • Preisser, J.S., Qaqish, B.F.: Robust regression for clustered data with application to binary responses. Biometrics 55, 574–579 (1999)

    Article  MATH  Google Scholar 

  • Preisser, J.S., Arcury, T.A., Quandt, S.A.: Detecting patterns of occupational illness clustering with alternating logistic regressions applied to longitudinal data. Am. J. Epidemiol. 158, 495–501 (2003)

    Article  Google Scholar 

  • Preisser, J.S., Young, M.L., Zaccaro, D.J., Wolfson, M.: An integrated population-averaged approach to the design, analysis, and sample size determination of cluster-unit trials. Stat. Medicine 22, 1235–1254 (2003)

    Article  Google Scholar 

  • Prentice, R.L.: Correlated binary regression with covariates specific to each binary observation. Biometrics 44, 1033–1048 (1988)

    Article  MATH  Google Scholar 

  • Qu, A., Song, P.: Assessing robustness of generalised estimating equations and quadratic inference functions. Biometrika 91, 447–459 (2004)

    Article  MATH  Google Scholar 

  • Ridout, M.S.: Discussion of the paper by Liang, Zeger, and Qaqish, “Multivariate regression analysis for categorical data”. J. Roy. Stat. Soc. B 54, 35 (1992)

    Google Scholar 

  • Sharples, K., Breslow, N.: Regression analysis of correlated binary data: some small sample results for the estimating equation approach. J. Stat. Comput. Simul. 42, 1–20 (1992)

    Article  MATH  Google Scholar 

  • Williams, D.A.: Generalized linear model diagnostics using the deviance and single case deletions. Appl. Stat. 36, 181–191 (1987)

    Article  MATH  Google Scholar 

  • Xiang, L.M., Tse, S.K., Lee, A.H.: Influence diagnostics for generalized linear mixed models: application to clustered data. Comput. Stat. Data Anal. 40, 759–774 (2002)

    Article  MATH  Google Scholar 

  • Ziegler, A., Arminger, G.: Parameter estimation and regression diagnostics using generalized estimating equations. In: Faulbaum, F., Bandilla, W. (eds.) StatSoft 95—Advances in Statistical Software 5, pp. 229–237. Lucius & Lucius, Stuttgart (1996)

    Google Scholar 

  • Ziegler, A., Blettner, M., Kastner, C., Chang-Claude, J.: Identifying influential families using regression diagnostics for generalized estimating equations. Genet. Epidemiol. 15, 341–353 (1998)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to John S. Preisser.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Preisser, J.S., Perin, J. Deletion diagnostics for marginal mean and correlation model parameters in estimating equations. Stat Comput 17, 381–393 (2007). https://doi.org/10.1007/s11222-007-9031-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11222-007-9031-1

Keywords

Navigation