Skip to main content
Log in

Fractional imputation for incomplete two-way contingency tables

  • Published:
Metrika Aims and scope Submit manuscript

Abstract

Imputation procedures such as fully efficient fractional imputation (FEFI) or multiple imputation (MI) create multiple versions of the missing observations, thereby reflecting uncertainty about their true values. Multiple imputation generates a finite set of imputations through a posterior predictive distribution. Fractional imputation assigns weights to the observed data. The focus of this article is the development of FEFI for partially classified two-way contingency tables. Point estimators and variances of FEFI estimators of population proportions are derived. Simulation results, when data are missing completely at random or missing at random, show that FEFI is comparable in performance to maximum likelihood estimation and multiple imputation and superior to simple stochastic imputation and complete case anlaysis. Methods are illustrated with four data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Blumenthal S (1968) Multinomial sampling with partially categorized data. J Am Stat Assoc 63: 542–551

    Article  MathSciNet  MATH  Google Scholar 

  • Box GEP, Tiao GC (1992) Bayesian inference in statistical analysis. Wiley Classics Library Edition. Wiley, New York

    Book  Google Scholar 

  • Chen TT, Fienberg SE (1974) Two-dimensional contingency tables with both completely and partially cross-classified data. Biometrics 30: 629–642

    Article  MathSciNet  MATH  Google Scholar 

  • Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm (with discussion). J Royal Stat Soc Ser B 39: 1–38

    MathSciNet  MATH  Google Scholar 

  • Fay RE (1996) Alternative paradigms for the analysis of imputed survey data. J Am Stat Assoc 91: 490–498

    MATH  Google Scholar 

  • Kalton G, Kish L (1981) Two efficient random imputation procedures. In: Proceedings of the survey research methods section. American Statistical Association, pp 146–151

  • Kim JK, Fuller W (2004) Fractional Hot deck imputation. Biometrika 91: 559–578

    Article  MathSciNet  MATH  Google Scholar 

  • Little RJA (1982) Models for nonresponse in sample surveys. J Am Stat Assoc 77: 237–250

    MathSciNet  MATH  Google Scholar 

  • Little RJA, Rubin DB (2002) Statistical analysis with missing data. Wiley, New York

    MATH  Google Scholar 

  • Magder LS (2003) Simple approaches to assess the possible impact of missing outcome information on estimates of risk ratios, odds ratios, and risk differences. Controlled Clin Trials 24: 411–421

    Article  Google Scholar 

  • Meng XL, Rubin DB (1991) Using EM to obtain asymptotic variance-covariance matrices: the SEM algorithm. J Am Stat Assoc 86: 899–909

    Google Scholar 

  • Meng XL, Rubin DB (1993) Maximum likelihood estimation via the ECM algorithm: a general framework. Biometrika 80: 267–278

    Article  MathSciNet  MATH  Google Scholar 

  • Molenberghs G, Kenward MG, Goetghebeur E (2001) Sensitivity analysis for incomplete contingency tables: the Slovenian plebiscite case. Appl Stat 50: 15–29

    Article  MATH  Google Scholar 

  • Rubin DB (1976) Infernece and missing data (with discussion). Biometrika 63: 581–592

    Article  MathSciNet  MATH  Google Scholar 

  • Rubin DB (1978) Multiple imputation in sample surveys—a phenomenological bayesian approach to nonresponse. In: Proceedings of the survey research methods section. American Statistical Association, pp 20–34

  • Rubin DB (1987) Multiple imputation for nonresponse in surveys. Wiley, New York

    Book  Google Scholar 

  • Rubin DB (1996) Multiple imputation after 18+ years. J Am Stat Assoc 91: 473–489

    MATH  Google Scholar 

  • Rubin DB, Stern HS, Vehovar V (1995) Handling don’t know survey responses: the case of the Slovenian plebiscite. J Am Stat Assoc 90: 822–828

    Google Scholar 

  • Schafer JL (1997) Analysis of incomplete multivariate data. Chapman & Hall, London

    Book  MATH  Google Scholar 

  • van Dyk DA, Meng XL, Rubin DB (1995) Maximum likelihood estimation via the ECM algorithm: computing the asymptotic variance. Stat Sinica 5: 55–75

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shin-Soo Kang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kang, SS., Koehler, K.J. & Larsen, M.D. Fractional imputation for incomplete two-way contingency tables. Metrika 75, 581–599 (2012). https://doi.org/10.1007/s00184-011-0343-y

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00184-011-0343-y

Keywords

Navigation