Skip to main content
Log in

Evaluation of missing data mechanisms in two and three dimensional incomplete tables

  • Published:
Journal of the Korean Statistical Society Aims and scope Submit manuscript

Abstract

The analysis of incomplete contingency tables is a practical and an interesting problem. In this paper, we provide characterizations for the various missing mechanisms of a variable in terms of response and non-response odds for two and three dimensional incomplete tables. Log-linear parametrization and some distinctive properties of the missing data models for the above tables are discussed. All possible cases in which data on one, two or all variables may be missing are considered. We study the missingness of each variable in a model, which is more insightful for analyzing cross-classified data than the missingness of the outcome vector. For sensitivity analysis of the incomplete tables, we propose easily verifiable procedures to evaluate the missing at random (MAR), missing completely at random (MCAR) and not missing at random (NMAR) assumptions of the missing data models. These methods depend only on joint and marginal odds computed from fully and partially observed counts in the tables, respectively. Finally, some real-life datasets are analyzed to illustrate our results, which are confirmed based on simulation studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Baker, S. G., & Laird, N. M. (1988). Regression analysis for categorical variables with outcome subject to nonignorable nonresponse. Journal of the American Statistical Association, 83, 62–69.

    Article  MathSciNet  Google Scholar 

  • Baker, S. G., Rosenberger, W. F., & Dersimonian, R. (1992). Closed-form estimates for missing counts in two-way contingency tables. Statistics in Medicine, 11, 643–657.

    Article  Google Scholar 

  • Clarke, P. S., & Smith, P. W. F. (2005). On maximum likelihood estimation for log-linear models with non-ignorable non-responses. Statistics & Probability Letters, 73, 441–448.

    Article  MathSciNet  Google Scholar 

  • Ghosh, S., & Vellaisamy, P. (2016b). Closed form estimates for missing counts in multidimensional incomplete tables. arXiv:1602.00947.

    Google Scholar 

  • Ghosh, S., & Vellaisamy, P. (2016a). On the occurrence of boundary solutions in multidimensional incomplete tables. Statistics & Probability Letters, 119, 63–75.

    Article  MathSciNet  Google Scholar 

  • Ghosh, S., & Vellaisamy, P. (2017). On the occurrence of boundary solutions in two-way incomplete tables. In REVSTAT. (forthcoming).

    Google Scholar 

  • Kim, S., Park, Y., & Kim, D. (2015). Onmissing-at-random mechanismintwo-way incomplete contingency tables. Statistics & Probability Letters, 96, 196–203.

    Article  MathSciNet  Google Scholar 

  • Little, J. A., & Rubin, D. B. (2002). Statistical analysis with missing data (2nd ed.). New York: Wiley.

    Book  Google Scholar 

  • Molenberghs, G., Beunckens, C., Sotto, C., & Kenward, M. G. (2008). Every missing not at random model has a missingness at random counterpart with equal fit. Journal of the Royal Statistical Society: Series, 70, 371–388.

    Article  MathSciNet  Google Scholar 

  • Molenberghs, G., Kenward, M. G., & Goetghebeur, E. (2001). Sensitivity analysis for incomplete contingency tables: the Slovenian plebiscite case. Journal of the Royal Statistical Society. Series C. Applied Statistics, 50, 15–29.

    Article  Google Scholar 

  • Park, Y., Kim, D., & Kim, S. (2014). Identification of the occurrence of boundary solutions in a contingency table with nonignorable nonresponse. Statistics & Probability Letters, 93, 34–40.

    Article  MathSciNet  Google Scholar 

  • Rochani, H. D., Vogel, R. L., Samawi, H. M., & Linder, D. F. (2017). Estimates for cell counts and common odds ratio in three-way contingency tables by homogeneous log-linear models with missing data. AStA Advances in Statistical Analysis, 101, 51–65.

    Article  MathSciNet  Google Scholar 

  • Rubin, D. B., Stern, H. S., & Vehovar, V. (1995). Handling Don’t know survey responses: the case of the Slovenian plebiscite. Journal of the American Statistical Association, 90, 822–828.

    Google Scholar 

  • Vansteelandt, S., Goetghebeur, E., Kenward, M. G., & Molenberghs, G. (2006). Ignorance and uncertainty regions as inferential tools in a sensitivity analysis. Statistica Sinica, 16, 953–979.

    MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sayan Ghosh.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ghosh, S., Vellaisamy, P. Evaluation of missing data mechanisms in two and three dimensional incomplete tables. J. Korean Stat. Soc. 48, 297–313 (2019). https://doi.org/10.1016/j.jkss.2018.11.008

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1016/j.jkss.2018.11.008

Keywords

Navigation