Skip to main content

Cross Tabulation and Categorical Data Analysis

  • Chapter
  • First Online:
Introduction to Statistical Methods in Pathology

Abstract

Often, we have questions about associations of events or variables with each other or their correlation with each other. For example, in pathology we commonly face the question of association of a test result with a disease status. In statistics, the process of testing the association between events is called hypothesis testing. If the variables are categorical (i.e., they can only assume finite discrete values), a common approach to hypothesis testing is to employ cross tabulation.

Cross tabulation is the summarization of categorical data into a table with each cell in the table containing the frequency (either raw or proportional) of the observations that fit the categories represented by that cell. The summary data presented in cross-tabulated form then can be used for many statistical tests most of which follow a distribution called chi-squared distribution.

In this chapter, we explain the concept of hypothesis testing and introduce the most common statistical tests used in hypothesis testing of categorical data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Strike PW. Statistical methods in laboratory medicine. New York: Butterworth-Heinemann; 2014.

    Google Scholar 

  2. Elliott AC, Woodward WA. Statistical analysis quick reference guidebook: with SPSS examples. Thousand Oaks: Sage; 2007.

    Book  Google Scholar 

  3. Agresti A, Kateri M. Categorical data analysis. Berlin Heidelberg: Springer; 2011.

    Book  Google Scholar 

  4. Fisher RA. On the interpretation of X2 from contingency tables, and the calculation of P. J R Stat Soc. 1922;85(1):87–94.

    Article  Google Scholar 

  5. Simpson EH. The interpretation of interaction in contingency tables. J R Stat Soc Ser B Methodol. 1951;13:238–41.

    Google Scholar 

  6. Wilson EB, Hilferty MM. The distribution of chi-square. Proc Natl Acad Sci. 1931;17(12):684–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Eisenhauer JG. Degrees of freedom. Teach Stat. 2008;30(3):75–8.

    Article  Google Scholar 

  8. Sharpe D. Your chi-square test is statistically significant: Now what? Practical Assessment, Research & Evaluation. 2015;20:1–10.

    Google Scholar 

  9. Scheaffer RL, Yes N. Categorical data analysis: NCSSM Statistics Leadership Institute, USA; 1999. (online publication accessible at: http://courses.ncssm.edu/math/Stat_Inst/PDFS/Categorical%20Data%20Analysis.pdf)

  10. Fleiss JL. Categorical Data Analysis. J Am Stat Assoc. 1991;86(416):1140–1.

    Article  Google Scholar 

  11. Mantel N. Chi-square tests with one degree of freedom; extensions of the Mantel-Haenszel procedure. J Am Stat Assoc. 1963;58(303):690–700.

    Google Scholar 

  12. Trajman A, Luiz RR. McNemar X2 test revisited: comparing sensitivity and specificity of diagnostic examinations. Scand J Clin Lab Invest. 2008;68(1):77–80.

    Article  CAS  PubMed  Google Scholar 

  13. Routledge R. Fisher’s exact test. In: Encyclopedia of biostatistics. New York: John Wiley Publishing; 2005.

    Google Scholar 

  14. Cohen J. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychol Bull. 1968;70(4):213.

    Article  CAS  PubMed  Google Scholar 

  15. Fleiss JL, Cohen J. The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educ Psychol Meas. 1973;33(3):613–9.

    Article  Google Scholar 

  16. Zhou XH, McClish DK, Obuchowski NA. Statistical methods in diagnostic medicine. John Wiley & Sons: New York; 2009.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Cite this chapter

Momeni, A., Pincus, M., Libien, J. (2018). Cross Tabulation and Categorical Data Analysis. In: Introduction to Statistical Methods in Pathology . Springer, Cham. https://doi.org/10.1007/978-3-319-60543-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-60543-2_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-60542-5

  • Online ISBN: 978-3-319-60543-2

  • eBook Packages: MedicineMedicine (R0)

Publish with us

Policies and ethics