Categorical Data Analysis
- 128 Citations
- 1 Mentions
- 11k Downloads
Introduction
A categorical variable consists of a set of non-overlapping categories. Categorical data are counts for those categories. The measurement scale is ordinal if the categories exhibit a natural ordering, such as opinion variables with categories from “strongly disagree” to “strongly agree.” The measurement scale is nominal if there is no ordering. The types of possible analysis depend on the measurement scale.
When the subjects measured are cross-classified on two or more categorical variables, the table of counts for the various combinations of categories is a contingency table. The information in a contingency table can be summarized and further analyzed through appropriate measures of association and models. A standard reference on association measures is Goodman and Kruskal (1979).
References and Further Reading
- Agresti A (2002) Categorical data analysis, 2nd edn. Wiley, New YorkzbMATHGoogle Scholar
- Agresti A (2010) Analysis of ordinal categorical data, 2nd edn. Wiley, New YorkzbMATHGoogle Scholar
- Bishop YMM, Fienberg SE, Holland PW (1975) Discrete multivariate analysis: theory and practice. MIT Press, CambridgezbMATHGoogle Scholar
- Goodman LA, Kruskal WH (1979) Measures of association for cross classifications. Springer, New YorkzbMATHGoogle Scholar
- Hosmer DW, Lemeshow S (2000) Applied logistic regression, 2nd edn. Wiley, New YorkzbMATHGoogle Scholar
- McCullagh P (1980) Regression models for ordinal data (with discussion). J R Stat Soc B 42:109–142zbMATHMathSciNetGoogle Scholar
- Nelder J, Wedderburn RWM (1972) Generalized linear models. J R Stat Soc A 135:370–384Google Scholar
- Stokes ME, Davis CS, Koch GG (2000) Categorical data analysis using the SAS system, 2nd edn. SAS Institute, CaryGoogle Scholar
- Thompson LA (2008) R (and S-PLUS) manual to accompany Agresti’s Categorical data analysis (2002), 2nd edn. https://home.comcast.net/~lthompson221/Splusdiscrete2.pdf