Ordinary Least Squares Regression of Ordered Categorical Data: Inferential Implications for Practice
Ordered categorical variables are frequently encountered as response variables in many disciplines. Agricultural examples include quality assessments of soil or food products, and evaluation of lesion severity, such as teat ends status in dairy cattle. Ordered categorical responses (OCR) are characterized by multiple levels recorded on a ranked scale, whereby levels appraise order but may not be informative of relative magnitude or proportionality between levels. A number of statistically sound methods are available in the standard toolbox to deal with OCR, such as constrained cumulative logit and probit models; however, these are commonly underutilized in practice. Instead, ordinary least squares linear regression (OLSLR) is often employed to infer upon OCR, despite violation of basic model assumptions. In this study, we investigate the inferential implications of OLSLR-based inference on OCR using simulated data to explore realized Type I error rate and realized statistical power under a variety of scenarios. The design of the simulation study was motivated by a data application, thus considering increasing number of levels and various frequency distributions of the OCR. We then illustrate inferential performance of OLSLR relative to a probit regression model fitted to an OCR using a survey dataset of veterinarian antimicrobial use in cattle feedlots. This article has supplementary material online.
Key WordsType I error Statistical power Violation of assumptions
Unable to display preview. Download preview PDF.
- Dean, W. R., McIntosh, W. A., Scott, H. M., and Barling, K. S. (2011), “The Role of Trust and Moral Obligation in Beef Cattle Feedlot Veterinarians’ Contingent Adoption of Antibiotic Metaphylaxis Recommendations,” International Journal of Sociology of Agriculture and Food, 18, 104–120. Google Scholar
- Hsu, T., and Feldt, L. S. (1969), “The Effect of Limitations on the Number of Criterion Score Values on the Significance Level of the F-Test,” American Educational Research Journal, 6, 515–527. Google Scholar
- Kutner, M., Nachtsheim, C. J., Neter, C. J., and Li, W. (2005), Applied Linear Statistical Models, Boston: McGraw-Hill Irwin. Google Scholar
- Radostits, O. M. (2001), Herd Health: Food Animal Production Medicine (3 ed.), Philadelphia: W. B. Saunders Company, pp. 397–433. Google Scholar
- Wright, S. (1934), “An Analysis of Variability in Number of Digits in an Inbred Strain of Guinea Pigs,” Genetics, 19, 506–536. Google Scholar