Skip to main content
Log in

Ordered Regression Models: a Tutorial

  • Published:
Prevention Science Aims and scope Submit manuscript

Abstract

Ordinal outcomes are common in the social, behavioral, and health sciences, but there is no commonly accepted approach to analyzing them. Researchers make a number of different seemingly arbitrary recoding decisions implying different levels of measurement and theoretical assumptions. As a result, a wide array of models are used to analyze ordinal outcomes, including the linear regression model, binary response model, ordered models, and count models. In this tutorial, we present a diverse set of ordered models (most of which are under-utilized in applied research) and argue that researchers should approach the analysis of ordinal outcomes in a more systematic fashion by taking into consideration both theoretical and empirical concerns, and prioritizing ordered models given the flexibility they provide. Additionally, we consider the challenges that ordinal independent variables pose for analysts that often go unnoticed in the literature and offer simple ways to decide how to include ordinal independent variables in ordered regression models in ways that are easier to justify on conceptual and empirical grounds. We illustrate several ordered regression models with an empirical example, general self-rated health, and conclude with recommendations for building a sounder approach to ordinal data analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. There are constrained and unconstrained partial models, which yields four versions of each approach.

  2. For a discussion of the constrained partial model, see Fullerton and Xu (2016, p.65).

  3. Espinosa and Hennig (2019) developed an ordinal model that allows one to impose a monotonicity constraint on ordinal independent variables via constrained MLE (p.872). The models we present in this study do not impose this inequality constraint. Ordinal patterns may emerge for groups of binary independent variables, but they are not imposed in the models.

  4. The “poor” category only had 1.53% of the overall sample. The LR test for combining the “fair” and “poor” categories was not significant (Χ2 = 9.798, p = 0.367), providing an empirical justification for this approach.

  5. For illustrations of stage and adjacent models, see Fullerton and Xu (2016, 2018) and Bauldry et al. (2018). We also present examples for these approaches in the online appendix.

  6. The “fairly likely” category only had 4.36% of the sample, and the “very likely” category only had 2.59% of the sample. The LR tests for combining these categories was not significant (Χ2 = 6.688, p = 0.571).

  7. We used the gologit2 (Williams 2006) command in Stata to estimate the cumulative models. Although the focus in this example is on cumulative models, we also estimated stage and adjacent models for comparison purposes. The results are substantively similar to the cumulative results. We used the gencrm command (Bauldry et al., 2018) and the regular mlogit command in Stata to estimate the stage and adjacent models, respectively. See the online appendix for the results and details regarding estimation using Stata.

References

  • Agresti, A. (2010). Analysis of ordinal categorical data (2nd ed.). Wiley

    Book  Google Scholar 

  • Amemiya, T. (1981). Qualitative response models: a survey. Journal of Economic Literature, 19, 1483–1536

    Google Scholar 

  • Bauldry, S., Xu, J., & Fullerton, A. S. (2018). Gencrm: a new command for generalized continuation-ratio models. Stata Journal, 18, 924–936

    Article  Google Scholar 

  • Bourdieu, P., Chambordeon, J., & Passeron, J. (1991). The craft of sociology: Epistemological preliminaries. Walter de Gruyter. Berlin, Germany.

    Book  Google Scholar 

  • Brant, R. (1990). Assessing proportionality in the proportional odds model for ordinal logistic regression. Biometrics, 46, 1171–1178

    Article  CAS  PubMed  Google Scholar 

  • Buis, M. L. (2011). The consequences of unobserved heterogeneity in a sequential logit model. Research in Social Stratification and Mobility, 29, 247–262

    Article  Google Scholar 

  • Bürkner, P.-C., & Vuorre, M. (2019). Ordinal regression models in psychology: a tutorial. Advances in Methods and Practices in Psychological Science, 2, 77–101

    Article  Google Scholar 

  • Cameron, S. V., & Heckman, J. J. (1998). Life cycle schooling and dynamic selection bias: Models and evidence for five cohorts of American males. Journal of Political Economy, 106, 262–333

    Article  Google Scholar 

  • Cheng, S., & Long, J. S. (2007). Testing for IIA in the multinomial logit model. Sociological Methods and Research, 35, 583–600

    Article  Google Scholar 

  • Clogg, C. C., & Shihadeh, E. S. (1994). Statistical models for ordinal variables. Sage

    Google Scholar 

  • Cox, D. R. (1972). Regression models and life-tables. Journal of the Royal Statistical Society Series B, 34, 187–220

    Google Scholar 

  • Espinosa, J., & Hennig, C. (2019). A constrained regression model for an ordinal response with ordinal predictors. Statistics and Computing, 29, 869–890

    Article  Google Scholar 

  • Fienberg, S. E. (1980). The analysis of cross-classified categorical data (2nd ed.). MIT Press

    Google Scholar 

  • Fullerton, A. S. (2009). A conceptual framework for ordered logistic regression models. Sociological Methods and Research, 38, 306–347

    Article  Google Scholar 

  • Fullerton, A. S., & Dixon, J. C. (2009). Racialization, asymmetry, and the context of welfare attitudes in the American states. Journal of Political and Military Sociology, 37, 95–120

    Google Scholar 

  • Fullerton, A. S., & Xu, J. (2012). The proportional odds with partial proportionality constraints model for ordinal response variables. Social Science Research, 41, 182–198

    Article  PubMed  Google Scholar 

  • Fullerton, A. S., & Xu, J. (2016). Ordered regression models: Parallel, partial, and non-parallel alternatives. Chapman & Hall/CRC Press

    Book  Google Scholar 

  • Fullerton, A. S., & Xu, J. (2018). Constrained and unconstrained partial adjacent category logit models for ordinal response variables. Sociological Methods and Research, 47, 169–206

    Article  Google Scholar 

  • Goodman, L. A. (1983). The analysis of dependence in cross-classifications having ordered categories, using log-linear models for frequencies and log-linear models for odds. Biometrics, 39, 149–160

    Article  CAS  PubMed  Google Scholar 

  • Greene, W. H., & Hensher, D. A. (2010). Modeling ordered choices: A primer. Cambridge, UK: Cambridge University Press.

  • Hedeker, D. R., Mermelstein, R. J., & Weeks, K. A. (1999). The thresholds of change model: An application to analyzing stages of change data. Annals of Behavioral Medicine, 21, 61–70

    Article  CAS  PubMed  Google Scholar 

  • Lieberson, S. (1985). Making it count. University of California Press

    Google Scholar 

  • Likert, R. (1932). A technique for the measurement of attitudes. Archives of Psychology, 140, 1–55

    Google Scholar 

  • Long, J. S. (1997). Regression models for categorical and limited dependent variables. Thousand Oaks, CA: Sage

  • Long, J. S., & Freese, J. (2014). Regression models for categorical dependent variables using Stata (3rd ed.). College Station, TX: Stata Press

  • Maddala, G. S. (1983). Limited-dependent and qualitative variables in econometrics. Cambridge University Press

    Book  Google Scholar 

  • Mare, R. D. (2011). Introduction to symposium on unmeasured heterogeneity in school transition models. Research in Social Stratification and Mobility, 29, 239–245

    Article  PubMed  PubMed Central  Google Scholar 

  • McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical Society Series B, 42, 109–142

    Google Scholar 

  • McCullagh, P., & Nelder, J. A. (1989). Generalized linear models (2nd ed.). Chapman & Hall

    Book  Google Scholar 

  • McFadden, D. (1973). Conditional logit analysis of qualitative choice behavior. In P. Zarembka (Ed.), Frontiers in econometrics (pp. 105–142). Academic Press

    Google Scholar 

  • McKelvey, R. D., & Zavoina, W. (1975). A statistical model for the analysis of ordinal level dependent variables. Journal of Mathematical Sociology, 4, 103–120

    Article  Google Scholar 

  • Nadler, J. T., Weston, R., & Voyles, E. C. (2015). Stuck in the middle: The use and interpretation of mid-points in items on questionnaires. Journal of General Psychology, 142, 71–89

    Article  PubMed  Google Scholar 

  • Peterson, B., & Harrell, F. E., Jr. (1990). Partial proportional odds models for ordinal response variables. Applied Statistics, 39, 205–217

    Article  Google Scholar 

  • Tucker, G., Adams, R., & Wilson, D. (2013). Observed agreement problems between sub-scales and summary components of the SF-36 Version 2 — An alternative scoring method can correct the problem. PLoS ONE , 8, e61191

  • Tucker, G., Adams, R., & Wilson, D. (2014). Results from several population studies show that recommended scoring methods of the SF-36 and the SF-12 may lead to incorrect conclusions and subsequent health decisions. Quality of Life Research, 23, 2195–2203

    Article  PubMed  Google Scholar 

  • Tutz, G. (1991). Sequential models in categorical regression. Computational Statistics & Data Analysis, 11, 275–295

    Article  Google Scholar 

  • Williams, R. (2006). Generalized ordered logit/partial proportional odds models for ordinal dependent variables. Stata Journal, 6, 58–82

    Article  Google Scholar 

  • Williams, R. (2009). Using heterogeneous choice models to compare logit and probit coefficients across groups. Sociological Methods and Research, 37, 531–559

    Article  Google Scholar 

  • Williams, R. (2016). Understanding and interpreting generalized ordered logit models. Journal of Mathematical Sociology, 40, 7–20

    Article  Google Scholar 

  • Xie, Y. (2011). Values and limitations of statistical models. Research in Social Stratification and Mobility, 29, 343–349

    Article  PubMed  PubMed Central  Google Scholar 

  • Xu, J., Bauldry, S., & Fullerton, A. S. (2019). Bayesian approaches to assessing the parallel lines assumption in cumulative ordered logit models. Sociological Methods and Research (In Press). https://doi.org/10.1177/0049124119882461

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrew S. Fullerton.

Ethics declarations

Ethical Approval

The OSU Institutional Review Board granted the study exempt status.

Informed Consent

Not applicable.

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fullerton, A.S., Anderson, K.F. Ordered Regression Models: a Tutorial. Prev Sci 24, 431–443 (2023). https://doi.org/10.1007/s11121-021-01302-y

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11121-021-01302-y

Keywords

Navigation