Skip to main content

Clustering Ordinal Data via Latent Variable Models

  • Conference paper
  • First Online:
Algorithms from and for Nature and Life

Abstract

Item response modelling is a well established method for analysing ordinal response data. Ordinal data are typically collected as responses to a number of questions or items. The observed data can be viewed as discrete versions of an underlying latent Gaussian variable. Item response models assume that this latent variable (and therefore the observed ordinal response) is a function of both respondent specific and item specific parameters. However, item response models assume a homogeneous population in that the item specific parameters are assumed to be the same for all respondents. Often a population is heterogeneous and clusters of respondents exist; members of different clusters may view the items differently. A mixture of item response models is developed to provide clustering capabilities in the context of ordinal response data. The model is estimated within the Bayesian paradigm and is illustrated through an application to an ordinal response data set resulting from a clinical trial involving self-assessment of arthritis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Agresti, A. (2010). Analysis of ordinal categorical data. Hoboken: Wiley.

    Book  MATH  Google Scholar 

  • Albert, J. H., & Chib, S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association, 88, 669–679.

    Article  MathSciNet  MATH  Google Scholar 

  • Cowles, M. K. (1996). Accelerating Monte Carlo Markov chain convergence for cumulative-link generalized linear models. Journal of the American Statistical Association, 6, 101–111.

    MathSciNet  Google Scholar 

  • Fox, J. P. (2010). Bayesian item response modeling. New York: Springer.

    Book  MATH  Google Scholar 

  • Frühwirth-Schnatter, S. (2001). Markov chain Monte Carlo estimation of classical and dynamic switching and mixture models. Journal of the American Statistical Association, 96, 194–209.

    Article  MathSciNet  MATH  Google Scholar 

  • Frühwirth-Schnatter, S. (2004). Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques. Statistica Sinica, 6, 831–860.

    Google Scholar 

  • Frühwirth-Schnatter, S. (2006). Finite mixture and Markov switching models. New York: Springer.

    MATH  Google Scholar 

  • Geweke, J., & Zhou, G. (1996). Measuring the price of arbitrage theory. The Review of Financial Studies, 9, 557–587.

    Article  Google Scholar 

  • Gormley, I. C., & Murphy, T. B. (2008). A mixture of experts model for rank data with applications in election studies. The Annals of Applied Statistics, 2(4), 1452–1477.

    Article  MathSciNet  MATH  Google Scholar 

  • Jacobs, R. A., Jordan, M. I., Nowlan, S. J., & Hinton, G. E. (1991). Adaptive mixture of local experts. Neural Computation, 3, 79–87.

    Article  Google Scholar 

  • Johnson, V. E., & Albert, J. H. (1999). Ordinal data modeling. New York: Springer.

    MATH  Google Scholar 

  • Lipsitz, S. R., & Zhao, L. (1994). Analysis of repeated categorical data using generalized estimating equations. Statistics in Medicine, 13, 1149–1163.

    Article  Google Scholar 

  • Lopes, H. F., & West, M. (2004). Bayesian model assessment in factor analysis. Statistica Sinica, 14, 41–67.

    MathSciNet  MATH  Google Scholar 

  • McLachlan, G. J., & Peel, D. (2000). Finite mixture models. New York: Wiley.

    Book  MATH  Google Scholar 

  • McNicholas, P. D., & Murphy, T. B. (2008). Parsimonious Gaussian mixture models. Statistics and Computing, 18(3), 285–296.

    Article  MathSciNet  Google Scholar 

  • Meng, X. L., & Wong, W. H. (1996). Simulating ratios of normalizing constants via a simple identity: a theoretical exploration. The Econometrics Journal, 7, 143–167.

    Google Scholar 

  • Von Davier, M. & Yamamoto, K. (2004). Partially observed mixtures of IRT models: an extension of the generalized partial credit model. Applied Psychological Measurement, 28(6), 389–406.

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work has emanated from research conducted with the financial support of Science Foundation Ireland under Grant Number 09/RFP/MTH2367.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Damien McParland .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

McParland, D., Gormley, I.C. (2013). Clustering Ordinal Data via Latent Variable Models. In: Lausen, B., Van den Poel, D., Ultsch, A. (eds) Algorithms from and for Nature and Life. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-00035-0_12

Download citation

Publish with us

Policies and ethics