A mixed-effects location-scale model for ordinal questionnaire data

Hedeker, Donald; Mermelstein, Robin J.; Demirtas, Hakan; Berbaum, Michael L.

doi:10.1007/s10742-016-0145-9

A mixed-effects location-scale model for ordinal questionnaire data

Published: 11 April 2016

Volume 16, pages 117–131, (2016)
Cite this article

Health Services and Outcomes Research Methodology Aims and scope Submit manuscript

Donald Hedeker¹,
Robin J. Mermelstein²,
Hakan Demirtas² &
…
Michael L. Berbaum²

815 Accesses
6 Citations
Explore all metrics

Abstract

In health studies, questionnaire items are often scored on an ordinal scale, for example on a Likert scale. For such questionnaires, item response theory (IRT) models provide a useful approach for obtaining summary scores for subjects (i.e., the model’s random subject effect) and characteristics of the items (e.g., item difficulty and discrimination). In this article, we describe a model that allows the items to additionally exhibit different within-subject variance, and also includes a subject-level random effect to the within-subject variance specification. This permits subjects to be characterized in terms of their mean level, or location, and their variability, or scale, and the model allows item difficulty and discrimination in terms of both random subject effects (location and scale). We illustrate application of this location-scale mixed model using data from the Social Subscale of the Drinking Motives Questionnaire assessed in an adolescent study. We show that the proposed model fits the data significantly better than simpler IRT models, and is able to identify items and subjects that are not well-fit by the simpler models. The proposed model has useful applications in many areas where questionnaires are often rated on an ordinal scale, and there is interest in characterizing subjects in terms of both their mean and variability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Estimating meaningful thresholds for multi-item questionnaires using item response theory

Article Open access 13 February 2023

Why item response theory should be used for longitudinal questionnaire data analysis in medical research

Article Open access 30 July 2015

Advances in nonparametric item response theory for scale construction in quality-of-life research

Article Open access 09 November 2021

References

Agresti, A., Lang, J.B.: A proportional odds model with subject-specific effects for repeated ordered categorical responses. Biometrika 80, 527–534 (1993)
Article Google Scholar
Aitkin, M.: Modelling variance heterogeneity in normal regression using GLIM. Appl. Stat. 36, 332–339 (1987)
Article Google Scholar
Bock, R.D., Aitken, M.: Marginal maximum likelihood estimation of item parameters: application of an EM algorithm. Psychometrika 46, 443–459 (1981)
Article Google Scholar
Cooper, M.L.: Motivations for alcohol use among adolescents: development and validation of a four-factor model. Psychol. Assess. 6, 117–128 (1994)
Article Google Scholar
Cox, C.: Location-scale cumulative odds models for ordinal data: a generalized non-linear model approach. Stat. Med. 14, 1191–1203 (1995)
Article CAS PubMed Google Scholar
Dierker, L., Mermelstein, R.: Early emerging nicotine-dependence symptoms: a signal of propensity for chronic smoking behavior in adolescents. J. Pediatr. 156, 818–822 (2010)
Article CAS PubMed PubMed Central Google Scholar
Dorman, J.: The effect of clustering on statistical tests: an illustration using classroom environment data. Educ. Psychol. 28, 583–595 (2008)
Article Google Scholar
Ezzet, F., Whitehead, J.: A random effects model for ordinal responses from a crossover trial. Stat. Med. 10, 901–907 (1991)
Article CAS PubMed Google Scholar
Harvey, A.C.: Estimating regression models with multiplicative heteroscedasticity. Econometrica 44, 461–465 (1976)
Article Google Scholar
Hedeker, D., Berbaum, M., Mermelstein, R.: Location-scale models for multilevel ordinal data: between- and within-subjects variance modeling. J. Probab. Stat. Sci. 4, 1–20 (2006)
Google Scholar
Hedeker, D., Demirtas, H., Mermelstein, R.J.: A mixed ordinal location-scale model for analysis of ecological momentary assessment data. Stat. Interface 2, 391–402 (2009)
Article PubMed PubMed Central Google Scholar
Hedeker, D., Gibbons, R.D.: A random-effects ordinal regression model for multilevel analysis. Biometrics 50, 933–944 (1994)
Article CAS PubMed Google Scholar
Hedeker, D., Gibbons, R.D.: Longitudinal Data Analysis Longitudinal Data Analysis. Wiley, New York (2006)
Google Scholar
Hedeker, D., Mermelstein, R.J.: A multilevel thresholds of change model for analysis of stages of change data. Multivar. Behav. Res. 33, 427–455 (1998)
Article Google Scholar
Hedeker, D., Mermelstein, R.J., Flay, B.R.: Models for intensive longitudinal data. In: Walls, T.A., Schafer, J.L. (eds.) Application of Item Response Theory Models for Intensive Longitudinal Data, pp. 84–108. Oxford University Press, New York (2006)
Chapter Google Scholar
Ishwaran, H., Gatsonis, C.: A general class of hierarchical ordinal regression models with applications to correlated ROC analysis. Can. J. Stat. 28, 731–750 (2000)
Article Google Scholar
Johnson, T.R.: On the use of heterogeneous thresholds ordinal regression models to account for individual differences in response style. Psychometrika 68, 563–583 (2003)
Article Google Scholar
Leckie, G., French, R., Charlton, C., Browne, W.: Modeling heterogeneous variance-covariance components in two-level models. J. Educ. Behav. Stat. 39, 307–332 (2014)
Article Google Scholar
McCullagh, P.: Regression models for ordinal data (with discussion). J. R. Stat. Soc. Ser. B 42, 109–142 (1980)
Google Scholar
McCullagh, P., Nelder, J.A.: Generalized Linear Models, 2nd edn. Chapman and Hall, New York (1989)
Book Google Scholar
Rijmen, F., Tuerlinckx, F., De Boeck, P., Kuppens, P.: A nonlinear mixed model framework for item response theory. Psychol. Methods 8, 185–205 (2003)
Article PubMed Google Scholar
Saei, A., McGilchrist, C.A.: Longitudinal threshold models with random components. J. R. Stat. Soc. Ser. D (Stat.) 47, 365–375 (1998)
Article Google Scholar
Samejima, F.: Estimation of latent ability using a response pattern of graded scores. Psychom. Monogr. 17, 1–100 (1969)
Article Google Scholar
Shiffman, S., Waters, A., Hickcox, M.: The nicotine dependence syndrome scale: a multidimensional measure of nicotine dependence. Nicotine Tob. Res. 6, 327–348 (2004)
Article CAS PubMed Google Scholar
Skrondal, A., Rabe-Hesketh, S.: Generalized Latent Variable Modeling: Multilevel, Longitudinal, and Structural Equation Models. Chapman and Hall/CRC, New York (2004)
Book Google Scholar
Toledano, A.Y., Gatsonis, C.: Ordinal regression methodology for ROC curves derived from correlated data. Stat. Med. 15, 1807–1826 (1996)
Article CAS PubMed Google Scholar
Tosteson, A.N., Begg, C.B.: A general regression methodology for ROC curve estimation. Med. Decis. Mak. 8, 204–215 (1988)
Article CAS Google Scholar
Tutz, G., Hennevogl, W.: Random effects in ordinal regression models. Comput. Stat. Data Anal. 22, 537–557 (1996)
Article Google Scholar

Download references

Funding

This study was funded by National Cancer Institute grant P01 CA98262 (Mermelstein, PI) and National Heart Lung and Blood Institute grant R01 HL121330 (Hedeker & Dunton).

Author information

Authors and Affiliations

Department of Public Health Sciences, University of Chicago, 5841 S. Maryland Ave., Room W254, MC2000, Chicago, IL, 60637, USA
Donald Hedeker
University of Illinois at Chicago, Chicago, IL, USA
Robin J. Mermelstein, Hakan Demirtas & Michael L. Berbaum

Authors

Donald Hedeker
View author publications
You can also search for this author in PubMed Google Scholar
Robin J. Mermelstein
View author publications
You can also search for this author in PubMed Google Scholar
Hakan Demirtas
View author publications
You can also search for this author in PubMed Google Scholar
Michael L. Berbaum
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Donald Hedeker.

Ethics declarations

Conflict of interest

All authors declare that he/she has no conflict of interest.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Supported by National Cancer Institute grant P01 CA98262 (Mermelstein, PI) and National Heart Lung and Blood Institute grant R01 HL121330 (Hedeker & Dunton). Thanks to Oksana Pugach for aiding in carrying out the simulation studies.

Appendix: SAS PROC NLMIXED syntax

Below is a sample of syntax necessary to run the mixed-effects ordinal location-scale model described in this article. Uppercase letters are used for SAS specific syntax and lowercase letters are used for user defined entities. In terms of the variables used in this syntax, id is a subject identifier, y denotes the ordinal outcome and x1 to x3 are item indicators. Here, for simplicity, we illustrate the syntax consdering only three items and three response categories. The two cumulative logits are clogit1 and clogit2, and the two cumulative probabilities are cprob1 and cprob2. The random location effect is named theta, the random scale effect is zeta, and their correlation is corr. The location model is summarized by loc, while the scale model is given by scale.

Users must provide starting values for all parameters on the PARMS statement. To do so, it is beneficial to run the model in stages using estimates from a prior stage as starting values and setting the additional parameters to zero or some small value. For example, one can start by estimating a random-intercepts ordinal model with item difficulty (beta1-beta3), item discrimination (delta1-delta3), and threshold parameter (alpha2). Estimates of these parameters can then be specified as starting values in a model that adds in the WS variance parameters (gamma1-gamma3). Finally, the full model with the additional parameters (tau1-tau3 and cor) can be estimated. In practice, this approach works well with PROC NLMIXED, which sometimes has difficulties in converging to a solution for complex models. Furthermore, for complex models, it is sometimes the case that the default convergence criteria is not strict enough. In the above syntax, the convergence criteria is specified as GCONV=1e-12 on the PROC NLMIXED statement.

Table 1 Simulation results for 4-parameter location-scale model

Full size table

Table 2 Simulation results for 2- and 3-parameter models

Full size table

Table 3 4-parameter location-scale model estimates (95 % confidence intervals)

Full size table

Table 4 Subjects with high, near-zero, and low scale: Random effect estimates and observed item responses

Full size table

Table 5 4-parameter location-scale model: Male gender DIF estimates (95 % confidence intervals)

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hedeker, D., Mermelstein, R.J., Demirtas, H. et al. A mixed-effects location-scale model for ordinal questionnaire data. Health Serv Outcomes Res Method 16, 117–131 (2016). https://doi.org/10.1007/s10742-016-0145-9

Download citation

Received: 30 November 2015
Revised: 25 March 2016
Accepted: 01 April 2016
Published: 11 April 2016
Issue Date: September 2016
DOI: https://doi.org/10.1007/s10742-016-0145-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A mixed-effects location-scale model for ordinal questionnaire data

Abstract

Access this article

Similar content being viewed by others

Estimating meaningful thresholds for multi-item questionnaires using item response theory

Why item response theory should be used for longitudinal questionnaire data analysis in medical research

Advances in nonparametric item response theory for scale construction in quality-of-life research

References

Funding