Multidimensional item response theory models for testlet-based doubly bounded data

Liu, Chen-Wei

doi:10.3758/s13428-023-02272-5

Multidimensional item response theory models for testlet-based doubly bounded data

Original Manuscript
Published: 20 November 2023

(2023)
Cite this article

Behavior Research Methods Aims and scope Submit manuscript

Chen-Wei Liu ORCID: orcid.org/0000-0003-2826-465X¹

208 Accesses
1 Altmetric
Explore all metrics

Abstract

A testlet-based visual analogue scale (VAS) is a doubly bounded scaling approach (e.g., from 0% to 100% or from 0 to 1) composed of multiple adjectives, nouns, or sentences (statements/items) within testlets for measuring individuals’ attitudes, opinions, or career interests. While testlet-based VASs have many advantages over Likert scales, such as reducing response style effects, the development of proper statistical models for analyzing testlet-based VAS data lags behind. This paper proposes a novel beta copula model and a competing logit-normal model based on the item response theory framework, assessed by Bayesian parameter estimation, model comparison, and goodness-of-fit statistics. An empirical career interest dataset based on a testlet-based VAS design was analyzed using the proposed models. Simulation studies were conducted to assess the two models’ parameter recovery. The results show that the beta copula model had superior fit in the empirical data analysis, and also exhibited good parameter recovery in the simulation studies, suggesting that it is a promising statistical approach to testlet-based doubly bounded responses.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Analyses of Testlet Data

The Dirichlet Dual Response Model: An Item Response Model for Continuous Bounded Interval Responses

Article Open access 20 July 2023

An Extended Item Response Tree Model for Wording Effects in Mixed-Format Scales

References

Aitchison, J., & Begg, C. B. (1976). Statistical diagnosis when basic cases are not classified with certainty. Biometrika, 63(1), 1–12. https://doi.org/10.1093/biomet/63.1.1
Article Google Scholar
Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723. https://doi.org/10.1109/TAC.1974.1100705
Article Google Scholar
Bürkner, P.-C., Schulte, N., & Holling, H. (2019). On the statistical and practical limitations of thurstonian IRT models. Educational and Psychological Measurement, 79(5), 827–854. https://doi.org/10.1177/0013164419832063
Article PubMed PubMed Central Google Scholar
Bond, M., & Pilowsky, I. (1966). Subjective assessment of pain and its relationship to the administration of analgesics in patients with advanced cancer. Journal of Psychosomatic Research, 10(2), 203–208. https://doi.org/10.1016/0022-3999(66)90064-x
Article PubMed Google Scholar
Bond, T. G., Yan, Z., & Heene, M. (2020). Applying the Rasch model: Fundamental Measurement in the Human Sciences. Routledge. https://doi.org/10.4324/9780429030499
Book Google Scholar
Brehm, J., & Gates, S. (1993). Donut shops and speed traps: evaluating models of supervision on police behavior. American Journal of Political Science, 37(2), 555–581. https://doi.org/10.2307/2111384
Article Google Scholar
Cribari-Neto, F., & Zeileis, A. (2010). Beta regression in R. Journal of Statistical Software, 34(2), 1–24. https://doi.org/10.18637/jss.v034.i02
Article Google Scholar
de Valpine, P., Turek, D., Paciorek, C. J., Anderson-Bergman, C., Lang, D. T., & Bodik, R. (2017). Programming with models: writing statistical algorithms for general model structures with NIMBLE. Journal of Computational and Graphical Statistics, 26(2), 403–413. https://doi.org/10.1080/10618600.2016.1172487
Article Google Scholar
Deonovic, B., Bolsinova, M., Bechger, T., & Maris, G. (2020). A rasch model and rating system for continuous responses collected in large-scale learning systems. Frontiers in Psychology, 11, 500039. https://doi.org/10.3389/fpsyg.2020.500039
Article PubMed PubMed Central Google Scholar
Ferrando, P. J. (2001). A nonlinear congeneric model for continuous item responses. British Journal of Mathematical and Statistical Psychology, 54(2), 293–313. https://doi.org/10.1348/000711001159573
Article PubMed Google Scholar
Ferrari, S., & Cribari-Neto, F. (2004). Beta regression for modelling rates and proportions. Journal of Applied Statistics, 31(7), 799–815. https://doi.org/10.1080/0266476042000214501
Article Google Scholar
Freyd, M. (1923). The graphic rating scale. Journal of Educational Psychology, 14(2), 83–102. https://doi.org/10.1037/h0074329
Article Google Scholar
Gelman, A., & Rubin, D. B. (1992). Inference from iterative simulation using multiple sequences. Statistical Science, 7(4), 457–472. https://doi.org/10.1214/ss/1177011136
Article Google Scholar
Grün, B., Kosmidis, I., & Zeileis, A. (2012). Extended Beta regression in R: shaken, stirred, mixed, and partitioned. Journal of Statistical Software, 48(11), 1–25. https://doi.org/10.18637/jss.v048.i11
Article Google Scholar
Gupta, A. K., & Nadarajah, S. (2004). Handbook of beta distribution and its applications. CRC Press. https://doi.org/10.1201/9781482276596
Book Google Scholar
Hastings, W. K. (1970). Monte Carlo sampling methods using markov chains and their applications. Biometrika, 57(1), 97–109. https://doi.org/10.1093/biomet/57.1.97
Article Google Scholar
Heidelberger, P., & Welch, P. D. (1981). A spectral method for confidence interval generation and run length control in simulations. Communications of the ACM, 24(4), 233–245. https://doi.org/10.1145/358598.358630
Article Google Scholar
Heller, G. Z., Manuguerra, M., & Chow, R. (2016). How to analyze the visual analogue scale: Myths, truths and clinical relevance. Scandinavian Journal of Pain, 13(1), 67–75. https://doi.org/10.1016/j.sjpain.2016.06.012
Article PubMed Google Scholar
Holland, J. L. (1959). A theory of vocational choice. Journal of Counseling Psychology, 6(1), 35–45. https://doi.org/10.1037/h0040767
Article Google Scholar
Holland, J. L. (1973). Making vocational choices: A theory of careers. Prentice-Hall. https://doi.org/10.1016/0022-4405(74)90056-9
Book Google Scholar
Holmes, J. B., & Schofield, M. R. (2022). Moments of the logit-normal distribution. Communications in Statistics-Theory and Methods, 51(3), 610–623. https://doi.org/10.1080/03610926.2020.1752723
Article Google Scholar
Johnson, N. L. (1949). Systems of frequency curves generated by methods of translation. Biometrika, 36(1/2), 149–176. https://doi.org/10.2307/2332539
Article PubMed Google Scholar
Lancaster, T. (2000). The incidental parameter problem since 1948. Journal of Econometrics, 95(2), 391–413. https://doi.org/10.1016/S0304-4076(99)00044-5
Article Google Scholar
Levy, R., & Svetina, D. (2011). A generalized dimensionality discrepancy measure for dimensionality assessment in multidimensional item response theory. British Journal of Mathematical and Statistical Psychology, 64(2), 208–232. https://doi.org/10.1348/000711010X500483
Article PubMed Google Scholar
Lewandowski, D., Kurowicka, D., & Joe, H. (2009). Generating random correlation matrices based on vines and extended onion method. Journal of Multivariate Analysis, 100(9), 1989–2001. https://doi.org/10.1016/j.jmva.2009.04.008
Article Google Scholar
Li, P. (2018). Efficient MCMC estimation of inflated beta regression models. Computational Statistics, 33(1), 127–158. https://doi.org/10.1007/s00180-017-0747-x
Article Google Scholar
Liu, C.-W., & Wang, W.-C. (2019). A general unfolding IRT model for multiple response styles. Applied Psychological Measurement, 43(3), 195–210. https://doi.org/10.1177/0146621618762743
Article PubMed Google Scholar
Liu, C. W., & Chalmers, R. P. (2021). A note on computing Louis’ observed information matrix identity for IRT and cognitive diagnostic models. British Journal of Mathematical and Statistical Psychology, 74(1), 118–138. https://doi.org/10.1111/bmsp.12207
Article PubMed Google Scholar
Makalic, E., & Schmidt, D. F. (2022). An efficient algorithm for sampling from sink (x) for generating random correlation matrices. Communications in Statistics - Simulation and Computation, 51(5), 2731–2735. https://doi.org/10.1080/03610918.2019.1700277
Article Google Scholar
Manuguerra, M., Heller, G. Z., & Ma, J. (2020). Continuous ordinal regression for analysis of visual analogue scales: The R Package ordinalCont. Journal of Statistical Software, 96, 1–25. https://doi.org/10.18637/jss.v096.i08
Article Google Scholar
Marley, J. K., & Wand, M. P. (2010). Non-standard semiparametric regression via BRugs. Journal of Statistical Software, 37, 1–30. https://doi.org/10.18637/jss.v037.i05
Article Google Scholar
Marsh, H. W. (1989). Confirmatory factor analyses of multitrait-multimethod data: Many problems and a few solutions. Applied Psychological Measurement, 13(4), 335–361. https://doi.org/10.1177/014662168901300402
Article Google Scholar
Maydeu-Olivares, A., & Brown, A. (2010). Item response modeling of paired comparison and ranking data. Multivariate Behavioral Research, 45(6), 935–974. https://doi.org/10.1080/00273171.2010.531231
Article PubMed Google Scholar
Mellenbergh, G. J. (1994). A unidimensional latent trait model for continuous item responses. Multivariate Behavioral Research, 29(3), 223–236. https://doi.org/10.1207/s15327906mbr2903_2
Article PubMed Google Scholar
Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., & Teller, E. (1953). Equation of state calculations by fast computing machines. The Journal of Chemical Physics, 21(6), 1087–1092. https://doi.org/10.1063/1.1699114
Article Google Scholar
Molenaar, D., Cúri, M., & Bazán, J. L. (2022). Zero and one inflated item response theory models for bounded continuous data. Journal of Educational and Behavioral Statistics, 47(6), 693–735. https://doi.org/10.3102/10769986221108455
Article Google Scholar
Neyman, J., & Scott, E. L. (1948). Consistent estimates based on partially consistent observations. Econometrica: Journal of the Econometric Society, 16(1), 1–32. https://doi.org/10.2307/1914288
Noel, Y. (2014). A beta unfolding model for continuous bounded responses. Psychometrika, 79(4), 647–674. https://doi.org/10.1007/s11336-013-9361-1
Article PubMed Google Scholar
Noel, Y., & Dauvier, B. (2007). A beta item response model for continuous bounded responses. Applied Psychological Measurement, 31(1), 47–73. https://doi.org/10.1177/0146621605287691
Article Google Scholar
Paolino, P. (2001). Maximum likelihood estimation of models with beta-distributed dependent variables. Political Analysis, 9(4), 325–346. https://doi.org/10.1093/oxfordjournals.pan.a004873
Article Google Scholar
Philip, B. K. (1990). Parametric statistics for evaluation of the visual analog scale. Anesthesia & Analgesia, 71(6), 710. https://doi.org/10.1213/00000539-199012000-00027
Article Google Scholar
Ramalho, E. A., Ramalho, J. J., & Murteira, J. M. (2011). Alternative estimating and testing empirical strategies for fractional regression models. Journal of Economic Surveys, 25(1), 19–68. https://doi.org/10.1111/j.1467-6419.2009.00602.x
Article Google Scholar
Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Danish Institute for Educational Research. https://doi.org/10.4135/9781412961288n335
Book Google Scholar
Reips, U.-D., & Funke, F. (2008). Interval-level measurement with visual analogue scales in internet-based research: VAS generator. Behavior Research Methods, 40(3), 699–704. https://doi.org/10.3758/BRM.40.3.699
Article PubMed Google Scholar
Revuelta, J., Hidalgo, B., & Alcazar-Córcoles, M. Á. (2022). Bayesian estimation and testing of a beta factor model for bounded continuous variables. Multivariate Behavioral Research, 57(1), 57–78. https://doi.org/10.1080/00273171.2020.1805582
Article PubMed Google Scholar
Richards, F. (1959). A flexible growth function for empirical use. Journal of Experimental Botany, 10(2), 290–301. https://doi.org/10.1093/jxb/10.2.290
Article Google Scholar
Rubin, D. B. (1984). Bayesianly justifiable and relevant frequency calculations for the applies statistician. The Annals of Statistics, 1151–1172. https://doi.org/10.1214/aos/1176346785
Samejima, F. (1973). Homogeneous case of the continuous response model. Psychometrika, 38(2), 203–219. https://doi.org/10.1007/BF02291114
Article Google Scholar
Sinharay, S., Johnson, M. S., & Stern, H. S. (2006). Posterior predictive assessment of item response theory models. Applied Psychological Measurement, 30(4), 298–321. https://doi.org/10.1177/0146621605285517
Article Google Scholar
Sklar, M. (1959). Fonctions de répartition à n dimensions et leurs marges. Publications de l’Institutde Statistique de l’Université de Paris, 8, 229–231. https://doi.org/10.4000/cpuc.315
Article Google Scholar
Smithson, M., & Merkle, E. C. (2014). Generalized linear models for categorical and continuous limited dependent variables. CRC Press. https://doi.org/10.1201/b15694-12
Book Google Scholar
Smithson, M., Merkle, E. C., & Verkuilen, J. (2011). Beta regression finite mixture models of polarization and priming. Journal of Educational and Behavioral Statistics, 36(6), 804–831. https://doi.org/10.3102/1076998610396893
Article Google Scholar
Sung, Y.-T., & Wu, J.-S. (2018). The visual analogue scale for rating, ranking and paired-comparison (VAS-RRP): a new technique for psychological measurement. Behavior Research Methods, 50(4), 1694–1715. https://doi.org/10.3758/s13428-018-1041-8
Article PubMed PubMed Central Google Scholar
Tadikamalla, P. R., & Johnson, N. L. (1982). Systems of frequency curves generated by transformations of logistic variables. Biometrika, 69(2), 461–465. https://doi.org/10.1093/biomet/69.2.461
Article Google Scholar
Thurstone, L. (1927). A law of comparative judgment. Psychological Review, 34, 273–286. https://doi.org/10.1037/h0070288
Article Google Scholar
Vehtari, A., Gelman, A., & Gabry, J. (2017). Practical bayesian model evaluation using leave-one-out cross-validation and WAIC. Statistics and Computing, 27(5), 1413–1432. https://doi.org/10.1007/s11222-016-9696-4
Article Google Scholar
Verkuilen, J., & Smithson, M. (2012). Mixed and mixture regression models for continuous bounded responses using the beta distribution. Journal of Educational and Behavioral Statistics, 37(1), 82–113. https://doi.org/10.3102/1076998610396895
Article Google Scholar
Wang, W.-C., & Chen, C.-T. (2005). Item parameter recovery, standard error estimates, and fit statistics of the WINSTEPS program for the family of rasch models. Educational and Psychological Measurement, 65(3), 376–404. https://doi.org/10.1177/0013164404268673
Article Google Scholar
Watanabe, S. (2010). Asymptotic equivalence of bayes cross validation and widely applicable information criterion in singular learning theory. Journal of Machine Learning Research, 11(12), 3571–3594. https://doi.org/10.1007/s11222-016-9657-y
Article Google Scholar
Wewers, M. E., & Lowe, N. K. (1990). A critical review of visual analogue scales in the measurement of clinical phenomena. Research in Nursing & Health, 13(4), 227–236. https://doi.org/10.1002/nur.4770130405
Article Google Scholar
Wright, B. D., & Masters, G. N. (1982). Rating scale analysis. MESA press.
Google Scholar

Download references

Author Note

The first author acknowledges the grant support from the National Science and Technology Council, Grant Number MSTC 110-2410-H-003-054-MY2.

Author information

Authors and Affiliations

Department of Educational Psychology and Counseling, National Taiwan Normal University, Taipei, Taiwan
Chen-Wei Liu

Authors

Chen-Wei Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chen-Wei Liu.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open Practices Statements

The materials for the simulation study are available at https://github.com/cwliu007/mirt_bd.

Appendices

Appendix 1

Appendix 2

Appendix 3

Appendix 4

Appendix 5

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Liu, CW. Multidimensional item response theory models for testlet-based doubly bounded data. Behav Res (2023). https://doi.org/10.3758/s13428-023-02272-5

Download citation

Accepted: 13 October 2023
Published: 20 November 2023
DOI: https://doi.org/10.3758/s13428-023-02272-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multidimensional item response theory models for testlet-based doubly bounded data

Abstract

Access this article