Abstract
A testlet-based visual analogue scale (VAS) is a doubly bounded scaling approach (e.g., from 0% to 100% or from 0 to 1) composed of multiple adjectives, nouns, or sentences (statements/items) within testlets for measuring individuals’ attitudes, opinions, or career interests. While testlet-based VASs have many advantages over Likert scales, such as reducing response style effects, the development of proper statistical models for analyzing testlet-based VAS data lags behind. This paper proposes a novel beta copula model and a competing logit-normal model based on the item response theory framework, assessed by Bayesian parameter estimation, model comparison, and goodness-of-fit statistics. An empirical career interest dataset based on a testlet-based VAS design was analyzed using the proposed models. Simulation studies were conducted to assess the two models’ parameter recovery. The results show that the beta copula model had superior fit in the empirical data analysis, and also exhibited good parameter recovery in the simulation studies, suggesting that it is a promising statistical approach to testlet-based doubly bounded responses.
Similar content being viewed by others
References
Aitchison, J., & Begg, C. B. (1976). Statistical diagnosis when basic cases are not classified with certainty. Biometrika, 63(1), 1–12. https://doi.org/10.1093/biomet/63.1.1
Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723. https://doi.org/10.1109/TAC.1974.1100705
Bürkner, P.-C., Schulte, N., & Holling, H. (2019). On the statistical and practical limitations of thurstonian IRT models. Educational and Psychological Measurement, 79(5), 827–854. https://doi.org/10.1177/0013164419832063
Bond, M., & Pilowsky, I. (1966). Subjective assessment of pain and its relationship to the administration of analgesics in patients with advanced cancer. Journal of Psychosomatic Research, 10(2), 203–208. https://doi.org/10.1016/0022-3999(66)90064-x
Bond, T. G., Yan, Z., & Heene, M. (2020). Applying the Rasch model: Fundamental Measurement in the Human Sciences. Routledge. https://doi.org/10.4324/9780429030499
Brehm, J., & Gates, S. (1993). Donut shops and speed traps: evaluating models of supervision on police behavior. American Journal of Political Science, 37(2), 555–581. https://doi.org/10.2307/2111384
Cribari-Neto, F., & Zeileis, A. (2010). Beta regression in R. Journal of Statistical Software, 34(2), 1–24. https://doi.org/10.18637/jss.v034.i02
de Valpine, P., Turek, D., Paciorek, C. J., Anderson-Bergman, C., Lang, D. T., & Bodik, R. (2017). Programming with models: writing statistical algorithms for general model structures with NIMBLE. Journal of Computational and Graphical Statistics, 26(2), 403–413. https://doi.org/10.1080/10618600.2016.1172487
Deonovic, B., Bolsinova, M., Bechger, T., & Maris, G. (2020). A rasch model and rating system for continuous responses collected in large-scale learning systems. Frontiers in Psychology, 11, 500039. https://doi.org/10.3389/fpsyg.2020.500039
Ferrando, P. J. (2001). A nonlinear congeneric model for continuous item responses. British Journal of Mathematical and Statistical Psychology, 54(2), 293–313. https://doi.org/10.1348/000711001159573
Ferrari, S., & Cribari-Neto, F. (2004). Beta regression for modelling rates and proportions. Journal of Applied Statistics, 31(7), 799–815. https://doi.org/10.1080/0266476042000214501
Freyd, M. (1923). The graphic rating scale. Journal of Educational Psychology, 14(2), 83–102. https://doi.org/10.1037/h0074329
Gelman, A., & Rubin, D. B. (1992). Inference from iterative simulation using multiple sequences. Statistical Science, 7(4), 457–472. https://doi.org/10.1214/ss/1177011136
Grün, B., Kosmidis, I., & Zeileis, A. (2012). Extended Beta regression in R: shaken, stirred, mixed, and partitioned. Journal of Statistical Software, 48(11), 1–25. https://doi.org/10.18637/jss.v048.i11
Gupta, A. K., & Nadarajah, S. (2004). Handbook of beta distribution and its applications. CRC Press. https://doi.org/10.1201/9781482276596
Hastings, W. K. (1970). Monte Carlo sampling methods using markov chains and their applications. Biometrika, 57(1), 97–109. https://doi.org/10.1093/biomet/57.1.97
Heidelberger, P., & Welch, P. D. (1981). A spectral method for confidence interval generation and run length control in simulations. Communications of the ACM, 24(4), 233–245. https://doi.org/10.1145/358598.358630
Heller, G. Z., Manuguerra, M., & Chow, R. (2016). How to analyze the visual analogue scale: Myths, truths and clinical relevance. Scandinavian Journal of Pain, 13(1), 67–75. https://doi.org/10.1016/j.sjpain.2016.06.012
Holland, J. L. (1959). A theory of vocational choice. Journal of Counseling Psychology, 6(1), 35–45. https://doi.org/10.1037/h0040767
Holland, J. L. (1973). Making vocational choices: A theory of careers. Prentice-Hall. https://doi.org/10.1016/0022-4405(74)90056-9
Holmes, J. B., & Schofield, M. R. (2022). Moments of the logit-normal distribution. Communications in Statistics-Theory and Methods, 51(3), 610–623. https://doi.org/10.1080/03610926.2020.1752723
Johnson, N. L. (1949). Systems of frequency curves generated by methods of translation. Biometrika, 36(1/2), 149–176. https://doi.org/10.2307/2332539
Lancaster, T. (2000). The incidental parameter problem since 1948. Journal of Econometrics, 95(2), 391–413. https://doi.org/10.1016/S0304-4076(99)00044-5
Levy, R., & Svetina, D. (2011). A generalized dimensionality discrepancy measure for dimensionality assessment in multidimensional item response theory. British Journal of Mathematical and Statistical Psychology, 64(2), 208–232. https://doi.org/10.1348/000711010X500483
Lewandowski, D., Kurowicka, D., & Joe, H. (2009). Generating random correlation matrices based on vines and extended onion method. Journal of Multivariate Analysis, 100(9), 1989–2001. https://doi.org/10.1016/j.jmva.2009.04.008
Li, P. (2018). Efficient MCMC estimation of inflated beta regression models. Computational Statistics, 33(1), 127–158. https://doi.org/10.1007/s00180-017-0747-x
Liu, C.-W., & Wang, W.-C. (2019). A general unfolding IRT model for multiple response styles. Applied Psychological Measurement, 43(3), 195–210. https://doi.org/10.1177/0146621618762743
Liu, C. W., & Chalmers, R. P. (2021). A note on computing Louis’ observed information matrix identity for IRT and cognitive diagnostic models. British Journal of Mathematical and Statistical Psychology, 74(1), 118–138. https://doi.org/10.1111/bmsp.12207
Makalic, E., & Schmidt, D. F. (2022). An efficient algorithm for sampling from sink (x) for generating random correlation matrices. Communications in Statistics - Simulation and Computation, 51(5), 2731–2735. https://doi.org/10.1080/03610918.2019.1700277
Manuguerra, M., Heller, G. Z., & Ma, J. (2020). Continuous ordinal regression for analysis of visual analogue scales: The R Package ordinalCont. Journal of Statistical Software, 96, 1–25. https://doi.org/10.18637/jss.v096.i08
Marley, J. K., & Wand, M. P. (2010). Non-standard semiparametric regression via BRugs. Journal of Statistical Software, 37, 1–30. https://doi.org/10.18637/jss.v037.i05
Marsh, H. W. (1989). Confirmatory factor analyses of multitrait-multimethod data: Many problems and a few solutions. Applied Psychological Measurement, 13(4), 335–361. https://doi.org/10.1177/014662168901300402
Maydeu-Olivares, A., & Brown, A. (2010). Item response modeling of paired comparison and ranking data. Multivariate Behavioral Research, 45(6), 935–974. https://doi.org/10.1080/00273171.2010.531231
Mellenbergh, G. J. (1994). A unidimensional latent trait model for continuous item responses. Multivariate Behavioral Research, 29(3), 223–236. https://doi.org/10.1207/s15327906mbr2903_2
Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., & Teller, E. (1953). Equation of state calculations by fast computing machines. The Journal of Chemical Physics, 21(6), 1087–1092. https://doi.org/10.1063/1.1699114
Molenaar, D., Cúri, M., & Bazán, J. L. (2022). Zero and one inflated item response theory models for bounded continuous data. Journal of Educational and Behavioral Statistics, 47(6), 693–735. https://doi.org/10.3102/10769986221108455
Neyman, J., & Scott, E. L. (1948). Consistent estimates based on partially consistent observations. Econometrica: Journal of the Econometric Society, 16(1), 1–32. https://doi.org/10.2307/1914288
Noel, Y. (2014). A beta unfolding model for continuous bounded responses. Psychometrika, 79(4), 647–674. https://doi.org/10.1007/s11336-013-9361-1
Noel, Y., & Dauvier, B. (2007). A beta item response model for continuous bounded responses. Applied Psychological Measurement, 31(1), 47–73. https://doi.org/10.1177/0146621605287691
Paolino, P. (2001). Maximum likelihood estimation of models with beta-distributed dependent variables. Political Analysis, 9(4), 325–346. https://doi.org/10.1093/oxfordjournals.pan.a004873
Philip, B. K. (1990). Parametric statistics for evaluation of the visual analog scale. Anesthesia & Analgesia, 71(6), 710. https://doi.org/10.1213/00000539-199012000-00027
Ramalho, E. A., Ramalho, J. J., & Murteira, J. M. (2011). Alternative estimating and testing empirical strategies for fractional regression models. Journal of Economic Surveys, 25(1), 19–68. https://doi.org/10.1111/j.1467-6419.2009.00602.x
Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Danish Institute for Educational Research. https://doi.org/10.4135/9781412961288n335
Reips, U.-D., & Funke, F. (2008). Interval-level measurement with visual analogue scales in internet-based research: VAS generator. Behavior Research Methods, 40(3), 699–704. https://doi.org/10.3758/BRM.40.3.699
Revuelta, J., Hidalgo, B., & Alcazar-Córcoles, M. Á. (2022). Bayesian estimation and testing of a beta factor model for bounded continuous variables. Multivariate Behavioral Research, 57(1), 57–78. https://doi.org/10.1080/00273171.2020.1805582
Richards, F. (1959). A flexible growth function for empirical use. Journal of Experimental Botany, 10(2), 290–301. https://doi.org/10.1093/jxb/10.2.290
Rubin, D. B. (1984). Bayesianly justifiable and relevant frequency calculations for the applies statistician. The Annals of Statistics, 1151–1172. https://doi.org/10.1214/aos/1176346785
Samejima, F. (1973). Homogeneous case of the continuous response model. Psychometrika, 38(2), 203–219. https://doi.org/10.1007/BF02291114
Sinharay, S., Johnson, M. S., & Stern, H. S. (2006). Posterior predictive assessment of item response theory models. Applied Psychological Measurement, 30(4), 298–321. https://doi.org/10.1177/0146621605285517
Sklar, M. (1959). Fonctions de répartition à n dimensions et leurs marges. Publications de l’Institutde Statistique de l’Université de Paris, 8, 229–231. https://doi.org/10.4000/cpuc.315
Smithson, M., & Merkle, E. C. (2014). Generalized linear models for categorical and continuous limited dependent variables. CRC Press. https://doi.org/10.1201/b15694-12
Smithson, M., Merkle, E. C., & Verkuilen, J. (2011). Beta regression finite mixture models of polarization and priming. Journal of Educational and Behavioral Statistics, 36(6), 804–831. https://doi.org/10.3102/1076998610396893
Sung, Y.-T., & Wu, J.-S. (2018). The visual analogue scale for rating, ranking and paired-comparison (VAS-RRP): a new technique for psychological measurement. Behavior Research Methods, 50(4), 1694–1715. https://doi.org/10.3758/s13428-018-1041-8
Tadikamalla, P. R., & Johnson, N. L. (1982). Systems of frequency curves generated by transformations of logistic variables. Biometrika, 69(2), 461–465. https://doi.org/10.1093/biomet/69.2.461
Thurstone, L. (1927). A law of comparative judgment. Psychological Review, 34, 273–286. https://doi.org/10.1037/h0070288
Vehtari, A., Gelman, A., & Gabry, J. (2017). Practical bayesian model evaluation using leave-one-out cross-validation and WAIC. Statistics and Computing, 27(5), 1413–1432. https://doi.org/10.1007/s11222-016-9696-4
Verkuilen, J., & Smithson, M. (2012). Mixed and mixture regression models for continuous bounded responses using the beta distribution. Journal of Educational and Behavioral Statistics, 37(1), 82–113. https://doi.org/10.3102/1076998610396895
Wang, W.-C., & Chen, C.-T. (2005). Item parameter recovery, standard error estimates, and fit statistics of the WINSTEPS program for the family of rasch models. Educational and Psychological Measurement, 65(3), 376–404. https://doi.org/10.1177/0013164404268673
Watanabe, S. (2010). Asymptotic equivalence of bayes cross validation and widely applicable information criterion in singular learning theory. Journal of Machine Learning Research, 11(12), 3571–3594. https://doi.org/10.1007/s11222-016-9657-y
Wewers, M. E., & Lowe, N. K. (1990). A critical review of visual analogue scales in the measurement of clinical phenomena. Research in Nursing & Health, 13(4), 227–236. https://doi.org/10.1002/nur.4770130405
Wright, B. D., & Masters, G. N. (1982). Rating scale analysis. MESA press.
Author Note
The first author acknowledges the grant support from the National Science and Technology Council, Grant Number MSTC 110-2410-H-003-054-MY2.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open Practices Statements
The materials for the simulation study are available at https://github.com/cwliu007/mirt_bd.
Appendices
Appendix 1
Appendix 2
Appendix 3
Appendix 4
Appendix 5
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, CW. Multidimensional item response theory models for testlet-based doubly bounded data. Behav Res (2023). https://doi.org/10.3758/s13428-023-02272-5
Accepted:
Published:
DOI: https://doi.org/10.3758/s13428-023-02272-5