Abstract
Test equating methods are widely used in order to make comparable different test forms administered at different occasions to different test takers. Although software for test equating is currently available, in this paper we focus the attention on four different R packages which can facilitate test equating for researchers and test developers. This paper list the different R packages which are available at the moment. Examples are provided for the equate, equateIRT, kequate, and the SNSequate packages. Additional features of these packages are discussed as well.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Albano, A. D. (2016). Equate: An R package for observed-score linking and equating. Journal of Statistical Software, 74(8), 1–36.
Andersson, B., & Wiberg, M. (2017). Item response theory observed-score kernel equating. Psychometrika, 82(1), 48–66.
Andersson, B., Bränberg, K., & Wiberg, M. (2013). Performing the kernel method of test equating with the package kequate. Journal of Statistical Software, 55(6), 1–25.
Battauz, M. (2015). equateIRT: An R package for IRT test equating. Journal of Statistical Software, 68(7), 1–22.
Battauz, M. (2017). equateMultiple: Equating of multiple forms using item response theory methods. https://cran.r-project.org/web/packages/equateMultiple/index.html, R Package Version 0.0.0.
Braun, H., & Holland, P. (1982). Observed-score test equating: A mathematical analysis of some ets equating procedures. In: P. Holland & D. Rubin (Eds.), Test equating (Vol. 1, pp. 9–49). New York: Academic Press.
Choi, S., Gibbons, L., & Crane, P. (2011). lordif: An R package for detecting differential item functioning using iterative hybrid ordinal logistic regression/item response theory and monte carlo simulations. Journal of Statistical Software, 39(8), 1–30.
von Davier. A. A., Holland, P., & Thayer, D. (2004). The kernel method of test equating. New York: Springer.
von Davier, M., & von Davier, A. (2011). A general model for IRT scale linking and scale transformations. In A. von Davier (Ed.), Statistical models for test equating, scaling, and linking (Vol. 1, pp. 225–242). New York: Springer.
González, J. (2014). SNSequate: Standard and nonstandard statistical models and methods for test equating. Journal of Statistical Software, 59(7), 1–30.
González, J., & Wiberg, M. (2017). Applying test equating methods using R. New York: Springer.
González, J., Wiberg, M., & von Davier. A. A. (2016). A note on the Poisson’s binomial distribution in item response theory. Applied Psychological Measurement, 40(4), 302–310.
Häggström, J., & Wiberg, M. (2014). Optimal bandwidth selection in observed-score kernel equating. Journal of Educational Measurement, 51(2), 201–211.
Kolen, M., & Brennan, R. (2014). Test equating, scaling, and linking: Methods and practices (3rd ed.). New York: Springer.
Lord, F. (1980). Applications of item response theory to practical testing problems. Hillsdale: Lawrence Erlbaum Associates.
Lord, F., & Wingersky, M. (1984). Comparison of IRT true-score and equipercentile observed-score “equatings”. Applied Psychological Measurement, 8(4), 453–461.
Partchev, I. (2014). Irtoys: Simple interface to the estimation and plotting of IRT models. http://CRAN.R-project.org/package=irtoys, R Package Version 0.1.7.
Rizopoulos, D. (2006). ltm: An R package for latent variable modeling and item response theory analyses. Journal of Statistical Software, 17(5), 1–25.
Robitzsch, A. (2016). sirt: Supplementary item response theory models. https://cran.r-project.org/web/packages/sirt/index.html, R Package Version 1.12.2.
Wallin, G., & Wiberg, M. (2019). Propensity scores in kernel equating for non-equivalent groups. Journal of Edcucational and Behavioral Statistics, 44(4), 390–414.
Wallin, G., Häggström, J., & Wiberg, M. (2018). How to select the bandwidth in kernel equating? – An evaluation of five different methods. In M. Wiberg, S. Culpeppar, R. Janssen, J. Gonzaĺez, & D. Molenaar (Eds.), Quantitative Psychology. The 82nd Annual Meeting of the Psychometric Society, Zurich, 2017 (pp 91–100). Springer.
Weeks, J. P. (2010). plink: An R package for linking mixed-format tests using IRT-based methods. Journal of Statistical Software, 35(12), 1–33. http://www.jstatsoft.org/v35/i12/.
Wiberg, M., & Bränberg, K. (2015). Kernel equating under the non-equivalent groups with covariates design. Applied Psychological Measurement, 39(5), 349–361.
Acknowledgements
This research was partially funded by the Swedish Research Council grant number 2014-578.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Wiberg, M., González, J. (2020). Practical Implementation of Test Equating Using R. In: Wiberg, M., Molenaar, D., González, J., Böckenholt, U., Kim, JS. (eds) Quantitative Psychology. IMPS 2019. Springer Proceedings in Mathematics & Statistics, vol 322. Springer, Cham. https://doi.org/10.1007/978-3-030-43469-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-43469-4_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-43468-7
Online ISBN: 978-3-030-43469-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)