Skip to main content

A Comparison of Item Parameter and Standard Error Recovery Across Different R Packages for Popular Unidimensional IRT Models

  • Conference paper
Quantitative Psychology (IMPS 2016)

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 196))

Included in the following conference series:

Abstract

With the advent of the free statistical language R, several item response theory (IRT) programs have been introduced as psychometric packages in R. These R programs have an advantage of a free open source over commercial software. However, in research and practical settings, the quality of results produced by free programs may be called into questions. The aim of this study is to provide information regarding the performance of those free R IRT software for the recovery item parameters and their standard errors. The study conducts a series of comparisons via simulations for popular unidimensional IRT models: the Rasch, 2-parameter logistic, 3-parameter logistic, generalized partial credit, and graded response models. The R IRT programs included in the present study are “eRm,” “ltm,” “mirt,” “sirt,” and “TAM.” This study also reports convergence rates reported by both “eRm” and “ltm” and the elapsed times for the estimation of the models under different simulation conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Note that this study used the latest version of each package available at the time of study: “eRm” (0.15–6; November 12, 2015), “ltm” (1.0–0; December 20, 2013), “TAM” (1.15–0; December 15, 2015), “sirt” (1.8–9; June 28, 2015), and “mirt” (1.15; January 21, 2016).

  2. 2.

    Note that “mirt” uses actually “+ intercept” but for consistency with the “ltm” expression, “–intercept” was used in this article.

References

  • P. Chalmers, mirt: a multidimensional item response theory package for the R environment. J. Stat. Softw. 48(6), 1–29 (2012)

    Article  Google Scholar 

  • C. DeMars, Recovery of graded response and partial credit parameters in MULTILOG and PARSCALE. Paper presented at the annual meeting of American Educational Research Association, Chicago, IL, 2002

    Google Scholar 

  • T. Kiefer, A. Robitzsch, M. Wu, TAM: Test Analysis Modules. R package version 1.15-0, 2015., http://CRAN.R-project.org/package=TAM

  • P. Mair, R. Hatzinger, M.J. Maier, eRm: Extended Rasch Modeling. R package version 0.15-6, 2015., http://CRAN.R-project.org/package=eRm

  • G.N. Masters, A Rasch model for partial credit scoring. Psychometrika 47, 149–174 (1982)

    Article  Google Scholar 

  • E. Muraki, A generalized partial credit model: application of an EM algorithm. Appl. Psychol. Meas. 16, 159–176 (1992)

    Article  Google Scholar 

  • E. Muraki, R.D. Bock, PARSCALE 3: IRT Based Test Scoring and Item Analysis for Graded Items and Rating Scales [Computer Software] (Scientific Software, Chicago, IL, 1997)

    Google Scholar 

  • T. Pan, O. Zhang, A comparison of parameter recovery using different computer programs and the latent trait models R-Packages in estimating the graded response model. Paper Presented at the annual meeting of American Education Research Association, Philadelphia, PA, 2014

    Google Scholar 

  • R Core Team, R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2015. ISBN: 3-900051-07-0., http://www.R-project.org/

  • M.J. Ree, Estimating item characteristic curves. Appl. Psychol. Meas. 3, 371–385 (1979)

    Article  Google Scholar 

  • D. Rizopoulos, ltm: an R package for latent variable modeling and item response theory analyses. J. Stat. Softw. 17(5), 1–25 (2006)

    Article  Google Scholar 

  • A. Robitzsch, sirt: supplementary item response theory models. R package version 1.8-9, 2015., http://CRAN.R-project.org/package=sirt

  • T. Rusch, P. Mair, R. Hatzinger, Psychometrics with R: a review of CRAN packages for item response theory. Discussion Paper Series/Center for Empirical Research Methods, 2013/2. WU Vienna University of Economics and Business, Vienna, 2013

    Google Scholar 

  • F. Samejima, Estimation of ability using a response pattern of graded scores. Psychometrika Monograph, No. 17, 1969

    Google Scholar 

  • S. Tao, B. Sorenson, M. Simons, Y. Du, Item parameter recovery accuracy: Comparing PARSCALE, MULTILOG and flexMIRT. Paper presented at the 2014 National Council of Measurement in Education Annual Meeting, Philadelphia, PA, 2014

    Google Scholar 

  • D. Thissen, MULTILOG: Multiple Category Item Analysis and Test Scoring Using Item Response Theory [Computer Software] (Scientific Software, Chicago, IL, 1991)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Taeyoung Kim .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Kim, T., Paek, I. (2017). A Comparison of Item Parameter and Standard Error Recovery Across Different R Packages for Popular Unidimensional IRT Models. In: van der Ark, L.A., Wiberg, M., Culpepper, S.A., Douglas, J.A., Wang, WC. (eds) Quantitative Psychology. IMPS 2016. Springer Proceedings in Mathematics & Statistics, vol 196. Springer, Cham. https://doi.org/10.1007/978-3-319-56294-0_36

Download citation

Publish with us

Policies and ethics