Advertisement

A regularization approach for the detection of differential item functioning in generalized partial credit models

  • Gunther SchaubergerEmail author
  • Patrick Mair
Article

Abstract

Most common analysis tools for the detection of differential item functioning (DIF) in item response theory are restricted to the use of single covariates. If several variables have to be considered, the respective method is repeated independently for each variable. We propose a regularization approach based on the lasso principle for the detection of uniform DIF. It is applicable to a broad range of polytomous item response models with the generalized partial credit model as the most general case. A joint model is specified where the possible DIF effects for all items and all covariates are explicitly parameterized. The model is estimated using a penalized likelihood approach that automatically detects DIF effects and provides trait estimates that correct for the detected DIF effects from different covariates simultaneously. The approach is evaluated by means of several simulation studies. An application is presented using data from the children’s depression inventory.

Keywords

Differential item functioning DIF Generalized partial credit model Regularization Lasso GPCMlasso 

Notes

Acknowledgements

We thank Rachel Vaughn-Coaxum and John Weisz for providing the data of the Children’s Depression Inventory and for comments that helped to improve the manuscript.

References

  1. Akaike, H. (1974). A new look at statistical model identification. IEEE Transactions on Automatic Control, 19, 716–723.CrossRefGoogle Scholar
  2. Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43(4), 561–573.CrossRefGoogle Scholar
  3. Birnbaum, A. (1968). Some latent trait models and their uses in inferring an examinee’s ability. In F. F M Lord, & M. Novick (Eds.) Statistical theories of mental rest score, reading (pp. 397–479). Boston: Addison-Wesley.Google Scholar
  4. Bollmann, S., Berger, M., & Tutz, G. (2018). Item-focused trees for the detection of differential item functioning in partial credit models. Educational and Psychological Measurement, 78(5), 781–804.  https://doi.org/10.1177/0013164417722179.
  5. Chang, H. H., Mazzeo, J., & Roussos, L. (1996). Detecting DIF for polytomously scored items an adaptation of the SIBTEST procedure. Journal of Educational Measurement, 33(3), 333–353.CrossRefGoogle Scholar
  6. Choi, S., Gibbons, L., & Crane, P. (2011). Lordif: An R package for detecting differential item functioning using iterative hybrid ordinal logistic regression/item response theory and Monte Carlo simulations. Journal of Statistical Software, 39(1), 1–30. https://www.jstatsoft.org/index.php/jss/article/view/v039i08.  https://doi.org/10.18637/jss.v039.i08 Google Scholar
  7. Choi, S. W., with contributions from Laura E Gibbons, & Crane, P. K. (2016). Lordif: Logistic ordinal regression differential item functioning using IRT. https://CRAN.R-project.org/package=lordif, r package version 0.3-3.
  8. Crane, P. K., Gibbons, L. E., Jolley, L., & van Belle, G. (2006). Differential item functioning analysis with ordinal logistic regression techniques: DIFdetect and difwithpar. Medical Care, 44(11), 115–123.CrossRefGoogle Scholar
  9. Hoerl, A. E., & Kennard, R. W. (1970). Ridge regression: Bias estimation for nonorthogonal problems. Technometrics, 12, 55–67.CrossRefGoogle Scholar
  10. Holland, P. W., & Thayer, D. T. (1988). Differential item performance and the Mantel–Haenszel procedure. Test Validity, 129–145.Google Scholar
  11. Komboz, B., Strobl, C., & Zeileis, A. (2018). Tree-based global model tests for polytomous Rasch models. Educational and Psychological Measurement, 78, 128–166.  https://doi.org/10.1177/0013164416664394 CrossRefGoogle Scholar
  12. Kovacs, M. (2004) Children’s Depression Inventory (CDI). Toronto: Multi-Health Systems Inc.Google Scholar
  13. Magis, D., Bèland, S., Tuerlinckx, F., & Boeck, P. (2010). A general framework and an R package for the detection of dichotomous differential item functioning. Behavior Research Methods, 42(3), 847–862.  https://doi.org/10.3758/BRM.42.3.847 CrossRefGoogle Scholar
  14. Mantel, N. (1963). Chi-square tests with one degree of freedom; extensions of the Mantel–Haenszel procedure. Journal of the American Statistical Association, 58(303), 690–700.Google Scholar
  15. Mantel, N., & Haenszel, W. (1959). Statistical aspects of the analysis of data from retrospective studies of disease. Journal of the National Cancer Institute, 22(4), 719–748. http://jnci.oxfordjournals.org/content/22/4/719.short, http://jnci.oxfordjournals.org/content/22/4/719.full.pdf+html.  https://doi.org/10.1093/jnci/22.4.719 Google Scholar
  16. Masters, G. (1982). A Rasch model for partial credit scoring. Psychometrika, 47(2), 149–174.CrossRefGoogle Scholar
  17. Meinshausen, N., & Bühlmann, P. (2010). Stability selection. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 72(4), 417–473.CrossRefGoogle Scholar
  18. Muraki, E. (1990). Fitting a polytomous item response model to Likert-type data. Applied Psychological Measurement, 14(1), 59–71.  https://doi.org/10.1177/014662169001400106 CrossRefGoogle Scholar
  19. Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement, 16(2), 159–176.CrossRefGoogle Scholar
  20. Penfield, R. D. (2007). Assessing differential step functioning in polytomous items using a common odds ratio estimator. Journal of Educational Measurement, 44(3), 187–210. http://www.jstor.org/stable/20461856.CrossRefGoogle Scholar
  21. Penfield, R. D., Gattamorta, K., & Childs, R. A. (2009). An NCME instructional module on using differential step functioning to refine the analysis of DIF in polytomous items. Educational Measurement: Issues and Practice, 28(1), 38–49.CrossRefGoogle Scholar
  22. R Core Team (2018). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org/.
  23. Reise, S. P., Widaman, K. F., & Pugh, R. H. (1993). Confirmatory factor analysis and item response theory: two approaches for exploring measurement invariance. Psychological Bulletin, 114(3), 552–566.CrossRefGoogle Scholar
  24. Schauberger, G. (2018). GPCMLasso: Differential item functioning in generalized partial credit models. R package version 0.1-1.Google Scholar
  25. Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics, 6, 461–464.CrossRefGoogle Scholar
  26. Shah, R. D., & Samworth, R. J. (2013). Variable selection with error control: another look at stability selection. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 75(1), 55–80.  https://doi.org/10.1111/j.1467-9868.2011.01034.x CrossRefGoogle Scholar
  27. Swaminathan, H., & Rogers, H. J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27(4), 361–370.  https://doi.org/10.1111/j.1745-3984.1990.tb00754.x CrossRefGoogle Scholar
  28. Thissen, D., Steinberg, L., & Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. In P. W. Holland, & H. Wainer (Eds.) Differential item functioning (pp. 67–113).Google Scholar
  29. Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 58, 267–288.Google Scholar
  30. Tutz, G., & Schauberger, G. (2015). A penalty approach to differential item functioning in Rasch models. Psychometrika, 80(1), 21–43.  https://doi.org/10.1007/s11336-013-9377-6 CrossRefGoogle Scholar
  31. Vaughn-Coaxum, R. A., Mair, P., & Weisz, J. R. (2016). Racial/ethnic differences in youth depression indicators. Clinical Psychological Science, 4(2), 239–253.  https://doi.org/10.1177/2167702615591768 CrossRefGoogle Scholar
  32. Welch, C., & Hoover, H. (1993). Procedures for extending item bias detection techniques to polytomously scored items. Applied Measurement in Education, 6(1), 1–19.  https://doi.org/10.1207/s15324818ame0601_1 CrossRefGoogle Scholar
  33. Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American Statistical Association, 101(476), 1418–1429.CrossRefGoogle Scholar
  34. Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67, 301–320.CrossRefGoogle Scholar
  35. Zumbo, B. (1999) A handbook on the theory and methods of differential item functioning (DIF). Ottawa: National Defense Headquarters.Google Scholar

Copyright information

© The Psychonomic Society, Inc. 2019

Authors and Affiliations

  1. 1.Department of Sport and Health Sciences, Chair of EpidemiologyTechnical University of MunichMunichGermany
  2. 2.Department of StatisticsLMU MunichMunichGermany
  3. 3.Department of PsychologyHarvard UniversityCambridgeUSA

Personalised recommendations