Probing what’s behind the test score: application of multi-CDM to diagnose EFL learners’ reading performance

Du, Wenbo; Ma, Xiaomei

doi:10.1007/s11145-021-10124-x

Probing what’s behind the test score: application of multi-CDM to diagnose EFL learners’ reading performance

Published: 22 January 2021

Volume 34, pages 1441–1466, (2021)
Cite this article

Reading and Writing Aims and scope Submit manuscript

800 Accesses
4 Citations
Explore all metrics

Abstract

Cognitive diagnostic assessment seeks to promote targeted instruction designed to address learners’ strengths and weaknesses within a specific domain. Most of the previous CDA applications, however, have merely implemented a single CDM. Few studies have investigated the applicability of multi-CDM to educational assessment data. To this end, the present study attempted to diagnose a sample of 740 college freshmen’s performance on a reading comprehension test designed by the PELDiaG (Personalized English Learning: Diagnosis & Guidance) research team from a key university in China with the multi-CDM. The model-data fit results showed that the multi-CDM outperformed any single CDMs and enhanced the interpretation of the inter-skill relationship as well. Consequently, both group and individual level diagnostic information were extracted and further synthesized into a fine-grained diagnostic feedback. The findings provided further evidence pertinent to the practicality of multi-CDM in reading comprehension tests. Finally, limitations and suggestions for further research were presented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Diagnostic Assessment of L2 Chinese Learners’ Reading Comprehension Ability

Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities

Educational L2 constructs and diagnostic measurement

Article Open access 24 January 2023

References

Alderson, J. C. (2000). Assessing reading. Cambridge: Cambridge University Press.
Book Google Scholar
Bachman, L. F. (1990). Fundamental considerations in language testing. New York, NY: Oxford University Press.
Google Scholar
Bernhardt, E. B. (2005). Process and procrastination in second language reading. Annual Review of Applied Linguistics, 25, 133–150. https://doi.org/10.1017/S0267190505000073.
Article Google Scholar
Chen, H., & Chen, J. (2015). Exploring reading comprehension skill relationships through the G-DINA model. Educational Psychology, 36(6), 1–20. https://doi.org/10.1080/01443410.2015.1076764.
Article Google Scholar
Chen, J., de la Torre, J., & Zhang, Z. (2013). Relative and absolute fit evaluation in cognitive diagnosis modeling. Journal of Educational Measurement, 50(2), 123–140. https://doi.org/10.1111/j.1745-3984.2012.00185.x.
Article Google Scholar
Chen, W. H., & Thissen, D. (1997). Local dependence indexes for item pairs using item response theory. Journal of Educational and Behavioral Statistics, 22(3), 265–289. https://doi.org/10.3102/10769986022003265.
Article Google Scholar
de la Torre, J. (2011). The generalized DINA model framework. Psychometrika, 76(2), 179–199. https://doi.org/10.1007/s11336-011-9207-7.
Article Google Scholar
de la Torre, J., & Chiu, C. Y. (2016). A general method of empirical Q-matrix validation. Psychometrika, 81(2), 253–273. https://doi.org/10.1007/s11336-015-9467-8.
Article Google Scholar
de la Torre, J., & Lee, Y. S. (2013). Evaluating the Wald test for item-level comparison of saturated and reduced models in cognitive diagnosis. Journal of Educational Measurement, 50(4), 355–373. https://doi.org/10.1111/jedm.12022.
Article Google Scholar
DiBello, L. V., Roussos, L. A., & Stout, W. (2007). Review of cognitively diagnostic assessment and a summary of psychometric models. In C. R. Rao & S. Sinharay (Eds.), Handbook of statistics (Vol. 26, pp. 970–1030)., Psychometrics Amsterdam: Elsevier.
Google Scholar
Ericsson, K. A., & Simon, H. A. (1993). Protocol analysis: Verbal reports as data. Cambridge, MA: The MIT Press.
Book Google Scholar
Gough, P. B., & Tunmer, W. E. (1986). Decoding, reading, and reading disability. Remedial and Special Education, 7(1), 6–10. https://doi.org/10.1177/074193258600700104.
Article Google Scholar
Grabe, W. (2009). Reading in a second language: Moving from theory to practice. Cambridge: Cambridge University Press.
Google Scholar
Harding, L., Alderson, C., & Brunfaut, T. (2015). Diagnostic assessment of reading and listening in a second or foreign language: Elaborating on diagnostic principles. Language Testing, 32(3), 317–336. https://doi.org/10.1177/0265532214564505.
Article Google Scholar
Hartz, S. (2002). A Bayesian framework for the unified model for assessing cognitive abilities: Blending theory with practicality (Unpublished doctoral dissertation). University of Illinois at Urbana-Champaign, IL.
Henson, R. A., Templin, J., & Willse, J. (2009). Defining a family of cognitive diagnosis models using log-linear models with latent variables. Psychometrika, 74(2), 191–210. https://doi.org/10.1007/s11336-008-9089-5.
Article Google Scholar
Jang, E. E. (2009). Cognitive diagnostic assessment of L2 reading comprehension ability: Validity arguments for fusion model application to LanguEdge assessment. Language Testing, 26(1), 31–73. https://doi.org/10.1177/0265532208097336.
Article Google Scholar
Jang, E. E., Dunlop, M., Park, G., & van der Boom, E. H. (2015). How do young learners with different profiles of reading skill mastery, perceived ability, and goal orientation respond to holistic diagnostic feedback? Language Testing, 32(3), 359–383. https://doi.org/10.1177/0265532215570924.
Article Google Scholar
Jang, E. E., Dunlop, M., Wagner, M., Kim, Y. H., & Gu, Z. (2013). Elementary school ELLs’ reading skill profiles using cognitive diagnosis modeling: Roles of length of residence and home language environment. Language Learning, 63(3), 400–436. https://doi.org/10.1111/lang.12016.
Article Google Scholar
Javidanmehr, Z., & Anani Sarab, M. R. (2019). Retrofitting non-diagnostic reading comprehension assessment: Application of the G-DINA model to a high stakes reading comprehension test. Language Assessment Quarterly, 16(3), 294–311. https://doi.org/10.1080/15434303.2019.1654479.
Article Google Scholar
Junker, B. W., & Sijtsma, K. (2001). Cognitive assessment models with few assumptions, and connections with nonparametric item response theory. Applied Psychological Measurement, 25(3), 258–272. https://doi.org/10.1177/01466210122032064.
Article Google Scholar
Kim, A. (2015). Exploring ways to provide diagnostic feedback with an ESL placement test: Cognitive diagnostic assessment of L2 reading ability. Language Testing, 32(2), 227–258. https://doi.org/10.1177/0265532214558457.
Article Google Scholar
Kintsch, W. (1988). The role of knowledge in discourse processing: A construction-integration model. Psychological Review, 95(2), 163–182. https://doi.org/10.1037/0033-295x.95.2.163.
Article Google Scholar
Kintsch, W., & Rawson, K. A. (2005). Comprehension. In M. J. Snowling & C. Hulme (Eds.), The science of reading: A handbook (pp. 209–226). Malden, MA: Blackwell Publishing.
Google Scholar
Koda, K. (2005). Insights into second language reading: A cross-linguistic approach. New York, NY: Cambridge University Press.
Book Google Scholar
Landis, J. R., & Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159–174. https://doi.org/10.2307/2529310.
Article Google Scholar
Lee, Y. W. (2004). Examining passage-related local item dependence (LID) and measurement construct using Q3 statistics in an EFL reading comprehension test. Language Testing, 21(1), 74–100. https://doi.org/10.1191/0265532204lt260oa.
Article Google Scholar
Lee, Y. W. (2015). Diagnosing diagnostic language assessment. Language Testing, 32(3), 299–316. https://doi.org/10.1177/0265532214565387.
Article Google Scholar
Lee, Y. W., & Sawaki, Y. (2009). Application of three cognitive diagnosis models to ESL reading and listening assessments. Language Assessment Quarterly, 6(3), 239–263. https://doi.org/10.1080/15434300903079562.
Article Google Scholar
Lei, P. W., & Li, H. (2016). Performance of fit indices in choosing correct cognitive diagnostic models and Q-matrices. Applied Psychological Measurement, 40(6), 405–417. https://doi.org/10.1177/0146621616647954.
Article Google Scholar
Leighton, J. P., & Gierl, M. J. (2007). Cognitive diagnostic assessment for education: Theory and applications. New York, NY: Cambridge University Press.
Book Google Scholar
Li, H., Hunter, C. V., & Lei, P. W. (2016). The selection of cognitive diagnostic models for a reading test. Language Testing, 33(3), 391–409. https://doi.org/10.1177/0265532215590848.
Article Google Scholar
Li, H., & Suen, H. (2013). Constructing and validating a Q-matrix for cognitive diagnostic analyses of a reading test. Educational Assessment, 18(1), 1–25. https://doi.org/10.1080/10627197.2013.761522.
Article Google Scholar
Madison, M. J., & Bradshaw, L. P. (2018). Assessing growth in a diagnostic classification model framework. Psychometrika, 83(4), 963–990. https://doi.org/10.1007/s11336-018-9638-5.
Article Google Scholar
McNeil, L. (2012). Extending the compensatory model of second language reading. System, 40(1), 64–76. https://doi.org/10.1016/j.system.2012.01.011.
Article Google Scholar
Nichols, P. D. (1994). A framework for developing cognitively diagnostic assessments. Review of Educational Research, 64(4), 575–603. https://doi.org/10.2307/1170588.
Article Google Scholar
Ravand, H. (2016). Application of a cognitive diagnostic model to a high-stakes reading comprehension test. Journal of Psychoeducational Assessment, 34(8), 782–799. https://doi.org/10.1177/0734282915623053.
Article Google Scholar
Ravand, H., & Baghaei, P. (2020). Diagnostic classification models: Recent developments, practical issues, and prospects. International Journal of Testing, 20(1), 24–56. https://doi.org/10.1080/15305058.2019.1588278.
Article Google Scholar
Ravand, H., & Robitzsch, A. (2018). Cognitive diagnostic model of best choice: A study of reading comprehension. Educational Psychology, 38(10), 1255–1277. https://doi.org/10.1080/01443410.2018.1489524.
Article Google Scholar
Roberts, M. R., & Gierl, M. G. (2010). Developing score reports for cognitive diagnostic assessments. Educational Measurement: Issues and Practice, 29(3), 25–38. https://doi.org/10.1111/j.1745-3992.2010.00181.x.
Article Google Scholar
Robitzsch, A., Kiefer, T., George, A. C., & Uenlue, A. (2018). CDM: Cognitive diagnosis modeling. R package version 6.3-45. https://CRAN.R-project.org/package=CDM.
Rojas, G., de la Torre, J., & Olea, J. (2012, April). Choosing between general and specific cognitive diagnosis models when the sample size is small. Paper presented at the meeting of the National Council on Measurement in Education, Vancouver, Canada.
Rupp, A. A., Templin, J., & Henson, R. (2010). Diagnostic measurement: Theory, methods, and applications. New York, NY: The Guildford Press.
Google Scholar
Sadoski, M., & Paivio, A. (2007). Toward a unified theory of reading. Scientific Studies of Reading, 11(4), 337–356. https://doi.org/10.1080/10888430701530714.
Article Google Scholar
Stanovich, K. E. (1980). Toward an interactive⁃compensatory model of individual differences in the development of reading fluency. Reading Research Quarterly, 16(1), 32–71. https://doi.org/10.2307/747348.
Article Google Scholar
Templin, J., & Henson, R. A. (2006). Measurement of psychological disorders using cognitive diagnosis models. Psychological Methods, 11(3), 287–305. https://doi.org/10.1037/1082-989X.11.3.287.
Article Google Scholar
Urquhart, S., & Weir, C. J. (1998). Reading in a second language: Process, product and practice. New York, NY: Longman.
Google Scholar
von Davier, M. (2005). A general diagnostic model applied to language testing data (RR-05-16). Princeton, NJ: Educational Testing Service.
Google Scholar
von Davier, M. (2014). The DINA model as a constrained general diagnostic model: Two variants of a model equivalency. British Journal of Mathematical and Statistical Psychology, 67(1), 49–71. https://doi.org/10.1111/bmsp.12003.
Article Google Scholar
Wang, C., & Gierl, M. J. (2011). Using the attribute hierarchy method to make diagnostic inferences about examinees’ cognitive skills in critical reading. Journal of Educational Measurement, 48(2), 165–187. https://doi.org/10.1111/j.1745-3984.2011.00142.x.
Article Google Scholar
Weir, C., Yang, H. Z., & Jin, Y. (2000). An empirical investigation of the componentiality of L2 reading in English for academic purposes. Cambridge: Cambridge University Press.
Google Scholar
Yen, W. M. (1984). Effects of local item dependence on the fit and equating performance of the three parameter logistic model. Applied Psychological Measurement, 8(2), 125–145. https://doi.org/10.1177/014662168400800201.
Article Google Scholar
Zhang, J. (2013). Relationships between missing responses and skill mastery profiles of cognitive diagnostic assessment (Unpublished doctoral dissertation). University of Toronto, Toronto, ON, Canada.

Download references

Acknowledgements

We would like to thank all the team members of PELDiaG for the great efforts they have made on constructing the diagnostic reading test prior to this study. We also want to express our sincere gratitude to the two anonymous reviewers for their constructive comments and suggestions for the refinement of this manuscript.

Funding

This work was supported by the National Social Science Fund of China under Grant No. 17BYY015, and the National Education Examinations Authority—British Council English Assessment Research under Grant No. EARG2020004.

Author information

Authors and Affiliations

School of Foreign Studies, Xi’an Jiaotong University, Xianning West Road, 28#, Xi’an, 710049, Shaanxi, China
Wenbo Du & Xiaomei Ma

Authors

Wenbo Du
View author publications
You can also search for this author in PubMed Google Scholar
Xiaomei Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Wenbo Du or Xiaomei Ma.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1

See Table 9.

Table 9 E-Qmat constructed by experts

Full size table

Appendix 2

See Table 10.

Table 10 S-Qmat based on students' verbal report

Full size table

Appendix 3

See Table 11.

Table 11 R-Qmat with revised items shadowed with italics

Full size table

Appendix 4

See Table 12.

Table 12 Coding guide for verbal reports

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Du, W., Ma, X. Probing what’s behind the test score: application of multi-CDM to diagnose EFL learners’ reading performance. Read Writ 34, 1441–1466 (2021). https://doi.org/10.1007/s11145-021-10124-x

Download citation

Accepted: 06 January 2021
Published: 22 January 2021
Issue Date: June 2021
DOI: https://doi.org/10.1007/s11145-021-10124-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Probing what’s behind the test score: application of multi-CDM to diagnose EFL learners’ reading performance

Abstract

Access this article

Similar content being viewed by others

Diagnostic Assessment of L2 Chinese Learners’ Reading Comprehension Ability

Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities

Educational L2 constructs and diagnostic measurement

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher's Note

Appendices

Appendix 1

Appendix 2

Appendix 3

Appendix 4

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Probing what’s behind the test score: application of multi-CDM to diagnose EFL learners’ reading performance

Abstract

Access this article

Similar content being viewed by others

Diagnostic Assessment of L2 Chinese Learners’ Reading Comprehension Ability

Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities

Educational L2 constructs and diagnostic measurement

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding authors

Additional information

Publisher's Note

Appendices

Appendix 1

Appendix 2

Appendix 3

Appendix 4

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation