Applying item response theory and computer adaptive testing: the challenges for health outcomes assessment

Fayers, Peter M.

doi:10.1007/s11136-007-9197-1

Applying item response theory and computer adaptive testing: the challenges for health outcomes assessment

Published: 07 April 2007

Volume 16, pages 187–194, (2007)
Cite this article

Quality of Life Research Aims and scope Submit manuscript

Peter M. Fayers^1,2

866 Accesses
57 Citations
Explore all metrics

Abstract

Objectives

We review the papers presented at the NCI/DIA conference, to identify areas of controversy and uncertainty, and to highlight those aspects of item response theory (IRT) and computer adaptive testing (CAT) that require theoretical or empirical research in order to justify their application to patient reported outcomes (PROs).

Background

IRT and CAT offer exciting potential for the development of a new generation of PRO instruments. However, most of the research into these techniques has been in non-healthcare settings, notably in education. Educational tests are very different from PRO instruments, and consequently problematic issues arise when adapting IRT and CAT to healthcare research.

Results

Clinical scales differ appreciably from educational tests, and symptoms have characteristics distinctly different from examination questions. This affects the transferring of IRT technology. Particular areas of concern when applying IRT to PROs include inadequate software, difficulties in selecting models and communicating results, insufficient testing of local independence and other assumptions, and a need of guidelines for estimating sample size requirements. Similar concerns apply to differential item functioning (DIF), which is an important application of IRT. Multidimensional IRT is likely to be advantageous only for closely related PRO dimensions.

Conclusions

Although IRT and CAT provide appreciable potential benefits, there is a need for circumspection. Not all PRO scales are necessarily appropriate targets for this methodology. Traditional psychometric methods, and especially qualitative methods, continue to have an important role alongside IRT. Research should be funded to address the specific concerns that have been identified.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Use of Cronbach’s Alpha When Developing and Reporting Research Instruments in Science Education

Article Open access 07 June 2017

Reporting reliability, convergent and discriminant validity with structural equation modeling: A review and best-practice recommendations

Article Open access 30 January 2023

How to use and assess qualitative research methods

Article Open access 27 May 2020

References

Berkson, J. (1944). Application of the logistic function to bio-assay. Journal of the American Statistical Society, 39, 357–365.
CAS Google Scholar
Rasch, G. (1960). Probabilistic models for some intelligence attainment tests. Copenhagen: Danish Institute for Educational Research.
Google Scholar
Cox, D. R. (1972). Regression models and life tables (with discussion). Journal of the Royal Statistical Society series B, 34, 187–220.
Google Scholar
McCullagh, P. (1980). Regression models for ordinal data (with discussion). Journal of the Royal Statistical Society series B, 42, 109–142.
Google Scholar
Van der Linden, W. J., & Hambleton, R. K. (1996). Item response theory: brief history, common models, and extensions. In W. J. Van der Linden & R. K. Hambleton (Eds.), Handbook of modern item response theory (p. 23). New York: Springer.
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. (Psychometric Monograph No. 17). Iowa City: Psychometric Society.
Google Scholar
Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement, 16, 159–176.
Article Google Scholar
Fayers, P. M., & Hand, D. J. (1997). Factor analysis, causal indicators and quality of life. Quality of Life Research, 6, 139–150.
PubMed CAS Google Scholar
Fayers, P. M, & Hand, D. J. (2002). Causal variables, indicator variables and measurement scales: An example from quality of life. Journal of the Royal Statistical Society series A, 165, 233–261.
Google Scholar
Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists (p. 123). Mahwah: Lawrence Erlbaum Associates.
Wright, B. D. (1992). IRT in the 1990s: Which models work best? Rasch Measurement Transactions, 6, 196–200.
Google Scholar
Donaldson, M. (2006). Using patient-reported outcomes in clinical oncology practice: benefits, challenges, and next steps. Expert Review Pharmacoeconomics Outcomes Research, 6, 87–95.
Article Google Scholar
De Boer, A. G. E. M., Van Lanschot, J. J. B., Stalmeier, P. F. M. et al. (2004). Is a single-item visual analogue scale as valid, reliable and responsive as multi-item scales in measuring quality of life? Quality of Life Research, 13, 311–320.
Article PubMed Google Scholar
Collins, S. L., Edwards, J., Moore, R. A et al. (2001). Seeking a simple measure of analgesia for mega-trials: Is a single global assessment good enough? Pain, 91, 189–194.
Article PubMed CAS Google Scholar
Bernhard, J., Sullivan M., Hurny C. et al. (2001). Clinical relevance of single item quality of life indicators in cancer clinical trials. British Journal of Cancer, 84, 1156–1165.
Article PubMed CAS Google Scholar
Petersen, M. Aa., Groenvold, M., Aaronson, N., Fayers, P. M., Sprangers, M. A., & Bjorner, J. B. (2006). Multidimensional computerized adaptive testing of the EORTC QLQ-C30: Basic developments and evaluation. Quality of Life Research, 15, 315–329.
Article PubMed Google Scholar
Haley, S. M., Ni, P. S., Ludlow, L. H., & Fragala-Pinkham, M. A. (2006). Measurement precision and efficiency of multidimensional computer adaptive testing of physical functioning using the pediatric evaluation of disability inventory. Archives of Physical Medicine and Rehabilitation, 87, 1223–1229.
Article PubMed Google Scholar
Wang, W.-C., Chen, P.-H., & Cheng, T.-Y. (2004). Improving measurement precision of test batteries using multidimensional item response models. Psychological Methods, 9, 116–136.
Article PubMed Google Scholar
Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists (p. 278). Mahwah: Lawrence Erlbaum Associates.
Bair, M. J., Robinson, R. L., Katon, W., & Kroenke, K. (2003). Depression and pain co-morbidity: a literature review. Archives of Internal Medicine, 163, 2433–2445.
Article PubMed Google Scholar
Ruoff, G. E. (1996). Depression in the patient with chronic pain. Journal of Family Practice, 43, S25–S33.
PubMed CAS Google Scholar
Alonso, J., Angermeyer, M. C., Bernert, S. et al. (2004). Prevalence of mental disorders in Europe: Results from the European Study of the Epidemiology of Mental Disorders (ESEMeD) project. Acta Psychiatrica Scandinavica Suppl 420, 21–27.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Public Health, Institute of Applied Health Sciences, University of Aberdeen Medical School, Polwarth Building, Foresterhill, Aberdeen, AB25 2ZD, UK
Peter M. Fayers
Faculty of Medicine, Pain and Palliation Research Group, Norwegian University of Science and Technology, Trondheim, Norway
Peter M. Fayers

Authors

Peter M. Fayers
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter M. Fayers.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fayers, P.M. Applying item response theory and computer adaptive testing: the challenges for health outcomes assessment. Qual Life Res 16 (Suppl 1), 187–194 (2007). https://doi.org/10.1007/s11136-007-9197-1

Download citation

Received: 25 August 2006
Accepted: 13 February 2007
Published: 07 April 2007
Issue Date: August 2007
DOI: https://doi.org/10.1007/s11136-007-9197-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Applying item response theory and computer adaptive testing: the challenges for health outcomes assessment