Migrating from a legacy fixed-format measure to CAT administration: calibrating the PHQ-9 to the PROMIS depression measures

Gibbons, Laura E.; Feldman, Betsy J.; Crane, Heidi M.; Mugavero, Michael; Willig, James H.; Patrick, Donald; Schumacher, Joseph; Saag, Michael; Kitahata, Mari M.; Crane, Paul K.

doi:10.1007/s11136-011-9882-y

Migrating from a legacy fixed-format measure to CAT administration: calibrating the PHQ-9 to the PROMIS depression measures

Published: 16 March 2011

Volume 20, pages 1349–1357, (2011)
Cite this article

Quality of Life Research Aims and scope Submit manuscript

Laura E. Gibbons¹,
Betsy J. Feldman²,
Heidi M. Crane²,
Michael Mugavero³,
James H. Willig⁴,
Donald Patrick⁵,
Joseph Schumacher⁶,
Michael Saag⁷,
Mari M. Kitahata² &
…
Paul K. Crane¹

927 Accesses
52 Citations
Explore all metrics

An Erratum to this article was published on 21 November 2012

Abstract

Purpose

We provide detailed instructions for analyzing patient-reported outcome (PRO) data collected with an existing (legacy) instrument so that scores can be calibrated to the PRO Measurement Information System (PROMIS) metric. This calibration facilitates migration to computerized adaptive test (CAT) PROMIS data collection, while facilitating research using historical legacy data alongside new PROMIS data.

Methods

A cross-sectional convenience sample (n = 2,178) from the Universities of Washington and Alabama at Birmingham HIV clinics completed the PROMIS short form and Patient Health Questionnaire (PHQ-9) depression symptom measures between August 2008 and December 2009. We calibrated the tests using item response theory. We compared measurement precision of the PHQ-9, the PROMIS short form, and simulated PROMIS CAT.

Results

Dimensionality analyses confirmed the PHQ-9 could be calibrated to the PROMIS metric. We provide code used to score the PHQ-9 on the PROMIS metric. The mean standard errors of measurement were 0.49 for the PHQ-9, 0.35 for the PROMIS short form, and 0.37, 0.28, and 0.27 for 3-, 8-, and 9-item-simulated CATs.

Conclusions

The strategy described here facilitated migration from a fixed-format legacy scale to PROMIS CAT administration and may be useful in other settings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Confirmatory Factor Analysis of the Kessler-6 Psychological Distress (K6) Scale in a Community Sample of People Living with Severe and Persistent Mental Illness: a Bifactor Model

Article 12 December 2022

Health, Health-Related Quality of Life, and Quality of Life: What is the Difference?

Article 18 February 2016

Differential item functioning of the PROMIS physical function, pain interference, and pain behavior item banks across patients with different musculoskeletal disorders and persons from the general population

Article 02 January 2019

Abbreviations

CAT:: Computerized adaptive testing
CFA:: Confirmatory factor analysis
CFI:: Comparative Fit Index
DIF:: Differential item functioning
PHQ-9:: Patient Health Questionnaire from the PRIME-MD depression measure
PRO:: Patient-reported outcome
PROMIS:: Patient-Reported Outcome Measurement Information System
RMSEA:: Root mean square error of approximation
SD:: Standard deviation
SEM:: Standard error of measurement
TLI:: Tucker–Lewis Index
UW:: University of Washington
UAB:: University of Alabama at Birmingham

References

Cella, D., Yount, S., Rothrock, N., Gershon, R., Cook, K., Reeve, B., et al. (2007). The Patient-Reported Outcomes Measurement Information System (PROMIS): Progress of an NIH Roadmap cooperative group during its first two years. Medical Care, 45(5 Suppl 1), S3–S11.
Article PubMed Google Scholar
Reeve, B. B., Hays, R. D., Bjorner, J. B., Cook, K. F., Crane, P. K., Teresi, J. A., et al. (2007). Psychometric evaluation and calibration of health-related quality of life item banks: Plans for the Patient-Reported Outcomes Measurement Information System (PROMIS). Medical Care, 45(5 Suppl 1), S22–S31.
Article PubMed Google Scholar
Cella, D., Riley, W. T., Stone, A., Rothrock, N., Reeve, B., Yount, S. E., et al. (in press). Initial item banks and first wave testing of the Patient-Reported Outcomes Measurement Information System (PROMIS) network: 2005–2008. Journal of Clinical Epidemiology.
Bjorner, J. B., Kosinski, M., & Ware, J. E., Jr. (2003). Calibration of an item pool for assessing the burden of headaches: An application of item response theory to the headache impact test (HIT). Quality of Life Research, 12(8), 913–933.
Article PubMed Google Scholar
Bjorner, J. B., Chang, C. H., Thissen, D., & Reeve, B. B. (2007). Developing tailored instruments: Item banking and computerized adaptive assessment. Quality of Life Research, 16(Suppl 1), 95–108.
Article PubMed Google Scholar
Cella, D., Gershon, R., Lai, J. S., & Choi, S. (2007). The future of outcomes measurement: Item banking, tailored short-forms, and computerized adaptive assessment. Quality of Life Research, 16(Suppl 1), 133–141.
Article PubMed Google Scholar
Fayers, P. M. (2007). Applying item response theory and computer adaptive testing: The challenges for health outcomes assessment. Quality of Life Research, 16(Suppl 1), 187–194.
Article PubMed Google Scholar
Choi, S. W., Reise, S. P., Pilkonis, P. A., Hays, R. D., & Cella, D. (2010). Efficiency of static and computer adaptive short forms compared to full-length measures of depressive symptoms. Quality of Life Research, 19(1), 125–136.
Article PubMed Google Scholar
Dorans, N. J. (2007). Linking scores from multiple health outcome instruments. Quality of Life Research, 16(Suppl 1), 85–94.
Article PubMed Google Scholar
Crane, P. K., Narasimhalu, K., Gibbons, L. E., Mungas, D. M., Haneuse, S., Larson, E. B., et al. (2008). Item response theory facilitated cocalibrating cognitive tests and reduced bias in estimated rates of decline. Journal of Clinical Epidemiology, 61(10), 1018–1027.
Article PubMed Google Scholar
Bjorner, J. B., Kosinski, M., & Ware, J. E., Jr. (2003). Using item response theory to calibrate the Headache Impact Test (HIT) to the metric of traditional headache scales. Quality of Life Research, 12(8), 981–1002.
Article PubMed Google Scholar
Ware, J. E., Jr., Kosinski, M., Bjorner, J. B., Bayliss, M. S., Batenhorst, A., Dahlof, C. G., et al. (2003). Applications of computerized adaptive testing (CAT) to the assessment of headache impact. Quality of Life Research, 12(8), 935–952.
Article PubMed Google Scholar
Kitahata, M. M., Rodriguez, B., Haubrich, R., Boswell, S., Mathews, W. C., Lederman, M. M., et al. (2008). Cohort profile: The Centers for AIDS Research Network of Integrated Clinical Systems. International Journal of Epidemiology, 37(5), 948–955.
Article PubMed Google Scholar
Lawrence, S. T., Willig, J. H., Crane, H. M., Ye, J., Aban, I., Lober, W., et al. (2010). Routine, self-administered, touch-screen, computer-based suicidal ideation assessment linked to automated response team notification in an HIV primary care setting. Clinical Infectious Diseases, 50(8), 1165–1173.
Article PubMed Google Scholar
Crane, H. M., Lober, W., Webster, E., Harrington, R. D., Crane, P. K., Davis, T. E., et al. (2007). Routine collection of patient-reported outcomes in an HIV clinic setting: The first 100 patients. Current HIV Research, 5(1), 109–118.
Article PubMed CAS Google Scholar
Pilkonis, P. A., Choi, S. W., Reise, S. P., Stover, A. M., Riley, W. T., & Cella, D. (under review). Item banks for measuring emotional distress from the Patient-Reported Outcomes Measurement Information System (PROMIS). Depression, Anxiety, and Anger.
Kroenke, K., Spitzer, R. L., Williams, J. B., & Lowe, B. (2010). The Patient Health Questionnaire Somatic, Anxiety, and Depressive Symptom Scales: A systematic review. General Hospital Psychiatry, 32(4), 345–359.
Article PubMed Google Scholar
Spitzer, R. L., Kroenke, K., & Williams, J. B. (1999). Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. Primary Care Evaluation of Mental Disorders. Patient Health Questionnaire. JAMA, 282(18), 1737–1744.
Article PubMed CAS Google Scholar
Crane, P. K., Gibbons, L. E., Willig, J. H., Mugavero, M. J., Lawrence, S. T., Schumacher, J. E., et al. (2010). Measuring depression and depressive symptoms in HIV-infected patients as part of routine clinical care using the 9-item Patient Health Questionnaire (PHQ-9). AIDS Care, 22(7), 874–885.
Article PubMed CAS Google Scholar
Teresi, J. A., Ocepek-Welikson, K., Kleinman, M., Eimicke, J. P., Crane, P. K., Jones, R. N., et al. (2009). Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS): An item response theory approach. Psychology Science Quarterly, 51(2), 148–180.
PubMed Google Scholar
Muthén, L. K., & Muthén, B. O. (1998–2007). Mplus: Statistical analysis with latent variables. Los Angeles, CA: Muthén & Muthén.
Wirth, R. J., & Edwards, M. C. (2007). Item factor analysis: Current approaches and future directions. Psychol Methods, 12(1), 58–79.
Article PubMed CAS Google Scholar
Forero, C. G., & Maydeu-Olivares, A. (2009). Estimation of IRT graded response models: Limited versus full information methods. Psychological Methods, 14(3), 275–299.
Article PubMed Google Scholar
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph, No. 17.
StataCorp. (2009). Stata statistical software: Release 11. College Station, TX: StataCorp LP.
Google Scholar
Muraki, E., & Bock, D. (2003). PARSCALE for Windows. Chicago: Scientific Software International.
Google Scholar
Kroenke, K., Spitzer, R. L., & Williams, J. B. (2001). The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606–613.
Article PubMed CAS Google Scholar
Choi, S. W. (2009). Firestar: Computerized adaptive testing (CAT) simulation program for polytomous IRT models. Applied Psychological Measurement, 33(8), 644–645.
Article PubMed Google Scholar
Fries, J. F., Cella, D., Rose, M., Krishnan, E., & Bruce, B. (2009). Progress in assessing physical function in arthritis: PROMIS short forms and computerized adaptive testing. Journal of Rheumatology, 36(9), 2061–2066.
Article PubMed Google Scholar
Rose, M., Bjorner, J. B., Becker, J., Fries, J. F., & Ware, J. E. (2008). Evaluation of a preliminary physical function item bank supported the expected advantages of the Patient-Reported Outcomes Measurement Information System (PROMIS). Journal of Clinical Epidemiology, 61(1), 17–33.
Article PubMed CAS Google Scholar
Crane, H. M., Grunfeld, C., Harrington, R. D., Uldall, K. K., Ciechanowski, P. S., & Kitahata, M. M. (2008). Lipoatrophy among HIV-infected patients is associated with higher levels of depression than lipohypertrophy. HIV Medicine, 9(9), 780–786.
Article PubMed CAS Google Scholar
Hansson, M., Chotai, J., Nordstom, A., & Bodlund, O. (2009). Comparison of two self-rating scales to detect depression: HADS and PHQ-9. British Journal of General Practice, 59(566), e283–e288.
Article PubMed Google Scholar
Wittkampf, K. A., Naeije, L., Schene, A. H., Huyser, J., & van Weert, H. C. (2007). Diagnostic accuracy of the mood module of the Patient Health Questionnaire: A systematic review. General Hospital Psychiatry, 29(5), 388–395.
Article PubMed Google Scholar
Coons, S. J., Gwaltney, C. J., Hays, R. D., Lundy, J. J., Sloan, J. A., Revicki, D. A., et al. (2009). Recommendations on evidence needed to support measurement equivalence between electronic and paper-based patient-reported outcome (PRO) measures: ISPOR ePRO Good Research Practices Task Force report. Value Health, 12(4), 419–429.
Article PubMed Google Scholar

Download references

Acknowledgments

This work was supported by National Institutes of Health grants U01 AR 057954, R01 MH 084759, P30 AI 27757, P30 AI 27767, R24 AI 067039, K23 MH 082641, and the Mary Fisher CARE Fund. The Patient-Reported Outcomes Measurement Information System (PROMIS) is an NIH Roadmap initiative to develop a computerized system measuring PROs in respondents with a wide range of chronic diseases and demographic characteristics. PROMIS II was funded by cooperative agreements with a Statistical Center (Northwestern University, PI: David F. Cella, PhD, 1U54AR057951), a Technology Center (Northwestern University, PI: Richard C. Gershon, PhD, 1U54AR057943), a Network Center (American Institutes for Research, PI: Susan (San) D. Keller, PhD, 1U54AR057926) and thirteen Primary Research Sites (State University of New York, Stony Brook, PIs: Joan E. Broderick, PhD and Arthur A. Stone, PhD, 1U01AR057948; University of Washington, Seattle, PIs: Heidi M. Crane, MD, MPH, Paul K. Crane, MD, MPH, and Donald L. Patrick, PhD, 1U01AR057954; University of Washington, Seattle, PIs: Dagmar Amtmann, PhD, and Karon Cook, PhD1U01AR052171; University of North Carolina, Chapel Hill, PI: Darren A. DeWalt, MD, MPH, 2U01AR052181; Children’s Hospital of Philadelphia, PI: Christopher B. Forrest, MD, PhD, 1U01AR057956; Stanford University, PI: James F. Fries, MD, 2U01AR052158; Boston University, PIs: Stephen M. Haley, PhD, and David Scott Tulsky, PhD, 1U01AR057929; University of California, Los Angeles, PIs: Dinesh Khanna, MD, and Brennan Spiegel, MD, MSHS, 1U01AR057936; University of Pittsburgh, PI: Paul A. Pilkonis, PhD, 2U01AR052155; Georgetown University, Washington DC, PIs: Carol. M. Moinpour, PhD, and Arnold L. Potosky, PhD, U01AR057971; Children’s Hospital Medical Center, Cincinnati, PI: Esi M. Morgan Dewitt, MD, 1U01AR057940; University of Maryland, Baltimore, PI: Lisa M. Shulman, MD, 1U01AR057967; and Duke University, PI: Kevin P. Weinfurt, PhD, 2U01AR052186). NIH Science Officers on this project have included Deborah Ader, PhD, Vanessa Ameen, MD, Susan Czajkowski, PhD, Basil Eldadah, MD, PhD, Lawrence Fine, MD, DrPH, Lawrence Fox, MD, PhD, Lynne Haverkos, MD, MPH, Thomas Hilton, PhD, Laura Lee Johnson, PhD, Michael Kozak, PhD, Peter Lyster, PhD, Donald Mattison, MD, Claudia Moy, PhD, Louis Quatrano, PhD, Bryce Reeve, PhD, William Riley, PhD, Ashley Wilder Smith, PhD, MPH, Susana Serrate-Sztein,MD, Ellen Werner, PhD, and James Witter, MD, PhD. This manuscript was reviewed by PROMIS reviewers before submission for external peer review. See the Web site at http://www.nihpromis.org for additional information on the PROMIS initiative.

Author information

Authors and Affiliations

General Internal Medicine, University of Washington, Box 359780, Harborview Medical Center, 325 Ninth Ave, Seattle, WA, 98104, USA
Laura E. Gibbons & Paul K. Crane
Allergy and Infectious Diseases, University of Washington, Box 359931, Harborview Medical Center, 325 Ninth Ave, Seattle, WA, 98104, USA
Betsy J. Feldman, Heidi M. Crane & Mari M. Kitahata
Department of Medicine, Division of Infectious Disease, University of Alabama at Birmingham, 1530 Third Ave S, CCB 142, Birmingham, AL, 35294-2050, USA
Michael Mugavero
Department of Medicine, Division of Infectious Disease, University of Alabama at Birmingham, 1530 Third Ave S, CCB 178, Birmingham, AL, 35294-2050, USA
James H. Willig
Department of Health Services, University of Washington, Box 359455, 4333 Brooklyn Ave NE, Seattle, WA, 98195-9455, USA
Donald Patrick
Division of Preventive Medicine, School of Medicine, University of Alabama at Birmingham, 1717 11th Avenue South, 616 Medical Towers Building, Birmingham, AL, 35209, USA
Joseph Schumacher
Center for AIDS Research, University of Alabama at Birmingham, 845 19th Street South, BBRB 256, Birmingham, AL, 35294-2050, USA
Michael Saag

Authors

Laura E. Gibbons
View author publications
You can also search for this author in PubMed Google Scholar
Betsy J. Feldman
View author publications
You can also search for this author in PubMed Google Scholar
Heidi M. Crane
View author publications
You can also search for this author in PubMed Google Scholar
Michael Mugavero
View author publications
You can also search for this author in PubMed Google Scholar
James H. Willig
View author publications
You can also search for this author in PubMed Google Scholar
Donald Patrick
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Schumacher
View author publications
You can also search for this author in PubMed Google Scholar
Michael Saag
View author publications
You can also search for this author in PubMed Google Scholar
Mari M. Kitahata
View author publications
You can also search for this author in PubMed Google Scholar
Paul K. Crane
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laura E. Gibbons.

Additional information

An erratum to this article can be found online at http://10.1007/s11136-012-0313-5.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOC 272 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gibbons, L.E., Feldman, B.J., Crane, H.M. et al. Migrating from a legacy fixed-format measure to CAT administration: calibrating the PHQ-9 to the PROMIS depression measures. Qual Life Res 20, 1349–1357 (2011). https://doi.org/10.1007/s11136-011-9882-y

Download citation

Accepted: 01 March 2011
Published: 16 March 2011
Issue Date: November 2011
DOI: https://doi.org/10.1007/s11136-011-9882-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Migrating from a legacy fixed-format measure to CAT administration: calibrating the PHQ-9 to the PROMIS depression measures