Mapping the Patient-Reported Outcomes Measurement Information System (PROMIS-29) to EQ-5D-5L

Aghdaee, Mona; Gu, Yuanyuan; Sinha, Kompal; Parkinson, Bonny; Sharma, Rajan; Cutler, Henry

doi:10.1007/s40273-022-01157-3

Mapping the Patient-Reported Outcomes Measurement Information System (PROMIS-29) to EQ-5D-5L

Original Research Article
Open access
Published: 07 November 2022

Volume 41, pages 187–198, (2023)
Cite this article

Download PDF

You have full access to this open access article

PharmacoEconomics Aims and scope Submit manuscript

Mapping the Patient-Reported Outcomes Measurement Information System (PROMIS-29) to EQ-5D-5L

Download PDF

3244 Accesses
2 Citations
14 Altmetric
1 Mention
Explore all metrics

Abstract

Background and Objective

The Patient-Reported Outcomes Measurement Information System (PROMIS-29) is gaining popularity as healthcare system funders increasingly seek value-based care. However, it is limited in its ability to estimate utilities and thus inform economic evaluations. This study develops the first mapping algorithm for estimating EuroQol 5-Dimension 5-Level (EQ-5D-5L) utilities from PROMIS-29 responses using a large dataset and through extensive comparisons between econometric models.

Methods

An online survey was conducted to collect responses to PROMIS-29 and EQ-5D-5L from the general Australian population (N = 3013). Direct and indirect mapping methods were explored, including linear regression, Tobit, generalised linear model, censored regression model, beta regression (Betamix), the adjusted limited dependent variable mixture model (ALDVMM) and generalised ordered logit. The most robust model was selected by assessing the performance based on average ten-fold cross-validation geometric mean absolute error and geometric mean squared error, the predicted mean, maximum and minimum utilities, as well as the fitting across the entire distribution.

Results

The direct approach using ALDVMM was considered the preferred model based on lowest geometric mean absolute error and geometric mean squared error in cross-validation (0.0882, 0.0299) and its superiority in predicting the actual observed mean, full health states and lower utility extremes. The robustness and precision in prediction across the entire distribution of utilities with ALDVMM suggest it is an accurate and valid mapping algorithm. Moreover, the suggested mapping algorithm outperformed previously published algorithms using Australian data, indicating the validity of this model for economic evaluations.

Conclusions

This study developed a robust algorithm to estimate EQ-5D-5L utilities from PROMIS-29. Consistent with the recent literature, the ALDVMM outperformed all other econometric models considered in this study, suggesting that the mixture models have relatively better performance and are an ideal candidate model for mapping.

Mapping PROMIS Global Health Items to EuroQol (EQ-5D) Utility Scores Using Linear and Equipercentile Equating

Article 14 July 2017

Estimating EQ-5D utilities based on the Short-Form Long Term Conditions Questionnaire (LTCQ-8)

Article Open access 14 August 2020

Mapping of Family Reported Outcome Measure (FROM-16) scores to EQ-5D: algorithm to calculate utility values

Article Open access 25 February 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

FormalPara Key Points for Decision Makers

The Patient-Reported Outcomes Measurement Information System (PROMIS-29) is a validated and widely used measure of health-related quality of life in clinical settings, and it has an accompanying value set to calculate utilities. However, it is of limited use in economic evaluations as EuroQol 5-Dimension 5-Level (EQ-5D-5L) is the common measure of interest in economic evaluations.
This study compares a wide range of direct and indirect mapping approaches, including finite mixtures of beta regression (Betamix) and an adjusted limited dependent variable mixture model (ALDVMM) to map PROMIS-29 to EQ-5D-5L.
This study finds mixture models to have relatively better performance and ideal candidate models for mapping.
This study generates a mapping algorithm for transforming PROMIS-29 scores to EQ-5D-5L utilities.

1 Introduction

Healthcare system funders are increasingly seeking value-based care making patient-reported outcome measures suitable to estimate the impact of health interventions on health status, and track patients’ health changes over time. One of the popular patient-reported outcome measures is the Patient-Reported Outcomes Measurement Information System (PROMIS^®), developed by the National Institutes of Health in the USA in 2004. The PROMIS^® consists of an item bank of questions that can be administered using standardised or custom short forms, profiles and computer adaptive tests [1]. The item bank questions are calibrated using psychometric techniques such as item response theory [2], which ensures they can be administered flexibly while remaining comparable.

One of the standardised instruments under the PROMIS^® is the PROMIS-29 (Profile V2.1) which measures seven domains, i.e. depression, anxiety, physical function, pain interference, fatigue, sleep disturbance, and the ability to participate in social roles and activities. Each domain includes an item bank, assessed with four questions ranked by patients on a five-point numeric scale. Higher scores indicate worse outcomes for anxiety, depression, fatigue, ability to participate in social roles and activities, and pain interference domains, and lower scores reflect worse outcomes for physical function and sleep disturbances domains. All domains refer to the previous 7 days except physical function, which has no time frame. In addition, there is also a question on pain intensity on an 11-point rating scale (0–10) [3].

To facilitate interpretation, the output from PROMIS^® domains can be presented as a T-score, which is a standardised score developed using a representative sample of the population. It is centred around a score of 50, and 10 is equal to one standard deviation from the reference population (usually the US general population) [4]. Thus, a T-score of 70 is two standard deviations above or below (depending on the wording of the question) the average of the reference population.

The construct validity and sensitivity of PROMIS-29 (Profile V2.1, V2.0) to changes in health status have been confirmed by several studies on patients with musculoskeletal pain [5], systemic sclerosis [6] and multiple chronic conditions [7], as well as kidney transplant recipients [8]. It has been translated into numerous languages and has been culturally adapted to several nations, which has led to its extensive adoption internationally [9,10,11].

Economic evaluation regulatory agencies such as the National Institute for Health and Care Excellence (NICE) in the UK, and the Pharmaceutical Benefits Advisory Committee (PBAC) and Medical Services Advisory Committee (MSAC) in Australia recommend using quality-adjusted life-years (QALYs) as the standard metric of health benefits when conducting economic evaluations of health technologies [12,13,14,15]. Health-state utility values (henceforth utilities) are typically estimated using a multi-attribute utility instrument [16]. Patients complete a questionnaire describing their health states, upon which a preference-based algorithm is applied to estimate utilities. The EuroQol five dimensions measure (EQ-5D) is the multi-attribute utility instrument most widely used to measure utilities in clinical trials and related studies and is the National Institute for Health and Care Excellence’s (NICE) preferred instrument for cost-utility evaluations in health technology assessments [12, 17]. The EQ-5D is the most widely used generic, preference-based, health outcome measure [18]. This popularity can be attributed to its format, worldwide validated translations and available value sets. The EQ-5D has been used globally in patients with various health conditions; this allows the comparisons of its utility values across different diseases and populations. The EQ-5D is simple to administer and score and has been proven to yield psychometric properties comparable to more comprehensive instruments [19]. The older version of this instrument has three levels, with a range from no problems to extreme problems. However, the newer version of the instrument (EQ-5D-5L) was introduced recently to increase the sensitivity of the instrument and to address the ceiling effect, leading to 5⁵ or 3125 possible health states [20].

One approach to estimate utilities is to conduct valuations whereby people express their preferences for different health states, thus utilities, described by the PROMIS^® using a time trade-off, standard gamble or discrete-choice experiment study [21]. Two such valuation studies have been conducted to date [22, 23], both in the USA. Dewitt et al. developed the PROMIS-Preference (PROPr) scoring system for valuations using standard gamble and seven domains of PROMIS^® [22], including cognition, depression, physical function, pain interference, fatigue, sleep disturbance, and the ability to participate in social roles and activities. However, PROPr includes a cognition domain that is not included in PROMIS-29 and needs to be estimated from other PROMIS 29 domains [24]. Craig et al. developed a value set based on PROMIS-29 (Profile V1.0) using a discrete-choice experiment approach [23]. However, this value set resulted in much lower utility estimates than those produced by other health instruments—the mean was only 0.16 in a sample drawn from the US general population [25]. Moreover, this version of PROMIS-29 is now retired and is substantially different from its newer versions in the domain of “social roles and activities”. Hartman and Craig identified this underestimation as a rescaling issue and proposed an algorithm to convert PROMIS-29 utilities to EQ-5D-3L utilities [26].

The other approach to utility estimation is to develop a “mapping” algorithm from PROMIS-29 scores to utilities [27]. Guidelines from decision-making authorities recommend using mapping to estimate utilities in the absence of direct utility measures [14, 28, 29]. A review of National Institute for Health and Care Excellence (NICE) health technology appraisals found that mapping was adopted in over a quarter of submissions between the years 2004 and 2008 [30]. Mapping can be ‘direct’ whereby utilities are directly estimated from explanatory variables, or ‘indirect’ whereby the probabilities for each response are predicted, and the relevant tariff is used to estimate utilities. The resulting algorithm can be applied to the PROM data collected in the study or clinical trial to estimate the associated utilities and quality-adjusted life-years [31].

There are three studies that map PROMIS-29 to utility measures [32]. Revicki et al. conducted the first mapping study, predicting EQ-5D-3L utilities (US valuation) from PROMIS-29 responses using data from 1658 people in the USA [33]. The algorithm was developed using ordinary least squares (OLS) and the best-performing model accounted for 57% of the variance in the EQ-5D-3L utilities. Hays et al. predicted Health Utilities Index Mark 3 utilities (Canadian valuation) from PROMIS-29 (Profile V2.0) responses, using data from 3000 people recruited in the USA [25]. The algorithm was developed using OLS and linear equating and the best-performing model accounted for 64% of the variance in the Health Utilities Index Mark 3 utilities. Klapproth et al. predicted EQ-5D-5L crosswalk utilities from PROMIS-29 responses using data from people recruited in the UK (n = 1509), France (n = 1501) and Germany (n = 1502) [34]. The EQ-5D-5L crosswalk utilities (using the 3L value sets for health states described by the 5L version) were used because the value set for the 5L version was not available for all three countries. The mapping algorithms were estimated using stepwise OLS with backward selection, starting with full models that incorporated linear, quadratic and cubic functional forms. The authors found that the use of polynomials improved the predictive performance, particularly for poorer health states. However, the study did not explore other regression models. While OLS is one of the most widely adopted direct approaches, it can suffer from poor accuracy as utilities are often highly skewed and bounded at one [35,36,37,38]. Moreover, the use of EQ-5D-5L crosswalk utilities introduces an additional level of uncertainty in the prediction.

The primary aim of this study is to develop the first mapping algorithm to predict EQ-5D-5L (non-crosswalk) utilities from PROMIS-29 responses. A value set for the 5L version is adopted to avoid the potential bias caused by using the crosswalk utilities. In previous research, in the absence of a value set for the 5L version, the value set for the 3L version has been used to predict the 5L utilities (crosswalk utilities). The secondary aim is to explore and compare various mapping approaches, including a range of regression models and direct and indirect (response) mapping. Several advanced methods such as finite mixtures of beta regression (Betamix) and the adjusted limited dependent variable mixture model (ALDVMM) are considered [39, 40].

2 Methodology

An online survey was conducted in June 2020 using the PureProfile online panel provider to collect responses from 3013 Australians representative of the general population. Respondents completed the PROMIS-29 (Profile V2.1), EQ-5D-5L and the Charlson Comorbidity Index (CCI) questionnaires [3, 17, 41]. Information on age, sex, state of residence and postcode was also collected.

2.1 Source and Target Measures

2.1.1 Source Measure: PROMIS-29 (Profile V2.1)

The PROMIS-29 (Profile V2.1) consists of 29 questions measuring seven domains of health, including depression, anxiety, physical function, pain interference, fatigue, sleep disturbance, and the ability to participate in social roles and activities [3]. Respondents rate each question based on the degree of severity or impairment in a given symptom or function. The PROMIS-29 (Profile V2.1) also assesses pain intensity using a single numeric rating item where the severity level ranges from 0 (no pain) to 10 (worst possible pain).

2.1.2 Target Measure: EQ-5D-5L

Our target measure was the five-level EQ-5D version, EQ-5D-5L, which was introduced in 2009 in an attempt to improve the instrument sensitivity and to address ceiling effects, as compared to the previous three-level version EQ-5D-3L [20]. This new version consists of five questions about mobility, self-care, usual activities, pain/discomfort and anxiety/depression. Each dimension comprises five levels, with a range from having no problems to having extreme problems [17]. The EQ-5D-5L utilities were estimated by applying an Australian valuation set developed using a discrete-choice experiment approach [42] (see Table 4, Approach 5 of their paper)^{Footnote 1}.

2.1.3 Charlson Comorbidity Index

The self-reported CCI was also included in the questionnaire as a potential predictor in the mapping model. The index consists of questions about the presence of 12 defined medical conditions, whether receiving treatment for the problem or not, and whether the medical condition has limited the respondent’s activities. An individual can receive a maximum of 3 points for each medical condition: 1 point for the presence of the condition, 1 for receiving treatment and an additional 1 point if the condition affects their usual activities[41].

2.2 Estimation and Validation Samples

In-sample cross-validation was performed for model comparisons using a ten-fold technique as there was no access to an external validation sample [43]. The dataset was divided randomly into ten subsamples, and nine subsamples were utilised as the “estimation sample”, and the remaining data were considered the “validation sample”. This process was repeated ten times, with each of the ten subsamples used once as the validation data.

2.3 Statistical Analysis

This mapping study explored a wide range of regression models for the best-performing algorithm. All statistical analyses were conducted in STATA 16. Methodological guidance recommended by the Mapping onto Preference-Based Measures Reporting Standards checklist [44], and reporting standards guidance outlined in the 2017 International Society for Pharmacoeconomics and Outcomes Research Task Force report were followed [31] (see Tables A1 and A2 in the Electronic Supplementary Material [ESM]).

2.4 Modelling Approach

The direct mapping approaches predict utilities directly by regressing EQ-5D-5L utilities on the PROMIS-29 domains or items. In estimation with direct mapping, five sets of explanatory variables were considered as initial candidate predictors. The final predictors were selected using the forward stepwise selection (at the 5% significance level). Set 1 was based on the Revicki et al. study, which predicted EQ-5D-3L utilities from PROMIS-29 domains’ T-scores [33]. Set 2 was based on Klapproth et al., which considered T-scores (in a linear, quadratic or cubic functional form) as well as age and sex as predictors [34]. The other three sets of explanatory variables were based on PROMIS-29 domains and items of raw responses:

Set 3: each of the seven domains of PROMIS-29 (raw scores) plus pain intensity, sex, age and age squared.
Set 4: each of the PROMIS-29 items as a continuous variable, plus sex, age and age squared.
Set 5: each of the PROMIS-29 items as a categorical variable, plus sex, age and age squared.

The direct approach considered a range of regression methods including OLS, Tobit, generalised linear model (GLM), censored least absolute deviation (CLAD) model, Betamix and ALDVMM.

The OLS method has several limitations, as mentioned previously [35,36,37,38]. Alternative econometric methods including Tobit, GLM and CLAD were therefore adopted to overcome these limitations [45,46,47]. The Tobit model accounts for utilities bounded at one, while the GLM approach can account for the non-normal distribution of utilities and better handles skewed data than linear regression [48]. The CLAD approach uses median values instead of a mean value, which is more robust to outliers and beneficial when a ceiling effect is present [45, 46, 49].

The mixture models, ALDVMM and Betamix, were also considered because of their flexibility and ability to accommodate multimodality [39, 40]. The ALDVMM prevents prediction outside the feasible range by limiting the dependent variable. It also accounts for the unfeasible gap between full health and the next possible health state [50]. The ALDVMM accounts for multimodality by modelling utilities as a mixture of multiple components, each representing a cluster of respondents with similar utility scores. Here, the estimated models consisted of two and three components. Betamix was used for its ability to model skewed and multimodal data measured on a zero to one interval [51, 52].

In the GLM estimation, the modified Park test identified a Poisson family distribution for the EQ-5D-5L utilities [53], and the Pregibon Link test and the modified Hosmer–Lemeshow test identified a power function with a power of 0.50 as the link function [54, 55]. In the estimation of ALDVMM^{Footnote 2}, two to four components were considered but the model with four components failed to converge. The predicted utilities were trucated at one if they were higher than one.

For indirect mapping, the responses to each EQ-5D-5L question were the dependent variables, and then the predicted responses were combined to predict utilities. As each question was modelled separately, each mapping algorithm consisted of five separate models. One set of explanatory variables was considered in indirect mapping, including PROMIS-29 items (as categorical variables), age, age squared and sex. We considered the generalised ordered logit (GLOGIT) model, which first predicts responses, and then utilities [46, 56, 57]. Monte Carlo simulations were used to generate the predicted responses to each dimension of the EQ-5D-5L within the GLOGIT approach.

The robustness of both direct and indirect method results was examined by considering two additional sets of explanatory variables:

Set 6: each of the PROMIS-29 items as a categorical variable, plus CCI, age and age squared.
Set 7: each of the PROMIS-29 items as a categorical variable, plus the interaction between CCI and age (including age squared).

2.5 Measures of Predictive Accuracy

The predictive accuracy of the models was compared using a number of metrics, including the geometric mean absolute error (GMAE)^{Footnote 3} and the geometric mean squared error (GMSE)^{Footnote 4} across ten-fold cross-validation. The geometric mean, instead of the arithmetic mean, was adopted because the former is less affected by outliers [58]. The other metrics adopted were focused on the accuracy in predicting the mean, minimum and maximum utilities of the sample. An overall ranking of the models was produced to accommodate all the criteria. Moreover, the distribution of the observed utilities was plotted against the predicted utilities to visually assess each method’s performance.

3 Results

3.1 Descriptive Statistics

The survey was completed by 3013 respondents. The average age of respondents was 47 years, and 51% of the sample were female. Respondent characteristics were representative of the Australian population across sex, age and jurisdictional location. Table 1 provides descriptive statistics of the study sample. The average EQ-5D-5L utility was 0.83 (standard deviation = 0.25). In the entire sample, 25% of the respondents had a utility equal to 1 (full health), and 2.5% had a utility of less than zero (poorest health state).

Table 1 Descriptive statistics

Full size table

The correlations between PROMIS-29 domains and EQ-5D-5L utilities and its dimensions (measured by Spearman’s ρ) are presented in Table 2, suggesting strong and statistically significant correlations between the EQ-5D-5L utilities and PROMIS-29 domains [Spearman’s rho (ρ) = 0.69 to −0.38; p = 0.000, 95% confidence interval]. High correlations are desirable as the accuracy of a mapping algorithm depends on the magnitude of overlap between the source and target measures [59]. The highest correlation was between the EQ-5D-5L utilities and the physical function domain of PROMIS-29 (ρ = 0.69), closely followed by the pain interference domain (ρ = −0.67). The lowest correlation was between the EQ-5D-5L utilities and the sleep disturbance domain (ρ = −0.38). The correlations between EQ-5D-5L dimensions and PROMIS-29 domains were overall statistically significant, except those between the mobility dimension of EQ-5D-5L and the anxiety, depression, fatigue and sleep disturbance domains of PROMIS-29. The highest correlation existed between the physical function domain of PROMIS-29 and the mobility dimension of EQ-5D-5L, likely owing to the fact that both instruments have similar questions regarding mobility and physical functions.

Table 2 Correlations between Patient-Reported Outcomes Measurement Information System (PROMIS-29) domains and EuroQol 5-Dimension 5-Level (EQ-5D-5L) dimensions

Full size table

3.2 Model Comparison

Table 3 presents the performance of each model for all metrics. For the direct mapping, models using predictors set 5 consistently performed better than those using sets 1–4, which is sensible because the ordinal nature of PROMIS-29 responses was not considered in the four sets. Set 2, which included polynomial forms of PROMIS-29 domains T-scores, performed better than sets 1, 3 and 4 in predicting lower utilities. This is consistent with Klapproth et al.’s finding that accounting for the non-linearity improves the prediction performance for poorer health states [34]. Only models using set 5 were discussed below and considered in the formal ranking.

Table 3 Predictive performance of models results from ten-fold cross-validation

Full size table

The ALDVMM-2 part model and GLM resulted in the lowest geometric mean absolute error scores 0.0882 and 0.0883, respectively. The former also clearly outperformed the other models by producing the lowest geometric mean squared error. GLM, Betamix and ALDVMM-2 part were able to correctly predict the observed mean (0.8279). Whilst none of the models accurately predicted the observed minimum utility (−0.3770), GLM (−0.3335) and ALDVMM-2 part (−0.3246) dominated the other models. The ALDVMM performance was not improved by adding another component and the ALDVMM-2 part outperformed the ALDVMM-3 part. OLS and CLAD, after truncation, were able to recover the maximum utility. For the indirect mapping, GLOGIT performed poorly for almost all the metrics, but it did predict the maximum utility correctly.

Table 4 ranks the models’ performances for each of the five criteria and reports the average of the five ranking scores. The ALDVMM-2 part model was the clear winner with a score of 1.8, followed by GLM with a score of 2.2. The two worst-performing models were GLOGIT (6.4) and Tobit (6.2).

Table 4 Ranking the performance of models

Full size table

Figure 1 compares the distribution of predicted utilities using explanatory variables set 5 with the observed distribution of the EQ-5D-5L utilities. This helps visually assess the performance of models across the whole distribution. The three mixture models overall fitted better, particularly towards both ends of the distribution. The scatterplots between observed and estimated utilities using explanatory variables set 5 are also generated to assess the predictive performance regarding how close each estimated utility is to its corresponding observed EQ-5D-5L utility (presented in Fig. A1 of the ESM). While it clearly demonstrates the poor predictive performance of GLOGIT, it is somewhat difficult to differentiate the performances of the other models.

As previously mentioned, predictors set 6 and 7 were also considered to test the robustness of models. The results were consistent with those using set 5. Adding the CCI improved predictions, particularly for lower utilities, which was expected as the CCI is a measure of comorbidities and thus severity of a respondent’s health state. However, the models using the CCI as a predictor are not suitable for producing mapping algorithms as CCI data are not always available.

3.3 Mapping Function

The ALDVMM-2 part model using the explanatory variables set 5 was considered the best-performing model. The mapping algorithm based on this model using the full dataset is presented in Table A4 of the ESM. The variance covariance matrix is provided in the ESM, which enables researchers to conduct a probabilistic sensitivity analysis in future cost-effectiveness analyses.

3.4 Comparison to Other Studies

We further compared the predictive performance between our mapping algorithm and the other two published mapping algorithms [33, 34]. The detail of this comparison is presented in Table A5 of the ESM.

4 Discussion

This paper develops the first algorithm mapping PROMIS-29 responses to the EQ-5D-5L (non-crosswalk) utilities using a large Australian dataset representative of the population. This algorithm predicted utilities with high precision. In particular, the mapping algorithm performed well in predicting lower health utilities, which previous studies struggled to achieve [33, 34]. This is particularly important as PROMIS-29 is often used for the population with multiple chronic conditions, who might have lower health utilities.

Although the differences in GMAE and GMSE appear small, these differences are more pronounced when the observed and predicted utilities are compared. For example, the estimated utility of a patient responding to the EQ-5D-5L with mobility = 1, self-care = 2, usual activity = 3, pain and discomfort = 2, and anxiety and depression = 3 is 0.7508. Using the OLS with the set 4 mapping algorithm predicts this utility to be between 0.6105 and 0.8638 (mean = 0.7865) depending on patient characteristics such as age and sex. Our best-performing model ALDVMM predicts these utilities to be between 0.6663 and 0.8314 (mean = 0.7612). This difference in utilities can make a substantial difference to the results of economic evaluations.

The mapping algorithm was developed through a rigorous process. We considered a large number of econometric models, including direct and indirect mapping as well as different model specifications and functional forms of the predictors. We also adopted five different metrics for comparison and produced ranking scores accommodating all the metrics. A distribution plot of the predicted and observed utilities was further generated for visual assessment. The ALDVMM-2 part model outperformed all the other models, including OLS adopted by the two previous mapping studies from PROMIS to EQ-5D [31,32,33,34]. Our results suggest that using models that accommodate multimodality may achieve superior performance; this has also been found in other mapping studies [60,61,62,63,64,65,66]. In our study, this superiority was particularly pronounced by the models’ accuracy in predicting utilities with a range from 0.4 to 0.8. While OLS is the most commonly used method for mapping, and it is easy to execute [35], its shortcomings may lead to a biased estimation of utilities and therefore suboptimal resource allocation.

We also explored different types of predictors. In addition to PROMIS-29 domains and their T-scores considered in Revicki et al. and Klapproth et al. [33, 34], we considered PROMIS-29 item raw scores, which accounts for the extra information at the individual question level. Moreover, the T-scores were calculated for the US general population and may not be appropriate for the other population. Another key difference to the former studies is that we included the PROMIS-29 pain intensity question, which was left out in previous studies.

Our study will enable the estimation of utilities from the PROMIS-29 in Australia for the first time. This is important as PROMIS-29 is one of the critical measures of health outcomes in Australia [67]. Previous mapping studies using PROMIS-29 were conducted in the USA and Europe [33, 34], and Klapproth et al. found limited generalisability of mapping algorithms across the countries [34].

There have been a few attempts to generate utilities directly by valuing PROMIS-29 [22, 23]. However, these studies are often subject to limitations. Craig et al. developed a value set that produced significantly lower utilities than expected [23]. The Dewitt et al. valuation set was developed based on the PROPr form, which uses a cognition domain not included in PROMIS-29 [22]. This has made it difficult to calculate a PROPr utility score when a PROMIS profile instrument is used. To further explore the valuation method for direct utility estimation is warranted.

There are several limitations in the study. First, the participants in this study were from an online panel. They had a similar distribution of age, sex and jurisdiction location to the Australian general population; however, self-selection bias might have occurred. Second, only internal cross-validation was used given the unavailability of external datasets containing PROMIS-29 and EQ-5D-5L, which is similar to the existing literature. Third, the analyses were based on only a single dataset. As with all mapping studies, there is uncertainty around the results and the differences in the measures of model performance. Future research using different datasets, sample sizes and types of respondents would further validate the proposed mapping algorithm.

5 Conclusions

This study developed an algorithm to map the PROMIS-29 to EQ-5D-5L utilities in Australia. The robustness and prediction precision of ALDVMM confirmed it as a superior mapping approach compared with other regression methods. With this algorithm, PROMIS-29 can be translated to quality-adjusted life-years for use in economic evaluations. This study extends the usefulness of PROMIS-29 from measuring patient outcomes in a clinical setting to a broader economic evaluation tool that can enable more efficient, healthcare resource allocation decisions.

Notes

While this study has not been published yet, we could access it through the authors. The valuation in this study is based on a new approach and constructed to estimate a utility algorithm with main effects plus interaction terms for pairs of attributes where both appear at levels 4 or 5, thus suggesting a more precise valuation of health status, particularly for individuals in poor health.
The ALDVMM was estimated using the Stata command “aldvmm”.
The MAE was calculated as the mean of the absolute values of the difference between the observed and predicted EQ-5D-5L utilities. Then, the GMAE was calculated by taking the tenth root over the product of individual MAEs.
The MSE were computed as the mean squared differences between the predicted and observed EQ-5D-5L utilities. The GMSE was calculated by taking the tenth root over the product of individual MSEs.

References

Cella D, Yount S, Rothrock N, Gershon R, Cook K, Reeve B, et al. The Patient-Reported Outcomes Measurement Information System (PROMIS): progress of an NIH toadmap cooperative group during its first two years. Med Care. 2007;45(5 Suppl. 1):S3.
Article Google Scholar
Fries JF, Krishnan E, Rose M, Lingala B, Bruce B. Improved responsiveness and reduced sample size requirements of PROMIS physical function scales with item response theory. Arthritis Res Ther. 2011;13(5):1–8.
Article Google Scholar
Hays R, Spritzer K, Schalet B, Cella D. PROMIS®-29 v2.0 profile physical and mental health summary scores. Qual Life Res. 2018;27(7):1885–91. https://doi.org/10.1007/s11136-018-1842-3.
Article Google Scholar
Instruments PAP. A brief guide to the PROMIS profile instruments for adult respondents. PPROMI System; 2015.
Deyo RA, Katrina R, Buckley DI, Michaels L, Kobus A, Eckstrom E, et al. Performance of a Patient Reported Outcomes Measurement Information System (PROMIS) short form in older adults with chronic musculoskeletal pain. Pain Med. 2016;17(2):314–24. https://doi.org/10.1093/pm/pnv046.
Article Google Scholar
Morrisroe K, Stevens W, Huq M, Sahhar J, Ngian G-S, Zochling J, et al. Validity of the PROMIS-29 in a large Australian cohort of patients with systemic sclerosis. J Scleroderma Relat Disord. 2017;2(3):188–95.
Article Google Scholar
Rose AJ, Bayliss E, Huang W, Baseman L, Butcher E, Garcia RE, et al. Evaluating the PROMIS-29 v2.0 for use among older adults with multiple chronic conditions. Qual Life Res. 2018;27(11):2935–44. https://doi.org/10.1007/s11136-018-1958-5.
Article Google Scholar
Tang E, Ekundayo O, Peipert JD, Edwards N, Bansal A, Richardson C, et al. Validation of the Patient-Reported Outcomes Measurement Information System (PROMIS)-57 and -29 item short forms among kidney transplant recipients. Qual Life Res. 2019;28(3):815–27. https://doi.org/10.1007/s11136-018-2058-2.
Article Google Scholar
Mahmoud GA, Rady HM, Mostafa AM. Cross cultural adaptation and validation of an Arabic version of selected PROMIS measures for use in rheumatoid arthritis patients. Egypt Rheumatol. 2019;41(3):177–82.
Article Google Scholar
Devine J, Klasen F, Moon J, Herdman M, Hurtado M, Castillo G, et al. Translation and cross-cultural adaptation of eight pediatric PROMIS^® item banks into Spanish and German. Qual Life Res. 2018;27(9):2415–30.
Article CAS Google Scholar
Coste J, Rouquette A, Valderas J, Rose M, Leplege A. The French PROMIS-29. Psychometric validation and population reference values. Rev Epidemiol Sante Publique. 2018;66(5):317–24.
Article CAS Google Scholar
National Institute for Clinical Excellence. NICE technology appraisals: a guide to the methods of technology appraisal. London: National Institute for Clinical Excellence; 2013.
Google Scholar
Department of Health. Procedure guidance for listing medicines on the Pharmaceutical Benefits Scheme (including consideration of vaccines for the National Immunisation Program). Version 1. Canberr (ACT): Commonwealth of Australia; 2016.
Medical Services Advisory Committee. Technical guidelines for preparing assessment reports for the Medical Services Advisory Committee: medical service type: therapeutic (version 2.0). Department of Health, Commonwealth of Australia; Canberra; 2016.
Guidelines for the economic evaluation of health technologies: Canada. 4th ed. Ottawa: CADTH; 2017.
Torrance GW, Furlong W, Feeny D, Boyle M. Multi-attribute preference functions. Pharmacoeconomics. 1995;7(6):503–20.
Article CAS Google Scholar
Herdman M, Gudex C, Lloyd A, Janssen MF, Kind P, Parkin D, Bonsel G, Badia X. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res. 2011;20(10):1727–36.
Brauer CA, Rosen AB, Greenberg D, Neumann PJ. Trends in the measurement of health utilities in published cost-utility analyses. Value Health. 2006;9(4):213–8.
Article Google Scholar
de Willige GV, Wiersma D, Nienhuis FJ, Jenner JA. Changes in quality of life in chronic psychiatric patients: a comparison between EuroQol (EQ-5D) and WHOQoL. Qual Life Res. 2005;14(2):441–51.
Article Google Scholar
Herdman M, Gudex C, Lloyd A, Janssen M, Kind P, Parkin D, et al. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res. 2011;20(10):1727–36. https://doi.org/10.1007/s11136-011-9903-x.
Article CAS Google Scholar
Drummond MF, Sculpher MJ, Torrance GW, O’Brien BJ, Stoddart GL. Methods for the economic evaluation of health care programmes. 3rd ed. Oxford: Oxford University Press; 2005.
Google Scholar
Dewitt B, Feeny D, Fischhoff B, Cella D, Hays RD, Hess R, et al. Estimation of a preference-based summary score for the Patient-Reported Outcomes Measurement Information System: the PROMIS^®-Preference (PROPr) scoring system. Med Decis Making. 2018;38(6):683–98.
Article Google Scholar
Craig BM, Reeve BB, Brown PM, Cella D, Hays RD, Lipscomb J, et al. US valuation of health outcomes measured using the PROMIS-29. Value Health. 2014;17(8):846–53. https://doi.org/10.1016/j.jval.2014.09.005.
Article Google Scholar
Dewitt B, Jalal H, Hanmer J. Computing PROPr utility scores for PROMIS^® profile instruments. Value Health. 2020;23(3):370–8.
Article Google Scholar
Hays R, Revicki D, Feeny D, Fayers P, Spritzer K, Cella D. Using linear equating to Map PROMIS® global health items and the PROMIS-29 V2.0 profile measure to the Health Utilities Index Mark 3. Pharmacoeconomics. 2016;34(10):1015–22. https://doi.org/10.1007/s40273-016-0408-x.
Article Google Scholar
Hartman JD, Craig BM. Comparing and transforming PROMIS utility values to the EQ-5D. Qual Life Res. 2018;27(3):725–33. https://doi.org/10.1007/s11136-017-1769-0.
Article Google Scholar
Kearns B, Ara R, Wailoo A, Manca A, Alava MH, Abrams K, et al. Good practice guidelines for the use of statistical regression models in economic evaluations. Pharmacoeconomics. 2013;31(8):643–52. https://doi.org/10.1007/s40273-013-0069-y.
Article Google Scholar
The National Institute for Clinical Excellence. Technology appraisals: a guide to the methods of technology appraisal. London: National Institute for Clinical Excellence; 2013.
Google Scholar
Department of Health. Guidelines for preparing a submission to the Pharmaceutical Benefits Advisory Committee. Version 5.0. Commonwealth of Australia; Canberra; 2016.
Tosh JC, Longworth LJ, George E. Utility values in National Institute for Health and Clinical Excellence (NICE) technology appraisals. Value Health. 2011;14(1):102–9. https://doi.org/10.1016/j.jval.2010.10.015.
Article Google Scholar
Wailoo AJ, Hernandez-Alava M, Manca A, Mejia A, Ray J, Crawford B, et al. Mapping to estimate health-state utility from non-preference-based outcome measures: an ISPOR Good Practices for Outcomes Research Task Force Report. Value Health. 2017;20(1):18–27. https://doi.org/10.1016/j.jval.2016.11.006.
Article Google Scholar
Pan T, Mulhern B, Viney R, Norman R, Tran-Duy A, Hanmer J, et al. Evidence on the relationship between PROMIS-29 and EQ-5D: a literature review. Qual Life Res. 2022;31(1):79–89.
Article Google Scholar
Revicki DA, Kawata AK, Harnam N, Chen WH, Hays RD, Cella D. Predicting EuroQol (EQ-5D) scores from the patient-reported outcomes measurement information system (PROMIS) global items and domain item banks in a United States sample. Qual Life Res. 2009;18(6):783–91. https://doi.org/10.1007/s11136-009-9489-8.
Article Google Scholar
Klapproth CP, van Bebber J, Sidey-Gibbons C, Valderas J, Leplege A, Rose M, et al. Predicting EQ-5D-5L crosswalk from the PROMIS-29 profile for the United Kingdom, France, and Germany. Health Qual Life Outcomes. 2020;18(1):1–13.
Article Google Scholar
Brazier JE, Yang Y, Tsuchiya A, Rowen DL. A review of studies mapping (or cross walking) non-preference based measures of health to generic preference-based measures. Eur J Health Econ. 2010;11(2):215–25. https://doi.org/10.1007/s10198-009-0168-z.
Article Google Scholar
Ara R, Brazier J. Deriving an algorithm to convert the eight mean SF-36 dimension scores into a mean EQ-5D preference-based score from published studies (where patient level data are not available). Value Health. 2008;11(7):1131–43. https://doi.org/10.1111/j.1524-4733.2008.00352.x.
Article Google Scholar
Rowen D, Brazier J, Roberts J. Mapping SF-36 onto the EQ-5D index: how reliable is the relationship? Health Qual Life Outcomes. 2009;7:27. https://doi.org/10.1186/1477-7525-7-27.
Article Google Scholar
Crott R, Briggs A. Mapping the QLQ-C30 quality of life cancer questionnaire to EQ-5D patient preferences. Eur J Health Econ. 2010;11(4):427–34. https://doi.org/10.1007/s10198-010-0233-7.
Article Google Scholar
Woodcock F, Doble B, Consortium C. Mapping the EORTC-QLQ-C30 to the EQ-5D-3L: an assessment of existing and newly developed algorithms. Med Decis Making. 2018;38(8):954–67. https://doi.org/10.1177/0272989X18797588.
Article Google Scholar
Ward Fuller G, Hernandez M, Pallot D, Lecky F, Stevenson M, Gabbe B. Health state preference weights for the Glasgow Outcome Scale following traumatic brain injury: a systematic review and mapping study. Value Health. 2017;20(1):141–51. https://doi.org/10.1016/j.jval.2016.09.2398.
Article Google Scholar
Chaudhry S, Jin L, Meltzer D. Use of a self-report-generated Charlson Comorbidity Index for predicting mortality. Med Care. 2005;43(6):607–15.
Article Google Scholar
Norman R, Viney R, Mulhern B, Brazier JE, Ratcliffe J, Lancsar E, et al. A large Australian DCE with duration and dead to value EQ-5D-5L health states. EuroQol Meeting; 21–22 September 2017; Barcelona.
Fushiki T. Estimation of prediction error by using K-fold cross-validation. Stat Comput. 2011;21(2):137–46.
Article Google Scholar
Petrou S, Rivero-Arias O, Dakin H, Longworth L, Oppe M, Froud R, et al. Preferred reporting items for studies mapping onto preference-based outcome measures: the MAPS statement. Pharmacoeconomics. 2015;33(10):985–91. https://doi.org/10.1007/s40273-015-0319-2.
Article Google Scholar
Sullivan PW, Ghushchyan V. Mapping the EQ-5D index from the SF-12: US general population preferences in a nationally representative sample. Med Decis Making. 2006;26(4):401–9. https://doi.org/10.1177/0272989X06290496.
Article Google Scholar
Kaambwa B, Bryan S, Barton P. Relationship between the EuroQoL-5D and Barthel Index: Examining the use of proxy outcome measures for older people. Med Decis Making. 2007;27(E5).
Brennan DS, Spencer AJ. Mapping oral health related quality of life to generic health state values. BMC Health Serv Res. 2006;6:96. https://doi.org/10.1186/1472-6963-6-96.
Article Google Scholar
Dakin H, Petrou S, Haggard M, Benge S, Williamson I. Mapping analyses to estimate health utilities based on responses to the OM8-30 Otitis Media Questionnaire. Qual Life Res. 2010;19(1):65–80. https://doi.org/10.1007/s11136-009-9558-z.
Article Google Scholar
Shaw JW, Pickard AS, Yu S, Chen S, Iannacchione VG, Johnson JA, et al. A median model for predicting United States population-based EQ-5D health state preferences. Value Health. 2010;13(2):278–88. https://doi.org/10.1111/j.1524-4733.2009.00675.x.
Article Google Scholar
Gray LA, Wailoo AJ, Hernandez AM. Mapping the FACT-B instrument to EQ-5D-3L in patients with breast cancer using adjusted limited dependent variable mixture models versus response mapping. Value Health. 2018;21(12):1399–405. https://doi.org/10.1016/j.jval.2018.06.006.
Article Google Scholar
Lamu AN, Chen G, Gamst-Klaussen T, Olsen JA. Do country-specific preference weights matter in the choice of mapping algorithms? The case of mapping the Diabetes-39 onto eight country-specific EQ-5D-5L value sets. Qual Life Res. 2018;27(7):1801–14. https://doi.org/10.1007/s11136-018-1840-5.
Article Google Scholar
Paolino P. Maximum likelihood estimation of models with beta-distributed dependent variables. Polit Anal. 2001;9(4):325–46.
Article Google Scholar
Manning WG, Mullahy J. Estimating log models: to transform or not to transform? J Health Econ. 2001;20(4):461–94.
Article CAS Google Scholar
Pregibon D. Goodness of link tests for generalized linear models. J R Stat Soc C-Appl. 1980;29(1):15–24.
Google Scholar
Hosmer DW Jr, Lemeshow S, Rodney X. Applied logistic regression. New York: Wiley; 2013.
Book Google Scholar
Gray AM, Rivero-Arias O, Clarke PM. Estimating the association between SF-12 responses and EQ-5D utility values by response mapping. Med Decis Making. 2006;26(1):18–29. https://doi.org/10.1177/0272989X05284108.
Article Google Scholar
Fryback DG, Lawrence WF, Martin PA, Klein R, Klein BE. Predicting quality of well-being scores from the SF-36: results from the Beaver Dam Health Outcomes Study. Med Decis Making. 1997;17(1):1–9. https://doi.org/10.1177/0272989X9701700101.
Article CAS Google Scholar
Gu Y, Fiebig DG, Cripps E, Kohn R. Bayesian estimation of a random effects heteroscedastic probit model. Econometrics J. 2009;12(2):324–39.
Article Google Scholar
Longworth L, Rowen D. Mapping to obtain EQ-5D utility values for use in NICE health technology assessments. Value Health. 2013;16(1):202–10. https://doi.org/10.1016/j.jval.2012.10.010.
Article Google Scholar
Hernandez-Alava M, Wailoo A, Wolfe F, Michaud K. The relationship between EQ-5D, HAQ and pain in patients with rheumatoid arthritis. Rheumatology. 2013;52(2):944–50.
Article CAS Google Scholar
Khan I, Morris S. A non-linear beta-binomial regression model for mapping EORTC QLQ-C30 to the EQ-5D-3L in lung cancer patients: a comparison with existing approaches. Health Qual Life Outcomes. 2014;12(1):163.
Article Google Scholar
Young TA, Mukuria C, Rowen D, Brazier JE, Longworth L. Mapping functions in health-related quality of life: mapping from two cancer-specific health-related quality-of-life instruments to EQ-5D-3L. Med Decis Making. 2015;35(7):912–26.
Article Google Scholar
Basu A, Manca A. Regression estimators for generic health-related quality of life and quality-adjusted life years. Med Decis Making. 2012;32(1):56–69.
Article Google Scholar
Gray Hernandez-Alava MH. A command for fitting mixture regression models for bounded dependent variables using the beta distribution. Stata J. 2018;18(1):51–75.
Article Google Scholar
Hernandez-Alava M, Wailoo A. Fitting adjusted limited dependent variable mixture models to EQ-5D. Stata J. 2015;15(3):737–50.
Article Google Scholar
Yang F, Wong C, Luo N, Piercy J, Moon R, Jackson J. Mapping the kidney disease quality of life 36-item short form survey (KDQOL-36) to the EQ-5D-3L and the EQ-5D-5L in patients undergoing dialysis. Eur J Health Econ. 2019;20(8):1195–206. https://doi.org/10.1007/s10198-019-01088-5.
Article CAS Google Scholar
Monitoring and Evaluation Framework. Integrated care strategy. NSW Ministry of Health; Sydney; 2016.

Download references

Author information

Authors and Affiliations

Macquarie University Centre for the Health Economy, Macquarie University, Level 5, 75 Talavera Road, Sydney, NSW, 2109, Australia
Mona Aghdaee, Yuanyuan Gu, Bonny Parkinson, Rajan Sharma & Henry Cutler
Department of Economics, Macquarie Business School, Macquarie University, Sydney, NSW, Australia
Kompal Sinha
Australian Institute of Health Innovation (AIHI), Macquarie University, Sydney, NSW, Australia
Mona Aghdaee, Yuanyuan Gu, Bonny Parkinson, Rajan Sharma & Henry Cutler

Authors

Mona Aghdaee
View author publications
You can also search for this author in PubMed Google Scholar
Yuanyuan Gu
View author publications
You can also search for this author in PubMed Google Scholar
Kompal Sinha
View author publications
You can also search for this author in PubMed Google Scholar
Bonny Parkinson
View author publications
You can also search for this author in PubMed Google Scholar
Rajan Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Henry Cutler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mona Aghdaee.

Ethics declarations

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions. This project was funded by the Agency for Clinical Innovation, Sydney, NSW, Australia.

Conflicts of interest/competing interests

All authors have no conflicts of interest that are directly relevant to the contents of this study.

Ethics approval

Ethical approval for this study was obtained from the Macquarie University Ethics Committee on 28 May, 2020 (Reference number: 52020674715853).

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Consent for publication

Not available.

Availability of data and material

Because of ethical concerns associated with sensitive data, supporting data are not available.

Code availability statement

The codes are available upon request to the corresponding author.

Author contributions

MA, YG, KS, BP, RS and HC contributed to the conception and design of this mapping study. MA conducted the statistical analysis. All the authors contributed to the interpretation of data, drafting the article, revising it critically for the intellectual content and final approval of the version to be published.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 586 kb)

Supplementary file2 (XLSX 233 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc/4.0/.

Reprints and permissions

About this article

Cite this article

Aghdaee, M., Gu, Y., Sinha, K. et al. Mapping the Patient-Reported Outcomes Measurement Information System (PROMIS-29) to EQ-5D-5L. PharmacoEconomics 41, 187–198 (2023). https://doi.org/10.1007/s40273-022-01157-3

Download citation

Accepted: 10 May 2022
Published: 07 November 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s40273-022-01157-3

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Mapping the Patient-Reported Outcomes Measurement Information System (PROMIS-29) to EQ-5D-5L

Abstract

Background and Objective

Methods

Results

Conclusions

Similar content being viewed by others

Mapping PROMIS Global Health Items to EuroQol (EQ-5D) Utility Scores Using Linear and Equipercentile Equating

Estimating EQ-5D utilities based on the Short-Form Long Term Conditions Questionnaire (LTCQ-8)

Mapping of Family Reported Outcome Measure (FROM-16) scores to EQ-5D: algorithm to calculate utility values

1 Introduction

2 Methodology

2.1 Source and Target Measures

2.1.1 Source Measure: PROMIS-29 (Profile V2.1)

2.1.2 Target Measure: EQ-5D-5L

2.1.3 Charlson Comorbidity Index

2.2 Estimation and Validation Samples

2.3 Statistical Analysis

2.4 Modelling Approach

2.5 Measures of Predictive Accuracy

3 Results

3.1 Descriptive Statistics

3.2 Model Comparison

3.3 Mapping Function

3.4 Comparison to Other Studies

4 Discussion

5 Conclusions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Funding

Conflicts of interest/competing interests

Ethics approval

Consent to participate

Consent for publication

Availability of data and material

Code availability statement

Author contributions

Supplementary Information

Supplementary file1 (DOCX 586 kb)

Supplementary file2 (XLSX 233 kb)

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation