Introduction

Opioid use disorder (OUD) represents a significant public health challenge in the United States (US) as more than 700,000 people have died from an opioid overdose since 1999 [1]. Cost-effectiveness analysis of interventions to mitigate OUD-related health harms is an important tool for guiding public policy responses [2]. Cost-effectiveness studies typically rely on models that represent the condition’s clinical progression through different health states for which the health-related quality of life (HRQoL) has been previously estimated and expressed in terms of health state utility values (HSUVs) on a scale with reference points at 0 (dead) and 1 (perfect health) [3]. The process of estimating HSUVs, which are used to calculate quality-adjusted life years (QALYs) in cost-effectiveness models, is a challenging endeavor [4] and a recent study by Barbosa and colleagues observed that there is a dearth of evidence enabling their estimation for OUD [5]. When looking at the use of HSUVs in OUD models, Barbosa et al. showed that there was repeated use of one source of evidence across models in the published literature [6]. Despite the extensive use of these estimates, the values were elicited in a sample of British participants and so are not representative of the health preferences of the US, which is the focus of the current paper. HSUVs should ideally reflect the preferences of the jurisdiction under investigation given that there are important differences between values elicited in different countries [7]. More recently, a study by Wittenberg and colleagues elicited HSUVs in a US population-representative sample (n = 2,054) to determine their perception of the quality of life effects of OUD [8].

The primary objective of our study is to provide a new set of HSUVs for OUD in the US based on data collected in people experiencing the key health states of interest from multiple studies funded by the National Institute on Drug Abuse (NIDA) using an “off-the-shelf” value set associated with the descriptive system of the EQ-5D-3 L. While HRQoL data are routinely collected in NIDA-funded clinical trials [9], the instruments used vary across studies which undermines their comparability [10], potentially affecting cost-effectiveness assessments [11]. We resolved this issue by employing mapping algorithms to estimate HSUVs [12]. Having harmonized the heterogeneous HRQoL data, we estimated HSUVs for twelve OUD states, corresponding to different combinations of the following factors: opiate type, frequency and mode of administration, receipt of medications for OUD, and HIV diagnosis. The rationale for the selection of these factors was based on a review of cost-effectiveness models of interventions addressing OUD in the published literature, which found that they corresponded to the most frequently observed states with HSUVs assigned to them for the estimation of QALYs. To identify key drivers of HRQoL among people living with OUD, we investigated associations between these factors, as well as age, and HSUV estimates through conducting a series of regression analyses. A secondary objective of the paper is to explore the impact of the withdrawal symptoms associated with OUD when estimating HSUVs for OUD cost-effectiveness models. Despite being a fundamental concern for patients with OUD, no existing models have sought to reflect these symptoms, raising a concern about the representation of the experiences of people with OUD in cost-effectiveness models [13,14,15].

Methods

Evidence of requirements for economic models in OUD

Following recommended methods [4], we established which health states are most important in cost-effectiveness models for OUD. Existing cost-effectiveness models in OUD were examined to identify the defined health states that had HSUVs assigned to them for the estimation of QALYs. Existing studies were selected from systematic reviews of cost-effectiveness studies in OUD [5, 16, 17]. Cost-effectiveness studies released after the publication of the systematic reviews were also obtained by searching studies citing the reviews on Google Scholar. Table 1 presents a list of variables related to the health states represented in the models identified. The variable most frequently represented was reported (or suspected) extra-medical opioid use, followed by treatment with medications for OUD. Other key descriptors include drug-injecting behaviors, HIV infection status, and hepatitis C virus infection status.

Table 1 Variables associated with health state utility values in opioid use disorder (OUD) cost-effectiveness models

Data obtained from the NIDA data share initiative

Trial data were acquired from the NIDA website if they fulfilled a series of inclusion criteria. First, they needed to have data collected in patients receiving any of the following diagnoses: (i) opioid dependence according to the definition set out in the fourth edition of the Diagnostic and Statistical Manual of Mental Disorders [35], (ii) OUD according to the definition set out in the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders [36], (iii) opioids as the substance of major drug use as determined by an interviewer, or (iv) receiving methadone for OUD. Second, they needed generic measures of HRQoL collected in these patient populations, which yielded data collected using the EQ-5D-3 L, the 12-Item Short Form Survey (SF-12) Version 1 and the SF-12 Version 2 measures. Additional searches were conducted to identify alternative HRQoL measures with a view to expanding the number of trials that could be included in the analysis. Alternative measures listed in the NIDA website were cross-referenced against a list of all existing mapping algorithms [37] but no measures were identified that could be linked to preference-based instruments via mapping algorithms. Finally, the trials needed to collect variables aligning with the primary health states of interest in OUD models, namely those relating to the receipt of medications for OUD and reported extra-medical opioid use. There were no variables that made a distinction between patients being either engaged in extra-medical opioid use or not. Instead, a continuous variable was chosen capturing the number of days of extra-medical opioid use in the past 30 days. For the latter, extra-medical opioid use was captured using self-reported responses to the Addiction Severity Index (ASI) assessment tool, in addition to data from urine drug screening tests. Finally, data on self-reported withdrawal symptoms, as measured by the Subjective Opiate Withdrawal Scale (SOWS), was obtained for study NCT02032433 to explore the consistency of the regression results when this was included as a covariate. For each of the trials obtained, the associated documentation was checked to identify variables in Table 1. Missing data.

A description of missing data in the studies can be found in Appendix A. For observations with partially missing HRQoL data, the mapping procedure, described in the next section, was used to predict EQ-5D-3 L scores based on the items where data were not available.

Methods for equating different health-related quality of life measures with external evidence

The EQ-5D-3 L was designated as the target measure for valuing HRQoL in this study for both evidential and practical reasons. First, previous research has validated its use in people with OUD through assessments of its content validity, construct validity, tests for evidence of floor and ceiling effects, and responsiveness [38]. Moreover, datasets were available that permitted the development of mapping algorithms between the EQ-5D-3 L and SF-12 measures. US population-based preference weights were assigned to EQ-5D-3 L responses to estimate health index values [39]. For studies that did not collect the EQ-5D-3 L, a two-step mapping procedure, shown in Fig. 1, was employed to predict HRQoL responses on the EQ-5D-3 L scale. The first step involved developing mapping algorithms to quantify the relationship between each version of the SF-12 and the EQ-5D-3 L. No studies were identified in the published literature mapping between the SF-12 Version 2 and the EQ-5D-3 L with US population values. Although previous studies have been conducted which map between the SF-12 Version 1 and the EQ-5D-3 L [40, 41], new mapping algorithms were developed to make use of recent methodological advances from the published literature. Methodological insights from the field of psychometrics research were integrated within a latent variable modeling framework to characterize the relationship between HRQoL measures in terms of a shared, latent factor, such that the different measures were assumed to capture alternative realizations of the same underlying health state [42, 43]. Separate mapping algorithms were required for the different versions of the SF-12 given that there are important differences in the item wording and response options [44, 45]. Two datasets from the Medical Expenditure Panel Survey [46, 47], collected in samples of the US general population, were obtained to equate different measures of HRQoL using a latent variable modeling framework. In accordance with the ‘Preferred Reporting Items for Studies Mapping onto Preference-based Outcome Measures: The MAPS Statement’ [48], details about the development of the mapping algorithms can be found in Appendices B and C in the supplementary materials.

Fig. 1
figure 1

Flow diagram showing two-step mapping procedure for equating health-related quality of life measures

Statistical analysis of health state utility data

A regression analysis was conducted by fitting a mixed-effects model to the pooled HSUV data using the glmmTMB package in R [49]. A beta-binomial distribution was assumed to account for the bounded nature of HSUVs [50]. Random effects were applied to the model intercepts to account for correlations between repeated observations on the same study participant at different points in time.

The main fixed effects in the model included three variables on the self-reported number of days of opioid use in the past 30 days, corresponding to the use of heroin, other opiates and methadone respectively. A dummy variable was also included to indicate whether participants reported injecting as their main route of opioid administration. Other main fixed effects included age, HIV diagnosis, and a dummy variable indicating whether participants were receiving ongoing treatment with medications for OUD, specifically either methadone or buprenorphine. Naltrexone was not considered due to its fundamentally different treatment mechanism as an opioid antagonist. We sought to distinguish between the two variables corresponding to methadone use. One refers to methadone use as recorded either in the treatment log or in the concomitant medication records. The second variable corresponds to the self-reported number of days of methadone use in the past 30 days in the Addiction Severity Index questionnaire. Exploratory analyses indicate that responses to the latter variable overwhelmingly refer to methadone use outside of medicated settings (see Table D3 in Appendix D).

Interactions between the main fixed effects variables corresponding to opioid use (medicated or non-medicated) were included to account for participants using multiple types of opioids. This is a critical issue given that individuals’ drug choices are influenced by the availability of drug types, as well as financial and social considerations [51]. As such, it is reasonable to expect that the relationship between each of these variables and HSUVs might vary according to the remaining drug use variables in the regression model. All predictor variables were retained in the model regardless of whether they were significant or not because the exclusion of predictors from cost-effectiveness models on the grounds of statistical significance could bias the results if these predictors have important impacts on costs and health consequences [52, 53]. The inclusion of non-significant predictors in the model ensures that uncertainty can be appropriately characterized in cost-effectiveness assessments for decision making, including the value of further data collection to resolve sample uncertainty [3]. A sensitivity analysis was conducted to explore differences in outcomes across the studies (see Appendix F).

A secondary analysis was conducted in a subset of the data (study NCT02032433) that included the SOWS measure as an additional covariate to quantify the detrimental health effects of withdrawal symptoms. One hundred and forty-one observations were dropped from the NCT02032433 data due to missing SOWS data. A regression analysis was then run on this data. Only one of the study participants in the NCT02032433 data had an HIV positive diagnosis so this variable was dropped from the analysis. Aside from this change and the additional inclusion of the SOWS variable, the model specification was the same as that in the main analysis.

The R script for all the analyses in this study can be found in the supplementary materials. Our interpretations of the regression analyses are concerned with three sets of outputs: (i) the direct outputs from the regression analysis which show parameter estimates on the beta-binomial scale, (ii) contrasts of the estimated marginal effects for predictor variables to provide a more tangible interpretation of the results [54, 55], and (iii) adjusted predictions of HSUVs for different health states of interest in OUD models.

Ethics approval

The Institutional Review Board of the University of California San Diego determined that an ethics review was not required because this study relied on the use of secondary de-identified data (Project #805,662).

Results

Datasets

Six studies meeting the inclusion criteria were identified through the NIDA Data Share Initiative. Table 2 provides information on the characteristics of these studies. These studies included data for five out of the nine variables listed in Table 1. In addition to the variables relating to the levels and types of extra-medical opioid use, the ASI captured information indicating whether participants reported injecting as their main route of opioid administration. One study (NCT01612169) captured data in people living with HIV and another (NCT02032433) collected information on participants’ HIV status at baseline. For the remaining datasets, study participants were assumed to be HIV negative. Two studies collected information on the presence of hepatitis C virus antibodies in study participants’ blood samples. Unfortunately, these data did not distinguish between active and resolved hepatitis C virus infection, so it was not possible to explore associations between HSUVs and the presence of active hepatitis C virus infection. In addition, data were not available to explore associations with either incarceration status, pregnancy, birth outcomes, or hypoxia. Table A1 in the supplementary materials describes the inclusion and exclusion of observations for the analysis. Just over 6% of observations were dropped, either due to a misalignment in the timing of the collection of variables (2.3%), a failure to provide any HRQoL responses (2.0%), missing responses for variables pertaining to extra-medical opioid use (1.1%), or because self-reported extra-medical opioid use conflicted with urine drug screen results (0.8%). The SOWS measure was collected in 91% observations in the NCT02032433 study.

Table 2 Characteristics of studies obtained from the NIDA Data Share Initiative

Descriptive statistics

Table 3 provides descriptive statistics on the demographic, behavioral, and treatment characteristics of study participants in the data analysis sample. These results show that there were relatively few data points where participants were receiving treatment with medications for OUD. Participants from study NCT00316277 had, on average, a shorter history of extra-medical opioid use and a lower prevalence of injecting drugs when compared to those in the other studies. We provide additional descriptive statistics (Tables D1 to D3) and figures (Figures D1 to D6) in Appendix D showing the different patterns of self-reported opioid use and treatment with medications for OUD to inform the choice of interactions between the main fixed effects variables.

Table 3 Demographic, behavioral, and treatment characteristics of study participants in the data analysis

Statistical analysis of health state utility data

The results from the main regression analysis showed that, on average, an additional day of heroin use in the past 30 days equates to a reduction in HSUVs of 0.002. This was smaller than the 0.003 reduction associated with an additional day of using other opiates, although people reporting heroin use were more likely to report injecting as their main route of administration compared to people reporting the use of other opiates (62% versus 18%), which is associated with an additional 0.043 reduction in health state utility. Table 4 shows contrasts from the main regression analysis and Table E1 in Appendix E shows results from the same analysis, with outputs on the beta-binomial scale. An HIV positive diagnosis was associated with a 0.074 reduction in HSUVs. The contrasts for the variables relating to heroin use and other opiate use were both found to be statistically significant. The interaction terms associated with these variables are retained in the model, despite them not being statistically significant, given that their exclusion on the grounds of statistical significance could potentially bias the cost-effectiveness results. The regression coefficients for the dummy variables indicating whether injecting was the main route of administration and identifying whether individuals had an HIV positive diagnosis were both statistically significant. All model coefficients related to the receipt of medications for OUD treatment were statistically non-significant, along with the corresponding HSUV contrast estimate.

Table 4 Estimated Health State Utilities’ (HSU) effects associated with contrasts for the predictor variables

HSUV predictions from the main regression analysis show that uncertainty surrounding estimates is much higher for predictor combinations involving medications for OUD treatment compared to those without medications for OUD treatment. This uncertainty may be partly attributed to the small number of data points that were collected in people receiving medications for OUD treatment. Table 5 shows HSUV predictions for different combinations of predictor values considered to be relevant to OUD models. We specify multiple scenarios to reflect the possibility of extra-medical opioid use occurring, at a reduced frequency, alongside medications for OUD treatment, which are denoted by the health states numbered 3, 4, 9, and 10.

Table 5 Predicted Health State Utility (HSU) values for health states of interest in opioid use disorders (OUD) models

We found that age-adjusted HSUV predictions for the health states with no extra-medical opioid use were very similar to age-adjusted HSUV norms for the general population and for people living with HIV from the published literature [56, 57]. This finding is illustrated in Fig. 2, which shows the mean HSUV predictions for each of the health states specified in Table 5 with additional adjustments for age. The dashed lines represent the age-adjusted HSUV norms (i.e. mean values) for general and populations living with HIV from the published literature. As such, predictions under the dashed lines fall below the age-adjusted norms. The HSUV predictions for people not engaging in extra-medical opioid use increasingly exceed HSUV norms among the higher age groups. The generalizability of the predictions for people older than 60 years old is doubtful given that this age group constituted only 1% of the sample data.

Fig. 2
figure 2

health state utility prediction from main analysis. The solid lines correspond to the various health states of interest in OUD cost-effectiveness models. The dotted lines reflect HSUV estimates from the published literature for the general population in the plot on the left and for populations living with HIV in the plot on the right

In the sensitivity analysis to explore differences in outcomes across the studies, the model coefficients for study membership were statistically non-significant. Furthermore, the inclusion of study-specific dummy variables did not yield any changes in the mean predicted HSUVs larger than 0.031. Results from the sensitivity analysis can be found in Appendix F.

The results from the secondary analysis suggest that withdrawal symptoms are a fundamental driver of health state utilities in people with OUD. The coefficient for the SOWS measure was found to be statistically significant and a one-point change on this measure was associated with a 0.0049 reduction in HSUV (results in Table 6). The coefficients for the other predictors were all statistically non-significant (results shown in Table E2 in Appendix E). The HSUV predictions from the secondary analysis exhibit much more variability than those from the main analysis. Table 7 shows HSUV predictions from the secondary analysis with stratification of predictions according to different levels of heroin use and different categories of withdrawal symptoms (where the rows labelled A, B, and C refer to moderate, severe, and worst possible symptoms).

Table 6 Estimated Health State Utilities’ (HSU) effects associated with contrasts for the predictor variables in the secondary analysis
Table 7 Predicted Health State Utility (HSU) values for health states while controlling for subjective opiate withdrawal scale (SOWS) score

Discussion

The results in this study show that HSUVs estimated among people with OUD are negatively associated with increased extra-medical opioid use, drug injecting, HIV positive diagnosis and increased age. The observed associations between HSUVs and treatment with medications for OUD are not statistically significant. However, HRQoL gains from medications for OUD treatment could still be achieved indirectly in a cost-effectiveness model through reductions in non-medicated opioid use or from reductions in the transmission of HIV. Regardless of these mechanisms for HRQoL gains, previous studies have shown that overdose mortality is the primary driver of QALY calculations in OUD rather than changes in HRQoL [18, 34].

This study expands on previous research by using responses from patients experiencing the key health states of interest rather than through vignette studies conducted in samples of the general population [6, 8]. The HSUVs estimated in this study are higher than those for equivalent health states derived in the vignette studies. For instance, HSUV predictions from the main analysis were 0.799 for people without HIV who inject heroin on a daily basis, compared to mean estimates of 0.588 and 0.574 in the studies by Connock et al. and Wittenberg et al., respectively [6, 8]. The lower estimates observed in the vignette studies may be attributable to the use of condition-specific labels in the valuation exercises, as opposed to the “generic” health dimensions of the EQ-5D-3 L.

Strengths and limitations

One advantage of estimates in this study is that they reflect values from the US EQ-5D-3 L tariff and facilitate the comparable measurement of health benefits across studies using this instrument. This comparability helps to promote consistency in the use of cost-effectiveness evidence to inform policy decisions [58], which is especially important given the popularity of the EQ-5D [59]. Some further strengths of this study include its alignment with the needs of cost-effectiveness models in OUD, the systematic effort to make comprehensive use of relevant evidence in keeping with the principles of evidence-based medicine [60], and the use of a generic preference-based HSUV measure that has been validated in OUD [38].

One limitation of our study was that the data did not permit a delineation of the different phases of treatment with medications for OUD (i.e. induction/starts, stabilization, and maintenance), which could have important implications for HRQoL. Our results found that HSUV predictions involving treatment with medications for OUD varied depending on the level of extra-medical opioid use. Previous research has shown that the gains in HRQoL associated with medications for OUD treatment are modest over the short-term [61] and do not persist over the long-term [62]. Another study found a stronger negative association between increased opioid use (non-medicated) and social domains of HRQoL when participants were enrolled in treatment compared to those who had discontinued treatment [63].

The outputs from this work may also be inadequate for researchers requiring some specific alternative preference-based measure, such as the EQ-5D-5 L, and additional mappings would be required to produce estimates for such a measure. It is important to acknowledge the potential drawbacks of this study in using responses from patients experiencing the key health states of interest. With any health condition, there is potential for patients to adapt to their health state and, consequently, their responses to HRQoL questionnaires may change over time [64, 65]. Whether adaptation should be viewed as a concern is disputed as the argument has been used both for and against the use of experienced responses [66]. Another limitation is that the representation of drug types (i.e. heroin and ‘other opiates’) may be considered outdated given that the illegal supply of opioids is increasingly composed of synthetic variants, such as fentanyl. These new variants have been found to induce different forms of craving and withdrawal compared to heroin, as well as presenting new challenges in treatment with medications for OUD [67,68,69]. This shortcoming points to the need for data collection in patients using synthetic opioids to quantify their impacts on HRQoL. Our study also lacked the data needed to estimate statistical relationships between HSUVs and hepatitis C virus status, a highly prevalent infection among people who inject drugs. Similarly, our study lacked data related to incarceration status, pregnancies, birth outcomes, and hypoxia, which are potentially relevant states in cost-effectiveness models for OUD.

The findings are limited due to the reliance on trial data. Research has shown that participants from diverse or marginalized backgrounds are underrepresented in treatment trials [70]. The relevance of trial results can also be compromised by a ‘comorbidity gap’, when the incidence of comorbidities is underrepresented in trial participants when compared to the real-world population of interest [71]. This issue might explain why the age-specific HSUV predictions in Fig. 2 do not align with the HSUV norms for people aged fifty and older given that comorbidities disproportionately affect the recruitment of older participants into clinical trials [72]. The use of evidence from patient registries has received increased interest as researchers strive to incorporate data with more representative patient populations and routine care pathways [73, 74]. As such, this may represent a practical solution to ensure that future studies include a representative case mix of participants.

Implications for future research

This study provides crucial evidence inputs for the estimation of QALYs in economic evaluations of interventions for OUD in a US setting. Ultimately, these estimates could give rise to different cost-effectiveness results and different policy decisions compared to the previous vignette studies because of the higher values for equivalent health states. Findings from the secondary analysis indicate that the explicit measurement of withdrawal symptoms in cost-effectiveness models might be an important line of pursuit for future research. The notion also relates to a broader research question to understand how measures of anhedonia, which is the inability to experience pleasure and one of the main characteristics of opioid withdrawal, might contribute to the development of evidence-based practices in the treatment in OUD [75]. The standard methodology in previous modeling studies has been to select intermediate endpoints – typically treatment retention and extra-medical drug use, both of which can be commonly found in clinical trials – to estimate the costs and QALYs associated with alternative policies [5, 16, 17]. This pragmatic approach to model development does not guarantee a conceptual model that accurately represents the experiences of people with lived experience of the condition under evaluation, which can lead to a misrepresentation of the key decision points [76]. This finding echoes calls for researchers to actively engage people with lived experience to ensure the models provide an appropriate representation of their “preferences, expectations and expanded definitions of what constitutes “successful” outcomes” [77, 78].

Conclusion

This study presents an analysis of HSUV data from patients experiencing key OUD health states in six clinical trials for cost-effectiveness models of OUD. We observed that HSUVs were negatively associated with increased extra-medical opioid use, drug injecting, an HIV positive diagnosis and increased age. The results yielded HSUVs that were higher than those conducted in vignette studies, where samples of the general population were asked to value descriptions of defined OUD health states.