Depression is associated with considerable impairments in health-related quality-of-life. However, the relationship between different health states related to depression severity and utility scores is unclear. The aim of this study was to evaluate whether utility scores are different for various health states related to depression severity.
We gathered individual participant data from ten randomized controlled trials evaluating depression treatments. The UK EQ-5D and SF-6D tariffs were used to generate utility scores. We defined five health states that were proposed from American Psychiatric Association and National Institute for Clinical Excellence guidelines: remission, minor depression, mild depression, moderate depression, and severe depression. We performed multilevel linear regression analysis.
We included 1629 participants in the analyses. The average EQ-5D utility scores for the five health states were 0.70 (95% CI 0.67–0.73) for remission, 0.62 (95% CI 0.58–0.65) for minor depression, 0.57 (95% CI 0.54–0.61) for mild depression, 0.52 (95%CI 0.49–0.56) for moderate depression, and 0.39 (95% CI 0.35–0.43) for severe depression. In comparison with the EQ-5D, the utility scores based on the SF-6D were similar for remission (EQ-5D = 0.70 vs. SF-6D = 0.69), but higher for severe depression (EQ-5D = 0.39 vs. SF-6D = 0.55).
We observed statistically significant differences in utility scores between depression health states. Individuals with less severe depressive symptoms had on average statistically significant higher utility scores than individuals suffering from more severe depressive symptomatology. In the present study, EQ-5D had a larger range of values as compared to SF-6D.
Depression is a common mental disorder with a 12-month prevalence of 5.3% and a lifetime prevalence of 13.2% . It is expected to rank first in terms of disability-adjusted life-years (DALYs) in high-income countries by 2030 [1–3]. Depression constitutes an enormous societal cost due to increased absenteeism, decreased work performance, and high healthcare utilization of depressed individuals [4, 5]. Moreover, it is related to considerable reductions in health-related quality-of-life (HRQoL) [6, 7]. Impairments in HRQoL are seen in mental, physical, and social functioning and negatively affect various aspects of the individual’s daily life .
HRQoL can be expressed as a utility score that represents the relative societal desirability of a particular health state that is anchored by 0 (death) and 1 (perfect health) . Utility scores are most commonly estimated using an indirect method, meaning that participants fill out a HRQoL questionnaire and then an algorithm is used to convert the participant’s health state into a utility score [10, 11]. These utility scores are often used to generate Quality-Adjusted Life-Years (QALYs) . A QALY is a measure that combines quality and quantity of life lived and is calculated by multiplying the utility score by the amount of time a participant spent in a particular health state. Many national guidelines for economic evaluations, for example in the Netherlands and the United Kingdom, recommend using QALYs in economic evaluations because they allow for comparisons across different treatments and health problems .
Depression severity can be categorized in different health states, namely remission and minor, mild, moderate, and severe depression. The association between health states related to depression severity and utility scores is not yet well researched. Two studies have examined the relationship between utility scores and different health states [12, 13]. They showed that depression has a considerable impact on utility scores, where more severe depression was associated with lower utility scores.
However, the aforementioned studies have several limitations. First, participants in both studies received antidepressant medication [12, 13]. Consequently, the findings may not be generalizable to individuals with depressive symptoms receiving no treatment or other types of treatment such as psychotherapy or combined treatments. Secondly, relatively small sample sizes were used in the analyses (n = 70 and n = 447, respectively) [12, 13]. Thus, these two studies could be underpowered to detect small but statistically significant differences in utility scores between health states. Furthermore, one study used the EuroQoL- 5 Dimensions (EQ-5D)  to estimate utility scores and the other the Short-Form (SF)- 6 Dimensions . However, it has been demonstrated that there are discrepancies between utility scores derived from the EQ-5D and the SF-6D leading to higher EQ-5D utility scores for healthier groups and higher SF-6D utility scores for less healthy groups [16, 17].
A recent meta-analysis aimed to pool the utility scores from different studies for three depression states (i.e., mild, moderate, and severe) . In total, the results from three studies were pooled regarding EQ-5D utilities. The authors indicated that milder depressive symptoms were related to increased utility scores . However, the number of included studies was limited and the statistical heterogeneity was considerable. Thus, the results may have limited generalizability and their validity may not be established.
Given the above, we included a large, representative sample of participants with depression, receiving interventions or being in control groups. Subsequently, we aimed to establish the utility scores, generated separately from EQ-5D and SF-6D tariffs for different health states related to depression severity. Secondly, we aimed to compare utility scores between clinically relevant depression health states. We hypothesized that the more severe health states would be related to lower utility scores and vice versa. The final objective was to investigate for potential differences between EQ-5D and SF-6D utility scores. We hypothesized that EQ-5D utility scores will have a wider range of values compared to the SF-6D [16, 17].
We carried out an individual participant data meta-analysis to estimate utility scores per predefined health state and to compare utility scores between health states. We performed a search in PubMed to identify relevant studies, using terms indicative for depressive symptoms, treatment for depression, quality-of-life, and randomized controlled trials. We did not aim to conduct a systematic review because this was not necessary for answering our research questions.
Two researchers examined the eligibility of the identified studies (SK and JEB). RCTs were eligible if they (a) included participants with a diagnosis of a depressive disorder based on a structured clinical interview, or participants with elevated depressive symptomatology based on a standardized measure of depressive symptom severity, (b) compared a treatment for depression with a control condition (i.e., care as usual or a waiting list group), (c) administered the EQ-5D-3L and/or SF-12 or SF-36 as a measure of HRQoL, (d) included a measure of depressive symptom severity (e.g., PHQ-9), and (e) were conducted in the Netherlands (to facilitate data sharing).
Data extraction and preparation
We contacted the authors of RCTs that satisfied our inclusion criteria and asked them permission to access their primary datasets. The authors signed a data sharing agreement that we provided. Data concerning participant registration number, gender, age, relationship status (not married/divorced/widowed, or married/living together with a partner), treatment group (intervention or control), education level (low, medium, high), comorbidity (study included exclusively participants with depression and another comorbid condition), and HRQoL and depression severity scores for all available measurements (i.e., baseline and follow-ups) were requested from the authors. All acquired data were strictly anonymous and it was not possible to track the identity of any of the participants. After receiving the primary datasets, we combined them in one database. Two researchers extracted the data from the primary datasets independently (SK and BvE or DM).
Utility scores were calculated using the UK EQ-5D and SF-6D tariffs (there are no Dutch tariffs available for SF-6D) [19, 20]. Included studies used different measures to monitor depressive symptom severity (Table 1). We used cut-off scores obtained from the literature for each of these measures to define the participant’s health state. The cut-off scores and the range of the questionnaires are reported in detail in Table 1. In accordance with the American Psychiatric Association (APA)  and National Institute for Clinical Excellence (NICE)  guidelines, we differentiated between five health states: remission (no or minimal depressive symptoms and no specific concern for clinical depression), minor depression (subthreshold/subclinical depression), and mild depression, moderate depression, and severe depression (three different severity levels of clinical depression).
We performed the analyses using the combined database. We used descriptive statistics to describe the demographic characteristics of the participants. To estimate the utility scores for each health state, we used a multilevel linear regression model in which we accounted for observations nested within participants and participants nested within studies (i.e., three-level structure). We used the default maximum-likelihood approach implemented in MLwiN . Separate analyses were carried out for the EQ-5D and SF-6D. The utility scores were the dependent variables, and four dummy variables representing the five health states were the independent variables. Based on the literature, we added the variables comorbidity, gender, age, relationship status, randomization group, and education level to the model to examine possible confounding effects. To determine whether there was confounding, we used the ‘rule of thumb’ of 10% change in the random coefficients between the model without covariates (crude model) and the model with covariates (adjusted model) [24, 25]. We also carried out a linear regression analysis without taking into account the hierarchical structure of the data (‘baseline model’). Statistical significance was set at p < .05.
We performed two sensitivity analyses. First, we repeated the analyses using the Dutch EQ-5D tariffs (there are no Dutch tariffs for SF-6D) , because we wanted to investigate whether our conclusions remain the same when using population preference values from different countries [27, 28]. In the second sensitivity analysis, we included only the baseline measurements from EQ-5D and SF-36 to calculate the mean utility scores for each health state. The main analyses included all the measurements of the participants (i.e., baseline and follow-ups) and, even though we controlled for this in the multilevel analysis, it is possible that it could influence our estimates.
Characteristics of included studies
We included ten studies with 1629 participants. All studies were conducted in the Netherlands and are presented in detail in Table 2. Four of them evaluated psychological treatments as an intervention (i.e., interpersonal psychotherapy, problem solving treatment, and cognitive behavioral therapy), two evaluated collaborative care (i.e., combination of general practitioner, psychiatrist, psychotherapist, and depression care manager), two stepped care (i.e., watchful waiting, activity scheduling, life review and consultation, and general practitioner), one disease management (i.e., general practitioner screening and consultation), and one medication and care as usual (i.e., antidepressants, consultation and information on depression). As a comparator, eight studies included care as usual and two used waiting list groups. Four studies included participants with depression and another comorbid condition.
Depression measures included Center for Epidemiologic Studies Depression Scale (CES-D, n = 3), Montgomery–Åsberg Depression Rating Scale (MADRS, n = 3), Patient Health Questionnaire (PHQ-9, n = 2), Inventory of Depressive Symptomatology Self-Report (IDS-SR, n = 1), and Hospital Anxiety and Depression Scale (HADS-D, n = 1). All studies administered the EQ-5D-3L and five of them also administered the SF-36.
Characteristics of participants
The demographic characteristics of the participants are presented in Table 3. From the 1629 participants, 856 had been randomized to an intervention group and 773 to a control group. Furthermore, 1087 participants were female (67%). The mean age was 56 years (SD = 18) and 720 participants (49%) were married or lived together with a partner (Table 3). Also, 569 participants (35%) had a lower education level (basic education or elementary school), 488 (31%) had an intermediate education level (high school or 12 years of education), and 536 (34%) had a higher education level (education after high school or university level degree).
EQ-5D utility scores
We included 4979 observations in the analyses. Table 4 presents the average utility scores from the adjusted model based on the EQ-5D and the mean differences between the health states. The average utility scores in the adjusted model were 0.70 (95% CI 0.67–0.73) for remission, 0.62 (95% CI 0.58–0.65) for minor depression, 0.57 (95% CI 0.54–0.61) for mild depression, 0.52 (95% CI 0.49–0.56) for moderate depression, and 0.39 (95%CI 0.35–0.43) for severe depression.
The mean utility scores were statistically significantly different between all five health states. The largest mean difference was found between remission and severe depression (−0.34, 95% CI −0.37 to −0.30). The smallest mean difference was found between minor and mild depression (−0.04, 95% CI −0.07 to −0.02). The covariates including comorbidity, age, gender, relationship status, randomization group, and education level were included in the adjusted model but the random coefficients of health states did not change by more than 10% (see Supplementary material, Table S1). Thus, we inferred that these covariates did not confound our estimations.
SF-6D utility scores
We used SF-6D utility scores as the dependent variable and included 1726 observations (Table 4). The average utility scores in the adjusted model were 0.69 (95% CI 0.67–0.71) for remission, 0.63 (95% CI 0.61–0.66) for minor depression, 0.59 (95% CI 0.58–0.62) for mild depression, 0.56 (95% CI 0.54–0.59) for moderate depression, and 0.55 (95% CI 0.53–0.57) for severe depression.
The mean differences in utility scores between the health states were statistically significant, except for the difference between moderate and severe depression (−0.01, 95% CI −0.03–0.00). We did not detect any confounding of our estimations (see Supplementary material, Table S1).
Differences between EQ-5D and SF-6D utility scores
Overall, the mean differences in utility scores between the health states were larger for EQ-5D than for SF-6D. The SF-6D in comparison with the EQ-5D showed a smaller range of utility scores (Table 4). In particular, it generated slightly lower utility scores for participants in less severe health states, and higher scores for more severe health states such as severe depression (EQ-5D = 0.39 vs. SF-6D = 0.55) (Table 4).
We calculated the utility scores using the Dutch EQ-5D tariffs. The average utility scores for the adjusted model were 0.73 (95% CI 0.69–0.77) for remission, 0.63 (95% CI 0.59–0.67) for minor depression, 0.58 (95% CI 0.54–0.62) for mild depression, 0.51 (95% CI 0.47–0.55) for moderate depression, and 0.37 (95%CI 0.33–0.41) for severe depression.
The sensitivity analysis using only baseline scores for EQ-5D (UK tariffs) included 1453 observations. The mean utility scores of the adjusted model were 0.72 (95% CI 0.63–0.80) for remission, 0.62 (95% CI 0.55–0.69) for minor depression, 0.55 (95% CI 0.47–0.62) for mild depression, 0.47 (95% CI 0.40–0.55) for moderate depression, and 0.30 (95% CI 0.23–0.38) for severe depression.
The sensitivity analysis using only baseline scores for SF-6D included 520 observations. The average utility scores of the adjusted model were 0.68 (95% CI 0.65–0.72) for remission, 0.63 (95% CI 0.60–0.63) for minor depression, 0.57 (95% CI 0.54–0.60) for mild depression, 0.55 (95% CI 0.52–0.58) for moderate depression, and 0.54 (95% CI 0.49–0.58) for severe depression.
The present study estimated utility scores derived from the EQ-5D and the SF-6D for five health states related to depression severity using individual participant data from ten clinical trials. The results demonstrated that utility scores differed statistically significant between the health states, and that less severe health states were associated with higher utility scores.
There are some differences between the utility scores that we found for each health state and those reported in the literature. For instance, the average utility score for remission (0.69–0.71) was somewhat lower than those reported in other studies (0.72–0.86) [12, 13]. In addition, the average utility score for severe depression in our study (0.39–0.55) was higher as compared to the previous findings (0.27–0.30) [12, 13]. These differences may be explained by the differences in design and methodology between the present and the previous studies. To illustrate, one of the previous studies evaluated depression severity based on physician’s judgment in combination with the Clinical Global Impression Improvement Scale (CGI-I) , while in the other study the participants evaluated hypothetical health states related to depression severity . Furthermore, these studies were conducted in the US (using SF-6D) and Sweden (using EQ-5D) and used different tariffs to calculate utility scores.
The average utility score for remission found in our study was lower than the average utility score of the general population (between 0.76 and 0.87) [17, 29, 30]. This finding is in line with the literature, indicating that individuals in remission from depression may suffer from residual impairments in HRQoL [31, 32]. Thus, evaluating treatment success based on remission of depression symptoms alone may be too restrictive. Improvements in HRQoL may take longer to occur and should, therefore, be monitored after remission of depressive symptoms as well [31, 33].
There is doubt in the literature regarding the performance of EQ-5D and SF-6D in detecting small but important changes in utility values [34–36]. Although we showed that most of the differences in utility scores between the five health states were statistically significant, it is important to examine whether the detected differences are also clinically relevant . Clinical relevance can be defined as the minimum change in an outcome that is perceived by the individuals as relevant and beneficial and has a notable effect on their daily life . A previous systematic review showed that a clinically relevant change in utility score for the EQ-5D ranged between 0.01 and 0.14 (mean = 0.07) and for the SF-6D between 0.01 and 0.10 (mean = 0.04) . Therefore, most of the mean differences in EQ-5D and SF-6D utility scores between the health states in our study appeared clinically relevant. There was no evidence that the differences between minor and mild, and mild and moderate depression for EQ-5D utility scores were clinically relevant. Similarly, for SF-6D utility scores, the differences between mild and moderate, and moderate and severe depression did not appear clinically relevant.
Our results are in line with the previous findings indicating that the EQ-5D generates higher utility scores than the SF-6D among healthier participants and lower scores for less healthier participants (i.e., severe depression) [17, 40]. This discrepancy has been addressed before [36, 41–43] and has been attributed to the different scoring algorithms, number of possible health states, and the approach through which the utility scores are generated . It is, therefore, important to consider that utility scores for health states related to depression severity are dependent on the measure from which they are generated.
The sensitivity analyses that we conducted demonstrated the robustness of our results. In particular, when we used the Dutch EQ-5D tariffs, the mean utility scores changed slightly, as it was expected. Nevertheless, in line with the main analyses, participants with more severe depressive symptoms had on average lower utility scores. Similarly, when we included only the baseline measurements in our models, the mean utility scores were in accordance with those in the main analyses.
Strengths and limitations
To our knowledge, this is the first study examining the relationship between utility scores and health states related to depression that includes a large sample size providing sufficient statistical power. Furthermore, we performed the analyses using multilevel modeling, which is considered the most appropriate approach to analyze hierarchically structured data and takes into account potential differences between the included studies . We used utility scores generated from both the EQ-5D-3L and the SF-36. Finally, we provided mean utility scores for the five health states related to depression severity that are recommended by APA and NICE [21, 22].
In model-based economic evaluations, health economic models are used to examine the long-term cost-utility of interventions for depression. Utility scores are typically included in these models to calculate QALYs. Previous model-based studies populated their models with utility scores selected from only a limited set of studies . Our study shows some important advantages over previous studies [12, 13] (e.g., large sample size, health states based on national guidelines) and our findings can be used to populate health economic models with more confidence. For instance, we intend to use the results of this study to populate a health economic model examining the cost-effectiveness of a “blended” (face-to-face and Internet-based) treatment for depression, which is part of the E-COMPARED project funded under the Seventh Framework Program .
The present study is not without limitations. We used different measures of depressive symptom severity to define the health states related to depression severity, while cut-off scores for depression were based on the literature . However, some studies in the literature reported different cut-off scores for the same instruments. The combination of different measures and the employment of cut-off scores could potentially lead to overlapping health states. Nevertheless, as reflected by the clinically relevant and statistically significant mean differences in the utility scores, the health states were a reliable representation of depressive symptom severity.
We used tariffs generated from a UK population to calculate the utility scores. It is possible that utility scores would be slightly different if we would have used studies and population preference values from other countries [27, 28]. However, the sensitivity analyses using Dutch tariffs for EQ-5D showed that our findings are robust. Similarly, using the EQ-5D-5L, which was recently introduced, may result in different outcomes . Furthermore, the mean age of our sample was somewhat high (i.e., 56 years old). Older age can be a factor related to lower HRQoL, but we statistically controlled for it without finding any statistically significant associations. In addition, comorbidity is very common for patients with depression and it may be related to impairments in HRQoL [49, 50]. We statistically controlled for comorbidity but we did not find any significant relationship. Finally, comorbidity and age of participants do not seem a threat to the validity of our estimations because the mean utility scores were similar or higher than utility scores reported for participants with depression in other studies.
We demonstrated that there are statistically significant and clinically relevant differences in utility scores between the health states. Particularly, individuals with less severe depressive symptoms had on average higher utility scores than individuals with more severe depressive symptomatology. Considering that individuals in remission from depression had on average lower utility scores than the general population, it is important to take into account HRQoL as an outcome of depression treatments. Differences between EQ-5D and SF-6D utility scores, and particularly the larger range of EQ-5D values, need to be considered for future economic evaluations and health economic models.
Hasin, D. S., Goodwin, R. D., Stinson, F. S., et al. (2005). Epidemiology of major depressive disorder: Results from the national epidemiologic survey on alcoholism and related conditions. Archives of General Psychiatry, 62(10), 1097–1106.
Waraich, P., Goldner, E. M., Somers, J. M., et al. (2004). Prevalence and incidence studies of mood disorders: A systematic review of the literature. Canadian Journal of Psychiatry. Revue Canadienne de Psychiatrie, 49(2), 124–138.
Mathers, C. D., & Loncar, D. (2006). Projections of global mortality and burden of disease from 2002 to 2030. PLoS Medicine, 3(11), e442.
Olesen, J., Gustavsson, A., Svensson, M., et al. (2012). The economic cost of brain disorders in Europe. European Journal of Neurology: The Official Journal of the European Federation of Neurological Societies, 19(1), 155–162.
Wittchen, H. U., Jacobi, F., Rehm, J., et al. (2011). The size and burden of mental disorders and other disorders of the brain in Europe 2010. European Neuropsychopharmacology: The Journal of the European College of Neuropsychopharmacology, 21(9), 655–679.
Saarni, S. I., Suvisaari, J., Sintonen, H., et al. (2007). Impact of psychiatric disorders on health-related quality of life: General population survey. The British Journal of Psychiatry: The Journal of Mental Science, 190, 326–332.
Saarni, S. I., Viertio, S., Perala, J., et al. (2010). Quality of life of people with schizophrenia, bipolar disorder and other psychotic disorders. The British Journal of Psychiatry: The Journal of Mental Science, 197(5), 386–394.
Papakostas, G. I., Petersen, T., Mahal, Y., et al. (2004). Quality of life assessments in major depressive disorder: A review of the literature. General Hospital Psychiatry, 26(1), 13–17.
Torrance, G. W. (1986). Measurement of health state utilities for economic appraisal. Journal of Health Economics, 5(1), 1–30.
Sassi, F. (2006). Calculating QALYs, comparing QALY and DALY calculations. Health Policy and Planning, 21(5), 402–408.
Whitehead, S. J., & Ali, S. (2010). Health outcomes in economic evaluation: The QALY and utilities. British Medical Bulletin, 96, 5–21.
Revicki, D. A., & Wood, M. (1998). Patient-assigned health state utilities for depression-related outcomes: Differences by depression severity and antidepressant medications. Journal of Affective Disorders, 48(1), 25–36.
Sobocki, P., Ekman, M., Agren, H., et al. (2007). Health-related quality of life measured with EQ-5D in patients treated for depression in primary care. Value in Health: The Journal of the International Society for Pharmacoeconomics and Outcomes Research, 10(2), 153–160.
Group, E (1990). EuroQol–a new facility for the measurement of health-related quality of life. Health Policy (Amsterdam, Netherlands), 16(3), 199–208.
Ware, J. E. Jr., & Sherbourne, C. D. (1992). The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Medical Care, 30(6), 473–483.
Chen, J., Wong, C. K., McGhee, S. M., et al. (2014). A comparison between the EQ-5D and the SF-6D in patients with chronic obstructive pulmonary disease (COPD). PLoS ONE, 9(11), e112389.
Kontodimopoulos, N., Pappa, E., Papadopoulos, A. A., et al. (2009). Comparing SF-6D and EQ-5D utilities across groups differing in health status. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 18(1), 87–97.
Mohiuddin, S., & Payne, K. (2014) Utility values for adults with unipolar depression systematic review and meta-analysis. Medical Decision Making, 34(5) 666–685.
Kind, P., Hardman, G., & Macran, S. (1999). UK population norms for EQ-5D. Vol. 172. Centre for Health Economics, University of York York.
Brazier, J., Usherwood, T., Harper, R., et al. (1998). Deriving a preference-based single index from the UK SF-36 Health Survey. Journal of Clinical Epidemiology, 51(11), 1115–1128.
Rush, A., First, M., & Blacker, D. (2008). American Psychiatric association: Task force for the handbook of psychiatric measures. Handbook of psychiatric measures Arington: American Psychiatric Pub.
National Collaborating Centre for Mental Health. (2010). Depression in adults with a chronic physical health problem. British Psychological Society, UK.
Rasbash, J., C. Charlton, W.J. Browne, et al. (2009). MLwiN Version 2.1, in Centre for Multilevel Modelling: University of Bristol.
Maldonado, G., & Greenland, S. (1993). Simulation study of confounder-selection strategies. American Journal of Epidemiology, 138(11), 923–936.
Grayson, D. (1987). Confounding confounding. American Journal of Epidemiology, 126(3), 546–553.
Lamers, L. M., Stalmeier, P. F., McDonnell, J., et al. (2005). Measuring the quality of life in economic evaluations: The Dutch EQ-5D tariff]. Nederlands Tijdschrift Voor Geneeskunde, 149(28), 1574–1578.
Galante, J., Augustovski, F., Colantonio, L., et al. (2011). Estimation and comparison of EQ-5D health states’ utility weights for pneumococcal and human papillomavirus diseases in Argentina, Chile, and the United Kingdom. Value in Health: The Journal of the International Society for Pharmacoeconomics and Outcomes Research, 14(5 Suppl 1), S60–S64.
Badia, X., Roset, M., Herdman, M., et al. (2001). A comparison of United Kingdom and Spanish general population time trade-off values for EQ-5D health states. Medical Decision Making: An International Journal of the Society for Medical Decision Making, 21(1), 7–16.
Luo, N., Johnson, J. A., Shaw, J. W., et al. (2005). Self-reported health status of the general adult U.S. population as assessed by the EQ-5D and Health Utilities Index. Medical Care, 43(11), 1078–1086.
Prigent, A., Auraaen, A., Kamendje-Tchokobou, B., et al. (2014). Health-related quality of life and utility scores in people with mental disorders: A comparison with the non-mentally ill general population. International Journal of Environmental Research and Public Health, 11(3), 2804–2817.
IsHak, W. W., Greenberg, J. M., Balayan, K., et al. (2011). Quality of life: The ultimate outcome measure of interventions in major depressive disorder. Harvard Review of Psychiatry, 19(5), 229–239.
Zimmerman, M., Chelminski, I., McGlinchey, J. B., et al. (2006). Diagnosing major depressive disorder VI: Performance of an objective test as a diagnostic criterion. The Journal of Nervous and Mental Disease, 194(8), 565–569.
Kolovos, S., Kleiboer, A., & Cuijpers, P. (2016). The effect of psychotherapy for depression on quality of life: A meta-analysis. British Journal of Psychiatry, 209, 1–9.
Brazier, J., Deverill, M., Green, C., et al. (1999). A review of the use of health status measures in economic evaluation. Health Technology Assessment Health Technol Assess (Rockv) Health Technology Assessment (Winchester, England), 3(9), 1–164.
Longworth, L., & Bryan, S. (2003). An empirical comparison of EQ-5D and SF-6D in liver transplant patients. Health Economics, 12(12), 1061–1067.
Brazier, J., Connell, J., Papaioannou, D., et al. (2014). A systematic review, psychometric analysis and qualitative assessment of generic preference-based measures of health in mental health populations and the estimation of mapping functions from widely used specific measures. Health Technology Assessment Health Technol Assess (Rockv) Health Technology Assessment (Winchester, England), 18(34), 1–188 .
Osoba, D., Rodrigues, G., Myles, J., et al. (1998). Interpreting the significance of changes in health-related quality-of-life scores. Journal of Clinical Oncology: Official Journal of the American Society of Clinical Oncology, 16(1), 139–144.
Jaeschke, R., Singer, J., & Guyatt, G. H. (1989). Measurement of health status. Ascertaining the minimal clinically important difference. Controlled Clinical Trials, 10(4), 407–415.
Walters, S. J., & Brazier, J. E. (2005). Comparison of the minimally important difference for two health state utility measures: EQ-5D and SF-6D. Quality of life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 14(6), 1523–1532.
Barton, G. R., Sach, T. H., Avery, A. J., et al. (2008). A comparison of the performance of the EQ-5D and SF-6D for individuals aged > or = 45 years. Health Economics, 17(7), 815–832.
Bryan, S., & Longworth, L. (2005). Measuring health-related utility: why the disparity between EQ-5D and SF-6D? The European Journal of Health Economics: HEPAC: Health Economics in Prevention and Care, 6(3), 253–260.
Rowen, D., Brazier, J., & Roberts, J. (2009). Mapping SF-36 onto the EQ-5D index: How reliable is the relationship? Health and Quality of Life Outcomes, 7, 27.
McDonough, C. M., Grove, M. R., Tosteson, T. D., et al. (2005). Comparison of EQ-5D, HUI, and SF-36-derived societal health state values among spine patient outcomes research trial (SPORT) participants. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 14(5), 1321–1332.
Gelman, A. (2006). Multilevel (Hierarchical) Modeling: What It Can and Cannot Do. Technometrics, 48(3), 432–435.
Sonntag, M., König, H.-H., & Konnopka, A. (2013). The estimation of utility weights in cost-utility analysis for mental disorders: A systematic review. PharmacoEconomics, 31(12), 1131–1154.
Kemmeren, L. L., van Schaik, D. J., Riper, H., et al. (2016). Effectiveness of blended depression treatment for adults in specialised mental healthcare: Study protocol for a randomised controlled trial. BMC Psychiatry, 16(1), 113.
Cameron, I. M., Cardy, A., Crawford, J. R., et al. (2011). Measuring depression severity in general practice: Discriminatory performance of the PHQ-9, HADS-D, and BDI-II. The British Journal of General Practice: The Journal of the Royal College of General Practitioners, 61(588), e419–e426.
Herdman, M., Gudex, C., Lloyd, A., et al. (2011). Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Quality of Life Research, 20(10), 1727–1736.
Patten, S. B., Williams, J. V., Lavorato, D. H., et al. (2008). Major depression as a risk factor for chronic disease incidence: Longitudinal analyses in a general population cohort. General Hospital Psychiatry, 30(5), 407–413.
Moussavi, S., Chatterji, S., Verdes, E., et al. (2007). Depression, chronic diseases, and decrements in health: Results from the World Health Surveys. Lancet, 370(9590), 851–858.
Manea, L., Gilbody, S., & McMillan, D. (2012). Optimal cut-off score for diagnosing depression with the Patient Health Questionnaire (PHQ-9): A meta-analysis. Canadian Medical Association Journal, 184(3), E191–E196.
van Steenbergen-Weijenburg, K. M., de Vroege, L., Ploeger, R. R., et al. (2010). Validation of the PHQ-9 as a screening instrument for depression in diabetes patients in specialized outpatient clinics. BMC Health Services Research, 10, 235.
Rush, A. J., Pincus, H. A., First, M. B., et al. (2000). Handbook of psychiatric measures. Washington, DC: American Psychiatric Association.
Carmody, T., Rush, A. J., Bernstein, I., et al. (2006). The Montgomery Äsberg and the Hamilton ratings of depression: A comparison of measures. European Neuropsychopharmacology, 16(8), 601–611.
Smaar, L. K., & Keefer, L. A. (2011). Measures of depression and depressive symptoms: Beck Depression Inventory-II (BDI-II), Center for Epidemiologic Studies Depression Scale (CES-D), Geriatric Depression Scale (GDS), Hospital Anxiety and Depression Scale (HADS), and Patient Health Questionnaire-9 (PHQ-9). Arthritis Care & Research, 63(S11), S454–S466.
Radloff, L. S. (1977). The CES-D Scale: A self-report depression scale for research in the general population. Applied Psychological Measurement, 1(3), 385–401.
Haringsma, R., Engels, G. I., Beekman, A. T., et al. (2004). The criterion validity of the center for epidemiological studies depression scale (CES-D) in a sample of self-referred elders with depressive symptomatology. International Journal of Geriatric Psychiatry, 19(6), 558–563.
Trivedi, M. H., Rush, A. J., Ibrahim, H. M., et al. (2004). The Inventory of Depressive Symptomatology, Clinician Rating (IDS-C) and Self-Report (IDS-SR), and the Quick Inventory of Depressive Symptomatology, Clinician Rating (QIDS-C) and Self-Report (QIDS-SR) in public sector patients with mood disorders: A psychometric evaluation. Psychol Med, 34(1), 73–82.
Kessler, R. C., Berglund, P., Demler, O., et al. (2003). The epidemiology of major depressive disorder: Results from the National Comorbidity Survey Replication (NCS-R). Jama, 289(23), 3095–3105.
Fava, M., Rush, A. J., Trivedi, M. H., et al. (2003). Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study. Psychiatric Clinics of North America, 26(2), 457–494.
Snaith, R. P. (2003). The hospital anxiety and depression scale. Health and Quality of Life Outcomes, 1(1), 29.
Bosmans, J. E., van Schaik, D. J., Heymans, M. W., et al. (2007). Cost-effectiveness of interpersonal psychotherapy for elderly primary care patients with major depression. International Journal of Technology Assessment in Health Care, 23(4), 480–487.
Bosmans, J. E., Dozeman, E., van Marwijk, H. W., et al. (2014). Cost-effectiveness of a stepped care programme to prevent depression and anxiety in residents in homes for the older people: A randomised controlled trial. International Journal of Geriatric Psychiatry, 29(2), 182–190.
Hermens, M. L., van Hout, H. P., Terluin, B., et al. (2007). Clinical effectiveness of usual care with or without antidepressant medication for primary care patients with minor or mild-major depression: A randomized equivalence trial. BMC Medicine, 5(1), 1–11.
Huijbregts, K. M., de Jong, F. J., van Marwijk, H. W., et al. (2013). A target-driven collaborative care model for Major Depressive Disorder is effective in primary care in the Netherlands. A randomized clinical trial from the depression initiative. Journal of Affective Disorders, 146(3), 328–37.
van Marwijk, H. W., Ader, H., de Haan, M., et al. (2008). Primary care management of major depression in patients aged > or =55 years: Outcome of a randomised clinical trial. British Journal of General Practice, 58(555), 680–687
Schreuders, B., van Marwijk, H., Smit, J., et al. (2007). Primary care patients with mental health problems: Outcome of a randomised clinical trial. British Journal of General Practice, 57(544), 886–891.
Seekles, W., van Straten, A., Beekman, A., et al. (2011). Stepped care treatment for depression and anxiety in primary care. a randomized controlled trial. Trials, 12, 171.
van Steenbergen-Weijenburg, K. M., van der Feltz-Cornelis, C. M., van Benthem, T. B., et al. (2015). Collaborative care voor de behandeling van comorbide depressieve stoornis bij chronisch lichamelijk zieke patiënten op een polikliniek van een algemeen ziekenhuis. Tijdschrift voor Psychiatrie, 57(4), 248–257.
van Straten, A., Cuijpers, P., & Smits, N. (2008). Effectiveness of a web-based self-help intervention for symptoms of depression, anxiety, and stress: Randomized controlled trial. Journal of Medical Internet Research, 10(1), e7.
Warmerdam, L., Smit, F., van Straten, A., et al. (2010). Cost-utility and cost-effectiveness of Internet-based treatment for adults with depressive symptoms: Randomized trial. Journal of Medical Internet Research, 12(5), 40–50.
The current study has been conducted within the E-COMPARED framework. The E-COMPARED project is funded under the Seventh Framework Program. The content of this article reflects only the authors’ views and the European Community is not liable for any use that may be made of the information contained therein.
Conflict of interest
The authors declare that they have no conflict of interest.
Electronic supplementary material
Below is the link to the electronic supplementary material.
About this article
Cite this article
Kolovos, S., Bosmans, J.E., van Dongen, J.M. et al. Utility scores for different health states related to depression: individual participant data analysis. Qual Life Res 26, 1649–1658 (2017). https://doi.org/10.1007/s11136-017-1536-2
- Utility scores
- Multilevel analysis