Background

While the prevalence of mental health conditions is increasing in the UK, there is a dearth of country specific data on the impact of mental health conditions on health-related quality of life (HRQoL). There are a number of European studies that investigate HRQoL decrements as a result of mental health conditions but none of these includes the UK [13]. The focus of many surveys has been epidemiological in nature concentrating on prevalence of mental health conditions. Health state utility values (HSUVs) take the form of an index constructed from an outcome measure normally anchored from zero to one to represent death and full health respectively. They are normally used by decision-makers to evaluate and compare cost-effectiveness of various treatments. HSUVs are particularly scarce for more complex conditions like psychosis, phobia and panic disorder [4] and disutility associated with different levels of severity.

Decision makers such as the National Institute for Health and Care Excellence (NICE) in the UK recommend that the results of economic evaluations in healthcare are presented in terms of quality adjusted life years (QALYs) which are a composite measure of health related quality of life and life expectancy. The most commonly used measures for putting the ‘Q’ into the QALY are generic preference-based measures of health, such as the EQ-5D [5] and SF-6D [6].This paper seeks to address the lack of available HSUVs associated with mental health conditions by providing estimates that can be used in economic evaluation from a representative community based sample.

Another important feature of mental health conditions is the common existence of co-morbidities, arising from other mental health conditions as well as physical ones. Understanding the impact of one condition on HSUVs cannot be done in isolation, but only after controlling for other health conditions. Previous studies of mental health conditions have largely failed to do this. There are several studies in the literature that highlight the lack of consensus on how best to calculate HSUVs [7, 8]. While attempts have been made to calculate HSUVs for co-morbid physical conditions [9], for example by adding or multiplying the effects of separate conditions, there is still overall very little empirical research on co-morbidities for those with mental health conditions.

Aims of the study

This paper has three objectives: (i) to estimate the impact of mental health disorders on quality of life in England (ii) to compare and contrast the impact of physical health and mental health co-morbidities on quality of life (iii) to assess the impact of severity of depression, anxiety, phobia and panic disorders on quality of life.

Specifically we estimate the decrement associated with various mental health disorders using the SF-6D and EQ-5D indices from a representative sample of the UK general population and controlling for a wide range of background variables.

Methods

Data

We used health-related quality of life data and information on health status collected in the Adult Psychiatric Morbidity Survey (APMS) carried out in 2000 and 2007 [10, 11]. This is a rigorously conducted general population survey aiming to provide information on the prevalence of psychiatric conditions among people living in Great Britain, as well as their associated social disabilities and use of services. Each APMS recruited about 8,000 working age adults in private households. APMS is unique in the UK for having data on a broad range of conditions including common mental health disorders like depression, anxiety and obsessive compulsive disorder, psychotic conditions, personality disorders and alcohol and drug dependence. They also contain general health measures including the SF-12 health index, a measure from which the SF-6D preference based utility index can be obtained [12]. There is also information on socio-demographic data, education and employment, income and debt, accommodation and stressful life events. While the 2000 survey covered England, Wales and Scotland, APMS 2007 only interviewed people in England so our analysis uses data for England only.

In both years APMS interviews are conducted in two stages. First, a computer assisted personal interview in the respondents own home covering neurotic symptoms and disorders using the Clinical Interview Schedule Revised (CIS-R) and screening items on personality disorder and psychosis. CIS-R is a structured interview that has been standardised so that it can be administered by social survey interviewers. It enquires about 14 common neurotic symptoms allowing categorisation according to ICD-10 criteria. CIS-R also measures the severity of the condition and is widely used [13, 14]. A second stage sample was chosen comprising respondents who satisfied screening criteria for psychotic and personality disorder. The second stage interviews were conducted by trained psychologists using Schedules for Clinical Assessment in Neuropsychiatry (SCAN) and Structured Clinical Interview (SCID-II). In 2000 (2007) there were 8580 (7461) initial interviews, a response rate of 54 (57)%. At the second stage there were 638 (630) interviews and a response rate of 73 (74)%. Our analysis sample comprises 5688 individuals in 2000 and 5388 individuals in 2007. Data for 2000 and 2007 are pooled.

Measuring HSUVs

Two generic preference-based measures were derived from the SF-12 data: SF-6D and the EQ-5D. The SF-6D health utility index is derived from individual responses to the SF-12, a generic health measure based on items taken from the SF-36 health survey, a standardised questionnaire used to assess patient health [15]. Brazier and Roberts [9] developed a preference based index for the SF-12 using an algorithm estimated from standard gamble valuations of a sample of SF-6D states obtained from members of the UK general population [6]. The index takes values that range from zero (equivalent to dead) to one (full health).

The EQ-5D questionnaire consists of 5 simple questions on mobility, self-care, usual activities, pain and discomfort, anxiety and depression. As raw data was not collected from respondents using the EQ-5D questionnaire, EQ-5D scores have been generated through mapping from SF12 items using a response approach mapping [16]. It is deemed important to present the utilities using the EQ-5D, since the former is the method preferred by NICE in the reference case [17]. In the response mapping approach, multinomial logistic regression is used to estimate the probability that a respondent will select a particular level of response to questions in the EQ-5D, using individual question responses from the SF-12 as predictors. EQ-5D responses are predicted using Monte Carlo simulation methods and UK tariffs are then applied to the raw scores to generate the EQ-5D index [5].

Measuring severity of conditions

It is often useful to know the severity of the condition, as quality of life decrements and health care costs are usually much higher for greater levels of morbidity. Each condition is diagnosed via a set of four questions from the CIS-R. For example in the case of anxiety these are: (i) felt generally anxious/nervous/tense for 4 days or more in the past seven days; (ii) in past seven days anxiety/nervousness/tension has been very unpleasant; (iii) in the past seven days have felt any of the following symptoms when anxious/nervous/tense (Racing heart, sweating or shaking hands, feeling dizzy, difficulty getting one’s breath, dry mouth, butterflies in stomach, nausea or wanting to vomit); (iv) felt anxious/nervous tense for more than three hours in total on any one of the past seven days. Each question scores one if that symptom was present, giving a total anxiety score ranging from zero for no symptoms to four. This score can be used as a measure of severity of the condition. A similar approach was taken with depression, panic and phobia.

Analysis

The basic model to be estimated is:

U i = f M i , P i , X i , ϵ i
(1)

Where U i is health utility for respondent i; M is mental health, P is physical health and X is a set of background characteristics, ϵi is a random error term.

Two separate sets of analyses are carried out with SF-6D and EQ-5D as dependent variables. The initial modelling was undertaken using Ordinary Least Squares (OLS) but this can be criticised. OLS estimates are biased and inconsistent, because both the SF-6D and EQ-5D distributions are skewed and in addition the EQ-5D distribution is truncated with many observations at the upper value of one [18]. To overcome these problems, OLS models for SF-6D are estimated with robust error variance as Breusch-Pagan/Cook-Weisberg tests results suggest heteroskedastic errors, and for the EQ-5D, tobit models are used to deal with the truncated nature of the data.

Mental health (M) is measured by a set of dichotomous variables to represent the presence of specific disorders. Diagnosis of specific disorders were assigned by the Office of National Statistics using answers to various sections of the CIS-R and applying algorithms based on the ICD-10 diagnostic criteria; these disorders are: generalised anxiety disorder (GAD), mixed anxiety depressive disorder (MADD), panic disorder, obsessive compulsive disorder (OCD), phobia, and depression. Psychosis and personality disorder are assigned via the Stage 1 screening questions, alcohol dependence is defined according to the Severity of Alcohol Dependence questionnaire (SAD-Q) and drug dependence defined according to questions used in the US Epidemiologic Catchment Area survey. A variable was generated to represent people who self-report that they have long-term depression lasting for a period of 2 years or over [19]. More detail on definitions for each disorder can be found in the APMS technical reports [10, 11].

Physical health (P) is measured by a set of dichotomous variables denoting the presence of self-reported long-standing health conditions: muscular-skeletal, respiratory, digestive, heart and circulatory, urinary, skin, ear, eye, neoplasm, blood disorder, and infection. The set of background variables (X) include age, marital status, presence of children aged 16 or under in the household, employment status, ethnicity, education and income. Dummy variables for regions and year are also included. A full list of variables and definitions can be obtained from the corresponding author.

Although co-morbidities are an important issue for cost-effectiveness modelling, there is no consensus about the best method to estimate HSUVs for co-morbidities [7]. To allow a flexible approach we have explored all first order interactions between mental health conditions and physical health conditions by estimating models (2) and (3) below:

U i = f M i , P i , X i , I M i , ϵ i
(2)
U i = f M i , P i , X i , I P i , ϵ i
(3)

Where IM is a set of dichotomous variables representing first order interactions between the seven mental health conditions described above and IP is a set of dichotomous variables representing first order interactions between the seven mental health conditions and the seven physical health conditions described above. It was not possible to investigate co-morbidities for personality disorder, psychosis and panic due to the small number of observations. In addition some physical conditions could not be considered due to small sample sizes, including ear complaints, neoplasm, blood disorder and infectious disorder. Data from the APMS was downloaded from the UK Data Archive [20] and STATA 11 was used for the analysis.

Results

Table 1 reports descriptive statistics for the entire sample of 11080 cases. The distribution of the SF-6D health utility index is negatively skewed with a concentration at the higher end of the scale (Additional file 1: Figure S1a). However there is no obvious ceiling or floor effect and only a small proportion of observations take the value of one (less than 5%). While the mean scores for SF-6D (0.791) and EQ-5D (0.808) are very similar, the distributions are very different. The EQ-5D distribution has a ceiling effect with 33% of observations at full health. SF-6D and EQ-5D scores range between 0.35 and 1 and between -0.38 and 1 respectively (Additional file 1: Figure S1b).

Table 1 Descriptive statistics for entire sample

Table 2 shows the descriptive statistics on physical and mental health conditions with mean utility scores. Around 82% of people in this dataset have no mental health disorder, similar to the prevalence rate reported in the APMS reports [10, 11]. The most common problem is Mixed Anxiety Depressive disorder (MADD) found in around 10% of respondents, and this is followed by alcohol dependency, which is here defined as any level of dependency detected by the SADQ, ranging from mild to severe. The SF-6D (EQ-5D) scores for those with no physical health problem and no mental problem are 0.829 (0.851) and 0.827 (0.842) respectively. All the scores for the EQ-5D are marginally higher than the SF-6D. Long-term depression and depression sufferers have the lowest SF-6D (EQ-5D) scores of 0.532 (0.513) and 0.551 (0.537) respectively. According to the neurotic symptoms score, those with more severe symptoms have monotonically lower utility scores as measured by both SF-6D and EQ-5D.

Table 2 Descriptive statistics: utility scores for people with physical and mental health conditions

The results from the OLS and tobit regressions for SF-6D and EQ-5D respectively are shown in Table 3. Marginal effects have been calculated at the means of the explanatory variables for tobit regressions. The highest decrements in HSUVs are observed for depression, MADD, long-term depression and GAD for both SF-6D and EQ-5D. The HSUV decrements in SF-6D (EQ-5D) scores are as follow: depression -0.137 (-0.159); MADD -0.136 (-0.127); long-term depression -0.100 (-0.118); GAD -0.086 (-0.070); phobia -0.083 (-0.081); panic -0.077 (-0.076); OCD -0.057 (-0.034) and; drug dependency -0.027 (-0.011). The three mental health conditions that do not have a statistically significant effect on health utility are psychosis, personality disorder and alcohol dependence. The coefficients for psychosis and personality disorder may be insignificant due to the small numbers of people (81 and 31 respectively) suffering from these conditions. All of the physical health conditions, except eye complaints, are statistically significant at 1% level of significance. In the models for both SF-6D and EQ-5D, socio-demographic characteristics were controlled for. They are not reported in this paper but are available by request from authors.

Table 3 Health decrements in SF-6D and EQ-5D scores

For the OLS regression for SF-6D, the adjusted R2 is 0.41 but the Ramsey Regression Equation Specification Error Test (RESET) [21] and Pregibon [22] link test statistics suggest that the model suffers from omitted variables and misspecification problems respectively. It is worth noting that these tests also revealed misspecification problems with both tobit [23] and Postestimation Generalized Linear Models (PGLM) models [24, 25] which were also explored for the SF-6D (not reported here). The marginal effects of physical and mental health conditions on the SF-6D index are very similar regardless of the estimation procedure and functional form chosen, a finding similar to that of Jones [26] who compared a number of different models for modelling health care cost data. In the case of EQ-5D as the dependent variable, the link test suggests that the model is correctly specified making tobit regression a better model than OLS in this case.

Co-morbid Health Conditions

Table 4 reports the results for the models exploring interactions between mental health conditions. Only the coefficients on the main mental health conditions and the significant (at p = 0.05) interactions are reported. The coefficients on the mental health conditions are different compared to Table 3 but remain statistically significant. The first point to note is that the coefficients on all of the interaction terms are positive implying that having both conditions results in a health utility decrement that is less than the sum of the individual coefficients. For example, having GAD and depression, the decrement to the SF-6D index is -0.107 - 0.182 + 0.122 = -0.167; this is smaller than that suggested by the additive model of -0.289. In this case it is also highlighted that the interaction term more than offsets the main effects of GAD. Similarly using EQ-5D, the decrement associated with GAD and depression is -0.213. Despite the insignificant main effects for alcohol dependence, there are significant positive interactions with GAD, MADD (EQ-5D only) and depression.

Table 4 Utility values for mental health co-morbidities: SF6 and EQ-5D scores

Table 5 reports the results to explore interactions with physical conditions; only for models where the coefficients on the main health conditions and the interactions are significant (at p = 0.05). The coefficients on the mental and physical health conditions are similar to those in Table 3 and are statistically significant. Of the interactions reported, the majority are positive, meaning that the presence of both disorders reduces health utility by a smaller amount than that suggested by the additive model. The only interaction terms that are negative are MADD and respiratory complaints, and MADD and muscular skeletal complications with the latter interaction being insignificant.

Table 5 Utility values for mental health and physical health co-morbidities: SF6 and EQ-5D scores

Severity of mental health condition

We have explored the severity of four conditions: depression (n = 367), anxiety (n = 613), phobia (n = 267) and panic (n = 121). We estimate model (1) and in the vector M, the dichotomous variable for the presence of anxiety (depression) is replaced with a set of four dummy variables indicating the score, compared to a baseline of zero. The results are shown in Table 6; the effect on other coefficients is negligible so only the estimated coefficients on the new set of dummy variables are reported. The results for depression are as expected, the utility decrement increases with the severity of the condition.

Table 6 Impact of severity of Anxiety, Depression, Phobia and Panic on SF-6D and EQ-5D scores

For anxiety, while there is a gradient overall from level 1 to 4, the estimated coefficient for levels 3 and 4 have very similar decrements. In this case, the levels have been aggregated to represent only two severity levels with the more severe level showing a higher SF-6D (EQ-5D) decrement of -0.070 (-0.068) compared with the lower severe level of -0.047 (-0.032).

The severity levels for phobia and panic have also been aggregated to distinguish between two severity levels. In the case of phobia, higher decrements are observed in both SF-6D and EQ-5D scores for the more severe levels. It is noted that the EQ-5D scores are not significant. However, for panic disorder both SF-6D and EQ-5D show a lower decrement for the more severe level and only the SF6D value is significant. This is possibly mainly due the fact that there are only 57 and 64 individuals in the less severe and more severe categories respectively.

Discussion

Our estimates from over 10,000 responses to the APMS 2000 and 2007 show that all but three of the mental health conditions considered here have a statistically significant and relatively large adverse effect on health utility measured by both SF-6D and EQ-5D indices. These effects are larger than for self-reported physical health conditions. Another study also found that HRQoL decrements from mental health conditions are higher than those reported by suffers of chronic physical conditions [1]. Depression, MADD and GAD are the mental health conditions that impact most on health. The three mental health conditions that are not statistically significant are alcohol dependency, personality disorder and psychosis. In considering co-morbidities, we have used interaction terms and in most cases they are significant confirming the fact that we cannot simply add HSUVs decrement of co-morbid health conditions. The impact on HSUVs is generally less than the addition of the separate health conditions. When considering interactions between mental and physical health conditions, the majority are positive, however in one case the interaction is negative and significant suggesting that the presence of both conditions reduces health utility by a larger amount that the additive model would suggest. Brief exploration of the effects of the severity of anxiety and depression on the health utility index, suggest that for depression the results are as expected, the utility decrement increasing with the severity of the condition. However, for anxiety, panic and phobia, only two severity levels were considered.

Utility decrements have been measured using both SF-6D and EQ-5D predicted using a published mapping function. Both measures on the whole depict a similar picture. In general, for most physical and mental health conditions, the decrements shown by SF-6D and EQ-5D are similar and this finding is supported by a study comparing SF-6D and EQ-5D across seven patient groups with physical health conditions [27]. A multi-centre randomised control trial in Netherlands [28] found that EQ-5D utilities led to significant higher health gains even though EQ-5D mean scores were lower than SF-6D scores at baseline. A study on patients with schizophrenia revealed similar EQ-5D and SF-6D mean scores at baseline but higher EQ-5D mean scores higher at follow-up [29].

In model [3], the utility decrements for eye complaints are statistically insignificant. This finding is line with a recent review where it was found that SF-6D and EQ-5D are inadequate measures for such conditions [30]. Similarly, the insignificant coefficient for psychosis is in line with the finding that generic measures are not suitable for use in schizophrenia, a psychotic disorder [31]. An additional reason why personality disorder and psychosis are not statistically significant could be due to relatively small numbers, as less than 1% of the sample has these conditions. For alcohol dependency we explored whether the insignificant result was due to our relatively broad definition of any dependency, by testing the effects of moderate and severe dependence on alcohol (as defined by the SADQ) alone, but these were not statistically significant either. Another study found that alcohol dependence was insignificant but similar to this paper, it was not possible to understand how and why this was the case [1].

The effects of co-morbidities are investigated through interaction terms. For the majority of mental health co-morbidities, the interaction terms are positive and often offset the decrement caused by a single condition. While it is understandable that this may be the case, clinically it is difficult to explain. It may be that patients get better at managing their conditions having been exposed to another mental health condition. In the presence of a mental health condition and a physical health condition, some of the results are very surprising. First, several of the interaction terms as measured by EQ-5D score are statistically insignificant even though the both main effects are statistically significant. These include GAD and muscular/skeletal complaints, GAD and digestive complaint, GAD and urinary complaints, MADD and urinary complaints, MADD and skin complaints, depression and respiratory complaints, phobia and muscular/skeletal complaints. This is not the case when the dependent variable is SF-6D. It may be due to the artefact of the mapping approach. In some cases, the main effects are not statistically significant but the interaction terms are. Examples include alcohol dependency and digestive complaints, where alcohol dependency is insignificant but the interaction term is significant. In this case, it is not clear whether it is acceptable to use the HSUVs thus generated. A possible explanation for the latter result is the fact that alcohol may be used to alleviate the symptoms of GAD and MADD.

Limitations

Overall the models we have estimated have good explanatory power but the RESET test does suggest misspecification problems. We have explored both tobit and generalised linear models for the SF-6D but these were still wrongly specified. The EQ-5D index also suffers from ceiling effects and therefore more sophisticated models may be used to address that. Despite the weaknesses of OLS regressions in the case of SF-6D, it was decided to present the results as they are easy to interpret and in the absence of better estimates, those generated in this paper can be readily used in decision-analytic models.

It is worth stressing here that the data we have used to estimate these models is not based on clinical diagnosis of mental health conditions, but instead on well-established instruments administered by trained interviewers. Furthermore, while data on the mental health conditions are reliable, the data on physical conditions are self-reported and are therefore less reliable since there was no confirmation of the conditions. Finally, the data is limited to those living in private households, though this accounts for the vast majority of people with these conditions.

There may also be a concern as to whether a generic measure of health like the SF-12 adequately assesses the impact of all mental disorders – particularly personality disorders and psychosis. The other limitation is that the EQ-5D estimates are based on a mapping function that been shown to have a tendency to underestimate values at the upper end of the scale and overestimate values at the lower end. Other mapping functions have not overcome this limitation and it may result in an underestimate of the impact of different conditions [32].

Conclusions

Despite these limitations, this study has a number of interesting findings. First, the estimates provided are based on a representative sample of the UK population. Generally HSUV are generated from trial data and there are always questions as to whether they can be generalised and used outside the trial setting. Therefore the decrements associated with the various mental health conditions and co-morbidities in this paper can be readily used in decision-analytic models in assessing cost-effectiveness of health technologies. Second the APMS is regarded as the most reliable available data in the UK in the area of mental health.

There are several implications of this paper. First, background characteristics need to be controlled for when estimating HSUVs. This is very important but often not accounted in single condition studies. As a result the HSUV decrement associated with a particular health condition is possibly higher which, in turn has implications in cost-effectiveness analysis. Second, the paper has also been able to generate estimates for HSUVs associated with long-term depression. Third, the paper shows that simply adding HSUVs for co-morbid health conditions is not appropriate. Fourth, it provides decrements that can be used to reflect severity of the following conditions: depression, anxiety, panic and phobia. These are useful parameters that can be used in decision analytic models where conditions of patients may be shown to progress.