Background

Mental disorder is a main cause of the disease burden in developed countries [1]. Thus, policy makers have an interest in the level of mental health and prevalence of mental health problems in the population. Clinical interviews are the gold standard for measuring psychiatric morbidity, however such interviews are costly. An alternative is self-report questionnaires of mental health. These questionnaires are cheaper and practical for population surveys and they can provide an assessment of all respondents. Questionnaire data on mental health has been shown to be associated with reduced workability [2, 3], lower work performance, and sickness absence [47]. Questionnaires can, however, not find true prevalence but only estimate risk proportions [8].

Several mental health instruments exist and depending on the chosen questionnaire the estimate of the risk proportion of the population may be very different: For example, in Croatia, Rukavinia et al. used the Mental Health Inventory (MHI-5) and estimated that the proportion with ‘psychological distress’ (scored from 0 to 100, cut-off at 52 point) was 34% in the adult population [9]. In Canada the proportion with ‘high psychological distress’ was 38% using the K-10 scale (scored from 0 to 40, cut-off at 9 point) [10]. In Denmark, the risk proportion of ‘all kinds of depression’ was 7.1% using the Major Depression Inventory (MDI) (scored from 0 to 50, cut-off at 20 points) [11].

In the present study, we evaluate the two mental health instruments the Major Depression Inventory (MDI) and the Mental Health Inventory (MHI-5) in regard to the prediction of long-term sickness absence. If the mental health of a person causes him or her to have a long-term sickness absence period from work, it is a severe health problem. Furthermore, the cumulative cost of sickness absence due to mental health problems is an economic burden for society. We therefore consider long-term sickness absence to be a relevant outcome for evaluating the performance of a mental health instrument used in population surveys.

The two questionnaires, the MDI and the MHI-5, have different designs (see Table 1): the MDI questions are a list of specific symptoms of depression; the MHI-5 questions are more generally formulated. Despite the differences, both instruments have been validated as measures of depression [12, 13] and both have been shown to predict sickness absence [4, 5]. In the present study we compare the instruments in the same study in a so-called head-to-head comparison, and we are therefore able to identify the best performing instrument in regard to the prediction of long-term sickness absence. We also divide the population into high risk and low risk groups by several methods for each instrument, and calculate the prediction of long-term sickness absence using different categorisations of the scales.

Table 1 The Mental Health Inventory (MHI-5) and the Major Depression Inventory (MDI)

Methods

Study sample and design

Data was collected as part of the DAnish National working Environment Survey (DANES) from late 2008 until early 2009. The DANES was approved by the Danish Data Protection Agency, journal number: 2008-54-0553. According to Danish law, questionnaire and register based studies do not need approval by ethical and scientific committees.

Questionnaire data was collected from a random sample of the Danish employed population, age 18–59 years. Participants could answer the questionnaire either by internet or by post. Non-responders received two reminders and were finally contacted and invited to participate in a telephone interview. The study contacted 9913 persons, 6531 responded (66%), of which 4919 were employees. We excluded participants with missing values on MHI-5 and MDI (n = 439), with missing response date (n = 150), with a sickness absence spell lasting 4 or more weeks in the preceding 3 months before or overlapping baseline (n = 177), yielding a study sample of 4153 participants. In the multivariable adjusted analyses the sample was, because of missing values to covariates, further reduced to 3713 participants. The 3713 participants are 75% of the employees that returned the questionnaire. If we assume the return rate of the employees is equal to the return rate of the full sample, i.e., 66%, the sample for the final analysis is 50% of the employed people the questionnaire originally was mailed to.

Our long-term sickness absence data came from a national register, the DREAM register [14]. The DREAM register has information on all Danish social transfer payments on a weekly basis. Employers are entitled to compensation from the municipalities if an employee has an absence spell of 22 days or longer.

The major depression inventory

The Major Depression Inventory (MDI) was developed in the late 1990s [12]. It was designed to measure depression symptoms in accordance with the symptom guidelines defined by the WHO classification for unipolar depression (ICD-10) and the American Psychiatric Association classification for major depression (DSM-IV) [12, 15]. The instrument consists of 12 questions and it includes 2 algorithms that classify participants with risk of unipolar depression (mild, moderate or severe) according to the ICD-10 definition or with risk of major depression according to the DSM-IV definition. The instruments can also be used to measure depressive symptoms scored on a scale ranging from 0 to 50, a higher score indicates a higher level of depressive symptoms. Both Bech et al. [12] and Forsell [16] have validated the MDI as a measure of depression using SCAN clinical interviews.

The questions of the MDI are shown in Table 1.

We calculated the MDI’s predictive value for long-term sickness absence when we used the MDI as: (1) a standardized scale score, (2) the scale categorized into five ordinal levels [5], (3) dichotomized into likely depression or not by the cut-off ≥20 in accordance with previous studies [5, 16], likely depression or not based on the DSM-IV algorithm, and likely depression or not based on the ICD-10 algorithm (we pooled the ICD-10 categories mild, moderate and severe) [12].

The mental health inventory

The mental health inventory (MHI-5) is a five-question subscale of the general health measure SF-36 [17]. The MHI-5 includes questions referring to both positive and negative aspects of mental health, and questions referring to both depression and anxiety [18, 19]. Berwick et al. [13], Cuijpers et al. [18] and Rumph et al. [20] have validated the MHI-5 as a measure for depression using clinical interviews as the gold standard. The MHI-5 has also been tested as a measure of anxiety, somatoform disorders, and substance use disorders, but has failed to perform equally well in these tests [18, 20]. The questions of the MHI-5 are shown in Table 1.

We scored the MHI-5 from 0 (poor mental health) to 100 (good mental health) [21]. Several cut-offs for dividing the responders into high risk and low risk group have been suggested for the MHI-5. The cut-off points are derived from ROC-curves or through another minimum misclassification criterion. The most used cut-off point is probably 52 [4, 22], other cut-off points include 54 and 74 [18] and the cut-offs 60, 68 and 76 [23]. We calculated the predictive value for long-term sickness absence using the MHI-5 as: (1) a standardized scale score; (2) categorized into 5 ordinal levels; and (3) dichotomized by four different cut-offs, i.e. the cut-offs 52, 60, 68 and 76.

Long-term sickness absence

Our outcome was time until onset of first episode of long-term sickness absence. In our analysis, long-term sickness absence was defined as minimum 4 weeks of consecutive sickness absence, because 4 weeks are the minimum period that is registered within the weekly based DREAM register. We followed responders in the register from 12 weeks before response date and until 60 weeks after response date. Participants, with long-term sickness absence in the 12 weeks before or overlapping the particular participant’s response date, were excluded from the study. Only sickness absence that comes after the response of the questionnaire is included in our analyses.

Covariates

The following covariates were included in the analyses: Age, gender, family status, smoking, alcohol, body mass index, leisure time physical activity, social class, somatic chronic illness, self-rated health and method of data collection. With the exception of age and gender, which were retrieved from registers, the information on the covariates came from the questionnaire.

We divided family status into four categories: (1) cohabitating, with children living at home, (2) cohabiting, without children living at home, (3) not cohabitating, with children living at home, (4) not cohabiting, without children living at home. Smoking was categorized into four levels; (1) never smoked, (2) ex-smoker, (3) light smoker, i.e., from occasional smoker to 14 cigarettes a day, and (4) heavy smoker ≥15 cigarettes a day. Alcohol intake was categorized into four levels following the guidelines of the Danish National Board of Health: (1) ≤ 7 units weekly, (2) 8–14 units, (3) 15–21 units, (4) > 21 units weekly. Body mass index was categorized into 4 levels according to the WHO categorization: (1) < 18.5 (underweight), (2) 18.5–25 (normal weight), (3) 25–30 (overweight), (4) > 30 (obese). Leisure time physical activity was scored on a scale from 0 to 24 points. Social class was categorized in accordance with the European Socio-economic Classification (ESeC). Somatic chronic illness was constructed as a binary variable (somatic chronic illness or not). Self-rated health was scored on a scale from 1–5 points. The self-rated health question primarily measures physical health [24, 25] and is therefore used together with somatic chronic illness to control for the responders physical health. Method of data collection was included, since responses are influenced by the data collection method [26]. Family status and social status were used as categorical variables in the analyses. All other covariates were assumed to be ordinal or linear.

Statistical analyses

We compared the MDI and MHI-5 in a head-to-head design [27]. We calculated the Pearson correlation between the MDI and the MHI-5 and we compared the distribution of MDI and the MHI-5 in two plots. We calculated the predictive value for long-term sickness absence by Cox’s proportional hazard in both a univariate regression analysis and a multivariate regression analysis. Participants were censored at the date of emigration, death, retirement, maternity leave or end of follow-up (60 weeks after baseline), whichever came first. We examined the proportional hazard assumption by visual inspection of log-log-survival plots. The predictive value of MDI and MHI-5 for long-term sickness absence was calculated with (1) measures dichotomized by different cut-offs, (2) 5-level categorical indexes, and (3) standardized scores. We standardized the two scores of the MDI and MHI-5 using mean = 0 and standard deviation = 1. The standardized MHI-5 score was reversed so high score equalled poor mental health for both scores. In order to evaluate which mental health instrument that had the best prediction of long-term sickness absence, we included both standardized scores in the same regression analysis. If one measure in such an analysis have a substantial better significance level than the other measure, it can be concluded that this measure better capture the variation in the data. To further confirm the result from this analysis, we performed a cross-validation by randomly splitting our sample into two independent samples and then carry out the analysis in each sample.

Results

The MDI

The MDI standardized score, the five-level categorized score, and the three dichotomized scores all predicted long-term sickness absence (see Table 2). The standardized score had the highest significance level. The 5-level categorization of MDI showed a steep increase in HR from the next highest to the highest level. This suggests that the best cut-off point for a dichotomization into a risk group and non-risk group is near this score, i.e., near 20 points (risk group = 7%). The risk group included 90 to 301 persons depending on which dichotomization we used, i.e., the risk group was from 2 to 7% of the employed adult population (see Table 2).

Table 2 The MDI’s and the MHI-5’s predictive value for long-term sickness absence

The MHI-5

The MHI-5 standardized score, the five-level categorized score, and the four dichotomized scores all predicted long-term sickness absence (see Table 2). The standardized score had the highest significance level. The 5-level categorization of MHI-5 showed an increase in HR for each level. The increase between the next highest and the highest level of the categorization was particular steep. This suggests that the best cut-off point for a dichotomization is near 60 points (risk group = 14%). The risk group included from 304 to 1560 persons depending on which dichotomization we used, i.e., from 7 to 38% of the employed adult population.

MDI and MHI-5 compared

The HRs of the standardized scores estimate the increased risk for one standard deviation increase of the standardized score, i.e., the increased risk of sickness absence for a one standard deviation of worse mental health. In the adjusted analyses, the HR of MDI was 1.27 and the HR of MHI-5 was 1.37, both were highly significant (see Table 2). With both instruments included in the same regression analysis, the MHI-5 remained significant and the MDI became non-significant (see Table 2), i.e., the MHI-5 explained a significant part of the variation in long-term sickness absence even when we controlled for the MDI. To confirm the stability of this result, we cross-validated the result by randomly splitting our sample in two independent samples and then carry out the analysis in each sample. Both results supported our initial finding (results not shown). Thus, the MHI-5 explained the variation in long-term sickness absence better than the MDI.

Figure 1 shows the distribution of MDI and MHI-5 scores within the study population. Both distributions are positively skewed, indicating that they distinguish better between degrees of reduced mental health than degrees of good mental health. The MDI score correlated with the MHI-5 score with a Pearson correlation coefficient = −0.69 and p < 0.0001. The correlation is illustrated in Figure 2. The Figure also illustrates that not all participants classified as high risk by the MDI were classified as high risk by the MHI-5 or vice versa. The MDI ‘cut-off 20’ and the MHI-5 ‘cut-off 52’ each classified 7.3% as high risk. However, only 3.8% of the ‘cases’ overlapped.

Figure 1
figure 1

The distribution of MDI and MHI-5 in a random sample of Danish employees.

Figure 2
figure 2

The association between MDI and MHI-5. Bubble size indicates no. of persons with the particular response pattern. The MDI cut-off at 20 and the MHI-5 cut-off at 52 are marked with a line, the resulting cross-hair illustrates that ‘cases’ identified by MDI and MHI-5 is only partly overlapping.

Discussion

Both the standardized MDI score and the standardized MHI-5 score strongly predicted long-term sickness absence. However, the MHI-5 captured more of the variability than the MDI when both instruments were included in the same analysis. The MHI-5 therefore performs better than the MDI as a predictor of long-term sickness absence. Furthermore, the MHI-5 consists of fewer questions than the MDI and is therefore more ‘economic’. Our study can, however, not conclude that MHI-5 is a better measure of depression than the MDI; it can only conclude that the predictive value for long-term sickness absence is higher for the MHI-5 than for the MDI.

The size estimations of the high risk group from the different dichotomizations of the MDI (2 to 7%) were more restrictive than the size estimations of the high risk group from the different dichotomizations of the MHI-5 (7 to 38%). If the MDI only identified people with risk of depression and the MHI-5 identified people with reduced mental health in general, the MDI should indeed identify less people than the MHI-5. This could also explain why the MHI-5 is a better predictor of sickness absence than the MDI. However, studies show that the MHI-5 performs better as a measure of depression than as a measure of anxiety or substance disorder [13, 18, 20, 28]. It is possible that the MHI-5 is more general than the MDI but the MHI-5 is not general enough to show strong validity as a measure of other disorders than mood disorder.

We chose to compare the predictive validity (i.e., the ability to predict a relevant outcome) of the two instruments. A limitation of our study is that only one outcome was tested, time until long-term sickness absence. Other outcomes, such as number of sick days or productivity at the workplace, may have given a different result. The population, in which we test the instruments, may also be of importance for the result. The MDI might perform better than the MHI-5 in a high risk population. The population in our study is a general working population, and only approximately 50% of those the questionnaire was aimed at ended up in the final regression analyses. The final sample may have been particular healthy due to selection bias. Furthermore, long-term sickness absence can be caused by other reasons than reduced mental health. It could, as an example, be due to a somatic illness. We have tried to control for physical health in the analyses, but it may not be sufficient.

Conclusion

The MHI-5 had a higher predictive value for long-term sickness absence than the MDI. In a study where the predictive value for long-term sickness absence is of importance the MHI-5 must be recommended as the best measure of mental health. The size of the high risk group can for the same instrument be very different depending on the choice of cut-off for case-ness, however, the MDI categorized in general fewer persons as ‘cases’ than the MHI-5.