Background

Chronic obstructive pulmonary disease is associated with a variety of comorbidities [1, 2], that are associated with hospital admissions and mortality [3]. One of them is depression [4] which is also linked to the clinical state and course of COPD [5, 6]. A further common factor in both COPD and depression are deteriorations in quality of life [7,8,9,10].

One of the screening tools for depression is the Patient Health Questionnaire Depression Scale 9, a 9-item self-report module containing the 9 signs and symptoms of major depression as delineated in the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV) [11]; a detailed description of the PHQ-9 questionnaire is given in the Additional file 1. Although the questions were constructed to refer to the same psychometric dimension “depression”, they are not synonymous, and some of them appear to be susceptible to impairments common in somatic diseases.

A relationship between biomedical disease conditions and depression scores has been assessed for a number of chronic diseases [12,13,14]. In COPD, many studies evaluated the sum score, and some studies selected single items, such as suicide ideation [15]. The spectrum of alterations common in COPD raises the question whether other items of the PHQ-9 and its sum score are related to different COPD characteristics.

To our knowledge, there is no study addressing this question and quantifying the impact of specific COPD characteristics on depression. This impact could also explain the wide range of prevalence estimates of depression in different COPD populations [16, 17]. Based on these considerations, we analysed the relationship between COPD characteristics and the results of the PHQ-9 using data from the baseline visit of the large German COPD cohort COSYCONET (German COPD and Systemic Consequences - Comorbidities Network) [18]. To elucidate the impact of COPD, we also demonstrated the associations between the single PHQ-9 items with anthropometric data, gender, age, smoking status, comorbidities and COPD characteristics.

Methods

Study population

COSYCONET is a multicentre cohort study focussing on the relationship between the lung disorder and comorbidities. We used the data of the baseline visit comprising 2741 COPD patients; study population, protocol and methods have been described previously [18]. The assessments included functional tests, clinical history, medication, quality of life and the PHQ-9 questionnaire for the screening of depression [18].

Assessments evaluated in the present analysis.

In the present analysis we included all patients of spirometric grades GOLD 1–4 [19], who had complete data of the PHQ-9, spirometric lung function, smoking status (active smoking versus no active smoking), exacerbation history and mMRC. Active smoking was defined as regular or occasional active cigarette smoking within the last 4 weeks prior to the interview. No active smoking included former smokers defined as regular or occasional active cigarette smoking in the past, as well as never smokers defined as no active cigarette smoking over lifetime. For the categorisation into GOLD groups A-D [19] we used the mMRC, which resulted in a more uniform distribution compared to the COPD Assessment Test (CAT), particularly regarding group C [20, 21]. The mMRC was therefore used throughout the study. The definition of exacerbation risk also followed GOLD recommendations [19]. Symptoms were defined via the combined GOLD groups B and D (GOLD BD), and exacerbation risk via the combined GOLD groups C and D (GOLD CD).

Among lung function parameters, forced expiratory volume in one second (FEV1) was used, since all other parameters turned out to be redundant for the purpose of this analysis. Predicted values of FEV1 were taken from the Global Lung Initiative (GLI) [22]. The assessments including patients’ history, lung function, questionnaires regarding the clinical state of COPD (mMRC, CAT), as well as quality of life (EQ VAS) were guided by standard operating procedures [18], and the PHQ-9 was administered following the instructions [23]. Moreover, patients were asked to bring with them all medication taken, which was recorded and evaluated according to substance categories based on the ATC code [21, 24]. The information derived from medication was also used to define a number of comorbidities for which specific medication was available [24]. The diagnosis (named “extended”) comprised patients with a reported doctor’s diagnosis (anamnestic report) as well as patients without report but with disease-specific medication. For psychopharmacological treatment, we additionally evaluated patients with specific medication and the subset of patients with depression-specific medication.

Data analysis

Data and parameter estimates are presented as mean values, standard deviations (SD) or standard errors of mean (SEM), or median values and quartiles (mMRC), or numbers and percentages. The relationship between categorical variables was assessed by contingency tables and chi-square statistics. Regarding continuous parameters, analysis of variance (ANOVA) with post hoc-comparisons according to Duncan was used. The individual items of the PHQ-9 were compared between different conditions, e.g. GOLD groups A to D, by the Kruskal-Wallis-H test. The relationship between the PHQ-9 sum score and influencing factors was quantified by multiple linear regression analysis. Additionally, receiver operator characteristics (ROC) were used to evaluate the relationship of the CAT score and EQ VAS to the PHQ-9 score. Statistical significance was assumed for a p value below 0.05, and all analyses were performed by the software package SPSS statistics 23 (IBM Corp., Armonk, NY, USA).

Results

Study population

Overall, 2291 patients of GOLD grades 1 to 4 were eligible. Of these, 2255 had complete data of the PHQ-9 and GOLD ABCD grouping as well as smoking status and were included into the analysis (Table 1). GOLD grades 1 to 4 were significantly different in the majority of parameters, except for pack years and gender. The distribution of comorbidities over grades and groups is shown in Table 2, indicating that the relative frequency of some comorbid conditions differed across these categories.

Table 1 Patients characteristics
Table 2 Distribution of comorbidities over grades and groups

Relationship between PHQ-9 and COPD severity

Multiple linear regression analysis revealed that the sum score of the PHQ-9 was associated with gender, age, BMI, pack years and smoking status (p < 0.05 each). The different role of the single of the single items of PHQ-9 to gender, age, BMI, pack years and smoking status is illustrated in Additional file 2: Figure S1, indicating different associations of these data with single PHQ-9 items. For example, higher age was associated with a decrease in the PHQ-9 scores for all questions, in contrast to gender, which was related to an increase of PHQ-9 sum score especially for questions 3 and 5. The distribution of the PHQ-9 sum score stratified according to GOLD grades and GOLD groups is shown in Table 1 and illustrated as combined relationship in Fig. 1, indicating a larger variation across groups than across grades. The figure also demonstrates that the differences related to symptoms (GOLD B and D) were larger than those related to exacerbation risk (GOLD C and D). With increasing severity of COPD the proportion of patients showing scores of at least 10, i.e. depression according to PHQ-9 recommendations, increased (Table 1).

Fig. 1
figure 1

Distribution of the PHQ-9 sum score stratified according to GOLD grades 1–4 and groups A-D in terms of mean PHQ-9 sum score and standard deviation. To illustrate the variability across patients, standard deviations have been chosen. The figure illustrates the additivity of symptoms and exacerbation risk regarding the sum score, when comparing the changes from A to B and B to D with those from A to C and B to D

The pattern exhibited in Fig. 1 suggested a dependence of the scores on symptoms and exacerbation risk in an additive manner. To evaluate this hypothesis we performed a two-way ANOVA with “symptoms” and “exacerbations” as binary categorical factors and an interaction term, keeping the confounders age, gender, BMI, smoking status and pack years. Higher symptoms (BD vs AC) led to an average (± SEM) increase by 2.75 ± 0.20 points, and higher exacerbations (CD vs AB) were associated with an average increase by 1.44 ± 0.21 points, without significant interaction (Fig. 2). This additive effect implied an increase by 4.19 points in group D compared to group A.

Fig. 2
figure 2

Effect of symptoms and exacerbations on the PHQ-9 sum score. The figure depicts the amounts of the additive effects, corresponding to a mean increase by 2.75 points in patients with high symptom burden and by 1.44 points in patients with high exacerbation risk

In a next step, we determined the relationship between GOLD groups A to D and the single items of the PHQ-9 (Kruskal-Wallis-H test). Each item showed significant associations with GOLD groups (p < 0.001). Additional file 2: Figure S2 illustrates the homogeneity across items, justifying the analysis of the sum score with regard to groups A to D.

Association of PHQ-9 with functional and clinical state of COPD

The association of the PHQ sum score with GOLD grades 1 to 4 was significant but weak (Table 1). The result for the single items is illustrated in Additional file 2: Figure S3; questions 1, 2, 4, 8 and 9 (p ≤ 0.011 each) were dependent on grades 1 to 4, while other questions were not (Kruskal-Wallis-H test). This demonstrates that the relationship to GOLD grades was less linear than the relationship to GOLD groups.

To elucidate the relation to GOLD grades, a linear regression analysis of the sum score against FEV1, including age, gender, BMI, smoking status and pack years as confounders, was performed. Overall, FEV1 was significantly (p < 0.001) related to the PHQ-9 score (Additional file 3: Table S1) but inspection of Fig. 1 revealed that for each of the GOLD groups A to D the association with FEV1, or equivalently GOLD grades, was weak. This was confirmed by separate analyses for each group A-D, in which FEV1 was not statistically significant. Due to the inconsistency of results, we therefore decided to omit lung function from the further analyses.

An analogous computation using the CAT and EQ VAS scores showed both to be significantly (p < 0.001 each) linked to the PHQ-9 sum score (Additional file 3: Table S2). The close relationship was underlined by ROC analyses, taking a value of PHQ-9 ≥ 10 as indicator for depression as outcome variable. These analyses yielded an area under the curve (AUC) of 0.81 with a cut-off of 20 for CAT, and of 0.75 with cut-off values of about 50 for VAS (total scale 0–100). Based on the very close relationship between the tests, CAT and EQ VAS were also omitted from the further analyses.

Association of PHQ-9 with comorbidities

Using multiple linear regression analysis and again including the confounders gender, age, BMI, smoking status and pack years, we assessed the association of the sum score with the comorbidities listed in Table 2. Asthma, sleep apnoea, gastrointestinal disorders, osteoporosis and arthritis were significantly linked to the sum score (Additional file 3: Table S3). The magnitude of the effects is illustrated in Fig. 3. The result for the single items is given in Additional file 1: Figure S4, showing different associations of comorbidities with single PHQ-9 items. For example, sleep apnoea was predominantly related to questions 4 and 7, whereas arthritis was related to questions 1 and 3.

Fig. 3
figure 3

Effect of comorbidities on the PHQ-9 sum score

The figure illustrates the dependence of the PHQ-9 score on the different comorbidities. Each single comorbidity was associated with an increase in the PHQ-9 sum score and these effects were additive.

Association with the diagnosis of mental disorders and medication

To determine the relationship to the PHQ-9 score, different definitions of mental disorders were used (Table 3), again by regression analyses including age, gender, BMI, smoking status and pack years. All four categorisations were significantly related to the PHQ-9 score (p < 0.001 each). Intake of specific medication for mental disorders was associated with a mean (±SEM) increase by 2.73 ± 0.31 points, intake of antidepressants with an increase by 2.83 ± 0.33 points, and a reported diagnosis (lifetime) with an increase by 3.94 ± 0.24 points; all associations confirm the suitability of the PHQ-9 to assess effects of medication.

Table 3 Different definitions of mental disorders and medication

Summary effect of COPD characteristics on the PHQ-9

The observed associations with COPD characteristics raised the question of their overall impact on the PHQ-9 sum score in each individual patient. To quantify this impact, using either COPD severity (groups) or comorbidities as predictors, the two regression models described above were used. Correspondingly, the reference was either the lowest degree of symptoms and exacerbation risk, i.e. GOLD group A, or the absence of all comorbidities shown in Fig. 3. At the same time each patients’ individual values of age, gender, BMI, smoking status and pack years were maintained to account for these influencing factors. Figure 4 illustrates the results, taking the observed mean (±SEM) value of 6.31 ± 0.10 for comparison and demonstrating that in the hypothetical absence of COPD the estimated mean values would be 4.52 ± 0.10 and 5.23 ± 0.10, respectively. This result illustrates the summary effect of COPD in our population on the PHQ-9 score, corresponding to an increase from 12 to 22% regarding patients with PHQ-9 values ≥10 and based on GOLD A-D. The inclusion of patients with a previous diagnosis of mental disorder or intake of antidepressants did not add incremental information to this association.

Fig. 4
figure 4

Association of the PHQ-9 sum score to different definitions of mental disorders. This figure illustrates the mean PHQ-9 sum score (black bar), the mean PHQ-9 sum score adjusted for the effect of COPD severity (dark grey bar) and the mean PHQ-9 sum score adjusted for the effect of comorbidities (light grey bar) for all patients (n = 2096), for patients with the reported diagnosis of mental disorders (n = 464) and for patients with the intake of anti-depressants (n = 233). The figure shows that, independent of the subpopulation chosen for analysis, the procedure to take into account COPD characteristics via symptoms/exacerbation risk or comorbidities leads to about the same reduction of the observed PHQ-9 sum score. Moreover it illustrates that both, before and after adjustment, patients with antidepressant therapy showed scores between 6.8 and 9.2, which were larger than those reported in the literature in patients without COPD. This suggests that the adjustment did not fully account for all COPD characteristics, considering the fact that the GOLD group A and the absence of comorbidities were the reference

Discussion

In the present study, we analysed the impact of COPD on the PHQ-9 depression score to reveal whether specific COPD characteristics were associated with changes in the score. The characteristics identified as relevant for the PHQ-9 comprised symptoms and exacerbation risk according to GOLD groups A to D, and the comorbidities asthma, sleep apnoea, gastrointestinal disorders, osteoporosis and arthritis. The GOLD groups were related to all single PHQ-9 items in a parallel manner, whereas the spirometric GOLD grades 1–4 were only related to some questions. The different role of the single items was underlined by their associations with anthropometric data and comorbidities. Our observations suggest that on average the chronic disease COPD was associated with PHQ-9 scores elevations by 1–2 score points. Conversely, when hypothetically assuming patients being in GOLD group A, or without the comorbidities analysed, the PHQ-9 sum score was reduced by this amount.

In all analyses age, gender, BMI, smoking status and pack years were carried as covariates to adjust for their influences. Females, younger subjects and active smokers showed higher scores of depression [25], which is in line with previous data [26,27,28]. We additionally found pack years to be related to the sum score, as well as BMI [29]. The associations with age, gender, BMI and smoking were robust throughout all analyses, thereby underlining the validity of the analyses regarding COPD characteristics.

Both categorisations according to GOLD [19] were related to the PHQ-9 sum score but the relationship to groups A-D was much more consistent and stronger than that to grades 1–4. Symptoms and exacerbation risk had completely additive effects not only on the sum score, but on all single items. Overall, in GOLD D compared to A, the PHQ-9 sum score was higher by more than 4 points. The strong effect observed for symptoms is in accordance with published data, despite the fact that these were obtained using previous GOLD definitions [30] or the CAT instead of mMRC score [6]. The average PHQ-9 sum score in our population was about 6.3 points and not far from the cut-off value of ≥10 considered as indicative for depression [11]. In view of this, the magnitude of the effects of COPD on the PHQ-9 score indicate a significant impact on the diagnosis of depression, in accordance with elevated prevalence values in the literature [5].

The mean influence of COPD, based on either symptoms/exacerbations or comorbidities, on the mean PHQ-9 sum score could be quantified as difference between 6.3 versus 4.5 or 5.2 points, respectively (Fig. 4). Formal adjustment of the mean PHQ-9 score for COPD severity or comorbidities led to values that are similar to values reported for the general population [31]. Correspondingly, the prevalence of score values ≥10 decreased from 22.0 to 11.6% or 14.6%, respectively, again being closer to the range of the general population [32, 33]. The effect attributable to COPD was similar in patients with or without the diagnosis of mental disorder or intake of antidepressants, although their baseline values were different. This suggests that COPD severity and comorbidities are associated with a systematic additive effect on the PHQ-9, and the relationship of COPD to the PHQ-9 was not limited to patients with premorbid depressive symptoms. The numerical estimates provided by us (Figs. 2 and 3) might be useful to quantify the effect of individual COPD characteristics on the PHQ-9 score.

Depression is recognized as a major condition among COPD comorbidities [34], but there is a wide range of prevalence estimates [5], suggesting a significant dependence on the study population and possibly the diagnostic tools. The PHQ-9 as a diagnostic screening tool has been validated in the general population [35], as well as cohorts with specific morbidities [12, 14], including COPD. Inspection of its single items raises the question, whether they are sensitive to impairments common in COPD, and indeed some of the items have already been analysed in these patients [15]. There is no study, however, which quantified the effect of COPD characteristics on the PHQ-9 in detail.

For this purpose, we focused on those patients’ characteristics, which are easily available in clinical practice. Lung function was omitted from the final analysis, as the associations were inconsistent and weak compared to the other influencing factors. An important observation was that all effects on the sum score arising from GOLD groups or from comorbidities were additive and that there were no significant interactions. This considerably simplified the estimation of the impact of COPD characteristics (see Figs. 2 and 3). The combination of comorbidities resulted in a slightly lower effect compared to that of symptoms and exacerbation risk (Fig. 4), suggesting additional influencing factors that are covered by GOLD groups, but not the comorbidities chosen. The effect of COPD severity on the PHQ-9 might even be larger as GOLD group A was taken as reference.

The relationship between depression scores and CAT, as a measure of COPD severity, is close, and a cut-off value of 10 for the PHQ-9 corresponds to cut-off values of 19 to 21 for the CAT [6]. We similarly observed the best correspondence for a cut-off value of 20. Interestingly, this value is similar to the cut-off value of 18 that has been proposed as superior to the conventional cut-off value of 10 regarding the categorisation of COPD severity [36]. The predictive power of the CAT score regarding depression seems high, but not high enough to replace a proper depression questionnaire such as the PHQ-9. We found similar results for the visual analogue scale (EQ VAS) of the generic quality of life questionnaire EQ-5D [9, 10]. All of these findings are in line with the expectation that quality of life scores are closely linked to depression.

Based on the patients’ medication and reports we defined several categorical indices of mental disorder, which resulted in prevalence estimates ranging between 10.3 and 24.3% (Table 3). This, however, did not affect our findings regarding the impact of COPD, and the associations found in the total population were also present when excluding patients with antidepressants. The observation that COPD, as common feature of our patients, showed such homogeneous effects, underlines its systematic impact on the PHQ-9 score and the evaluation on depression based on this.

Studies on the effect of antidepressants on the PHQ-9 reported scores of about 17 at baseline and of 5 after treatment [31]. In our study population, patients with antidepressants showed mean scores of 9.2, and of 6.8 or 7.8 when hypothetically taking into account COPD. The decrease in the PHQ-9 score was similar to that in patients without antidepressants but higher than in the total study population, indicating that the antidepressant therapy did not normalize the PHQ-9 score in COPD patients. Although there are no reasons to assume that antidepressants should be less effective in COPD compared to other patients [34], this observation suggest that antidepressant therapy is not fully effective in COPD, possibly due to the impact of continuing symptom burden from respiratory disease.

Limitations

In COSYCONET the presence of mental disorders was asked as lifetime prevalence based on the diagnosis by a physician, and depression was not specifically asked for. However, patients with a diagnosis of depression at the time of the study could be identified by specific medication, and our findings regarding the impact of COPD on the PHQ-9 score were not affected by this. In addition, the effect of respiratory symptoms and exacerbation risk were additive throughout all analyses in a very consistent manner. We cannot exclude selection effects that are unavoidable in a cohort study with comprehensive, demanding assessments, but it seems unlikely that these have affected our results.

The questionnaires used for the diagnosis of depression are not necessarily equivalent [37], and our results strictly apply to the PHQ-9, which refers to the patients’ clinical state in the previous two weeks. This restriction is unlikely to have played a role in our study, as only patients with stable COPD were investigated [18]. The PHQ-9 could be particularly susceptible to COPD severity, as some of its questions, although targeting depression, address symptoms that are also common in COPD. This was reflected in the correlation between the PHQ-9 and the CAT or EQ VAS, which do specifically target depression, as well as the analysis of single PHQ-9 items. It would be of interest to clarify whether other diagnostic tools for depression also reflect COPD characteristics.

Conclusion

In a large cohort of patients with COPD, there were significant associations between parameters describing the patients‘clinical state and the PHQ-9 that is used for depression screening. The average magnitude of the effect of COPD severity was 1–2 points which accounted for an increase by about factor 2 in the proportion of patients showing PHQ-9 values of at least 10. Noteworthy enough, the approach using GOLD 2017 A to D groups allowed to quantify the impact of COPD on the PHQ-9 score in the most consistent manner, both regarding single items and sum score. The estimates for GOLD groups and comorbidities provided by us may be helpful to assess the effect of COPD in each individual patient. In our study cohort, the patients with antidepressant medication showed an average value of about 9.2 points (see Fig. 4) which is markedly higher than the value of 5.8, which has been reported under treatment with antidepressant medication in the literature [31]. The difference of about 3.4 points corresponds to the combined effects, which we attribute to the symptom burden, exacerbation rates and comorbidities (see Figs. 2 and 3). As exacerbations seem to play a minor role for the PHQ-9 score compared to symptoms and the available bronchodilator therapy in COPD primarily addresses symptoms, it seems plausible that a better treatment of symptoms would also lower the PHQ-9 score. Therefore it might be hypothesized that the optimisation of respiratory therapy in COPD could improve the scoring for depression in COPD patients.