Background

Chronic obstructive pulmonary disease (COPD) is a highly prevalent chronic disease, characterized by expiratory airflow limitation, which is not fully reversible and usually progressive[1]. The Global initiative for chronic Obstructive Lung Disease (GOLD) classified the severity of COPD in mild (stage I), moderate (stage II), severe (stage III), or very severe (stage IV) according to the severity of airway obstruction and the presence of respiratory failure[1]. Despite optimal medical treatment, patients experience symptoms such as dyspnea, fatigue, and/or chronic cough[2]. Consequently, patients have a poor exercise capacity and limited performance in daily life activities[3, 4].

Health status can be defined as “the impact of health on a person’s ability to perform and derive fulfillment from the activities of daily life”[5]. A patient’s self-reported health status thus includes health-related quality of life and functional status[5]. Patients suffering from COPD in all GOLD stages report an impaired health status, with worst scores reported by GOLD stage IV patients[68]. Therefore, improvement of health status is an important component in COPD management, including pulmonary rehabilitation[912].

A careful assessment of patient’s health status is necessary to identify targets for (non-) pharmacological interventions, as well as to evaluate the effects of these interventions[13]. Disease-specific health status questionnaires (i.e., related to a specific condition or group) or generic health status questionnaires (i.e., intended for general use, irrespective of the underlying disease) can be used as instruments to assess health status[14]. For instance, the Saint George’s Respiratory Questionnaire (SGRQ) is a commonly used instrument measuring disease-specific health status among patients with COPD[1520], while the European Quality of Life-5 dimensions (EQ-5D)[18, 19, 21], Assessment of Quality of Life Instrument (AQoL)[22, 23], and Medical Outcome Study 36-Item Short-Form Health Survey (SF-36)[15, 17, 2426] are used to assess generic health status.

Previous cross-sectional studies found that disease-specific and generic health status questionnaires are moderately to strongly correlated in patients with COPD[17, 19, 27]. A longitudinal study among patients with newly detected COPD suggested that changes in a disease-specific health status questionnaire (Chronic Respiratory Disease Questionnaire, CRQ) are significantly correlated with the changes in several sections of a generic health status questionnaire (Nottingham Health Profile, NHP)[28]. However, longitudinal studies analyzing the correlations between disease-specific and generic questionnaires at different time points in patients with advanced COPD are lacking. Therefore, the aims of the current longitudinal observational study were: (1) to determine whether and to what extent a disease-specific questionnaire (SGRQ) correlates with generic health status questionnaires (EQ-5D, AQoL and SF-36) at four different time points in patients with advanced COPD; and (2) to determine the correlation between changes in these health status questionnaires during one-year follow-up.

Methods

This study is an analysis of a longitudinal observational study concerning symptoms and care needs of patients with advanced chronic organ failure[29]. Methodological details of this study and cross-sectional data about health status, symptom distress, family caregiving and advance care planning in patients with advanced COPD were published before[2934].

Study population

Patients with advanced COPD were recruited by their chest physician at the outpatient clinic of one academic and two general hospitals in The Netherlands between January 2008 and June 2009. Patients were included if they had a physician-confirmed diagnosis of severe to very severe COPD (GOLD stages III and IV). Patients were excluded if they were not clinically stable for at least four weeks preceding enrolment (no hospital admission or major change in medication, according to the treating chest physician), if their pharmacological therapy was not optimal (according to the current available guidelines) and stable for at least two months preceding enrolment and if patients lived in a nursing home. All participating patients gave written informed consent. The Medical Ethical Committee of the Maastricht University Medical Centre+ (MUMC+), Maastricht, the Netherlands, approved this study (MEC 07-3-054). The study was registered at the Dutch Trial Register (NTR 1552).

Instruments

Patients were visited by a member of the research team in their home environment. The following outcomes were assessed at baseline: demographics, weight and height, smoking status, current self-reported comorbidities (Charlson Comorbidity Index)[35], long-term oxygen therapy (LTOT), self-perceived mobility problems, dependency in personal care, and forced expiratory volume in the first second (FEV1). FEV1 was calculated from the flow-volume curve measured by a handheld pulmonary spirometer[36]. Severity of dyspnoea was measured using the modified Borg scale (range 0 [nothing at all] to 10 [maximal])[37]. Symptoms of anxiety and depression were studied using the Hospital Anxiety and Depression scale (HADS), which is divided into an anxiety subscale (HADS-A) and a depression subscale (HADS-D)[38]. Total scores for each subscale range from 0 (optimal) to 21 points (worst). Patients were visited at baseline, and four, eight, and 12 months after baseline to assess disease-specific and generic health status using the questionnaires as described below.

Disease-specific health status

Disease-specific health status was assessed using the SGRQ which consists of 76 items. The SGRQ provides three domain scores (symptoms, activity and impact) and a total score, ranging from 0 (optimal) to 100 points (worst)[39]. A recall period of three months was used.

Generic health status

Generic health status was assessed using the self-administered questionnaires EQ-5D[21], AQoL[22] and the SF-36[24].

EQ-5D is a five-item questionnaire consisting of several items: mobility, self-care, usual activity, pain/discomfort, and anxiety/depression[21]. Each item has three levels: no problems (1), some problems (2), and extreme problems (3). Higher scores on the several items indicate more severe problems. An index score is provided which ranges from −0.59 (worst) to 1.0 (best)[21]. In addition, patients rated their current health using a visual analogue scale (VAS). VAS score ranges from 0 (death or worst possible health) to 100 (best possible health).

AQoL consists of 15 items divided into five domains: illness, independent living, social relationships, physical senses, and psychological well-being[22]. Total score ranges from −0.04 (worst) to 1.00 (best)[40].

SF-36 consists of 36 items divided into eight domains: physical functioning, role-physical, bodily pain, general health, vitality, social functioning, role-emotional, and mental health. For each domain, scores range from 0 (worst) to 100 points (best)[24]. A Physical Component Summary measure (PCS) and a Mental Component Summary measure (MCS) are provided using norm-based methods with scores from a Dutch general population[41]. These summary measure scores are transformed to create a minimum and maximum possible score of 0 and 100 points. All scores below 50 points can be interpreted as below the general population norm[24].

Statistics

Categorical variables were described as frequencies. Continuous variables were tested for normality and were described as mean and standard deviation (SD). Only patients who completed all four visits were included in the analyses. To compare baseline characteristics between patients who completed the study and those who withdrew from the study, an independent sample t-test or Mann–Whitney U test was used, as appropriate. For categorical variables, a Chi square test was used. Depending on the variable distribution, a paired sample t-test or Wilcoxon Signed-Rank test was performed to compare the baseline scores of the health status questionnaires with the follow up scores at four, eight, and 12 months. To visualize the correlations between the SGRQ total score and the EQ-5D index score, AQoL total score, and SF-36 PCS and MCS score at baseline, scatterplots were constructed. Furthermore, correlations of the changes in SGRQ total score and EQ-5D index score, AQoL total score, and SF-36 PCS and MCS score and the correlations of the changes in generic health status questionnaires between baseline and four months, four and eight months, and eight and 12 months were shown using scatterplots. To illustrate the correlations of the baseline EQ-5D, AQoL, SF-36 PCS and MCS scores and the correlations of the changes between these generic health status questionnaires, scatterplots were constructed. Additionally, bivariate correlations (Pearson’s product–moment correlation) were performed. Strength of correlation was classified as follows: absent (<±0.20), weak (±0.20 to ±0.34), moderate (±0.35 to ±0.50), and strong (> ± 0.50)[19, 42]. The agreement in the direction of change in SGRQ total score and EQ-5D index score, AQoL total score, and SF-36 PCS and MCS score was determined. The four-months changes were classified into: no change, improvement and deterioration. For generic questionnaires, improvement is defined as a change >0 + (1.96*SE of mean change), deterioration is defined as a change <0-(1.96*SE of mean change), otherwise no change. For SGRQ improvement is defined as a change <0-(1.96*SE of mean change), deterioration is defined as a change >0 + (1.96*SE of mean change), otherwise no change. Cohen’s Kappa was used to test agreement. Kappa values were categorized as poor (<0.0), slight (0.00-0.20), fair (0.21-0.40), moderate (0.41-0.60), substantial (0.61-0.80), and almost perfect (0.81-1.00) agreement[43]. A p-value ≤0.05 was interpreted as statistically significant[44]. All statistical analyses were performed using SPSS for Windows, Version 19.0.

Results

Baseline characteristics

In total, 105 COPD patients were included (Table1). Most patients had very severe COPD (GOLD stage IV: n = 77, 73.3%). Fifty-nine percent of the patients used long-term oxygen therapy. Patients were generally slightly overweight, and 85% of the patients experienced mobility problems. Patients who withdrew from the study were more frequently dependent in personal care than patients who completed the study.

Table 1 Baseline patient characteristics

Health status

At baseline, patients generally had an impaired disease-specific and generic health status (Table2). Patients who withdrew from the study reported worse scores on the SGRQ impact domain, several domains of the EQ-5D and AQoL, the EQ-5D index score and AQoL total score compared to patients who completed all four visits.

Table 2 Baseline health status

On average, patients who completed the study (n = 84) reported 1.1 (1.1) exacerbations during 12 months follow-up. Generic health status scores at baseline and 12 months after baseline were comparable between patients who had no exacerbations during 12 months (n = 33) and patients who had one or more exacerbations during 12 months (n = 51) (p >0.05). Patients who experienced one or more exacerbations during 12 months reported a worse SGRQ total score (58.3 (16.3) points) at baseline compared to patients who had no exacerbation during 12 months (50.7 (14.6) points), p = 0.03.

Table3 shows the changes in health status over time. SGRQ total score, EQ-5D index score, and SF-36 PCS and MCS scores remained stable over time. Patients reported worse AQoL total scores at eight and 12 months compared to baseline.

Table 3 Health status at baseline, 4, 8 and 12 months

Correlations

Correlations between SGRQ total scores and the EQ-5D index score, AQOL total score, and SF-36 PCS and MCS scores were weak to strong at the four different time points (Table4).

Table 4 Correlations between SGRQ total score and generic health status questionnaires

Figure1a to1h show the correlations between the SGRQ total scores and EQ-5D index scores, AQoL total scores and SF-36 PCS and MCS scores at all time points, and the correlations between the changes in SGRQ total scores and the changes in EQ-5D index scores, AQoL total scores and SF-36 PCS and MCS scores. At baseline, disease-specific and generic health status questionnaires were moderately to strongly correlated, while the correlations between the changes were weak or absent.

Figure 1
figure 1

Correlation between SGRQ total score and EQ-5D index score (a), AQoL total score (c), SF-36 PCS score (e) and SF-36 MCS score (g) at baseline and correlation between changes between 0–4 months, 4–8 months and 8–12 months of SGRQ total score and changes of EQ-5D index score (b), AQoL total score (d), SF-36 PCS (f) and SF-36 MCS score (h) (all data points together); n = 84; Δ, change in.

Furthermore, at baseline, generic health status questionnaires were moderately to strongly correlated. The correlations between the changes were weak to moderate while the changes in AQoL total score were not correlated with the changes in SF-36 PCS score (Figures2a to2j).

Figure 2
figure 2

Correlation between EQ-5D index score and AQoL total score (a), SF-36 PCS score (c), SF-36 MCS score (e) and correlation between AQoL total score and SF-36 PCS score (g) and SF-36 MCS score (i) at baseline and correlation between changes from 0–4 months, 4–8 months and 8–12 months of EQ-5D index score and changes of AQoL total score (b), SF-36 PCS score (d), SF-36 MCS score (f) and changes of AQoL total score and SF-36 PCS score (h) and SF-36 MCS score (j) (all data points together); n = 84; Δ, change in.

Agreement

Table5 shows the agreement of the direction of changes in SGRQ total scores with the direction of changes in EQ-5D index score, AQoL total score, SF-36 PCS and SF-36 MCS scores. The direction of changes in SGRQ total scores agreed slightly (Kappa ranged from 0.12 to 0.18, p <0.05) with the direction of changes in EQ-5D index score, AQoL total score and SF-36 PCS score.

Table 5 Direction of 4-months changes in SGRQ total score and EQ-5D index score, AQoL total score, SF-36 PCS and SF-36 MCS

The direction of changes in EQ-5D index score agreed fairly with the direction of changes in AQoL total score and SF-36 PCS score (Kappa ranged from 0.21 to 0.34, p <0.05). The direction of changes in AQoL total score agreed moderately with the direction of changes in SF-36 PCS score (К = 0.41, p <0.001). There was no agreement between the direction of changes in SF-36 PCS score and the direction of changes in EQ-5D index score (К = 0.17, p = 0.07) and AQoL total score (К = 0.15, p = 0.17).

Discussion

Key findings

The current study shows that the SGRQ total score and the EQ-5D index score, AQoL total score, and SF-36 PCS score were moderately to strongly correlated at four consecutive time points in patients with advanced COPD, while SGRQ total scores were weakly correlated with SF-36 MCS scores at four, eight and 12 months after baseline. The correlations between changes over time were weak or even absent. This is probably due to a poor or absent agreement in the direction of changes in the different health status questionnaires.

Previous cross-sectional studies found moderate to strong correlations between disease-specific and generic health status questionnaires in patients with COPD[19, 27]. The current longitudinal study extends these findings and shows that SGRQ total scores were moderately to strongly correlated with the EQ-5D index score, AQoL total score, and SF-36 PCS score at four consecutive time points during one year follow-up. SGRQ total scores were weakly correlated with SF-36 MCS scores at four, eight, and 12 months. This can be explained by the fact that the SF-36 MCS focusses on mental aspects of health status, while the SGRQ total score includes several components of disease-specific health status. These findings are in line with Katsura et al. who reported even no correlation (r = 0.19) between the SGRQ total score and SF-36 MCS score in elderly patients with COPD[45].

Generic health status questionnaires were moderately to strongly correlated, while the correlations between the changes were weak to moderate. Pickard et al. found similar data in a cross-sectional study including inpatients and outpatients diagnosed with COPD concerning the EQ-5D index score and SF-36 PCS and MCS score[19]. EQ-5D index score showed the best correlation with AQoL total score, at baseline and between changes over time. Holland et al. compared the AQoL and EQ-5D in elderly patients admitted to an acute hospital and found a comparable correlation of the baseline scores and changes between these two questionnaires as in the present study[46]. Thus, even the generic health status questionnaires lack correlation. Researchers and clinicians should consider carefully which questionnaire is in line with their individual study objectives and study population. To the best of our knowledge, the AQoL has not been used as generic health status questionnaire in patients with COPD before. The current findings may suggest that the AQoL can be used to assess health status in outpatients with advanced COPD. However, future research is needed to support these results.

To the best of our knowledge, this is the first longitudinal study exploring the correlations (at four consecutive time points as well as the changes between these time points) between the SGRQ and the EQ-5D, AQoL, and SF-36 in patients with advanced COPD. Previously, improved health status (measured by SGRQ and SF-36) was found in patients with COPD who followed pulmonary rehabilitation or received home care[4749]. Thus, it seemed reasonable to hypothesize that changes in disease-specific and generic health status questionnaires were somehow correlated. However, in the present study, the correlations between the changes in SGRQ total scores and in AQoL total scores as well as SF-36 MCS scores were absent. The changes in SGRQ total score and EQ-5D index score and SF-36 PCS scores were weakly correlated. These weak or absent correlations can be explained by the fact that the direction of changes in SGRQ total scores and EQ-5D index score, AQoL total scores and SF-36 PCS score just slightly agree. There was even no agreement on the direction of change in SGRQ total score and SF-36 MCS score. The agreement on the direction of change in disease-specific and generic health status was about 45%, while in 25 to 35% of the patients the observed four months changes were in opposite direction (Table5). Ritva et al. compared the changes measured by SGRQ and 15D, a 15 dimensional generic health status questionnaire, among asthmatic patients and concluded that these instruments agreed on direction of change in health status in 64.8% of patients. In 15.8% of the cases the changes were in an opposite direction. In the remaining patients, health status changed according to one questionnaire[50]. However, Ritva et al. assessed changes in health status during three years[50], while we studied changes in health status during one year.

The current results support that healthcare professionals should use disease-specific as well as generic instruments to gain insight into the disease-specific and general impact of the disease and to get an in depth understanding of patients’ health status. Indeed, disease-specific questionnaires include relevant questions related to the patient’s disease and may generally be more sensitive to the disease and small changes in health status[17]. Generic health status questionnaires are intended for general use and are universally applicable: they tend to cover a broad variety of aspects (e.g. functional states, perceptions, social opportunities)[14, 17]. Indeed, disease-specific instruments may be more responsive to change in clinical status or more sensitive in distinguishing patients with different disease severities while generic measures may be more likely to detect unexpected events which are probably not related to the disease and detect effects of diverse aspects on a disease[5]. The current study revealed that patients who experienced one or more exacerbations during 12 months reported a worse SGRQ total score at baseline compared to patients who had no exacerbation during 12 months while there were no significant differences according to the generic measurements. This may illustrate the disease-specific qualities of the SGRQ: the SGRQ may be more sensitive to detect differences in clinical status in patients with respiratory diseases. On the other hand, the AQoL showed changes over one year in total score and several domain scores (e.g., “independent living” or “illness”) while the SGRQ did not show changes over one year. This underlines the fact that generic and disease-specific instruments measure different aspects and cannot replace each other. Combining both types of questionnaires allows healthcare professionals to identify targets for interventions as well as to evaluate the effects of these interventions[13].

Recently, Pakhale et al. constructed a new questionnaire (McGill-COPD Quality of Life Questionnaire) by combining items of a disease-specific questionnaire with qualities of a generic questionnaire to measure health status in COPD patients[51]. Combining items of different measures can reduce patients’ load and increase their attention and participation. But it is still not clear which questionnaires or items in terms of psychometric properties (e.g. validity, reliability, sensitivity to change) are most suitable to use for combination. The McGill-COPD Quality of Life Questionnaire suggests possibilities. However, this questionnaire needs to be validated before it can be used in clinical practice or further research.

Interestingly, participants in the present study reported a better generic health status over time according to the AQoL illness domain, while the AQoL total score showed worse generic health status. This was also true for some EQ-5D domains and the EQ-5D VAS score (Table3). These findings show that domain scores as well as total scores should be examined to achieve a correct understanding of patients’ health status. Furthermore, our results may suggest that the AQoL is more sensitive to detect changes than the SF-36 or SGRQ. However, the current study did not focus on sensitivity of health status questionnaires. A lack of previous studies using the AQoL as generic health status measure underlines the importance of further research in this field.

The present study has some methodological considerations that should be considered in interpreting the results. First, the study population consists of a convenience sample of patients with advanced COPD. Thus, it remains unknown whether these data are applicable for patients with GOLD stage I or II. Second, about 20% of the baseline study sample withdrew from the study during one year follow-up. These patients were more dependent in personal care and reported a worse baseline generic health status than patients who completed the study. It may be possible that changes in disease-specific and generic health status were different among patients who completed the study and patients who withdrew from the study. However, total scores of the disease-specific questionnaire SGRQ were comparable with findings from previous studies[6, 19]. Third, the duration of the follow-up period is limited. However, longitudinal studies among elderly with advanced chronic diseases are scarce and recruitment and sustained participation is challenging in this population[52, 53]. Fourth, ceiling effects and floor effects in longitudinal studies can limit the results because changes cannot be adequately detected. However, the proportion of patients reporting the highest possible score on the disease-specific and generic questionnaires ranged from 1.2-8.3% for all data points, while none of the patients reported the lowest possible score on any of the disease-specific and generic health status questionnaires on any of the data points. Because ceiling effects and floor effects are considered to be present if more than 15% of the subjects score the highest or lowest possible value, respectively[54], the influence of ceiling effects and floor effects seems to be limited in the current study. Fifth, the current findings need to be interpreted in the light of the number of comparisons that were made in the present study. Nonetheless, multiple findings in the same direction, rather than a single statistically significant result, suggest that these are not due to chance alone. Moreover, ‘Bonferoni adjustments are at best, unnecessary and, at worst, deleterious to sound statistical inference’[55].

Conclusion and implications

The current study suggests that health care professionals should use disease-specific as well as generic instruments to gain insight into the disease-specific and general impact of the disease. This is necessary to identify targets for interventions with the aim to improve disease-specific and generic health status and to evaluate the effects of interventions. Future research is required to compare disease-specific and generic health status questionnaires in a broader population. In addition, the possibilities and characteristics of a combined instrument measuring health status in patients with advanced COPD should be studied.