Background

Self-administered questionnaires are required in order to estimate global outcomes of COPD [1]. Available disease-specific Health-Related Quality of Life (HRQoL) measures, mainly the St George’s Respiratory Questionnaire (SGRQ) [2], and the Chronic Respiratory Disease Questionnaire (CRDQ) [3], are reliable and valid, and widely used in clinical trials. There is increasing evidence that HRQoL questionnaires can also be useful in clinical settings [4]; however, existing instruments are lengthy and have complex scoring algorithms, making them poorly suited for routine use in clinical practice and repeated assessment. Most patients require short questionnaires (reducing fatigue, eliminating redundancy of items, and facilitating spontaneous responses).

A standardized, patient-centered assessment instrument covering key aspects of COPD HRQoL facilitates information gathering and improves communication between patient and clinician, particularly for general practitioners. An ideal instrument would identify specific areas of greater severity that would serve as focal points for targeted management or the evaluation of management goals, thereby improving both the process and outcome of care. The instrument must be sensitive enough to measure mild-moderate COPD [5], but also reliable [6, 7], valid for evaluative studies, and useful for determining rehabilitation routines. Previous studies have shown significant changes in quality of life score between three and six months in patients with COPD participating in a rehabilitation program [8] or, after discharge from a rehabilitation program [9, 10].

Recently, short self-administered questionnaires, the COPD Clinical Questionnaire (CCQ) [11], the Short Form Chronic Respiratory Disease Questionnaire (SF-CRQ) [12], the Visual Simplified Respiratory Questionnaire (VSRQ) [13], the COPD Assessment Test (CAT) [14], and the COPD specific HRQoL (VQ11) [15], have been validated with similar psychometric properties (Table  1) and some limitations.

Table 1 Psychometric properties of short HRQoL questionnaires for COPD patients

In practice, the 8-item VSRQ and the 8-item CAT include no subscales. Conversely, the 8-item SF-CRDQ does not provide a total score. The SF-CRDQ, VSRQ and CAT mix answers for frequency and intensity, which could be difficult to distinguish in patients with COPD.

Conceptually, these instruments establish confusions to estimate the impact of COPD between health status and HRQoL. The concept of health status refers to the impact of health on the individual’s ability to perform daily life activities and to benefit from them [16]. HRQoL refers to the three broad dimensions of health: physical (e.g., autonomy, capacity, symptoms), psychological (e.g., pain, self-esteem, and symptoms), and social (e.g., social relationship, family relationship) [17]. Thus, an instrument dedicated to the measure of HRQoL needs to provide 3 dimensions (physical, psychological and social) and an overall total score.

Qualitatively, the SF-CRDQ and the CAT include information on daily symptoms, activity limitation and other physical manifestations of COPD (Table  2). These instruments are specific to the functional and psychological outcomes of COPD (symptoms, function and confidence in living at home), whereas the social effects produced by COPD can only be assessed by HRQoL measures [18]. The VSRQ and the CAT do not include any item on depression, which is prevalent in COPD patients and alters HRQoL [19]. The 10-item CCQ includes one social life question and relates directly to respiratory problems. The VQ11 is a short instrument that was designed to measure the functional, psychological and social aspects of COPD consequences and provide an overall score for specific HRQoL.

Table 2 Item coverage of short HRQoL questionnaires for COPD patients

Psychometrically, for all the above instruments but the CAT and VQ11, the item reduction selection was made by a committee of experts and without the use of statistical analysis such as Confirmatory Factor Analysis. A previous study showed that the VQ11 has good content and internal properties [15]. This study was carried out to verify its construct’s validity and reliability in comparison to other short instruments (Table  1).

Methods

Participants

Participants were recruited from three pulmonary clinics and two medical offices between January 2008 and June 2009 using advertising flyers. To be included in the study, patients had to be aged between 40 and 85 years, and have an incompletely reversible limitation in airflow (forced expiratory volume in 1 s (FEV1) to forced vital capacity (FVC) ratio ≤ 70%) and I to IV GOLD stage (FEV1 < 80% th.). Patients with severe or uncontrolled comorbidities (unstable and/or uncontrolled cardiac disease, terminal disease, dementia, or an uncontrolled psychiatric illness) were excluded. 181 (123 males and 58 females) participated in the study after providing informed written consent. Demographic and clinical characteristics are listed in Table  3. The subjects did not participate in any other research study during this period. This study was approved by the University of Montpellier 1 Ethics Committee and the Regional Ethics Committee (authorization number: A00332-53).

Table 3 Clinical characteristics and HRQoL measures for 181 patients (123 males and 58 females)

Study design

Upon recruitment, the following examinations were performed for each participant: a clinical assessment, the collection of cardio-respiratory family history and number of exacerbations for respiratory (or other) reasons, a respiratory function examination (spirometry, blood gas analysis), an electrocardiogram, and a six-minute walk test (6MWT). Participants were also required to complete the experimental questionnaire, the external validity questionnaires (Medical Outcome Survey Short - SF-36, SGRQ-C, Hospitalization Anxiety Depression Scale - HADS, Physical Self-Worth - PSW) and a datasheet on their socio-cultural situation. Appointments were made with participants who were able and willing to undergo a follow-up assessment one week later. Forty-nine participants completed the VQ11 again to test its reproducibility. To ensure uniform assessments in this multicentric study (Montpellier, Paris, Osséja), the medical and scientific committee of the healthcare network developed recommendations and one teaching session for a standard protocol for instrument use and patient assessment.

Completion of questionnaires

The participants completed the battery of self-administered questionnaires, presented in a randomized order, while resting between physical tests. Participants who took part in both sessions were examined by the same researcher on both occasions.

VQ11

The VQ11 is a brief, self-administered HRQoL questionnaire that was specifically designed to allow individual monitoring of COPD patients over a short-term period. The questionnaire’s preliminary versions were developed according to the standard stages of questionnaire validation [20, 21], including evaluation of content validity, item clarity and construct validity [15]. An initial version of the questionnaire was drawn up by a panel of experts consisting of 20 COPD professionals and 15 patients with different degrees of COPD and different psychosocial levels. The clarity of each item was then tested on 20 patients with different degrees of COPD and different psychosocial levels. After making adjustments to the initial questionnaire, the committee produced an experimental questionnaire with 24 items, covering three theoretical components (functional, psychological and relational) and 11 sub-components. Each sub-component consisted of two or three items. This experimental questionnaire was tested on 166 COPD patients. Confirmatory factor analysis showed that the best model was a two-level hierarchical model with an initial level comprising 11 items (one per sub-component) distributed across three components (functional = 3 items; psychological = 4 items; social = 4 items) and a top level (lower score indicates better HRQoL) combining these three components. Cronbach’s alphas were calculated to test the internal consistency scales. The resulting values were 0.83 for the functional component, 0.69 for the psychological component, 0.57 for the social component and 0.83 for the total scale. Table  4 shows the French version of the VQ11 and a cross-cultural translation produced by three native speakers of English.

Table 4 Content and structure of the VQ11 questionnaire and its cross-cultural translation by three native speakers of English

Other questionnaires

The MOS-SF-36 [22, 23], the SGRQ-C [24], the HADS [25], and the PSW of the French version [26] of the Physical Self-Perception Profile [27] were assessed.

MMRC scale

The degree of dyspnea was measured using the Modified Medical Research Council (MMRC) scale [28], which correlates well with other scales and health status scores [29].

Analyses of respiratory function

The pulmonary function tests (PFTs) included simple screening spirometry, formal lung volume measurement, diffusing capacity for carbon monoxide, and arterial blood gases. We measured the volume-time curve and the flow-volume loop, as well as FVC and FEV1 in order to calculate FEV1/FVC indices [30].

Exercise tolerance

The 6MWT test was performed twice with more than 30 minutes between tests in order to allow heart rate and dyspnea to return to their initial rest values [31]. A dyspnea score was measured on a 10-cm visual analog scale (VAS) before and immediately after the test.

Statistical analyses

CFA was used to confirm the theoretical model found during the internal validation (Statistical Software Mplus 5.1). Fit assessment of the CFA models was based on multiple indicators [3234], including the Chi-square statistic (χ2), comparative fit index (CFI), Tucker-Lewis Index (TLI), root mean square error of approximation (RMSEA), and 90% confidence interval (CI) of the RMSEA. Values greater than 0.90 for CFI and TLI are considered to indicate adequate model fit, although values approaching 0.95 are preferable. Values smaller than 0.08 or 0.06 for the RMSEA indicate acceptable and good model fit, respectively [33, 34]. For the RMSEA 90% CI, values less than 0.05 for the lower bound (left side) and less than 0.08 for the upper bound (right side) or of 0 for the lower bound and less than 0.05 for the upper bound (right side) indicate acceptable and good model fit, respectively [35]. Factor loadings, squared multiple correlations, standard errors and t values were inspected for appropriate sign and/or magnitude.

Concurrent validity was assessed by analyzing Pearson correlation coefficients between the dimensions of the study questionnaire and those of another questionnaire measuring similar concepts. Spearman correlation coefficients were calculated between the VQ11 (total and component scores) and other independent variables. In order to confirm good concurrent validity but no redundancy, the new questionnaire had to show moderate correlation (0.40 to 0.70) with a well-established mea surement tool.

Reliability is the degree to which an instrument is free from random error. It is evaluated by measuring internal consistency reliability and reproducibility. Internal consistency reliability refers to the homogeneity of the items of the scale and was assessed using Cronbach’s alpha. Reproducibility establishes the stability of an instrument over time in a stable population and was tested using a Pearson correlation.

Results

Patient demographics

One hundred eighty-one participants completed the study baseline questionnaires and 49 participants completed the follow-up questionnaires. The mean ± SD and range of the physiological and patient-reported outcomes for the study population are summarized in Table  3.

Factor validity

A CFA supported the validity of a two-level hierarchical model with a three-component initial level and a single top level (Figure  1). The weighted least squares mean- and variance-adjusted χ2 estimator analyzed all the items as categorical variables. Fit indices were acceptable (χ2 = 133.090; df = 24; CFI = 0.910, TLI = 0.955; RMSEA = 0.158). The fit for the one-factor model was as good as the fit for the three-factor model (χ2 = 135.573; df = 25; CFI = 0.909, TLI = 0.956; RMSEA = 0.156), but it was less acceptable with the difftest (difftest = 12.277; df: 3; p = 0.0065).

Figure 1
figure 1

Hierarchical model and indices obtained for the VQ11 using confirmatory factor analysis (n = 181 COPD patients).

Reproducibility was assessed in terms of the correlations between the measures produced by the 49 COPD patients who were reassessed after a one-week period (42 patients under 10% of variation; 7 patients under 20% of variation). Correlation coefficients were 0.76 for the functional component (p < .01), 0.65 for the psychological component (p < .01), 0.73 for the social component (p < .01), and 0.88 for total VQ11 (p < .01).

Reliability

Cronbach’s alphas were 0.80 for the functional component, 0.68 for the psychological component, 0.77 for the social component and 0.89 for the whole scale. Alphas computed using the Spearman-Brown formula were 0.91, 0.81 and 0.87, respectively.

The low correlations between VQ11 and FEV1 (functional component, -0.26; total, -0.19) suggest that there are no significant differences between groups classified by grade of severity and VQ11 score.

Table  5 presents the clinical characteristics and HRQoL measures of the 181 COPD participants according to quartiles of VQ11 score. Associations between increasing VQ11 quartiles and clinical characteristics were found for FEV1, pre-BD FEV1, FEV1/FVC and 6MWT distance (negative association), as well as for BODE score and dyspnea from both MMRC classification and the 6MWT-end measure (positive association). There were no significant associations between VQ11 quartiles and either age or BMI. HRQoL measures from SGRQ were all positively and significantly associated with VQ11 quartiles including SGRQ symptoms, activity, impact and total scores (p < 0.0001).

Table 5 Clinical characteristics and HRQoL measures by quartiles of VQ11 score

Construct validity

The construct validity results are shown in Tables  6 and 7. The VQ11 showed good correlation with SGRQ total scores (0.71), SF-36 component scores (-0.61 for MCS and -0.61 for PCF), HADS total scores (0.61), and physical self-worth (-0.59). SGRQ total scores correlated most strongly with the functional component of the VQ11 (0.66), whereas MCS and HADS scores correlated most strongly with the psychological and social components of the VQ11 (-0.61 and 0.63; -0.60 and 0.61 respectively). Furthermore, we found significant correlations between VQ11 total scores and 6MWT end dyspnea scores, 6MWT distances (in meters and in percentage of the predicted distance) and total BODE scores (0.26, -0.37, 0.51 respectively). These correlations were particularly satisfactory for the functional component of the VQ11 (0.28, -0.42, 0.56 respectively). Correlations with FEV1, FEV1 as a percentage of the predicted value, FEV1/FVC and pack-years for smokers were low.

Table 6 Correlation between VQ11 and other variables of interest
Table 7 Correlation between VQ11 and other questionnaire scores

Discussion

This study examined the validity and reliability of the VQ11, a short, self-administered questionnaire specifically designed for repeated assessment of patients with COPD and for use in routine care. The results show that the VQ11 provides a simple and reliable measure of overall COPD-related HRQoL and physical, psychological, and social components of HRQoL as expected by experts [17].

The hierarchical structure of a preliminary version of the VQ11 had previously been tested on a sample of 166 COPD patients [15]. Therefore, the initial aim of the present study was to use CFA to verify this factorial structure with a new sample of COPD patients. Our findings demonstrated that the higher-order factor model provided a satisfactory fit to the data and a better fit than the alternative models. These results confirm those from a previous study [15]. The model contains physical, psychological, and social components, in line with guidelines for HRQoL questionnaires [17, 22, 34]. Our analysis showed that these components can be examined separately.

The concurrent validity of the VQ11 was confirmed by the correlation between VQ11 scores (for the individual components and for the total score) and the SF-36 or the SGRQ-C. VQ11 total scores strongly correlated with scores for the physical and mental components of the SF-36, and with scores on the eight scales that make up these components. As expected, the highest correlation was for the physical functioning scale [36]. Also as expected, the highest correlations were between corresponding components of the VQ11 and the SF-36 (VQ11-functional scale and the SF-36 physical component and physical functioning scales; VQ11-psychological scale and SF-36 emotional role, energy/vitality, mental health, general health perceptions and mental component scales; VQ11-social scale and SF-36 social functioning scale). By contrast, the correlation between the component and total scores on the VQ11 and the SF-36 bodily pain scale was not particularly strong. However, because the SF-36 was not designed to measure sleep disturbances and respiratory complaints, this relatively weak correlation does not affect the concurrent validity of the VQ11. As expected, VQ11 total scores correlated well with SGRQ-C scores and three specific domains.

As expected also [37], VQ11 total scores correlated well with HADS depression and total scores. The results also supported the weak correlations between airway obstruction and HRQoL [7, 38, 39].

The three components of the VQ11 are disease-specific domains of HRQoL for COPD patients. VQ11 functional scores correlated with dyspnea (MMRC grade and 6MWT distance), BODE index, exercise tolerance (6MWT distance), the activity and impact scores of the SGRQ-C, and the physical functioning and physical components of the SF-36. The items of the VQ11 reflect the main symptoms perceived by patients with COPD, dyspnea [28], physical limitation [24] and fatigue [37, 38, 40].

Significant correlations were found between the psychological component of the VQ11 and the HADS anxiety and depression scales, the SGRQ-C impact scale, the emotional role and mental components of the SF36, and physical self-worth (defined as physical self-esteem). This emphasizes that disease-associated anxiety and depression are important HRQoL factors for COPD patients. The degree of anxiety felt by COPD patients has been shown to be related to their degree of pulmonary dysfunction [41].

The social component of VQ11 correlated with the social functioning and mental components of the SF36. Social relationships are affected by chronic respiratory disease, especially in patients with severe respiratory insufficiency who often depend on close social relationships to manage daily activities [42]. Patients with COPD experience losses in several areas of their lives, and they may feel useless, experience reduced sexual activity, depend on others for their personal care and lose interest in future projects.

The total, functional, psychological, and social components of the VQ11 showed good reliability over a period of one week for patients without clinical change. The correlation coefficient for the psychological component was the lowest. This result can be explained by intra-individual variability in the perception of disease and health status. A recent study noted higher day-to-day instability in self-esteem, which is a major correlate factor of HRQoL for COPD patients compared with healthy adults [41]. Consequently, it would be advisable for clinicians to ask patients to complete the VQ11 every three to six months, in order to assess the stability of patients’ perceptions of their illness, as this perception may be a sign of vulnerability and of the likelihood they will not fully adhere to their treatment.

There were limitations to the study. The sample was relatively homogenous, with all subjects having moderate to severe COPD, the majority being ex-smokers with a significant smoking history. The concept of quality of life and its implications on daily life are different for men and women [21]. We could not validate the new questionnaire separately for men and women due to the small sample size. The VQ11 also needs to be studied in other ethnic populations and cultures for cross-cultural validity. In addition, the responsiveness of the VQ11 to interventions and comparisons with other quality of life questionnaires is also required. Last, using a new patient-reported outcomes questionnaire requires the determination of the minimal clinically important difference [43].

In practice, there are four general benefits to using the VQ11: (1) It helps clinicians quickly detect the worsening of HRQoL in COPD patients. This deterioration can then be explained by the acknowledgment of COPD, poor disease self-management (routine or acute situation), the presence of comorbidities (depression, sleep trouble, metabolic syndrome…) and/or weak support from family and friends. Possible consequences include an increase in the exacerbation risk and aggravation of COPD, the development of health-risk behavior or of a new disease, and/or the deterioration of communication with caregivers, family or friends. (2) The three components of the VQ11 allow caregivers to assess and correct a number of situations: based on a high score on the functional component, informed decisions can be made on therapy, modification of current treatment, new assessment, physiotherapy or comprehensive rehabilitation; with a high score on the psychological component, those decisions can be made regarding new assessment, psychological support, education, or comprehensive rehabilitation; finally, with a high score on the social component, decisions can be made concerning social support, psychological support, education, membership in a health network or patients association. (3) Based on the anticipated validation of a Minimal Clinically Important Difference, the VQ11 can also help monitor the efficacy of an individual’s therapeutic decision. (4) Lastly, the VQ11 provides clinicians with meaningful cues to examine a COPD life consequence more specifically when an answer to an item is more than three. Moreover, the back of the form includes an educational message with a space for drawing and commenting in which individual messages can be exchanged between patients and caregivers, family or friends.

Conclusions

This study showed the validity and reliability of the VQ11, a short, self-administered questionnaire specifically designed for repeated assessment of patients with COPD and for use in routine care. The VQ11 provides clinicians and patients with a simple and reliable measure of overall COPD-related HRQoL. The VQ11 facilitates discussions about the overall consequences of COPD arising from the illness’s physical symptoms and psychological perceptions, observance behaviors, health behaviors, life projects with COPD, and social support. Additional information is needed to provide responsiveness to change at the individual patient level, an essential feature for its use in clinical practice.