Findings

Objectives

Nasal congestion (NC) has been described as one of the most troublesome symptoms for patients with allergic rhinitis (AR) and is associated with poorer sleep, mood, and productivity [1, 2]. A new tool to measure patient experience of NC is the Congestion Quantifier Seven-Item test (CQ7) which was developed recently in the United States [3]. The CQ7 was originally developed as a screening tool to identify patients with NC potentially requiring treatment and the original version was shown to have excellent reliability, validity, sensitivity and specificity, and responsiveness [3, 4]. The objectives of the present study were to assess the reliability, validity, sensitivity and specificity, and responsiveness of a version of the CQ7 for use in Spain.

Cultural adaptation and validation study

The CQ7 was adapted into Spanish for Spain following a process of cultural adaptation based on international recommendations, which included translation into Spanish by two independent translators, back-translation into English, and cognitive debriefing in 10 patients with AR and NC [5]. The psychometric properties of the Spanish version were then tested in an observational, prospective, multicenter study carried out in the Allergology departments of 17 Spanish hospitals. The majority of patients made one study visit but in some centers they made two (baseline and follow-up at one month) to examine test-retest reliability and responsiveness.

The main study group (N = 166) were outpatients with NC and a clinical diagnosis of intermittent or persistent AR as defined in the ARIA (Allergic Rhinitis in Asthma) guidelines [6]. Patients could be treated or untreated for AR and/or NC at the time of inclusion. Control subjects (N = 35) had to be without NC on inclusion and there was no requirement for a diagnosis of AR.

Variables collected at baseline were: age, gender, educational level, time from diagnosis of allergic rhinitis, frequency and duration of nasal symptoms associated with AR, presence of other diseases, treatment for AR, overall NC severity (clinician and patient ratings), and acoustic rhinometry (in selected centres). In acoustic rhinometry testing (SER 2000, Rhinometrics, Lynge, Denmark), nasal volume (V0-7) was assessed from the nostril to 7 cm and minimum cross-sectional area (mCSA) was assessed in both nostrils. Clinicians also completed the Total Symptom Score (TSS) for all patients. The TSS consists of 5 questions measuring AR symptoms and provides an overall score raging from 0 (no symptoms) to 15 (very severe symptoms).

Patients completed the Spanish version of the Congestion Quantifier Seven-Item Test (CQ-7), the Psychological General Well -Being Index (PGWBI) [7], and the Medical Outcomes Study Sleep Scale (MOS Sleep) [8, 9]. The CQ-7 consists of 7 items answered on a scale from 0 (never) to 4 (always) with a total score ranging from 0 (no nasal congestion) to 28 (worst nasal congestion). The overall score is a simple summation of the individual item scores. The time frame for all instruments was the previous week and all had been adapted and validated for use in Spain [10, 11].

Patients who attended the follow-up visit completed a global rating of change item. The latter was used to measure perceptions of change in NC from baseline on a scale with 13 response options ranging from 'A very great deal better' to 'A very great deal worse'.

Ethics approval for the study was provided by the Ethics Committee of the Hospital Clínic in Barcelona and all patients taking part in the study provided written informed consent to participate.

Statistical analysis

The feasibility of the Spanish version of the CQ7 was assessed by examining the proportion of missing responses and the proportion of patients who found the instrument easy to use. The proportion of patients with the worst and best possible scores was calculated to estimate floor and ceiling effects, while internal consistency (reliability) was assessed using Cronbach's alpha coefficient [12]. Test-retest reliability was assessed by computing the intraclass correlation coefficient (ICC) in patients reporting no or only minimal change on the global rating of change item [13]. Convergent validity [13] was tested by analyzing the extent to which CQ7 scores demonstrated logical relationships with other outcomes measures (PGWBI, MOS Sleep, TSS, acoustic rhinometry) and known groups' validity was tested by determining the ability of the instrument to discriminate between groups defined by different categories of severity on the NC severity rating item (according to both patient and clinician overall ratings). T tests and effect sizes were used to analyze the extent of differences between groups. Sensitivity and specificity were evaluated using receiver operating characteristic (ROC) curve analysis to determine whether the questionnaire discriminated between patients with NC and controls. Responsiveness to change was assessed by determining the extent to which the instrument captured change in health status in patients reporting improvement or worsening on the global rating of change item. Change over time was analyzed using t tests and effect sizes. For all analyses, the level of statistical significance was set at 0.05 and all analyses were performed in version 13.0 of SPSS.

Results

A total of 201 individuals participated in the validation study (166 patients with NC and 35 controls without NC). Sample characteristics are shown in Table 1. The study population was relatively young with a mean age of 34.3 years, and a slight predominance of women.

Table 1 Sample characteristics at baseline: controls, patients with nasal congestion, and overall

There were no missing responses on any of the CQ7 items in any of the study visits (see Table 2). The majority of respondents (controls and patients) found the questionnaire 'easy' (33.3%) or 'very easy' (56.2%) to complete. Ceiling and floor effects (1.2% and 0.6%, respectively) were very small in the patient sample. Internal consistency was very satisfactory in the overall sample (Cronbach's alpha of 0.92) and test-retest reliability assessed in patients reporting no or only minimal change in NC at follow-up (n = 24) was also acceptable (ICC of 0.81).

Table 2 Score distributions, internal consistency, and missing responses on the CQ7: overall sample and patient and control groups (baseline visit).

Correlations between the CQ7 and other outcome measures showed the expected patterns (Table 3). The CQ7 score correlated most highly with the TSS (r = 0.60, p < 0.0001), though moderate to high correlations were also seen with the vitality (r = 0.33, p < 0.0001) and general health (r = 0.56, p < 0.0001) dimensions of the PGWBI. Correlations with the MOS Sleep questionnaire were highest for dimensions related with breathing difficulties, i.e. the 'sleep short of breath/headache', 'sleep disturbance' and 'snoring' dimensions (correlations of r = 0.49, 0.47, and 0.35, respectively; p < 0.0001). Correlations with acoustic rhinometry values were generally low, particularly at the first visit.

Table 3 Pearson correlation coefficients at baseline between CQ7, PGWBI (overall and dimensions), MOS Sleep scale (overall and by dimension), acoustic rhinometry results, and Total Symptom Score

The CQ7 discriminated well between groups defined by NC severity (Figure 1). Between-group effect sizes using clinician-rated NC severity ranged from 0.33 to 1.83 which would represent small and large effect sizes, respectively. Similar results were observed using patient self-ratings of overall NC severity.

Figure 1
figure 1

Bar chart. ES: Effect size. Differences in mean score between all categories were statistically significant (p < 0.05; Tukey correction for multiple comparisons) except between Moderate and Severe/Very severe.

The instrument showed good sensitivity and specificity for detecting cases of nasal congestion with an area under the ROC curve over 0.90 (AUC = 0.948, IC95% [0.912 - 0.985]; p < 0.001). The optimum cut-point for discriminating between cases and non-cases on the CQ7 was 7 points, which gave a sensitivity of 94% and a specificity of 85.7%.

In 39 patients (55.7%) who reported improvement on the global rating of change item the between visit difference in mean CQ7 scores was statistically significant (p < 0.001) with an effect size of 1.1, representing a large effect size (Table 4).

Table 4 Change in CQ7 scores after 1 month based on patient global rating of change in nasal congestion

Conclusions

The results of the present study show that the Spanish version of the CQ7 has excellent psychometric properties which were similar to or, in some cases, superior to those shown by the original version. The great majority of patients found the instrument easy to complete which, coupled with the very low rate of missing responses, indicates excellent acceptability. Likewise, the instrument discriminated well between patients defined by level of clinical severity and correlated in the way expected with other outcome measures. Sensitivity and specificity were excellent and the instrument appeared to be very responsive to change.

The results observed here showed that the Spanish version of the instrument had psychometric properties which were similar to those of the original version. That version also had high reliability coefficients (Cronbach's alpha of 0.93 and an ICC of 0.85), discriminated well between patients and controls (AUC of 0.97), and correlated well with the MOS Sleep scale (correlations were slightly stronger than those observed here, ranging from 0.21 to 0.67). The authors of that instrument also found that a cut point of 7 points would optimize sensitivity and specificity [3]. The similarity of the results adds to the robustness of our findings as they are indicative of an instrument that works consistently across these two languages/cultures.

Interestingly, correlations between CQ7 scores and acoustic rhinometry at baseline were non-existent or minimal, while considerably stronger correlations were observed at the second study visit, though these were still low to moderate. Nevertheless, we did not expect a very much stronger correlation as the two indicators measure substantially different things; rhinometry is a biological parameter measuring nasal geometry whereas the CQ7 measures the subjective perception of air through the nasal cavities and the impact of NC on activities. The stronger correlation with the mCSA could suggest that the aspects measured by the CQ7 are more closely related with the sensation of nasal obstruction than with nasal volume.

Study limitations include the small number of respondents in the control group and, in particular, the fact that the control group had a higher proportion of males and was better educated. This might have led to better scores on the CQ7 as education and being male are often associated with higher scores on patient reported outcome measures. The difference in score between the two groups may have been smaller with a larger control group with more similar characteristics to the patient group, though the difference would likely remain substantial. Although the method of assessing test-retest reliability employed here is commonly used in assessing PRO instruments, the small number of patients included in this analysis and the fact that only patients reporting no or minimal change were included may have introduced a selection bias. This characteristic should be tested in larger samples in the future.

Taking into account the study limitations, we nevertheless believe that our findings indicate that the Spanish version of the CQ7 questionnaire is a practical, reliable, and valid screening tool to detect and monitor cases of nasal congestion in allergic rhinitis patients.