Background

Cluster headache (CH) is a primary headache disorder that affects approximately 0.1% of the population [1]. Approximately 85% of individuals afflicted by CH have episodic CH (eCH), defined by the occurrence of remissions lasting at least 3 months between active bouts, while 15% of patients have chronic CH (cCH), characterized by the lack of remissions lasting more than 3 months [2].

Although less frequent than other headache disorders, CH might be associated with a substantial degree of disability due to the recurrent attacks of excruciating pain. The impact of CH on patients’ life can be severe; it is estimated that almost 20% of patients with CH have lost a job due to their headache, while another 8% are out of work or on disability [3]. CH is also frequently associated with comorbidities, mostly psychiatric, such as depression, anxiety, and aggressive behavior [4, 5].

Patient-reported outcomes (PROs) are important tools to assess the impact of headache disorders on the affected individuals. Contrariwise to migraine, in which the importance of PROs has been widely recognized [6], there is still little attention to the PROs for patients with CH. The available CH-specific questionnaires [7, 8] are rather time-consuming and poorly usable in the everyday clinical practice. More in detail, the Cluster Headache Quality of Life Scale is a 28-item scale focusing on quality of life [7], while the Cluster Headache Scales are a 36-item tool assessing psychosocial factors [8]; notably, those scales do not focus on the disability related to headache itself. The Cluster Headache Impact Questionnaire (CHIQ) is a short questionnaire recently developed in the German language and published along with an English translation [9]. The CHIQ consists of eight items rated on a 6-point Likert scale. Each question receives a score from 0 (“Never”) to 5 (“Always”); the total score is calculated by summing up the scores of each question, for a total range from 0 to 40. Questions refer to the last week and assess limitations in physical and mental daily activities, self-injurious behavior, and the impression of being a burden to the patient’s social environment; additional questions assess attack frequency and acute medication intake during the last week [9]. The original version of CHIQ showed good internal consistency and test-retest reliability, and positive correlations with attack frequency and acute medication intake; positive correlations were also found between the CHIQ score and depression, anxiety, and stress scores, while a negative correlation was found with a measure of quality of life [9], demonstrating the ability of CHIQ to perform a thorough clinical assessment of patients with CH.

The aim of this study was to validate the Italian version of the CHIQ to provide a reliable and easy-to-use PRO for patients with CH in clinical practice and research.

Methods

Questionnaire translation

We followed international recommendations for the translation protocol [10] (Fig. 1). The English version of the CHIQ questionnaire was first translated from English to Italian by two independent researchers (AO and RO) and then back translated by a native English speaker (MM) to resolve potential discrepancies between the Italian and English version. Conflicts were resolved by discussion and agreement among all the involved researchers. The final questionnaire is provided as supplementary material.

Fig. 1
figure 1

Translation process

Study design and selection of the validation cohort

To test the applicability of the questionnaire, patients ≥ 18 years old who met ICHD-3 criteria for eCH or cCH [2] and included in the “Italian Headache Registry” (RICe) [11] were recruited between April and December 2022. RICe was approved by the Ethics Committee of the University of Florence (code 14591_oss) and mutually recognized by the other local Institutional Review Boards. The study was conducted according to the guidelines of the Declaration of Helsinki and all patients signed an informed consent before participating in the study.

We included patients seeking care at the 14 headache centers for the first time (1st visit) or for follow-up visit. After obtaining informed consent, patients were asked to complete an online survey conducted using the Research Electronic Data Capture (REDCap) [12] platform combined within the RICe electronic data capture platform. Participants were assigned a personal code in order to match a first and a follow-up survey. The first survey comprised the CHIQ, and the questionnaires to assess psychological dimensions (Depression Anxiety Stress Scale Short Version, DASS-21) [13] and quality of life (Short Form Health Survey, SF-36) [14]. Seven days after the first survey, patients were asked to complete the follow-up survey which included the same questionnaires. Patients with missing data in the CHIQ, DASS-21, or SF-36 questionnaires were excluded.

The validation cohort included patients with active eCH or cCH, while patients with eCH in remission were excluded. Nevertheless, all patients completed the baseline questionnaire; the CHIQ scores of patients with active eCH, cCH, and CH in remission were compared to assess whether the score was able to identify differences across the three groups. Only patients with a stable attack frequency during the week before the baseline and the follow-up assessments (± 2 attacks) were included in the test-retest cohort, similarly to the original CHIQ validation [9].

Statistical analysis

Demographics and CH characteristics are presented as descriptive statistics (mean ± SD, median [interquartile range, IQR] or numbers, and percentages of patients) if not otherwise specified. Between-group comparisons were performed by non-parametric tests (Wilcoxon or Kruskal-Wallis test, as appropriate) given the assumption of non-normal distribution of variables. For internal consistency, Cronbach’s alpha was calculated. A Cronbach alpha > 0.80 was accepted as good [15]. Test-retest reliability was assessed using intraclass correlation coefficients (ICC). Statistical analysis was performed using R, version 4.2.2 (The R Project for Statistical Computing, Vienna, Austria). Significance was accepted at p < 0.05.

Results

During the study period, we evaluated 181 patients with CH of whom 96 (53.0%) had active eCH, 14 (7.7%) cCH, and 71 (39.3%) eCH in remission. The 110 patients with either active eCH or cCH were included in the validation cohort. After 7 days from the baseline questionnaire, 24 (21.8%) of those 110 patients had an attack frequency similar to baseline and were therefore included in the test-retest cohort (Fig. 2).

Fig. 2
figure 2

Flowchart of patient selection

The characteristics of patients with active eCH, cCH, and eCH in remission are reported in Table 1. The 110 patients of the validation cohort had a median number of 8 headache attacks (IQR 3–15) and used a median of 7 analgesics (IQR 2–12) during the week preceding questionnaire assessments.

Table 1 Characteristics of patients included in the study. The validation cohort only included patients with chronic and active episodic cluster headache

In the validation cohort (n = 110), the mean (±SD) global CHIQ score was 24.8 ± 8.3 (range 1–39, possible range 0–40). Visual inspection of the CHIQ score histogram revealed a slight positive skewness (Fig. 3). The Shapiro-Wilk test revealed a slight deviation from normality (W = 0.976; p = 0.044). Internal consistency of the CHIQ was good with Cronbach’s alpha value of 0.891. Cronbach’s alpha did not change after deletion of any of the items (Table 2).

Fig. 3
figure 3

Histogram of Cluster Headache Impact Questionnaire score distribution

Table 2 Results of questionnaire items and Cronbach alpha scores after elimination of each item. Scores are those of patients with active cluster headache (n = 110)

CHIQ score showed a significant positive correlation with anxiety, depression, and stress scores of the DASS-21 scale, while showing a significant negative correlation with quality-of-life domains assessed by the SF36 scale (Table 3). CHIQ score showed a marginally significant correlation with the number of CH attacks during the previous week, but no significant correlation with the frequency of intake of acute medications (Table 3).

Table 3 Correlation between the Italian version of the Cluster Headache Impact Questionnaire and measures of disability, depression, anxiety, stress, and quality of life

Median CHIQ score was 25 (IQR 19–31) in patients with active eCH, 26 (IQR 19–35) in those with cCH, and 21 (IQR 10–27) in those with eCH in remission. The difference across groups was significant (p < 0.001, Kruskal-Wallis test). When comparing groups in couples, the score was similar when comparing patients with active eCH and cCH (p = 0.777; Fig. 4), while it was significantly lower in eCH in remission when compared to either active eCH or cCH.

Fig. 4
figure 4

Comparison of Cluster Headache Impact Questionnaire scores across groups

In the test-retest cohort, median weekly headache attacks remained unchanged from 5 (IQR 4–13) to 5 (IQR 2–9) (p = 0.284) as were the median CHIQ scores: 25 at baseline (IQR 22–33), 23 at the retest session (IQR 11–30) (p = 0.095). The intraclass correlation coefficients were overall poor and ranged from 0.326 and 0.690 (Table 4). Not all coefficients were significant, but the overall score showed a statistically significant test-retest correlation (Table 4).

Table 4 Intraclass correlation coefficients for test-retest reliability for patients with active cluster headache 7 days after the first questionnaire (n = 24)

Discussion

We validated the Italian version of the CHIQ in a nationwide collaboration among 14 headache centers. Our results show good internal consistency, comparable to the original version. Therefore, this version of the questionnaire could be used in clinical and in research settings to evaluate the impact of CH on patients. The availability of a specific PRO for CH will provide clinicians and researchers with an easy-to-use tool that assesses the impact of CH by including additional components beyond the simple quantification of attack frequency and medication use.

Some differences between our validation cohort and the original German cohort are worth mentioning. The first main difference is that we validated our questionnaire in a cohort of patients with CH assessed in headache centers only, while the original CHIQ score was validated in a cohort comprising patients’ associations. Therefore, our cohort might have a higher diagnostic accuracy, as all patients received a diagnosis validated by headache specialists. On the other hand, we reached a lower patient number—mostly referring to patients with cCH—and our results might be less generalizable to the global population of patients with CH. In our cohort, the median number of weekly CH attacks was slightly lower than that of the original validation: 8 headache attacks (IQR 3–15) in our cohort vs 15.2 ± 13.8 attacks in the German cohort. The consumption of analgesics was also lower in our cohort (n. 7, IQR 2–12) vs 13.5 ± 14.2 in the German cohort.

The internal consistency of the questionnaire in our cohort was excellent, which makes it suitable for a wider application in Italy. On the contrary, in the test-retest validation, the overall rate of agreement was poor and not significant for some items (Table 3). In our opinion, this result might be due to the low number of patients who still reported headache at the 7-day follow-up. We asked patients to repeat the questionnaire after a shorter time compared with the original validation (7 instead of 14 days). This short timeframe was chosen as we expected that patients referring to headache centers would receive treatments that would quickly change their headache frequency and associated disability. We cannot exclude that effective treatments might have changed headache-related disability even during this short timeframe, thus affecting the reliability of test-retest. This interpretation is supported by the fact that most patients did not report headache at the moment of the follow-up survey.

An additional difference between our cohort and the original CHIQ validation cohort was that we found no difference in median CHIQ scores between patients with active eCH and those with cCH (Fig. 3). This finding is likely influenced by the low number of patients with cCH. The validity of the Italian version of CHIQ as a tool fit for detecting the impact of CH is however strengthened by the significant difference observed between eCH in remission and both groups of active eCH and cCH.

Similar to the original validation, we found a high correlation between CHIQ scores and measures of depression, anxiety, stress, and quality of life. Rather surprisingly, we found only a marginal correlation between the CHIQ scores and CH frequency and no correlation between the scores and acute medication use. This finding might have important implications for the management of patients with CH as it signals that the information coming from the CHIQ questionnaire could provide more complete information compared with those of simple questions about CH frequency or acute medication use.

We performed a validation study on a nationwide cohort of patients referring to headache centers where headache diagnoses are established by trained professionals, therefore increasing the reliability of data. Patient compliance was complete as all patients who were offered the questionnaire on their clinical visit agreed to participate in the study and completed the questionnaire. On the other hand, our study suffers from some potential limitations. We cannot exclude that patients accessing headache centers were more motivated than those attending other care settings. Therefore, a wider application of the CHIQ questionnaire could be associated with some compliance issues compared with the present cohort. Selection bias cannot be excluded as patients were coming from specialized centers. Additionally, the number of patients with cCH and the size of the test-retest cohort were small. The 7-day timeframe for test-retest was also short, which could expose to a risk of recall bias. The validation cohort included 110 patients with a ratio between patients and questionnaire items of 14:1, which is higher than a widely recommended ratio of 10:1 [16], but lower than the optimal 20:1 ratio [17]. For this reason, the reliability of subgroup analyses for eCH and cCH cannot be granted.

Conclusions

We showed the applicability of the Italian version of the CHIQ in patients with chronic and episodic CH. The questionnaire represents a useful and easy-to-use tool to explore several aspects of disability and quality of life of patients with CH within a limited amount of time. Further large-scale studies are needed to detect relevant cutoffs of the scale associated with clinically relevant differences, to distinguish the disability of patients with eCH and cCH, and to assess the effect of CH treatments on the CHIQ scores as well as on CH frequency and acute medication use.