Abstract
Although health-related quality of life (HRQoL) has developed into a crucial outcome parameter in clinical research, evidence of the EQ-5D-3L validation performance is lacking in patients with spinocerebellar ataxia (SCA) types 1, 2, 3, and 6. The objective of this study is to assess the acceptability, validity, reliability, and responsiveness of the EQ-5D-3L. For n = 842 predominantly European SCA patients of two longitudinal cohort studies, the EQ-5D-3L, PHQ-9 (Patient Health Questionnaire), and ataxia-specific clinical assessments (SARA: Scale for Assessment and Rating of Ataxia; ADL: activities of daily living as part of Friedreich’s Ataxia Rating Scale; INAS: Inventory of Non-Ataxia Signs) were assessed at baseline and multiple annual follow-ups. The EQ-5D-3L was evaluated regarding acceptability, distribution properties, convergent and known-groups validity, test-retest reliability, and effect size measures to analyze health changes. The non-item response was low (EQ-5D-3L index: 0.8%; EQ-VAS: 3.4%). Ceiling effects occurred in 9.9% (EQ-5D-3L) and 3.0% (EQ-VAS) with a mean EQ-5D-3L index of 0.65 ± 0.21. In total, convergent validity showed moderate to strong Spearman’s rho (rs > 0.3) coefficients comparing EQ-5D-3L and EQ-VAS with PHQ-9, SARA, ADL, and INAS. EQ-5D-3L could discriminate between groups of age, SARA, ADL, and INAS. Intra-class correlation coefficients (EQ-5D-3LICC: 0.95/EQ-VASICC: 0.88) and Kappa statistics (range 0.44 to 0.93 for EQ-5D-3L items) indicated tolerable reliability. EQ-5D-3L shows small (effect size < 0.3) to moderate (effect size 0.3–0.59) health changes regarding ataxia severity. The analysis confirms an acceptable, reliable, valid, and responsive recommended EQ-5D-3L in SCA patients, measuring the HRQoL adequately, besides well-established clinical instruments.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Spinocerebellar ataxias (SCAs) comprise a heterogeneous group of autosomal dominantly inherited diseases caused by degeneration of the cerebellum and its connections [1]. SCAs have an estimated global prevalence of 3–4 per 100,000. SCA 1, SCA 2, SCA 3, and SCA 6 account for more than half of the affected families [2, 3].
SCAs commonly manifest in the third and fourth decades of life [4] and are characterized by prominent ataxia often accompanied by additional neurological symptoms. Cognitive impairment and depression may also occur [4]. Patients with SCA have a shorter life expectancy and suffer substantial limitations in their daily activities. The disease-specific functional limitations and the emotional burdens harm the patient’s health-related quality of life (HRQoL) [5].
Measuring HRQoL has become a crucial outcome in patients with chronic conditions to assess their subjective health perspective in clinical trials or to optimize treatment decision-making [6, 7]. Epstein et al. [8] referred in their work about self-rated health in Friedreich ataxia (FA) that HRQoL measures were potentially useful as clinical markers of the ataxia disease status. Only a few previous studies evaluated the impact of SCA on a patient’s HRQoL, revealing a significant decline over time and associations with ataxia severity, functional impairment, pain, depressive symptoms, and fatigue [9,10,11].
However, evidence about the validity of appropriate HRQoL measures in SCAs is missing. The National Institute for Health and Care Excellence (NICE) recommends using the widely used and well-established generic preference-based EQ-5D in rare diseases [12]. Only one study focused on the psychometric properties of the EQ-5D-3L in patients with Friedreich ataxia [13]. Based on a sample of n = 56 patients, the authors rated the EQ-5D-3L as a measure with poor discriminative ability. However, the results of this cross-sectional study with a small sample size remain inconclusive and should be interpreted with caution. Further validation studies are needed to generate more robust results concerning the psychometric performance of HRQoL instruments in SCA.
This analysis aimed to assess the psychometric properties of the EQ-5D-3L in terms of acceptability, validity, reliability, and responsiveness. To this end, EQ-5D-3L from large samples of patients with SCA 1, 2, 3, and 6 were analyzed and followed up annually for several years in different observational multi-center studies. This large data set of a rare disorder allows us to validate the EQ-5D-3L longitudinally on an appropriate sample size for the first time.
Patients and Methods
Study Design, Recruitment, and Sample
We analyzed data from two prospective, longitudinal observational ataxia cohort studies carried out at several European and US study centers: [1] European Spinocerebellar Ataxia Type 3/Machado-Joseph Disease Initiative (ESMI) and [2] European Spinocerebellar Ataxia Registry (EUROSCA). ESMI is an ongoing, longitudinal cohort study of SCA 3 mutation carriers and controls. The cohort started in 2016 and is still ongoing in its 5th year of recruitment. Both recruitment and follow-ups take place in 11 European (majority of study participants) and three US study sites [14]. EUROSCA is a European longitudinal study with people with manifest SCA 1, 2, 3, and 6 carried out in 17 European sites [15, 16]. The study was initiated in 2005, and participants were consecutively recruited and followed up until 2016 [15]. ESMI and EUROSCA were approved by the ethics committees of the participating centers and have been performed in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. Written informed consent was obtained from all study participants. EUROSCA was registered with a ClinicalTrials.gov number (NCT02440763).
In both studies, clinical visits were scheduled annually within the first 3 years after baseline (T0 = baseline, T1 = follow-up 1, T2 = follow-up 2, T3 = follow-up 3). We checked for time between observations (T0−T1, T1−T2, and T2−T3), providing mean, standard deviation, and interquartile range in supplementary Table 1. Afterwards, participants entered an extension phase in which study visits were scheduled in combination with routine visits resulting in irregular intervals. Therefore, only data from baseline to T3 were included in this analysis. The flow chart in Supplementary Figure 1 illustrates patient selection from both studies.
SCA 1, 2, and 3 present a similar clinical picture with an age onset between 30 and 40 years and frequently more non-ataxia symptoms [16, 17]. Patients suffering from SCA 6 are older (> 60 years old), show a slower disease progression, and have less additional non-ataxia signs, revealing SCA 6 as a purely cerebellar disorder [16, 17]. Based on these aspects, patients of both studies were separated into two groups: patients with SCA 1, 2, or 3 vs. patients with SCA 6.
Measures
In both studies, patients completed a set of socio-demographics (age, sex), age of onset of gait difficulties, clinical, and health-related questionnaires at baseline and every follow-up. The EQ-5D-3L and the Patient Health Questionnaire 9 (PHQ-9) were administered as health-related questionnaires. Rater-based clinical ataxia-specific measures were the Scale for Assessment and Rating of Ataxia (SARA) and the activities of daily living (ADL) as part of Friedreich’s Ataxia Rating Scale and the Inventory of Non-Ataxia Signs (INAS). All measures were administered in one session by clinicians as face-to-face interviews in the respective study center.
EuroQol 5-Dimensional 3-Level—EQ-5D-3L
The EQ-5D consists of a thermometer-like visual analog scale (EQ-VAS) anchored by 0 (worst health) and 100 (best health) to assess health status at the time of assessment and a 5-item descriptive system with five dimensions: mobility, self-care, usual activities, pain/discomfort, and anxiety/depression. Each item has three levels of responses reflecting no, moderate, and extreme problems [17, 18]. The response pattern in these 5 items can be converted into a single index score, anchored between zero (for death) and 1 (perfect health). For our analyses, EQ-5D-3L indices were calculated using the European value set for use in multinational studies [19].
Patient Health Questionnaire 9—PHQ-9
The PHQ-9 is a widely validated self-reported questionnaire designed for use in primary care as a screening instrument for depressive symptoms [20, 21]. The score of the 9-item-questionnaire ranges between 0 and 27 with the following score interpretation: 0 to 4 none to minimal, 5–9 mild, 10–14 moderate, and ≥ 15 severe depression [21].
Scale of the Assessment and Rating of Ataxia—SARA
SARA is a valid and reliable clinical rating of the severity of ataxia symptoms. SARA consists of eight items related to gait, stance, sitting, speech, finger-chase test, nose-finger test, fast alternating movements, and heel-shin test [22]. The sum score ranges between 0 (no ataxia) and 40 (most severe ataxia) [22]. We grouped the sum score into the following stages: SARA scores mild 0–9.5, moderate ≤ 19.5, strong ≤ 29.5, and extreme ≤ 40 [23].
Inventory of Non-Ataxia Signs—INAS
The INAS asks for a total of 30 items that are categorized into 16 non-ataxia signs. INAS count denotes the number of non-ataxia signs, ranging from zero (absence of non-ataxia signs) to a maximum of 16. A higher INAS count can be interpreted as a more complex extracerebellar disorder [24].
Activities of Daily Living (ADL) Part of the Friedreich Ataxia Rating Scale (FARS)
The FARS is a validated rater-based scale [25], comprising a measure of ataxia and ADL subscale and a neurological subscale. The scores of each measure can be added to make a total score ranging from 0 to 36, with higher scores representing more severe impairment. ADL was only assessed in the ESMI study.
Statistical Analyses
Patients’ socio-demographics (gender, age), age of onset of gait, and clinical characteristics (SARA, INAS, and ADL) were analyzed descriptively. Chi-square tests or t-tests have been applied to examine differences between the two genotype groups (SCA 1, 2, and 3 vs. SCA 6).
For the psychometric analysis, the EQ-5D-3L was analyzed in terms of acceptability, distributional properties, convergent validity, known-groups validity, test-retest reliability, and responsiveness to change. No imputation was used to deal with missing data, resulting in complete case analyses.
Completeness of Data
The frequency of missing values in both genotype groups for the EQ-5D-3L index and EQ-VAS was used as an indicator for acceptance.
Distributional Properties
Mean, standard deviation, minimum and maximum, and floor and ceiling effects were assessed for EQ-5D-3L index and EQ-VAS regarding the two genotype groups. Frequencies of responses per EQ-5D-3L item are reported.
Convergent Validity
The association between the EQ-5D-3L dimensions, EQ-5D-3L index, and EQ-VAS with SARA, INAS, ADL, and PHQ-9 were assessed using Spearman’s correlation coefficient. Correlations were interpreted as follows: rsp < 0.3 small, 0.3 ≥ rsp < 0.5 moderate, and rsp ≥ 0.5 high/strong [26, 27]. The sign of the coefficient can be positive as well as negative, indicating the direction of the relationship [28]. We expected higher correlations between EQ-5D dimensions mobility, self-care and usual activity and the SARA score, INAS count and ADL, and the EQ-5D dimensions pain/discomfort and anxiety/depression with the PHQ-9. Lower correlations were expected between mobility, self-care, and usual activity with PHQ-9. For the convergent validity, we recognized all observations from baseline to T3. Furthermore, we created scatter plots with rugs for inspection of these associations for baseline and T3. Analyses were conducted separately for each genotype group.
Known-Groups Validity
Known-groups validity is the ability to distinguish between different health states and disease groups. We compared the EQ-5D-3L index between two groups of ages (≤ 51 years old, > 51 years old), ataxia severity (SARA score: mild 0–9.5, moderate ≤ 19.5, strong ≤ 29.5, extreme ≤ 40) [23], non-ataxia symptoms (INAS: ≤ 4 symptoms, > 4 symptoms), depression (PHQ-9: minimal ≤ 4, moderate ≤ 9) [21], and the EQ-VAS (low ≤ 60; high > 60). We hypothesized that the EQ-5D-3L index would be lower in older patients, patients with a higher SARA score, a higher INAS count, a higher PHQ-9, and a lower EQ-VAS value in both genotype groups.
Test-Retest Reliability
Test-retest reliability is expressed by the ability of a test’s consistency in repeated testing. We tested for consistency of scores over time comparing the EQ-5D-3L’s responses at baseline and T1 among “stable” patients according to their symptom severity. A patient was considered stable if there were no changes in the SARA score and INAS count within 1 year. For the single-item dimensions of EQ-5D-3L, we calculated Cohens Kappa; for EQ-5D-3L index and EQ-VAS, intraclass correlation coefficients (ICC) were reported. Cohens Kappa values > 0.40 [29] and ICCs values > 0.75 [30] indicate acceptable reliability.
Responsiveness
Responsiveness refers to the EQ-5D-3L ability to capture changes in health over time. We analyzed the changes in the data pairs from baseline (T0) to T1, T0 to T2, and T0 to T3. Changes in the EQ-5D-3L index/EQ-VAS were then analyzed separately in subgroups of patients based on the SARA score: group 1 “no changes” in the SARA score (SARA score T0 = T1; T0 = T2, and T0 = T3); group 2: “deteriorated” in the SARA score (SARA score T0 < T1; T0 < T2, and T0 < T3). Differences in the EQ-5D-3L index and EQ-VAS between time points were divided by baseline standard deviation to produce standardized effect size (SES) or by the pooled standard deviation of change to calculate standardized response means (SRM), calculated with paired sample t-tests. We interpreted the effect sizes as < 0.3 small, 0.3 to 0.59 moderate, and ≥ 0.6 large [26] [31]. We expected to observe significant health changes in group 2 and no or minimal changes in group 1. Given the decreasing number of patients over time, we analyzed all patients regardless of genotype groups. SES and SRM are commonly used effect sizes in many validation studies of the EQ-5D [32], supporting the comparison of findings. For the SARA score, SES and SRM were also calculated.
Data were analyzed using IBM SPSS Statistics (Version 21). The R package “ggplot2” was used to create graphs (Version 4.2.1).
Results
Sample
Table 1 presents the characteristics of the sample across both genotype groups. Patients were, on average, 49.9 ± 13.7 years old, and 49% were women.
As known for SCA 6, the age of onset in that group started later in life with an average of 54.3 years compared to SCA 1–3 with 37.5 years. Most patients had a moderate ataxia severity (SARA mean score 14.0) and a mean EQ-5D index of 0.65 in both groups. SCA 6 patients reported lower INAS and a slightly lower PHQ-9.
Completeness of Data and Distributional Properties
Missing values occurred occasionally for the EQ-5D index (0.8%; n=7) and EQ-VAS (3.4%; n=29). Supplementary Figure 2 presents responses to the EQ-5D-3L items. The proportion of patients reporting “no problems” in any dimension was 10.9% for SCA 1–3 and 3.7% for SCA 6 (Supplementary Table 2). More than 50% of patients reported no problems with SC and AD at baseline. Only a small proportion of patients reported extreme health-related problems. Table 1 includes the distributional properties of the EQ-5D-3L index and EQ-VAS. The EQ-5D-3L-derived indices ranged from 0.03 to 1.0, with a ceiling effect of 9.9% of overall patients. Ceiling effects of the EQ-VAS occurred in less than 4% of the total sample.
Convergent Validity
We observed low (r < 0.3) to strong (r ≥ 0.5) Spearman correlations between EQ-5D-3L dimensions and SARA, ADL, INAS, and PHQ-9. For patients with SCA 1–3, SARA, and ADL correlated strongly and statistically significant (p < 0.01) with the physical dimensions (self-care: rSARA = 0.596/rADL = 0.689; usual activity: rSARA = 0.534/rADL = 0.682) of EQ-5D-3L and low with mental items (pain/discomfort: rSARA= 0.120/rADL= 0.280; anxiety/depression: rSARA= 0.193/rADL= 0.231; all correlations with p < 0.01). Contrary to this, PHQ-9 showed moderate to strong statistically significant (p < 0.01) correlations with pain/discomfort (rPHQ-9 = 0.355) and anxiety/depression (rPHQ-9 = 0.530). Findings were similar in SCA 6 patients, even though correlations were constantly lower for this group. With respect to INAS, only in SCA 1–3 moderate correlations with p < 0.01 were seen to mobility (rINAS = 0.410), self-care (rINAS = 0.419), and usual activity (rINAS = 0.431) while only poor correlations were seen for pain/discomfort (rINAS = 0.156) and anxiety/depression (rINAS = 0.146) and all dimensions in SCA 6 (rINAS < 0.3 with p < 0.01 in usual activity, pain/discomfort, and anxiety/depression).
The EQ-5D-3L index showed moderate (rINAS = −0.381 and rPHQ-9 = −0.474) to strong (rSARA = −0.562 and rADL = −0.713) correlations (Table 2). EQ-VAS showed a strong correlation with ADL (rADL = −0.507) and a moderate correlation with SARA (rSARA = −0.396), INAS (rINAS = −0.329) and PHQ-9 (rPHQ-9 = −0.453) in the entire sample (Table 2). When analyzed by genotype groups, EQ-5D-3L was stronger correlated in patients with SCA 1–3. Correlations in this group were highest for ADL (rADL = −0.713), followed by SARA (rSARA = −0.575) and PHQ-9 (rPHQ-9 = −0.493). EQ-VAS correlated moderately with all instruments. In the group of SCA 6 patients, EQ-5D-3L presented lower correlation coefficients consistently with poor correlations for EQ-5D-3L index and INAS (rINAS = −0.230) and EQ-VAS and SARA (rSARA = −0.230) while EQ-VAS correlated strongly with PHQ-9 (rPHQ-9 = −0.517). All correlations were statistically significant. Figure 1 illustrates the correlation between the EQ-5D-3L index and SARA, INAS, PHQ-9, and ADL.
Known-Groups Validity
In SCA 1–3, results fit the assumptions made for differences between known groups according to age, SARA, INAS, PHQ-9, and general health (EQ-VAS). EQ-5D-3L index was lower in older patients, patients with more severe ataxia, with higher INAS, higher likelihood of depression, and lower EQ-VAS (Table 3). In SCA 6, such differences were less pronounced and did not reach significance for age and INAS (Table 3).
Test-Retest Reliability
Among 27 patients classified as stable at 12 months after baseline according to SARA, Kappa statistics of all EQ-5D-3L items was > 0.4, demonstrating acceptable test-retest reliability. ICC calculation of the EQ-5D-3L (0.95) index and the EQ-VAS (0.88) indicated sufficient reliability.
Responsiveness
Clinically stable patients (no change in SARA score) showed no responsiveness for the EQ-5D-3L index and EQ-VAS. However, in the patient group with an increase in the SARA score over time (SARA score deteriorated), both the EQ-5D-3L index and the EQ-VAS differed between baseline and follow-ups with almost none to small effect sizes. SES varied from 0.08 (T0 to T1) to 0.29 (T0 to T3) for the EQ-5D-3L index and 0.17 (T0 to T1) to 0.34 (T0 to T2) for the EQ-VAS. SRM ranged from 0.09 (T0 to T1) to 0.27 (T0 to T2/T3) for the EQ-5D-3L index and 0.15 (T0 to T1) to 0.33 (T0 to T2) for the EQ-VAS, respectively (Table 4). Regarding the solely observation of the SARA score, we found small (T0 to T1: 0.2SES and 0.2SRM) to moderate (T0 to T2: 0.40SES and 0.39SRM; T0 to T3: 0.59SES and 0.56SRM) changes in health for both effect sizes.
Discussion
To the best of our knowledge, this is the first longitudinal study reporting specific evidence on the psychometric properties of the EQ-5D-3L in patients with SCA. Results of this study fill the gap of inconclusive evidence by showing that the EQ-5D-3L demonstrates acceptability, reliability, good discriminative ability, moderate to strong convergent validity, and small (EQ-5D-3L indices) to moderate (EQ-VAS with regard to ataxia severity) responsiveness to health changes in measuring HRQoL among patients with SCA 1, 2, 3, and 6. Our analysis supports the NICE recommendations [12] to gain information about the EQ-5D-3L performance in rare diseases. It provides evidence that the EQ-5D-3L could be an appropriate measure to capture the patients’ HRQoL of SCA patients for use in clinical trials, in health economic evaluations, and in further research understanding HRQoL in people with that rare disease.
Distribution Properties
The extent of non-item response with less than 5% (EQ-5D index = 0.8% and EQ-VAS = 3.4%) is very low and acceptable. Our results are consistent with the findings of other studies regarding item-based ceiling effects. Bolzan et al. [5] analyzed HRQoL in the pre-ataxic phases in a Brazilian sample of SCA 3 patients using the EQ-5D-3L. The results of this study demonstrated the highest proportion of “no problems” in self-care (73.9%) and the lowest in mobility (13%). Responses of our patients are in line with Bolzan et al. [5], occurring with the highest ceiling effects in self-care (only SCA 1–3) and the lowest in mobility (for both groups). Maas et al. [10] analyzed the discordance between the EQ-5D-5L and physician-rated motor symptom severity in early-to-middle-stage SCA 3, revealing a comparable high “no problem” area in SC (70%). Ceiling effects of the EQ-5D-3L index are well known and highly differed in percentage in terms of the analyzed setting [32,33,34]. We detected acceptable ceiling effects of the EQ-5D-3L with 10.9% in SCA 1–3 and 3.7% in SCA 6 patients with an index range of 0.03 to 1.0 (SCA 1–3) and 0.21 to 1.0 (SCA 6). Bolzan et al. [5] showed that SCA patients differed in their HRQoL from healthy individuals, especially in the EQ-5D mobility dimension. Item responses of our sample show nearly identical results of a reduced HRQoL dominantly in the dimension mobility with a proportion of > 70% reporting at least some problem in both SCA groups. However, due to the evidence that the 3L tends to overestimate health problems resulting in a biased index score [35] and missing HRQoL data of SCA patients, we cannot conclusively state that our findings present the full range of health states in SCA patients.
Validity
It is well known that the EQ-5D-3L can be a useful tool to screen symptoms of anxiety and depression compared to the PHQ-9 [36], also evaluated as appropriate for use in SCA [37]. Symptoms of depression are common in SCA patients
[38]. According to our PHQ-9 screening results, no patient in our survey met the criterion for clinically relevant depression. Nevertheless, we found moderate to strong correlations between EQ-5D-3L/EQ-VAS and the PHQ-9, which underlines the possibility of using EQ-5D as a substitute for capturing the patients’ mood.
Several validation studies in different settings and patient populations found at least moderate associations between the EQ-5D and disease-specific instruments and clinical assessments [39,40,41]. Our study also confirmed moderate to strong correlations between EQ-5D-3L-dimensions, EQ-5D-3L index, and EQ-VAS with the disease-specific ratings of SARA, INAS, and ADL. This supports our hypothesis of convergent validity testing in an adequate correlation structure. Including all observations (T0 to T3) in the validity analysis may lead to within-subject effects as individuals with more data points are included. A separate analysis of all observations showed almost equally high correlation coefficients, indicating robust results when all time points (T0 to T3) are considered (Supplementary Table 3). The EQ-5D-3L can also successfully differentiate among different stages of age and health components (SARA, INAS, PHQ-9, EQ-VAS). In addition to clinically relevant parameters in SCA patients, the EQ-5D-3L represented strong convergent validity and was useful in assessing a comprehensive picture of the patient’s health status.
Test-Retest Reliability
A substantial body of literature analyzed the EQ-5D-3L’s reliability. We studied the test-retest reliability of the EQ-5D-3L by calculating Cohen’s Kappa and ICCs for stable patients from baseline to T1. Both coefficients passed the threshold for tolerable reliability (Kappa range: 0.44–0.93; ICC: > 0.8). Our results replicate international findings, including those for rare diseases [39, 42], suggesting that the EQ-5D-3L shows acceptable reproducibility in SCA (Supplementary Table 4). Nevertheless, the results must be interpreted with caution and be seen more as a tendency regarding the small sample size of stable patients (N < 30).
Responsiveness
The EQ-5D-3L’s ability to detect changes in health varies greatly by setting and patient group. Shah et al. [39] reported poor responsiveness of EQ-5D-3L in patients with rare lung diseases; whereas, Golicki et al. [43] documented a moderate to good ability to detect changes in health in stroke patients. Compared to other HRQoL measures, the EQ-5D, especially the three-level version, is less responsive [32]. As EQ-5D indices and EQ-VAS values in patients deteriorated similarly to ataxia severity (SARA score), we can assume responsiveness to changes in ataxia severity in SCA 1–3 and SCA 6. In that case, we have to consider that SCA is a slowly progressive neurodegenerative disease, which means that changes in functional and mental health over the observed time are expectedly small. Therefore, we did not expect to see strong effect sizes for EQ-5D-3L.
Strengths and Limitations
The strength of our study is primarily in the study design and the included large population group. While analyses of rare diseases are often associated with a small number of cases in a cross-sectional design, we analyzed a longitudinal data set with a large sample of n = 842 SCA patients with multiple observations. Nevertheless, we know that the validation of the EQ-5D-3L in SCA patients is confronted with limitations. First of all, the data set did not include additional well-established HRQoL measurements. We therefore could not compare the EQ-5D-3L to other HRQoL instruments with comparable constructs, which limits the generalizability of the results. Another limitation is the design of the measurement time points, defined as a 1-year follow-up. The 1-year distance between the time points can be interpreted as long, not capturing variations in the patient’s HRQoL. The slow progression of SCA diseases and the low clinical and patient-reported changes in health status support the choice of 1-year follow-ups. Our test-retest reliability analysis must be interpreted with caution; it relies on only a small subgroup of subjects classified as clinically stable. To obtain robust results, it would be favorable to administer the EQ-5D-3L, for example, on two consecutive days. However, our data deliver a good estimate of score stability when applied in larger intervals, which is the common clinical setting. There is a probability that combining SCA 1, 2, and 3 into one group will influence the interpretation of the results. Despite this, studies revealed that the clinical picture of these three SCA types is very similar [16, 17], supporting the grouping of these three SCA disorders. Finally, the EQ-5D-3L is increasingly replaced by the 5L version. Several studies reported an improved sensitivity with the 5L version, reducing ceiling and floor effects and improving other psychometric properties [32, 34]. We are conscious that using the 3L version in our survey leads to the risk of a reduced interpretation of the measurement properties. Due to this, further research should evaluate if an extension from three to five levels would influence the performance of the EQ-5D in SCA patients [44].
Conclusion
The psychometric analysis demonstrates that the EQ-5D-3L is an acceptable, valid, and reliable instrument for patients with SCA 1, 2, 3, and 6 and shows overall low responsiveness. Thus, the EQ-5D-3L could be used regularly in economic evaluations in this rare disease area and as an additional instrument to the disease-specific HRQoL measures in SCA clinical research [45]. Nevertheless, further research is needed to evaluate if the 5L version would improve the psychometric properties of the EQ-5D in SCA.
Data Availability
Data is available on request.
References
Sullivan R, Yau WY, O'Connor E, Houlden H. Spinocerebellar ataxia: an update. J Neurol. 2019;266(2):533–44.
Diallo A, Jacobi H. Tezenas du Montcel S, Klockgether T. Natural history of most common spinocerebellar ataxia: a systematic review and meta-analysis. J Neurol. 2021;268(8):2749–56.
Ruano L, Melo C, Silva MC, Coutinho P. The global epidemiology of hereditary ataxia and spastic paraplegia: a systematic review of prevalence studies. Neuroepidemiology. 2014;42(3):174–83.
Jacobi H, du Montcel ST, Bauer P, Giunti P, Cook A, Labrum R, et al. Long-term disease progression in spinocerebellar ataxia types 1, 2, 3, and 6: a longitudinal cohort study. Lancet Neurol. 2015;14(11):1101–8.
Bolzan G, Leotti VB, de Oliveira CM, Ecco G, Cappelli AH, Rocha AG, et al. Quality of life since pre-ataxic phases of spinocerebellar ataxia type 3/Machado-Joseph disease. Cerebellum. 2022;21(2):297–305.
Calvert M, Kyte D, Duffy H, Gheorghe A, Mercieca-Bebber R, Ives J, et al. Patient-reported outcome (PRO) assessment in clinical trials: a systematic review of guidance for trial protocol writers. PLoS One. 2014;9(10):e110216.
Scoggins JF, Patrick DL. The use of patient-reported outcomes instruments in registered clinical trials: evidence from ClinicalTrials.gov. Contemp Clin Trials. 2009;30(4):289–92.
Epstein E, Farmer JM, Tsou A, Perlman S, Subramony SH, Gomez CM, et al. Health related quality of life measures in Friedreich ataxia. J Neurol Sci. 2008;272(1-2):123–8.
Jacobi H, du Montcel ST, Bauer P, Giunti P, Cook A, Labrum R, et al. Long-term evolution of patient-reported outcome measures in spinocerebellar ataxias. J Neurol. 2018;265(9):2040–51.
Maas R, Schutter D, van de Warrenburg BPC. Discordance between patient-reported outcomes and physician-rated motor symptom severity in early-to-middle-stage spinocerebellar ataxia type 3. Cerebellum. 2021;20(6):887–95.
Schmitz-Hübsch T, Coudert M, Giunti P, Globas C, Baliko L, Fancellu R, et al. Self-rated health status in spinocerebellar ataxia--results from a European multicenter study. Mov Disord. 2010;25(5):587–95.
National Institute for Health and Care Excellence. CHTE methods review - health-related quality of life. Task and finish group report. 2020. URL: https://rees-france.com/wp-content/uploads/2020/12/2020-CHTE-2020-Healthrelated-quality-of-life-.pdf. [last visited 01.09.2023]
Riazi A, Cano SJ, Cooper JM, Bradley JL, Schapira AH, Hobart JC. Coordinating outcomes measurement in ataxia research: do some widely used generic rating scales tick the boxes? Mov Disord. 2006;21(9):1396–403.
Hengel H, Martus P, Faber J, Garcia-Moreno H, Solanky N, Giunti P, et al. Characterization of lifestyle in spinocerebellar ataxia type 3 and association with disease severity. Mov Disord. 2022;37(2):405–10.
Diallo A, Jacobi H, Cook A, Labrum R, Durr A, Brice A, et al. Survival in patients with spinocerebellar ataxia types 1, 2, 3, and 6 (EUROSCA): a longitudinal cohort study. Lancet Neurol. 2018;17(4):327–34.
Schmitz-Hübsch T, Coudert M, Bauer P, Giunti P, Globas C, Baliko L, et al. Spinocerebellar ataxia types 1, 2, 3, and 6: disease severity and nonataxia symptoms. Neurology. 2008;71(13):982–9.
Jacobi H, Bauer P, Giunti P, Labrum R, Sweeney MG, Charles P, et al. The natural history of spinocerebellar ataxia type 1, 2, 3, and 6: a 2-year follow-up study. Neurology. 2011;77(11):1035–41.
EuroQol. EuroQol--a new facility for the measurement of health-related quality of life. Health Policy. 1990;16(3):199–208.
Greiner W, Weijnen T, Nieuwenhuizen M, Oppe S, Badia X, Busschbach J, et al. A single European currency for EQ-5D health states. Results from a six-country study. Eur J Health Econ. 2003;4(3):222–31.
Spitzer RL, Kroenke K, Williams JB. Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. Primary care evaluation of mental disorders. Patient Health Questionnaire. JAMA. 1999;282(18):1737–44.
Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. 2001;16(9):606–13.
Schmitz-Hübsch T, du Montcel ST, Baliko L, Berciano J, Boesch S, Depondt C, et al. Scale for the assessment and rating of ataxia: development of a new clinical scale. Neurology. 2006;66(11):1717–20.
Teubner-Liepert K. Einflussfaktoren und Prädiktoren von Stürzen bei degenerativer Ataxie. Universität Tübingen; 2019.
Jacobi H, Rakowicz M, Rola R, Fancellu R, Mariotti C, Charles P, et al. Inventory of Non-Ataxia Signs (INAS): validation of a new clinical assessment instrument. Cerebellum. 2013;12(3):418–28.
Fahey MC, Corben L, Collins V, Churchyard AJ, Delatycki MB. How is disease progress in Friedreich’s ataxia best measured? A study of four rating scales. J Neurol Neurosurg Psychiatry. 2007;78(4):411–3.
Cohen J. Statistical power analysis for the behavioral sciences: Routledge; 2013.
Hinkle DE, Wiersma W, Jurs SG. Applied statistics for the behavioral sciences: Houghton Mifflin college division; 2003.
Akoglu H. User’s guide to correlation coefficients. Turk J Emerg Med. 2018;18(3):91–3.
Altman DG. Practical statistics for medical research. CRC Press; 1990.
Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155–63.
Kazis LE, Anderson JJ, Meenan RF. Effect sizes for interpreting changes in health status. Med Care. 1989;27(3):S178–89.
Buchholz I, Janssen MF, Kohlmann T, Feng YS. A systematic review of studies comparing the measurement properties of the three-level and five-level versions of the EQ-5D. PharmacoEconomics. 2018;36(6):645–61.
Thompson AJ, Turner AJ. A comparison of the EQ-5D-3L and EQ-5D-5L. PharmacoEconomics. 2020;38(6):575–91.
Pickard AS, De Leon MC, Kohlmann T, Cella D, Rosenbloom S. Psychometric comparison of the standard EQ-5D to a 5 level version in cancer patients. Med Care. 2007;45(3):259–63.
Janssen MF, Bonsel GJ, Luo N. Is EQ-5D-5L Better than EQ-5D-3L? A head-to-head comparison of descriptive systems and value sets from seven countries. PharmacoEconomics. 2018;36(6):675–97.
Short H, Al Sayah F, Ohinmaa A, Johnson JA. The performance of the EQ-5D-3L in screening for anxiety and depressive symptoms in hospital and community settings. Health Qual Life Outcomes. 2021;19(1):96.
Schmitz-Hübsch T, Coudert M, Tezenas du Montcel S, Giunti P, Labrum R, Dürr A, et al. Depression comorbidity in spinocerebellar ataxia. Mov Disord. 2011;26(5):870–6.
McMurtray AM, Clark DG, Flood MK, Perlman S, Mendez MF. Depressive and memory symptoms as presenting features of spinocerebellar ataxia. J Neuropsychiatry Clin Neurosci. 2006;18(3):420–2.
Shah A, Ng X, Shah R, Solem C, Wang P, Obradovic M. Psychometric validation of the EQ-5D-3L in patients with nontuberculous mycobacterial (NTM) lung disease caused by mycobacterium avium complex (MAC). Patient Relat Outcome Meas. 2021;12:45–54.
Michalowsky B, Xie F, Kohlmann T, Gräske J, Wübbeler M, Thyrian JR, et al. Acceptability and validity of the EQ-5D in patients living with dementia. Value Health. 2020;23(6):760–7.
Yang Y, Brazier J, Longworth L. EQ-5D in skin conditions: an assessment of validity and responsiveness. Eur J Health Econ. 2015;16(9):927–39.
Pickard AS, Wilke C, Jung E, Patel S, Stavem K, Lee TA. Use of a preference-based measure of health (EQ-5D) in COPD and asthma. Respir Med. 2008;102(4):519–36.
Golicki D, Niewada M, Buczek J, Karlińska A, Kobayashi A, Janssen MF, et al. Validity of EQ-5D-5L in stroke. Qual Life Res Int J Qual Life Asp Treat Care Rehab. 2015;24(4):845–50.
Buchholz M, Weber N, Borel S, Sayah S, Xie F, Schulz JB, et al. Patient-reported, health economic and psychosocial outcomes in patients with Friedreich ataxia (PROFA): protocol of an observational study using momentary data assessments via mobile health app. BMJ Open. 2023;13(8):e075736.
Schmahmann JD, Pierce S, MacMore J, L'Italien GJ. Development and validation of a patient-reported outcome measure of ataxia. Mov Disord. 2021;36(10):2367–77.
Acknowledgements
This publication is an outcome of ESMI, an EU Joint Program—Neurodegenerative Disease Research (JPND) project (see www.jpnd.eu). Members of the EUROSCA and ESMI research study groups. Members of the ESMI study group: Jeanette Hübener-Schmid, Magda Santana, Marcus Grobe-Einsler, Berkan Koyak, Mafalda Raposo, Manuela Lima, Hector Garcia-Moreno, Paola Giunti, Luís Pereira de Almeida, Bart van de Warrenburg, Judith van Gaalen, Dagmar Timmann, Andreas Thieme, Kathrin Reetz, Imis Dogan, Carlo Wilke, Ludger Schöls, Olaf Riess, Matthis Synofzik, Jeroen de Vries, Jon Infante, Oz Gulin, James Joers, Chiadikaobi Onyike, Michal Povazan, Eva-Maria Ratai, Jeremy Schmahmann. Members of the EUROSCA study group: Sophie Tezenas du Montcel, Peter Bauer, Paola Giunti, Arron Cook, Robyn Labrum, Michael H. Parkinson, Alexandra Durr, Alexis Brice, Perrine Charles, Cecilia Marelli, Caterina Mariotti, Lorenzo Nanetti, Marta Panzeri, Maria Rakowicz, Anna Sulek, Anna Sobanska, Ludger Schöls, Holger Hengel, Laszlo Baliko, Bela Melegh, Alessandro Filla, Antonella Antenora, Jon Infante, José Berciano, Bart P. van de Warrenburg, Dagmar Timmann, Sandra Szymanski, Sylvia Boesch, Jun-Suk Kang, Massimo Pandolfo, Jörg B. Schulz, Sonia Molho & Alhassane Diallo.
Funding
Open Access funding enabled and organized by Projekt DEAL. The EuroQol Research Foundation funded this research (Grant No.: 349-RA). The funders have not influenced the conceptualization and conduct of the study and will not have any role in the data analysis and publication of the results.
The project is supported through the following funding organizations under the aegis of JPND: Germany, Federal Ministry of Education and Research (BMBF; funding codes 01ED1602A/B); Netherlands, The Netherlands Organization for Health Research and Development; Portugal, Foundation for Science and Technology and Regional Fund for Science and Technology of the Azores; United Kingdom, Medical Research Council. This project has received funding from the European Union’s Horizon 2020 research and innovation program under Grant Agreement No. 643417.
Author information
Authors and Affiliations
Consortia
Contributions
MB: data quality control, statistical analysis (conceptualization and design), and manuscript (drafting and revision); NW: data preparation, data quality control and data monitoring for analysis, and manuscript (review and critique); AR: manuscript (review and critique); JF: manuscript (review and critique); TSH: manuscript (review and critique); HJ: manuscript (review and critique); FX: manuscript (review and critique); TK: manuscript (review and critique); BM: development of the research project (conception, organization, monitoring), statistical analysis (review and critique), and manuscript (review and critique). All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Ethical Approval and Consent to Participate
ESMI and EUROSCA were approved by the ethics committees of the participating centers and have been performed in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. Written informed consent was obtained from all study participants.
Conflict of Interest
Feng Xie and Bernhard Michalowsky are members of the EuroQol group. The authors declare no competing interests.
MB reports no disclosures. NW reports no disclosures. AR reports no disclosures. JF was funded as a fellow of the Cluster for Excellence in Clinical Neuroscience of the Hertie Foundation and received funding of the National Ataxia Foundation (NAF). TS-H receives research grants from Celgene/bms and research grant from Hexal AG. HJ reports no disclosures. FX reports no disclosures. TK is receiving research support from the Bundesministerium für Bildung und Forschung (BMBF), the National Institutes of Health (NIH) and Servier. Within the last 24 months, he has received consulting fees from Biogen, UCB, and Vico Therapeutics.
Disclaimer
The authors’ views do not necessarily reflect the views of the EuroQol group. The EuroQol Foundation is not involved in the study’s design, data assessment, analyses, or interpretation of the results.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
ESM 1
(DOCX 210 kb)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Buchholz, M., Weber, N., Rädke, A. et al. Health-Related Quality of Life in Patients with Spinocerebellar Ataxia: a Validation Study of the EQ-5D-3L. Cerebellum (2023). https://doi.org/10.1007/s12311-023-01597-3
Accepted:
Published:
DOI: https://doi.org/10.1007/s12311-023-01597-3