Plain English summary

Breast Cancer is a chronic disease, since it is an ongoing condition that can recur, requires medical treatment and negatively affects health-related quality of life. The difficulty in interpreting quality of life scores prevents their use, but reference norms can help interpreting them. This study provides reference norms of three widespread quality of life measuring instruments (EORTC QLQ-C30, EORTC QLQ-BR23, and EQ-5D-5L), based on non-metastatic breast cancer patients in Spain.

Health-related quality of life questionnaires were administered to 1276 women with breast cancer at diagnosis and 2 years after. Quality of life was worse for women diagnosed at higher stages of cancer, and for those with other medical conditions. For this reason, reference norms at diagnosis and at the 2-year follow-up were provided according to age, number of conditions, and cancer stage. These reference norms can be useful to interpret the quality of life scores of women with non-metastatic breast cancer.

Background

Breast Cancer is a chronic disease [1] since it is an ongoing condition that can recur, requires medical treatment and affects negatively the health-related quality of life (HRQoL) [2], which therefore has become an important outcome in these patients [3,4,5,6].

HRQoL instruments are generic or specific according to their target population, and they can in turn be classified as psychometric profiles or econometric indexes according to their measurement model [7]. Generic instruments are applicable to any population, and are well suited for the comparison among diseases [5, 8, 9], while disease-specific instruments are more responsive in detecting changes over time and differences between groups. The EQ-5D has probably been the most widely used generic econometric instrument [8, 9], and the instruments developed by the European Organization for Research and Treatment of Cancer (EORTC) are the most frequently used cancer-specific ones [10]: the Quality of Life Questionnaire C-30 (EORTC QLQ-C30), common for all tumor locations [11], and the specific module for Breast cancer (EORTC QLQ-BR23) [12].

The difficulty in interpreting HRQoL scores has been identified as one of the main barriers to the widespread use of this type of outcomes [13]. A well-established approach to aid interpretation of scores has been to produce tables of normative data [11, 13, 14]. Reference norms of the EQ-5D-5L based on general population are currently available for several countries [15,16,17,18,19], but not for population with breast cancer. Reference values of EORTC QLQ-C30 and EORTC QLQ-BR23 based on breast cancer patients are available since 2008 [20], and updated for the EORTC QLQ-C30 in 2020 stratifying by age, breast cancer stage (early or metastatic), comorbid conditions, and performance status [21]. However, in this update norms were constructed by pooling samples of multinational randomized clinical trials without providing data at country level, despite the consistent differences between countries [14] and geographic areas [21].

Although there are general population-based reference norms of EORTC QLQ-C30 available at country level [22,23,24,25], there are patient-based reference norms for women with breast cancer and other breast diseases only for Germany [26].

The present study aimed to obtain reference norms of the EORTC QLQ-C30, the EORTC QLQ-BR23, and the EQ-5D-5L, based on a population of Spanish non-metastatic breast cancer patients at diagnosis and 2 years after, according to relevant demographic and clinical characteristics.

Methods

Study population and setting

We analyzed data of patients included in the CaMISS (Spanish abbreviation for Health Services Research in Breast Cancer) prospective observational study [27] including consecutive women aged 18 years or older diagnosed with incident breast cancer in one of the participant Spanish hospitals from April 2013 to May 2015. Exclusion criteria were: diagnosis of sarcoma, lymphoma or inflammatory carcinoma; breast cancer recurrence; terminal illness; or inability to respond to questionnaires for any reason.

Women were selected from the lists for surgery and other oncological treatments through revision of inclusion and exclusion criteria in their medical records. Eligible patients were contacted, informed, and invited to participate by phone, and their written informed consent was requested. Out of the 1629 eligible patients invited to participate in the CaMISS study, 1456 accepted (89%): 1176 from 6 hospitals in the Basque Country, 97 from 2 hospitals in the Canary Islands, 97 from one hospital in Catalonia, and 86 from one hospital in Andalusia. After excluding 75 patients with no tumor stage information, 18 in stage IV and 87 without HRQoL data at diagnosis, 1276 patients were included in the analyses. Participants were followed for 2 years (Median = 1.98 years; P25–P75 = 1.94–2.05), and their re-evaluation was performed from May 2015 to December 2017.

HRQoL questionnaires were administered before surgery or before beginning the neoadjuvant therapy, and 2 years after diagnosis. The first administration between diagnosis and treatment was performed during a hospital visit (43.3%) or through telephone interviews (56.7%). 2 years after diagnosis, HRQoL questionnaires were sent by post mail, with reminders at 2 weeks and at 2 months. In the interval among reminders, non-responders were also telephoned to remind them that a questionnaire had been sent and also to offer them the option of responding over the phone if they preferred (17.1% were administered through telephone interviews). There was one person responsible for data collection per hospital, who were trained for recruitment procedures, extraction of data from medical records and administration of HRQoL questionnaires.

The study was approved by the ethics committees of all participating hospitals and conducted according to the principles expressed in the 2000 revision of the Declaration of Helsinki.

Study variables

Participants' age, date of breast cancer diagnosis, clinical TNM classification, and diagnosis of other diseases were collected from medical records at recruitment. The Charlson Comorbidity Index [28] was constructed with 17 items, scoring from 0 to 6 points. Information about the treatments performed and recurrences suffered during the follow-up were collected from medical records 2 years after diagnosis.

The following socio-demographic variables were self-reported: education level, occupation, social class, and marital status. Social class was based on the Spanish National Classification of Occupations 2011, using a neo-Weberian approach [29]: (I) Large employers (≥ 10 employees) and higher grade professionals or managers; (II) Small employers, lower grade professionals or managers, higher grade technicians, sports professionals and artists; (III) Intermediate occupations (white collar workers); (IV) Lower supervisory and lower technician occupations; (V) Skilled primary workers and semi-skilled workers; and VI) Unskilled workers.

Health-related quality of life measures

HRQoL was assessed using the Spanish validated versions of EORTC QLQ-C30 [30, 31], EORTC QLQ-BR23 [12], and EQ-5D-5L [32].

The EORTC QLQ-C30 [31, 33] is comprised of 30 items that assess five functional scales, eight cancer symptoms, a financial difficulties scale, and a global health status scale. A single higher-order summary score was calculated, using 27 out of the 30 items (excluding global health status and financial impact) [34]. The EORTC QLQ-BR23 [12] comprises 23 items that assess four functional and four symptoms scales. Responses were transformed into scores ranging from 0 to 100, were higher scores indicate greater burden for symptom scales, but better results in functional scales and the EORTC QLQ-C30 summary score [34].

The EQ-5D-5L [32] comprises 5 questions to measure the mobility, self-care, usual activities, pain/discomfort, and anxiety/depression dimensions with a 5 response options Likert scale (from none to extreme problems). The EQ-5D-5L includes also a 6th question to self-rate health (EQ-VAS) with a visual analogue scale ranging from 0 to 100 (best health state imaginable). Applying the Spanish social preferences [35], a health index was obtained ranging from 1 (perfect health) to negative values for those health states considered worse than death. These preference values were obtained using a standardized protocol, combining the techniques of time trade-off and discrete choice [36].

Statistical analysis

The statistical power of the study was estimated as > 0.8, with an alpha risk of 0.05 (Type I error), to detect mean differences of 0.4 standard deviation on HRQoL scores for the comparison with the smallest sample size between women with breast cancer in stage 0 and stage III (N = 116 and 72, respectively). The study power was calculated with R package version 1.3–0 [37].

Characteristics of women who completed the HRQoL telephone interview at diagnosis and after 2 years were compared to women who did not complete the follow-up evaluation, using the Chi square test. HRQoL differences according to age, comorbidity and stage were tested with ANOVA and Tukey post hoc analyses or Chi Square test. Bar chart figures were created showing our results together with general population-based reference norms of the EQ-5D-5L for Spanish women [15], and of the EORTC QLQ-C30 for patients with non-metastatic breast cancer from European and Anglo-Saxon countries [21].

To identify the factors independently associated with HRQoL at diagnosis and 2 years after, linear regression models with the EORTC QLQ-C30 summary score were constructed, as well as censored linear regression models (Tobit) with the EQ-5D-5L index due to the right-skewed distribution. Marginal effects were obtained from the Tobit model as averaged individual marginal effects to restore the original range of the EQ-5D-5L index [38].

Reference norms were estimated for all patients with non-metastatic breast cancer, stratifying by the variables independently associated with HRQoL. We provide the percentage and standard error for the EQ-5D-5L dimensions. For the continuous scores of EQ-5D-5L, EORTC QLQ-C30 and BR-23 we provide the deciles, percentiles 5 and 95, mean, standard deviation (SD), and 95% confidence interval (95% CI). All analyses were performed with IBM SPSS Statistics, version 25.

Results

Sample characteristics at diagnosis

Table 1 shows socio-demographic and clinical characteristics of the sample, as well as treatment and recurrences observed during the 2-year follow-up. Mean age of the 1276 participants at diagnosis was 58 years (SD = 12), most were married or with a couple (67%), and with breast cancer diagnosed in stage I (52%) and II (30%). The treatments adjuvant to surgery most frequently applied were hormonotherapy (85%) and external radiotherapy (83%). The most frequent treatment combinations are: surgery with radiotherapy and hormonotherapy, which was applied to more than 40% of patients, followed by this combination in addition with chemotherapy (18%), and surgery with hormonotherapy in almost 10% of the patients. Patients with other treatments received concomitant medication for side effects or other diseases, such as granulocyte colony-stimulating factor (8 patients), methotrexate or zoledronic acid.

Table 1 Characteristics of women with non-metastatic breast cancer at diagnosis, treatment applied and recurrences detected during the 2-year follow-up

Statistically significant differences were found between the 1108 patients who completed the two HRQoL assessments and the 168 who did not complete the second evaluation. The latter group were younger, more frequently without couple, underwent mastectomy and lymphadenectomy, and had a higher percentage of detected recurrences.

Figure 1 shows that EQ-5D-5L index means for breast cancer patients were significantly lower (worse) than the reference norms based on general population of Spanish women [15] at the two time points, except for patients aged > 65 years. In general, the EORTC QLQ-C30 summary score means for breast cancer patients at diagnosis (Fig. 2) were similar to the reference norms based on European and Anglo-Saxon patients [21] at diagnosis, but significantly lower (worse) at the 2-year follow-up.

Fig. 1
figure 1

EQ-5D-5L Index according to age, tumor stage and Charlson Comorbidity Index: bars represent means and their 95% confidence intervals at diagnosis and at 2-year follow-up. Discontinuous lines represent the EQ-5D-5L index means of reference norms based on general population [15] Spanish women. Y axis represents the EQ-5D-5L index values from 1 (perfect health) to 0 (death). *p value < 0.05 obtained using Tukey post hoc analysis for comparing each category with the reference: aged < 40 years old, tumor stage 0, or Charlson Comorbidity Index 0

Fig. 2
figure 2

EORTC QLQ-C30 summary score according to age, tumor stage and Charlson Comorbidity Index: bars represent means and their 95% confidence intervals at diagnosis and at 2-year follow-up. Discontinuous lines represent the EORTC QLQ-C30 reference norms based on patients with non-metastatic breast cancer at diagnosis [21]. Y axis represents the EORTC QLQ-C30 summary score from 0 (worst) to 100 (best results). *p value < 0.05 obtained using Tukey post hoc analysis for comparing each category with the reference: aged < 40 years old, tumor stage 0, or Charlson Comorbidity Index 0

Table 2 shows statistically significant differences on EQ-5D-5L by age group and comorbidity, both at diagnosis and 2 years after. Only usual activities and pain/discomfort EQ-5D dimensions presented differences by tumor stage at diagnosis; while all the dimensions, the index and the EQ-VAS presented statistically significant differences after 2 years. Similarly, more EORTC QLQ-C30 scales (Table 3) presented statistically significant differences by comorbidity index and tumor stage at 2 years of follow-up than at diagnosis. Some scores of the EORTC BR23 (Table 4) also presented significant differences among groups defined by these variables.

Table 2 EQ-5D-5L results according to age, Charlson Comorbidity Index and stage: frequency and percentage of patients that reported having any problem in each dimension and mean (SD) of EQ-5D-5L index* and EQ-VAS**
Table 3 Mean scores and standard deviations of EORTC QLQ-C30 according to age, Charlson Comorbidity Index, and tumor stage
Table 4 Mean scores and standard deviations of EORTC QLQ-BR-23 according to age, Charlson Comorbidity Index, and stage

Multivariate models of EQ-5D-5L iIndex and EORTC QLQ-C30 summary score at diagnosis and at 2-year follow-up (Table 5) show the independent association of comorbidity and tumor stage with HRQoL. The standardized multivariate regression coefficient of EORTC QLQ-C30 summary score indicates that HRQoL was poorer for women with stage II and III than for those with stage 0 both at diagnosis (− 0.11 and − 0.07, p < 0.05) and at follow-up (− 0.15 and − 0.10, p < 0.01). The EQ-5D-5L index also indicated poorer HRQoL for women with Charlson comorbidity index ≥ 2 than comorbidity 0 at diagnosis (− 0.13, p < 0.001) and at follow-up (− 0.18, p < 0.001).

Table 5 Multivariate models of the EQ-5D-5L index and EORTC QLQ-C30 summary score as dependent variables

The reference norms for EQ-5D-5L (dimensions, index, and EQ-VAS), EORTC QLQ-C30 and BR23 scores at diagnosis and at the 2-year follow-up are provided in Supplementary data, stratified by age, comorbidity index, and tumor stage (Supplementary materials, Tables 1.1.1 to 2.3.8).

Discussion

This is the first study to obtain the Spanish reference norms of the EORTC QLQ-C30, the EORTC QLQ-BR23 and the EQ-5D-5L, based on women with non-metastatic breast cancer. The results obtained indicate that the HRQoL of these women differs according to age, comorbidity, and tumor stage at diagnosis and at 2 years of follow-up and, therefore, reference norms based on patient population stratified by these factors are recommended to interpret HRQoL results.

Our results showing HRQoL differences by age and comorbid conditions are consistent with the available evidence from studies on reference norms of the EQ-5D [15,16,17,18,19, 39] and the EORTC QLQ-C30 [21, 22], and confirm the need to stratify them by these variables. According to the magnitude of the HRQoL differences among stages, we decided to provide reference norms separately for stage 0-I and II-III. Previous EORTC QLQ-C30 reference norms also stratified by tumor stage, but with different aggregations (I-II and III-IV in the first one [20], and I-II-III and IV in the 2020 update [21]), both at diagnosis.

There are important differences in content between EQ-5D-5L and EORTC QLQ-C30, as they target different populations. EORTC QLQ-C30 is specifically focused on patients with cancer and, therefore, it includes domains that are especially relevant for this population, such as insomnia or fatigue. The lack of these domains in EQ-5D-5L has been highlighted in patients with multiple myeloma [40] and leukemia [41]. From a clinical perspective, EORTC instruments can be more useful to identify cancer-specific problems, though utilities cannot be directly obtained from EORTC QLQ-C30 and BR23 since they are psychometric instruments. Utilities can be calculated using algorithms developed through mapping models to EQ-5D [42,43,44]: two developed in patients with metastatic breast cancer (from the EORTC QLQ-C30 [42] or also from the BR-23 [43]) and one specifically developed for HER2-positive patients with recurrent, unresectable, or metastatic breast cancer [44]. These mapping algorithms allow economic evaluations, but administering the utility instrument itself would be the best option: either the EQ–5D–5L or the new breast cancer-specific utility instrument based on the EORTC QLQ-C30 and the BR-45 [45].

The distribution of the EQ–5D–5L index and its five health dimensions shows a marked aggregation of individuals in the best response option (no problems) at diagnosis, which is consistent with the reference norm from Spanish general population [15]. Worse HRQoL in young breast cancer patients than in young general population can be explained by a more drastic HRQoL impact from diagnosis, related to a more advanced stage at diagnosis and worse prognosis. In fact, among the patients aged under 40 years 50% were diagnosed in stage II or III and 6.6% presented metastasis, while among those aged 40–65 years only 33.6% were diagnosed in these stages and 1.7% presented metastasis. This higher HRQoL impact in young breast cancer patients has been previously described in other studies [42, 43].

Women presented worse EORTC QLQ-C30 results at the 2-year follow-up than at diagnosis for both functional and symptom dimensions, except for emotional function, insomnia and appetite loss. Probably these emotional aspects were already impacted just after discovering the cancer diagnosis. These results are consistent with long breast cancer survivors [46], who continue to experience limitations in a variety of HRQoL dimensions, with specific symptoms that persist or even increase over time and HRQoL restrictions not only associated with aging process but also with cancer and/or its treatment.

The women with breast cancer in our study presented similar HRQoL to the non-metastatic patients of the 2020 updated EORTC reference norms [21], which are also based on incident breast cancers cases at diagnosis. However, we obtained better results than the German reference values based on breast cancer patients in routine clinical practice without information about stages [26], and also better than the first EORTC reference norms from 2008 [20], which included patients at diagnosis (17% stage I-II, 14% stage III-IV, 29% not known stage) but also 41% patients recruited with recurrent/metastatic cancer. Our data at 2 years after diagnosis could not be compared with any of the abovementioned reference norms [20, 21, 26] because they did not report any follow-up.

On one hand, we constructed reference norms from a large sample of women with non-metastatic breast cancer consecutively recruited in ten Spanish hospitals, while the 2020 updated EORTC reference norms [21] were based on a polled analysis of randomized clinical trials, usually with restrictive inclusion criteria, which could not be representative of the population of women with breast cancer. On the other hand, our sample was composed mainly by participants from the Basque country, where the project was designed. Due to financial restrictions, the number of hospitals was limited in the other regions in order to recruit around a hundred participants in each one. The results of the comparison among regions followed the well-known geographic pattern of North–South inequalities on the socio-economic indicators, and also on HRQoL with EORTC QLQ-C30 summary means of: 87.0 at diagnosis and 85.5 at 2 years after in the Basque country, 87.3 and 83.6 in Catalonia, and 80.1 and 77.5 in Andalusia (p < 0.001). However, no differences on TNM stage of the breast tumor at diagnosis were found among regions, and the proportion of women diagnosed in the breast screening program in our study is consistent with the 49% reported in another Spanish region [47].

Therefore, as far as we know, this is the first study to provide reference norms based on breast cancer patients at 2 years after diagnosis in the EORTC and EQ–5D–5L, and also the first one specific for Spanish women. Furthermore, it provides values for the EORTC QLQ-BR-23, which was not included in the 2020 updated EORTC reference norms. Although the latter show relevant differences among European regions [21], no values at country level have been provided in these norms. Our study is one of the first, together with the German one [26], providing country-specific breast cancer patients-based reference norms.

As mentioned above, reference norms help to interpret results by comparing them to a control group, in this case Spanish women diagnosed with non-metastatic breast cancer. The difference between the observed score and the reference value provides the individual deviation from the population. For example, the EORCT QLQ-C30 global score of 88 at diagnosis and of 83 2 years later from a 60-year-old woman diagnosed in stage II indicated no deviation from the reference norms for women aged 40–65 years and diagnosed in stage II or III (means of 87 and 83), but these scores indicate a slightly worse HRQoL than expected when compared to the mean of 85, obtained by the whole group aged 40–65 years. This example illustrates why stratification by breast cancer stage could be suitable for the interpretation of HRQoL results.

The main limitation of the study is that CaMISS was not designed specifically to provide HRQoL reference norms, and therefore the recruitment strategy was not addressed to obtaining a sample that were equally representative of all Spanish areas, the North being overrepresented. Second, loss to follow-up is problematic in most cohort studies and often leads to bias. Although it is really small in our sample (13%), it is relevant to consider that women not answering HRQoL questionnaires at 2 years after diagnosis presented a higher rate of recurrences and received more aggressive treatments, which is consistent with results from German and Spanish cohorts of breast cancer patients [48, 49]. Thus, loss to follow-up could have produced and overestimation of HRQoL at the 2-year follow-up. Third, trained interviewers administered the HRQoL questionnaires mainly by telephone interviews before treatment, while almost all patients self-completed them at home, as recommended, 2 years after diagnosis. However, little difference between both administration methods has been reported [50]. Finally, no reference norms had been provided for the new EORTC BR-45 [51], an update of the BR-23 originally developed in 1996.

Having reference norms based on Spanish patients with non-metastatic breast cancer is fundamental, as they will facilitate the assessment of the impact of this disease, monitoring its evolution, comparing the impact of treatments, identifying populations that need special attention, and carrying out comparisons among different countries. These reference norms can be useful to interpret the scores obtained in women with non-metastatic breast cancer, who are the majority in Spain, by comparing them with country-specific reference values for this population.