FormalPara Key Summary Points

Why carry out this study?

The quality of life (QoL) of patients with relapsing–remitting chronic diseases plays an important role in clinical decision-making by physicians and their patients regarding treatment.

Few studies evaluate real-world, patient-reported health outcomes and QoL in follicular lymphoma (FL). These outcomes are important, as prolonged survival in these patients incurs a longer-term disease and treatment burden, as well as possible deterioration in QoL.

This analysis of data from a real-world survey quantified the impact of FL across five European countries and the United States to investigate differences across countries and the effects on patient QoL compared with a non-Hodgkin’s lymphoma (NHL) reference population across treatment lines.

What was learned from the study?

While FL is perceived as an indolent disease, patients with FL who typically experience remissions and relapses throughout their life span have high unmet needs regarding QoL.

In general, patients with FL had worse QoL than those of the NHL reference population, with most aspects of QoL being significantly worse at later lines of therapy compared to earlier lines. Differences were identified across countries, as functioning and, symptoms were worse for patients in Europe (France, Germany, Italy, Spain, the United Kingdom) than the United States.

These real-world findings highlight the need for novel FL therapies that maintain/ improve QoL and alleviate patient burden.

Introduction

The indolent B-cell lymphoproliferative disorder follicular lymphoma (FL) is the second most common type of lymphoma in Western Europe and the United States (US) [1,2], accounting for approximately 30–35% of all non-Hodgkin lymphomas (NHLs) [2,3].

FL is heterogeneous, and typically has a long natural course of remissions and relapses. The disease is therefore generally considered a chronic, life-long condition [1]. The 5-year relative survival rate is 90.6% [4], although is significantly worse for patients with transformation [5]. Most patients with FL will have a prolonged survival with or without treatment [6,7], with overall survival potentially extending beyond 20 years [8]. It is therefore important to consider quality of life (QoL) when deciding a suitable treatment. Symptoms such as fatigue [9,10] can significantly impact QoL in patients with FL [11], and treatment can have an even greater negative impact on the patient than the disease itself [12].

The treatment landscape for NHLs was changed immensely by the introduction of immunotherapies. The anti-CD20 monoclonal antibody rituximab, plus chemotherapy for induction followed by maintenance therapy, has resulted in improved outcomes for patients with FL [13,14,15,16]. More recently, autologous chimeric antigen receptor (CAR) T-cell therapy that targets lymphoma-associated antigens, such as CD19, has demonstrated significant clinical efficacy, and prolonged responses in chemotherapy-refractory patients [17,18,19]. Bispecific antibodies that concurrently bind target tumor cells and immune effector cells have also improved treatment responses in patients with refractory tumors [20,21].

The safety profiles of CAR T cells and bispecific antibodies have generally been acceptable and manageable [20,22], although there is the potential for serious adverse events related to immune system overactivation, including cytokine-release syndrome and immune effector cell-associated neurotoxicity syndrome [20,23].

There remains a paucity of data evaluating real-world, patient-reported outcomes and QoL in FL. Considering the seemingly indolent disease progression of FL due to its chronic relapsing–remitting nature, it is important to understand patient-reported QoL, as disease- and treatment-related symptoms and disease activity have a long-term impact on patients. Patient-reported outcomes (PROs) may also differ according to patients’ disease stage and whether they are in remission. A study in patients with FL found that patients who had relapsed were more likely to experience worse QoL than patients who were newly diagnosed, in remission, or disease-free [12].

To address the information gap, we aimed to quantify the impact of FL on patients’ QoL, and to investigate QoL differences across treatment lines, geographic regions, and countries, using data from a multinational, real-world survey in five European countries and the United States (US).

Methods

Survey Design

Data were drawn from the Adelphi Real World (ARW) FL Wave II Disease Specific Programme™ (DSP™), a cross-sectional survey with elements of retrospective data collection of physicians and their patients in Europe [France, Germany, Italy, Spain, the United Kingdom (UK)] and the US from June 2021 to January 2022. The methodology has been previously described, validated, and demonstrated to be representative and consistent over time [24,25,26,27], and is well-established with surveys in many different disease areas implemented globally [28,29,30,31].

Physicians completed online surveys and patient record forms (PRF) for the next six consecutively consulting patients (one patient on first-line therapy [1L], three patients on second-line therapy [2L], and two patients on third-line therapy and beyond [3L+)]. Each record included details regarding patient demographics and clinical characteristics such as ECOG score and Ann Arbor staging at data collection. Physicians completed the record using a combination of existing patient clinical records, their own judgement and diagnostic/interpretation skills, and information collected during the consultation.

Each patient for whom a physician completed a PRF were then invited to voluntarily fill out a pen-and-paper patient self-completion form (PSC). The PSC included detailed questions on patient demographics, functioning, and symptoms via the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire (EORTC QLQ-C30) [32,33,34].

Data were collected in such a way that patients and physicians could not be identified directly. Physician and patient data were pseudo-anonymized. A code was assigned when data were collected. Upon receipt by ARW, data were pseudo-anonymized again to mitigate against tracing them back to the individual. Data were aggregated before being shared with the subscriber and/or for publication.

Survey Population

A geographically representative sample of physicians were recruited to participate in the DSP. Physicians were eligible to participate in this survey if they were medical oncologists, hematologists, or hem-oncologists. Physicians had to be personally responsible for treatment decisions and management of patients with FL, and for making treatment decisions for at least four patients with FL in a typical month.

Patients were eligible for inclusion if they were over the age of 18 years at the time of data collection, had a physician-confirmed diagnosis of FL, and were receiving active drug treatment for FL at the time of data collection. Patients were excluded if they were participating in a clinical trial. Steroid monotherapy was not considered to be an active drug treatment.

Survey Measures

The EORTC QLQ-C30 questionnaire comprises a global health status/quality of life (GHS/QoL) scale, five functional scales (physical, role, emotional, cognitive, and social functioning), eight symptom scales (fatigue, nausea/vomiting, pain, dyspnea, insomnia, appetite loss, constipation, and diarrhea) and a financial difficulties scale. Scale scores are linearly converted to range from 0 to 100. For the functioning and GHS/QoL scales, higher scores indicate better functioning. For the symptom and financial difficulties scales, higher scores indicate worse symptom burden and greater financial difficulties, respectively [32,34].

Survey Ethics and Consent

This research complied with all relevant market research guidelines and legal obligations, and data were collected according to European Pharmaceutical Marketing Research Association guidelines [35].

The survey was conducted in accordance with the Pearl Institutional Review Board (protocol number 9061). Where patients provided data directly, they signed an informed consent form prior to participation in this study. Each survey was performed in full accordance with relevant legislation at the time of data collection, including the US Health Insurance Portability and Accountability Act 1996 [36] and Health Information Technology for Economic and Clinical Health Act legislation [37].

Physician participation was financially incentivized, with reimbursement upon survey completion according to fair market research rates.

Statistical Analysis

Data were analyzed only where physician- and patient-reported data were matched, i.e., in cases where both PRFs and PSCs, respectively, were completed. Data were analyzed by country (France, Germany, Italy, Spain, UK, US) and region (Europe, US), summarized using descriptive analyses, and compared using bivariate analyses. Means and standard deviations (SD) and medians and interquartile ranges (IQR) were calculated for continuous variables, and frequency and percentages were calculated for categorical variables. Missing data were not imputed; therefore, the base of patients for analysis could vary from variable to variable and was reported separately for each analysis. EORTC QLQ-C30 scores for Europe were compared to the US using a t test. EORTC QLQ-C30 scores for each country were compared to the other countries using a t test and across-all-countries analysis of variance statistical tests were performed. Statistical analysis was performed in Stata 17.

Using a t test, patient EORTC QLQ-C30 scores from our survey were compared against validated EORTC QLQ-C30 NHL reference scores [34], which summarize QoL for patients with NHL. This allows for an understanding of DSP patients with FL relative to a general patient population with NHL. Descriptive and bivariate analyses were performed to compare QoL (per EORTC QLQ-C30) across line of therapy (LOT) (1L vs. 2L, 1L vs. 3L+ , and 2L vs. 3L+). A test for trend across lines of therapy was performed using the Jonckheere–Terpstra test. A p value of < 0.05 was considered statistically significant.

Results

All results discussed below regarding disease and symptom burden are statistically significant (p < 0.05), unless stated otherwise.

Physician and Patient Sample Populations

A total of 251 oncologists/hematologists/hem-oncologists completed PRFs for 1478 eligible patients. A total of 401 patients completed a PSC and were included in the analyses. The top three physician reported guidelines across all markets were National Comprehensive Cancer Network (59%), European Society of Medical Oncology (57%), and American Society of Clinical Oncology (55%) (Table 1).

Table 1 Physician and patient sample populations

Patient Demographics and Clinical Characteristics

Mean (SD) age was 66.0 (9.24) years, 58.1% were male, and 62.8% were retired (Table 2). The mean (SD) age of patients in Europe was 66.0 (9.12) years and in the US was 66.1 (9.73) years. The proportion of males in the European cohort was 55.7% and in the US cohort was 66.7%. In Europe, 8.9% of patients were working full/part-time, whereas, in the US, 25.3% were working full-/part-time.

Table 2 Physician-reported demographic and clinical characteristics of patients with follicular lymphoma

Almost two-thirds (64.8%) of patients with FL had Ann Arbor stage III (41.9%) or stage IV (22.9%) disease at data collection. Ann Arbor stage III and stage IV were reported for 44.3% and 18.5% of patients in Europe, respectively, versus 33.3% and 39.1% in the US. Overall, 20.0% and 61.1% of patients had an Eastern Cooperative Oncology Group Performance Status (ECOG-PS) score of 0 and 1, respectively. Patients’ level of functioning in terms of their ability to care for themself, daily activity, and physical ability varied widely among the six countries. An ECOG-PS score of 0 or 1 was observed in 79.9% of patients in Europe (ranging from 65.0% in Germany to 95.3% in Italy) and in 85.1% in the US. An ECOG-PS score of 2 or 3 was reported for 20.1% of patients in Europe (ranging from 4.9% in France and 4.8% in Italy to 35.0% in Germany) and for 14.9% of patients in the US. A Follicular Lymphoma International Prognostic Index-2 (FLIPI-2) score of 2 (indicating intermediate risk) was observed in 38.5% of patients with FL, whereas 37.0% had a score of 3–5 (indicating high risk). In the US, a higher percentage of patients with FL had a high risk FLIPI-2 score (42.3%) compared to those 36.4% across Europe (ranging from 30.5% in Italy to 42.5% in Germany).

At data collection, 48.4% of patients were receiving 2L treatment and 76.1% were receiving 2L+ treatment. Over one-quarter (27.7%) of patients were receiving 3L+ therapy. The top three treatments received at 1L induction therapy were R-CHOP (rituximab, cyclophosphamide, doxorubicin, vincristine, prednisolone), R-bendamustine (rituximab, bendamustine) and rituximab (60.6%, 10.0%, and 5.5%, respectively). At 2L induction therapy, the top three treatments received were R-bendamustine, O-bendamustine (obinutuzumab and bendamustine), and R-CHOP [25.9%, 18.4%, and 14.1%, respectively]. At 3L induction therapy, the top three treatments received were idelalisib (18.0%), rituximab + lenalidomide (17.1%), and obinutuzumab + lenalidomide (12.6%) (Table 2). Overall, patients in Europe were more likely to be receiving later lines of therapy than in the US, with 80.3% and 60.9% of patients receiving 2L+ treatment, respectively. In the US, 13.2% of patients were receiving CHOP + obinutuzumab (cyclophosphamide, doxorubicin, vincristine, prednisolone) for 2L induction therapy, with 20.8% of patients having received R-bendamustine. In Europe, 27.0% of patients with FL received R-bendamustine for 2L induction therapy with 20.6% having received O-bendamustine.

A total of 59.9% of patients with FL received supportive therapies to treat their FL or any other concomitant conditions. Across Europe, this was 66.9% of patients compared to 34.5% in the US. Granulocyte colony-stimulating factors (G-CSFs) were used by 25.0% of patients with FL. In the US, usage of G-CSFs in patients with FL was 3.3%, compared to 28.1% usage in Europe.

Forty-four percent of patients with FL were diagnosed between 0 and 2 years before the time of data collection and 24.3% of patients were diagnosed between 2 and 4 years before the time of data collection (Table 2). On average (SD), those who were receiving 1L treatment at time of data collection started their treatment 162.3 (200.45) days before the time of data collection. For 2L, this decreased to 145.3 (186.47) days, while for 3L, it decreased to 113.0 (128.00) days before the time of data collection (Table 2).

Disease Burden and Patient-Reported EORTC QLQ-C30

Mean EORTC Scores by Study Regions and Countries

The mean (SD) score for GHS/QoL in the FL DSP population across all markets was 51.8 (20.06), ranging from 50.1 (18.90) in Europe to 57.6 (22.99) in the US (Supplementary Table S1).

US scores for GHS/QoL and all functioning domains were higher and almost all symptom domains (excluding diarrhea) were lower versus all other European countries (Supplementary Table S2). The mean (SD) German GHS/QoL score was lower compared to the other countries [41.3 (13.91) vs. 54.8 (18.00) France, 51.9 (13.39) Italy, 59.6 (22.79) Spain, 55.4 (19.07) UK, and 57.6 (22.99) US; Supplementary Table S2]. The UK and Spain had higher scores for GHS/QoL, physical, role, and social (UK only) functioning compared to all other countries, while Italy had lower scores for emotional, cognitive, and social functioning. In Germany, almost all functioning scores (excluding cognitive functioning) were lower, and almost all symptom scores (excluding constipation) and financial difficulties were higher versus all other countries. France was not significantly different from all other countries in any of the functions, symptoms, or GHS/QoL.

Mean EORTC scores vs the NHL Reference Value Mean Scores

The mean (SD) score for GHS/QoL in the FL DSP population was worse when compared with NHL reference value [51.8 (20.06) vs. 56.1 (27.1); Table 3]. Mean (SD) functioning scores for role, cognitive, and social functioning in FL DSP patients were better than those of the NHL reference values [62.7 (27.55) vs. 57.3 (36.2), 74.4 (22.61) vs. 68.5 (30.8), and 65.8 (26.22) vs. 60.4 (34.1)], respectively. However, interestingly, mean (SD) symptom scores were significantly worse in FL DSP patients compared to the NHL reference values with regard to nausea and vomiting [17.6 (20.13) vs. 10.0 (28.9)], pain [27.9 (25.71) vs. 24.5 (30.1)], dyspnea [26.0 (28.32) vs. 16.9 (24.6)], appetite loss [33.2 (28.23) vs. 30.7 (31.5)], and diarrhea [15.3 (22.37) vs. 9.5 (19.1)].

Table 3 Disease burden of patients with FL: bivariate analysis comparing FL DSP EORTC QLQ-C30 total (all therapy lines) scores with NHL reference valuesa,b

Mean EORTC Scores Across Lines of Therapy

In our study, mean (SD) scores for GHS/QoL [[56.5 (22.21) to 50.4 (20.11)] and for all functioning domains (excluding emotional functioning) worsened from 1 to 3L+ , with the worsening across lines of therapy in physical functioning [77.9 (18.36) to 66.2 (20.70)], role functioning [72.7 (23.59) to 57.1 (27.48)], and social functioning [70.1 (24.89) to 61.0 (28.38)] reaching statistical significance for the trend (Table 4). Mean (SD) scores for fatigue [35.1 (21.53) to 44.7 (22.55)], pain [22.2 (23.40) to 32.1 (26.37)], and dyspnea [20.5 (25.76) to 30.3 (30.14)], all worsened across the lines of therapy from 1 to 3L+, reaching statistical significance for trend (Table 4).

Table 4 Disease burden of patients with FL, across lines of therapy

Pairwise Analysis of Mean EORTC Scores between Lines of Therapy

Mean scores for GHS/QoL and all functioning/symptom scales for all therapy lines are shown in Table 5. Mean (SD) score for GHS/QoL at 2L was 50.4 (18.72) worse than at 1L [56.5 (22.21)], there were no differences between 1 and 3L [50.4 (20.11)] and 2L and 3L. Physical functioning was the only domain to consistently have mean (SD) scores differing significantly between the lines of therapy, with scores at 2L worse than at 1L [71.4 (19.57) vs. 77.9 (18.36)], scores at 3L+ worse than at 2L [66.2 (20.70) vs. 71.4 (19.57)], and scores at 3L+ worse than at 1L [66.2 (20.70) vs. 77.9 (18.36)].

Table 5 Disease burden of patients with FL, by line of therapy: bivariate analysis

Discussion

In this study, we analyzed real-world data from a multinational survey of patients with FL, exploring PROs and QoL in patients receiving 1L, 2L, and 3L+ therapy. We found that GHS/QoL was generally poor among patients with FL, with patients in our study reporting significantly worse mean scores than a reference population of patients with NHL. Similarly, we found most symptom scores were worse in patients with FL compared with those with NHL, including, nausea and vomiting, pain, dyspnea, appetite loss, and diarrhea. The functioning and symptom burden in patients with FL generally worsened with successive LOT.

Other studies in Europe reported similar findings, with a study conducted by Johnsen et al. also reporting poor mean GHS/QoL and citing dyspnea, among other symptoms, as one of the most prevalent symptoms cited by patients with FL [38]. Of note, the mean GHS/QoL we reported here was even lower than was found in the study by Johnsen et al., by around five units across several of the functional and symptom scales. Five units is the minimal clinically important difference for EORTC QLQ-C30 reported in a study across several tumor types [39], suggesting that the patients in our study had a perceptibly worse QoL than those studied by Johnsen et al. A study in Germany focused on the effect of treatment on QoL in patients with FL reported that QoL was low in all patients, but that patients with FL receiving rituximab and chemotherapy had worse QoL than patients receiving high-dose chemotherapy followed by peripheral blood stem cell transplantation [40]. Taken together, these findings suggest that both the burden of the disease and the side effects of treatment can lead to poor QoL in patients with FL, highlighting the need for novel therapies and improvements in supportive care interventions that may maintain or improve QoL.

The recurring nature of FL means that patients are generally treated with multiple LOT during their treatment journey. A large proportion of patients with FL in our study were receiving 2L and 3L+ therapy, with many reporting worse overall GHS/QoL across successive LOT. Similarly, symptom burden in patients with FL was observed to be higher in later LOT on several key scales (fatigue, pain, and dyspnea). Previous studies exploring other patient outcomes have also demonstrated an effect of LOT, finding that progression-free survival and overall survival diminished across LOT [41,42]. Our findings are similar to those in patients with other cancers, with studies on patients with multiple myeloma in Germany and France finding those receiving later LOT had lower QoL compared with patients earlier in their treatment journey [43,44]. There are likely to be multiple reasons for a decrease in QoL with successive LOT, for example, patients on later LOT have likely had their disease and the side effects of their treatment for longer, as well as experiencing disease progression and increase in age over successive therapies, all of which may decrease patients’ overall QoL [43]. One solution to improve QoL for patients on later lines of therapy could be for physicians to prescribe a better supportive therapy in conjunction with chemotherapy. For example, the use of a granulocyte colony-stimulating factor (G-CSF) administered during chemotherapy treatment has been found to reduce the risk of infection, including febrile neutropenia and pneumonia, complications often resulting from chemotherapy [45,46]. As a result of receiving a G-CSF, patients had fewer hospitalizations and fewer days in hospital compared with patients not receiving this therapy [45].

In contrast to our other results on patient QoL across LOT, we found that EORTC QLQ-C30 emotional functioning scores in patients with FL were not significantly different between LOT. A possible explanation for this may be that receiving therapy, irrespective of which line it is, may help to reduce patient anxiety and/or depression and allow them to feel more in control of and hopeful about fighting their disease [6,10]. Improving knowledge of individual patient genetic and epigenetic profiles could help to identify those patients at high risk for transformation and disease progression earlier. This may help to better target therapies to achieve longer remission and reduce the number of LOT patients receive [47], maintaining or improving patients’ QoL.

Our analysis showed that the GHS/QoL of patients with FL differed significantly across countries in numerous EORTC functioning and symptom domains, with patient QoL reported as significantly worse in Europe compared with QoL in the US. We also found differences between the five European countries included in the study, with QoL in Germany reported as significantly lower and QoL in Spain and the UK being significantly higher compared to the other countries examined. Regional and country differences may reflect differences in populations and healthcare systems across countries; however, further research is needed to elucidate the reasons for country differences.

The limitations of DSP surveys have been previously described in other real-world studies using DSP data [25,30,48,49]. Several limitations should be considered in the evaluation of our findings. Patients participating in the surveys may not reflect the general FL population, as a certain number of patients were recruited for each disease severity category to ensure that the sample size in each was sufficient to fulfil the quota of the study. Similarly, we only included physicians in the study who were treating more than four patients monthly, which may have led to our data coming from busier physicians based in a specific environment, such as academic medical centers or busy urban clinics rather than quieter, more rural settings. For some countries in the study, we had relatively lower patient numbers, particularly in France, Italy, and the UK. This may have contributed to the variation in individual country outcomes and differences between countries in patient demographics and clinical characteristics. These differences in sample sizes across countries may also have led to observed differences in QoL across Europe versus the US. The analyses did not control for individual patient characteristics and treatment. It should be noted that the survey did not analyze the therapies received across the different stages of FL therefore future research should investigate this. It should also be noted that the survey did not include patients with FL who had transformed from an indolent form to a more aggressive form. However, transformation rates are low, with estimates of 2–3% of patients annually in the rituximab era [5,50], and future research could benefit from including transformed patients with the FL population. This study only included patients who were actively receiving treatment at the time of data collection, therefore, patients who were on a “watch and wait” period were not included. The minority of patients undergo watch and wait [41]; however, future research could include this patient subset for comparison in their analyses. The data included here were from the six countries that were the focus of this study, and therefore results may not be representative of patients with FL in other countries not included in this study. For better understanding of real-world QoL in patients with FL on a larger scale, future surveys should ensure data are more reflective of epidemiology internationally, beyond the six countries included here. Future research should compare the baseline characteristics of patients who completed versus patients who did not complete the patient self-completion questionnaire, and evaluate the reasons for non-completion to assess any biases that may exist in the results reported in this study.

Data were collected during the COVID-19 pandemic, during which treatment may have been adjusted to accommodate the restrictions imposed on the populations across the different countries. Moreover, QoL and functioning, particularly emotional and social well-being, may have been affected, being influenced by the living environment and access to healthcare during the pandemic. The findings may therefore be different from those if data had been collected pre- and/or post-pandemic.

Conclusions

Although FL is perceived as an indolent disease, patients have high unmet needs in their QoL, based on comparisons with NHL reference values. Furthermore, patient QoL decreases with subsequent LOT and may be worse in Europe than in the US. Maintaining QoL for patients with FL is an important factor in treatment decision-making, given the likelihood of many years of survival and multiple LOT. Our findings help address the patient-reported QoL data gap in FL treatment outcomes, and add to previously published physician-reported efficacy evidence. These findings highlight the need for novel FL therapies and supportive care interventions that increase QoL and alleviate patient burden. In light of the differences from NHL using the general GHS/QoL instrument, EORTC QLQ-C30, there is also the need for a FL-specific instrument for measuring QoL in these patients. Moreover, considering the paucity of data evaluating the patient with FL population, the FL data presented here may provide a benchmark to compare against, and can support future research into the impact of novel agents on clinical trial-assessed QoL.