Background

Rituximab (R) in combination with cyclophosphamide, doxorubicin, vincristine, and prednisone (CHOP) is the current standard of care for diffuse large B cell lymphoma (DLBCL) [1,2,3]. Despite treatment with R-CHOP, many patients’ relapse and outcomes remain poor with salvage therapies [4]. Obinutuzumab (GA101; G) is a fully humanized, glycoengineered, type II anti-CD20 monoclonal antibody which has shown greater direct cell death induction, antibody-dependent cellular cytotoxicity, and antibody-dependent cellular phagocytosis than R [5,6,7,8].

In a Phase II study in patients with advanced DLBCL, G demonstrated promising activity and an acceptable safety profile when combined with CHOP (G-CHOP) as a first-line treatment [9]. GOYA (NCT01287741) was a randomized, open-label, multicenter Phase III study that evaluated the efficacy and safety of G-CHOP compared with R-CHOP in patients with previously untreated DLBCL. Results from the primary analysis (clinical cut-off date, 29 April 2016), which had a median observation period of 29 months, showed that G-CHOP did not significantly improve investigator-assessed progression-free survival (PFS) compared with R-CHOP [10]. Here, we present the updated results from the final analysis of GOYA.

Methods

The study design and methodology of the GOYA study are described in full elsewhere [10]. In brief, patients were included if they were at least 18 years of age and had histologically documented, previously untreated, CD20-positive DLBCL; adequate hematologic function; ≥ 1 bi-dimensionally measurable lesion; an Eastern Cooperative Oncology Group (ECOG) performance status of ≤ 2; and an International Prognostic Index (IPI) risk group of high, high-intermediate, or low-intermediate risk. Low-risk patients with an IPI score of 1 (not due to age alone) or 0 with bulky disease (1 lesion ≥ 7.5 cm) were also eligible. Using stratified permuted block randomization, patients were randomized 1:1 to eight (21 days) cycles of G (1000 mg intravenous injection [IV] on days 1, 8 and 15, cycle 1 and day 1, cycles 2–8) or R (375 mg/m2 IV on day 1 of cycles 1–8) in combination with six or eight cycles of CHOP. The randomization stratification factors were number of planned cycles of CHOP, IPI score, and geographic region. Pre-planned radiotherapy was allowed for bulky or extranodal disease. The primary endpoint was investigator-assessed PFS. Secondary endpoints included independent review committee (IRC)-assessed PFS (primary analysis only); overall survival (OS); complete response (CR) and overall response rate (ORR) according to the modified Cheson 2007 criteria [11] by computed tomography (CT) and CT incorporating positron emission tomography (PET); event-free survival (EFS; defined as the period from the date of randomization until the date of disease progression, relapse, initiation of a new non-protocol-specified anti-lymphoma treatment, or death from any cause); disease-free survival (DFS; defined as the percentage of patients with CR at the end of treatment); time to next anti-lymphoma treatment; and safety. An exploratory endpoint of investigator-assessed PFS according to cell of origin (COO; germinal center B cell [GCB] or activated B cell [ABC]) based on gene expression profiling using the NanoString Research-Use-Only assay (NanoString Technologies, Inc., Seattle, WA) [12, 13] was also analyzed.

Statistical analysis

As previously described, it was planned to enroll approximately 1400 patients over 3 years, which was expected to yield 405 PFS events for the primary analysis [10]. The final analysis was conducted once patients had completed at least 3 years of follow-up. Comparisons between treatment arms for time-to-event endpoints were performed using a stratified two-sided log-rank test (α = 0.05). The analyses stratification factors were number of planned cycles of CHOP (6 or 8) and IPI score. Kaplan–Meier analysis was also used to analyze time-to-event endpoints. Estimates of treatment effect were calculated using Cox proportional hazards regression and are presented as stratified hazard ratios (HR) with 95% confidence intervals (CI).

Results

Patient characteristics and treatment

A total of 1418 patients were enrolled across 207 sites in 29 countries between July 2011 and June 2014. Of these, 1414 patients were included in the final analysis (clinical cut-off date, 31 January 2018); four patients from a single study site were excluded due to serious Good Clinical Practice non-compliance. A total of 704 and 710 patients were included in the G-CHOP and R-CHOP arms, respectively (intent-to-treat population), with 702 and 701 receiving at least one dose of study treatment (safety population). Demographic and baseline characteristics were well balanced between the two groups (Table 1). COO data were available for 933 patients, and the distribution of patients with each DLBCL subtype was similar between treatment arms. In total, the median time from diagnosis to randomization was 24.0 days (range, 1.0–1104.9) (G-CHOP 23.1 days [range, 3.0–1104.9]; R-CHOP: 25.0 days [range, 1.0–264.8]).

Table 1 Baseline patient and disease characteristics (intent-to-treat population)

A total of 117 patients (16.7%) in the G-CHOP arm and 105 patients (15.0%) in the R-CHOP arm discontinued any component of study treatment; 116 patients (16.5%) in the G-CHOP and 102 patients (14.5%) in the R-CHOP arm discontinued antibody treatment. The most common reason for antibody treatment discontinuation was adverse events (AEs), with a higher percentage of patients in the G-CHOP arm than in the R-CHOP arm discontinuing antibody treatment due to an AE (10.4% vs 6.1%) (Fig. 1).

Fig. 1
figure 1

Patient disposition from the final analysis of the GOYA trial. aOverall, 1418 patients were randomized (G-CHOP arm: N = 706 and R-CHOP arm: N = 712) to the study; however, due to a serious Good Clinical Practice non-compliance at a single site, data from all 4 patients enrolled at the site (two patients in each arm) were excluded from the final analysis. bA patient was considered to have completed the treatment if they received all 8 cycles of study treatment. cThe end of the study was defined as the last patient’s last visit and occurred at approximately 6.5 years (78 months) after the first patient was enrolled to allow all patients to have ≥ 3 years of follow-up post-treatment. G-CHOP, obinutuzumab plus cyclophosphamide, doxorubicin, vincristine, and prednisone; R-CHOP, rituximab plus cyclophosphamide, doxorubicin, vincristine, and prednisone

New (unplanned) anti-lymphoma treatment (including systemic new anti-lymphoma treatment, radiotherapy, or surgical procedure) was received by 113 patients (53 and 60 in the G-CHOP and R-CHOP arms, respectively) prior to disease progression and by 261 patients (122 and 139 in the G-CHOP and R-CHOP arm, respectively) after disease progression.

Efficacy

At the time of final analysis (median observation time, 48.0 months; range, 0.1–76.5 months for G-CHOP and 47.4 months; range, 0.1–78.2 months for R-CHOP), 224 (31.8%) patients and 233 (32.8%) patients had experienced an investigator-assessed PFS event in the G-CHOP and R-CHOP arms, respectively (stratified HR 0.94, 95% CI 0.78–1.12; p = 0.48) (Table 2, Fig. 2). Estimated 5-year PFS rates were 63.8% and 62.6% for G-CHOP and R-CHOP, respectively. The results of the secondary endpoints were similar between the two treatment arms (Table 2). In total, 149 (21.2%) patients in the G-CHOP arm and 145 (20.4%) patients in the R-CHOP arm had an OS event. Estimated 5-year OS rates were 77.0% and 77.7% for G-CHOP and R-CHOP, respectively (stratified HR 1.02, 95% CI 0.81–1.29; p = 0.84; Fig. 2). EFS at 5 years was 60.6% and 58.9% in the G-CHOP arm and R-CHOP arm, respectively (stratified HR 0.95, 95% CI 0.80–1.12; p = 0.53). Time to start of new anti-lymphoma treatment was similar between the two groups, with events experienced by 33.8% of patients in the G-CHOP arm and 35.2% of patients in the R-CHOP arm (stratified HR 0.93, 95% CI 0.78–1.12; p = 0.45). The proportion of patients with a CR was similar for the G-CHOP and R-CHOP arms when assessed with CT incorporating PET or CT alone (56.5% vs 59.1% and 35.4% vs 33.9%, respectively). The ORR was also similar between treatment arms with CT incorporating PET or CT alone (77.1% vs 77.6% and 81.4% vs 80.1% for G-CHOP vs R-CHOP, respectively).

Table 2 Summary of efficacy endpoints (intent-to-treat population)
Fig. 2
figure 2

Kaplan–Meier estimates of PFS by treatment group and OS by treatment group. a Investigator-assessed PFS (primary endpoint) by treatment (ITT population), in which no significant difference was found for G-CHOP compared with R-CHOP. b OS by treatment (ITT population), which showed no significant difference in survival between treatment groups. *Stratified by planned number of CHOP cycles and IPI score. CI, confidence interval; G-CHOP, obinutuzumab plus cyclophosphamide, doxorubicin, vincristine, and prednisone; ITT, intent-to-treat; HR, hazard ratio; PFS, progression-free survival; OS, overall survival; R-CHOP, rituximab plus cyclophosphamide, doxorubicin, vincristine, and prednisone

Overall, the results of the PFS subgroup analyses were consistent with PFS in the overall population with no significant difference observed between treatment arms for any subgroup according to stratification factors and baseline characteristics (Fig. 3); although, notably, patients with a high IPI score at baseline trended towards a better response to treatment with R-CHOP compared with G-CHOP (low-intermediate: HR 0.93, 95% CI 0.71–1.23; high-intermediate: HR 0.73, 95% CI 0.53–1.01; high: HR 1.27, 95% CI 0.87–1.86; Fig. 3).

Fig. 3
figure 3

Forest plot of unstratified HRs for investigator-assessed PFS by treatment group and patient subgroup. CHOP, cyclophosphamide, doxorubicin, vincristine, and prednisone; CI, confidence interval; ECOG PS, Eastern Cooperative Oncology Group performance status; G-CHOP, obinutuzumab plus cyclophosphamide, doxorubicin, vincristine, and prednisone; HR, hazard ratio; IPI, International Prognostic Index; KM, Kaplan–Meier; R-CHOP, rituximab plus cyclophosphamide, doxorubicin, vincristine, and prednisone

Kaplan–Meier analysis of PFS according to treatment arm in patients with different COO subtypes (ABC, GCB, and unclassified) is summarized in Fig. 4. The GCB subgroup appeared to be associated with a better PFS compared with ABC and unclassified subgroups (5-year Kaplan–Meier PFS estimates, 71.0%, 54.3%, and 57.9% in the G-CHOP arm and 65.5%, 55.7%, and 62.7% in the R-CHOP arm; Fig. 4).

Fig. 4
figure 4

Kaplan–Meier estimates of PFS by COO status by treatment group. a Investigator-assessed PFS by COO with G-CHOP, where GCB appeared to have a better outcome compared with ABC and unclassified subgroups. b Investigator-assessed PFS by COO with R-CHOP, where GCB also appeared to have a better outcome versus ABC and unclassified subgroups. ABC, activated B cell; COO, cell of origin; GCB, germinal center B cell; G-CHOP, obinutuzumab plus cyclophosphamide, doxorubicin, vincristine, and prednisone; PFS, progression-free survival; OS, overall survival; R-CHOP, rituximab plus cyclophosphamide, doxorubicin, vincristine, and prednisone

No significant reductions in the risk of disease progression with G-CHOP relative to R-CHOP were observed for patients with GCB, ABC, or unclassified DLBCL, although a trend towards benefit with G-CHOP compared to R-CHOP was apparent for the GCB subgroup (stratified HR, GCB 0.80, 95% CI 0.58–1.12; ABC 0.91, 95% CI 0.61–1.36; and unclassified 1.10, 95% CI 0.65–1.88; Fig. 3 and Supplementary Fig 1).

Safety

Safety results from the final analysis are consistent with those reported in the primary analysis, with no new safety signals. Most patients in each treatment arm (97.6% in the G-CHOP arm and 94.0% in the R-CHOP arm) experienced at least one AE (Table 3). The incidence of grade 3–5 AEs was higher in the G-CHOP arm (75.1%) compared with the R-CHOP arm (65.8%). Serious AEs were also more common in the G-CHOP arm (44.4% vs 38.4%).

Table 3 Summary of safety (AEs by preferred term reported by ≥ 5% of patients; safety population)

A higher number of patients in the G-CHOP arm compared with the R-CHOP arm discontinued any component of study treatment due to an AE (12.4% vs 8.3%). Fatal AEs occurred in 43 (6.1%) patients in the G-CHOP arm and 31 (4.4%) patients in the R-CHOP arm, with infections being the most common; in particular, five patients (0.7%) in each arm had a fatal case of pneumonia, and six patients (0.9%) in the G-CHOP arm versus no patients in the R-CHOP arm had a fatal case of septic shock (Table 3 and Supplementary Table 1). Overall, the most common cause of death was disease progression (G-CHOP 12.4% and R-CHOP 13.1%).

In total, 22 (3.1%) patients in the G-CHOP arm and 26 (3.7%) patients in the R-CHOP arm had a second malignancy. The most common of these were prostate cancer (0.4% in each arm), lung adenocarcinoma (0.4% and 0.3% in the G-CHOP and R-CHOP arms, respectively), and breast cancer (0.1% and 0.4% in the G-CHOP and R-CHOP arms, respectively). Prolonged neutropenia (0.3% vs 0.0%) and late-onset neutropenia (8.7% vs 4.9%) occurred at a greater frequency in the G-CHOP arm compared with the R-CHOP arm.

Discussion

This Phase III, open-label, randomized study was designed to compare the efficacy and safety of G-CHOP versus R-CHOP in previously untreated patients with DLBCL. In agreement with the findings of the primary analysis (clinical cut-off date, 29 April 2016) [10], results of this analysis did not demonstrate superiority of G-CHOP compared to R-CHOP.

After a median observation time of 47.7 months, investigator-assessed PFS (study primary endpoint) did not differ significantly between G-CHOP and R-CHOP (stratified HR 0.94, 95% CI 0.78–1.12; p = 0.48), with similar results to those obtained in the primary analysis (stratified HR 0.92, 95% CI 0.76–1.11; p = 0.39). The results of the secondary endpoints were consistent with the primary endpoint and did not show a benefit of G-CHOP over R-CHOP. The 5-year OS rates were similar for G-CHOP and R-CHOP (77.0% vs 77.7%).

G-chemotherapy has previously demonstrated superiority compared with R-chemotherapy in studies of patients with other B cell malignancies, such as follicular lymphoma (FL) and chronic lymphocytic leukemia [14, 15]. The lack of superior efficacy of G-CHOP compared with R-CHOP observed within this study was unexpected.

In the present study, we noted a trend towards a PFS benefit with G-CHOP versus R-CHOP in patients with GCB DLBCL. Like FL, GCB DLBCL is derived from germinal center B cells, and many driver genetic alterations that provide a selective advantage and contribute to cancer development are seen in both lymphomas [16]. This may, in part, explain the trend in benefit observed in the patients with GCB DLBCL, which is consistent with the improvement in outcome seen in patients with FL when treated with G-CHOP versus R-CHOP [14]. Interestingly, we found that the ABC DLBCL subgroup was associated with better PFS (5-year Kaplan–Meier PFS estimates: G-CHOP 54.3% and R-CHOP 55.7%) than reported in retrospective population analyses of previously untreated patients with DLBCL treated with R-CHOP (5-year Kaplan–Meier PFS estimates: R-CHOP 46% [17] and 48% [18]). This result may indicate a selection bias for low-risk patients in the GOYA trial, with very high-risk patients requiring urgent treatment less likely to be enrolled into a prospective trial such as this. It is noteworthy that the total median time from diagnosis to randomization was 24.0 days (range, 1.0–1104.9). While it is unclear how selection bias may have affected the overall result, there was no significant improvement in outcome with G-CHOP in any IPI subgroup.

The frequency and nature of the AEs reported were as expected for the patient population and the treatment regimens being assessed, and no new safety signals were observed during the additional follow-up after the primary analysis. Higher rates of grades 3–5 and serious AEs occurred in the G-CHOP arm compared with the R-CHOP arm; however, these were generally clinically manageable. A similar percentage of deaths occurred in the G-CHOP and R-CHOP arms (21.2% vs 20.1%), and the majority of these fatalities were due to disease progression (12.4% vs 13.1%). There were no unexpected delayed toxicities observed, and a similar number of second malignancies occurred in the G-CHOP and R-CHOP arms (3.1 and 3.7%, respectively). A higher frequency of late-onset neutropenia was observed in the G-CHOP arm compared with the R-CHOP arm (8.7% vs 4.9%, respectively).

Conclusions

In conclusion, the final analysis of this study demonstrated that G-CHOP did not show a PFS benefit over R-CHOP in previously untreated patients with DLBCL, and R-CHOP remains the standard of care in this population. The results of the secondary endpoints were consistent with the primary endpoint. Overall, no unexpected safety findings were observed, and the toxicity of G-CHOP was generally manageable. Exploratory analyses and investigations of biomarkers are ongoing to evaluate whether there may be a role for G in identifiable subgroups of DLBCL.