Background

Lung cancer remains the cancer with the highest incidence and fatality rates worldwide [1]. With the development and clinical application of molecular targeted drugs and immune checkpoint inhibitors, the survival of patients with advanced non-small cell lung cancer (NSCLC) has significantly improved [2, 3]. For patients with oncogenic driver negative non-squamous NSCLC (NS-NSCLC), regimens containing immunotherapy have become a new standard of first-line treatment [4]. Nevertheless, for patients with oncogenic driver (e.g., EGFR, ALK, and ROS1) positive NS-NSCLC whose disease progressed on prior targeted therapy, or patients with oncogenic driver negative NS-NSCLC who cannot tolerate immunotherapy, platinum-based chemotherapy with or without bevacizumab (a monoclonal antibody against vascular endothelial growth factor [VEGF]) remains the recommended first-line or subsequent therapy. Compared with chemotherapy alone, bevacizumab combined with chemotherapy can further prolong progression-free survival (PFS) and overall survival (OS) for patients with NS-NSCLC [5,6,7,8]. However, clinicians are still debating the better choice of first-line chemotherapy regimens (pemetrexed+platinum [PP] versus paclitaxel+carboplatin [PC]) in combination with bevacizumab.

In addition, in classic studies of AVAPERL and PARAMOUNT, advanced NS-NSCLC patients with disease control after 4 to 6 cycles of first-line induction chemotherapy can benefit from continuation maintenance treatment with bevacizumab (B), pemetrexed (Pem) or bevacizumab in combination with pemetrexed (Pem + B). However, PFS benefits with doublet maintenance did not translate into an OS advantage [9, 10]. Since two recent trials (COMPASS and EA5508) presented results on single-agent or doublet maintenance therapy at the 2019 American Society of Clinical Oncology meeting [11, 12], we conducted a meta-analysis of randomized control trials (RCTs) to assess the optimal first-line and maintenance regimens for NS-NSCLC patients who are assumed to be intolerant to immunotherapy, by comparing the efficacy and toxicity of first-line treatment regimens between PP ± B and PC + B, and maintenance treatment regimens between Pem + B, Pem, and B.

Methods

Search strategy

We identified eligible trials by an electronic search of the Cochrane library, PubMed, Embase, and Web of Science databases using the following terms: “non-small cell lung cancer,” “NSCLC,” and “pemetrexed,” “bevacizumab,” “paclitaxel,” and “chemotherapy,” “clinical trials,” and “maintenance treatment.” The search was performed on March 30, 2020. Two independent reviewers screened titles/abstracts and full text articles. The reference lists including related trials and review articles were manually retrieved.

Selection criteria

The inclusion criteria were as follows: (1) RCTs; (2) Studies that recruited untreated advanced NS-NSCLC patients; (3) Studies that compared cisplatin (or carboplatin) plus pemetrexed with or without bevacizumab (PP ± B) to carboplatin plus paclitaxel with bevacizumab (PC + B) as first-line treatment, or compared the combination of pemetrexed and bevacizumab to pemetrexed or bevacizumab monotherapy as maintenance treatment. (4) Studies included at least one of the followings as main outcome: OS, PFS, objective response rate (ORR), or grade ≥ 3 treatment-related adverse events (TRAEs). Studies were excluded if they were repeated published studies, non-RCT studies, or non-first-line therapy studies.

Data collection and quality assessment

Characteristics of trials extracted were: first author’s name, year of publication, patient characteristics, study name, study design and phase, sample size, treatment regimens of the study and control groups, maintenance regimens, and treatment cycles. Endpoints extracted were median PFS (mPFS), median OS (mOS), ORR, and grade ≥ 3 TRAEs. Engauge Digitizer 10.8 software (produced by Mark Mitchell 2014; https://github.com/markummitchell/engauge-digitizer) was used to extract hazard ratio (HR) and 95% confidence intervals (CI), as well as other details (such as numbers at risk) from survival curves if no detailed HR values or numbers at risk were given. Trial quality was assessed with the methods recommended by the Cochrane Collaboration for assessing risk of bias [13]. The criteria used for quality assessment were randomization sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, selective reporting, and other biases. Each item was categorized as having high, low, or unclear risk. Sensitivity analysis was performed for the primary outcome with the leave-one-out approach.

Statistical analysis

The meta-analysis was performed using STATA 12.0 (StataCorp, College Station). Analyses were stratified by trial. We compared the efficacy of each treatment regimen during the induction and maintenance phases. The evaluation included OS, PFS, ORR, and TRAEs. OS was evaluated from the beginning of randomized therapy until death due to any cause. PFS was defined as the beginning of randomized therapy until first event (progression or death from any cause). PFS and OS were expressed as HRs. The ORR, PFSR1y, and the rate of grade ≥ 3 TRAEs were expressed as risk ratios (RRs). All p-values were two-sided and were considered statistically significant at the 0.05 level. Heterogeneity was assessed with χ2 test (α = 0.1) and I2 statistics. When statistics heterogeneity did not exist among studies (P > 0.10, I2 < 50%), we used a fixed-effect model; if heterogeneity did exist (P < 0.10, I2 > 50%), we found the cause and changed to a random-effect model.

Results

Characteristics of included trials

Based on the inclusion and exclusion criteria, six RCTs [9, 11, 12, 14,15,16], including 3144 NS-NSCLC patients were included in this meta-analysis. The baseline characteristics of the included studies are in Tables 1 and 2. Among them, three trials [14,15,16] were included in analysis comparing first-line treatment regimens between PP ± B and PC + B. Three other trials [9, 11, 12] were included for analysis to compare maintenance regimens between Pem + B and B. The flow diagram of the literature retrieval and selection is in Fig. 1.

Table 1 Characteristics of the included studies
Table 2 Clinicopathological characteristics of the included patients
Fig. 1
figure 1

Overview of study search and selection. A list of excluded papers after reading titles and abstracts can be found in additional file 1

Comparisons of first-line therapy between PP ± B and PC + B

Three RCTs including 1418 patients were used to compare the efficacy and safety of PP ± B and PC + B [14,15,16], in which PP + B and PP subgroups were compared with PC + B. Indirect comparisons between subgroups of PP + B and PP were also analyzed.

Efficacy

The results of efficacy comparison are in Fig. 2. Compared with PC + B, PP ± B showed a significant benefit in mPFS (HR 0.88; 95% CI, 0.78 to 0.99; P = 0.04) and PFSR1y (RR 0.83; 95% CI, 0.74 to 0.93; P = 0.001), no significant differences were seen in mOS (HR 1.01; 95% CI, 0.89 to 1.14; P = 0.863), and ORR (RR 1.02; 95% CI, 0.92 to 1.15; P = 0.675) between the two groups. We also calculated pooled mPFS and mOS using a weighted average of single study medians because of insufficient data on 95% CI values [17]. For subgroups of PP ± B vs. PC + B, mPFS was 5.77 vs. 5.80 months and mOS was 12.16 vs. 13.04 months.

Fig. 2
figure 2

Efficacy comparison of first-line therapy between PP ± B and PC + B. a. mPFS; b. mOS; c. ORR; d. PFSR1y. Abbreviations: PP ± B, pemetrexed-platinum with or without bevacizumab; PC + B, paclitaxel-carboplatin with bevacizumab; mPFS, median progression-free survival; mOS, median overall survival; ORR, objective response rates; PFSR1y, 1-year PFS rate

In the subgroup analysis, compared with PC + B group, a PP + B group showed improved mPFS (HR 0.83; 95% CI, 0.71 to 0.97) and PFSR1y (RR 0.77; 95% CI, 0.68 to 0.89) (all P < 0.05), but no significant difference in ORR and mOS was observed between the two groups. A PP subgroup showed no advantage compared with a PC + B group for any parameter. Indirect comparisons found no significant differences between PP + B and PP in mPFS (P = 0.36), PFSR1y (P = 0.11), mOS (P = 0.83), or ORR (P = 0.41).

Safety

The most common grade ≥ 3 TRAEs were hematologic toxicities, hypertension, and sensory neuropathy. Compared with PC + B, PP ± B had a significantly higher risk of anemia (RR 1.75; 95% CI, 1.58 to 1.95; P < 0.001) and thrombocytopenia (RR 1.70; 95% CI, 1.47 to 1.96; P < 0.001), but a significantly lower risk of neutropenia (RR 0.67; 95% CI, 0.59 to 0.77; P = 0.000), febrile neutropenia (RR 0.47; 95% CI, 0.25 to 0.90; P = 0.023), and sensory neuropathy (RR 0.21; 95% CI, 0.06 to 0.76; P = 0.017). No significant differences were seen in hypertension (P = 0.117) or drug-related death (P = 0.491) between the two groups (Table 3).

Table 3 Summary of forest plot for TRAEs (PP ± B vs. PC + B)

Comparisons of maintenance treatment between Pem + B, Pem and B

Three RCTs including 1726 patients were used to compare the efficacy and safety of Pem + B and B maintenance [9, 11, 12]. Two RCTs used a continuation maintenance regimen in the study design [9, 11], and one study used continuation and switch maintenance regimens [12]. Indirect comparisons between Pem + B versus Pem maintenance and between Pem versus B maintenance were also analyzed.

Efficacy

The results of efficacy comparison are in Fig. 3. Compared with B alone maintenance, Pem + B maintenance showed significant benefit in mPFS (HR 0.64; 95% CI, 0.57 to 0.72; P < 0.001), PFSR1y (RR 0.70; 95% CI, 0.63 to 0.77; P < 0.001), and mOS (HR 0.88; 95% CI, 0.78 to 1.00; P = 0.05). The mPFS and mOS (calculated using a weighted average of the single study medians) in subgroups Pem + B vs. B were 6.73 vs. 4.03 months and 19.39 vs. 16.36 months, respectively [17]. In subgroup analysis, compared with B maintenance, neither Pem + B continuation maintenance nor Pem + B switch maintenance showed obvious differences in mOS.

Fig. 3
figure 3

Efficacy comparison of maintenance therapy between Pem + B and B. a. mPFS; b. mOS; c. ORR. Abbreviations: Pem + B, pemetrexed and bevacizumab; B, bevacizumab; mPFS, median progression-free survival; mOS, median overall survival; ORR, objective response rates

Indirect comparisons showed that mPFS (P = 0.024) and PFSR1y (odds ratio [OR] 0.57; 95% CI, 0.34 to 0.95; P = 0.03) were significantly improved in a Pem + B maintenance group compared with a Pem maintenance group, but with no significant difference in mOS between the two groups (P = 0.855). Pem maintenance showed no benefit compared with B maintenance through indirect comparison of PFSR1y (OR 1.22; 95% CI, 0.76 to 1.95; P = 0.41).

Safety

The most common grade ≥ 3 TRAEs were hematologic toxicities and hypertension. The risk of anemia (RR 1.75; 95% CI, 1.46 to 2.09; P < 0.001), neutropenia (RR 1.95; 95% CI, 1.80 to 2.12; P < 0.001), or thrombocytopenia (RR 1.88; 95% CI, 1.55 to 2.28; P < 0.001) were significantly higher in a Pem + B maintenance group than in a B alone maintenance group. No significant difference was observed in hypertension (P = 0.864) between the two groups (Table 4).

Table 4 Summary of forest plot for grade 3/4 TRAEs (Pem + B vs. B maintenance)

Quality of included studies and publication bias

The risk of bias assessment of the included RCTs was low and is shown in Table 5; all studies were of high quality. To minimize publication bias, we executed strict inclusion criteria for selected papers and detected publication bias by several methods. No substantial asymmetry was found by visual inspection of the funnel plots. An Egger linear regression test and Begg rank correlation test also found no evidence of publication bias. Sensitivity analyses were conducted on PFS and OS to assess the heterogeneity in the first-line and maintenance phases. No significant heterogeneity in PFS or OS from any study was found.

Table 5 Evaluation of risk of bias in the included studies

Discussion

Chemotherapy combined with immunotherapy has become the current standard care for patients with negative oncogenic drivers regardless of squamous or non-squamous NSCLC or programmed death ligand 1 (PD-L1) expression level [18]. However, chemotherapy plus bevacizumab is still an important first-line treatment option for patients with oncogenic driver negative NS-NSCLC who cannot tolerate immunotherapy, and is also a subsequent treatment for patients with oncogenic driver positive NS-NSCLC whose disease progressed on prior targeted therapy. Our study enhances understanding of the rational option of first-line chemotherapy regimens in combination with bevacizumab and the subsequent optimal maintenance therapy for these advanced NS-NSCLC cases. In view of the inclusion of the latest large-sample RCTs and strict inclusion criteria, this meta-analysis thus answers some controversial questions that have not been solved in previous studies.

One question is which first-line chemotherapy regimen (pemetrexed- versus paclitaxel-based) is a better choice when used in combination with bevacizumab. Bevacizumab combined with platinum-based doublet chemotherapy shows clinical benefits for advanced NS-NSCLC in multiple RCTs, with mPFS of 6.2–9.2 months and mOS of 12.3–24.3 months [5,6,7,8]. A meta-analysis showed comparable efficacy for taxane and non-taxane regimens in combination with bevacizumab for treatment of patients with NS-NSCLC. For taxane and non-taxane groups, respective weighted mOS was 14.4 and 13.7 months (P = 0.5), mPFS was 6.93 and 6.99 months (P = 0.61), and ORR was 41 and 39% (P = 0.65) [19]. Our meta-analysis found that PP ± B had a significant benefit for PFS and PFSR1y, but no difference in OS and ORR between PP ± B and PC + B. For subgroup comparisons with PC + B, PP + B had significant benefits for PFS and PFSR1y, but not OS. The negative OS outcome may be attributed to the subsequent maintenance treatment options. Among three studies included for comparison of first-line treatments, PRONOUNCE and ERACLE studies used Pem alone as maintenance therapy; only the PointBreak study used Pem + B maintenance [14,15,16]. In our meta-analysis, the two groups had different grade 3/4 toxicity profiles. In the PP ± B group, the risk of severe anemia was 1.75 times and the risk of thrombocytopenia was 1.7 times that in the PC + B group. In the PC + B group, the risk of severe sensory neuropathy was 4.76 times and the risk of febrile neutropenia was 2.13 times that in the PP ± B group (Table 3). Since we saw no significant difference in OS between the two groups, the tolerance of patients to different drug toxicities should be considered when choosing first-line chemotherapies. That is, the choice of first-line chemotherapy mainly depends on differences in toxicity profiles.

The second question is which maintenance therapy (B versus Pem + B) is preferred. Maintenance therapy has emerged as a confirmed treatment strategy for advanced NSCLC. For NS-NSCLC patients, Pem + B in combination or as a single drug as a maintenance therapy is shown to be beneficial for survival [9, 10, 14, 20]. Even though Pem + B showed significant benefits in PFS compared to monotherapy B maintenance, four previous studies did not recommend Pem + B as a standard maintenance regimen because of the lack of OS benefits and higher toxicity [9, 11, 12, 14]. Combining two recent RCTs [11, 12], our meta-analysis showed not only an improvement in PFS with Pem + B maintenance, but also a benefit in OS (P = 0.05), and the latter was demonstrated for the first time. The PointBreak study showed a longer OS for Pem + B maintenance than B alone, but that trial could not be included in our meta-analysis, because the timepoint after random assignment was different from those in the other trials [14]. Although the addition of pemetrexed to bevacizumab as a maintenance therapy (Pem + B) can moderately improve survival, we still need to be cautious, as doublet maintenance leads to more toxicities, especially hematological toxicity. In our meta-analysis, the risk of grade 3/4 TRAEs including anemia, thrombocytopenia, and neutropenia were all significantly higher in the Pem + B groups. This may lead to a prolonged treatment interval, poor compliance with maintenance treatment, or even drug-related termination or death. Therefore, we recommend that only patients with NS-NSCLC with controlled disease after 4 to 6 cycles of PP + B induction therapy who have not experienced intolerable toxicity receive Pem + B continuation maintenance therapy whenever possible.

The third question is whether bevacizumab should be added to a PP regimen. Pemetrexed combined with platinum is the preferred frontline chemotherapy for patients with NS-NSCLC in National Comprehensive Cancer Network (NCCN) guidelines [21]. Efficacy of PP + B has been observed in some trials [9, 11, 14, 20], but no direct prospective comparison has been made between PP + B and PP. However, designing prospective trials comparing PP + B and PP seems increasingly infeasible. In both the PRONOUNCE and ERACLE study designs, bevacizumab was added to the PC regimen, but not to the PP regimen. Nevertheless, no significant difference in PFS or OS was observed between PP and PC + B [15, 16]. Our meta-analysis indicated that PP + B significantly prolonged PFS, as compared to PC + B, but no significant differences were seen in any survival data between PP + B and PP by indirect comparisons. However, the strength of the evidence to clarify this issue remains limited.

Currently, pembrolizumab in combination with chemotherapy is the preferred first-line regimen according to NCCN guidelines for patients with oncogenic driver negative NS-NSCLC and without contraindications to PD-1/PD-L1 inhibitors, regardless of PD-L1 expression level. Atezolizumab in combination with chemotherapy and bevacizumab is the other recommended regimen [22]. Interestingly, the chemotherapies in these two regimens differ (carboplatin/cisplatin+pemetrexed, and carboplatin+paclitaxel, respectively). In the future, we should focus on whether bevacizumab is a good partner to combine with chemotherapy and anti-PD-1 immunotherapy (e.g., pembrolizumab) for both first-line and maintenance treatment.

In our meta-analysis, we strictly limited the inclusion criteria to RCTs. However, summary statistics rather than individual patient data were used for each trial, and the studies included were heterogeneous, with varying patient populations and different study designs. For example, EGFR-sensitizing mutation populations were excluded in the COMPASS trial, but not mentioned in the other five trials. This difference may lead to different subsequent line regimens and survival.

Conclusions

On the basis of the answers to these three questions, we have made preliminary recommendations for first-line and maintenance treatment strategies for patients with advanced NS-NSCLC with negative drivers who cannot tolerate immunotherapy: i) PP + B as first-line therapy is as effective as PC + B in patients with advanced NS-NSCLC, and the toxicity profile of the two therapies varies. ii) Addition of pemetrexed to bevacizumab as maintenance therapy significantly improved survival, but led to more toxicity. iii) Patients’ tolerance and toxicity profiles should be considered when choosing treatment regimens. iv) Treatment with PP + B or PC + B followed by Pem + B rather than single-drug B or Pem maintenance might be the best choice under the premise of tolerable toxicity.

In addition, a phase III RCT IMpower-150 found that in a subset of EGFR/ALK-positive advanced NS-NSCLC patients whose disease progressed on prior targeted therapy, adding atezolizumab to PC + B can significantly improve the survival. Although additional studies on quadruple use focused specifically on this subgroup of patients are still needed, at least for now, the above recommendations may also be applicable for patients with positive oncogenic drivers whose disease progressed on prior targeted therapy.