Introduction

Colorectal cancer (CRC) ranks as the fourth most common cause of cancer-related deaths worldwide, resulting in nearly 900,000 fatalities annually [1]. Metastasis is observed in approximately 20–30% of patients upon colorectal cancer diagnosis [2], and about 10–25% of patients develop metachronous metastasis following treatment-oriented surgery [3]. Metastatic colorectal cancer (mCRC) carries a grim prognosis, with a 5-year survival rate of less than 20% [4]. While surgery and local ablation can address some isolated metastases, there is no cure for the majority of CRC patients with multiple systemic metastases, necessitating palliative systemic treatment to extend overall survival. Presently, the National Comprehensive Cancer Network (NCCN) recommends various first-line systemic treatment regimens for mCRC patients [5]. These regimens primarily consist of dual and triple chemotherapy protocols based on 5-fluorouracil (5-FU) or capecitabine in combination with other cytotoxic agents (oxaliplatin or irinotecan), often with the addition of targeted drugs. However, many of these treatments lack direct comparative clinical trials, and some remain controversial in terms of safety. One meta-analysis indicates that FOLFOXIRI (5-fluorouracil plus leucovorin plus oxaliplatin plus irinotecan) + bevacizumab may increase the incidence of neutropenia and diarrhea compared to dual chemotherapy regimens with or without targeted drugs [6], while two other meta-analyses suggest that this regimen does not elevate treatment toxicity [7, 8]. The presence of toxic side effects restricts the clinical application of chemotherapy regimens and targeted drugs, such as hypertension and proteinuria induced by bevacizumab [9, 10], skin toxicity induced by Panitumumab and Cetuximab [11, 12], and hypomagnesemia related to panitumumab [13]. The safety of treatment protocols significantly impacts patients’ prognosis and quality of life. Reducing drug toxicity, mitigating the risk of adverse reactions, and enhancing patients’ quality of life necessitate ongoing exploration and practice by medical professionals.

Network meta-analysis (NMA) is frequently employed to compare differences among multiple treatment regimens, combining direct and indirect comparisons to offer crucial references for clinical drug usage guidance [14]. In prior NMA studies, more emphasis has been placed on evaluating the effectiveness of systemic therapy for colorectal cancer, with less attention directed toward assessing safety. One meta-analysis assessed the safety of 16 first-line systemic treatment regimens for mCRC patients but was limited to an overall analysis of grade 3 or higher adverse events (AEs) and did not specifically analyze the characteristics of individual AEs [8]. If clinicians have early knowledge of potential AEs and corresponding treatment measures associated with each treatment plan, they can choose the most suitable treatment method based on the specific conditions of patients. This approach is beneficial in enhancing the quality of life for patients. Therefore, by utilizing a network meta-analysis method to evaluate the safety of first-line systemic treatment for metastatic colorectal cancer, this study compares and ranks various adverse reactions to provide guidance for clinical decision-making.

Materials and methods

Design and registration

This systematic review study process followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA 2020) statement [15], and the protocol was registered with the International Prospective Registry of Systematic Reviews (PROSPERO: CRD42023462360).

Literature search

Two researchers (YR.Z, PP.M) independently searched for studies published in PubMed, Embase, Web of Science, and the Cochrane Library from the establishment of the database to August 15, 2023. The search terms were designed by combining subject words and free words. The search keywords encompassed colorectal neoplasm, colorectal cancer, metastasis, first-line chemotherapy, FOLFOX (5-fluorouracil plus leucovorin plus oxaliplatin), FOLFIRI (5-fluorouracil plus leucovorin plus irinotecan), CAPOX (capecitabine plus oxaliplatin), FOLFOXIRI, 5-fluorouracil, capecitabine, bevacizumab, cetuximab, panitumumab, and randomization. Specific search strategies are presented in supplementary materials (Appendix 1).

Inclusion and exclusion criteria

The inclusion criteria were as follows:

  1. (a)

    Population: Patients with pathologically confirmed colorectal cancer and distant metastases who have not received any treatment after diagnosis;

  2. (b)

    Interventions: Systemic therapy as first-line treatment for patients with mCRC;

  3. (c)

    Outcomes: Grade ≥ 3 AEs or toxic reactions (according to the Common Toxicity Criteria of the American National Cancer Institute) ;

  4. (d)

    Study type: Randomized controlled trial (RCT) with sample size ≥ 30;

  5. (e)

    Language: English.

Exclusion criteria:

  1. (a)

    Participants: Colorectal cancer patients without metastasis;

  2. (b)

    Interventions: Non-first line systemic treatment regimens (maintenance therapy, radiation therapy, second- or third-line therapy, etc.) ;

  3. (c)

    Outcomes: No relevant outcome indicators or data could not be extracted;

  4. (d)

    Non-RCT trials;

  5. (e)

    Duplicate studies or secondary analyses, in vitro experiments, animal experiments, reviews, letters, guidelines, case reports, pathological mechanisms, conference abstracts, reviews, or systematic reviews.

Literature screening and data extraction

Two researchers (YR.Z, PP.M) independently screened all articles identified by the database search. First, EndNote X 9.0 software was used to remove duplicate literature and exclude case reports, conference abstracts, letters, and review articles. Then, the titles and abstracts of the remaining literature were screened to exclude studies that were not relevant to the topic, and the full texts and supplementary information of the remaining studies were reviewed to identify eligible studies. Finally, data were extracted according to a unified extraction table, which included information such as the author, publication year, study design, study country, study period, number of study sites, follow-up time, total number of samples, and the number of AEs. The two researchers (YR.Z, PP.M) independently cross-checked all included papers and extracted data. Any disputed articles were referred to a third researcher (HMF) for consensus resolution.

Quality assessment

Two independent reviewers (YR.Z and PP.M) assessed the risk of bias using Version 2 of the Cochrane Risk-of-Bias Tool for randomized trials (RoB 2.0) (https://doi.org/10.1136/bmj.l4898). Each study was categorized as “low risk of bias,” “some concerns,” or “high risk of bias” across the following domains: bias arising from the randomization process; bias due to deviations from the intended intervention; bias from missing outcome data; bias in the measurement of outcomes; and bias in the selection of the reported results, including deviations from the registered protocol. We rated trials as having an overall high risk of bias if one or more domains were rated “high risk of bias” and as having an overall low risk of bias if all domains were rated “low risk of bias.”

Statistical analysis

Bayesian random-effects (BRE) models were employed to analyze the effects of interventions and compare their safety. Risk ratio (RR) and its 95% confidence interval (CrI) were used to combine the incidence of AEs, with a 95% CrI excluding 1 representing a statistically significant difference. The Markov chain Monte Carlo method was adopted for modeling, with four Markov chains running simultaneously. The annealing times were set to 20,000, and the modeling was completed after 50,000 simulation iterations [16]. The deviance information criterion (DIC) was used to compare model fit and global consistency. With there was a closed loop in the network, local consistency was analyzed using the node-splitting method [17]. Additionally, interventions were ranked based on the surface under the cumulative rank curve (SUCRA), and league tables were generated to compare differences in effects among interventions [18]. A funnel plot was used to intuitively reflect the heterogeneity among the studies. Analyses were conducted using Stata 15.0 (Stata Corporation, College Station, TX) and R 4.3.1 (R Development Core Team, Vienna, http://www.R-project.org). A difference with a p-value of less than 0.05 was considered statistically significant.

Results

Literature search results

In the initial literature search, 36,127 articles were retrieved, and 11,616 duplicate studies were excluded. Following the inclusion and exclusion criteria, the article titles and abstracts were carefully examined, leading to the exclusion of 24,283 studies, while 228 potential related studies were initially included. After analyzing the full texts and supplementary materials, 53 RCT studies [19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71] were ultimately included for data extraction and statistical analysis. The flowchart of the literature retrieval and screening is presented in Fig. 1.

Fig. 1
figure 1

PRISMA flowchart

Basic characteristics of the included literature

In 53 Phase II/III randomized controlled trials, which included 17,351 mCRC patients, 29 different treatment schemes were used. Among these schemes, more patients received FOLFOX/FOLFIRI + Bevacizumab and CAPOX (Table 1 and Supplementary Table S1). These studies were published between 2000 and 2023. Only two were single-center studies [23, 24], while the rest were multicenter studies.

Table 1 Basic characteristics of included studies

Quality evaluation

42 studies mentioned correct randomization methods, while 11 studies had unclear randomization methods. Among them, 20 studies reported well-established allocation concealment schemes and were rated as low risk; 31 studies had unclear allocation schemes and were categorized as an unknown risk, and 2 studies did not use allocation concealment schemes, resulting in a high risk. In addition, 25 studies did not mention blinding, and 18 studies did not employ a method to establish blinding for practitioners and patients. Lastly, 50 studies did not deviate from established interventions, and 3 studies had an unknown bias. All study outcomes were measured appropriately, and the outcome data were complete. Furthermore, all outcome measures mentioned in the study design were reported (Supplementary Figures S1 and S2).

Network meta-analysis of all AEs

Grade ≥ 3 AEs

Seventeen studies compared the incidence of grade 3 or higher AEs across 15 treatment regimens (Fig. 2A). The RR was 0.38 (95% CrI, 0.16–0.85) for the FULV (5-fluorouracil plus leucovorin) vs. FULV + bevacizumab regimen and 0.27 (95% CrI, 0.01–0.85) for CAPOX vs. FOLFOXIRI + bevacizumab regimen, indicating that the FULV and CAPOX regimens had a lower risk of grade 3 or higher AEs, as shown in Fig. 2B. The top three safety regimens based on SUCRA values were FULV (89.76%), CAPOX (85.29%), and FOLFIRI (76.34%), while the least safe regimen was FOLFOX + panitumumab (19.91%), as illustrated in Supplementary Table S2.

Fig. 2
figure 2

(A) Network plot of Grade ≥ 3 any adverse events. (B) risk ratios (95%CIs) of Grade ≥ 3 any adverse events

Death related to AEs

Eighteen studies compared the risk of death associated with AEs among 14 treatment regimens (Fig. 3A). Based on the safety ranking determined by the SUCRA value (where a higher SUCRA value indicates a lower incidence of AEs and a safer treatment regimen), the top three treatment plans were FOLFIRI + cetuximab (81.72%), FULV + bevacizumab (72.52%), and FOLFOX (68.53%). These results are presented in Supplementary Table S2. However, the league table results indicated that there was no statistically significant difference between these regimens (Fig. 3B).

Fig. 3
figure 3

(A) Network plot of deaths related to adverse events. (B) risk ratios (95%CIs) of deaths related to adverse events

Network meta-analysis on specific AEs

Hematological AEs

Neutropenia

Forty-four studies compared the incidence of neutropenia among 27 treatment regimens (Fig. 4A). The RR was 0.01 (95% CrI, 0–0.26) for CAPOX + cetuximab vs. FOLFOXIRI + cetuximab, 0.09 (95% CrI, 0.01–0.79) for capecitabine vs. CAPIRI (capecitabine plus irinotecan), and 0.1 (95% CrI, 0.05, 0.17) for CAPOX vs. FOLFOX. These results indicate that CAPOX + cetuximab, capecitabine, and CAPOX regimens had a lower risk of neutropenia, as depicted in Supplementary Figure S3. The top three regimens for safety based on SUCRA values were CAPOX + cetuximab (94.08%), capecitabine (89.93%), and CAPOX (83.63%). The least safe regimen was FOLFOXIRI + bevacizumab (6.55%), as shown in Table 2.

Fig. 4
figure 4

Network plot of hematological adverse events. (A) Neutropenia; (B) Leukopenia; (C) Anemia; (D)Thrombocytopenia. Bmab, bevacizumab; Cap, capecitabine; Cet, cetuximab; Oxa, oxaliplatin; Pmab, panitumumab; FU,5-fluorouracil; LV, leucovorin; Iri, irinotecan; CAPOX, capecitabine plus oxaliplatin; CAPIRI, capecitabine plus irinotecan; FOLFOX, 5-fluorouracil plus leucovorin plus oxaliplatin; FOLFOXIRI, 5-fluorouracil plus leucovorin plus oxaliplatin plus irinotecan; FOLFIRI, 5-fluorouracil plus leucovorin plus irinotecan; FUIRI, 5-fluorouracil plus irinotecan; IROX, irinotecan plus oxaliplatin. Each node represented a different treatment and its size depended on the number of patients that is directly examined. The nodes were joined by lines with different thickness which shows whether there was a direct relationship between treatments and the thickness was weighted according to the available direct evidence between them

Table 2 SUCRA value of specific adverse events
Febrile neutropenia

Twenty-seven studies compared the incidence of febrile neutropenia among 19 treatment regimens (Supplementary Figure S4A). The RR was 0.15 (95% CrI, 0.05–0.36) for CAPOX vs. FOLFOXIRI, 28.45 (95% CrI, 1.85–437.2) for FOLFIRI vs. S-1 + oxaliplatin, and 0.11 (95% CrI, 0.02–0.41) for CAPOX + bevacizumab vs. FOLFOXIRI + bevacizumab, indicating a lower risk of febrile neutropenia for CAPOX, S-1 + oxaliplatin, and CAPOX + bevacizumab regimens, as shown in Supplementary Figure S5. The top three regimens for safety, based on SUCRA values, were CAPOX (91%), S-1 + oxaliplatin (87.86%), and CAPOX + bevacizumab (84.48%). The least safe regimen was FOLFIRI + cetuximab (13.54%), as indicated in Table 2.

Leukopenia

Eighteen studies compared the incidence of leukopenia among 19 treatment regimens (Fig. 4B). The RR was 0.01 (95% CrI, 0–0.68) for CAPOX + cetuximab vs. FOLFOXIRI + cetuximab, and 0.05 (95% CrI, 0–0.85) for CAPOX + bevacizumab vs. FOLFIRI + bevacizumab, indicating that CAPOX + cetuximab and CAPOX + bevacizumab had a lower risk of leukopenia, as depicted in Supplementary Figure S6. The top three regimens for safety, based on SUCRA values, were CAPOX + cetuximab (90.41%), CAPIRI + cetuximab (81.34%), and CAPOX + bevacizumab (80.12%). The least safe regimen was IROX (14.49%, irinotecan plus oxaliplatin), as shown in Table 2.

Anemia

Thirty studies compared the incidence of anemia among 22 treatment regimens (Fig. 4C). The top three regimens for safety, based on SUCRA values, were CAPOX + cetuximab (84.12%), CAPOX + bevacizumab (77.94%), and 5-FU + oxaliplatin (72.45%). The least safe regimen was FOLFOXIRI (16.05%), as indicated in Table 2. However, the league table results indicated that there was no statistical difference between the protocols (Supplementary Figure S7).

Thrombocytopenia

Thirty-four studies compared the incidence of thrombocytopenia among 25 treatment regimens (Fig. 4D). The RR was 0.04 (95% CrI, 0–0.39) for CAPIRI + bevacizumab vs. CAPOX + bevacizumab, and 0.05 (95% CrI, 0–0.84) for FOLFIRI + bevacizumab vs. FOLFOXIRI + bevacizumab, indicating that CAPIRI + bevacizumab and FOLFIRI + bevacizumab had a lower risk of thrombocytopenia, as shown in Supplementary Figure S8. The top three regimens for safety, based on SUCRA values, were CAPIRI + bevacizumab (84.43%), FOLFIRI + bevacizumab (79.98%), and CAPIRI + cetuximab (78.26%). The least safe regimen was S-1 + oxaliplatin (15.73%), as indicated in Table 2.

Gastrointestinal AEs

Diarrhea

Fifty-one studies compared the incidence of diarrhea among 29 treatment regimens (Fig. 5A). The RR was 0.15 (95% CrI, 0.02–0.95) for FOLFOX + bevacizumab vs. FOLFOXIRI regimen, 20.1 (95% CrI, 1.42–279.88) for CAPIRI vs. FULV + bevacizumab regimen, and 5.19 (95% CrI, 1.01–27.99) for CAPOX vs. FULV regimen. These results indicate that FOLFOX + bevacizumab, FULV + bevacizumab, and FULV regimens had a lower risk of diarrhea, as shown in Supplementary Figure S9. The top three safety regimens, based on SUCRA values, were FOLFOX + bevacizumab (83.63%), FULV + bevacizumab (82.64%), and FULV (82.28%). The least safe regimen was CAPIRI (16.85%), as illustrated in Table 2.

Fig. 5
figure 5

Network plot of Digestive adverse events. (A) Diarrhea; (B) Nausea; (C) Vomiting; (D) Mucositis/stomatitis. Bmab, bevacizumab. Cap, capecitabine; Cet, cetuximab; Oxa, oxaliplatin; Pmab, panitumumab; FU,5-fluorouracil; LV, leucovorin; Iri, irinotecan; CAPOX, capecitabine plus oxaliplatin; CAPIRI, capecitabine plus irinotecan; FOLFOX, 5-fluorouracil plus leucovorin plus oxaliplatin; FOLFOXIRI, 5-fluorouracil plus leucovorin plus oxaliplatin plus irinotecan; FOLFIRI, 5-fluorouracil plus leucovorin plus irinotecan; FUIRI, 5-fluorouracil plus irinotecan; IROX, irinotecan plus oxaliplatin. Each node represented a different treatment and its size depended on the number of patients that is directly examined. The nodes were joined by lines with different thickness which shows whether there was a direct relationship between treatments and the thickness was weighted according to the available direct evidence between them

Nausea

Thirty-one studies compared the incidence of nausea among 24 treatment regimens (Fig. 5B). The RR was 0.02 (95% CrI, 0–0.47) for FOLFIRI + cetuximab vs. FOLFOXIRI + cetuximab regimen, 0.03 (95% CrI, 0–0.48) for CAPOX vs. FOLFOXIRI regimen, and 39.22 (95% CrI, 1.54–2529.6) for FOLFOXIRI vs. S-1 + oxaliplatin. These results indicated that FOLFIRI + cetuximab, CAPOX, and S-1 + oxaliplatin regimens had a lower risk of nausea, as shown in Supplementary Figure S10. The top three safety regimens, based on SUCRA values, were FOLFIRI + cetuximab (85.81%), CAPOX (79.95%), and S-1 + oxaliplatin (78.5%). The least safe regimen was S-1 + oxaliplatin + LV (2.59%), as illustrated in Table 2.

Vomiting

Thirty-one studies compared the incidence of vomiting among 25 treatment regimens (Fig. 5C). The RR was 42.05 (95% CrI, 3.95–566.39) for IROX vs. S-1 + oxaliplatin, 0 (95% CrI, 0–0.51) for CAPIRI + cetuximab vs. FOLFOXIRI + cetuximab, and 0.06 (95% CrI, 0–0.85) for CAPOX vs. CAPOX + bevacizumab regimens. These results indicate that S-1 + oxaliplatin, CAPIRI + cetuximab, and CAPOX regimens had a lower risk of vomiting, as shown in Supplementary Figure S11. The top three regimens for safety, based on SUCRA values, were S-1 + oxaliplatin (90.57%), CAPIRI + cetuximab (85.73%), and CAPOX (81.54%). The least safe regimen was FOLFOXIRI + cetuximab (8.98%), as illustrated in Table 2.

Mucositis/stomatitis

Forty studies compared the incidence of mucositis or stomatitis among 28 treatment regimens (Fig. 5D). The RR was 0.06 (95% CrI, 0–0.62) for CAPOX vs. FOLFOX + cetuximab regimen and 0.14 (95% CrI, 0.03–0.55) for FOLFOX vs. FOLFOX + panitumumab. These results indicate a lower risk of mucositis or stomatitis for CAPOX and FOLFOX regimens, as shown in Supplementary Figure S12. The top three treatment regimens for safety, based on SUCRA values, were CAPOX (76.72%), Cap + bevacizumab (73.45%), and FOLFIRI + cetuximab (71.48%). The least safe regimen was FOLFOX + Cetuximab (16.71%), as illustrated in Table 2.

Anorexia

Ten studies compared the incidence of anorexia among 8 treatment regimens (Supplementary Figure S4B). The top three treatment regimens for safety, based on SUCRA values, were FOLFOX + bevacizumab (73.69%), CAPOX + bevacizumab (66.91%), and FOLFIRI + panitumumab (66.26%). The least safe regimen was S-1 + Iri + bevacizumab (15.18%), as illustrated in Table 2. However, the league table results showed that there was no statistical difference between the protocols (Supplementary Figure S13).

Neurological AEs

Peripheral sensory neuropathy

Twelve studies compared the risk of peripheral sensory neuropathy among 10 treatment regimens (Fig. 6A). The RR was 0.02 (95% CrI, 0–0.32) for CAPIRI + bevacizumab vs. CAPOX + bevacizumab, 0.02 (95% CrI, 0–0.16) for FOLFIRI + bevacizumab vs. FOLFOX + bevacizumab, and 0.04 (95% CrI, 0–0.38) for FOLFOXIRI + bevacizumab. These results indicate a lower risk of peripheral sensory neuropathy for CAPIRI + bevacizumab and FOLFIRI + bevacizumab regimens, as shown in Supplementary Figure S14. The top three treatment regimens for safety, based on SUCRA values, were CAPIRI + bevacizumab (88.87%), FOLFIRI + bevacizumab (86.12%), and FOLFIRI + cetuximab (82.62%). The lowest-ranked regimen was S-1 + oxaliplatin + LV (2.98%), as illustrated in Table 2.

Fig. 6
figure 6

Network plot of Neurological adverse events. (A) Peripheral sensory neuropathy; (B) Fatigue; (C) Hypertension; (D) Thromboembolic events. Bmab, bevacizumab. Cap, capecitabine; Cet, cetuximab; Oxa, oxaliplatin; Pmab, panitumumab; FU,5-fluorouracil; LV, leucovorin; Iri, irinotecan; CAPOX, capecitabine plus oxaliplatin; CAPIRI, capecitabine plus irinotecan; FOLFOX, 5-fluorouracil plus leucovorin plus oxaliplatin; FOLFOXIRI, 5-fluorouracil plus leucovorin plus oxaliplatin plus irinotecan; FOLFIRI, 5-fluorouracil plus leucovorin plus irinotecan; FUIRI, 5-fluorouracil plus irinotecan; IROX, irinotecan plus oxaliplatin. Each node represented a different treatment and its size depended on the number of patients that is directly examined. The nodes were joined by lines with different thickness which shows whether there was a direct relationship between treatments and the thickness was weighted according to the available direct evidence between them

Fatigue

Twenty-two studies compared the incidence of fatigue among 20 treatment regimens (Fig. 6B). CAPOX was associated with a lower risk of fatigue compared to CAPOX + Cetuximab (RR = 0.13, 95% CI: 0.02–0.83), as shown in Supplementary Figure S15. The top three regimens for safety, based on SUCRA values, were FOLFOXIRI + cetuximab (72.03%), CAPOX + bevacizumab (71.17%), and CAPOX (68.53%). The last-ranked regimen was CAPOX + Cetuximab (15.49%), as illustrated in Table 2.

Circulatory AEs

Hypertension

Nineteen studies compared the incidence of hypertension among 13 treatment regimens (Fig. 6C). Compared with FOLFOX + cetuximab (RR = 15.8, 95% CI: 2.05-436.84) and FOLFOX + panitumumab (RR = 9.65, 95% CI: 1.14, 323.19), FOLFOX + bevacizumab had a higher risk of hypertension. Compared with FOLFIRI + Bevacizumab (RR = 0.17, 95% CI: 0.06, 0.41) and FOLFIRI + cetuximab (RR = 0.16, 95% CI: 0.04, 0.66), FOLFIRI had a lower risk of hypertension. The results are shown in Supplementary Figure S16. The top three treatment regimens for safety, based on SUCRA values, were FOLFOX + cetuximab (88.06%), FOLFOX + panitumumab (80.96%), and FOLFIRI (80.54%). The last-ranked regimen was FOLFOXIRI + Bevacizumab (13.64%), as illustrated in Table 2.

Thromboembolic events

Eight studies compared the incidence of thromboembolic events among 7 treatment regimens (Fig. 6D). The top three treatment regimens for safety, based on SUCRA values, were FOLFOX (89.2%), FULV (74.78%), and FOLFOX + bevacizumab (61.62%), with the last regimen being FULV + bevacizumab (22.23%), as illustrated in Table 2. However, the league table results indicated no statistical difference between the protocols (Supplementary Figure S17).

Consistency and publication bias assessment

DIC was employed to compare the consistency model with the inconsistency model. All closed-loop models exhibited variation values of less than 5, indicating good consistency, as indicated by DIC. The local inconsistency test between direct evidence and indirect evidence revealed local inconsistency between CAPIRI + Bevacizumab and CAPOX + Bevacizumab (P = 0.01) and FOLFIRI + Bevacizumab (P = 0.01) in terms of thrombocytopenia, as shown in Supplementary Table S3. Regarding the assessment of publication bias, no evidence of publication bias was observed in the comparison-adjusted funnel plots (Supplementary Figures S18 and S19).

Discussion

In this study, we utilized the NMA method to compare and rank first-line systemic treatment regimens for mCRC patients, analyzing the specific AEs associated with each regimen. This approach aims to identify treatment regimens with high safety profiles and clarify the toxicity characteristics of each treatment regimen. Such findings are essential for offering guidance in clinical decision-making and hold significant practical values in enhancing the treatment outcomes and quality of life for mCRC patients.

Based on the NMA results, the CAPOX regimen ranked first in the SUCRA comprehensive rankings of AEs. It may currently be the safest regimen for first-line systemic treatment of mCRC patients. However, the analysis of specific AEs revealed that the CAPOX regimen was associated with thrombocytopenia and diarrhea. Degirmencioglu et al. [72] also showed that the CAPOX regimen was inferior to the FOLFOX regimen in terms of disease progression, metastasis, and mortality in patients with colorectal cancer. There was no significant difference between the two regimens in overall survival. An observational study from India indicated that the CAPOX regimen was more effective but was prone to grade 3/4 blood and gastrointestinal toxicities such as thrombocytopenia (11.7%) and diarrhea (5.5%), consistent with our findings [73]. Additionally, CAPOX regimens were well-tolerated in febrile neutropenia, neutropenia, and leukopenia. A meta-analysis concluded that oxaliplatin-based regimens were better tolerated regarding leukopenia and febrile neutropenia compared to irinotecan-based regimens [74]. Two RCTs [50, 58] also demonstrated that neutropenia and leukopenia were less likely to occur with CAPOX than FOLFOX. S-1 is an oral fluoropyrimidine chemotherapy agent designed to enhance the antitumor activity of 5-fluorouracil while reducing its severe gastrointestinal toxicity, including nausea, vomiting, stomatitis, and diarrhea [75]. Our study also found that S-1 or S-1 + Oxaliplatin regimens were well-tolerated in terms of gastrointestinal response.

FULV secured the second position in the SUCRA comprehensive rankings for AEs, indicating good safety. When comparing single-drug chemotherapy with combined chemotherapy or chemotherapy combined with targeted drugs, it becomes apparent that while they significantly enhance the anti-tumor effect, they also increase the occurrence of side effects. In a detailed analysis of specific AEs, both the FOLFOX and FOLFOX + bevacizumab regimens were associated with neutropenia and peripheral sensory neuropathy. Furthermore, the FOLFOX + bevacizumab regimens were linked to hypertension. FOLFOX exhibited a lower risk of hypertension compared to FOLFOX + bevacizumab (RR = 0.22 (95% CrI: 0.05–0.73). This finding was consistent with results from two RCTs [23, 60], which demonstrated that FOLFOX + bevacizumab was more likely to result in hypertension when directly compared with FOLFOX. This aligns with previous studies, which identified hypertension as the most common side effect of bevacizumab [76,77,78]. The FOLFOX + panitumumab and FOLFOX + cetuximab regimens were associated with multiple AEs. A meta-analysis assessing the safety of panitumumab in CRC patients revealed a higher frequency of grade 3/4 AEs in the panitumumab group when compared to the control group (RR = 1.17, 95% CI: 1.08–1.27, P = 0.0001) [79]. Concerning the toxic side effects of cetuximab, a separate meta-analysis suggested that cetuximab was linked to an increased risk of leukopenia, neutropenia, and anemia events in patients with colorectal cancer [80]. Another meta-analysis reported that cetuximab carried a greater risk of skin diseases compared to bevacizumab [81]. One study demonstrated a significantly higher incidence of hand and foot skin reactions (P = 0.02) in patients treated with CAPIRI + bevacizumab compared with patients treated with FOLFIRI + bevacizumab [47] .In addition, two RCTs also demonstrated that FOLFOX + cetuximab was associated with a higher incidence of grade 3 or higher AEs when compared to FOLFOX with or without bevacizumab [28, 58].

Irinotecan-based regimens, specifically FOLFIRI + bevacizumab and CAPIRI + bevacizumab, were well tolerated with regards to thrombocytopenia and peripheral sensory neuropathy. A meta-analysis indicated that irinotecan-based regimens were associated with a lower risk of peripheral sensory neuropathy and thrombocytopenia compared to oxaliplatin-based regimens [74]. Additionally, two RCTs provided evidence that CAPIRI + bevacizumab was less likely to cause peripheral sensory neuropathy and thrombocytopenia compared to CAPOX + bevacizumab [32, 42]. An RCT study demonstrated that FOLFIRI + bevacizumab was directly linked to a reduction in peripheral sensory neuropathy when compared with FOLFOX + bevacizumab [36]. According to the SUCRA ranking, the FOLFOXIRI scheme was more likely to result in grade 3 or higher AEs compared to FOLFIRI. Consistently, a meta-analysis showed that the incidence of neutropenia, anemia, diarrhea, stomatitis, and neuropathy in the FOLFOXIRI group was significantly higher than that in the FOLFIRI group [81], which aligns with the results of this study. Furthermore, three RCTs [26, 30, 40] also indicated that FOLFOXIRI + bevacizumab might increase the risk of grade 3 or higher AEs when directly compared with FOLFOX/FOLFIRI + bevacizumab.

As far as we know, this study possesses several notable advantages. Firstly, it employed network meta-analysis to directly and indirectly compare multiple interventions, addressing the limitation of traditional meta-analysis that only allows the analysis of two directly compared interventions. Secondly, this study evaluated the risk of AEs across different treatment regimens, providing a clearer understanding of the toxicity profile associated with each regimen. Thirdly, in comparison to other NMAs assessing the safety of first-line systemic treatment for mCRC [8], this study included and analyzed a more extensive range of treatment regimens and outcome indicators. It also incorporated and assessed the latest relevant studies, offering a robust reference for clinical treatment regimen selection. However, this study has acknowledged limitations. Firstly, due to the absence of head-to-head trials, some treatment options were excluded from the network for different outcomes. Secondly, our study did not analyze results according to the RAS and BRAF status or left/right status. This was primarily because among the 53 studies included, only two RCT studies [19, 22] compared treatment outcomes for left and right colon cancer, and 13 RCT studies mentioned the RAS and BRAF status. Consequently, we cannot fully evaluate all treatment options for mCRC based on RAS and BRAF status or left/right status. Thirdly, the different dosage forms of treatment regimens may affect the final assessment. Fourthly, for four outcome measures (i.e., death related to AEs, anemia, anorexia, and thromboembolic event), ranking guidance may be limited due to the absence of statistical differences between treatment regimens. Fifthly, there was local inconsistency in the indicator of thrombocytopenia. After a thorough examination and analysis of the original text, it was discovered that the number of responders in the CAPIRI + Bevacizumab regimen was frequently recorded as 0 for this indicator, potentially contributing to outcome inconsistency. Sixth, during literature screening, we only included studies where the language was English, which may have increased the bias of the results.

Conclusion

Based on SUCRA rankings, the CAPOX regimen is most likely to secure the top position in terms of safety, while the FOLFOXIRI + panitumumab regimen is most likely to rank last. In the analysis of specific AEs, The CAPOX regimen, whether combined with or without targeted drugs (bevacizumab and cetuximab), is associated with a reduced risk of neutropenia and febrile neutropenia, as well as an increased risk of thrombocytopenia and diarrhea. The FOLFOX regimen, with or without bevacizumab, is linked to an increased incidence of neutropenia and peripheral sensory neuropathy. The FOLFIRI/CAPIRI + bevacizumab regimen is connected to a decrease in the risk of peripheral sensory neuropathy. S-1 and S-1 + oxaliplatin are well tolerated with respect to gastrointestinal reactions. The FOLFOXIRI regimen, whether combined with or without targeted drugs, is associated with various AEs.

In summary, the CAPOX regimen appears to be the safest option among the first-line systemic treatment regimens for mCRC patients, while the FOLFOXIRI + panitumumab regimen may be associated with a higher incidence of grade 3 or higher AEs. More pharmacological and clinical trials are required to optimize first-line systemic regimens, enhancing efficacy and minimizing toxicity. Additionally, further well-designed, high-quality RCTs are anticipated to validate our findings.