Introduction

Although treatments for breast cancer, which is one of the most common cancers diagnosed in women [1], are continuously progressing, chemotherapy still plays a critical role. As early as 1995, a study showed that chemotherapy dose had a marked influence on the treatment effect [2], and further studies found that a sufficient dosage and course of chemotherapy could significantly improve the relapse-free survival and overall survival of patients with breast cancer [3, 4]. However, the toxicities caused by chemotherapy can obviously affect patients’ tolerance and limit the sustained relative dose intensity (RDI) of chemotherapy, which can reduce the dosage and/or the course of chemotherapy, resulting in reducing effectiveness. Among the toxicities of chemotherapy, myelosuppression is a frequent reason for dose limitation and dose reduction in patients with cancer [5]. Moreover, once severe bone marrow suppression occurs, febrile neutropenia (FN) can be life-threatening, with a high risk of mortality. Therefore, clinical researchers have been exploring effective therapeutic drugs to alleviate chemotherapy-induced neutropenia (CIN) and improve patients’ tolerance of chemotherapy.

Granulocyte colony-stimulating factor (G-CSF) is one of the many cytokines that can simulate the proliferation and differentiation of neutrophil precursors and enhance the function of mature neutrophils [6]. As a result, since 1991, the first G-CSF (filgrastim) was approved for use in patients with cancer being treated with chemotherapy [7]. G-CSF-type therapeutic agents were widely used in the clinic to reduce CIN and were accepted in different clinical guidelines for patients receiving high FN chemotherapy regimens [8, 9]. However, because of its primary clearance through the kidney, human G-CSF shows a short half-life, requiring daily administration intravenously or subcutaneously, which means patients suffer increased rates of injection site infection and display reduced tolerance [10, 11]. To extend the half-life, polyethylene glycol (PEG) was introduced into G-CSF (PEGylated G-CSF), which changed the drugs’ method of clearance, resulting in a decrease in the systemic clearance of the PEGylated form and a single dose showed efficacy comparable to that of daily injections of normal G-CSF [12, 13]. Compared with the G-CSF, PEGylated G-CSF has benefits in terms of fewer injections, better compliance, and decreased burden for healthcare professionals and patients in various tumors or non-Hodgkin’s lymphoma (NHL) [14]. However, Hendler et al. [15, 16] found that both forms of G-CSF could reduce the rate of FN and improve the safety and cost of chemotherapy but found no difference between them, especially in patients with breast cancer. Many studies have shown direct comparative data between PEGylated G-CSF and G-CSF; however, there was no agreement on whether PEGylated G-CSF was better than G-CSF in terms of safety and effectiveness in patients with breast cancer [13, 17]. Therefore, we conducted a systematic review and meta-analysis to capture the available data on the effectiveness and safety of PEGylated G-CSF and G-CSF to compare their prophylaxis of CIN and FN during chemotherapy for breast cancer.

Methods

Eligibility criteria and study selection

Eligible studies must only compare PEGylated G-CSF and G-CSF to improve CIN in patients with breast cancer and all the studies must be the randomized controlled trials (RCTs), with or without a blinding method. Only peer-reviewed articles with full text available were included. PEGylated C-CSF includes different long-acting PEG conjugates of a short-acting G-CSF, which could increase the plasma half-life. G-CSF refers to all kinds of G-CSF used in chemotherapy. Studies comparing PEGylated G-CSF and G-CSF for other types of cancers or non-RCTs were excluded. Duplicate studies and those with incomplete data were excluded, and when publications were duplicates, the latest studies were included. Studies were restricted to those in the English language and those enrolling adult patients.

Information sources and search strategy

Relevant studies were searched in the PubMed, Embase, Cochrane Library, and Web of Science databases, from the date of inception of the databases to December 2019. Two people independently searched the databases with combined Medical Subject Headings (MeSH) and unlimited words. The search words were as follows: breast cancer, breast tumor/tumor, breast carcinoma, malignant neoplasm of breast, mammary cancer/tumor/carcinoma; PEGylated granulocyte colony stimulating, PEGylated G-CSF, PEG-rhG-CSF, PEG-G-CSF, pegfilgrastim, long-acting G-CSF; granulocyte colony-stimulating factor, G-CSF, rhG-CSF, filgrastim, Neupogen, r-metHuG-CSF, Lenograstim, Zarxio, and so on; and the study types were searched using “randomized controlled trial,” “RCT,” and “randomized/randomized.” We also screened all of the cited articles and related articles and contacted the authors by email if the articles were not available in full text.

Data collection process

All relevant published data were extracted. Papers were excluded if they did not report neutropenia-related outcomes. If the study used many different doses of PEGylated G-CSF, we collected the data related to the recognized effective dose, such as 6 mg or 100 μg/kg; however, if they disclosed data for both 6 mg and 100 μg/kg doses, we preferentially acquired the data for 6 mg. Authors of eligible articles were contacted by email if verification of their data was necessary. Data were collected by two different reviewers independently; any errors were checked by a third reviewer.

Data items

All the data from the included articles were checked to prevent the inclusion of duplicates. Study characteristics that were considered important were design, chemotherapy regimen, sample size, the time of blood collection, and drug interventions (dose). The clinical outcome features were extracted; primary outcomes included incidence or/and duration of grade ≥ 3 neutropenia in cycle 1. The secondary outcomes were the incidence of FN, the time to absolute neutrophil count (ANC) recovery, and adverse events (AEs). Skeletal and/or muscle pain AEs were highlighted for analysis.

Quality assessment in individual studies

The quality of the studies was assessed by two reviewers independently. For the RCTs, the Cochrane Collaboration tool was used to assess the risk of bias [18]. Any discrepancies in interpretation were resolved by consensus. The Cochrane Collaboration tool assesses six quality items: selection bias, performance bias, detection bias, attrition bias, reporting bias, and other bias. Based on the reviewers’ judgments, every accepted article was evaluated as having a “low,” “high,” or “unclear” risk of bias.

Data analysis

Statistical analysis was performed using the Cochrane Collaboration’s RevMan software, version 5.2 (Cochrane Collaboration, Oxford, UK). For dichotomous data, the comparative differences between the two groups are shown as risk ratios (RRs). For continuous data, the outcomes of the two groups were expressed as weighted mean differences (WMDs) with 95% confidence intervals (CIs). Tests for heterogeneity were performed using Cochran’s Q statistic and the I2 test, with P < 0.05 indicating significant heterogeneity. If the test for heterogeneity was statistically significant, then a random effects model was used; otherwise, the meta-analysis was performed using a fixed effects model.

Results

Study selection and study characteristics

A flowchart detailing the process of study identification and selection following the PRISMA statement is shown in Fig. 1. There were 404 records after duplicates were removed, and after screening the titles and abstracts for relevance, 26 articles were assessed further for eligibility. Among the 26 articles, 17 articles were excluded: 7 articles were only an abstract, 1 article was not in English, 4 articles were only related to long-acting G-CSF, 3 articles included patients with other types of cancer, and 2 articles were not RCTs. Thus, nine studies [19,20,21,22,23,24,25,26,27] were ultimately included in our present systematic review. Two of the identified papers were by the same author [19, 20]; therefore, we examined the data carefully. We found that the two papers included different samples and were from different publication years; therefore, the data of both papers could be used in the analysis and both papers were accepted. The characteristics of these nine studies are summarized in Table 1, while the outcomes of the nine studies are summarized in Table 2.

Fig. 1
figure 1

Study flow diagram, inclusions, and exclusions following the PRISMA statement

Table 1 Characteristics of the studies included in meta-analysis
Table 2 The summary of studies’ outcomes

Risk of bias within the studies

The risks of bias of all studies were assessed using the Cochrane Collaboration’s tool. All the studies randomly assigned the research objects and the outcomes were not affected by the blinded method. The included RCTs had a moderate risk of bias (Fig. 2). The publication bias was evaluated using a funnel plot [28].

Fig. 2
figure 2

Risk of bias graph: judgments about the risk of bias item presented in all included studies (a) and for each included study (b)

Results of individual studies and synthesis of the results

The incidence of grade ≥ 3 neutropenia

Only five studies [21, 22, 24,25,26] provided information on the incidence of grade ≥ 3 neutropenia. When we input all the data from these studies into the RevMan software, we found that the heterogeneity among the five studies was significant (P = 0.003, I2 = 75%); therefore, sensitivity analysis of the five trials was performed. The analysis showed that the results were not substantially influenced by the heterogeneity. Therefore, we used the random effects model to analyze the final outcome, in which we did not observe a significant difference in the incidence of grade ≥ 3 neutropenia between the two groups (RR = 0.87; 95% CI 0.69 to 1.1; P = 0.26) (Fig. 3a).

Fig. 3
figure 3

Forest plot comparing the results of effectiveness between the two groups. a Incidence of grade ≥ 3 neutropenia; b duration of grade ≥ 3 neutropenia; c duration of grade 4 neutropenia; d incidence of febrile neutropenia (FN); e the time to ANC recovery

The duration of grade ≥ 3 neutropenia

Four studies [21, 22, 25, 26] provided results for the duration of grade ≥ 3 neutropenia. The heterogeneity among the studies was significant (P < 0.00001, I2 = 94%); however, none of the studies in our analysis significantly influenced the heterogeneity, so we used the random effects model to analyze these data. The outcome showed that the duration of grade ≥ 3 neutropenia between the two groups was not significantly different (WMD = − 1.04, 95% CI − 2.30 to − 0.23, P = 0.11) (Fig. 3b).

The duration of grade 4 neutropenia

Five studies [19,20,21,22, 27] reported the duration of grade 4 neutropenia. There was significant heterogeneity among the studies (P = 0.01, I2 = 69%), necessitating the use of the random effects model. The results showed that the use of PEGylated G-CSF did not result in a significantly shorter duration of grade 4 neutropenia compared with that of G-CSF (WMD = − 0.32, 95% CI 0.67 to 003, P = 0.007). When we removed Wang’s study [22] from the analysis, the heterogeneity decreased (P = 0.12, I2 = 49%); however, the data from the Wang’s data did not influence the final result (P > 0.05). Thus, although the included studies showed that the use of PEGylated G-CSF resulted a slight improvement in grade 4 neutropenia compared with that of G-CSF, the difference was not significant (Fig. 3c).

The incidence of FN

Only one study lacked data on this AE [20]; therefore, the remaining eight trials were used to compare the incidence of FN between the two groups. There was no significant heterogeneity among these studies (P = 0.44, I2 = 0%); therefore, we used the fixed effects model to analyze the results. There was no significant difference between the two groups (RR = 0.80; 95% CI 0.50 to 1.28; P = 0.35) (Fig. 3D); thus, PEGylated G-CSF had no obvious advantage in improving the FN of CIN in patients with breast cancer.

The time to ANC recovery

Five studies [19, 20, 24, 26, 27] reported information concerning the time to ANC recovery. However, the study by Holmes et al. [27] only described the median time of ANC recovery; therefore, we excluded that article from this part of the meta-analysis. The pooled analysis of the results showed no significant heterogeneity (P = 0.83, I2 = 0%). Using a fixed effects model, the pooled results showed that there was no significant difference between two groups for the time to ANC recovery (WMD = − 0.14, 95% CI − 0.50 to 0.22, P = 0.45) (Fig. 3e).

The grade 4 AEs of the two groups

Seven studies [19, 21, 22, 24,25,26,27] reported the specific numbers of grade 4 AEs. The study by Huang et al. [24] reported no grade 4 AEs; therefore, that study was not included in the analysis. Using a fixed effects model, the pooled results showed that the grade 4 AEs of the two groups were not significantly different (RR = 0.82; 95% CI 0.57 to 1.18; P = 0.29), without significant heterogeneity (P = 0.17, I2 = 36%) (Fig. 4a).

Fig. 4
figure 4

Forest plot comparing the results of safety between the two groups. a grade 4 adverse (AEs); b incidence of skeletal and/or muscle pain

Skeletal and/or muscle pain

All nine studies [19,20,21,22,23,24,25,26,27] provided information about this AE. The pooled analysis showed significant heterogeneity among the studies (P = 0.02, I2 = 56%); therefore, a random effects model was used for the analysis. The results indicated no significant difference between the two groups (RR = 0.77; 95% CI 0.46 to 1.29; P = 0.32) (Fig. 4B). Meanwhile, we also performed a sensitivity analysis of these studies and found that when we removed Xie’s study [25], the heterogeneity decreased (P = 0.10, I2 = 42%); however, the pooled results still showed that there was no difference between the groups in terms of skeletal and/or muscle pain using a fixed effects model (RR = 0.96; 95% CI 0.72 to 1.27; P = 0.76). Thus, the use of PEGylated G-CSF did not increase the incidence of skeletal and/or muscle pain significantly compared with that of G-CSF.

Publication bias

All nine studies [19,20,21,22,23,24,25,26,27] provided information about skeletal and/or muscle pain; therefore, we created a funnel plot based on this result, which showed that only one study was outside of the 95% CI, while the rest of the studies lay inside, showing a low distribution around the verticals and indicating an acceptable level of publication bias for these studies (Fig. 5).

Fig. 5
figure 5

Funnel plot illustrating the meta-analysis of skeletal and/or muscle pain

Discussion

CIN is one of the most common side effects of chemotherapy, which usually leads to chemotherapy dose reduction and/or treatment delay, which might compromise the effectiveness of chemotherapy [29]. The clinical use of G-CSF has greatly improved the safety of chemotherapy. The prophylactic use of G-CSF in chemotherapy significantly reduced the incidence of FN from 24 to 7–16% [19, 30, 31]. Breast cancer has different molecular subtypes, and chemosensitivity might be associated with dose intensity: The higher the relative dose intensity, the higher the response rate of patients with breast cancer, especially those with luminal subtypes [32, 33]. Dose-dense chemotherapy regimens in breast cancer induce a higher rate of CIN, necessitating the daily injection of G-CSF [12, 34]. The emergence of PEGylated G-CSF has greatly reduced the number of injections required, making it more convenient for the patients, especially those who live far away from the hospital. Both types of G-CSF are effective, and Cornes et al. [17] also performed a systematic review of short- versus long-acting G-CSF, but they showed that the two kinds of G-CSF had no difference of effectiveness included different types of cancer. Therefore, because the safety and efficacy of PEGylated G-CSF compared with that of G-CSF for CIN of breast cancer have not been conclusively determined, we conducted an up-to-date systematic review based on RCTs to provide the current best evidence on this topic.

Lambertini et al. [21, 35] suggested that compared with G-CSF, PEGylated G-CSF showed superior efficacy, and for high FN chemotherapy regimens, such as AT (anthracyclines-taxane), resulted in a shorter duration of CIN. However, the results of the present study revealed no significant difference in the efficacy indicators between the two drugs (P > 0.05). We did not find a significant difference for the incidence/duration of grade ≥ 3 neutropenia between the two groups (P > 0.05), and although in most of the studies the PEGylated G-CSF slightly improved grade 4 neutropenia, the difference was not significant for the duration of grade 4 neutropenia (P = 0.007). All the included studies were RCTs, in which the short-acting G-CSF was used according to recommended guidelines, requiring an average of more than six injections to support ANC recovery to > 2.0 × 109/L or up to 10 days. Our study also showed that when the short-acting G-CSF was used according to the guidelines, the use of PEGylated G-CSF did not result in a shorter time to ANC recovery (P = 0.45). However, in the articles [36,37,38] that showed evidence for greater efficacy of PEGylated G-CSF, this may have been the result of the underdosing of short-acting G-CSF in the non-RCTs, which was possibly closer to the real-world situations [17]. In the real world, though more regular visits to the hospital to receive a daily injection may be a source of better psychosocial support, which also would increase the pain of hypodermic injection and the burden to patients, resulting in missed appointments and patients not receiving a sufficient dose of G-CSF [17, 39]. And, in clinical practice, some physicians had shown that using PEGylated G-CSF could result in significant time savings and the pain decrease for patients by reducing the number of injections, which could improve the quality of life (QOL) and treatment compliance for patients [40,41,42].

FN is considered a severe complication, which could be life-threatening and is associated with a high risk of mortality. Some studies [27, 43,44,45] showed that the incidence of FN might be significantly different in cycles 2 to 4 and that PEGylated G-CSF could perform better than G-CSF in support of patients’ recovery from FN. However, our pooled results from eight studies [19, 21,22,23,24,25,26,27] showed that PEGylated G-CSF had no obvious advantage over G-CSF in improving FN of CIN in patients with breast cancer (P = 0.35). This was possible because we only included the data of the first cycle and all the patients in the control groups received sufficient doses of short-acting G-CSF. Meanwhile, we only included patients with breast cancer, who perhaps had a relatively better prognosis compared with that of other tumors.

AEs occurred in most of the included patients; however, the majority of them were related to chemotherapy and were not serious. Ashrafi reported higher numbers of AEs induced by PEGylated G-CSF [23]; however, seven studies [19, 21, 22, 24,25,26,27] reported the specific numbers of grade 4 AEs and the final result showed no significant difference between the two groups for grade 4 AEs (P = 0.29), suggesting that PEGylated G-CSF is just as safe as G-CSF. None of the articles reported unexpected AEs related to either drug, such as spleen rupture. The most common adverse reaction induced by PEGylated G-CSF is skeletal and/or muscle pain [23, 46]. However, the results of the present study showed that PEGylated G-CSF did not significantly increase the incidence of skeletal and/or muscle pain (P = 0.32), and the articles that did report this AE noted only mild to moderate, rather than unbearable, pain. Xu et al. [21] also found that a fixed dosage of PEGylated G-CSF did not influence the effectiveness or increase the related AEs in over- or underweight patients. Meanwhile, Shi et al. [47] found that PEGylated G-CSF did not increase the AEs related to different chemotherapy. Thus, we concluded that PEGylated G-CSF was as safe as G-CSF to treat CIN in patients with breast cancer but did not show an obvious superiority.

Several limitations were associated with the included randomized studies. First, the main limitation was that the included studies were all RCTS with a relatively small size and lacked real-world data, which might have decreased the reliability and could cause bias. Second, different studies used different outcome measurements with different blood collection times, and some FN events might not have been captured accurately because some studies had not made a distinction between FN and FN-related endpoints. Third, PEGylated G-CSF was delivered at two recommended doses and as different types (original or biosimilar), and the included studies did not use the same dose and type of PEGylated G-CSF, although the studies had confirmed that the two doses and types of PEGylated G-CSF were similarly effective [14, 19, 21, 22, 25, 48], which could have led to bias.

In conclusion, when short-acting G-CSF is used at a sufficient dose according to the recommended guidelines, PEGylated G-CSF did not show an obvious advantage over G-CSF in the chemotherapy of patients with breast cancer. However, PEGylated G-CSF facilitates out-patient chemotherapy, improves the patients’ QOL by decreasing the numbers of injections, and reduces the number of hospital visits, especially in the time of COVID-19 pandemic, which could decrease the incidence of COVID-19 infection. In addition, PEGylated G-CSF did not increase the incidence of AEs. Thus, for the chemotherapy of breast cancer, PEGylated G-CSF has no obvious superiority in terms of effectiveness and safety over G-CSF, whereas, in actual clinical practice, it is more convenient.