Introduction

Gastric cancer is the third leading cause of cancer death and the fifth most common cancer worldwide [1,2,3]. Even though there is a steady decline in its incidence and mortality in recent years, an estimated 1,000,000 patients were newly diagnosed and more than 783,000 patients died from gastric cancer in 2018 [1]. More seriously, this trend has shown signs of change. A recent study demonstrated that the increasing rates of gastric cancer among people less than 50 years old might reverse the overall decline in the incidence of gastric cancer [4, 5].

Open gastrectomy (OG) remains the mainstay of curative approach for gastric cancer for a long time. Until 1994, Kitano firstly described the efficacy of laparoscopy gastrectomy (LG) in the case of early stage carcinoma in the antrum of the stomach [6]. Then, the employment of LG for gastric cancer has achieved rapid development and popularities in past decades due to minimal invasion, less blood loss, less time of using analgesic requirement and quicker recovery [7,8,9,10]. Another benefit of laparoscopic surgery is the capacity to observe the surgical field in a magnified view, which could help surgeons with more meticulous dissection of lymph nodes which is important to patient’s prognosis [11]. However, previous studies showed decreased number of harvested lymph nodes for gastric patients during LG compared with OG [12, 13]. Besides, like all the laparoscopic procedure, port site metastases and seeding during LG were inevitable because of intra-abdominal hyperpressure and adherence of laparoscopic instrument [14,15,16,17]. What is more, though there are some studies comparing the secondary outcomes between the LG and OG groups, lack of long-term oncological outcomes such as recurrence and mortality hinders its full support as a valid procedure [18,19,20]. Therefore, debates still exist whether LG is superior to OG for gastric cancer patients.

The aim of this meta-analysis was to identify and analyze random controlled trials (RCTs) in order to compare the primary and secondary outcomes of LG versus OG. Subgroup analyses were conducted to evaluate the primary outcomes which are key surgical and prognostic outcomes and may be influenced by the tumor stage and the gastrectomy type. Sensitivity analysis was implemented to validate the stability of the conclusion based on different effect models.

Methods

Search strategy

Two authors independently searched Pubmed, Embase, Cochrane Library, WANFANG, and China National Knowledge Internet until Nov. 25, 2018. The following combined search terms were used: (“Abdominal neoplasms” OR “Intestinal neoplasms” OR “Stomach neoplasms”) AND “Laparoscopy” AND “Gastrectomy” AND “Clinical trials” [21]. Details of the search strategies can be found in Additional file 1: Table S1.

Selection criteria

Studies were selected based on the following inclusion criteria: (1) study design, RCT in English or Chinese (animal studies, observational studies, basic research, retrospective studies, case-control studies, quasi-randomized studies, case reports, and cohort studies were excluded); (2) participants, gastric cancer patients undergoing gastrectomy; (3) interventions, surgical operation comparing LG with OG; and (4) outcomes, primary outcomes and secondary outcomes. Primary outcomes are (1) number of lymph nodes harvested during surgery, (2) severe complications, (3) short-term and long-term recurrence, and (4) short-term and long-term mortality. Secondary outcomes are (5) operative time, (6) intraoperative blood loss, (7) measures of earlier postoperative recovery (analgesic administration, time to first flatus, first ambulation and first oral intake, hospital stay), (8) blood transfusion (number, quantity), and (9) total complications. If there were two or more studies from the same authors or institutions, only the study with the largest sample size was chosen. Studies were excluded if full text of the trial was not available or they did not fulfill the inclusion criteria.

Data extraction and quality assessment

The records from the initial search were scanned by two authors to exclude any duplicate and irrelevant studies. The following data were extracted: first authors, publication date, country of origin, study period, tumor stage, gastrectomy type, lymph-node dissection, number of OG and LG cases, characteristics of the study population (including sex, age), follow-up, and primary and secondary outcomes (number of lymph nodes harvested during surgery, severe complications, recurrence and mortality; operative time, blood loss, indictors of earlier postoperative recovery (analgesic administration, first flatus, first ambulation, oral intake, hospital stay), blood transfusion (number, quantity), and total complications). Any discrepancies were resolved by discussion. Study quality was estimated using an adaptation of the Cochrane Handbook for Systematic Reviews of Interventions via the following characteristics: random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, selective data, and other bias.

Statistical analysis

I2 and P value were used to evaluate the statistical heterogeneity. A fixed effects model was adopted with significant heterogeneity (I2 ≤ 50% and P ≥ 0.1), while a random effects model was employed in all other instances (I2 > 50% or P < 0.1) [22,23,24]. Risk ratio (RR) with 95% confidence interval (CI) was calculated for binary outcomes, mean difference (MD), or the standardized mean difference (SMD) with 95% CI for continuous outcomes and the hazard ratio (HR) for time-to-event outcomes. Subgroup analyses based on tumor stage and the type of gastrectomy were performed to evaluate the primary outcomes. Sensitivity analysis was used to explore the consistence of the conclusion based on fixed/random-effect models. Publication bias was evaluated by Egger’s test. If publication bias was conformed, the Duval’s trim and fill method was implemented to adjust for this bias. All statistical calculations were performed by Review Manager 5.3 (Cochrane collaboration. Copenhagen) and STATA software (Version 12.0; STATA Corporation, College Station, TX, USA). P value less than 0.05 was considered statistically significant.

Results

Search results and studies characteristics

Our search initially yielded 5725 studies with 1197 studies subsequently excluded due to duplication. After a review of the titles and abstracts, we obtained 48 studies by excluding an additional 4480 studies. We further excluded 31 studies by scanning the full text (original data unavailable [n = 3], data repeatability [n = 8], review and meta-analysis [n = 11], retrospective and cohort studies [n = 4], quasi-randomized studies [n = 2], and studies with our unconcerned outcomes [n = 3]). Finally, seventeen RCTs were included in our analysis [11, 25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40] (Fig. 1).

Fig. 1
figure 1

Flowchart of literature search and study selection process

Characteristics of seventeen eligible RCTs were presented in Table 1. These RCTs were published between 2002 and 2018, involving 5204 patients (50.3% patients with LG). There were no differences in the demographics and clinicopathological characteristics of patients in the LG and OG group for each study. Eight trials were conducted in China [25,26,27, 29, 35, 37, 39, 40], five studies in Japan [28, 31, 32, 36, 38], three in Korea [11, 33, 34], and one in Italy [30]. Early gastric cancer (EGC) patients were included in six studies [28, 32, 33, 36, 38, 39], and advanced gastric cancer (AGC) patients were enrolled in another six trials [25, 29, 34, 35, 37, 40]. Distal gastrectomy was adopted in nine trials [26, 28, 30, 32,33,34, 36, 38, 40]. The results of methodological quality assessment about each risk of bias item for each included trial were shown in Fig. 2.

Table 1 Baseline characteristics of studies included in the meta-analysis
Fig. 2
figure 2

Risk of bias. a Risk of bias graph. b Risk of bias summary

Primary outcomes

Sixteen trials reported the number of lymph nodes harvested during surgery. However, in Kim’s trial, the baseline was statistically significant in the extent of lymphadenectomy (P = 0.002). More patients suffered from D2 lymphadenectomy in the OG group than the LG group, which could cause a significant bias in the number of lymph nodes harvested during surgery [11]. Therefore, we excluded this trial in our analysis. Plotted data showed that there was no difference between these two groups in the number of lymph nodes harvested during surgery with a modern heterogeneity using the random model (MD = − 0.72, 95% CI = [− 1.50, 0.07], P = 0.07) (Fig. 3a).

Fig. 3
figure 3

Forest plot between laparoscopy gastrectomy (LG) and open gastrectomy (OG) group on primary outcomes. a The number of lymph nodes harvested during surgery. b Severe complications. c Long-term recurrence. d Short-term mortality. e Long-term mortality

Severe complications were defined when the extent of complications was up to grade III or more based on the Common Terminology Criteria for Adverse Events (CTCAE) ver. 4.0 or the Clavien-Dindo classification. Fourteen trials reported the severe complications. Fixed model showed no difference in these two groups without statistically significant heterogeneity (RR = 0.90, 95% CI = [0.65, 1.26], P = 0.55) (Fig. 3b).

Short-term recurrence was described as local recurrence, surgical recurrence, or distal metastases that existed within 6 months after surgery. Four trials reported the short-term recurrence while no patients were recurrent in the two groups. Therefore, we could conclude that there was no difference in the short-term recurrence between the LG and OG groups though we could not calculate the effect estimate. Seven trials reported the long-term recurrence which was defined as recurrence beyond 6 months after surgery. Fixed model showed no difference in these two groups without heterogeneity (HR = 0.99, 95% CI = [0.78, 1.26], P = 0.93) (Fig. 3c).

Fifteen trials reported short-term mortality which was regarded as death in hospital or within 1 month after surgery. Fixed model showed no difference in these two groups without statistically significant heterogeneity (RR = 1.50, 95% CI = [0.52, 4.35], P = 0.45) (Fig. 3d). Nine trials reported long-term mortality which was described as death out of hospital and beyond 1 month after operation. Fixed model showed no difference in these two groups without heterogeneity (HR = 1.03, 95% CI = [0.80, 1.32], P = 0.82) (Fig. 3e).

Secondary outcomes

There were longer operative time (MD = 58.80 min, 95% CI = [45.80, 71.81], P < 0.001), less intraoperative blood loss (MD = − 54.93 ml, 95% CI = [− 81.60, − 28.26], P < 0.001), less time to first flatus (MD = − 0.58 days, 95% CI = [− 0.79, − 0.37], P < 0.001), first ambulation (MD = − 0.50 days, 95% CI = [− 0.90, − 0.09], P = 0.02) and first oral intake (MD = − 0.64 days, 95% CI = [− 1.24, − 0.03], P < 0.04), and less hospital stay (MD = − 1.37 days, 95% CI = [− 2.05, − 0.70], P < 0.001) in the LG group versus the OG group with significant heterogeneity using random models (Fig. 4a–e, Fig. 5a).

Fig. 4
figure 4

Forest plot between the LG and OG group on secondary outcomes. a Operative time. b Intraoperative blood loss on secondary outcomes. c Time to first flatus. d Time to first ambulation. e Time to first oral intake

Fig. 5
figure 5

Forest plot between the LG and OG group on secondary outcomes. a Hospital stay. b The number of patients who need blood transfusion. c The quantity of blood transfusion. d The frequency of analgesic administration. e The duration of analgesic administration. f Total complications

There were no differences in the number of patients who need blood transfusion (RR = 0.77, 95% CI = [0.57, 1.05], P = 0.1) and the quantity of blood transfusion (SMD = 0.06, 95% CI = [− 0.27, 0. 38], P = 0.74) using a fixed model with no heterogeneity (Fig. 5b, c). Also, the fixed models showed that the frequency and the duration of analgesic administration was less and shorter in the LG group than the OG group with no heterogeneity (frequency: MD = − 1.73, 95% CI = [− 2.21, − 1.24], P < 0.001; I2 = 0, P = 0.42; duration: MD = − 1.26, 95% CI = [− 1.40, − 1.12], P < 0.001; I2 = 0, P = 0.57) (Fig. 5d, e).

Total complications were defined as complications that occurred during the same hospitalization or within 30 days after the operation. Sixteen trials reported the total complications. Fixed model showed that patients in the LG group underwent fewer total complications after surgery than the OG group (RR = 0.81, 95% CI = [0.71, 0.93], P = 0.003) without statistically significant heterogeneity (Fig. 5f).

Subgroup analysis

Primary outcomes consist of lymph nodes harvested during surgery, severe complications, short and long-term recurrence, and mortality. Considering that primary outcomes are the key surgical and prognostic markers, we conducted the subgroup analysis about these indicators. Subgroup analysis was stratified based on the different cancer stages (early gastric cancer and advanced gastric cancer) and different types of gastrectomy (distal gastrectomy). Subgroup analysis showed no difference in lymph nodes harvested during surgery, severe complications, recurrence, and mortality between these two groups. Detailed results were shown in Tables 2 and 3.

Table 2 Subgroup analysis of laparoscopic versus open gastrectomy stratified by different tumor stage
Table 3 Subgroup analysis of laparoscopic versus open gastrectomy stratified by different type of gastrectomy

Sensitivity analysis and publication bias

Sensitivity analysis is an analytic procedure which could be used to explore the source of uncertainty in the pooled results. We used fixed/random-effect models to test each comparison and arrived at a consistent conclusion (data not shown). Egger’s test was conducted for each comparison to evaluate the publication bias. There exists publication bias in the number of lymph nodes harvested during surgery, the duration of analgesic administration and the time to first flatus (Table 4); however, when applying the trim-and-fill method, there were not any trials trimmed in the number of lymph nodes harvested and the duration of analgesic administration. About the time to first flatus, after filling one trial, the revised result was still consistent using random model (MD = − 0.61 days, 95% CI = [− 0.82, − 0.41], P < 0.001) or fixed model (MD = − 0.81 days, 95% CI = [− 0.86, − 0.76], P < 0.001), indicating no publication bias in the comparison. The filled plot was shown in Fig. 6.

Table 4 Publication bias by Egger’s test
Fig. 6
figure 6

Filled funnel plot with pseudo 95% confidence limits on time to first flatus

Discussion

Though there are some meta-analyses comparing the safety and efficacy of the LG and OG for gastric cancer patients, there still exist some concerns about the number of lymph nodes harvested during the surgery and the long-term outcomes [12, 13, 18,19,20]. In our meta-analysis, we summarized the primary and secondary outcomes of LG versus OG for gastric cancer patients. After an extensive search of the literature, 17 RCTs were identified and included.

Of the primary outcomes, they are key surgical and prognostic indictors including the number of lymph nodes harvest during surgery, severe complications, recurrence, and mortality. As for the number of lymph nodes harvested during surgery, we excluded Kim’s trail because there was statistical significance in the extent of lymphadenectomy. There are 390 patients with D2 lymphadenectomy and 216 patients with D1 lymphadenectomy in the OG group while 360 and 284 patients suffered from D2 and D1 lymphadenectomy in the LG group, separately (P = 0.004). Kim et al. also admitted that this bias could be the reason that more lymph nodes were dissected in the OG group than in the LG group [11]. Therefore, it is necessary to exclude the trial in the pooled analysis of the number of lymph nodes dissection during surgery. Through the meta-analysis, the plotted data demonstrated that there were no statistically significant differences in primary outcomes between the LG and OG groups. Stratified by the different cancer stage and different types of gastrectomy, subgroup analysis was conducted to check the sensitivity and stability of the results. The conclusion was consistent, which suggested that LG has a comparable efficacy compared with OG for gastric cancer patients.

As for the secondary outcomes, they consist of operative time, intraoperative blood loss, blood transfusion (number, quantity), measures of earlier postoperative recovery (analgesic administration, time to first flatus, first ambulation and first oral intake, and hospital stay), and total complications. Plotted data showed that there were no differences between the two groups in the number of patients who need transfusions and the quantity of blood transfusions. Longer operative time was required for patients in the LG group than the OG group. However, compared with patients in OG group, patients in LG group lost less blood during operation, achieved lower total complications; required less analgesic administration; shorter time to first flatus, first ambulation, and first oral intake; and shorter hospital stay. That means LG has an advantage over OG in the safety for gastric cancer patients.

In order to check the stability of our results, we conducted sensitivity analysis. We used fixed/random models to test each comparison and the conclusions were unchanged. Egger’s test showed that publication bias existed in the number of lymph nodes harvested during surgery, the duration of analgesic administration and the time to first flatus. Conclusions were consistent by the Duval’s trim and fill method, which means our results were stable and reliable.

Despite all this, this meta-analysis has some limitations. Firstly, all these RCTs have high or unclear risk in blinding due to medical ethics. Secondly, heterogeneity exists in operative time, blood loss, analgesic administration, hospital stay, and time to first flatus, ambulation, and oral intake. Finally, limited data were available to compare the hospital costs and health-related quality of life which are also important for patients to choose the method of operation [26, 39, 40].

Conclusion

In our analysis, we could conclude that LG was comparable to OG in the primary outcomes and had some advantages in secondary outcomes. That means LG is superior to OG for gastric cancer patients.