Laparoscopic liver resection is considered a feasible alternative to the open approach. Minimally invasive techniques have been widely used in the treatment of benign diseases such as hydatid cysts, hepatolithiasis, hemangiomas, focal nodular hyperplasia, and hepatic adenomas [1,2,3]. However, an increasing number of case series of malignant lesions scheduled for the laparoscopic approach have been published in the literature [4]. For instance, in other types of surgery laparoscopic access has been shown to be not inferior to an open approach in terms of oncological outcomes [5,6,7]. Most importantly, there are major well-known advantages related to the minimally invasive approach: less postoperative pain, lower morbidity, faster recovery, and better quality of life [8, 9].

Despite laparoscopy gaining popularity, its clinical utility and complexity in liver surgery, especially extensive liver resection, is still a subject of thorough analysis and discussion [10, 11]. It is often questioned whether reduced complications are enough to outweigh the benefits of open surgery such as the less steep learning curve and potentially shorter operative time. There is, however, evidence supporting non-inferior outcomes, primarily overall survival (OS) and disease-free survival (DFS).

Moreover, operations for hepatocellular carcinoma (HCC) in patients with liver cirrhosis and portal hypertension are considered difficult and may be associated with relatively high morbidity [12]. It has been proved that patients with liver cirrhosis have worse overall outcomes and a higher perioperative complication rate [13, 14].

So far only few meta-analyses comparing laparoscopic and open liver resections for HCC have been performed, and have not taken into consideration variances of the techniques such as pure laparoscopic and hand-assisted. This may create potential bias when drawing conclusions [15,16,17,18]. In addition, these reviews do not cover the many large-scale studies published in recent years.

To our best knowledge, this is the first systematic review and meta-analysis to evaluate exclusively pure laparoscopic liver resection (LLR) compared with open liver resection (OLR) for HCC.

The aim of this study was to evaluate different aspects of LLR, including its safety (morbidity and mortality), difficulty (operative time and blood loss), and clinical utility (long-term survival) in comparison with OLR.

Materials and methods

Search strategy

Our literature search included EMBASE, MEDLINE, Web of Science, and The Cochrane Library databases. The search terms used were “laparoscopy,” “pure laparoscopic,” “minimally invasive,” “liver resection,” “hepatectomy,” “hepatocellular carcinoma,” and their combinations with Boolean “AND” and “OR” operators. There were no date restrictions and only full texts in English were included. Our last search was performed on December 1, 2017. The full search strategy is available in Supplementary File 1. The systematic review was registered and its protocol published in the International Prospective Register of Systematic Reviews (PROSPERO) under registration number CRD42018084576.

Study selection

Results of the initial search were screened independently by two teams with three reviewers in each team. Studies containing data comparing morbidity between patients undergoing pure laparoscopic and open liver resection for HCC were considered eligible for inclusion. All studies describing hand-assisted or hybrid resections (without subgroup data on pure laparoscopic resections), national registries, reviews, and animal studies were excluded. Both non-randomized and randomized studies were eligible as long as they matched the inclusion criteria.

Data extraction and outcome measures

Outcomes of this systematic review were overall morbidity, major morbidity, specific complications (bile leak, abscesses, cardiopulmonary), blood loss, surgical site infection rate, conversion rate, operative time, reoperation and readmission rates, R0 resection rate, length of hospital stay, and 3- and 5-year OS and DFS rates. Data on type of study, number of patients enrolled, patients’ age and sex, tumor size, types of surgery, and liver function status (cirrhosis, Child scale) were also extracted. Major morbidity was extracted when stated or—if the Clavien–Dindo scale was used—complications rated as Clavien–Dindo grade 3 and higher were considered major.

Statistical analysis

The analysis was performed using RevMan 5.3 (freeware from The Cochrane Collaboration) and R version 3.4.3 with meta package [19]. Statistical heterogeneity and inconsistency were measured using Cochran’s Q test and I2, respectively. Qualitative outcomes from individual studies were analyzed to assess individual and pooled risk ratios (RR) with pertinent 95% confidence intervals (CI) favoring pure laparoscopic over open liver resection for HCC and by means of the random-effects method. When appropriate, mean and standard deviation (SD) were calculated from medians and interquartile ranges using a method proposed by Hozo et al. [20]. Weighted mean differences with 95% CI are presented for quantitative variables using the inverse variance random-effects method. Statistical significance was observed at the two-tailed 0.05 level for hypothesis and 0.10 level for heterogeneity testing, while unadjusted p values were reported accordingly. To help with clinical interpretation of heterogeneity, we computed prediction intervals (PIs), as suggested by IntHout et al. [21], with the meta R package utilizing the approach of Higgins [22].

Quality assessment

The quality of non-randomized studies was evaluated with the Newcastle–Ottawa Scale (NOS), which consists of three factors: patient selection, comparability of the study groups, and assessment of outcomes. A score ranging from 0 to 9 is assigned to each study, and those that achieve a score of 6 or greater are considered of high quality. The Cochrane risk of bias tool was used to assess the quality of the included randomized controlled trials. We used funnel plots and Egger’s test with meta-regression model to explore possible publication bias [23]. In cases of funnel plot asymmetry, the trim-and-fill method was applied to estimate the cause of asymmetry and correct it [24].

This review was performed strictly following Preferred Reporting Items for Systematic Reviews (PRISMA) guidelines [25] and the MOOSE consensus statement [26].

Results

Study identification

The initial search yielded 3852 articles. After removing 1767 duplicates, 2085 studies were screened by their titles and abstracts for further analysis. Since 1624 did not match the review criteria, 461 full-text articles were screened for eligibility and of these, 418 were later excluded. The PRISMA flowchart and reasons for study exclusion are shown in Fig. 1.

Fig. 1
figure 1

PRISMA flowchart of the study

Characteristics of included studies

The characteristics of a total of 5100 patients from 43 studies included in the meta-analysis are specified in Table 1 [27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69]. The only randomized controlled trial was conducted by Jiang et al. [48].

Table 1 Included studies sorted by year descending

Hospital volume

We estimated the volume of hospitals where studies were performed. Some institutes were noticeably very high-volume centers with almost 3000 cases in 6 years [39], while others performed as few as 60 procedures in 5 years [61].

Study quality

In all included studies, their quality was rated as high (≥ 6 by assessment using the NOS scale), and the risk of bias of the included randomized controlled trial was low according to Cochrane criteria.

Type of surgery

27 studies reported data on types of resections performed, including number of hemihepatectomies, although the reporting style and detail varied between articles.

Liver function

With respect to cirrhosis in patients, 27 studies reported data. Of these, 11 included only patients with cirrhosis. In total, 1065 out of 1257 (84.73%) and 1831 out of 2150 (85.16%) patients with cirrhosis were reported in LLR and OLR groups, respectively. Meanwhile, 33 studies reported data on patients’ Child–Pugh score, with 14 trials analyzing only subjects with Child–Pugh grade A.

Tumor size

35 manuscripts reported on tumor size. There is a noticeable trend of submitting patients with larger lesions to undergo OLR, leading to a potential yet incomputable bias. Pooled estimate analysis showed a significant trend toward smaller tumor sizes in LLR (mean difference − 0.26, 95% CI − 0.42 to − 0.10, p for effect < 0.001). However, the data are highly heterogeneous (I2 = 79%, p < 0.001).

Pringle maneuver

14 articles reported on use of the Pringle maneuver. 163 of 584 (27.91%) LLR patients and 250 of 865 (28.90%) OLR patients underwent this technique during surgery.

Overall morbidity

All studies reported on overall morbidity. The pooled analysis (Fig. 2A) indicates that LLR is connected with reduced overall morbidity (15.59%) compared with OLR (29.88%): RR = 0.53, 95% CI 0.47–0.60, p for overall effect < 0.00001, p for heterogeneity 0.29, I2 = 10%. The funnel plot is asymmetric and Egger’s regression test result significant, both of which indicate potential publication bias. Trim-and-fill analysis was performed and 10 studies that are mirror images of most outlying studies were filled in, [27, 34, 43, 44, 47, 48, 51, 62,63,64] as seen in Fig. 2B. Nevertheless, after this evaluation, the significance of the results remained unchanged, with RR = 0.57, 95% CI 0.49–0.65, and PI = 0.34–0.95.

Fig. 2
figure 2

A Pooled estimates of overall morbidity for pure laparoscopic versus open liver resection for hepatocellular carcinoma. CI confidence interval, df degrees of freedom, MH Mantel–Haenszel. B Funnel plot for results from all studies after trim-and-fill analysis for overall morbidity. White dots represent filled-in studies

Major morbidity and mortality

Major morbidity was reported in 32 studies (n = 4080 patients). Results in only two of them [30, 42] were significantly different, and pooled RRs favor LLR (3.76%) over OLR (8.69%), with RR = 0.48, 95% CI 0.36–0.63, p for effect < 0.00001, p for heterogeneity 0.99, I2 = 0%, and PI virtually equal to CI (Fig. 3A). Thirty-two studies (n = 3657 patients) reported data on mortality. Mortality rate in LLR was 1.58% versus 2.96% in OLR. Pooled analysis showed that mortality was also not significantly different between open and pure laparoscopic groups: RR = 0.64, 95% CI 0.42–1.00, p for effect 0.05, and I2 = 0%. PI was slightly wider than CI, being 0.40–1.04 (Fig. 3B).

Fig. 3
figure 3

Pooled estimates of major morbidity (A) and mortality (B) for pure laparoscopic versus open liver resection for hepatocellular carcinoma. CI confidence interval, df degrees of freedom, MH Mantel–Haenszel

Complications

Bile leak

Bile leak rate was reported in 29 studies (n = 3831 patients). There were no significant differences between groups, with rates of 1.70% in the LLR group and 2.33% in the OLR group: RR = 0.77, 95% CI 0.48–1.24, p for effect 0.28, and I2 = 0% (Fig. 4A).

Fig. 4
figure 4

Pooled estimates of bile leak (A), abscesses (B), and pulmonary complications (C) for pure laparoscopic versus open liver resection for hepatocellular carcinoma. CI confidence interval, df degrees of freedom, MH Mantel–Haenszel

Abscesses

Fifteen trials (n = 2034 patients) reported on abscess occurrence. One out of 780 (0.13%) patients in the LLR group and 14 of 1254 (1.12%) in the OLR group had abscesses. Pooled analysis showed no statistical differences between these groups: RR = 0.40, 95% CI 0.14–1.18 p = 0.10, I2 = 0% (Fig. 4B).

Pulmonary complications

Twenty-eight studies (n = 3343 patients) reported the occurrence of pulmonary complications: 5.01% of patients undergoing LLR and 10.03% in the OLR had this morbidity, its rate being significantly reduced in the LLR group: RR = 0.58, 95% CI 0.44–0.75 (PI 0.43–0.77), p for effect < 0.0001, and I2 = 0% (Fig. 4C).

Blood loss

Data on blood loss were reported in 34 studies (n = 4116 patients). The heterogeneity for this outcome was very high, I2 = 94%. Sensitivity analysis did not find any potential sources of heterogeneity. For this reason, we decided not to pool the results.

Operative time

Operative time was reported in 43 studies (n = 5100 patients). We did not pool the results because of the very high heterogeneity (p < 0.0001, I2 = 91%). Sensitivity analysis did not find specific studies that caused these results.

Length of hospital stay

Length of hospital stay was reported in 42 studies (n = 5032 patients). However, heterogeneity was high (I2 = 85%) and its source was not revealed by sensitivity analysis. Thus, no pooling was performed.

Survival

Three-year OS was reported in 21 studies (n = 2950 patients), while 18 (n = 2467 patients) trials reported 5-year OS. There were no significant variations among the analyzed groups: the LLR group had OS rates of 83.72% and 68.97% in 3 and 5 years, respectively. The OS rate in patients undergoing OLR was 80.82% in 3 years and 68.12% in 5 years. For 3-year OS, RR = 0.86, 95% CI 0.72–1.02, p = 0.08, and I2 = 6% (Fig. 5A). For 5-year OS, RR = 0.94, 95% CI 0.82–1.09, p = 0.41, and I2 = 27% (Fig. 5B).

Fig. 5
figure 5

Pooled estimates of 3-year (A) and 5-year (B) overall survival for pure laparoscopic vs. open liver resection for hepatocellular carcinoma. CI confidence interval, df degrees of freedom, MH Mantel–Haenszel

Nineteen studies (n = 2836 patients) reported on 3-year DFS. Thirteen articles (n = 1829 patients) provided information on 5-year DFS. Pooled analysis showed no differences between groups for either outcome. The 3-year DFS rate was 59.45% in the LLR group and 59.05% in the OLR group, while the 5-year DFS rate was 46.57% for LLR patients and 44.84% for OLR patients. For 3-year DFS, RR = 1.02, 95% CI 0.90–1.15, and p = 0.81. We analyzed sources of moderate heterogeneity (I2 = 47%), and in sensitivity analysis we found two studies that affected the results [42, 64]. After their removal from the meta-analysis, pooling confirmed previous findings, with RR = 1.01 (95% CI 0.93–1.10) and virtually no heterogeneity (I2 = 0%) (Fig. 6A). For 5-year DFS, RR = 0.97, 95% CI 0.90–1.05, p = 0.46, and I2 = 0% (Fig. 6B).

Fig. 6
figure 6

Pooled estimates of 3-year (A) and 5-year (B) disease-free survival for pure laparoscopic versus open liver resection for hepatocellular carcinoma. Forest plot presents studies after sensitivity analysis. CI confidence interval, df degrees of freedom, MH Mantel–Haenszel

Discussion

Summary of findings

There is growing evidence supporting the feasibility of laparoscopic liver resection for HCC, and its safety is confirmed in our meta-analysis. This review, including over 5000 patients, shows that pure laparoscopy significantly reduces morbidity while at the same time delivering survival comparable with that of open surgery. Because of the very high heterogeneity, it is not possible to definitively assess the differences in blood loss and operative time. Although the quality of all included studies was assessed using standardized tools as high, all but one are non-randomized, which may introduce selection bias. Moreover, there were differences in tumor size and in the use of the Pringle maneuver between LLR and OLR groups, which may cause serious bias and troublesome interpretation of results.

In addition, we did not include three studies because of the language limitations. However, based on abstract screening, number of cases in them was relatively small (less than 50 cases). Therefore, it is very unlikely that their inclusion would alter the final results.

We realize that our review is not the first to be conducted on this topic. However, previously published meta-analyses on liver resections for HCC either did not take the type of laparoscopic technique into consideration [16,17,18] or performed a subgroup analysis that mistakenly assigned trials with hybrid resections to the LRR group as in Sotiropoulos et al. [15]. This might have introduced a major methodological bias to previous studies. In addition, more recent large-scale trials are not included in previous reviews. Moreover, several meta-analyses, including recently published one by Goh et al., were limited to cirrhotic patients, which does not allow to draw conclusions with wide clinical applicability of laparoscopy [70].

These facts prompted us to revisit this topic and follow strict methodological and surgical criteria to obtain the best available evidence. Additionally, we used PIs to interpret whether the results would be applicable in different clinical settings.

When defining the aim of our study, we strived to tackle the issue on different levels in terms of LLR safety (by analyzing morbidity, mortality, and specific complications), difficulty (operative time and blood loss), and its long-term results (OS and DFS).

LLR safety

Overall morbidity is crucial in our review for assessment of the method’s safety. Pooled analysis confirmed the benefits of laparoscopy with low heterogeneity and its demonstrable effect in different settings. It is worth noting that only one study, by Hu et al. [61], reported higher overall morbidity in LLR. Also, common general (i.e., pulmonary) complications were less likely to occur in the LLR patients, while procedure-associated complications (bile leak, abscesses) did not differ compared with OLR.

Nevertheless, results varied between studies: some showed an overall complication rate as high as 20.31% for LLR [42] while others reported no morbidity at all among 50 patients undergoing LLR [48]. Such discrepancies can be explained to some extent by hospital volume or surgeon experience, as discussed further in the limitations of our review below, but it seems that the definitions and reporting of specific complications are not standardized, which may result in significant discrepancies between included studies and eventual bias.

Pooled analysis also showed no differences in mortality (1.58% in LLR vs. 2.96% in OLR; RR = 0.64; 95% CI 0.42–1.00), which generally is a relatively rare complication in liver resections for HCC. Only one study by Xiang et al. [42] had a higher mortality rate in the LLR group, but it is important to note that this was based on one death in both groups: 1 out of 128 in LLR and 1 out of 208 in OLR.

Difficulty of LLR and OLR

Laparoscopy in surgery is sometimes disparaged as being more complex, supposedly because of the steep learning curve, longer operation times, and greater blood loss [71]. However, it has been shown that more experienced surgeons have, in fact, shorter median operative times as well as reduced blood loss and conversion rates in liver surgery [72]. Many parameters, such as hospital volume, are difficult to compare, which directly affects the experience and subsequent intraoperative results. Some studies point out that an increase in operative time may be a result of the learning curve and should improve in the future [69]. Others, however, claim to have lowered it, as evidenced by a reduced conversion rate [36, 49]. This learning curve effect is nearly impossible to include in an analysis. These study limitations may be one reason why the legitimacy of LLR is often challenged in liver surgery. In our meta-analysis, we decided not to perform a pooled analysis of operative time, blood loss, and length of stay for reasons of significant heterogeneity. Even if pooling was possible, there is a potential bias because of highly variable operative techniques between centers. Most studies did not thoroughly describe the type of surgical devices and techniques of parenchyma transection, which influences the total blood loss. Another difference is the rate of use of the Pringle maneuver.

Usually laparoscopy is also associated with extended duration of surgery, but there is a possibility of patient selection bias reflecting surgeons’ preference to submit more complex cases to OLR. However, recent analyses showed that in liver resections a trend toward shorter operative time in laparoscopy may in fact be non-significant [73, 74]. Types of surgical instruments used in LLR may also affect the operative time [75].

The very high heterogeneity of these results (I2 = 94% for blood loss, I2 = 91% for operative time, and I2 = 85% for length of stay) does not allow us to draw definitive conclusions, and it would seem that non-randomized trials may not be able to resolve this issue.

Long-term results

The meta-analysis confirmed previous findings [17, 18] that LLR does not differ from OLR in terms of OS and DFS. This has to be juxtaposed with the clear benefits of LLR safety as well as its vague yet potentially greater difficulty. Our meta-analysis of more than 5000 cases points out the weakness of non-randomized trials that do not allow for unequivocal conclusions. All studies but one, by Jiang and Cao [48], presented non-randomized groups. This, in our opinion, is a massive drawback that must be taken into account when discussing the data. We interpret our results as a plea for well-designed multicenter trials analyzing the type of surgery, complexity of the procedure, surgeon’s experience, and hospital volume.

Due to inconsistent reporting in included manuscripts, we did not analyze recurrence-free survival separately from DFS. Although a few publications did indeed evaluate recurrences in LLR versus OLR, mean follow-up time varies significantly between studies, making it impossible to pool results without bias.

A very important bias results from patient allocation. There are indisputable differences between LLR and OLR patients regarding the tumor size, cirrhosis, or, intraoperatively, use of the Pringle maneuver. This limitation can only be overcome by randomized controlled trials. Moreover, center volumes highly differ, as does surgeons’ experience. The latter was not reported in our included studies despite it being an extremely important factor in complex liver surgeries. In addition, use of the Pringle maneuver reported in the studies varies and resulted in very high heterogeneity. Finally, we did not analyze the potential impact of surgical technique and its differences between studies. Neither were perioperative care protocols considered, despite their application having been shown to be beneficial in many surgical disciplines [76, 77].

Conclusions

This systematic review with a meta-analysis, thus far the most comprehensive analysis comparing pure LLR with OLR for HCC, reveals major flaws in the available literature. The results indicate that LLR is safe in different clinical settings as it may be associated with reduced overall morbidity and non-procedure-specific complications, and no negative influence on mortality as well as OS and DFS. However, these results are based on non-randomized trials comparing heterogeneous groups of patients, thus introducing confounding variables from the outset.

In our opinion, therefore, there is no need for further non-randomized trials proving the feasibility and safety of LLR. This is a plea for large, multicenter, well-designed randomized controlled trials that can overcome the weaknesses of the available evidence.