Oncological esophagectomy is a key component of curative treatment for resectable esophageal cancer and can be performed as a combination of open, laparoscopic/thoracoscopic and robot-assisted surgery [1,2,3]. Over the years, minimally invasive esophagectomy and especially robot-assisted minimally invasive esophagectomy (RAMIE) have gained in popularity, possibly allowing technical and postoperative advantages [4,5,6,7,8]. To date, research on the added value of a robotic system for esophagectomy has mainly focused on the thoracic phase while the added value of the robotic system during the abdominal phase has rarely been studied [9]. Most surgeons perform the abdominal phase laparoscopically or via laparotomy [10]. Moreover, the decision to be operated via a fully robotic or only partially robotic approach may depend on multiple patient characteristics such as body mass index and comorbidities. A recent review elaborated on the effects of robot assistance during the abdominal phase of RAMIE suggesting its non-inferiority compared to conventional laparoscopic abdominal approaches [11]. However, it remains unclear how the robot-assisted abdominal phase during RAMIE relates to laparoscopy regarding oncological safety and perioperative complications. Therefore, the aim of this study was to compare hybrid laparoscopic RAMIE to full RAMIE for patients with esophageal cancer in an international propensity-score matched cohort study.

Materials and methods

Data were acquired from the prospectively maintained database from the Upper GI International Robotic Association (UGIRA) [12]. The UGIRA group was initiated in 2017 as a worldwide group investigating robotic surgery in upper gastrointestinal cancer and provides data on perioperative care of patients who underwent robotic esophagogastric surgery. Participating centers can provide data for the UGIRA database without a minimum number of robotic procedures. This design was intentionally chosen to also compare the first robot-assisted operations which might be under the influence of a learning curve. The registry consists of 23 participating centers and the UGIRA study group has a central institutional review board approval at the University Medical Center of Utrecht (17/837). For each participating center local ethical approval was either obtained or waived by the local ethical committee. The research proposal was reviewed by the scientific committee of UGIRA and was approved. This paper follows to the STROBE guidelines for observational cohort studies [13].

Patients, procedures and tumor entity

All patients of the UGIRA group who underwent full RAMIE and hybrid laparoscopic RAMIE for esophageal cancer between 2017 and 2021 were included. Full RAMIE consists of both a robot-assisted abdominal phase and a robot-assisted thoracic phase while the hybrid laparoscopic group consists of a laparoscopic abdominal phase and a robot-assisted thoracic phase. In this study, only procedures with curative intention and intrathoracic anastomosis (Ivor-Lewis) were included. One essential inclusion criterion defined adenocarcinoma or squamous cell carcinoma as acceptable tumor entities based on the preoperative histology during primary diagnosis. However, final histopathology could eventually differentiate other tumor entities such as mixed adenoneuroendocrine carcinoma (MANEC), poorly differentiated carcinoma or small cell carcinoma as well (named as other tumor entities).

Outcomes

The primary endpoint was postoperative complications according to Clavien Dindo grade 3a or higher. Secondary endpoints were intraoperative adverse events, in-hospital mortality, postoperative complications, and oncological outcomes including radical resection (R0) rates and lymph node yield. The eighth TNM edition was used for both clinical and pathological TNM stage. The definition of the College of American Pathologists was used for radical resection (i.e., no tumor cells within the resection margins).

Statistical analysis

Data management, missingness imputation and propensity-score matching (PSM) were all realized via Python 3.9 [14] within the integrated development environment of Visual Studio Code (Version 1.59). Patients who underwent full RAMIE were compared to patients who underwent hybrid laparoscopic RAMIE. To account for missing data, case-specific and variable-specific missingness of more than 25% was excluded. Eventually, the overall rate of missing data in the whole dataset was calculated as 2.0% which is widely accepted as a legitimate threshold for imputation. We performed multiple imputations with n = 1000 iterations via Iterative Imputer from Sci–kit learn [15]. After multiple imputation, a propensity-score matching analysis was performed via the Python package pymatch (adapted from the R package Matching [16]) to reduce the effect of known confounders to a minimum. As potential confounders, all variables were utilized which were available before surgery and which were considered as potentially relevant for the decision to either belong to the full RAMIE or hybrid laparoscopic RAMIE group. Through logistic regression, a propensity-score was calculated for each patient based on the selected characteristics displayed in Table 1. Matched study groups were created using nearest-neighbor one-to-one matching without replacement. A threshold of 0.001 was calculated to prevent poor matches after optimizing the threshold and simultaneous maximization of retained proportion according to the overlap of both groups (demonstrated in Fig. 1). After matching, the further comparison between full RAMIE and hybrid laparoscopic RAMIE was performed using Chi2-square tests for binary data, Mann–Whitney U test for ordinal data and student’s t-test for continuous data. A p-value of less than 0.05 was considered as statistically significant. StataSE Version 15.0 (by StataCorp LLC, College Station, TX) was eventually used for final statistical analysis after matching.

Table 1 Preoperative variables used for propensity-score matching
Fig. 1
figure 1

Overlap of data points with propensity-score plotted against data availability (see A). The augmentation of the PSM threshold does not necessarily lead to higher case numbers which is why a threshold of 0.001 was chosen with a retained proportion of > 80% (see B)

Results

Study population

A total of 807 patients underwent Ivor-Lewis esophagectomy and were included for propensity-score matching. Table 1 summarizes all preoperative variables which were used for logistic regression to achieve propensity-score matching. Several parameters such as ASA score (p < 0.0001), year of esophagectomy (p = 0.0002) and history of malignant disease (p = 0.021) were significantly different between groups before matching. Figure 1A shows the overlap of patients in both groups (full RAMIE versus hybrid laparoscopic RAMIE) with their propensity-score plotted on the x-axis. To maximize data similarity, propensity-score matching was eventually performed with an average accuracy of the score of 65.11% based on the selected preoperative variables. In Fig. 1B, the threshold is depicted in relation to the retained proportion of cases. Finally, 296 patients were matched for each group. The last two columns of Table 1 demonstrate the frequency distributions and test statistics of both matched groups with all parameters not showing any significant differences.

Hybrid laparoscopic RAMIE versus full RAMIE

Table 2 demonstrates all outcome variables and test statistics for both full RAMIE and hybrid laparoscopic RAMIE. Figure 2 shows all continuous outcome variables represented as box plots.

Table 2 Intraoperative and postoperative outcome variables with according test statistics after propensity-score matching of both groups hybrid laparoscopic RAMIE and full RAMIE
Fig. 2
figure 2

Box plots for continuous outcome parameters: Significant differences were found for length of stay on ICU (see C, p = 0.0005) and for total length of in-hospital stay (see D, p < 0.0001). Box plots for A total intraoperative blood loss, B total operational time, C length of stay on ICU, D length of in-hospital stay, E total lymphnode yield, F number of positive lymphnodes. ICU intensive care unit

Intraoperative parameters such as blood loss (p = 0.6967) and operational time (p = 0.1032) were not significantly different between both groups. Median intraoperative blood loss was measured as 200 ml for hybrid laparoscopic RAMIE and as 197 ml for full RAMIE. Mean operational time was averaged 430.3 min for hybrid laparoscopic RAMIE compared to 417.7 min for full RAMIE. Significant differences could be found for the length of stay (LOS) on intensive care unit (median LOS of 3 days for hybrid laparoscopic RAMIE versus 2 days for full RAMIE, p = 0.0005) and total in-hospital stay (median LOS of 15 days versus 12 days, p < 0.0001) (Fig. 3)

Fig. 3
figure 3

Bar graphs of binary outcome parameters: Significant differences were found for complications according to Clavien Dindo grade 3a or higher (p < 0.001) and anastomotic leakage (p = 0.001). RAMIE robot-assisted minimally invasive esophagectomy

Oncological outcome parameters such as radical resection (R0) rates (95.6% for hybrid laparoscopic RAMIE versus 96.3% for full RAMIE, p = 0.8526) and total lymph node yield (mean 30.4 for hybrid laparoscopic RAMIE versus 29.5 for full RAMIE, p = 0.3834) were comparable between both groups. Likewise, the number of positive lymph nodes in the final histopathology did not differ between both groups (median 0 for both RAMIE groups, p = 0.6122). The conversion rate to open surgery during the abdominal phase was 2.4% in the hybrid laparoscopic RAMIE group compared to 1.7% in the full RAMIE group (p = 0.560). During the thoracic phase open surgery occurred in 1.4% of hybrid laparoscopic RAMIE cases and in 2.7% of full RAMIE cases (p = 0.243).

Postoperative complications with Clavien Dindo grade 3a or higher appeared more frequently in the hybrid laparoscopic RAMIE group (45.3% versus 26.0%, p < 0.001). This is confirmed via the U-test for the most severe Clavien Dindo grade reported for the individual patients (p < 0.0001). The overall postoperative complication rate was also higher in the hybrid laparoscopic RAMIE group (65.2% versus 48.3%, p < 0.001), including specific complications such as anastomotic leakage (28.0% versus 16.6%, p = 0.001). Readmission rates either to intensive care unit (17.5% for hybrid laparoscopic RAMIE versus 11.2% for full RAMIE, p = 0.071) or to hospital (9.8% versus 13.2%, p = 0.241) did not differ significantly between both groups. The rate of hospital-acquired pneumonia after surgery also did not differ between both groups (15.9% for hybrid laparoscopic RAMIE versus 16.6% for full RAMIE, p = 0.824).

Discussion

This propensity-score matched analysis compared hybrid laparoscopic RAMIE to full RAMIE and suggests that full RAMIE may be superior in terms of overall postoperative complications according to Clavien Dindo grade 3a or higher. A significantly lower percentage of anastomotic leakage was observed after full RAMIE as opposed to the hybrid laparoscopic RAMIE group. In addition, the length of in-hospital stay after full RAMIE was significantly shorter than after hybrid laparoscopic RAMIE. Oncological outcomes (such as radical resection rates or lymph node yield) and intraoperative parameters including operation time were equal for both procedures.

To date, only few studies have focused specifically on the abdominal phase by comparing full RAMIE with hybrid laparoscopic RAMIE [17,18,19,20]. For instance, a retrospective multicenter study by Grimminger et al. compared 175 full RAMIE procedures to 67 hybrid (either laparoscopic or open laparotomy) RAMIE procedures and demonstrated that full RAMIE was associated with significantly lower postoperative complications including anastomotic leakage and respiratory failure [20]. Since there is not much evidence in the current literature, it is necessary to reflect on potential benefits of the robotic abdominal approach. Thus, shorter operation times after full RAMIE and a more precise dissection and reduction in surgical trauma of the gastric conduit could theoretically lead to less complications such as anastomotic leakage of the esophagogastrostomy [20,21,22]. On the other hand, financial expenses of a robotic system and its maintenance are often debated. It has been shown that hybrid laparoscopic RAMIE can be performed with comparable costs in comparison to full RAMIE in the setting of a high-volume European medical center [23]. If robotic assistance does truly lead to a decrease in postoperative complications, it is thinkable that costs could be saved on the long run regarding avoidable time and resources during intensive care and postoperative course [24].

Concerning the limitations of this study, it is to state that the retrospective design based on the UGIRA database may not respect standardized operational steps of the participating centers (such as the implementation of a feeding jejunostomy during the abdominal phase). Similarly, the acquisition of data regarding abdominal lymph node yield and operational time during the abdominal phase is heterogeneously available with a significant missingness due to different approaches by the centers. As another important limitation, it is necessary to discuss a potential learning curve effect leading to the concordant result that a robot-assisted abdominal phase might be superior to laparoscopy during RAMIE. It is very likely that a learning curve effect is involved in the hybrid laparoscopic RAMIE group. A robotic system is generally implemented in the thoracic phase at first place, and after completing the learning curve for the thoracic phase the robotic system may also be implemented for the abdominal phase. Hence, it may be possible that the full RAMIE cases included in this analysis were more frequently performed by a team that has more robotic experience. Consequently, it may be possible that the hybrid laparoscopic RAMIE group consists of procedures performed by surgeons who are undergoing the learning curve for RAMIE. According to the current literature, the learning curve for RAMIE is generally completed after 45–70 cases with the possibility of being shortened by following a structured training pathway that involves proctoring, and modular approaches may help to further reduce time to proficiency [25,26,27,28]. On the other hand, there is also a learning curve for the robot-assisted abdominal phase, although only few studies have dealt with this question and allegedly found a plateau phase after 14–22 cases [29, 30]. Moreover, the learning curve for non-robotic total MIE has also been reported to be relatively high with 119 cases [31]. The effect of the learning curve may be significant for the results of the presented study since the UGIRA registry holds data from centers that are yet in their learning curve. Anyhow, in order to solely compare the robot-assisted abdominal phase to laparoscopy a cohort without a learning curve effect is needed. In this way, only cases after completion of the learning curve for both the thoracic as well as the abdominal phase could be included in a follow-up study. Finally, the significance of a learning curve effect during the abdominal phase has to be elucidated especially in this setting of two cavity surgery where the thoracic phase is performed robotically in any case.

A strength of this study is the fact that it includes a large and international multicenter cohort representing the real practice of specialized hospitals. The UGIRA study group offers the unique opportunity to conduct comparative studies based on standardized procedures and a rigorous selection of participating medical centers. This study also features a strong methodology with a state-of-the-art statistical implementation for data handling, missingness imputation and propensity-score matching.

The current study showed that the use of a robotic system in the abdominal phase during RAMIE achieves comparably good postoperative outcomes. The study suggests that the implementation of a robotic system during the abdominal phase is safe without compromising histopathological results. In the future, it is inevitable to perform prospective and randomized studies investigating whether full RAMIE is truly superior to hybrid laparoscopic RAMIE regarding complications and long-term expenses.