Introduction

Lymph nodes located in the upper mediastinum—especially along the bilateral recurrent laryngeal nerve (RLN)—are common site of early metastatic spread in patients with upper and middle third esophageal squamous cell carcinoma (ESCC) [1].

Although a thorough RLN lymph node dissection (LND) allows a better staging, the routine application of RLN LND in esophageal cancer surgery remains uncommon. This is not only due to its unconfirmed therapeutic value but also due to the high morbidity that frequently accompanies RLN LND [4, 5]. Damage to the RLN causes a hoarse voice and swallowing difficulties. Coughing and airway clearance are also impaired by the inability to perform an adequate Valsalva maneuver, which in turn significantly increases the postoperative pulmonary complication rates [6].

Over the last two decades, video-assisted thoracoscopic esophagectomy (VATE) has gained increasing popularity owing to its capacity to provide an improved magnification and precise tissue dissection [9]. In experienced hands, VATE may result in a significant reduction of postoperative mortality and morbidity compared with open surgery [9]. However, a safe performance of RLN LND through VATE remains challenging. The limitations imposed by two-dimensional vision, instrument rigidity, and limited surgical spaces represent the main obstacles to achieving this goal. In addition, the potential occurrence of direct mechanical or thermal damage to the RLN has led to unchanged RLN palsy rates even when RLN LND is performed through VATE [12].

In recent years, robotic surgical systems incorporating three-dimensional stereoscopic vision and arms with wrist-like joints have allowed a meticulous dissection even in the limited mediastinum space. Despite the growing interest in robot-assisted thoracoscopic esophagectomy (RATE) [13], few published studies have directly compared its results with those of VATE [16, 17]. Moreover, the question as to how RATE compares with VATE for RLN LND has received scarce attention. Because our thoracic surgery center is witnessing this technology shift, we designed the current propensity-matched study to directly compare RATE with VATE in ESCC patients, with a special focus on the adequacy of LND along the RLN.

Materials and methods

Patients

This retrospective analysis was performed under the approval of our institutional review board. Consecutive ESCC patients who had undergone McKeown minimally invasive thoracoscopic esophagectomy and bilateral RLN LND between January 2013 and May 2016 were deemed eligible. Robot-assisted surgery was made available to our center as of October 2014, and it was subsequently offered to all patients. Patients who agreed to undergo a partially insured operation with the da Vinci Si HD Surgical System (Intuitive Surgical Inc., Sunnyvale, CA, USA) were included in the RATE group. All of the remaining patients who refused the partially insured use of RATE underwent VATE under complete health insurance coverage (VATE group). Pretreatment staging was based on the results of chest and abdomen CT scans, PET imaging, and endoscopic ultrasound. Patient staging was performed according to the American Joint Committee on Cancer (AJCC) staging criteria, seventh edition (2010). The severity of comorbidities was determined with the Charlson comorbidity index (CCI) [18]. The total score ranges between 0 and 37, with a score of zero indicating that no comorbidities are found.

Indications for neoadjuvant therapy

When clinically staged as T2N0M0 or higher, patients were offered neoadjuvant chemoradiotherapy (nCRT). It consisted of either a combination of cisplatin plus 5-fluorouracil or carboplatin plus paclitaxel given with concurrent radiation therapy (at a dose of 30 Gy before 2014 and 41.4–45 Gy thereafter). At 6–8 weeks after completion of nCRT, patients underwent surgical resection. Our standard surgical approach consisted of transthoracic esophagectomy followed by the creation of a gastric tube (either through laparotomy or laparoscopy) and a cervical esophagogastric anastomosis via left neck incision. All ESCC patients underwent a total two-field LND (which included the lower mediastinal/upper abdominal and bilateral RLN lymph nodes). However, neck LND was selectively performed only in patients who had evidence of lymph node involvement on preoperative CT or PET imaging. Patients were kept in an intensive care unit for at least one night after surgery and transferred thereafter to their appropriate ward. Jejunostomy tube feeding was started 24 h after surgery. A leak test was performed on the seventh postoperative day.

Thoracic procedure in the semiprone position

The patient was placed in the left semiprone position with single-lung ventilation. An artificial pneumothorax using CO2 at a pressure of 6–8 mmHg was created. As far as RATE was concerned, five trocars were positioned at the following sites: (1) a 12-mm trocar at the sixth intercostal space (ICS) along the posterior axillary line for a 30-degree angled thoracoscope; (2) a 8-mm trocar at the fourth ICS along the mid axillary line medially to the scapula for the right robotic arm (R, Fig. 1a); (3) a 8-mm trocar at the ninth ICS along the posterior axillary line for the first left robotic arm (L1, Fig. 1a); (4) a 8-mm trocar at the ninth ICS along the mid point between the mid-scapular line and the paraspinal line for the second left robotic arm (L2, Fig. 1a); and (5) a 12-mm trocar at the seventh or eighth ICS along the mid axillary line for an accessory port. A da Vinci Si robotic cart (Intuitive Surgical Inc.) was docked at 45-degree in a counterclockwise direction from the patient craniocaudal axis. With regard to VATE, four trocars were positioned (Fig. 1b). The observation port lied at the posterior axillary line in the eighth or ninth ICS, whereas the main working port was located at the midaxillary line in the fourth intercostal space. The secondary working ports were positioned at the scapular line and the midaxillary line in the seventh ICS. The surgeon and the assistant stood on the right side of the patient, with a high-resolution screen being located on the opposite side.

Fig. 1
figure 1

a Trocar arrangement during RATE performed with the patient lying in the semiprone position. R right robotic arm; L1 first left robotic arm; L2 second left robotic arm. b Trocar arrangement during VATE performed with the patient lying in the semiprone position

Dissection along the bilateral recurrent laryngeal nerve

After incision of the mediastinal pleura, the layer between the esophagus and the thoracic duct was dissected to ensure the maximum mobilization of the posterior part of the esophagus. The mediastinal pleura was then incised along the course of the right vagus nerve until the lower margin of the subclavian artery to identify the right RLN. Left RLN LND was started by exposing the cartilage portion of the trachea. The esophagus was then retracted toward the dorsal side by the third robotic arm (or by surgeon’s left hand in the case of VATE), while the trachea was pushed aside by an assistant to improve operative exposure. The soft tissue between the left side trachea and the esophagus (which included the left RLN and lymph nodes) was carefully dissected. Complete lymphadenectomy was subsequently performed by careful skeletonization of the left RLN. Video clips 1 and 2 demonstrate the key steps of lymphadenectomy along the bilateral RLN during RATE and VATE, respectively.

Gastric tube formation and cervical anastomosis

The mobilization of the stomach was performed under laparoscopy. All of the left gastric, paraesophageal, and splenic lymph nodes were removed. Mobilization of the stomach was conducted on the right gastroepiploic and right gastric arteries. A 4-cm gastric tube was prepared extracorporeally by making a 5-cm incision in the midline.

After construction of the gastric tube, a left cervical incision was performed and the cervical esophagus was identified. After completion of a stapled cervical esophagogastric anastomosis, a drain was placed and all of the incisions were closed.

Definitions

Perioperative complications were defined according to the system proposed by the Esophagectomy Complications Consensus Group [19]. Vocal cord function was assessed by an otolaryngologist using a flexible laryngoscope 2 weeks after surgery even when overt hoarseness was not evident. RLN palsy was further classified by site (unilateral vs bilateral) and type of treatment required (type I: no therapy required; type II: injury requiring elective surgical procedure; type III: injury requiring acute surgical intervention) [19]. Thirty- and 90-day mortality were defined as any death within 30 or 90 days, respectively, after the date of surgery. In-hospital mortality was defined as death occurring at any time during the postoperative hospital stay. Thirty-day readmission was defined as any inpatient admission to our institution within 30 days of discharge from a postoperative stay.

Statistical analysis

Patient characteristics, perioperative results, and surgical performances were analyzed with the Chi-square test, Fisher’s exact test, Student’s t test, and the Mann–Whitney U test, as appropriate. To control for potential confounding factors, patients were matched according to six variables (age, sex, CCI, clinical stage, use of CRT, and RLN nodal findings on preoperative CT imaging). Propensity scores for all patients were estimated by using a multiple logistic regression. Two comparable treatment groups were identified using a 1:1 match ratio based on 8 digits of the estimated propensity score. Data were analyzed using SAS version 9.3 (SAS Institute Inc., Cary, NC, USA) and SPSS version 20.0 (SPSS Inc., Chicago, IL, USA). p values < 0.05 (two tailed) were considered statistically significant.

Results

Patient characteristics

Between January 2013 and May 2016, we identified 141 ESCC patients who underwent McKeown esophagectomy accompanied by bilateral RLN LND with a minimally invasive approach (Fig. 2). Of them, 37 were treated with RATE and 104 received VATE. After propensity matching, a total of 34 patient pairs were included in the study. Table 1 shows the general characteristics of the study patients before and after propensity matching. Notably, patients in the RATE group were older and had a higher CCI before the matching procedure. Moreover, a higher proportion of patients in the VATE group were treated with CRT before surgery.

Fig. 2
figure 2

Flow of the patients through the study

Table 1 Demographic and clinical data of the study patients before and after propensity matching

Propensity-matched analysis

After propensity matching, the general characteristics of the RATE and VATE groups were well balanced. Notably, no significant intergroup differences were evident even for unmatched clinical variables (e.g., tumor location and tumor length). Details on the surgical quality before and after propensity matching are summarized in Table 2. No conversion to open thoracotomy occurred in either group. The thoracic operating time in the RATE group was longer than in the VATE group, albeit not statistically significant. Similarly, intraoperative blood loss and the need of blood transfusions did not show significant intergroup differences. The mean number of dissected nodes was similar in the two study groups, the only exception being the left RLN area. Specifically, the mean number of nodes removed from this region was 5.32 in the RATE group and 3.38 in patients who received VATE (p = 0.007).

Table 2 Surgical quality in the RATE and VATE groups

Table 3 shows the subgroup analysis of surgical quality in the RATE and VATE groups stratified according to the use of preoperative CRT (yes vs no). Although preoperative CRT significantly reduced the mean number of LND (39.75 vs 31.36, respectively, p < 0.001), RATE produced a higher lymph node yield along the left RLN compared with VATE (regardless of CRT use or not).

Table 3 Subgroup analysis of surgical quality in the RATE and VATE groups stratified according to the use of preoperative CRT (yes vs no)

With regard to morbidity after RLN LND, 17 (25%) patients suffered from RLN palsy; of them, 15 and 2 cases had left-sided and right-sided lesions, respectively. Most patients (13/17, 76.5%) with RLN palsy had self-limiting clinical manifestations that required no therapy (type I). Four patients with type II RLN palsy required elective office-based intracordal hyaluronate injections after discharge [20]. The short-term perioperative outcomes of the two study groups before and after propensity matching are depicted in Table 4. Besides RLN palsy, no significant differences were also observed in terms of pulmonary complications, anastomotic leaks, 30-day/90-day/in-hospital mortality, and 30-day readmission rates between the two groups.

Table 4 Short-term perioperative outcomes in the RATE and VATE groups

Discussion

The results of our study demonstrate that RATE allows achieving a higher mean number of dissected nodes along the RLN area (especially in the left side) compared with VATE, without increasing RLN palsy and pulmonary complications rates. Besides the case series reported by Suda et al. [16], this is the second study that directly compared McKeown RATE versus VATE. Notably, our current report includes a larger number of cases and is the first to apply a propensity-matched study design. We specifically performed a careful matching of several potential confounders that could affect both lymph node yields and postoperative outcomes.

Complete left RLN LND required the complete removal of soft tissue and lymph nodes located both ventrally and dorsally to the RLN (from the aortic arch level to the cervical part). Specifically, the dissection of nodes located ventrally to the left RLN is frequently the most challenging part of the surgical procedure. Video clip 2 illustrates the most common difficulties that may be encountered during left RLN LND performed through VATE. The limited flexibility of the instrument tip precludes a tension-free dissection of lymph nodes located between the trachea and the left RLN. Notably, RLN injury can be caused by contusions, excessive stretching, and thermal damage occurring during manipulation. In order to improve the operative exposure and reduce RLN injury, several novel approaches—including esophageal isolation with loops of umbilical tapes (with the goal of suspending the esophagus and ensure the maximum exposure of the left RLN) or the replacement of a double-lumen tube with a single-lumen tube coupled with CO2 insufflation (with the goal of reducing the trachea rigidity elicited by the double-lumen tube and facilitate a better countertraction)—have been recently proposed during VATE. [21, 22]. Unfortunately, and despite all of these technical improvement, left RLN LND remains difficult to be achieved through VATE. In this scenario, experienced assistants are strictly required to ensure a high-quality dissection. We believe that RATE offers at least three advantages as far as the dissection of the left RLN lymph nodes is concerned. First, the three-dimensional self-controlled magnified view provided by the robotic camera allows a better visualization of the upper mediastinum compared with a traditional two-dimensional monitor. Owing to its ability to offer an adequate depth perception, this feature significantly facilitates left RLN LND. Second, the use of the third robot arm (controlled by the operator) allows achieving an excellent operating exposure through the application of stable and self-controllable tractions and countertractions on the esophagus and trachea. With the help of a third robotic arm, the surgeon can perform left RLN LND while maintaining the full use of the primary “working” arms, ultimately ensuring a safer and easier approach to nodal dissection. All of the “self-controlled” features offered by the robotic system may ultimately abrogate the need for experienced surgical assistants (who are strictly required when VATE is performed). Third, RATE facilitates a meticulous tension-free dissection even in a limited anatomical space; this is achieved both by tremor filtering and the use of instruments with articulating function that allows obtaining seven degrees of freedom. It can be anticipated that all of these features may decrease the rates of RLN injury.

During the RATE procedure, we decided to place the third robotic arm on the surgeon’s leftmost side to facilitate the esophageal traction. Under these circumstances, the third robotic arm may come into collision with either the second robotic arm or the patient’s spine; a category of patients at special risk of developing this accident during the upper mediastinum dissection are those with a deep-seated esophagus (i.e., an esophagus located in the left side of mediastinum). When these patients are operated, it is advisable to shift the R port and the camera port (as reported in Fig. 1a) downward of one intercostal space; the third robotic arm (L2) should then be shifted to the rightmost side at the third intercostal space. Figure 3 illustrates the details of trocar design.

Fig. 3
figure 3

Trocar arrangement during RATE performed in patients with a deep-seated esophagus (i.e., an esophagus located in the left side of mediastinum). R1 first right robotic arm; R2 second right robotic arm; L left robotic arm

An important observation from our study is that a better operating exposure is not paralleled by a shorter thoracic operating time, with RATE time being 30 min longer than that of VATE (even though the difference was not statistically significant). One potential explanation for the prolonged operating time is the higher surgeon confidence provided by the robotic system, which can ultimately lead to the willingness of achieving a complete RLN LND. Another potential cause underlying this phenomenon is the inclusion of cases operated when RATE was still in its development stage (i.e., with surgeons operating during the early phase of their learning curves). Further studies specifically aimed at analyzing the surgical learning curves are needed to address this issue in an appropriate manner.

Our findings should be interpreted in the context of several caveats. First, our study shares the limitations of small-sized, retrospective investigations. We acknowledge the low number of participants and the reduced statistical power. Second, our results are limited by the short follow-up time. Although a higher number of dissected nodes along the left RLN was achieved in the RATE group, the question as to whether a more extensive RLN LND could result in a significant survival advantage clearly requires a long-term follow-up analysis. Third, the use of RATE was driven by the patient’s own willingness to undergo a partially insured robot-assisted operation. Despite the use of propensity matching, we cannot rule out the presence of potential selection biases. Consequently, prospective randomized studies are necessary to confirm and expand our findings.

Conclusions

Compared with VATE, RATE resulted in a higher lymph node yield along the left RLN without increasing morbidity.