Background

Oesophagectomies are considered major and complex surgery with significant postoperative pain and high postoperative complication rates which can decrease long-term survival (Booka et al. 2018). Epidural analgesia is often considered the gold standard form of postoperative analgesia for this surgery (Low et al. 2018). Epidurals have been shown to reduce postoperative pain scores and some postoperative complications such as respiratory failure in major abdominal surgery but with uncertain replication in oesophageal cancer surgery (Pirie et al. 2020; Rigg et al. 2002). Pulmonary complications are of interest as they are one of the most common postoperative complications, especially in high-risk open abdominal surgery; they predict long- and short-term health outcomes, admission to critical care and hospital length of stay (Booka et al. 2018; Odor et al. 2020).

Although epidural analgesia may potentially improve outcomes, contraindications include patient refusal, anticoagulant use or a patient’s pre-existing anatomical or neurological issues. Epidurals are associated with complications such as urinary retention, hypotension and partial or complete failure and rarer complications such as neurological damage (8.2–17.4 cases of permanent nerve damage per 100,000 patients receiving epidural analgesia) (Cook et al. 2009). Postoperative management of epidural analgesia also represents a higher resource requirement (Holtz et al. 2022). Recent evidence has suggested minimally invasive oesophagectomy surgery is gaining in popularity compared to open oesophagectomy surgery (Mann et al. 2020). Existing evidence from colorectal surgical data shows epidural analgesia achieves superior pain relief compared to opioid analgesia for open surgery, but not for less invasive (laparoscopic) surgery (Borzellino et al. 2024; Turi et al. 2024). Therefore, as minimally invasive oesophagectomy surgery increases in frequency, epidural analgesia may in turn become less beneficial. Finally, other regional techniques such as paravertebral and erector spinae catheters have been gaining favour in recent years, having many of the benefits of epidural analgesia but with a more favourable side effect profile, although randomised clinical trials are lacking (Feenstra et al. 2023). Many of these factors may result in a reduction in the use of epidural analgesia for postoperative pain management (Pirie et al. 2020).

Two previous meta-analyses compared analgesic techniques in oesophagectomies in 2017 and 2018 but found a paucity of prospective trials to compare. Regarding epidural analgesia versus intravenous opioid analgesia, Visser et al. (2017) observed no significant difference in pain scores at 24 and 48 h postoperatively, and Hughes et al. (2018) observed no significant difference in rest pain postoperatively (Visser et al. 2017; Hughes et al. 2018). Both reviews concluding that no benefit could be shown by epidurals regarding postoperative pulmonary complications (PPCs). Since these reviews, further relevant randomised trials have been published (Xu et al. 2023; Zhu et al. 2020; Li et al. 2019; Wang et al. 2019).

During the completion of this review, a network meta-analysis evaluating analgesic strategies post-oseophagectomy by Ramjit et al. (2024) was published showing an increase in postoperative forced vital capacity; a reduction in pain scores, opioid consumption, intensive care unit stay and time to extubation in thoracic epidural analgesia (TEA) versus systemic opioids (Ramjit et al. 2024). The review did not find enough data to analyse morbidity including postoperative pulmonary complications.

Our primary aim was to evaluate whether TEA reduced respiratory morbidity versus other analgesic techniques following oesophagectomy surgery, with a secondary objective to compare analgesic outcomes.

Methods

This review protocol was prospectively registered on PROSPERO (CRD42023484720) and followed guidance from the Preferred Reporting Items for Systematic reviews and Meta-Analysis (PRISMA) statement 2020 (Page et al. 2020).

The review question was as follows: “In adult patients undergoing elective oesophagectomy, does thoracic epidural analgesia influence postoperative pulmonary complications in comparison to other analgesic techniques?” We used the framework of PICOS (Population, Intervention, Comparison, Outcomes and Study design). Participants included adult patients undergoing elective oesophagectomy. Thoracic epidural analgesia was the comparator group. The intervention groups included any other form of analgesia such as intravenous opioids or other regional techniques. Study design was restricted to randomised clinical trials only.

Primary outcome

The primary outcome was a composite of postoperative pulmonary complications (PPCs), including respiratory infection, respiratory failure and atelectasis within 30 days of surgery. Standard diagnostic criteria were based upon the European Perioperative Clinical Outcomes (EPCO) consensus statement (Jammer et al. 2015). However, as the review search dates included a period before the most recent consensus definitions of PPCs, we categorised explicit descriptions of PPCs in each trial according to closeness of match to the EPCO definitions. Where a composite PPC was not reported, we contacted corresponding authors via email to request additional information, including primary data.

Secondary outcomes

Secondary outcomes included resting and dynamic 24- and 48-h pain scores measured with a 100-mm visual analogue scale (VAS), technical failure, postoperative nausea and vomiting (PONV) and length of hospital stay. Pain not only is important for humane reasons but also it slows progress towards enhanced recovery targets and can lead to further postoperative complications (Low et al. 2018). Technical failure was assessed because epidurals have a high rate of failure (27–32%) (Hermanides et al. 2012). This is at odds with the most common intervention group of intravenous opioid analgesia, which has virtually no failure rate. Technical failure can be defined as insufficient epidural analgesia which requires removal or switch of analgesic regimen and also includes accidental catheter dislodgement (Hermanides et al. 2012). PONV is included as it is a well-known side effect of opioid analgesia; it can be defined as the 24-h incidence of postoperative nausea or vomiting, as this is commonly reported in trials and the most clinically relevant time interval (Dolin and Cashman 2005). Finally, length of hospital stay is a well-established and important perioperative outcome, measured in time (hours) from admission to discharge.

Search strategy

We searched PubMed, Embase, and CENTRAL databases, using a combination of relevant keywords and medical subject heading terms for oesophagectomy surgery and epidural analgesia. Search limits were applied to restrict results to RCTs published from 1 January 2013 to 31 December 2023. We included all randomised controlled trials of adult (age ≥ 18 years) patients undergoing elective oesophagectomy surgery in which one group received postoperative TEA. Intraoperative TEA was not included as all patients undergo general anaesthesia; thus, pain control and its sequelae are more relevant for the postoperative period. The full search strategy is detailed in Appendix. No language restrictions were placed on eligible studies.

Study selection

After de-duplication, the primary author screened titles and abstracts against the inclusion criteria to identify potentially relevant papers. One researcher was used at this stage who erred on the side of over-inclusion. The second stage involved full-text review of all potentially eligible studies by two authors and recording the reason for the exclusion of a paper (Fig. 1).

Fig. 1
figure 1

PRISMA flow diagram

Data extraction

One author extracted data from the selected publications using a pre-piloted data abstract tool. All data was checked by a second reviewer. Information included is as per the study characteristic tables below. Data not reported in the studies was recorded as “NR” (not reported), and non-applicable data was recorded as “N/A”. WebPlotDigitizer was used to estimate these numerical scores and standard deviations from graphical data (Rohatgi 2022). Risk-of-bias assessment was completed after data extraction. Two authors individually evaluated the methodological quality of all articles using the Cochrane risk-of-bias assessment tool version 2 (Higgins et al. 2023).

Data synthesis

Meta-analysis was performed on any primary or secondary outcome included by more than one study. For the dichotomous outcomes of PPCs and PONV, incidences of outcomes per group were extracted from each study to allow a pooled meta-analysis of risk ratio estimates with 95% confidence intervals. For the continuous outcomes of 24- and 48-h pain scores for resting and dynamic pain, mean scores and standard deviations for both groups in each study were extracted to allow a pooled meta-analysis of mean difference estimates with 95% confidence intervals.

Where results were presented in included trials as mixed data of median (IQR) and mean (SD), we converted to mean and standard deviation throughout, to enable pooled comparison. Standard deviations for pain scores were imputed for one trial that did not report, by combining the mean standard deviations of other trials (Fares et al. 2014). One trial measured postoperative pain scores twice a day; these morning and afternoon scores were combined to give a single mean and standard deviation for each day (Flisberg et al. 2001). Two trials had four groups of participants we combined into two groups according to the method of postoperative analgesia (Zhu et al. 2020; Li et al. 2019). Standard deviations were computed using an online calculator (StatsToDo) decomposing the mean and standard deviations of two groups into one single group (CombineMeanSD. 2023; Altman 2000).

All meta-analyses were performed using RevMan (Cochrane and Collaboration 2024). Inverse variance was used as the statistical method for both dichotomous and continuous outcomes. Statistical heterogeneity was assessed by using both the I2 and χ2 tests. A random effects model was adopted due to the clinical and methodological diversity between trials. Formal meta-analyses were not possible for other outcomes including technical failure and length of hospital stay due to insufficient data; therefore, this data is presented in tabulated form and/or narratively appraised.

Results

Description of included studies

A total of 330 publications were found over three databases. After filtering for eligibility criteria, 10 randomised trials with 741 patients over 5 countries were included (Table 1).

Table 1 Study information

The details of inclusion and exclusion criteria varied between trials, with many excluding comorbid patients based on ASA grade or individual diseases. Only one trial used a laparoscopic technique for surgery; all others were open surgery. Three trials recorded PPCs, and only two trials recorded length of hospital stay and PONV. However, all trials recorded some form of pain score at 24 and 48 h (Table 2).

Table 2 Inclusion and exclusion criteria and primary and secondary trial outcomes matching systematic review outcomes

Patient characteristics included a mean age of 59.9 years, mean BMI of 22.7 and a proportion of 75.3% males (Table 3). All trials compared TEA to intravenous opioids, with only one trial including a third group of paravertebral and transversus abdominis plane blocks. Regional and intravenous drug regimens varied (Table 4). PPCs, rest and dynamic pain scores are displayed in Tables 5, 6 and 7 respectively and PONV and length of hospital stay in Tables 8 and 9.

Table 3 Patient characteristics
Table 4 Analgesic regimens
Table 5 Postoperative pulmonary complications (PPCs)
Table 6 Pain scores at rest (+ / − standard deviations)
Table 7 Pain scores on movement (+ / − standard deviations)
Table 8 Postoperative nausea and vomiting
Table 9 Length of hospital stay

Regarding overall risk of bias, seven trials were judged as having some concerns, with three being judged as high risk (Fig. 2). Some trials did not mention their randomisation or allocation concealment method. Nearly all trials did not blind their assessors, accounting for zero trials able to be judged as a low risk of bias overall.

Fig. 2
figure 2

Risk-of-bias summary

Postoperative pulmonary complications (PPCs)

Regarding the primary outcome of PPCs within 30 days of surgery, one trial (Xu et al. 2023) used the EPCO definition (Jammer et al. 2015). Another trial (Maghsoudloo et al. 2023) did not state a definition, and the third trial (Fares et al. 2014) reported a composite of PPCs: individual incidences of pneumonia, pleural effusions and ARDS (acute respiratory distress syndrome). These were all included in the meta-analysis of PPCs. None of these trials specified a time limit for recording these complications. The meta-analysis (Fig. 3) suggests TEA may reduce the risk of a composite of PPCs (RR 3.88; 95% CI 1.98–7.61), although this is of lower certainty due to some risk of bias and differences in definition of composites of PPCs. The I2 and χ2 tests suggest little heterogeneity, although there is some uncertainty of these values as the number of studies and sample sizes is small.

Fig. 3
figure 3

Meta-analysis of postoperative pulmonary complications (PPCs)

Postoperative pain scores

Some studies reported only pain scores (Zhu et al. 2020; Wang et al. 2019; Maghsoudloo et al. 2023; Wang et al. 2017; Yokoyama et al. 2005; Liu and Wang 2015) and did not specify rest and dynamic pain scores. In this case, these pain scores were included in the meta-analyses for rest pain scores. Nearly all pain scores were stated as the VAS (visual analogue score), apart from one study which used the box scale and was still included in the pain meta-analyses (Yokoyama et al. 2005). Some dynamic pain scores did not record their dynamic activity or had different activities across trials. All but three trials did not report numerical data for their pain scores and only had graphical data (Li et al. 2019; Wang et al. 2019; Maghsoudloo et al. 2023).

Rest pain was meta-analysed at 24 and 48 h (Fig. 4a and b). All trials but one favoured TEA. Both meta-analyses suggest a significant reduction in pain scores in the TEA group regarding rest pain at 24 (MD 9.02; 95% CI 5.88–12.17) and 48 h (MD 8.64; 95% CI 5.91–11.37).

Fig. 4
figure 4

a Meta-analysis of pain scores at rest at 24 h. b Meta-analysis of pain scores at rest at 48 h. c Meta-analysis of dynamic pain scores at 24 h. d Meta-analysis of dynamic pain scores at 48 h

Four of 10 trials measured dynamic pain scores at 24 and 48 h; the meta-analyses were displayed in Fig. 4c and d. The summary measures again suggested a significant reduction in pain scores in the TEA group at 24 (MD 14.96; 95% CI 5.46–24.46) and 48 h (MD 16.60; 95% CI 8.72–24.47), with a larger effect but wider confidence interval in comparison to the rest pain meta-analyses. The mean difference (MD) is measured in millimetre of the 0–100-mm visual analogue score (VAS). All pain score meta-analyses suggest considerable heterogeneity when assessing their I2 and χ2 tests.

Postoperative nausea and vomiting

Regarding PONV, only three trials reported data, no definitions or time limits were given by any trial and two trials recorded nausea and vomiting as a single event (Xu et al. 2023; Li et al. 2019). One trial recorded them as two separate events (Wang et al. 2017), which were combined into a single event by addition, in order to include it in the meta-analysis (Fig. 5). This meta-analysis did not suggest any difference in reduction of PONV rates by either method of analgesia and should be considered of low certainty.

Fig. 5
figure 5

Meta-analysis of postoperative nausea and vomiting (PONV)

Other outcomes

There was not enough data to allow meta-analysis of length of hospital stay. Only two trials recorded total hospital stay, both without standard deviations and measuring time to the nearest day (Xu et al. 2023; Flisberg et al. 2001). Other trials’ measurements varied including critical care unit stay, postoperative care unit stay and pre- and postoperative hospital stay with different units of measuring time.

Only one trial recorded technical failure of epidurals (Flisberg et al. 2001). This trial recorded 4 failures of epidural analgesia in 18 participants (22%), and the definition of failure and causes was not recorded. Assuming the definition of failure was similar to other trials, this would be in keeping with failure rates reported in the current literature (27–32%) (Hermanides et al. 2012).

Discussion

This meta-analysis provides evidence for a significant reduction in a composite of PPCs in patients receiving TEA compared with IV opioids. However, overall certainty of evidence is low. Two of the three trials assessing PPCs did not adhere to a standardised definition, and one trial measured composites of PPCs such as pneumonia, atelectasis or pleural effusion. A limitation of composite measurements is the lack of clarity as to which component is different. The solution to this is to use core outcome sets (such as those from the StEP — COMPAC group) with standardised endpoints, creating more comparable data for future meta-analyses (Myles et al. 2016; Boney et al. 2022). Also, this result was on the basis of a small number of small trials, with one trial being judged as a high risk of bias.

Regarding non-epidural regional techniques, we found only one trial directly comparing these to thoracic epidural analgesia. Xu et al. (2023) had 3 groups with 56 patients in each group and showed a similar 5 and 7 PPCs in its TEA group versus its PVB/TAP group, contrasting to the larger 17 PPCs in its IV group (Xu et al. 2023). Therefore, this combined single-shot paravertebral and transversus abdominus plane block technique (PVB/TAP) shows promise for the future but would benefit from a larger body of evidence. A disadvantage is that this technique would require expertise and time for two separate procedures; an advantage would be no requirement for running a postoperative neuraxial local anaesthetic infusion with its associated risks.

Regarding the secondary outcome of pain scores, this review shows a significant reduction in pain scores for patients receiving TEA compared to intravenous analgesia. This significance is displayed at 24 and 48 h postoperatively, at rest and during dynamic movement. This is in contrast to the two previous systematic reviews in 2017 and 2018 which did not show a significant difference in pain scores, probably due to a paucity of data at that time (Visser et al. 2017; Hughes et al. 2018). But this is in agreement with a 2024 network meta-analysis of 14 trials, which also suggests a statistically significant reduction in pain scores with epidural versus systemic opioids (Ramjit et al. 2024). However, our results may only be clinically significant for the dynamic pain scores, which have a MD above that of the 10-mm minimum clinically important difference suggested in the literature (Myles et al. 2017). The single non-epidural regional technique in our review (PVB/TAP group) showed higher pain scores in its trial than the TEA group but lower pain scores than the IV group, with varying levels of significance, and poorer pain control as time progressed (Xu et al. 2023).

We were unable to compare patient-controlled epidural analgesia (PCEA) versus continuous epidural analgesia in this review, as only two trials ran continuous epidural regimens (Maghsoudloo et al. 2023), (Fares et al. 2014). There was also a large amount of methodological diversity within trials that allowed epidural boluses, some being clinician bolus only (not PCEA) (Flisberg et al. 2001; Yokoyama et al. 2005). Bolus volumes varied, there were differences in concentration and type of local anaesthetic and some trials had unspecified lockout times (Flisberg et al. 2001; Wang et al. 2017). Epidurals were sited at different thoracic vertebral levels with one trial siting two thoracic epidurals (Yokoyama et al. 2005).

Other limitations include the small numbers of participants in some trials with no large clinical effectiveness trials. Small RCTs can overestimate treatment effects in the real world (Dechartres et al. 2024). There was methodological diversity in the inclusion and exclusion criteria, populations were from different countries and there were differences in surgical approaches. Nearly all included studies used an open approach (large abdominal incision), with one study using a laparoscopic approach. Although there is a lack of evidence regarding analgesic strategies in laparoscopic surgery, omitting this study does not significantly change the results of the meta-analyses. These differences may have contributed to the high statistical heterogeneity in the pain meta-analyses.

Trials had little data on morbidity which was also poorly defined, and many did not assess pain after 48 h. However, within the 48-h postsurgical timeframe, pain scores were well reported, with nearly all trials using a visual analogue scale (VAS) at standardised time intervals. Pain measurement with the VAS is a validated, subjective measure in acute pain which is well understood by patients (Delgado et al. 2024; Haefeli and Elfering 2006). Its measurement at rest and movement at 24 h is also considered a key patient-reported outcome measure (Myles et al. 2018).

Regarding the limitations of this systematic review process, we did not search for trials with non-epidural regional techniques, unless they included epidural analgesia as a comparator group. This was to avoid the bias associated with indirect comparisons within a network meta-analysis (Feenstra et al. 2023). We searched three large databases, but others were omitted. Regarding the data extraction process, the conversion of graphical data to numerical data using online software was required, which does not have perfect accuracy. One trial did not report its standard deviation for pain scores, and this was imputed in order for inclusion in the meta-analyses (Fares et al. 2014). Lastly, a lack of data for less invasive surgical techniques and non-epidural regional techniques is a limitation of this review. However, these surgical techniques are not yet developed at many centres, are not available for all levels of disease progression and have not yet shown clear short- and long-term benefits (Jebril et al. 2024).

This review has the benefit of including recent evidence and being restricted to oesophagectomy patients only, who have very specific analgesic requirements, compared to older systematic reviews which included data from non-randomised trials (Visser et al. 2017) and data from (non-oesophagectomy) gastric surgery patients (Hughes et al. 2018). It is the first systematic review to interrogate respiratory outcomes, albeit a composite, and it is the first to show evidence to support a reduction in a composite of PPCs with TEA versus intravenous analgesia. It also supports the recent network meta-analysis by Ramjit et al., by showing a reduction in pain scores with TEA versus intravenous analgesia (Ramjit et al. 2024). Contextualising this, our systematic review can support clinicians in utilising thoracic epidural analgesia for reducing pain and PPCs. It can aid discussions in preoperative counselling, shared decision-making and during perioperative risk stratification and planning of postoperative care.

Conclusions

This meta-analysis provides evidence that TEA should currently remain the gold standard analgesic technique for reducing pain after elective oesophagectomy. It is also the first review to provide evidence that TEA reduces a composite of PPCs following oesophagectomy surgery, although this conclusion is of low certainty. Future trials are needed to compare TEA administration techniques, including PCEA. Non-epidural regional analgesic techniques should also be considered for future research. Trials must include more recent laparoscopic and minimally invasive surgical approaches, since the benefit and risk profile of TEA may not be generalised to these patient groups. Appropriate powering to detect clinical effectiveness is required, as is the use of core outcome sets with standardised endpoints (Myles et al. 2016).