In Western countries, the standard treatment of locally advanced esophageal cancer consists of neoadjuvant chemo(radio)therapy (nCRT) followed by esophagectomy.1 Definitive chemoradiotherapy (dCRT) is an alternative treatment with an option for salvage esophagectomy in patients with residual or recurrent disease.2 Alternative treatment strategies are being explored that also try to minimize the role of surgery.3,4,5,6 Active surveillance after nCRT refers to an experimental approach in which patients with a complete response after nCRT are closely monitored for cancer regrowth instead of planned esophagectomy. Non-inferiority of overall survival at 2 years for active surveillance compared with planned surgery was demonstrated in the SANO trial.7

Given the complexity and evolving nature of esophageal cancer treatment, it is important to establish consensus on the definitions and semantics of the multimodal treatment modalities, including, but not limited to, active surveillance. For example, the term ‘salvage surgery’ is not uniformly defined in the literature and this limits the comparison of studies on dCRT.8,9,10,11 A unified terminology and conceptual framework are essential for precise communication, comparison and interpretation of outcomes, and future research collaboration. Organ-sparing treatments have already been implemented for prostate and rectal cancer, and consensus on outcome measures and definitions of terms used in clinical practice has been established.12,13

We conducted a Delphi study aimed at achieving expert consensus on the semantics of esophageal cancer treatment, including nCRT followed by resection or active surveillance and dCRT. The results of this study may help to frame further research and clinical practice guidelines on multimodal esophageal cancer treatment.

Materials and Methods

This Delphi consensus study was conducted between June 2022 and June 2023. The Delphi method is a structured iterative survey-based technique to facilitate collective decision making and capture the breadth of expert opinions in a systematic manner.14 This study has not been submitted for review to the Medical Ethical Committee as it did not involve patients or privacy-sensitive data.

Participants

Through email invitations, an international expert panel of surgeons, medical oncologists, gastroenterologists, and radiation oncologists were invited to participate in this study. All specialists practiced in a hospital providing active surveillance for esophageal cancer, and published studies on active surveillance. The study coordinators composed of two surgeons (BW, SL), one medical oncologist (BM), one gastroenterologist (MS), and one radiation oncologist (JN) from the Erasmus MC Cancer Institute. The international panel comprised a total of 49 participants representing three continents, including Europe, Asia, and the USA (Table 1).

Table 1 Characteristics of the participating experts

Literature Review

A systematic search was performed by an independent librarian in the Embase, Medline, Cochrane and Web of Science databases to identify commonly used terminology and definitions in esophageal cancer treatment (Online Resource Table S1).

Treatment terminologies for esophageal cancer were categorized into three groups: nCRT followed by surgery, nCRT followed by active surveillance, and dCRT with the possibility of subsequent resection.

Delphi Survey Round 1 (June 2022)

Participants were requested to rate and comment on open-ended questions. They were given the opportunity to add terminology or leave comments for the coordinating group. Surveys were distributed via email using Castor Electronic Data Capture (EDC) version 2023.3.0.1 (Castor, Amsterdam, The Netherlands), and an automatic reminder was sent after a 2-week period to participants who had not responded.15 The survey was open for response for 12 weeks from the date of invitation and the results were discussed by the study coordinators of the Erasmus MC.

Delphi Survey Round 2 (December 2022)

Participants were asked to express their agreement or disagreement with 33 statements in an online survey using Castor EDC. These statements were formulated by the study coordinators based on the findings from the initial Delphi survey round. The results of the first round were shared with the participants using text or images. Statements that fell beyond the study's scope, or did not align with the respondents' initial interest were not further questioned in the study. Participants had the opportunity to provide comments or add free text at the end of the second survey. The results of the second survey were then discussed with the study coordinators after a period of 8 weeks.

Online Consensus Meeting (March 2023)

One week prior to the scheduled online consensus meeting, a list of items that reached consensus in both the first and second Delphi surveys was shared, as well as items on which no consensus was reached. The 1-h online meeting was hosted using Microsoft Teams Version 1.6.00 (Microsoft Corporation, Redmond, WA, USA). During this meeting, participants were presented with 10 additional statements or definitions pertaining to the treatment of esophageal cancer. A poll was utilized to gather the participants' agreement or disagreement on these statements, allowing for subsequent discussions among the participants. Following the conclusion of the meeting, the results were reviewed and discussed by the study coordinators.

Delphi Survey Round 4 (May 2023)

A final online survey was conducted using Castor EDC. Participants were asked to express their agreement or disagreement on the remaining statements, which were formulated based on the outcomes and discussions from the online consensus meeting. The results of the previous round were shared with participants using text or images. The survey remained open for 8 weeks. Responses were discussed by the study coordinators with the aim to generate a final report.

Statistical Analysis

Prior to start of the Delphi consensus study, the Erasmus MC panel established the definition of consensus as having an agreement threshold of ≥75%. This threshold aligns with the commonly reported median of 75% utilized in current literature as a widely accepted criterion for consensus.16 The statements and definitions that achieved consensus were analyzed using descriptive statistics and were presented as frequencies expressed in percentages (%).

Results

Literature View

The search yielded a total of 645 articles. Following a thorough evaluation of titles and abstracts (by researcher CZ), 101 articles were selected for extracting terminology as they discussed terminology relevant to esophageal cancer treatment. Articles that did not meet these criteria were excluded. The extracted terms were reviewed and supplemented with suggestions from the Erasmus MC panel. A final list of 38 terms was compiled, for which consensus was sought among the (inter)national participants.

Participants

A total of 72 national and international experts were invited to participate in the Delphi consensus study, of which 49 participated in the first round (response rate 68%) and 44 in the second round (response rate 61%). The 45 participants who responded to the first and second rounds were invited to the online consensus meeting (21 attended, response rate 47%), and 39 of 45 (87%) experts participated in the fourth round (Fig. 1).

Fig. 1
figure 1

Overview of response to each Delphi round

An overview of the statements on which consensus was reached (87.1%) is shown in Table 2. The results for each Delphi round are shown in Online Resource Tables S25.

Table 2 An overview of the statements on which consensus was reached

Neoadjuvant Chemoradiotherapy Plus Surgery

The consensus among participants was that the main goal of nCRT is to downstage the tumor and to facilitate a radical (R0) resection, with the potential added benefit of treating micrometastases at an early stage. The recommended chemotherapy of the nCRT regimen was a combination of a platinum with a taxane, a platinum in combination with a 5-fluorouracil derivate, or a platinum combined with vinorelbine. Radiotherapy was defined as a total dose of 41.4 Gy. Surgery that is part of trimodality treatment and follows nCRT should be defined as planned esophagectomy.

Neoadjuvant Chemoradiotherapy Followed by Observation

An observation strategy after achieving a clinical complete response following nCRT should be defined as active surveillance. The main goal of this strategy is to omit surgery in patients with a persistent complete remission or to postpone surgery until the presence of (loco)regional tumor is detected in the absence of distant metastases. Response to therapy can be assessed through response evaluations. Participants recommended planning the first response evaluation 6–8 weeks after the completion of nCRT. Recommended tests include a diagnostic computed tomography (CT) scan, 18F-fluorodeoxyglucose positron emission tomography (18F-FDG PET)/CT scan, 18F-FDG PET/CT combined with a diagnostic CT scan, upper endoscopy with (bite-on-bite) biopsies, and endoscopic ultrasonography with fine-needle aspiration of suspicious lymph nodes. The Delphi study did not delve into the sequence of these tests.

If no tumor is detected during active surveillance, it should be classified as a clinically complete response, while it should be called residual disease/cancer/tumor if a tumor did not (completely) respond to nCRT. If surgery is indicated during active surveillance (irrespective of the interval after nCRT), the consensus was to defined it as esophagectomy after surveillance.

Disease Categories

Participants reached consensus on the definitions of different disease categories. Local disease refers to a tumor confined to the esophagus, while regional disease involves regional lymph nodes. Locoregional disease is characterized by the presence of tumor in both the esophagus and the regional lymph nodes. Distant metastases are defined as the presence of metastases in distant organs or non-regional lymph nodes that occur during active surveillance but outside the time-window to surgery. If metastases (distant) occur after nCRT but before the normal time window for surgery (i.e. 10–12 weeks after nCRT), they should be called interval metastases.

Definitive Chemoradiotherapy (With or Without Esophagectomy)

The main goal of dCRT is to cure esophageal cancer without performing surgery. It was widely agreed that the indications for dCRT included T4b carcinoma, patients with cervical (proximal) esophageal tumors (distance from upper esophageal sphincter not further defined), patients who are unfit for surgery, and patients who refuse surgical intervention. The recommended dCRT regimen consists of a combination of a platinum with a taxane, a platinum in combination with a 5-fluorouracil derivate, or a platinum combined with vinorelbine, combined with a total dose of 50.4 Gy radiotherapy. When surgery is performed in the case of recurrent disease following dCRT, it should be called salvage esophagectomy.

No consensus was reached on the optimal approach for follow-up after dCRT in various clinical scenarios.

Discussion

This international Delphi study led to a consensus on several aspects of esophageal cancer treatment. Consensus on statements was categorized according to different treatment modalities that focused on the main goal and regimen of therapy, evaluation of response to therapy, including diagnostic tests, and the definitions of different disease categories. Despite consensus being reached for 27 items (87%), the international expert panel could not agree on the optimal follow-up strategy after dCRT.

The consensus statements from this study do not represent the definite or sole truth. They offer a valuable and all-encompassing summary of the consensus reached. For example, consensus was reached on the need for diagnostic tests during response evaluations; however, the value of conducting an 18F-FDG PET/CT in combination with a diagnostic CT scan for each response evaluation (consensus 75%) is debatable. Some participants argued that a diagnostic CT scan is only of value when residual tumor has been confirmed and surgical resection is planned. On the contrary, the added value of a diagnostic CT scan can be questioned in patients with a complete response on the 18F-FDG PET/CT and endoscopy. We recommend to carefully consider its use given the associated costs and limited information when esophagectomy after surveillance is not indicated. Furthermore, new diagnostic tests may become available over time, potentially requiring a re-evaluation of these study findings. Variation in recommended diagnostic tests may also exist between countries due to costs/resources or because specific diagnostic tests are not yet available.

The efficacy and safety of active surveillance as an alternative to esophagectomy after nCRT has been shown at 2 years of follow-up and we are waiting for the long-term survival results.4,5,6,7 Smaller retrospective studies exploring the feasibility and safety of active surveillance after nCRT have shown similar outcomes.17,18,19,20,21 A recent systematic review of individual patient data also showed similar overall survival in patients who underwent active surveillance compared with patients with planned esophagectomy after nCRT.22 Considering these studies, and in anticipation of the long-term results of the SANO trial, we feel that active surveillance is an alternative treatment strategy that could be offered. This is supported by the outcome of a discrete choice experiment where patients were willing to accept a 16% lower 5-year overall survival rate if the possibility of esophagectomy after surveillance could be reduced to 30%.23,24 Even if active surveillance does not demonstrate to be equivalent overall survival after long-term follow-up results are known, it can remain a viable option for patients who prefer a non-surgical approach considering the associated risks of planned esophagectomy, including altered quality of life after surgery. Therefore, consensus on definitions and terminology as outlined in this Delphi study is needed.

The terminology used for surgery following a prolonged period after nCRT varies. Several studies refer to this as ‘salvage’ surgery.25,26 In alignment with the results of this Delphi study, the term 'salvage surgery’ should be used when esophagectomy is performed for recurrent disease after dCRT (radiation dose ≥50.4 Gy) where surgery is not part of the planned treatment regimen given a patient’s fitness or disease characteristics. When disease recurs, restaging should be performed, and only in a minority of patients may salvage esophagectomy be feasible and offer a chance for cure. This differs from esophagectomy within active surveillance after nCRT. Chemoradiotherapy dose differs and, as such, perioperative morbidity and mortality. In active surveillance, all patients should be eligible for surgery as the goal is not to prevent surgery but only to operate when cancer recurs. As the SANO trial shows, esophagectomy after surveillance is safe and results are similar to planned esophagectomy.7 The experts from this Delphi study therefore agreed that surgery performed after nCRT in the case of residual cancer detected during active surveillance should be referred to as 'esophagectomy after surveillance'. When esophagectomy follows nCRT, irrespective of response to treatment and outside an active surveillance strategy, it should be defined as 'planned esophagectomy'.

The large number of experts across three continents who actively participated is a strength of this study and increases its robustness and external validity.27 Second, the addition of an online consensus meeting between participants facilitated real-time interactions and dynamic discussions, enabling participants to engage in the exchanges of ideas, leading to quicker consensus building. Additionally, the use of virtual meeting tools enhanced flexibility, accommodating varied scheduled time zones and ensuring maximum participation from participants across the world.

There are a few limitations to consider. The study coordinators of the Erasmus MC made diligent efforts to invite international participants who were actively involved in providing active surveillance for esophageal cancer. We felt that these clinicians should be involved in this consensus document as they best understand the semantics of the multimodal treatment strategies for esophageal cancer. Nevertheless, only a few participants had a large experience with this treatment, which may have introduced some degree of selection bias. Efforts were made to invite participants from various regions and disciplines to ensure a diverse and representative panel. Second, while the consensus threshold of ≥75% aligns with commonly reported criteria, it is important to realize that this is a somewhat arbitrary threshold. Nevertheless, this threshold has been widely accepted in the literature and is consistent with other Delphi studies.

Conclusion

This Delphi consensus study successfully attained a consensus among international experts on different facets of esophageal cancer treatment. The insights obtained from this study can promote the correct interpretation of study results and enhance the comparison of study results. It is hoped that consistent terminology will minimize misunderstandings, streamline data consolidation, and facilitate collaboration among healthcare providers from diverse geographical regions.