Background

Esophageal cancer is one of the common causes of cancer mortality worldwide [1]. In contrast to the Western world, where adenocarcinoma is the common histology, squamous cell carcinoma (SqCC) is the predominant histology in Asia [2]. For locally advanced esophageal SqCC (LA-ESqCC), neoadjuvant concurrent chemoradiotherapy (nCCRT) is one of the standard-of-care options [3,4,5,6].

However, the optimal interval between nCCRT and esophagectomy is debated in the literature [7]. In clinical practice, some interval is needed for patients to recover from the side effects of nCCRT, but delayed surgery might lead to tumor growth. In the experience of nCCRT for rectal cancer, a randomized controlled trial (RCT) reported that prolongation was associated with a higher pathological complete response (pCR) [8], a well-known good prognostic factor [9]. In contrast, another RCT reported that prolongation led to a similar pCR but a higher morbidity [10].

Regarding nCCRT for esophageal cancer, a systematic review of non-RCTs published in 2018 reported that a long interval (> 7–8 weeks, vs ≤ 7–8 weeks) was associated with higher pCR rates but worse overall survival (OS), both with statistical significance [7]. However, all Asian studies included in this study were based on patients treated almost a decade ago. In addition, the results of individual studies included in this systematic review were variable. Given the abovementioned geographic variation, controversy in this topic, and lack of RCTs, we aimed to compare the OS of LA-ESqCC treated with nCCRT and esophagectomy in modern Asia with either long or short intervals via a population-based propensity-score-matched analysis.

Methods

Data source

The Health and Welfare Data Science Center (HWDC), Ministry of Health and Welfare, database is a set of databases providing complete information regarding the Taiwan Cancer Registry (TCR) (data until 2015), the death registry (data until December 31, 2017), and reimbursement data from the National Health Insurance (NHI) (data until December 31, 2016) for the whole Taiwan population, and it is provided by the Bureau of National Health Insurance [11]. The quality of the TCR was reported in 2019 [12]. The NHI research database has also been used in many population-based studies. All of the HWDC data with personal information were deidentified.

Study population and design

The study flow chart, as suggested in the STROBE statement [13], is depicted in Fig. 1. In this retrospective cohort study, we used the HWDC database to identify LA-ESqCC patients who were diagnosed from 2011 to 2015 and treated with nCCRT (radiotherapy 40–50.4 Gy at a dose per fraction of 1.8–2 Gy) and esophagectomy. nCCRT was defined as concurrent systemic and locoregional therapy with preoperative radiotherapy per the TCR record. Patients with other cancer(s) were excluded. The date of diagnosis was used as the index date. We determined the explanatory variable of interest [interval between nCCRT and esophagectomy (long interval (8–12 weeks) vs short interval (4–7 weeks))] based on the cancer registry data; the primary outcome of interest [OS] and other supplementary outcomes [pCR, 30 and 90 day mortality (since surgery), incidence of local regional recurrence (ILRR), and esophageal cancer mortality (IECM)] were extracted from the TCR or determined via linkage with the death registry. OS was calculated from the date of diagnosis to the date of death or December 31, 2017 (censoring date of the death registry). We also considered other covariables [see the next section] to adjust for potential nonrandomized treatment selection and then constructed a propensity-score (PS)-matched sample (1:1 paired matching) to evaluate the effectiveness of the interval between nCCRT and esophagectomy.

Fig. 1
figure 1

STROBE study flowchart and the number of individuals at each stage of the study. 1: We only included those treated (class 1–2) by any single institution to ensure data consistency. 2: Clinical stage II–III, by the 7th American Joint Committee on Cancer staging. 3: Without missing information in the TCR and death registry

Other explanatory covariables

We identified patient demographic factors [age, gender, residency region], patient characteristics [drinking, betel nut chewing, smoking, body mass index (BMI)], disease characteristics [tumor size, tumor differentiation, tumor location, clinical T-stage and N-stage], diagnosis method [use of positron emission tomography (PET)], and treatment characteristics [number of lymph nodes removed, radiotherapy (RT) delivery method, RT dose] as potential confounders based on our experiences in clinical practice and modified from our TCR/NHI related study [6]. These covariables were defined as follows. Patient residency was classified as northern Taiwan or elsewhere. The drinking, betel nut chewing, smoking, and use of PET variables were classified as yes or no. The number of lymph nodes was classified as < 15 or ≥ 15. Tumor size was dichotomized by tumors having a diameter ≤ 5 or > 5 cm. Tumor differentiation was classified as well/moderately differentiated or poorly/undifferentiated. Tumor location was classified as cervical or not. Clinical stage was classified as T1–T2 vs T3–T4 for T-stage and negative vs positive for N-stage. RT delivery was classified as image-guided radiotherapy (IGRT) or non-IGRT.

Statistical analyses

In the primary analysis (PA), we used the propensity score method as advocated in the literature to balance the measured potential confounders [14, 15]. We used a logistic regression model based on all covariables [see the above subsection “Other explanatory covariables”] to evaluate the probability with a long interval [vs a short interval]. Patients were matched on the logit of the propensity score using a caliper of 0.25 standard deviations of the logit of the propensity score via a greedy match algorithm as used in the literature [16]. The standardized difference (SDif) was used to assess the balance of the covariates [17, 18]. We used a robust variance estimator to compare the hazard ratio (HR) of death between PS-matched groups during the entire follow-up period [15] and evaluated the effect of potential unmeasured confounding factor(s) via the E value [19]. Binary outcomes (pCR) within the matched pairs were compared using McNemar’s test. We adopted the subdistribution HR via the clustered Fine–Gray model to evaluate ILRR and IECM [20]. Because of the vague [7–8 weeks] cutoff point used in the recent systematic review [7], we used alternative definitions [(1) 4–8 weeks vs 8–12 weeks; (2) 4–7 weeks vs 7–12 weeks] for the interval between nCCRT and esophagectomy to compare the OS as the first and second supplementary analyses (SA-1, SA-2) via separate PS matching. In the third SA (SA-3), we considered additional covariables [including site patient volume [21, 22] plus number of positive lymph node] and outcome [R0 resection], by constructing another PS-matched population for comparison. Although optimal interval was not specified in the recent treatment guideline [3], 4~6 weeks were commonly used in the RCT [23, 24]. Therefore, we performed the fourth SA (SA-4) by constructing additional PS-matched population to only compare 4~6 weeks vs 6~8 weeks. SAS v.9.4 software (SAS Institute, Cary, NC, USA) was used for statistical analyses.

Results

Study population

As shown in Fig. 1, we identified 160 eligible PS-matched patients treated with nCCRT and esophagectomy between 2011 and 2015 from 7908 esophageal cancer patients (65% locally advanced) as our primary study population and divided them into two groups [long interval group (n = 80) vs short interval group (n = 80)]. All covariates were balanced [SDif < 0.25] after matching (Table 1), though some were not balanced before matching.

Table 1 Patient characteristics of the study population in the primary analysis

Primary analysis

After a median follow-up of 30 months [range 4–81] (median 41 and range 24–81 for the survivors), 83 deaths were recorded (39 and 44 in the short and long interval groups, respectively). The Kaplan–Meier OS curve is shown in Fig. 2. The 1/2/3/4/5-year OS rates [in %] for the short and long interval groups were 89/83, 68/59, 56/51, 45/39, 45/35, respectively. There was no significant difference for HR [1.22; 95% confidence interval (95% CI) 0.78–1.91, P = 0.39] when the long interval group was compared to the short interval group. Our result may be due to an unmeasured confounding variable associated with both treatment selection and survival by a risk ratio of 1.56 [E value] fold each, but weaker confounding could not do so. The results of the HR for ILRR (HR = 1.44, P = 0.29) and IECM (HR = 1.18, P = 0.48) were similar. The pCR rates (55% vs 54% for the short vs long interval groups, P = 1), 30-day mortality (P = 0.06, exact numbers not reported per HWDC policy due to few events), and 90-day mortality (4% vs 9%, P = 0.19) were also not significantly different between the two groups.

Fig. 2
figure 2

Kaplan–Meier overall survival curve (in years) in the primary analysis

Supplementary analysis (SA)

When alternative definitions of the interval between nCCRT and esophagectomy were used, we were still able to construct balanced study populations (Table 2). The results were not significantly different [SA-1: HR for death 1.08, P = 0.71; SA-2: HR for death 1.32, P = 0.10]. In SA-3, we constructed another balanced study population (Table 3) and found that the results were not significantly different [HR for death 1.22, P = 0.35]. There were also no statistically significant differences in the distribution of R0 resection [P = 0.07, exact proportion not reported per HWDC policy due to the small number of events]. In SA-4, we constructed additional balanced study population (Table 4) and found that the results were not significantly different [HR for death 1.01, P = 0.98].

Table 2 Patient characteristics of the study population in the first and second supplementary analyses
Table 3 Patient characteristics of the study population in the third supplementary analysis
Table 4 Patient characteristics of the study population in the 4th supplementary analysis

Discussion

In our analysis of LA-ESqCC treated with nCCRT and esophagectomy in this population-based study from modern Asia, we found that OS was similar for long and short intervals between nCCRT and esophagectomy.

We searched the literature up to May 2019 by using the same strategy as used in the recent systematic review [7] to see if there were other modern studies and found two population-based studies from North America and another two single-institution studies from Asia [25,26,27,28]. Azab et al. utilized the American National Cancer Database (NCDB) to identify more than 5000 patients (81% adenocarcinoma) and found that SqCC groups had similar OS across interval lengths [25]. Franko and McAvoy used the same NCDB specifically for SqCC and found that OS was not affected by the interval length [26]. Furukawa et al. identified 134 patients from a Japanese hospital and reported that OS survival rates did not significantly differ between the two groups (≤ 8 or > 8 weeks) [27]. Roh et al. identified 348 Korean patients and found no significant difference in OS between the groups [P = 0.101] [28]. Our results were similar to the results of these four studies in that the OS between different interval length groups was similar.

However, there were inherent limitations in our analysis. As in all nonrandomized studies, our results were prone to potential unmeasured confounder(s), although we used PS matching to balance observed covariables. There was a risk of unmeasured confounders (such as surgical techniques or systemic therapy details), so we reported the E value, as suggested in the literature [19]. For example, a transthoracic approach has been reported to lead to a trend of favorable long-term outcomes [29] and taxane has been used in modern neoadjuvant trials with excellent results [23]. Besides, the importance of the anastomotic sites or the surgical fields was debated in the literatures [30]. However, these factors were not considered in our study due to the data not being available. Some potential pathological factors like extranodal extension, perineural invasion, or lymphovascular invasion were also not included due to the same data limitation. Therefore, phase III RCTs are needed to clarify the findings from our study and other studies. However, when we searched the clinical trial registry [https://clinicaltrials.gov/] in March 2019 using the keywords “esophagectomy | Interventional Studies | Esophagus Cancer | Phase 3”, we found no relevant studies. Therefore, we believe that our study provides useful information until higher-level data are available.

Conclusions

In this population-based study from modern Asia, we found that for LA-ESqCC patients treated with nCCRT and esophagectomy, OS was similar for long and short intervals between nCCRT and esophagectomy. Randomized controlled trials are needed to clarify this finding.