Background

Esophageal cancer (EC) ranks seventh in cancer incidence and sixth in mortality worldwide [1]. Surgery is usually the cornerstone for curative treatment. Yet, while cancer of the upper third of the esophagus at the cervical and upper thoracic region is relatively rare (accounting for no more than 10% all cases) [2], radical esophagectomy is often precluded. This is because upper third EC, especially cervical, is histologically almost entirely squamous cell, with a high risk of local invasion of adjacent anatomical structures.

The guideline of the National Comprehensive Cancer Network (NCCN) provides no consensus regarding the optimal treatment of upper EC, although definitive radiotherapy is recommended for cervical EC [3]. Unfortunately, upper third EC has a worse prognosis than middle or lower third tumors, with an estimated 5-year overall survival rate of 33% [4].

Prognosis for EC has traditionally relied on tumor-node-metastasis (TNM) staging [4, 5], but there is accumulating evidence that primary gross tumor volume (GTVp) is particularly prognostic and may guide clinical treatment [6]. With advances in diagnostic imaging and therapeutic technology, GTVp has the potential to become a powerful predictor of patient survival. In 2006, Créhange et al. [7] reported that patients with EC had a worse prognosis if the GTVp was 100 cm3 or larger. In 2015, it was demonstrated that primary GTVp was not only a significant predictor of improved survival, but also more powerful than TNM staging as a prognostic factor [8]. In 2019, Lv et al. [9] showed that relative tumor volume was an independent prognostic factor in non-resectable esophageal squamous cell carcinoma (ESCC), and GTV for lymph nodes was an independent indicator of distant metastasis-free survival. In the same year, Chen et al. [10] also reported that GTVp was an independent factor that affected overall survival in ESCC, which could be combined with N stage to improve prediction. Hence, GTVp is an important prognostic factor that may provide new insight for personalized treatment strategies in EC.

Due to the rarity of upper ESCC, prognostic factors remain unclear [11]. Based on the above studies, it is speculated that GTVp may also be prognostic for survival in upper ESCC, and help guide treatment. The current treatment modalities for upper ESCC are mainly radiotherapy, surgery, or radiotherapy in combination with surgery. In addition, some studies have shown that concurrent chemoradiotherapy was effective against upper third ESCC [11, 12], or that upper third EC was especially sensitive to preoperative chemoradiotherapy, which significantly improved survival [13]. Surgery has even been demonstrated recently as better than radiotherapy for therapeutic effect in upper third ESCC [14]. Nevertheless, GTVp was not examined in such studies, and further investigation is warranted concerning optimal therapeutic schedules based on GTVp for upper ESCC.

The current study explored the prognostic value of GTVp in upper ESCC, and the value of GTVp for strategizing individualized treatment. As such, this study provides a clinical reference for individualized treatment of patients with upper ESCC.

Materials and methods

Patients and pretreatment assessment

This retrospective study was performed with the approval of Fujian Medical University Cancer Hospital Institutional Board Review (No. SQ2020–063-01).

For inclusion in the present analysis, patients fulfilled the following criteria: given no prior treatment; with pathologically proven ESCC without distant metastases; the tumor located at the upper thoracic region (as defined by the NCCN guideline [3]); and designated ≤3 by Eastern Cooperative Oncology Group scoring. The pretreatment assessment of all patients was performed according to our institutional protocol [10]. Clinical TNM staging was in accordance with the eighth edition of the Union for International Cancer Control/American Joint Committee on Cancer (UICC/AJCC) staging criteria for EC. Potential subjects were excluded if they had recurrent disease; secondary malignancies; or any tumor located in the cervical, middle, or lower third thoracic esophagus.

Finally, 568 patients with upper ESCC were enrolled in this analysis. Patients were apportioned to the following groups, based on treatment given: definitive radiotherapy/concurrent chemotherapy (RT); radical surgery (S); or pre- or postoperative radiotherapy as well as radical surgery (S + RT). The R, S, and R + S groups consisted of 238, 216, and 114 patients, respectively. Patients in each group may or may not have also received pre- or postoperative chemotherapy, and neoadjuvant or adjuvant chemotherapy of radiotherapy.

GTVp measurement

Chest computed tomography images before treatment were transmitted to a 3D treatment-planning system. Primary target lesions, excluding lymph nodes, were independently outlined by 2 thoracic radiotherapists, each therapist with at least 10 years of clinical practice in the treatment of EC. Any discrepancies were resolved by consensus. The 3D images were automatically reestablished and GTVp measurements were automatically computed by the system.

Treatments

Surgical treatment consisted of radical resection of local tumor and regional lymph nodes, in accordance with institutional procedures described previously [15]. Radiotherapy technology included 3-dimensional conformal radiation therapy or intensity modulated radiation therapy. The definitive [4], pre- [10], and postoperative [16] radiotherapy treatments, including the targets, target dose, and dose limitations to organs at risk have been described as cited. The median radiation dose to the targets for definitive, pre-, and postoperative radiotherapy were, respectively, 61.5 (range, 50.0 to 67.2), 40.0 (36.0 to 50.0), and 50.0 (40.0 to 63.0) Gy with conventional fractions.

Chemotherapy treatments were platinum-based, and administered to 370 (65.1%) of the patients, of whom 113 (30.5%), 163 (44.1%), and 94 (25.4%) were in the S, RT, and S + RT groups, respectively.

Evaluation of treatment adverse effects

The main observational indexes of complications associated with radical surgery were: pneumonia, anastomotic leakage, wound infection, chylothorax, respiratory failure, anastomotic stenosis, and cardiac insufficiency. The main observational indexes of acute toxicities of radiotherapy were acute radiation-induced esophagitis and radiation-induced pneumonitis, which were evaluated based on the criteria of the Radiation Therapy Oncology Group [17].

Surveillance

The surveillance schedule for patients was performed as in our previous study [10]. In short, patients were followed-up every 3 months for the first 2 years; every 6 months for years 3 to 5; and annually thereafter. The final follow-up date was 19 December 2019. The median surveillance time was 41.5 months. The primary endpoint was overall survival (OS), which was calculated from the start of diagnosis to the date of death or last follow-up. The secondary endpoint was progression-free survival (PFS), defined as the date of diagnosis to the date of death, local and/or regional relapse, distant metastasis, or last follow-up.

Statistical analyses

The optimal cut-off of GTVp indicating a survival difference was evaluated using a multivariate Cox proportional hazards regression model, with penalized spline (P-spline) regression, which enabled a nonlinear association between GTVp and OS or PFS. P-spline was performed using the smoothHR package in R, version 3.5.2 (http://www.r-project.org/) [18]. Other data were analyzed using SPSS, version 24.0 (IBM, Armonk, NY, USA).

The clinical features and adverse effects of treatment of the different subgroups were compared with Pearson’s chi-squared or Fisher’s exact tests. OS and PFS rates were calculated using the Kaplan-Meier method, and subgroups were compared with the log-rank test. Multivariate analyses of clinical features were performed to determine independent prognostic factors for OS and PFS, via Cox proportional hazards regression. To adjust selection and confounding bias, the baseline characteristics for comparing different subgroups were compensated by using propensity score matching (PSM) analyses. The unbalanced factors were modulated by PSM with a match tolerance of 0.1. A 2-sided probability value P ≤ 0.05 indicated a statistically significant difference.

Results

Patient characteristics and treatment failure patterns

Among the 568 patients, there were 397 and 177 men and women, respectively (ratio 2.2:1), with a median age of 60 years (range, 38 to 90 y). There were 238, 216, and 114 in the RT, S, and RT + S treatment groups. Additional clinical details are summarized in Table 1, including clinical T (cT), clinical N (cN), and clinical TNM (cTNM) stages; lymph node metastasis (LNM); tumor length; and chemotherapy.

Table 1 Patient Characteristics by Treatment Group

For the entire cohort, the 5-year OS and PFS rates were 44.6 and 40.2%, respectively. The 5-year OS rates of the S, RT, and RT + S groups were 51.7, 34.2, and 51.0%; the 5-year PFS rates were 46.3, 29.0, and 49.6%. Ultimately, 319 patients died of the following: 105 of distant metastasis; 134 of local or regional relapse; and 11 and 69 of other medical or unknown causes. There were 148 patients who experienced tumor relapse (43, 86, and 19 of local, regional, or combined relapse). Overall, 113 patients developed distant metastasis.

Volume-dependent effect of GTVp on OS and PFS

To quantify the effect and optimal cutoff value of GTVp on OS and PFS, a multivariate Cox regression analysis was conducted using P-splines in smoothHR of the R software. The model indicated that the risk (InHR) of abbreviated OS and PFS increased sharply when GTVp was more than 30 cm3 (Fig. 1A, B). These findings confirmed a GTVp-dependent effect, and a GTVp of 30 cm3 was an optimal cut-off volume for survival difference. Consequently, all patients could be stratified into 2 subgroups based on a GTVp of less or greater than 30 cm3.

Fig. 1
figure 1

Optimal cutoff of GTV-p in 568 patients with upper ESCC. The effects of GTV-p on (A) OS and (B) PFS are modeled with a P-spline expansion, with GTV-p as a continuous variable. Estimated logarithm HRs (solid lines) with 95% confidence intervals (shaded area) for the association of GTV-p with OS or PFS according to the degrees of freedom in multivariate additive Cox in smoothHR - Optimal extended Cox - type additive hazard regression unadjusted model

Prognostic value of GTVp

Overall, the average GTVp was 27.30 (1.85 to 238.99) cm3; specifically, 396 and 172 patients had, respectively, GTVp < 30 cm3 and ≥ 30 cm3. The prognostic value of GTVp was determined via univariate and multivariate analyses. The univariate analyses showed that patients with GTVp ≥30 cm3 had markedly poorer 5-year OS and PFS than did those with GTVp < 30 cm3 (Fig. 2A, B). Other prognostic factors for OS and PFS were: gender; cT, cN, and cTNM stage; LNM; and tumor length (Table 2). The multivariate Cox regression analysis confirmed the following as independent prognostic factors for OS and PFS: GTVp, gender, cT stage, and LNM. In addition, N stage was an independent prognostic indicator for PFS.

Fig. 2
figure 2

Overall survival rates of patients with GTV-p < 30 cm3 or GTV-p ≥ 30 cm3. Without PSM: (A) OS; (B) PFS. With PSM: (C) OS; (D) PFS

Table 2 Univariate and multivariate analyses of factors influencing OS and PFS

Various clinical baseline characteristics were significantly different between the patients with GTVp < 30 cm3 and ≥ 30 cm3 (Supplemental Table 1), and PSM was employed to calibrate the potential indication bias between the 2 groups. After matching, the only significantly different clinical features were gender and tumor length, and 132 patients were matched in each group; GTVp remained a prognostic risk for OS and PFS (Fig. 2C, D). Taken together, these findings suggest that GTVp can independently predict OS and PFS.

Prognostic value of treatment modalities stratified by GTVp

As the GTVp of the 3 treatment groups differed so greatly, the possibility of selection bias was considered (Supplemental Table 1). Hence, it was necessary to explore the prognostic effect of treatment, stratified by GTVp. For GTVp < 30 cm3, the OS curves of the RT, S, and S + RT groups appeared to be identical and did not differ significantly (Fig. 3A). The PFS curves of the 3 groups were comparable, expect with a border line difference between the RT and S groups (P = 0.053; Fig. 3B).

Fig. 3
figure 3

Survival rates of patients with GTV-p < 30 cm3. Without PSM among S, RT, and S + RT groups: (A) OS. (B) PFS. With PSM between S group and RT group: (C) OS. (D) PFS

The PSM analysis revealed no marked difference among the various clinical features of the 79 patients in the RT group and the 79 patients in the S group (Supplemental Table 2). It was observed that the OS and PFS curves of the RT and S groups overlapped, respectively (all P > 0.05; Fig. 3C, D).

For GTVp ≥30 cm3, the OS curves of the S + RT and RT groups were significantly better than that of the S group; the 5-year OS rates of the S + RT and RT groups were 45.4 and 21.1%, respectively, while that of the S group was 14.1% (P = 0.009 and 0.031; Fig. 4A). Although the 5-year OS rate of the S + RT group was better than that of the RT, the difference did not reach statistical difference (P = 0.169). In addition, the S + RT group showed markedly better distinction of PFS curves compared with the RT and S groups with the 5-year PFS being 42.8% cf. 19.7 and 42.8% cf. 14.3% (P = 0.003 and P = 0.002, respectively), whereas the PFS curves of the RT and S groups almost overlapped (P = 0.504; Fig. 4A, B). The PFS curve of the S + RT group was significantly better than that of the RT or S groups; the 5-year PFS rates of the S + RT group was 42.8%, whilst that of the RT and S groups were 21.1 and 14.1% (P = 0.003 and 0.002). The PFS curves of the RT and S groups almost overlapped (P = 0.504, Fig. 4A, B).

Fig. 4
figure 4

Survival rates of patients with GTV-p ≥ 30 cm3. Without PSM among S, RT, and S + RT groups: (A) OS. (B) PFS. With PSM between R group and R + S group: (C) OS. (D) PFS

To compensate for the unbalanced baseline characteristics, 27 patients in the S + RT group and 27 in the RT group were selected for comparison of their PFS (Supplemental Table 3). The PFS of the 2 groups remained statistically similar (Fig. 4C). Nevertheless, a borderline significance of difference was confirmed between the overall S + RT and RT groups (P = 0.053, Fig. 4D).

Taken together, the S group experienced the worst OS, while the PFS of the S + RT was significantly better that of the RT group for patients with GTVp ≥30 cm3.

Adverse effects of treatment modalities

The common complications associated with radical surgery were pneumonia (19%) and anastomotic leakage (6.3%; Table 3). Rare complications included wound infection (0.9%), chylothorax (1.7%), respiratory failure (1.1%), anastomotic stenosis (1.4%), and cardiac insufficiency (0.6%). The rates of complications of the S + RT and S groups were similar.

Table 3 Surgical complications of the S and S + RT groups

The common side effects of radiotherapy were acute radiation-induced esophagitis (49.7%) and radiation-induced pneumonitis (15.1%); most side effects were complication of grades I -II (Table 4). The rate of acute radiation-induced esophagitis in the S + RT group (64.4%) was markedly lower than that of the RT group (22.0%, P < 0.001).

Table 4 Radiation toxicity of the S and S + RT groups

Discussion

To our knowledge, the present study comprised the largest cohort of patients with upper ESCC to investigate the significance of GTVp for prognosis and indication of treatment. This study found that a GTVp of 30 cm3 was the optimal cut-off for differences in survival, and GTVp ≥30 cm3 was an independent negative risk factor of OS and PFS. For GTVp < 30 cm3, no significant survival differences were observed among the 3 treatment groups (S, RT, and S + RT). For patients with GTVp ≥30 cm3, the S + RT and RT groups both experienced significantly better OS than did the S group, while the PFS of the S + RT group was superior to that of the RT. In addition, the S + RT group had a significantly lower rate of radiation side effects compared with the RT group. These findings indicate that GTVp was predictive for prognosis and treatment in patients with upper ESCC.

In this study, we firstly confirmed the GTVp-dependent effect on survival, and determined that 30 cm3 was the optimal cut-off point using P-spline regression analyses. GTVp was then found to be an independent prognostic factor for OS and PFS. Recently, several studies have reported a close association between survival outcome and tumor volume in EC [19, 20], but the optimal cut-off points of GTVp varied among studies. Créhange et al. [7] initially found that the optimal cut-off point for differences in OS was 100 cm3. Chen et al. [21] reported that 20 cm3 and 40 cm3 were useful cut-off points for OS differences, based on clinical experience. Using receiver operating characteristic (ROC) curve analysis, Chen et al. [22] identified 39.41 cm3 as a cut-off to predict survival differences, while Chen et al. [10] suggested a cut-off of 28 cm3. The above postulated cut-off values were based either on clinical experience or ROC analysis, and limited by considering only associations between the continuous variable and survival endpoints. In the present study, multivariate Cox regression analysis using P-splines in smoothHR was conducted to determine the optimal cutoff value, which should be more valid because it is based on an association between the continuous variable and survival rates.

To further verify the prognostic value of GTVp, PSM analysis was employed to calibrate the potential indication bias between patients with GTVp < 30 cm3 and those with GTVp ≥30 cm3. It was observed that patients with GTVp ≥30 cm3 indeed experienced poorer OS and PFS rates. In addition, multivariate Cox proportional hazards regression showed that GTVp (HR [hazard ratio] for OS 1.682; HR for PFS 1.673) was more prognostic than T stage (HR for OS 1.245; HR for PFS 1.214). Therefore, GTVp is an important risk factor that independently influences patient outcomes, outside of clinical TNM stage.

An increasing number of studies have indicated that TNM stage may not be sufficient to determine prognosis, as it does not consider GTV that differs between T or N stage [10, 22]. Larger GTVp is understood to represent greater tumor load, a higher percentage of tumor radioresistant hypoxic cells and clonogenic cells, and greater risk to organs surrounding the tumor, which results in poor survival [21]. Furthermore, GTVp should be included in the staging system as an indicator of patients’ prognosis [9, 10]. Taken together, GTVp showed a powerful prognostic value that may make up for the deficiency of the TNM stage, and should be part of a personalized treatment strategy for upper ESCC.

As GTVp in the present study was markedly different among the S, RT, and S + RT groups, the 3 treatment groups were further stratified by GTVp to explore prognostic effect. For GTVp < 30 cm3, there were no significant differences in OS and PFS among the treatments. This may be because of the lighter tumor burden and less tumor infiltration to adjacent anatomical structures, and GTVp < 30 cm3 may result in a significantly better prognosis irrespective of the therapeutic methods. It is noteworthy that 215 (54.3%) and 80 (20.2%) patients, respectively, were included in the S and S + RT groups (Supplemental Table 1), which suggests that most of the patients with GTVp < 30 cm3 were considered viable candidates for radical resection. Even for the 101 (25.5%) patients who did not undergo surgery, definitive radiotherapy was able to achieve good efficacy, because radioresistant hypoxic and clonogenic tumor cells were negligible when the GTVp is less than 30 cm3. Therefore, for patients with GTVp < 30 cm3, we suggest that surgery, or radiotherapy plus surgery, is preferred if the tumors are considered resectable, whereas definitive radiotherapy is best if the tumors are non-resectable.

For GTVp ≥30 cm3, the S group had the worst 5-year OS rate. The extent of tumor local invasion was larger, and most of these patients (81.4%) were staged as locoregional advanced disease. Thus, surgery may not be sufficient to control tumor development when GTVp is ≥30 cm3, and poor OS is the result.

Furthermore, the S + RT group had better 5-year PFS rates than did the RT group. This may be because, first, a more advanced N stage may have been a feature of the RT group. Although the imbalance in clinical characteristics between the RT and S + RT groups was compensated for by PSM, more numbers of positive locoregional nodal metastases may be detected in the RT group if it was given lymph nodal pathological examination. There is surgical data that the rates of positive locoregional nodal metastases in ESCC increase with each T stage, from a low of 31% for T1b, up to 100% for T4 [23].

Secondly, the RT group may have had lower PFS because these patients were more prone to develop cervical lymph node metastases than the S + RT group. It was reported that the most common nodal metastases in upper ESCC are cervical (49.5%) [15]. In addition, patients who received definitive chemoradiotherapy experienced more cervical lymph node recurrences compared with those given radical esophagectomy for ESCC [24], which may be reflected the poorer 5-year PFS rates in the RT group of the present study.

Finally, perhaps the S + RT group had better 5-year PFS rates than the RT because the addition of radiotherapy to radical surgery made loco-regional recurrence more unlikely. Some studies showed that neoadjuvant chemoradiotherapy prior to surgery could result in lower risk of local recurrence and higher disease-free survival in patients with locally advanced ESCC [25, 26]. Consequently, in the present study, the addition of radiotherapy to surgery was associated with preferable survival benefits in patients with GTVp ≥30 cm3.

The S + RT group had no more surgical complications than did the S group. Postoperative complications that have been confirmed negative prognostic factors in EC include pneumonia, pyothorax, and chylothorax, and preoperative chemoradiotherapy was even reported to reduce the negative effect of postoperative complications on patient outcome [27]. In the present study, most radiotherapy treatments when given were delivered postoperatively (16.7% preoperatively), and the S + RT group did not have a significant rate of postoperative complications. Furthermore, the S + RT group showed a significantly lower rate of radiation side effects compared with the RT group, especially for acute radiation-induced esophagitis. This may be because the radiation dose delivered during definitive radiotherapy was much higher than that of either the preoperative or postoperative radiotherapy.

Notably, there were some limitations in this study. Although the cohort was large, it was retrospective, with inevitable selection and confounding bias. A prospective, multicenter study is warranted. Furthermore, this study did not consider the GTV of lymph nodes, which may also influence survival [9]. Because the selection of treatment was primarily based on T stage for non-metastatic ESCC as recommended by the NCCN guideline [3], we emphasized the prognostic and predictive effects of GTVp. Yet, the effects of the GTV of lymph nodes for upper ESCC are worthy of investigation. Finally, this analysis did not integrate other independent prognostic factors with GTVp, such as gender, cT stage, and LNM, which may improve the accuracy of prognosis and choice of treatment. Thus, a nomogram to predict the survival prognosis of patients with upper ESCC is needed.

In conclusion, GTVp is prognostic for OS and PFS in patients with upper ESCC. For GTVp < 30 cm3, no significant survival differences were observed among the RT, S, and S + RT treatment arms. For ESCC GTVp ≥30 cm3, radiotherapy plus surgery was the most effective treatment. These findings may help clinicians strategize individualized treatment for patients with upper ESCC.