Introduction

Esophageal cancer is the seventh most frequently diagnosed cancer and the sixth leading cause of cancer-related deaths in the world, representing a major global health problem [1, 2]. Combined modality therapy has been shown to improve survival in patients with loco-regional disease compared to surgery alone, since a complete (R0) tumor resection cannot be achieved in about 30% (T3)–50% (T4) of cases [3]. Surgery with perioperative treatment is the gold standard treatment for resectable esophageal cancer, and exclusive chemoradiotherapy (CRT) is an alternative option for squamous cell carcinoma [3]. Despite recent improvements in the available treatment modalities, overall survival (OS) remains poor, with a five-year rate of about 15–20% [4]. A better stratification of these patients based on pre-treatment predictive and prognostic factors could optimize treatment strategies. Clinical factors with negative impact on OS include the following: malnutrition, obesity, male gender, cardiological comorbidities, and low socio-economic status [5]. Clinical stage and lymph node involvement are the main tumor-related factors influencing the prognosis of esophageal cancer [6, 7]. Five-year net survival for patients with esophageal cancer shows a difference between Stages 2 and 3, ranging from 30% at Stage 2 to 16% at Stage 3 [8]. Nodal invasion, quantified by absolute number (≥ 4) or ratio of positive nodes ≥ 0.2 compared to all examined lymph nodes, is an independent prognostic factor for OS [5, 7]. Five-year survival rates of 63% for pN0 and 30% for pN + lesions are reported [6, 7]. Moreover, the analysis of markers for treatment response is an important prognostic factor. Positron emission tomography (PET) with 2-deoxy-2-[fluorine-18]-D-glucose (18F-FDG) fused with computed tomography (18FDG-PET/CT) has an emerging role in tumor staging, radiotherapy treatment volume delineation, response assessment and follow-up for several tumors [9, 10]. In esophageal cancer, 18F-FDG PET/CT may lead to a change in treatment management in up to a third of patients, representing the most accurate imaging modality for detecting distant metastases [11,12,13]. It is still matter of debate if metabolic parameters obtained by PET/CT could have a predictive and prognostic role [14,15,16,17,18,19,20]. Some of these parameters, such as maximum standard uptake value (SUVmax), SUVmean, SUVpeaks, tumor functional longitudinal length, metabolic tumor volume (MTV) and total glycolysis (TLG) have been proposed as prognostic indicators in esophageal and other solid tumors [19, 21,22,23].

The aim of the present study is to evaluate the role of 18F-FDG PET/CT as possible predictive and prognostic factor in a series of patients with esophageal carcinoma, treated with combined modality treatments.

Materials and methods

Clinical records of patients with histological diagnosis of esophageal carcinoma referred to the Division of Radiation Oncology of the University Hospital “Maggiore della Carità” in Novara, Italy, from 2012 to 2020, were collected. The study was approved by the local review board following the rules of our Institution. All the procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation and with the Helsinki Declaration of 1964 and all subsequent revisions. All patients were informed about the possible use of their documentation in the research study and gave their informed consent prior to their inclusion in the study. Inclusion criteria were as follows: biopsy-proven esophageal carcinoma, age ≥ 18 years, treatment with CRT (neoadjuvant or definitive), and pre-treatment staging including esophagogastroduodenoscopy (EGDS), computed tomography (CT), 18F-FDG PET/CT, ultrasound endoscopy (EUS), and bronchoscopy when indicated. Staging was defined according to the American Joint Committee on Cancer (AJCC) TNM classification 8th ed. [24]. All cases were discussed within a multidisciplinary tumor board comprising pathologists, radiologists, gastroenterologists, surgeons, and radiation and medical oncologists. All patients were treated with combined CRT.

For radiotherapy purposes, each patient had a pre-treatment planning spiral CT (Aquilion Large Bore, Toshiba, Japan). A radiation oncologist with expertise in gastro-intestinal cancers delineated the clinical target volume. Radiation therapy was delivered with a linear accelerator using a 6MV photon beam. Concomitant chemotherapy regimens included weekly carboplatin + paclitaxel, as per the CROSS trial [25] or other platinum-based regimens (cisplatin + 5-fluorouracil, carboplatin monotherapy, FOLFOX). All patients were further assessed at 4–6 weeks after the end of treatment, with EGDS and CT. Those, who underwent neoadjuvant CRT and were deemed unsuitable for surgery after restaging, were candidates for radiation boosts.

Pre-treatment 18F-FDG PET/CT was performed with the same PET/CT scanner (Biograph 16 Hirez, Siemens, Germany) and evaluated both automatically and manually to homogenize all the acquired parameters. PET was performed by OSEM3D reconstruction with 48 equivalent iterations (2i × 24 s), Gaussian smoothing filter 4 mm, FOV 168 × 168, voxel size 4.8 × 4.8 mm, slice thickness 2 mm and emission scan duration from 2 to 3 min per bed position. CT was performed with 100 kV, 36 mAs, slice thickness of 5 mm and FOV 512 × 512. CT images were used for both attenuation correction of PET data and localization of pathological FDG uptake. CT scan was performed without administration of intravenous contrast with a low-dose protocol for CT acquisition. The blood glucose levels of all patients were measured before the injection of 18F-FDG and were < 190 mg/dL. The injected activity of FDG was 3 MBq/Kg and the mean time between injection and acquisition was 81 min (range 57–118 min). For each primary tumor, the following parameters were analyzed: SUVmax, SUVmean, MTV and TLG according to the European Agency of Nuclear Medicine (EANM) guidelines [26]. SUVmax was calculated to determine the 18F-FDG PET activity and recorded using a volumetric region of interest (VOI), positioned around the pathological 18F-FDG uptake in the attenuation-corrected images. Each VOI was checked visually to exclude areas of physiological uptake. MTV was defined as the volume of hypermetabolic tissue with a threshold higher than 41 and 50% of the maximum pixel value in the primary tumor [26]. TLG was defined as the SUVmean multiplied by the MTV. Data were collected to evaluate metabolic parameters detected by 18F-FDG PET/CT prior to CRT in relation with treatment response, assessed with post-treatment imaging (CT and EGDS) and OS, disease-free survival (DFS) and loco-regional control (LRC). In addition to PET parameters, other potential prognostic factors were evaluated as: patient’s gender, primary tumor site, histological type, surgical intervention, margin status (R0 vs. R1/R2), and type of chemotherapy. Post-CRT response assessment on restaging was based on the following criteria: (a) the absence of neoplastic lesions within the radiation field was considered a complete response (CR), (b) a > 30% reduction in the diameters of the neoplastic lesion in the various projections was considered a partial response (PR), and (c) the progression or < 30% diameter reduction for neoplastic lesion was deemed as a non-response (NR). Patients with PR and CR were considered as responders, and those with NR as non-responders. During follow-up, patients underwent the same exams as post-treatment assessment every 6 months for the first 3 years and then every year. Diagnosis of recurrent tumor and distant metastasis was based on clinical and radiological evidence of tumor relapse.

Statistical analysis

All semiquantitative PET imaging parameters (SUVmax, SUVmean, MTV, and TLG) calculated with isocontours of 41 and 50% were expressed as mean ± standard deviation (SD), median and range. Their normal distribution was assessed by means of the Shapiro–Wilk test. Intra-observer reproducibility was assessed by calculating the intraclass correlation coefficient in the semiquantitative parameters of the PET imaging determined by the same operator in two successive trials on the same dataset.

Two-side Student’s t tests for unpaired data (for normally distributed variables) or Mann–Whitney U test (for non-normally distributed variables) were used to compare the semiquantitative parameters of the clinical variables categorized as follows: patient’s gender (male vs. female), T-site (3rd upper vs. 3rd middle–lower), lymph node metastases (N + vs. N0), histological type (squamous cell carcinoma vs. adenocarcinoma). Student’s t test or Mann–Whitney U test were also used as appropriate to compare PET parameters and treatment response, divided into responders (CR + PR) and non-responders (NR).

The correlation of metabolic characteristics with patient’s age and longitudinal extension of the primary tumor was assessed using the Pearson’s correlation coefficient. Only statistically significant correlations were noted in the Tables. To evaluate the association between SUV, MTV, TLG and survival, the median value of every metabolic parameter was used as the cut-off. OS, DFS and LR control rates were calculated using Kaplan–Meier analysis stratified according to cut-off value (median value) and compared using the log-rank test. DFS was defined as the time after treatment during which no sign of any tumor relapse was observed, and LRC was defined as the time after treatment without local or regional tumor relapse. Both were calculated from the end of treatment. Follow-up time was analyzed from the last day of CRT to the date of the last follow-up, recurrence, or death. The same tests were used to compare survival (OS and DFS) among clinical variables categorized as follows: gender (male vs. female), primary tumor site (3rd upper vs. 3rd middle–lower), histological type (squamous cell carcinoma vs. adenocarcinoma), surgery (Yes vs. No and R0 vs. R1/R2) and type of chemotherapy (CROSS trial vs. cisplatin-based). Yates’s chi-squared test was used to evaluate any possible associations between surgery and PET parameters. All statistical analyses were performed using Statistica 6.0 (Stasoft Inc., Tulsa, OK, USA) and statistical significance was set at p value ≤ 0.05.

Results

Forty-three patients meeting the inclusion criteria were enrolled in the current study. Most presented with locally advanced stage (90.7% T3-T4; 83.7% N +). Clinical characteristics of the patients are listed in Table 1. They were treated with external beam RT to a median total dose of 44.0 Gy (range 44.0–59.4 Gy, including boost), with standard daily fractionation (1.8–2 Gy). Seven patients were treated with 3-dimension-conformal RT (3D-CRT), 19 with static intensity modulated radiotherapy (IMRT) and 17 with volumetric modulated arc therapy (VMAT). Radiotherapy was delivered with neoadjuvant intent in 31 patients and definitive in 12. Eighteen patients received concomitant chemotherapy with weekly carboplatin + paclitaxel and 12 received only platinum-based regimens. Treatment modalities are reported in Table 2. According to the assessment criteria, eight patients obtained CR (18.6%), 22 PR (51.2%), and 13 patients NR (30.2%). Twenty of 31 patients treated with neoadjuvant intent were eligible for surgery (6–8 weeks after CRT). Six of the 20 patients (30%) who underwent surgery showed a pathological CR (ypCR), while only two of the 23 non-surgical patients (8.7%) showed a clinical CR complete response (ycCR). The remaining 11/31 patients were not eligible for surgery due to local progression and infiltration of vascular structures in five patients and progression outside the radiotherapy volume in four patients (two at the level of celiac lymph nodes and two with metastases to liver, bone or lung). Intra-observer reproducibility over two trials on the same data showed intraclass correlation coefficients ranging from R = 1 (SUVmax) to R = 0.98 (SUVmean) demonstrating the excellent reliability of this semiautomatic technique.

Table 1 Patients’ clinical characteristics
Table 2 Patients’ treatments

The mean values and SD of SUVmax, SUVmean, MTV, and TLG of the baseline [18F] FDG-PET/CT are reported in Table 3, together with the result of the Shapiro–Wilk test for the normality of the distribution.

Table 3 18F-FDG PET parameters of the primary tumor

The associations between metabolic characteristics and tumor-related variables are summarized in Tables 4, 5. SUVmax (p = 0.03), SUVmean50 (p = 0.03) and SUVmean41 (p = 0.04) values were significantly higher in N + than in N0 cases. SUVmax (p = 0.05), SUVmean50 (p = 0.04) and SUVmean41 (p = 0.03) values were significantly higher in the squamous cell carcinomas than in the adenocarcinomas. MTVs and TLGs values (p < 0.01) were found to be directly proportional to the longitudinal extension of the primary tumor, but, in this regard, both were calculated using parameters which derived from the tumor size. Conversely, PET parameters were not influenced by patient age or tumor location in the upper, mid or lower esophagus.

Table 4 Metabolic characteristics in relation to patient and tumor-associated variables (Student’s t test)
Table 5 Correlation of metabolic characteristics with patient’s age and longitudinal extension of the primary tumor

No significant difference was found between 18F-FDG PET/CT parameters between responders (CR + PR, N = 30) and non-responders (NR, N = 13). No metabolic parameters with a statistically significant predictive value of response emerged from our analysis (Table 6). After a median follow-up time of 7 months (range 1–98 months), the 3-year OS, DFS, and LRC rates were 32, 35, and 32%, respectively. Of the 30 responding patients (CR + PR) after CRT, 19 (63.3%) developed loco-regional recurrence or distance relapse. At the time of analysis, 32 patients have died (74.4%), 25 of them (78.1%) with progressive disease. Patients with MTV50 < 8.1 cm3 (p = 0.04) and MTV41 < 11.3 cm3 (p = 0.04) were correlated to a better OS, in both cases with a 3-year OS rate of 33 vs. 20% (Figs. 1, 2, Table 7). No statistical significance was found amongst the other PET parameters (SUVmax, SUVmean, and TLG) and OS. Among the other potential prognostic factors, surgery (Yes vs. No) and radicality of resection (R0 vs. R1 + R2) were found to be correlated with better OS, (p = 0.00074 and p = 0.00006, respectively) (Fig. 3 and Fig. 4).

Table 6 18F-FDG PET parameters and response assessment (Student’s t test)
Fig. 1
figure 1

Log-rank test between OS and MTV50 (< 8.1 cm3 vs. ≥ 8.1 cm3)

Fig. 2
figure 2

Log-rank test between OS and MTV41 (< 11.3 cm3 vs. ≥ 11.3 cm3)

Table 7 18F-FDG PET parameters in relation to OS, DFS and LR control (log-rank test)
Fig. 3
figure 3

Log-rank test between OS and surgery (Yes vs. No)

Fig. 4
figure 4

Log-rank test between OS and surgical radicality (R0 vs. ≠ R0)

To investigate whether surgery influenced the finding of statistical significance of MTV on OS, we analyzed the association between surgery and MTV and found no correlation for these two variables (Yates’s chi-squared test = 2.79, p = 0.01).

Discussion

Despite the improvements in the multimodality treatment of esophageal cancer, the prognosis of this tumor is still poor. Prediction of patient prognosis and survival in the preoperative setting could guide the choice of personalized neoadjuvant/adjuvant treatment strategies. In this regard, metabolic imaging plays a key role in the staging of esophageal cancer and could provide predictive and/or prognostic information. More specifically, SUV max and analysis of tumor response before and after CT/CRT, together with MTV and TLG were suggested as prognostic parameters [19, 20]. Currently, there is no accepted threshold for SUVmax and different definitions of MTV have been used, making it difficult to compare the studies and evaluate the usefulness of MTV [5]. These contrasting results could be related to the inhomogeneity of the 18F-FDG tumoral uptake pattern that is associated not only with increased metabolism, but also with several other parameters such as perfusion, cell proliferation and density, and tissue hypoxia. As a matter of fact, SUVmax reflects only the most active part of the tumor and does not necessarily correlate with the entire tumor burden [27]. Our analysis suggests the MTV to be a prognostic statistically significant parameter. In terms of OS, high values of MTV50 (p = 0.04, cut-off 8.0 cm3) and MTV41 (p = 0.04, cut-off 11.3 cm3) correlate with a worse prognosis (3-year survival: 33 vs. 20%). These cut-off levels could be considered the optimal ones emerging from our experience. These results are in line with other data reported in the literature. A study by Chhabra et al. in 51 patients found a correlation between pre-treatment MTV and prognosis in terms of OS, DFS, and LRC; in particular, high values of MTV with cut-off 19 cm3 correlated with a worse prognosis [18]. A similar analysis was conducted by Hoseok et al. and Li et al. obtaining a cut-off value for MTV of 1.0 cm3 and 10.5 cm3, respectively [19, 20].

Literature data indicate TLG also as a prognostic factor: TLG is the product of the MTV and SUVmean and represents both the degree of 18F-FDG uptake and the size of the metabolically active mass [15, 28]. Our data suggest a correlation trend between prognosis in terms of OS and DFS and TLG50 and TLG41 (p = 0.08 and p = 0.09, respectively). The differences between studies evaluating SUV, MTV, and TLG parameters could be explained by different factors. In most cases, these are retrospective studies with limited case series, hampering accurate comparisons. Moreover, differences in SUV, MTV, and TLG between different tumor histologies are not known in esophageal cancer and may affect the different prognostic roles of MTV and TLG [15]. We did not identify significant predictive parameters; only a trend was found according to the achievement of a complete or partial response (CR + PR) to CRT; responding patients showed lower but non-significant MTV values than those in the non-responder group (mean MTV50 of the two groups: 10.4 cm3 vs. 18.4 cm3, p = 0.10). This fact could be related to an inhomogeneous distribution of the cases with unbalanced patients’ characteristics in the analyzed subgroups (responders vs. non-responders) and to the relatively low number of cases in the non-responder group. On the other hand, only few literature data suggest a predictive role for metabolic parameters based only on pre-treatment PET/CT, without a comparison of post-CRT PET/CT. Levine et al. in a series of 64 patients with esophageal carcinoma reported a possible role of SUVmax as a predictor; in this case, patients with pre-treatment SUVmax > 15 responded better to treatment [14]. The identification of predictive parameters of response to therapy could come from studies comparing systematically the metabolic parameters both before and after CRT [29]. Another well-known prognostic factor is surgery: in our series, patients eligible for surgery showed better survival in terms of both OS (survival 3-year 48 vs. 10%, p = 0.00074), and DFS (3-year survival 56 vs. 10%, p = 0.0014). The same applies to those patients who showed a pathological complete response, in terms of both OS (3-year survival of 59 vs. 10%, p = 0.00006) and DFS (3-year survival of 67 vs. 18%, p = 0.0001). Of note, surgery did not influence the value of MTV as a prognosticator for OS as shown in our analysis by Yates's chi-squared test. The main limitations of our study are related to its retrospective nature, the relatively limited number of patients and the lack of data regarding post CRT PET/CT response.

Conclusions

The data here presented suggest a prognostic value for preoperative MTV, independent of the use of surgery, and support the hypothesis that patients with lower preoperative MTV values could have a better prognosis even after radical CRT regimens. These findings need to be validated within prospective studies on a larger cohort of patients, to investigate more in-depth prognostic factors to improve treatment strategies in esophageal cancer. Further investigations by omics studies could open the way for more customized treatment, considering tumor signatures related to response to specific treatment approaches.