Introduction

Esophageal cancer remains an important cancer worldwide and is responsible for over 600,000 new cases in 2020 and an estimated 544,000 deaths, ranking seventh for incidence and sixth for mortality globally [1]. Esophageal squamous cell cancer (ESCC), as the dominant histological subtype, comprises over 90% of all esophageal cancer cases in the high-risk areas such as East and Central Asia [2, 3].

Local-regional staging is essential for decision making and prognostication of esophageal carcinoma [1, 4]. Important findings were, first, that staging accuracy of current imaging modalities is still relatively inadequate [5,6,7]. Second, clinical staging, currently based largely on imaging, cannot predict the survival as accurately as pathologic staging, which overestimated the survival of early-stage tumor and underrated the survival of advanced-stage tumor [8]. Therefore, new tools for accurate clinical staging have to be developed to facilitate precision cancer care.

Radiomics enable non-invasive decoding of clinical staging in various cancers including ESCC [9,10,11]. However, previous radiomics studies mainly focused on the intratumoral region alone, whereas there is paucity of data evaluating the potential value of peritumoral radiomics features for predicting ESCC staging. Recently, peritumoral radiomics features have been shown to be predictive in treatment response assessment in ESCC [12]. Therefore, the primary objective of the study was to assess the ability of intratumoral and peritumoral radiomics in predicting local-regional staging in ESCC. The secondary objectives were to investigate associations between radiomics-based staging phenotype and patient survival.

Materials and methods

This retrospective study was approved by the ethics committee of our institution (no. ky-2023-109) without the requirement of written informed consent.

Patients

Esophageal cancer patients who underwent radical esophagostomy and regional lymphadenectomy (i.e., pathological T1-4aN1-2M0) in our institution from January 2012 to September 2016 were retrospectively recruited. Inclusion criteria were as follows: (1) ESCC confirmed histologically and (2) standard contrast-enhanced computed tomography (CT) performed within 2 weeks before surgery. Exclusion criteria included (1) preoperative anticancer therapy; (2) concurrence other malignant tumors; (3) missing clinicopathological data (preoperative blood-routine characteristics, pathological data for definite TNM stage, etc.); (4) uninterpretable enhanced CT images; and (5) Postoperative following up time < 1 year. One hundred twenty-two patients who met the criteria were allocated randomly to the training cohort (n = 93) and internal validation cohort (n = 29) in a 3:1 ratio. Patient enrollment pathway is shown in Fig. 1. These patients were previously reported as part of a radiomics study [13]. Yet, the current study exploits a different study purpose, methodology, and results as compared with the prior publication. Whereas previous study dealt with intratumoral radiomics and focused on prediction of lymph node (LN) metastasis, the present study evaluates the diagnostic performance of intratumoral and peritumoral radiomics in predicting local-regional staging and prognostic value of radiomics-predicted Tumor-Node-Metastasis (TNM) stage.

Fig. 1
figure 1

Flow diagram of patient enrollment, eligibility, and exclusion criteria. ESCC esophageal squamous cell carcinoma, CT computed tomography

Clinicopathological characteristics

Clinical information, including demographic data (age, sex), laboratory test (serum albumin, fibrinogen, blood-routine characteristics), and histopathological reports (tumor site, grade, Tumor stage, Node stage) were collected from electronic medical record databases. Neutrophil-to-lymphocyte ratio (NLR) and platelet-to-lymphocyte ratio (PLR) were calculated based on neutrophil, lymphocyte, and platelet count within 2 weeks before surgery. The threshold values for serum albumin and fibrinogen used here were 35 g/L and 4 g/dL, respectively. The pathologic stage was defined according to the Union for International Cancer Control TNM staging system (8th edition) [8, 14]. Stage I and II were classified as the early-stage, and stage III and IV the late-stage.

Follow‑up strategy

Patients were followed up every 3 months for the 1st year after surgery, every 6 months for thereafter. At each outpatient visit, thoraco-abdominal CT scans, brain magnetic resonance images or brain CT scans and bone scans were routinely performed to detect any evidence of recurrence. The recurrence date was recorded as the date when the aforementioned scans first showed signs of recurrence. Recurrence-free survival (RFS) was defined as the duration from the date of surgery to the first radiographic detection of recurrence, death, or the last follow-up was set as the end point.

CT acquisition

All patients underwent contrast-enhanced chest CT using a 64-slice LightSpeed VCT (GE Healthcare), which was performed in the axial plane with 5-mm-thick sections.

Details on the imaging protocols are shown in Supplementary S1 (online). Arterial-phase CT images, as the optimal one for visualization of esophageal cancer, were retrieved from picture archiving and communication system (Carestream, Canada) for tumor annotation [15]. CT-reported lymph node (LN) status was assessed in consensus on the pretreatment CT by two radiologists (Z.T. and R.M., with 12 and 6 years of clinical experience in esophageal imaging, respectively). LN short-axis diameter greater than 10 mm was defined as a radiological positive nodal status [16].

Tumor segmentation and feature extraction

Tumor segmentation and radiomic features extraction were performed by using the open software 3D Slicer (version 4.10.2, http://www.slicer.org). The intratumoral three-dimensional regions of interests (ROIs) covering the whole tumor in all patients were manually delineated slice by slice on the CT images by the one investigator (R.M), who was blinded to pathological TNM stage. After intratumoral segmentation, peritumoral masks with a radial distance of 3 mm were automatically created using morphologic outward dilation by 2 mm and inside erosion by 1 mm of the tumor boundaries. Airway, lung, left atrium, aorta, vertebrae, and azygos were manually excluded (Fig. 2). To evaluate the interreader agreement, an independent investigator (X.T) also placed three-dimensional ROIs of the intratumoral and peritumoral areas in a randomly selected subset of 30 patients.

Fig. 2
figure 2

Lesion segmentation for radiomics analysis. First, A Region of interest was manually segmented in axial view to obtain intratumoral mask, B then the peritumoral masks with a radial distance of 3 mm were semiautomatically generated using morphologic outward dilation by 2 mm and inside erosion by 1 mm of the tumor boundaries, with airway, lung, left atrium, aorta, vertebrae, and azygos excluded manually. C, D Three-dimensional view of the intratumoral and peritumoral volumes of interest

Before features extraction, image normalization was performed by remapping the histogram to fit within μ ± 3σ (μ: mean gray level within the VOI; σ: gray level standard deviation). For each ROI, 1223 quantitative features were extracted, including 14 shape features, 234 first-order features, and 975 second-order texture features derived from gray level dependence matrix (GLDM), gray level co-occurrence matrix (GLCM), gray level run length matrix (GLRLM), gray level size zone matrix (GLSZM), and neighboring gray tone difference matrix (NGTDM).

Construction of a TNM-related radiomic signature

The reproducibility of features was calculated by using intra-class correlation coefficients (ICC). After being normalized using the z score standardization method, all radiomics features were filtered using the criteria of ICC ≥ 0.80 and correlation coefficient ≥ 0.90. The remaining features were then input into the least absolute shrinkage and selection operator (LASSO) logistic regression model to avoid overfitting and construct radiomic signature. The output of LASSO model was converted into a probability score, namely the radiomic score, indicating the individual relative risk for high pathologic tumor stage.

Statistical analysis

All statistical analyses were performed using R version 4.2.1 (The R Foundation). A two-tailed P value less than 0.05 was considered as statistical significance. The clinicopathological characteristics between two datasets were compared with Chi-Square, t test or Mann–Whitney U test, where appropriate. The discrimination performance of the radiomic signature was quantified by the area under curve (AUC) value in the primary training set and internally validated in the independent test set. To explore the prognostic value of radiomics-predicted TNM stage, the optimum threshold of the radiomic score was determined using the surv_cutpoint function of survival R package. Accordingly, the patients were divided into low- and high-risk groups in the entire cohort, for which the survival outcomes were compared with Kaplan–Meier analysis and the 2-sided log-rank tests. Univariable and multivariable Cox regression analyses were conducted to analyze the relationship between radiomics-predicted TNM stage and RFS.

Results

Patient characteristics

After applying the exclusion criteria, 122 patients (mean age, 57 years; 27 women) with ESCC were included for radiomics signature training, internal validation, and survival comparison analysis. The details of clinical-pathologic characteristics are shown in Table 1. High TNM stage in training set and test set was 55.2% and 54.8%, respectively. The median follow-up time was 47.6 months (interquartile range 39.8-63.2). The median RFS duration for the entire cohort was 21.9 months, with 67 of 122 (54.9%) patients experiencing recurrence after complete surgical resection (median RFS time: 19.2 and 22.1 months for training set and test set, respectively). No significant differences in clinicopathological characteristics between two datasets justified their use as training and test sets (see Table 1).

Table 1 Baseline characteristics of patients in training and test sets

Radiomics features selection and signature construction

A total of 2446 features were extracted from two ROIs per patient. After the reproducibility and correlation analysis, a list of 122 features was retained with 2324 ineligible features excluded. Next, five radiomics features, including one intratumoral feature and four peritumoral features, were selected by the LASSO algorithm (Fig. 3) and quantitatively integrated into the radiomics signature. The radiomic score was computed as follows: radiomic score = 0.22885113 − (0.13596893 × Intratumoral_wavelet-LHH.glcm.DifferenceAverage) − (0.02591146 × Peritumoral_log-sigma-1-5-mm-3D.gldm.SmallDependenceLowGrayLevelEmphasis) + (0.08734137 × Peritumoral_wavelet-HLH.firstorderKurtosis) + (0.08575472 × Peritumoral_wavelet-HHH.glszm.SizeZoneNonUniformity) + (0.05001215 × Peritumoral_original.glcmInverseVariance).

Fig. 3
figure 3

Selection of TNM-associated radiomics features via least absolute shrinkage and selection operator algorithm. Top figure shows the coefficient profiles of 122 radiomics features against the log (λ), bottom figure the cross-validation curve. Pink dotted vertical lines were drawn at the optimal value by using fivefold cross-validation and the 1 standard error of the minimum criteria. Five nonzero coefficients were selected. TNM tumor-node-metastasis

Validating the radiomics signature

The receiver operating characteristic curves analysis indicated that the radiomics signature exhibited AUCs of 0.823 (95% CI 0.739–0.906) in the training set, and 0.813 (95% CI 0.632–0.993) in the test set (Fig. 4).

Fig. 4
figure 4

Receiver operating characteristic curves to predict TNM stage for the radiomics signature in the training and test sets. The radiomics signature incorporating one intratumoral and four peritumoral features distinguished tumor stage with an area under curve (AUC) of 0.823 in the trainset, with similar performance in the testset (AUC 0.813)

Survival risk stratification based on the radiomics signature

To explore the relationship between radiomics-predicted TNM stage and RFS, the entire cohort was clustered into low-risk (radiomic score < 0.17) and high-risk (radiomic score ≥ 0.17) groups according to the optimal cutoff value of the radiomic score determined by the surv_cutpoint function of survival R package (Fig. 5). The distribution of the radiomic score indicated that patients with low scores were commonly associated with favorable RFS, while those with high scores showed an increasing frequency of recurrence (log-rank P = 0.004, Fig. 6), with a hazard ratio of 0.47 (95% CI 0.27–0.83, P = 0.009). Multivariable Cox regression analysis revealed that radiomics-predicted TNM stage was an independent preoperative factor for RFS (hazard ratio, 0.32 [95% CI 0.17, 0.64]; P = 007) following curative-intent resection of ESCC, as was CT-reported LN status (hazard ratio, 1.95 [95% CI 1.13, 3.35]; P = 016) (Table 2).

Fig. 5
figure 5

The classification of radiomics-predicted TNM stage status was derived using the optimal threshold of radiomic score (0.17) determined by the surv_cutpoint function of the R package survminer

Fig. 6
figure 6

Kaplan–Meier survival analyses stratified by radiomics-predicted TNM stage in the entire cohort (n = 122). Recurrence-free survival was significantly worse in patients at high risk compared with those at low risk. Shaded areas represent 95% confidence intervals

Table 2 Uni- and multivariable cox regression analysis of predictors of recurrence-free survival

Discussion

In this study, we developed and validated a radiomics signature for the preoperative prediction of TNM stage in patients with resectable ESCC. The radiomics signature, which combined one intratumoral feature and four peritumoral features, performed well in distinguishing TNM stage with an AUC of 0.823 in the training set, and showed similar discrimination on internal validation (AUC 0.813). The comparable performance implied that the radiomics signature was a robust imaging biomarker in predicting tumor stage. In addition, the staging phenotype predicted by radiomics emerged as an independent preoperative predictor of RFS, thereby providing potentially prognostic information for medical decision making. To our knowledge, this study is the first exploring the combined predictive value of intratumoral and peritumoral radiomics features for tumor stage in ESCC patients, and also the first investigating the prognostic value of radiomics-based staging phenotype.

Pretreatment prediction of TNM stage is important for risk stratification and individualized therapy [4, 17]. Patients with late-stage ESCC are likely to be offered neoadjuvant chemoradiation in hopes of improving survival. Yet, clinical staging based chiefly on imaging modalities is still relatively inaccurate and more precise clinical staging tools are needed [8]. Prior studies supported the potential use of radiomics as a useful tool for tumor stage prediction in different clinical settings [9, 11, 18]. Wu et al. developed a radiomics signature to predict TNM stage of ESCC, resulting in an AUC of 0.762 at internal validation [11]. Unlike our signature that is based on radiomics features extracted from intratumoral and peritumoral areas, Wu et al. extracted and analyzed intratumoral features alone. Our radiomics signature demonstrated a slightly better performance (AUCs: 0.813 vs 0.762), although a head-to-head comparison is needed. One possible explanation was that peritumoral regions may potentially contain complementary predictive information, as aggressiveness is a hallmark of cancer and peritumoral invasion is related to tumor stage [19,20,21,22]. Therefore, a combination of intratumoral and peritumoral features, which potentially capture the intratumoral heterogeneity and peritumoral microenvironment simultaneously, could enhance the predictive ability of radiomics in tumor staging in patients with ESCC. Since the developed radiomics signature can reliably predict tumor stage preoperatively, it may contribute to improve selection of ESCC patients most likely to benefit from neoadjuvant therapy while sparing others from the toxic effects of the treatment.

Opposed to previous studies [11, 23, 24] that showed the usefulness of shape-based features (such as tumor volume, tumor length) for assessment of TNM stage, our signature did not include any shape features, likely because shape features do not reflect comprehensively tumor heterogeneity and aggressiveness, thus are less predictive of TNM stage.

Another critical result found in our study was that the radiomics-based staging phenotype was independently associated with RFS, implying its prognostic relevance for ESCC patients. More frequent follow-ups and more positive tailored therapy may be needed for high-risk patients probed by the proposed radiomics signature.

The following study limitations merit consideration. First, the retrospective study may induce inevitable selection bias. Second, we collected patient data from single center. Future multicenter external validation is warranted to validate the generalization of the proposed radiomics signature. Third, manual volumetric segmentation was time-consuming and labor-intensive. Automatic annotation is required to simplify the process in the future. Fourth, we considered the handcrafted radiomic features alone in this study. The role of deep learning features has not been explored. An integrated analysis of the handcrafted and deep learning features may potentially improve predictive performance [25, 26]. Finally, clinical data (such as endoscopic results) for preoperative stage were unavailable in all patients in this retrospective study, so we were unable to compare the efficacy of our radiomics signature with that of clinical stage.

In conclusion, the proposed radiomic signature incorporates features of intratumoral and peritumoral regions, allowing the non-invasive evaluation of tumor stage in ESCC and potentially predicting prognosis.