Introduction

Malignant esophageal fistula (MEF) is a serious complication of advanced esophageal cancer and is defined as a fistula caused by malignancies [1]. The two most common types are esophageal mediastinal fistula and esophageal respiratory fistula. Esophageal aortic fistula was rarely reported, because these patients often die of sudden massive hemorrhage and cannot be diagnosed before death [2]. In untreated EC patients, the incidence of esophageal fistula ranges from 5 to 15% [3], while radiotherapy patients have a range of 5.6% to 33% [4,5,6,7]. MEF has a very poor prognosis; patients frequently die within 3 months due to nutritional failure, pneumonia, mediastinal abscess or large vascular injury [8]. Patients with MEF must be treated with a nutrition tube, gastrostomy, or esophageal stent to prevent digestive fluid leakage into the trachea or mediastinum, all of which significantly reduce quality of life. Therefore, early identification of patients at risk of MEF enables early intervention, which improves patient outcomes.

However, few studies on MEF risk factors have been conducted, the majority of which were retrospective analyses of small samples, and only a few risk factors had been confirmed. T4 stage (invasion of adjacent organs) had been a known risk factor for MEF. Because the esophagus is a thin-walled organ with a thickness of less than 5 mm, tumors can grow through the entire thickness of the esophageal wall, which is the primary cause of MEF [3]. It is reported that the incidence of esophageal fistula is 10–22% in T4 patients with radical concurrent chemoradiotherapy, which is significantly higher than in non-T4 patients [9, 10]. In a retrospective study, ECOG PS, BMI, T4, N2/3 and re-radiotherapy were identified as independent factors, with a C-index of a nomogram incorporating the factors of 0.764 in an external validation cohort [11]. Another retrospective study revealed quantitative pretreatment CT analysis has excellent performance for predicting fistula formation in esophageal cancer patients [12].

Recently, radiomics has proven to be a useful tool for identifying the biological behavior of tumors [13]. Its central concept is to convert images into digital information that can be mined, extracted, and analyzed, as well as to predict target associations and build model [14, 15]. Iwashita et al. reported CT radiomics and clinical data might help determine survival outcomes in patients with esophageal cancer treated with radical radiotherapy [16]. Yang et al. found that radiomics can be used for chemoradiation outcome prediction in esophageal cancer [17]. Furthermore, some research has demonstrated that radiomics of tumor target areas can be utilized to predict tumor stage, differentiation, and pathological type [18, 19]. The present study attempt to develop a joint prediction nomogram by combining clinical factors and pre-treatment CT radiomics to accurately predict MEF. It will help clinicians identify EC patients at high risk of having an esophageal fistula.

Materials and methods

Patients

The Shandong Cancer Hospital ethics committee approved this retrospective study (Approval No. 2021003193), and informed consent was waived.

Inclusion criteria were: (a) Esophageal carcinoma proved by pathology; (b) Chest enhanced CT examination before treatment; (c) Patients treated with chemotherapy or chemoradiotherapy. Exclusion criteria: (a) Esophageal fistula appeared prior to treatment; (b) Anastomotic fistula or other medical injuries caused esophageal fistula.

A total of 122 patients treated at Shandong Cancer Hospital from October 2018 to September 2020 were included in the study. Fifty-three patients with MEF were identified, while 69 patients did not have a MEF within their survival period.

Malignant esophageal fistula must be confirmed by endoscopy or esophagography. All cases were divided into a training set and a validation set in a 7:3 ratio using a random algorithm.

Data collection and variable definition

Clinical data such as gender, age, tumor location, TNM stage, radiotherapy, chemoradiotherapy, chemotherapy regimen, and stenosis were obtained using the hospital information system (HIS). To obtain enhanced chest CT images, the picture archiving and communication system (PACS) was used. The image format was digital imaging and communications in medicine (DICOM).

Stenosis was defined as the inability to pass an endoscope through and/or significantly impairing eating (semi-fluid or fluid diet). The chemotherapy regimens mentioned in the article were first-line treatments.

Clinical predictors and clinical nomogram

Logistic univariate and multivariate regression analyses were used to identify clinical independent predictors and create a clinical nomogram using the variables with P < 0.25 in multivariate analyses.

Tumor segmentation

The region of interest (ROI) was defined as primary tumors, which included lesions with esophageal wall thickening > 5 mm or lumen occlusion diameter > 10 mm but did not include intraluminal gas or oral contrast agent. A radiation oncologist with 10 years of experience manually delineated the ROIs, which were then checked by a radiologist. Both doctors were unaware of the patient’s clinical information. 3D Slicer (version: 4.10.2), an open source software platform for medical image processing and visualization, was used to perform all delineation tasks.

Feature extraction and selection

Radiomic features were extracted automatically from each contoured ROI using Pyradiosity, an open source Python package available at https://pyradiomics.readthedocs.io/en/latest/.

The Lasso-logistic regression algorithm was used to screen radiomic features related to the presence of MEF. The Pearson correlation test was used to examine the multicollinearity.

Radiomics model construction

Based on the screened features, the logistic regression algorithm generated the radiomics model.

Radiomics-clinical model construction

Two steps were included in radiomics-clinical model construction. First, radiomic signature (radscore) was calculated by adding all filtered eigenvalues multiplied by the corresponding coefficients. The Wilcoxon rank sum (Mann–Whitney) test was applied to the radscores of the training and validation sets to ensure that there was no difference between the two groups. The flowchart for radiomic signature is presented in Fig. 1. Second, the logistic regression algorithm was used to fit radiomic signature and clinical independent risk factors.

Fig. 1
figure 1

Flowchart of developing a radiomic signature. Following ROI segmentation, 851 radiomic features were extracted from each ROI. The Lasso algorithm selected ten features that had the best correlation with the occurrence of esophageal fistula and a radiomic signature were created. The box chart revealed that there was no significant difference between the training and validation sets, but there was a difference between patients with and without fistula

Validation and comparison

In the validation set, the prediction efficiency of each model was evaluated, including discrimination (AUC/ROC), calibration (calibration curve), goodness of fit (AIC), reclassification ability (NRI, IDI), and clinical benefit (DCA curve). The Delong test was used to compare model discrimination abilities.

Statistical analysis

Stata 15.0 software (Stata Corp, www.stata.com) was used for statistical analysis. The chi-square test was used to compare categorical variables; the Wilcoxon rank sum test was used to compare continuous variables; all tests were two-sided, and P < 0.05 was considered statistically significant. R software (version 3.5.3, https://www.r-project.org/) was used to select radiomic features and build models. The R program packages used in this study are listed in the table below.

Algorithm

Packages

Version

Lasso

glmnet

4.0-1

Logistic, nomogram, calibration

Rms

6.0-1

ROC/AUC

ROCR, pROC

1.0-11, 1.16.2

NRI/IDI

PredictABEL, nricens

1.2-4, 1.6

Correlation test

corrplot

0.84

DCA

Rmda

1.6

Results

Patients

This study enrolled 122 patients with esophageal cancer, including 56 patients with MEF. Squamous cell carcinoma was the pathological type of all patients. The time between the occurrence of an esophageal fistula and diagnosis of EC ranged from 1 to 24 months, with an average of 7.87 ± 8.78 months. Esophageal mediastinal fistula occurred in 22 patients, esophageal tracheal fistula in 30 patients, and esophageal pulmonary fistula in 1 patient. Sixty-nine control cases were hospitalized at the same time and had no esophageal fistula within the survival period or 2 years after diagnosis, with a median follow-up time of 30 months (IQR: 16, 38). Table1 shows the baseline characteristics of enrolled patients.

Table 1 Clinical characteristics of patients included in the analysis

In a 7:3 split, all patients were assigned to a training set and a validation set. There were 86 patients in the training set, with 38 (44.2%) having MEF; 36 patients in the validation set, with 15 (41.7%) having MEF. No statistically significant differences existed in baseline clinical characteristics between the two groups (Table 1).

Clinical predictors and clinical nomogram

The clinical prediction model was built using stenosis (P = 0.01), gender (P = 0.23), and T stage (P = 0.11), all of which were screened by logistic regression algorithm (Table 2). The model predicted esophageal fistula with an AUC = 0.691 (95% CI 0.582–0.799) in the training set, and 0.640 (95% CI 0.453–0.827) in the validation set. The AIC value was 115.8. Figure 2 depicts a clinical nomogram and its calibration curve. Each predictor in the nomogram was assigned a score on a point scale. By adding up the total scores projected in the bottom scale, we could estimate the probability of MEF.

Table 2 Univariate regression analysis and multivariate regression analysis
Fig. 2
figure 2

A Nomogram developed by clinical risk factors. B Calibration curve plotted for clinical prediction model

Radiomic features

A total of 851 radiomic features were extracted from each ROI, and all features can be divided into 4 categories. The ‘shape’ category had 14 features that indicate the shape and size of regions of interest (ROIs) in 2D and 3D spaces. The ‘first order’ represents statistical eigenvalues of voxel intensity, encompassing mean, maximum, and minimum values. Textural features such as Glcm (GrayLevelCooccurenceMatrix), Glrlm (GrayLevelRunLengthMatrix), Glszm (GrayLevelSizeZoneMatrix), Gldm (GrayLevelDependenceMatrix), and Ngtdm (NeighbouringGrayToneDifferenceMatrix) were computed from multiple statistical matrices and characterized the organization Wavelet-based features were defined as first-order and texture features obtained from eight wavelet decompositions of the original CT images. As a result, the total number of radiomic features can be estimated as 14 + (18 + 24 + 16 + 16 + 14 + 5) + (18 + 24 + 16 + 16 + 14 + 5)8 = 851.

Ten features that were closely related to the presence of an MEF were chosen using Lasso-logistic algorithm (Fig. 1). The selected features and their coefficients are shown in Table 3. There was no significant multicollinearity between the features chosen (Pearson correlation coefficients were all less than 0.9).

Table 3 Radiomic features selected and their coefficients

Radiomics prediction model

The radiomics prediction model was built based on the selected radiomic features to compare with the clinical and radiomics-clinical models. In validation set, the AUC of radiomics model was 0.692 (95% CI 0.516–0.868) and AIC was 111.4.

Radiomics-clinical nomogram model

The formula for calculating radiomic signature (radscore) was as follows:

$$\begin{aligned} {\text{Radscore}} & = {\text{original}}\_{\text{gldm}}\_{\text{DependenceEntropy}} \times 0.29509193 \\ & \quad + {\text{wavelet}}.{\text{LLH}}\_{\text{firstorder}}\_{\text{Mean}} \times - 0.87347923 \\ & \quad + {\text{wavelet}}.{\text{LLH}}\_{\text{firstorder}}\_{\text{Skewness}} \times - 0.01889205 \\ & \quad + {\text{wavelet}}.{\text{LHH}}\_{\text{firstorder}}\_{\text{Skewness}} \times 0.32706898 \\ & \quad + {\text{wavelet}}.{\text{LHH}}\_{\text{glcm}}\_{\text{Idn}} \times 0.72705274 \\ & \quad + {\text{wavelet}}.{\text{HLL}}\_{\text{firstorder}}\_{\text{Skewnes}} \times 1.02717885 \\ & \quad + {\text{wavelet}}.{\text{LLL}}\_{\text{firstorder}}\_{\text{Maximum}} \times 0.74987525 \\ & \quad + {\text{wavelet}}.{\text{LLL}}\_{\text{firstorder}}\_{\text{Range}} \times 0.17126651 \\ & \quad + {\text{wavelet}}.{\text{LLL}}\_{\text{glszm}}\_{\text{GrayLevelNonUniformityNormalized}} \times - 0.26262914 \\ & \quad + {\text{wavelet}}.{\text{LLL}}\_{\text{glszm}}\_{\text{ZoneEntropy}} \times 0.30172464. \\ \end{aligned}$$

In the training set, patients without MEF had a radscore of 0.873 ± 0.372, while patients with MEF had a radscore of 1.275 ± 0.381. Wilcoxon rank sum test revealed a significant difference (P = 0.000). In the validation set, patients with MEF had a radscore of 1.125 ± 0.186, while those without had a radscore of 0.857 ± 0.2187, a significant difference (P = 0.000). In the training and validation sets, there was no significant difference between patients with MEF (P = 0.278), and those without (P = 0.6019). The multivariate logistic regression analysis including radscore, gender, T stage and stenosis revealed that stenosis (P = 0.023) and radscore (P = 0.001) were the independent risk factors (Table 2).

A radiomics-clinical nomogram prediction model (Fig. 3A) was developed using these two variables as inputs, which predicted esophageal fistula with an AUC = 0.782 (95% CI 0.684–0.8796) in the training set and 0.867 (95% CI 0.7461–0.987) in the validation set, with an AIC = 101.1. The calibration curve revealed that the radiomics-clinical nomogram’s predicted results were in good agreement with the actual observations (Fig. 3B).

Fig. 3
figure 3

A Radiomics-clinical nomogram. B Calibration curve plotted for radiomics-clinical prediction model

Model comparison

Figure 4A displays the discrimination ability (AUC values) of the clinical model, radiomics model, and radiomics-clinical model, revealing that the Joint model outperformed the other two models (Delong test, P < 0.05). The net benefit of the radiomics-clinical prediction model under each threshold probability was greater than that of the clinical model and radiomics model, according to the decision curves (Fig. 4B). In comparison to the clinical model, the NRI of radiomics-clinical model was 23.6% (95% CI 0.153–0.614) (1000 iterations), and IDI was 0.125 (95% CI 0.040–0.210), which was statistically significant (P = 0.004).

Fig. 4
figure 4

The model’s performance was compared using the A ROC curve (discrimination), B DCA curve (clinical benefit). Radiomics-clinical prediction model outperformed the other two

Discussion

Malignant esophageal fistula is a serious complication of esophageal cancer that significantly reduces the patient’s survival time and quality of life. However, there is currently a scarcity of effective predictive methods. Clinical prediction models and quantitative-CT-based prediction models with AUC of 0.805 and 0.841, respectively, have been reported in previous research [11, 12]. In present study, we built the radiomics-clinical model, clinical model, and radiomics model all at the same time and compared their predictive abilities. We discovered that the radiomics-clinical model performed significantly better than the other two models.

The radiomics-clinical model developed by fitting stenosis with radiomic signature using logistic regression algorithm had an excellent prediction efficiency with an AUC of 0.867 (95% CI 0.7461–0.987). The discrimination ability was significantly higher than that of the clinical (AUC = 0.640) and radiomics models (AUC = 0.692). In terms of goodness of fit, the AIC of the radiomics-clinical model was 101.1, which was lower than the AICs of the clinical model (AIC = 115.8) and the radiomics model (AIC = 111.4), indicating a better fitting performance.

T4 indicates that esophageal cancer has infiltrated the entire esophageal layer and has invaded surrounding organs [20]. Esophageal cancer is prone to spontaneous necrosis, and radiotherapy or chemotherapy can also promote necrosis, resulting in an esophageal fistula, particularly in tumors sensitive to radiation or chemotherapy [21]. However, whether radiotherapy is the cause of esophageal fistula remains debatable. Some researchers believe there is no evidence that radiotherapy increases the incidence of esophageal fistula because these patients with tracheoesophageal fistula after radiotherapy may have fistula in any case, but radiotherapy causes it to occur earlier [3]. Other researchers believe that radiotherapy causes tumor tissue to shrink while inhibiting normal tissue repair, which is a major cause of MEF [22, 23].

Esophageal stenosis is another clinically independent predictor. Takahiro et al. [9] discovered that stenosis was the only clinical risk factor for MEF, and that the risk of esophageal fistula in patients with stenosis was twice that of patients without stenosis, which was consistent with our findings. T4 and stenosis were also independent risk factors for esophageal fistula in patients treated by concurrent radiotherapy and chemotherapy [24]. However, there is no universally accepted definition of esophageal stenosis, which is usually determined based on the patient’s clinical symptoms and/or the findings of an endoscopic examination [25, 26].

In this study, male patients outnumbered female patients by 6.625:1, and female patients were less likely than male patients to develop an esophageal fistula. Guan et al. discovered that gender was an independent risk factor for malignant esophageal fistula [27]. Previous research has found that gender influenced the prognosis of esophageal cancer [28]. Estrogen regulates metabolism and organ response after injury [29]. As a result, women have natural advantages in injury recovery, which may account for gender differences in the occurrence and prognosis of MEF.

The clinical prediction model’s discrimination was unsatisfactory. To improve model performance, we extracted radiomic features from the primary tumor’s pre-treatment CT, screened out the feature set most relevant to the occurrence of MEF using lasso logistic regression and cross validation, and built a combined model with clinical risk factors, which significantly improved prediction efficiency. The biological behavior of esophageal cancer cells is a significant cause of fistula [30], but clinical risk factors cannot adequately reflect these characteristics. Medical images of tumors are the external manifestation of gene phenotype, and radiomics can convert images into digital data that can be mined, extracted, and analyzed [31]. Radiomics had been used in tumor studies to predict pathological type, stage, curative effect, and prognosis [32,33,34], and the derived radiogenomics can analyze tumor heterogeneity at the gene level [35, 36]. In our study, the potential 851 candidate radiomics features were eventually reduced to ten potential predictors by the LASSO method for further integration to form the radiomics signature, which contains useful biological information.

Because previous research has shown that arterial phase can better visualize esophageal tumors, which were chosen as the research object in this study [37].

This research had some advantages: first, prior to treatment, all of the clinical characteristics and medical images were gathered, which was helpful in developing the treatment strategy. Second, the nomogram prediction model showed the weight of each parameter on the outcome in a more intuitive way, making it more practical for clinical use. There were some limitations: because none of the patients in this group were surgically treated, only a few patients provided data on tumor differentiation, and this variable could not be assessed. To avoid bias, the total dose of radiotherapy was not included in the analysis. This was due to the fact that eight patients developed esophageal fistulas during radiotherapy, and their irradiation dose was 2–40 Gy. Furthermore, the modeling and validation were all completed in a single center. In future work, we will include data from multiple centers for external validation to improve the model’s generalization.

Conclusion

We developed and validated the first radiomics-clinical nomogram which could assist clinicians in identifying patients at high risk of malignant esophageal fistula.