1 Introduction

Bladder cancer (BCa) is one of the most common malignant tumors in humans and the most common among all urological tumors [1]. Based on whether it invades the bladder muscle layer, BCa can be divided into muscle invasive bladder cancer (MIBC) and non-muscle invasive bladder cancer (NMIBC) [2, 3]. Radical cystectomy (RC) is one of the main treatments for nonmetastatic muscle invasive bladder cancer (NMMIBC) or high risk non-muscle invasive bladder cancer (HRNMIBC) [4]. Generally, the prognosis of patients with MIBC after RC is generally worse than that of patients with NMIBC. Moreover, there are significant individual differences in the prognosis of RC for BCa of the same tumor stage determined by tumor heterogeneity [5]. Studies have shown that, despite RC treatment, local or metastatic recurrence rates of patients with BCa are 22–47% [6]. In these patients, the prognosis is often very unsatisfactory, and early and accurate prediction of recurrence risk and individualized treatment and follow-up management should be improved [7].

Computed tomography (CT) is the most commonly used preoperative evaluation in patients with BCa to localize the tumor and determine its number, size, degree of invasion of surrounding tissue, and presence of lymph node metastases or distant metastases. However, tumor heterogeneity cannot be reliably assessed visually. Radiomics refers to the extraction of many medical image texture features from raw image data and converting them into quantitative, high dimensional data [8]. Recent literature has shown that the radiomics features of CT or magnetic resonance imaging (MRI) have a significant impact on predicting the pathologic grade, lymph node metastasis, and overall survival of BCa [9,10,11]. At present, scholars have begun to explore the prediction of recurrence of BCa after surgery but have not yet elaborated on the prognosis of patients after RC for BCa [12, 13]. The purpose of this paper was to establish a nomogram related to radiomics to further explore the prediction of the risk of local and metastatic recurrence after RC for BCa.

Based on these findings, we hypothesized the following: (1) the nomogram can extract high throughput radiomics features that are highly correlated with BCa recurrence from raw CT images; (2) the nomogram model established by radiomics characteristics combined with important clinical factors can accurately and efficiently predict the recurrence of BCa.

2 Materials and methods

2.1 Patients

The retrospective study was approved by an institutional ethics review board and exempted patients from the need for informed consent, the approval numbers and dates of the Institutional Review Borad are (B) KY2022238 and December 30, 2022, respectively. In total, 134 patients who underwent RC at the Southwest Hospital of Army Medical University from June 2014 to June 2019 and were diagnosed with postoperative pathological confirmation of BCa were retrospectively enrolled. The inclusion criteria were as follows: (1) underwent RC and was pathologically confirmed as BCa; (2) CT scan was performed 1 month before RC, and the images were clear; and (3) complete clinicopathological data were available. The exclusion criteria were as follows: (1) CT scan of poor quality, mainly manifested as unclear display of bladder tumors caused by artifacts or other factors; (2) other treatment, including chemotherapy or immunotherapy, prior to CT scan; and (3) incomplete clinicopathological data or loss to follow-up.

2.2 Research method

2.2.1 Clinicopathological data and follow-up contents

Information of patients in the First Affiliated Hospital of Army Medical University from June 2014 to June 2019 was retrospectively collected, and the clinical data mainly included gender, age, tumor number, tumor size and grade, pathologic T stage, lymph node stage and histology type. All patients were followed every 3 months for the first year postoperatively and then every 6 months in years 2 and 3. The most important postoperative follow-up test to determine if BCa has returned is a CT scan. If the examination results show that the bladder tumor reappears in the urethra and pelvic cavity after RC; Or distant sites, such as the lung, bone, brain or liver, are considered for local or metastatic recurrence of BCa. Relapse-free survival (RFS) was recorded. After 3 years of follow-up, 24 cases of BCa recurrence were confirmed, which were defined as the recurrence group, and the rest were classified as the no recurrence group.

2.2.2 CT image acquisition

Prior to RC, all patients had a CT examination (Siemens Healthineers Somatom Sensation 64). The CT scan parameters were as follows: tube voltage, 120 kV; automatic tube current; detector collimation, 64 × 0.6 mm; matrix, 512 × 512; and rack rotation time, 500 ms. The intravenous contrast agent iohexol (350 mg I/ml) was scanned by intravenous injection of the elbow at an injection rate of 3–3.5 ml/s, followed by a flushing tube with 20 ml of saline. After collecting the localization images, arterial and venous images were obtained at 25 s and 60 s, respectively, and arterial phase images with a thickness of 1.5 mm were selected for feature extraction.

2.2.3 Region of interest (ROI) delineation and radiomics feature extraction

The first senior reader in genitouriturological imaging (patients whose clinicopathological data and follow-up results were unknown) reviewed all CT scans and manually cut and mapped the ROI of the entire tumor layer by layer using the medical imaging software 3D Slicer (version 4.13.0 http://www.slicer.org). Another reader with extensive experience in urogenital imaging (who was also unaware of the clinicopathological data and follow-up results of the patients) reviewed all the ROIs assigned by the first reader and randomly selected 45 lesions for resegmentation. Python (version 3.7.1) and Jupyter Notebook (version 5.7.4) were combined to perform feature extraction on ROIs that had already been cut. The main steps were as follows:

  • 1. Image preprocessing: The plug in Pyradiomics (version 3.0.1) was used to resample all images, the parameters binWidth were set to 25, the parameters resampledPixelSpacing were set to 1.0 mm, Then, the image nonlinear intensity transformation and wavelet transform are performed on the original CT image to reduce computational interference and improve feature recognition ability.

  • 2. Feature extraction and calculation: Based on the original CT image and processing, 1316 radiomics features were extracted from each ROI: firstorder features that describe single pixels of the image; shape features that describe the geometric characteristics of the ROI; and texture features that reflect the homogeneous phenomenon of vision in the image, including 252 firstorder features, 14 shape features, 336 gray-level cooccurrence matrix (GLCM) features, 224 gray-level run length matrix (GLRLM) features, 224 gray-level size zone matrix (GLSZM) features, 70 neighborhood gray tone difference matrix (NGTDM) features, and 196 gray-level dependence matrix (GLDM) features.

2.2.4 Local and metastatic recurrence-related feature selection and radiomics scoring model establishment

  • 1. Feature selection: First, to evaluate the repeatability of radiomics feature extraction, we calculated the interclass correlation coefficient (ICC) of the feature on the basis of the previous two experts' work and selected features with ICC>0.75. Subsequently, in training set, to maintain the comparability of different features and reduce the imbalance in the importance of each feature caused by the differences in the mean and variance of each feature, we normalized the features of the Z score: z= xi-μ/δ. Finally, to reduce the difficulty of model learning and data noise in the later stage, the coefficient of association and analysis of variance were used for feature selection. After removing other redundant and useless features, the features of p<0.05 were retained as an effective factor for the next feature selection.

  • 2. Radiomics Scoring Model Establishment: To solve the overfitting problem of high dimensional data, improve the performance of the model, and select the most effective prognostic features, the LASSO regression algorithm was used to select the deviation of the measurement index, and the optimal feature was selected after 10-fold cross verification. In the training group, a logistic regression model with radiomics scoring was constructed based on the selected radiomics features to obtain the regression coefficients. Then, a linear combination of these selected features and their corresponding regression coefficients was used to calculate the Rad-Score for each patient. Then, validation was performed in the validation group. The score value was used for subsequent analysis. The study design for radiomics is shown in Figure 1.

Fig. 1
figure 1

Workflow for extraction and screening of radiomic features

2.2.5 Establishment of radiomics-clinical model and efficacy assessment

First, univariate analysis was carried out, and the statistically significant variable P<0.05 was retained as a clinical risk factor, combined with Rad-Score and clinical variables. The multivariate logistic regression model was used for analysis, and P <0.05 was further determined to be an independent risk factor. Then, the radiomics clinical nomogram predictive type related to BCa recurrence was established. The training set and validation set calibration curves and receiver operating characteristic curves (ROC) were used to evaluate the performance. Finally, according to the median recurrence risk score of the line chart model training dataset, Decision curve analyses (DCA) was established to further evaluate the clinical benefit of the radiomics-clinical model compared with the radiomics model and the clinical model alone.

2.3 Statistical analysis

SPSS 26.0 software (IBM) was used for univariate and multivariate analyses to study the clinical risk factors and independent risk factors among the cohort. A t test was used for measurement data, a chi-square test was used for counting data, and single factor analysis of variance was used for preliminary feature screening. In this study, R statistical software version 4.2.2 (https://www.r-project.org/) was used for ICC calculation, model building and performance evaluation, and survival curve drawing. The R packages used included “glmnet,” “rms,” “pROC,” “rmda,” and “survival.” P<0.05 indicates statistical significance.

3 Results

3.1 Patient characteristics

Overall, 134 patients with BCa were enrolled in this study. All patients underwent RC and pelvic lymph node dissection followed by urinary diversion (Briker or orthotropic ileal bladder surgery). Among them, 119 were males, and 15 were females, with a mean age of 64.4 ± 9.7 years. During the 3-year follow-up, patients were divided into a recurrence group (n=24) and a no recurrence group (n=110) according to whether they relapsed. The median RFS of patients with relapse was 12 months. Comparisons of gender, age, tumor number, tumor size, grade, pathologic T stage, lymph node stage and histology type between the two groups are shown in Table 1.

Table 1 Demographics and Clinical Characteristics of Patients in the training set and validation set

3.2 Selection of radiomics features and establishment of radiomic model

In this study, 1316 radiomics features were extracted, among which the average features of ICC>0.75 showed good agreement between the two groups of observers, and 1076 features of ICC>0.75 prepared the following feature selection. The correlation threshold of 0.9 was selected for correlation coefficient analysis, and the 13 best features with a P value of <0.05 were then screened out by analysis of variance. Each feature data point was standardized, and 8 features were screened out after 10-fold cross verification (Fig. 2a, b) using LASSO regression algorithm. Finally, they were established as logistic regression models for radiomics. The model showed an AUC of 0.777 (95% CI 0.666–0.887) for the training set and 0.772 (95% CI 0.634–0.911) for the validation set (Fig. 2c, d). Based on the independent variable coefficients and cutoff values of the radiomics model, the regression coefficients were obtained. In the training set, the 8 retained features were linearly combined by the corresponding coefficient weighting to obtain the calculation formula of Rad-Score, which could quantify the radiomic features and predict the recurrence risk.

Fig. 2
figure 2

a The LASSO model is selected by tenfold cross verification based on minimum criteria; b LASSO coefficient distribution map for 8 texture features; c and d are ROC curves of the training set and validation set radiomic features

3.3 Establishment of a radiomic-clinical model and its efficacy assessment

Univariate and multivariate analyses were performed based on the Rad-Score and important clinical factors of the patients in the training set, in which tumor size, pathologic T stage, lymph node stage and Rad-Score were identified as clinical risk factors (P <0.05). Meanwhile, in the multivariate analysis based on clinical risk factors, pathologic T stage (OR 0.176, 95% CI 0.036–0.653 P=0.015) and Rad-score (OR 2.933, 95% CI 1.336–7.951 P=0.018) were still identified as independent prognostic factors, as shown in Table 2. Combined with the Rad-score and pathologic T stage, a radiomics-clinical nomogram model for predicting local or metastatic recurrence after RC in BCa patients was established (Fig. 3a). The ROC curves of the model showed that the AUCs of the training set and the validation set were 0.840 (95% CI 0.743-0.937) and 0.883 (95% CI, 0.777-0.989), respectively, the accuracies were 0.817 and 0.878 (Fig. 3b, c), respectively, and the calibration curves of the training set and the validation set performed well, showing that the model can predict the recurrence rate of BCa within 3 years after RC in good agreement with the actual situation (Fig. 3d, e). According to the nomogram, if the patient is pathologic T stage≥2 and the Rad-Score is -2, the scores will be close to 17.5 and 45, respectively, and the “total score” is 62.5 points, which corresponds to a “recurrence risk” of approximately 0.18 points. The patients were divided into a low risk recurrence group and a high risk recurrence group according to the median recurrence risk in the training set (0.12). The decision curves of different models in the cohort are shown in Fig. 3f, DCA shows that the clinical utility performance of the radiomics-clinical model is higher than that of the clinical model and the radiomics model. As shown in Fig. 4, the performance of recurrence risk stratification was quantitatively evaluated, and the comprehensive prediction performance of the radiomics-clinical model was significantly better than that of the clinical model and the radiomics model, as shown in Table 3. Based on nomogram recurrence risk stratification, Kaplan-Meier (KM) plots of RFS in the first 3 years after RC in the low risk recurrence group and high risk recurrence group were plotted, as shown in Fig. 5.

Table 2 Univariate and multivariate regression analyses of the indicators for recurrence prediction in the training set
Fig. 3
figure 3

a Nomogram constructed based on pathologic T stage and Rad-Score; ROC curves and calibration curves of the radiomics-clinical model in training b, d and validation c, e sets; f DCA for the radiomics-clinical model nomogram

Fig. 4
figure 4

ROC curves and chi-square plots of different risks and models in training a, c and validation b, d sets

Table 3 Different models evaluated the diagnostic performance of relapse risk stratification
Fig. 5
figure 5

Kaplan–Meier plots of the RFS for training set (a) and validation set (b)

4 Discussion

Recurrence after RC is one of the main factors affecting the prognosis of BCa [14]. At present, the most important method for diagnosing BCa is cystoscopy. Imaging plays an irreplaceable role in the early diagnosis and evaluation of treatment effects in patients and has obvious noninvasive advantages over endoscopy [15,16,17,18]. Pathological grade is still the most critical factor affecting the treatment and prognosis of BCa, but with the rise of radiomics applications in oncology [19], extracting high throughput information based on raw images may be able to quantify tumor heterogeneity earlier [20, 21]. Therefore, noninvasive tools for predicting recurrence after RC for BCa appear to be valuable in clinical decision-making.

To date, several relevant studies have reported the application value of CT radiomics in predicting the prognosis after BCa surgery, among which Piotr Woznicki et al. found that radiomics features based on preoperative CT scans had prognostic value in predicting overall survival before RC in a study of 301 BCa patients who underwent RC and pelvic lymphadenectomy. Among them, the AUC of the clinical model was 0.761, and the AUC of the radiomics model was 0.771, which suggests that the predictive performance of the radiomics model is comparable to that of the verified clinical parameters [11]. Qian et al. reported that radiomics features extracted from multistage CT images combined with important clinicopathological risk factors can predict the recurrence of BCa 2 years after surgery, but the recurrence of BCa after RC was not included separately in the systematic study [12, 13]. Therefore, whether CT radiomics combined with clinical factors can predict local recurrence or metastatic recurrence after RC for BCa needs to be further verified.

In this study, a radiomics-clinical nomogram model based on preoperative CT extraction of high throughput radiomics features and important clinical risk factors was initially explored for individualized BCa recurrence risk stratification after RC.

Among the important clinical factors selected, our univariate analysis showed that tumor size, pathologic T stage, and lymph node stage were strongly associated with tumor recurrence after RC in patients with BCa and that these factors influenced the prognosis of patients, consistent with previous studies [22, 23]. However, tumors are heterogeneous in individual patients, and the TNM staging system used to predict long term survival is not entirely accurate [24,25,26]. Interestingly, based on high throughput data normalization and the lasso regression algorithm, eight radiomics texture features with the highest correlation with tumor recurrence after RC in patients with BCa were finally included, including one firstorder feature, four GLSZM features, one GLCM feature, one GLDM feature and one GLRLM feature. In recent years, they have been extensively studied [27, 28]. Preliminary evidence suggests that these features may be a good characterization of histoheterogeneity and histopathological differences among patients with BCa [29]. Multivariate analysis combining tumor size, pathologic T stage, lymph node stage and Rad-Score showed that pathological T stage and Rad-Score were independent risk factors for tumor recurrence after RC. Based on both, we developed a radiomics-clinical nomogram model for recurrence risk stratification. The results showed that the AUC (95% CI) of the training and validation sets of the nomogram model was 0.840 (0.743–0.937) and 0.883 (0.777–0.989), respectively, which was significantly higher than that of the radiomics model, indicating that the radiomics- clinical model was superior to the radiomics model in predicting recurrence after RC in patients with BCa. The calibration plot confirmed that the predicted performance is reliable and valid. DCA also clearly showed that, when the risk threshold is greater than 0.1, the composite model has more net benefits than the clinical model alone or the radiomics model.

According to the nomogram, when pathologic T stage was less than 2 and Rad-Score was -0.5717195401, the minimum value reaching the predicted value was less than 0.1. When pathologic T stage was more than 2, the Rad-Score was 2.336809971, the maximum predicted value is greater than 0.9. The recurrence risk group was divided according to the median predicted value of the training set's nomogram model, that is, the median recurrence risk score. the comprehensive comparison of various factors showed that the radiomics-clinical model is still better than the other two models. Chi-square test results showed a close correlation between recurrence risk stratification and true recurrence.

KM map analysis showed that RFS of patients with different recurrence risk groups were also different. The performance of radiomics-clinical nomogram is valuable for the estimation of RFS. In summary, the nomogram scoring system of this study may serve as a more effective tool to accurately and rapidly assess the risk of recurrence after RC through simple computational methods, enabling early individualized treatment of BCa patients.

Our current research still has some unanswered questions. First, this was a single center study with a limited sample size, which requires additional multicenter data samples and external validation. Second, due to the incompleteness of the data in the single institutional database, possible predictors, such as tumor location and radical urinary diversion, were not included in this study, and subsequent studies can further analyze these in depth. In addition, biomarkers, such as WDR72 and methylation of LMX1A, have been reported to be closely associated with BCa recurrence [30, 31]. These factors were not included in this study, and we will try to build more efficient predictive models by combining clinical factors, radiomics, and important biomarkers. What's more, by repeatedly modeling and using multiple data dimensionality reduction methods to reduce overfitting, it is difficult to find an absolute subset of the selected features because it relies on the segmentation of the training and test sets. We believe that future research may be able to establish a variety of models and provide the more reliable solution for clinical diagnosis and treatment after multiple evaluations. As a result, this study still needs to be further improved before clinical application [32].

In conclusion, this study combined clinical factors and radiomics characteristics of CT to construct a composite model to predict local or metastatic recurrence of BCa after RC. These preliminary results showed that the model has good efficacy in stratifying recurrence risk, which may provide some help for the individualized treatment of BCa patients.