1 Introduction

With a global incidence rate of 13.5% and a mortality rate of 6.7%, PCa ranks as the fifth leading cause of death in males worldwide [1, 2]. Imaging techniques have gone a long way in recent years, and this has major implications on PCa diagnosis, detection, screening, staging of the illness, response to therapy and individualized prediction of oncological outcomes [3,4,5]. Radiology has revolutionized the cancer diagnostic process. Imaging diagnostics provide the noninvasive and cost-effective method for scalable detection, localization, and monitoring of cancer. Imaging modalities outside radiography, such as skin pictures and colonoscopy films, are also used for screening and diagnosis [6].In the past, qualitative evaluation of imaging modalities was prioritized, but as artificial intelligence (AI) techniques for analyzing radiological characteristics have advanced, the emphasis of radiomics has shifted to quantitative data measurement from these images [7]. The therapeutic relevance of "radiomics" is also being explored and has the propensity to be extended to clinical applications by merging image characteristics and phenotypes with gene and protein signatures [8, 9].

The nuclear DNA binding protein Ki-67 is a standard for tumor staging present in all vertebrates [10]. Several nuclear cell cycle study investigations have shown that Ki-67 is only present in actively dividing (G1, S, and G2) cells and not in resting cells (G0). Therefore, Ki-67 might function as a cell proliferation marker in several malignancies [11]. Immunohistochemistry (IHC) evaluation of paraffin-embedded tissue sections for Ki-67 labeling index and percent positive frequency are predictive of clinical outcomes for individual patients. Generally speaking, a poor prognosis might be expected in clinical situations when the Ki-67 score is high [12, 13]. To aid in the prognostic assessment of individuals with PCa, prostate biopsy samples are graded according to the Gleason system [14]. A higher Gleason score (GS) indicates a more aggressive form of PCa and a worse prognosis for the patient. Pathological ratings vary from 2 to 10, with higher values suggesting more dangers and higher mortality [15]. The GSs score reflects the extent of infiltration and invasion of PCa and is a crucial reference for clinical urologists, becoming part of the TNM or Whitmore-Jewett PCa staging system to provide a prognosis. According to Mayo’s research, using GS, perineural invasion, and Ki-67 expression together is the most accurate way to predict long-term outcomes from PCa [16]. Given its cheap cost, fast evaluation, and high predictive value, Matthew.T et al. suggested that the addition of Ki-67 expression to perineural invasion and GS at biopsy should be regarded as a benchmark against which all novel biomarkers are evaluated before their introduction into clinical practice [16]. However, Ki-67 expression and GS of PCa can only be acquired by transrectal ultrasound-guided prostate biopsy (TRUS-Bx) or radical prostatectomy highlighting the need for a non-invasive and more reliable means of prognosis.

Magnetic resonance imaging (MRI) is a valuable technique for PCa radiomics research because it can identify clinical PCas, conduct lesion-targeted biopsies with high sensitivity and specificity, and provide high-quality pictures [17]. Radiomics is a method that extracts quantitative features from medical images to reveal tumor characteristics and heterogeneity. MRI-based radiomics is a helpful tool for PCa diagnosis because of its ability to predict PCa features such as GS, Prostate Imaging Reporting and Data System version 2.1 (PIRADS v2), risk class, and automated prostate segmentation inside a completely automated quality control system [18, 19]. Compared to imaging characteristics derived by subjective judgment, radiomics is more objective and may extract high-dimensional imaging features that are not visible to the human and may be connected with intratumor heterogeneity [20]. T2-WI may reveal the architecture of the prostate, as well as the lesion's location, size, shape, and extent, as well as the existence of perineural invasion or seminal vesicle invasion [21]. In general, PCa has a low signal on T2-WI, although cancer foci in the transitional zone may be difficult to differentiate from benign hyperplastic lesions. DWI may reflect the diffusion of water molecules in the tissue, and prostate cancer often exhibits a strong signal on DWI due to its high cell density and poor diffusivity of water molecules [21]. DWI may increase the detection rate and localization precision of prostate cancer, particularly in the transitional and central zones [22]. ADC is a quantitative parameter of DWI that reflects the diffusion coefficient of water molecules in tissues. ADC is inversely correlated to DWI, that is, the lower the ADC, the higher the DWI signal. ADC can help eliminate the effect of T2 signal enhancement (T2 shine-through) on DWI and improve the assessment of the malignancy degree (Gleason score) of prostate cancer [23]. It is suggested that the T2-WI, DWI, and ADC-based radiomics features can indicate the biological behavior and heterogeneity on the onset of tumor and may facilitate the application of Ki-67 expression and GS prediction in PCa.

Radiomics has been applied to predict Ki-67 expression in bladder cancer using MRI-based features [24], such as T2-weighted (T2-WI) and dynamic contrast-enhancement images. In addition, Ahmad C. et al. predicted GS of PCa patients using radiomics which included T2-WI and apparent diffusion coefficient (ADC, computed from diffusion-weighted imaging) scans of 99 PCa patients [25]. This study showed random forest (RF) classifier model was used to predict GS groups and identify the most important signature among the 41 radiomic features. It concluded that the radiomic features could be used as a non-invasive biomarker to predict the GS of PCa patients [25]. Yanjie Zhao et al. investigated whether radiological and radiomic variables from magnetic resonance imaging might predict a Ki-67 proliferative index in meningioma patients using a machine learning (ML) model. The prediction model might help treat meningioma patients by predicting their Ki-67 score. AUCs of 0.837 and 0.700 showed good prediction models [26].

To better utilize the predictive power of radiomics, we developed a radiomics prediction model through a multicenter collaboration. we adopted the radiomics to extract 318 features from T2-WI, DWI, and ADC images and used SVM and LASSO algorithms to construct radiomics signatures to preoperatively predict the Ki-67 expression and GS status and investigate their prognostic value in PCa.

2 Materials and methods

With the approval of the local institution's ethical committee, retrospective clinical-pathologic data on patients with PCa was acquired from electronic medical systems in the three centers (Center A, Gansu Provincial Hospital; Center B, The 940 Hospital of Joint Logistics Support Force of Chinese PLA; Center C, Second People's Hospital of Gansu Province). This retrospective study was also approved by the Ethics Committee of the Gansu Provincial Geriatrics Association (2022-61) and the requirement for informed consent was waived. Furthermore, our research program was patterned around a local institution’s AI model.

2.1 Participants

In our multicenter retrospective cohort analysis, 574 PCa patients with a main pathology diagnosis, between January 1 2017, and January 1, 2022, were included based on the inclusion and exclusion criteria. The inclusion criteria included the following: (a) PCa was diagnosed with histopathology and IHC; (b) All MRI scans were performed within 30 days of PCa diagnosis to exclude the confounding effects of medications on the measurements; (c) Completed prognosis information; (d) No missing information of Ki-67 and GS. The exclusion criteria included the following: (a) Inability to identify the location of the tumor in MRI (Figure S1); (b) Chemotherapy and radiotherapy were performed before pelvic MRI; (c) Lesions for which the borders were difficult to determine; (d) Incomplete clinical information.

We gathered the data from a total of 574 PCa-diagnosed patients based on the aforementioned inclusion criteria. From January 1 2017 to January 1, 2022, data from a total of 399 Center A patients were inducted into the training cohort. Furthermore, from August 2018 to May 2020, data from 97 patients with PCa from Center B and data from 78 patients from Center C were included in a testing cohort. Figure 1 depicted the flowchart for patient recruiting (Fig. 1).

Fig. 1
figure 1

Patient selection flow chart. SVM support vector machine, LASSO Least absolute shrinkage and selection operator

2.2 Prostate tumor segmentation

One radiologist (with over 5-year experience in pelvic MRI reading) and one urologist (with over 30-year experience in prostate MRI reading) segmented the region of interest (ROI) (Figure S2). If there were any objections to the findings, they would be debated until an agreement was established. Two radiologists were unaware of the prognostic information, Ki-67 expression, and GS throughout this procedure (Fig. 2). For each PCa patient, the boundaries of the tumor were drawn on each slice on the T2WI images, DWI images, and ADC images using ITK-SNAP software (version 4.0.0; http://itk-snap.org). When a patient had numerous tumors, the largest lesion was segmented for feature extraction. Volumes of interest (VOI) was created by stacking each patient's ROIs. To prevent data heterogeneity, Digital Imaging and Communications in Medicine (DICOM) which is the standard file format for the management of medical imaging information and related data were normalized and resampled to the same resolution (1 mm × 1 mm × 1 mm) before radiomics feature extraction. Patients were randomly divided into training and test sets at a ratio of 7:3.

Fig. 2
figure 2

Workflow for the radiomics process. a MR segmentation of the region of interest (ROIs) on T2-weighted images (T2WI), DWI, ADC. b Extraction of radiomics features within the defined volumes of interest (VOIs) to quality global features, texture features, and nontexture features. c Radiomics feature construction using the least absolute shrinkage and selection operator (LASSO) regression. d Predictive performance assessment with respect to discrimination, calibration, and decision curve analysis. d Predictive performance assessment with respect to discrimination, decision curve analysis, K-M

2.3 Feature extraction

After segmentation was complete, the PyRadiomics program (http://www.radiomics.io/pyradiomics.html) was used for lesion feature extraction. For feature extraction, we employed FeAture Explorer (FAE), a visualization program (Figure S3). It uses Python, NumPy, pandas, and scikit-learning modules. FAE can extract image characteristics, preprocess the feature matrix, create alternative models automatically, and assess them with standard clinical statistics [27]. First-order, form, and texture features make up the raw feature classes. Due to a single MRI sequence containing 107 features, mp-MRI produced 321 features in total for this study. In total 321 radiomics features were recovered from the axial T2WI, DWI, and ADC images. All patients' radiomics characteristics were normalized using the Z-score [(x –μ)/σ]. x is the radiomics feature value, is the mean of the feature values, and is the standard deviation of the feature values and was computed using the training set.

2.4 Feature selection

Z-score was used to standardize the dataset. Normalization is the process of subtracting the mean value and dividing the standard deviation for each characteristic. In the course of developing the histology classification model for this research, we up-sampled the positive and negative samples to reduce the imbalance in the classification training set data. To decrease the dimensions of the row space of the feature matrix, we used a Pearson Correlation Coefficient (PCC) for every pair of features [26]. If the PCC was more than 0.9, we randomly eliminated one of them. Multiple iterative calculations enable classifiers to determine the optimal feature combination by evaluating the importance of features and determining the optimal feature combination. Therefore, recursive feature elimination (RFE) based on LASSO and SVM was used to find the optimal feature combination based on accuracy step by step. To determine the hyper-parameter (e.g., the number of features) of model, we applied cross-validation with fivefold on the training data set. The hyper-parameters were set according to the model performance on the validation data set.

2.5 Evaluation of Ki-67 and Gleason score

After surgical resection or TRUS-Bx, IHC was performed on PCa samples for assessment of the Ki-67 and GS within 7 days. Ki-67 expression was detected using Anti-Ki-67 Rabbit pAb (GB111499) according to the manufacturer's procedure. Two independent pathologists who were blind to prognostic and clinical information assessed the immunoreactivity for Ki-67 based on the proportion of Ki-67-positive cells among 1000 cells randomly picked from each section. Similarly, GS was based on the morphology of cancer cells in Hematoxylin and eosin stain (HE) prostate puncture biopsies or resection specimens. After analyzing the tissue samples, the pathologist then assigned a grade to the observed patterns of the tumor specimen:(a) Primary grade—assigned to the dominant pattern of the tumor (has to be greater than 50% of the total pattern seen). (b)Secondary grade assigned to the next-most frequent pattern (has to be less than 50%, but at least 5% of the pattern of total cancer observed). The pathologist then summed the pattern number of the primary and secondary grades to obtain the final GS. The GS in this study were all total scores. Based on studies relating to the prognosis of PCa [1, 27,28,29,30,31], the following criteria were applied: high Ki-67 expression group (> 10% cells stained) and low Ki-67 (≤ 10% cells stained) expression group; high GS group (> 7) and low GS group (≤ 7).

2.6 Model evaluation

Calculated the radiomics score(rad-score) for each sample in the classification model [28, 29]. Applied the sigmoidal function to convert the rad-scores into the probability of high Ki-67 expression and high GS scores P (The value range was 0 ~ 1). The receiver operating characteristic (ROC) curves and the area under the curve measured the diagnostic efficacy of the models (AUC). Based on the truncation value that maximizes the Youden index value to calculate the diagnostic compliance, sensitivity, and specificity of the model. Moreover, the study of decision curves (DCA) demonstrated the clinical net gains offered by the prediction model.

2.7 Statistical analysis

SPSS 23.0 (SPSS, Armonk, NY, USA) and R statistical software (version 3.6.1 R, https://www.r-project.org/) were used for statistical analysis. The R modules utilized in this investigation were indicated in the Supplemental information (Tables 1, 2). The clinical features of the training and validation sets were compared using the appropriate Student's t-test, Chi-square test, or Mann–Whitney U test. The Kaplan–Meier and log-rank tests were conducted across two groups, which were determined by the IHC-based Ki-67 expression status and the radiomics-predicted Ki-67 expression status, respectively. All tests were 2-tailed, and a P < 0.05 was considered as statistically significant.

Table 1 Clinical characteristics and Ki-67 in the training cohort and testing cohort
Table 2 Clinical characteristics and GS in the training cohort and testing cohort

3 Result

3.1 Clinical characteristics

This multicenter retrospective analysis comprised 574 participants with a mean age of 72 ± 7 years and a range of 45–90 years. Following the study's inclusion and exclusion criteria, we eliminated 404 participants and included information from 170 patients. There were 98 cases with GS > 7 and 72 cases with GS ≤ 7 included.

After eliminating 54 samples with missing Ki-67 expression information from the enrolled data, a total of 115 samples were used in the development of the Ki-67 prediction model, comprising 68 instances with KI-67 > 10% and 47 cases with Ki-67 10%. Age, bone metastasis, BMI, and T-stage statistics were not significantly different between the two groups (Tables 1, 2). Ki-67 and GS are independent risk factors affecting the prognosis of prostate cancer patients (Tables S1, S2).

3.2 Feature selection and construction of radiomics signatures

The results of feature extraction and construction of the prediction model are as follows:

  1. (a)

    We performed feature extraction after feature outlining of the incorporated MRI images. Types of features extracted included the shape, gray level co-occurrence matrix (GLCM), histogram (intensity-based or first-order), gray level size zone matrix (GLSZM), and level run length matrix (GLRLM). 321 features were extracted from each PCa ROI.

  2. (b)

    A total of 321 features were retrieved from the three levels of T2-WI, DWI, and ADC, and after deleting individual faulty fields, 318 features were used in the subsequent analysis. In the process of building the histological classification model in this study, to eliminate the imbalance of the classification training set data (48/33 ratio of high expression to low expression cases), we balanced the positive and negative samples by up-sampling.

  3. (c)

    The Z-score was used to normalize the feature matrix, and due to the high spatial dimensionality of the extracted features, PCC was used to dimensionally reduce the data. The feature vectors of the transformed feature matrix were independent of each other, and the RFE algorithm was used to perform feature selection and rank the features before building the model. The top 15 features were chosen as the best feature subset for the Ki-67 classification model, while the top 9 features were chosen for the GS classification model (Fig. 3). Feature-chosen heat maps. The maps' hues indicated the relative importance of the highlighted elements. Positive cases, or those with elevated Ki-67 expression, were often a deeper hue. This exhibited the features' capacity to differentiate patients based on their biological properties (Fig. 4).

  4. (d)

    SVM and LASSO were chosen as the images used by the classifier to construct the classification models for Ki-67 and GS, respectively. For imaging radiomics modeling and testing, the corresponding samples were separated into training and test sets (81/34, 119/51) in a 7:3 ratio. To define the model parameters, a 5-folder cross-validation was performed to test the model's classification performance, and the optimal model parameters were ultimately chosen. The final Ki-67 and GS classification models for PCa were developed by linearly combining the best features with their respective coefficients.

Fig. 3
figure 3

The inbuilt feature importance in the SVM (a) and LASSO (b) the development of the LASSO in training set (c)

Fig. 4
figure 4

Heat maps of the selected features. The color of the maps represented the value of the selected features. The color of positive cases or high Ki-67 expression cases was generally darker. This proved the ability of the features themselves to distinguish the biological characteristics of patients

3.3 Performance of prediction models

The optimal classification model for the Ki-67 employs SVM as the classifier, and its AUCs on the training and validation sets were 0.884 and 0.793, respectively (Fig. 5a, b). The GS optimal classification model employs LASSO as the classifier, and its AUCs on the training and validation sets were 0.827 and 0.813 respectively (Fig. 5c, d) (Tables 3, 4). In models, the estimated probabilities for each sample were considerably greater for the high-expression and high-risk groups in the training and validation sets than for the low-expression and low-risk groups (Figs. 6, 7). DCA demonstrated that the imaging histology characteristics had a positive net clinical effect in both datasets (Fig. 8).

Fig. 5
figure 5

The performance of models. The ROC curves of SVM in training sets (a) and validation sets (b).The ROC curves of Lasso in training sets (c) and validation sets (d). ROC receiver operating curve, AUC area under the ROC curve

Table 3 Clinical statistics (GS) in the diagnosis
Table 4 Clinical statistics (Ki-67) in the diagnosis
Fig. 6
figure 6

Violin plot. Violin plot of probability of SVM model based on training set (a) and validation set (b); Violin plot of probabilities of a LASSO model based on training set (c) and validation set (d)

Fig. 7
figure 7

The predicted value for each sample in the models. The predicted value for each sample in the training set (a) and validation set (b) (SVM); The predicted value for each sample in the training set (c) and validation set (d) (Lasso)

Fig. 8
figure 8

Decision Curve Analyses. DCA for radiomics signature in SVM-training (a) and validation sets (b); DCA for radiomics signature in Lasso-training (b) and validation sets (d); DCA: decision curve analyses

3.4 Prognosis

3.4.1 Relationship between Ki-67 expression status and OS

  1. (a)

    In the Ki-67 classification model study, a total of 55 patients (47%) experienced tumor progression-related events. OS was significantly lower in IHC-Ki-67-high-expression patients than in IHC-Ki-67-low expression patients within both the training and validation groups (Fig. 9a, b; P < 0.05)

  2. (b)

    The SVM model's relationship between OS and Ki-67 expression predicted by Radiomics showed that PCa patients with Rad-high-Ki-67-expression had significantly lower OS than patients with Rad-low-Ki-67-expression in both the training and validation groups (Fig. 9c, d; P < 0.05).

Fig. 9
figure 9

Kaplan–Meier OS Curves. Prognostic value of the IHC-based Ki-67 expression status and radiomics-predicted Ki-67 expression status based on the SVM model. Kaplan–Meier OS curves for patients grouped by IHC-based Ki-67 expression status in training sets (a) and validation sets (b). Kaplan–Meier OS curves for patients grouped by radiomics-predicted Ki-67 expression status in training sets (c) and validation sets (d). Prognostic value of the Pathology-based gleason and radiomics-predicted gleason based on the SVM model. Kaplan–Meier OS curves for patients grouped by Pathology-based gleason in training sets (e) and validation sets (f). Kaplan–Meier OS curves for patients grouped by radiomics-predicted gleason in training sets (g) and validation sets (h). IHC immunohistochemistry, OS disease-free survival

3.4.2 Relationship between Gleason score and OS

  1. (a)

    In the GS classification model study, a total of 86 patients (51%) experienced tumor progression-related events. OS was significantly lower in patients with IHC-GS-high-expression than in patients with IHC-GS-low expression in both the training and validation groups (Fig. 9e, f; P < 0.05)

  2. (b)

    The SVM model's relationship between OS and GS predicted by Radiomics showed that PCa patients with Rad-high-GS-expression significantly lower OS than Rad-low-GS-expression in both the training and validation group (Fig. 9 g, h; P < 0.05)

4 Discussion

In this multicenter retrospective study, we constructed and validated MRI-based radiomics signatures for the prediction of Ki-67 expression status and GS in PCa. To remove the imbalance of the training data set, we used up-sampling by repeating random cases to make positive/negative samples balance. We applied the normalization on the feature matrix. Without data balancing, the predictive performance of radiomics signatures was inadequate, with obviously low specificity. After data balancing by the up-sampling, the synthesized performance of radiomics signatures was further improved, indicating that data balancing contributes to constructing more powerful prediction models. For the Ki-67 classifier, we settled on a set of 15 features, while for the GS classifier, we narrowed it down to a set of 9. Prediction of Ki-67 expression status and the GS was best for the LASSO and SVM in this investigation, and DCA showed that this model produced good clinical net benefit. Therefore, radiomics based on MRI might aid in predicting the Ki-67 expression and the GS in PCa.

Due to the rising uses of radiomics in oncology and its efficacy in differentiating malignancies, which is difficult with conventional radiologic interpretations, the use of radiomics is interesting [30,31,32]. Radiomics illuminates the invisible link between diseased outcomes and extracted variables, hence facilitating clinical decision-making [20]. Radiomics postulates that the most important information is included in the spatial distribution of voxel intensities [28]. Indeed, our preliminary results demonstrated that four dominant radiomics categories from T2-WI and DWI of the selected texture 318 features could quantify intratumor heterogeneity, and that these categories were consistent with those from prior studies that reported the relevant features that are predictors for staging PCa aggressiveness [29, 33]. With the use of more potent algorithmic models, the top radiomic characteristics presented in this research may aid in giving more accurate predictions. Moreover, by exploring similarities or differences in tumor heterogeneity genes at the microscopic level, several key factors are provided for the development of improved radiomics models prediction for hallmarks of PCa, proliferation, and aggressiveness.

Computer-aided diagnostics (CAD) is a system that employs ML algorithms to analyze imaging and/or non-imaging medical data and assess the patient’s condition, which can be used to assist physicians in decision-making [34]. RF, SVM, and LASSO algorithms are the CAD tools commonly used to classify resampled instances to analyze medical images from picture databases [35]. In this work, we tested SVM, LDA, RF, and LASSO to find the best model, and finally selected SVM and LASSO for predicting Ki-67 expression and GS in the data based on their higher predictive accuracy. This was also an exploration of the application of CAD. As clinical urologists, we are more interested in the algorithmic maturity of ML or DL, as well as the scope and applicability of clinical applications. The SVM and LASSO models that we have chosen from the results of the existing studies have different application characteristics in the CAD of PCa. SVM has the best accuracy in distinguishing GS in image augmentation [36], the ability to handle high-dimensional data, and good generalization ability [37, 38]. However, it also has drawbacks such as high training and testing costs, sensitivity to parameters, and difficulty in interpretation [39]. On the other hand, the advantage of LASSO is that it can extract features and regularize radiomics data simultaneously to prevent overfitting, and it is suitable for correlated high-dimensional data [40, 41]. However, it may lose some important features and be sensitive to outliers. In contrast, direct comparisons between different ML or DL algorithms are scarce in terms of available research. Most of the above-mentioned studies did not compare their classifiers, so a conclusion about the best algorithm classifier to be used for the study of radiomics in PCa cannot be drawn at this time.

Our predictive models built on machine learning SVM and LR-Lasso classifiers are suitable for processing small sample data with high classification accuracy and are particularly suitable for solving dichotomous classification problems. The models are relatively simple and easy to understand and implement but may require fine-grained feature engineering and tuning of parameters. At the same time, the data requirements are relatively relaxed and can handle problems such as classification, regression, and anomaly detection, provided that the data quality is good. In contrast, deep learning models, such as VGG and Resnet models [42, 43], are generally used for image classification and target detection tasks in large-scale datasets and usually require more computational resources and time for multi-classification problems [44]. At the same time, their requirements for data quality are higher and require normalization and pre-processing, which may otherwise have an impact on the effectiveness of the models. In conclusion, machine learning models and deep learning models each have their own characteristics and are suitable for different scenarios and problems [45]. The correct choice of the appropriate algorithm requires a comprehensive consideration of the actual task and dataset characteristics, including data volume, data nature, classification accuracy requirements, and computational resources, to achieve the best classification results [46].

Our results corresponded with the prognosis of PCa patients has been the final argument. In previous retrospective clinical research, Ki-67 and GS showed a strong association with the prognosis of patients with PCa [10, 47]. Our findings are identical to those of previous research, but we emphasize the non-invasive, repeatable, and user-friendly character of radiomics. In this study, we aimed to investigate the association among PCa patient Ki-67 expression status, the GS, and survival outcomes (5-year OS) predicted by radiomics using the LASSO and SVM. Our model predicted high Ki-67 expression, high GS in PCa patients with considerably reduced Ki-67 expression (Fig. 9c, d), and low GS in patients with OS (Fig. 9g, h), as shown by the data. The foregoing findings may imply that the model we built may directly predict the prognosis of patients using MRI data, therefore providing urologists with more important information for clinical decision-making.

The present study has some limitations, partly due to the exploratory nature of assessing the feasibility of the new application domain. (a) First, this is a retrospective multicenter analysis with a relatively small sample size. A prospective multicenter study with a larger sample size is needed to validate our results for future clinical applications. (b) The complementary role of radiomics signatures in the established predictive model is worth exploring, which will be our future direction to provide a more robust model. (c)In this study, we classified patients with a GS (Gleason score) of 4 + 3 as low risk. However, it is important to note that the prognosis for these patients is significantly worse compared to those with a GS of 3 + 4, placing them in the high-risk group from a clinical perspective. This discrepancy can be attributed mainly to our limited sample size. If we had categorized these patients as high risk, the disparity in the number of positive and negative samples (83/36) would have been too substantial, significantly impacting the reliability of our results. (d) Although the findings are promising and the models performed well, the inherent uncertainty should be considered before applying this information in clinical settings.

5 Conclusion

Initial findings indicate that radiomic features derived from MRI imaging are a reliable tool for predicting Ki-67 overexpression and high GS. The radiomic properties performed well when examined for discrimination, calibration, and clinical utility. The proposed noninvasive technique has the potential to enhance clinical decision making for individuals with PCa.