Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer

Wu, Ahao; Luo, Lianghua; Zeng, Qingwen; Wu, Changlei; Shu, Xufeng; Huang, Pang; Wang, Zhonghao; Hu, Tengcheng; Feng, Zongfeng; Tu, Yi; Zhu, Yanyan; Cao, Yi; Li, Zhengrong

doi:10.1038/s41598-024-66979-x

Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer

Article
Open access
Published: 13 July 2024

Volume 14, article number 16208, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer

Download PDF

Ahao Wu^1,3^na1,
Lianghua Luo^1,7^na1,
Qingwen Zeng¹,
Changlei Wu²,
Xufeng Shu¹,
Pang Huang¹,
Zhonghao Wang¹,
Tengcheng Hu¹,
Zongfeng Feng¹,
Yi Tu⁴,
Yanyan Zhu⁵,
Yi Cao¹ &
…
Zhengrong Li^1,6

307 Accesses
Explore all metrics

Abstract

The study aims to investigate the predictive capability of machine learning algorithms for omental metastasis in locally advanced gastric cancer (LAGC) and to compare the performance metrics of various machine learning predictive models. A retrospective collection of 478 pathologically confirmed LAGC patients was undertaken, encompassing both clinical features and arterial phase computed tomography images. Radiomic features were extracted using 3D Slicer software. Clinical and radiomic features were further filtered through lasso regression. Selected clinical and radiomic features were used to construct omental metastasis predictive models using support vector machine (SVM), decision tree (DT), random forest (RF), K-nearest neighbors (KNN), and logistic regression (LR). The models’ performance metrics included accuracy, area under the curve (AUC) of the receiver operating characteristic curve, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). In the training cohort, the RF predictive model surpassed LR, SVM, DT, and KNN in terms of accuracy, AUC, sensitivity, specificity, PPV, and NPV. Compared to the other four predictive models, the RF model significantly improved PPV. In the test cohort, all five machine learning predictive models exhibited lower PPVs. The DT model demonstrated the most significant variation in performance metrics relative to the other models, with a sensitivity of 0.231 and specificity of 0.990. The LR-based predictive model had the lowest PPV at 0.210, compared to the other four models. In the external validation cohort, the performance metrics of the predictive models were generally consistent with those in the test cohort. The LR-based model for predicting omental metastasis exhibited a lower PPV. Among the machine learning algorithms, the RF predictive model demonstrated higher accuracy and improved PPV relative to LR, SVM, KNN, and DT models.

CT radiomics nomogram for the preoperative prediction of lymph node metastasis in gastric cancer

Article 29 August 2019

Development and validation of a CT radiomics and clinical feature model to predict omental metastases for locally advanced gastric cancer

Article Open access 25 May 2023

Computed Tomography-Based Deep Learning Nomogram Can Accurately Predict Lymph Node Metastasis in Gastric Cancer

Article Open access 31 July 2022

Introduction

Gastric cancer is one of the most common malignant tumors worldwide. The incidence and mortality rates of gastric cancer rank fifth and fourth globally, and in China, they rank third¹. In recent years, treatment options for gastric cancer have expanded significantly^2,3,4,5,6, including surgery, chemotherapy, targeted therapy and immunotherapy^7,8,9,10,11; but radical surgery remains the primary treatment modality for locally advanced gastric cancer (LAGC)^12,13. For LAGC patients without omental metastasis, radical gastrectomy with omentum preservation offers advantages such as shorter operating time, less intraoperative blood loss, and fewer postoperative complications^14,15. Therefore, accurate preoperative staging of gastric cancer patients is of great importance for individualized treatment.

Radiomics has been increasingly applied in gastric cancer research in recent years. Its principle involves converting medical images into numerical variables for statistical analysis¹⁶. Radiomics plays a crucial role in predicting gastric cancer lymph node metastasis, N staging, neoadjuvant chemotherapy efficacy, postoperative local recurrence, and long-term survival^17,18,19. Additionally, at the molecular level, radiomics has been employed to investigate immune cell infiltration and assess the sensitivity of immune therapy²⁰. The application of radiomics has improved preoperative staging for gastric cancer patients, contributing significantly to individualized treatment. However, even when using the same radiomic techniques, the accuracy of predictions varies across studies due to the diverse predictive methods employed by researchers. For example, in predicting gastric cancer lymph node metastasis using computed tomography (CT) radiomic features, the use of different predictive models can result in substantial variations in prediction accuracy^17,21. Consequently, there is a need to further evaluate the accuracy among various predictive models to identify the optimal model.

With the development of artificial intelligence, machine learning algorithms have been progressively developed and widely applied in the field of gastric cancer. Examples include constructing gastric cancer survival and neoadjuvant chemotherapy benefit prediction models using support vector machine (SVM)²²; predicting early gastric cancer patients’ lymph node metastasis status using decision tree (DT) models²³; predicting Chinese gastric cancer patients’ HER2 status using random forest (RF) models²⁴; constructing artificial neural network models for screening and diagnosing non-invasive gastric cancer²⁵; and applying K-nearest neighbors (KNN) algorithms to classify gastric cancer lymph node metastasis²⁶. Although the development of machine learning algorithms has significantly enhanced the accuracy of predictive models in tumor diagnosis, adjuvant therapy benefits, and survival prognosis, the predictive abilities of different machine learning models for the same diagnostic prediction are inconsistent²⁷. The selection of a suitable machine learning prediction model requires further research.

Our previous study on constructing LAGC omental metastasis prediction models demonstrated that although the predictive model for omental metastasis constructed using logistic regression (LR) had relatively high accuracy and area under the curve (AUC), the positive predictive value (PPV) was less than satisfactory²⁸. This trend was particularly evident in unbalanced binary classification studies^27,28,29. Therefore, the objective of this study is to further investigate the predictive capabilities of machine learning algorithms in LAGC omental metastasis and compare the performance of various machine learning predictive models.

Materials and methods

Patients and study design

A retrospective collection of 478 pathologically confirmed LAGC patients was conducted from April 2019 to July 2022 at the First Affiliated Hospital of Nanchang University and the Second Affiliated Hospital of Nanchang University, including clinical features and arterial phase CT images. Of these, 374 patients from the First Affiliated Hospital of Nanchang University were randomly divided into a training cohort and a test cohort at a 7:3 ratio. Additionally, 104 patients from the Second Affiliated Hospital of Nanchang University were used as a validation cohort. Inclusion criteria were as follows: (1) enhanced abdominal CT examination within 2 weeks before surgery; (2) LAGC patients underwent radical gastrectomy with omentectomy; (3) histologically confirmed primary gastric adenocarcinoma; (4) postoperative pathology confirmed as T3/T4 stage. Exclusion criteria were: (1) patients with incomplete clinical data; (2) postoperative pathology confirmed as T1/T2 stage; (3) CT images with artifacts or inadequate inflation, insufficient image quality for diagnosis; (4) neoadjuvant chemoradiotherapy before surgery. The study design flowchart is shown in Fig. 1. The Medical Research Ethics Board of the First Affiliated Hospital of Nanchang University, China, waived the informed consent in the main manuscript (The Ethics Board approval number: (2022)CDYFYYLK(08-007)). All methods were performed in accordance with the relevant guidelines and regulations.

Radiomic feature extraction and selection

Arterial phase CT images were downloaded from the imaging center of the First Affiliated Hospital and the Second Affiliated Hospital of Nanchang University and saved in DICOM format. Tumor segmentation was performed manually by two radiologists with more than 5 years of experience in abdominal radiology using 3D Slicer software (version 4.11). The segmented image files were saved in Nrrd format. Pyradiomics, a plugin in 3D Slicer, was used to extract features from all patients’ Nrrd format files. Feature selection was performed using the least absolute shrinkage and selection operator (LASSO) regression method in R software (version 4.13). The “glmnet” package was downloaded in R software. To construct the optimal LASSO regression model, we generated 200 lambda values ranging from 0 to 0.05. The lambda value corresponding to the minimum mean square error of the model is the optimal lambda. The selected features can be obtained by inputting the optimal lambda into the glmnet function.

SVM

Support vector machine (SVM) is a powerful classifier. Its principle is to create a decision boundary between two classes and then predict labels using one or more feature vectors. This decision boundary is often a hyperplane whose direction is as far away as possible from the data points closest to each class. These closest points are called support vectors. The “e1071” package was downloaded in R language. The SVM model performed hyperparameter optimization to determine the optimal cost and gamma values and construct the best SVM model.

DT and RF

We used the “rpart” package in R language for decision analysis. DT classification proceeds from the root to the leaves, using statistical measures such as the Gini index and gain ratio to describe the purity and determine the optimal split. When there is only one class of samples in the child node, the Gini index is 0, and the child node stops splitting. We first constructed an original decision tree using all the independent variables in the training cohort and then built the optimal DT based on the best node for tree post-pruning. RF is a collection of DTs that individually serve as weak classifiers but together form a robust prediction. This method randomly extracts bootstrap samples and features to build each classification and regression tree. Hyperparameters in the study included the number of trees and leaf nodes in the forest. The optimal model was selected by hyperparameter optimization.

KNN

The “knn” package was downloaded in R language, and the LAGC omental metastasis diagnostic model was constructed using the training cohort data. If the test sample is closest in feature space to k determined samples, it is considered to belong to the same class as these k samples. Too large or too small k values affect the accuracy of classification. Therefore, we performed hyperparameter optimization by setting the k-value range from 0 to 100, to construct the optimal model.

LR

The “rms” package was downloaded in R language. Stepwise regression using forward, backward, and forward–backward methods was applied to construct the LR diagnostic model in the training cohort. The best diagnostic model was selected based on the Akaike information criterion (AIC) size. To make the evaluated models more accurate and reliable, all diagnostic models underwent four-fold cross-validation. The training cohort data were divided into four parts, with one part used as the validation set. After four tests, changing the validation set each time, the average of the four model results was taken as the final result. On the other hand, the test cohort served as internal data to evaluate the reliability of the model, and an external validation cohort was used to evaluate the model. Model evaluation metrics included area under the curve (AUC) of the receiver operating characteristic (ROC) curve, accuracy, sensitivity, specificity, PPV, and negative predictive value (NPV).

Statistical analysis

Statistical analysis was performed using SPSS software (version 26.0) and R software (version 4.13). Count data were expressed as frequencies, and differences between count data were compared using the chi-square test. Normally distributed measurement data were expressed as mean ± SD and compared using the t-test for differences between normally distributed variables. Non-parametric tests were used for differences analysis of non-normal distribution and heterogeneous variance measurement data. Differences between multiple groups were compared using one-way analysis of variance. A p-value < 0.05 was considered statistically significant.

Ethics approval and informed consent

This study was approved by the Ethics Committee of the First Affiliated Hospital of Nanchang University [Ethics Committee Approval No.: (2022)CDYFYYLK(08-007)]. As this study was retrospective, the Ethics Committee of the First Affiliated Hospital of Nanchang University waived the informed consent for this study. All methods were performed according to relevant guidelines and regulations.

Results

Clinical features

Among the 478 eligible patients, 419 patients had no omental metastasis, and 59 patients had omental metastasis, with an occurrence rate of 14.1%. The training cohort consisted of 262 patients: 189 males and 79 females; 229 had no omental metastasis, and 33 had omental metastases. The test cohort consisted of 112 individuals, 83 males, and 29 females; 13 of them had omental metastases, and 99 did not. The validation cohort consisted of 104 patients, 75 males, and 29 females; 13 patients had omental metastases, and the remaining 91 patients did not. The clinical features of the training, test, and validation cohorts showed no significant statistical differences (Table 1), ensuring the reliability of the results obtained from the test and validation cohorts.

Table 1 LAGC patient characteristics in the training, test, and validation cohorts.

Full size table

Radiomic feature and clinical feature selection

We extracted 864 radiomic features from the arterial phase CT images. Detailed radiomic features are shown in Additional File 1. Features with an intra-class correlation coefficient (ICC) greater than 0.75 were considered stable features, and 548 radiomic features were ultimately selected. The ICC values for radiomic features are detailed in Additional File 2. The radiomic features of the training cohort were analyzed using LASSO regression. As the lambda value increased, the absolute value of the feature coefficients gradually decreased and eventually approached 0 (Fig. 2A). As the lambda value increased, the bias percentage first gradually decreased and then gradually increased. The optimal lambda value corresponds to the minimum bias percentage (Fig. 2B). The selected radiomic features based on the optimal lambda value were diagnostics image original mean (DIOM), original shape maximum 2D diameter slice (OSMDS), original shape maximum 3D diameter (OSMD), original first order kurtosis (OFK), wavelet LH first order kurtosis (WLFK), and wavelet HLH Gldm large dependence high gray level emphasis (WHGL). Similarly, the clinical features of the training cohort were subjected to LASSO regression to select distinct clinical features. The clinical features selected based on the optimal lambda value were CA125 and clinical N staging (Fig. 2C, D).

SVM-based predictive model

In the omental metastasis prediction model constructed using SVM, we determined that the model had the highest accuracy when the vector number was 7 (Fig. 3A). The multidimensional data of LAGC patients were converted into two-dimensional data, and omental metastasis and non-omental metastasis groups had a more distinct concentrated distribution in the two-dimensional space (Fig. 3B). In the training cohort, the AUC value of the predictive model was 0.844; sensitivity and specificity were 0.849 and 0.704, respectively; PPV and NPV were 0.292 and 0.790, respectively (Fig. 3C). In the test cohort, the AUC of the predictive model was 0.735; sensitivity and specificity were 0.769 and 0.622, respectively; PPV and NPV were 0.212 and 0.953, respectively (Fig. 3D). In the validation cohort, the AUC of the predictive model was 0.741; sensitivity and specificity were 0.833 and 0.756, respectively; PPV and NPV were 0.385 and 0.974, respectively (Fig. 3E).

DT-based predictive model

In DT, we determined the number of tree split nodes to be 3 (Fig. 4A). The variable features of the constructed prediction model were ranked according to their importance, with the top six features being clinical N staging, CA125, DIOM, WLFK, OFK, and OSMD, respectively (Fig. 4B). Based on the DT split node number of 3, we selected the top three important features to construct the DT prediction model (Fig. 4C). In the training cohort, the AUC of the predictive model was 0.759; sensitivity and specificity were 0.606 and 0.883, respectively; PPV and NPV were 0.426 and 0.940, respectively (Fig. 4D). In the test cohort, the AUC of the predictive model was 0.624; sensitivity and specificity were 0.231 and 0.990, respectively; PPV and NPV were 0.227 and 0.910, respectively (Fig. 4E). In the validation cohort, the AUC of the predictive model was 0.658; sensitivity and specificity were 0.400 and 0.917, respectively; PPV and NPV were 0.400 and 0.917, respectively (Fig. 4F).

RF-based predictive model

In the RF predictive model, when the number of trees in the model is 5, the out-of-bag (OOB) error reaches the minimum value of 0.122 (Fig. 5A). We then determined the tree split nodes, and when the number of split nodes is 6, the error reaches the minimum value of 0.318 (Fig. 5B). Additionally, we conducted feature importance analysis, and among the eight selected features, OSMD, OSMDS, and N staging played important roles in prediction accuracy. N staging, OSMDS, and OFK played important roles in reducing the Gini coefficient of the predictive model (Fig. 5C). In the training cohort, the AUC value of the predictive model was 0.995, sensitivity and specificity were 0.970 and 0.965, respectively; PPV and NPV were 0.800 and 0.995, respectively (Fig. 5D). In the test cohort, the AUC value of the predictive model was 0.750, sensitivity and specificity were 0.769 and 0.663, respectively; PPV and NPV were 0.233 and 0.956, respectively (Fig. 5E). In the validation cohort, the AUC value of the predictive model was 0.808, sensitivity and specificity were 0.750 and 0.800, respectively; PPV and NPV were 0.308 and 0.964, respectively (Fig. 5F).

KNN-based predictive model

Through the hyperparameter optimization of the KNN function, we found that the best kernel function for the predictive model was “triangular,” and the best k-value was 14 (Fig. 6A). We constructed the predictive model based on these conditions. In the training cohort, the AUC value of the predictive model was 0.759, sensitivity and specificity were 0.714 and 0.827, respectively; PPV and NPV were 0.370 and 0.952, respectively (Fig. 6C). In the test cohort, the AUC value of the predictive model was 0.797, sensitivity and specificity were 0.909 and 0.598, respectively; PPV and NPV were 0.227 and 0.980, respectively (Fig. 6D). In the validation cohort, the AUC value of the predictive model was 0.611, sensitivity and specificity were 0.500 and 0.852, respectively; PPV and NPV were 0.333 and 0.920, respectively (Fig. 6E).

LR-based predictive model

Through LR analysis, we found that when the predictive model consists of N staging, CA125, DIOM, and WHGL, the regression model has an optimal fitting state, with an AIC value of 155.24. When the predictive model removes these features or adds other features, the AIC value will increase (Fig. 6B). In the training cohort, the AUC value of the regression predictive model was 0.864, sensitivity and specificity were 0.879 and 0.774, respectively; PPV and NPV were 0.358 and 0.978, respectively (Fig. 6F). In the test cohort, the AUC value of the predictive model was 0.808, sensitivity and specificity were 0.615 and 0.888, respectively; PPV and NPV were 0.210 and 0.941, respectively (Fig. 6G). In the validation cohort, the AUC value of the predictive model was 0.782, sensitivity and specificity were 0.692 and 0.846, respectively; PPV and NPV were 0.250 and 0.953, respectively (Fig. 6H).

Comparison of predictive abilities of various models

The predictive abilities of omental metastasis models in LAGC constructed using various machine learning methods are shown in Table 2. In the training cohort (Fig. 7A), the RF predictive model had better accuracy, AUC, sensitivity, specificity, PPV, and NPV compared to LR, SVM, DT, and KNN. The RF predictive model achieved a significant improvement in PPV compared to the other four predictive models. The DT predictive model had a lower sensitivity compared to the other four predictive models, with a sensitivity of only 0.606. In the test cohort (Fig. 7B), all five machine learning-constructed predictive models had a relatively low PPV. The evaluation indicators of the DT predictive model were more significantly different compared to the other four predictive models, with a sensitivity of 0.231 and specificity of 0.990 for the DT predictive model. The LR-constructed predictive model had the lowest PPV of 0.210 compared to the other four predictive models. In the external validation cohort (Fig. 7C), the evaluation indicator results of the predictive models were generally similar to those in the test cohort.

Table 2 A comparison of omental metastatic models’ prediction ability.

Full size table

Discussion

Previous prospective OMEGA studies have shown that radical surgery for LAGC can preserve omentum without tumor metastasis³⁰. A phase II randomized controlled study demonstrated that omentum-preserving radical surgery for advanced gastric cancer has advantages such as shorter operation time and less bleeding³¹. After propensity score matching, RI et al.³² collected 263 cases of advanced gastric cancer radical surgery patients in each group, and there were no significant differences in overall survival and tumor recurrence between the omentum preservation group and the omentum resection group. Therefore, precise preoperative assessment of omental metastasis in advanced gastric cancer is of great importance for the selection of surgical methods. Our previous study constructed an LR predictive model for LAGC omental metastasis based on radiomic features and clinical features²⁸. The LR predictive model had an AUC of 0.871, sensitivity of 0.828, specificity of 0.864, PPV of 0.455, NPV of 0.974, and accuracy of 0.860. The inclusion of radiomic features significantly improved the accuracy of the omental metastasis prediction model. Although our LR predictive model had relatively high accuracy, the PPV was only 0.455, meaning that half of the patients we predicted to have omental metastasis were false positives. To this end, we further explored the predictive models of omental metastasis constructed by LR, SVM, DT, RF, and KNN machine learning methods and compared their predictive performance.

Our study results showed that in the training cohort, the overall performance indicators of the RF predictive model were superior to LR, SVM, DT, and KNN. In particular, the PPV of the RF predictive model showed significant improvement compared to the LR predictive model. Although the improvement in PPV for the RF predictive model in the test and validation cohorts was not as substantial as in the training cohort, the PPV of the RF predictive model was still better than that of the LR predictive model. In this study, the PPV of the LR predictive model was 0.358. In other binary classification studies with unbalanced samples, the LR predictive model also had a low PPV. In the prediction study of pathological complete response (PCR) after neoadjuvant chemoradiotherapy in locally advanced rectal cancer (LARC) patients, the PPV of the LR predictive model constructed by Huh et al. was only 0.255³³. In a study predicting PCR of rectal cancer using magnetic resonance imaging radiomic features, the performance indicators of the RF predictive model were significantly better than those of SVM, KNN, and LR predictive models³⁴. In their study, the PPV of the RF predictive model was 0.759, and that of the LR predictive model was 0.354. The better performance of the RF predictive model compared to the LR predictive model may be because traditional risk prediction models are based on regression models, such as LASSO regression and logistic regression. These methods have certain limitations, as they cannot analyze nonlinear relationships and are susceptible to variable selection and confounding factors^35,36. Machine learning methods provide algorithms for understanding patterns in large, complex, and heterogeneous data. Recursive partitioning, especially random forests, can handle a large number of predictive variables even in the presence of complex interactions^37,38.

Our research also found that not all machine learning predictive models have good performance in predicting omental metastasis in LAGC. While the DT predictive model demonstrated relatively high accuracy in the training, test, and validation cohorts, its sensitivity was the worst compared to LR, SVM, RF, and KNN predictive models. In the training cohort, the sensitivity of the DT predictive model was 0.606, while in the test cohort, it was 0.231, and in the validation cohort, it was 0.400. Although the DT predictive model had high accuracy, the misdiagnosis rate of omental metastasis-positive patients in the test and validation cohorts was greater than 50%. DT analysis has been proven to be useful in the clinical treatment decision-making process for disease diagnosis and survival prediction^39,40. In many DT predictive model studies, researchers only evaluated the AUC value and accuracy of the model, without further analysis of sensitivity and specificity. The reason for the low sensitivity of the DT predictive model in this study may be due to the unbalanced distribution of binary classification samples, with omental metastasis patients accounting for only 14.1% of all patients. Furthermore, the DT model is a simple and easy-to-use non-parametric classifier that does not consider the interaction between variables during the tree formation process and only considers single variables for each screening⁴¹.

The overall performance metrics of the SVM predictive model were slightly lower than those of the LR predictive model in the training cohort. In the test and validation cohorts, the evaluation metrics of the SVM and LR predictive models differed, with the LR predictive model having higher AUC and specificity than the SVM predictive model, while the SVM predictive model had higher sensitivity and PPV than the LR predictive model. The performance metrics of the KNN predictive model were generally similar to those of the LR predictive model in the training and test cohorts. However, in the validation cohort, the KNN predictive model had poorer AUC and sensitivity than the LR predictive model, with an AUC value of 0.611 and a sensitivity of 0.500. In Huang et al.’s²⁷ study using clinical features to construct LR models for predicting PCR in locally advanced rectal cancer, the PPV of the LR predictive model was 0.390, significantly lower than KNN, SVM, and other machine learning predictive models. In contrast, in Wei et al.’s³⁴ study using radiomic features to construct machine learning models for predicting PCR in rectal cancer, the LR predictive model had a higher PPV than KNN and SVM machine learning predictive models. The considerable differences between KNN, SVM predictive models, and the LR predictive model may be influenced by the original data.

In this study, we divided the data from our center into a training cohort and a test cohort, while data from other hospitals were used as an external validation cohort. There were no statistically significant differences in the basic demographic data between the cohorts, and the performance evaluation metrics of each machine learning model were generally consistent across the cohorts, ensuring the reliability and stability of each predictive model. Of course, there are certain limitations to our study. Firstly, our validation datasets were all from the same region, lacking international datasets for external validation. Secondly, although the RF predictive model improved the PPV, there was a significant decrease in the PPV of the predictive models in the test and validation cohorts. Lastly, the sample size of this study was very modest compared to studies with large samples to construct prediction models. A bigger sample of LAGC patients would be required to confirm the findings of this investigation in the future. For constructing predictive models with unbalanced binary classification samples, further research and exploration are needed to improve the PPV of the models, whether by finding new predictive models or identifying new clinical or radiomic features.

Conclusion

In constructing predictive models for LAGC omental metastasis using unbalanced binary classification samples, the LR-based model has the disadvantage of low PPV and high false-positive rates. Among the machine learning algorithms, the RF predictive model, compared to LR, SVM, and KNN models, demonstrates higher accuracy while improving the PPV of the predictive model and reducing the false-positive rate. This makes the application of the model in the clinical treatment decision-making process more practical.

Data availability

All data supporting the findings of this study are available within the paper and its Supplementary Information.

Abbreviations

LAGC:: Locally advanced gastric cancer
CT:: Computed tomography
SVM:: Support vector machine
DT:: Decision tree
RF:: Random forest
KNN:: K-nearest neighbors
LR:: Logistic regression
AUC:: Area under the curve
ROC:: Receiver operating characteristic
PPV:: Positive predictive value
NPV:: Negative predictive value
LASSO:: Least absolute shrinkage and selection operator
AIC:: Akaike information criterion
ICC:: Intra-class correlation coefficient
DIOM:: Diagnostics image original mean
OSMDS:: Original shape maximum 2D diameter slice
OSMD:: Original shape maximum 3D diameter
OFK:: Original first-order kurtosis
WLFK:: Wavelet LH first order kurtosis
WHGL:: Wavelet HLH Gldm large dependence high gray level emphasis
OOB:: Out-of-bag
PCR:: Pathological complete response
LARC:: Locally advanced rectal cancer

References

Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71(3), 209–249 (2021).
Article PubMed Google Scholar
Kim, S. T. et al. Comprehensive molecular characterization of clinical responses to PD-1 inhibition in metastatic gastric cancer. Nat. Med. 24(9), 1449–1458 (2018).
Article CAS PubMed Google Scholar
Li, S. et al. Neoadjuvant therapy with immune checkpoint blockade, antiangiogenesis, and chemotherapy for locally advanced gastric cancer. Nat. Commun. 14(1), 8 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Anderson, E. et al. A comparison of clinicopathologic outcomes across neoadjuvant and adjuvant treatment modalities in resectable gastric cancer. JAMA Netw. Open 4(12), e2138432 (2021).
Article PubMed PubMed Central Google Scholar
Shitara, K. et al. Trastuzumab deruxtecan in previously treated HER2-positive gastric cancer. N. Engl. J. Med. 382(25), 2419–2430 (2020).
Article CAS PubMed Google Scholar
Scott, L. J. Apatinib: A review in advanced gastric cancer and other advanced cancers. Drugs 78(7), 747–758 (2018).
Article CAS PubMed Google Scholar
Ricci, A. D., Rizzo, A. & Brandi, G. DNA damage response alterations in gastric cancer: Knocking down a new wall. Future Oncol. 17(8), 865–868 (2021).
Article CAS PubMed Google Scholar
Guven, D. C. et al. The association between albumin levels and survival in patients treated with immune checkpoint inhibitors: A systematic review and meta-analysis. Front. Mol. Biosci. 9, 1039121 (2022).
Article CAS PubMed PubMed Central Google Scholar
Rizzo, A. et al. Third- and later-line treatment in advanced or metastatic gastric cancer: A systematic review and meta-analysis. Future Oncol. 16(2), 4409–4418 (2020).
Article CAS PubMed Google Scholar
Viscardi, G. et al. Comparative assessment of early mortality risk upon immune checkpoint inhibitors alone or in combination with other agents across solid malignancies: A systematic review and meta-analysis. Eur. J. Cancer 177, 175–185 (2022).
Article CAS PubMed Google Scholar
Santoni, M. et al. The impact of gender on the efficacy of immune checkpoint inhibitors in cancer patients: The MOUSEION-01 study. Crit. Rev. Oncol. Hematol. 170, 103596 (2022).
Article PubMed Google Scholar
Huang, C. et al. Laparoscopic vs open distal gastrectomy for locally advanced gastric cancer: Five-year outcomes from the CLASS-01 randomized clinical trial. JAMA Surg. 157(1), 9–17 (2022).
Article PubMed Google Scholar
Hyung, W. J. et al. Long-term outcomes of laparoscopic distal gastrectomy for locally advanced gastric cancer: The KLASS-02-RCT randomized clinical trial. J. Clin. Oncol. Off. J. Am. Soc. Clin. Oncol. 38(28), 3304–3313 (2020).
Article Google Scholar
Chen, M., He, F. Q., Liao, M. S., Yang, C. & Chen, X. D. Gastrectomy with omentum preservation versus gastrectomy with omentectomy for locally advanced gastric cancer: A systematic review and meta-analysis. Int. J. Surg. 96, 106176 (2021).
Article PubMed Google Scholar
Lin, H. W., Loh, E. W., Shen, S. C. & Tam, K. W. Gastrectomy with or without omentectomy for gastric cancer: A systematic review and meta-analysis. Surgery 171(5), 1281–1289 (2022).
Article PubMed Google Scholar
Lambin, P. et al. Radiomics: The bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14(12), 749–762 (2017).
Article PubMed Google Scholar
Wang, Y. et al. CT radiomics nomogram for the preoperative prediction of lymph node metastasis in gastric cancer. Eur. Radiol. 30(2), 976–986 (2020).
Article PubMed Google Scholar
Cui, Y. et al. A CT-based deep learning radiomics nomogram for predicting the response to neoadjuvant chemotherapy in patients with locally advanced gastric cancer: A multicenter cohort study. EClinicalMedicine 46, 101348 (2022).
Article PubMed PubMed Central Google Scholar
Huang, L. et al. Computed tomography-based radiomics nomogram: Potential to predict local recurrence of gastric cancer after radical resection. Front. Oncol. 11, 638362 (2021).
Article PubMed PubMed Central Google Scholar
Sun, R. et al. A radiomics approach to assess tumour-infiltrating CD8 cells and response to anti-PD-1 or anti-PD-L1 immunotherapy: An imaging biomarker, retrospective multicohort study. Lancet Oncol. 19(9), 1180–1191 (2018).
Article CAS PubMed Google Scholar
Jiang, Y. et al. Radiomics signature on computed tomography imaging: Association with lymph node metastasis in patients with gastric cancer. Front. Oncol. 9, 340 (2019).
Article PubMed PubMed Central Google Scholar
Jiang, Y. et al. Immunomarker support vector machine classifier for prediction of gastric cancer survival and adjuvant chemotherapeutic benefit. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 24(22), 5574–5584 (2018).
Article CAS Google Scholar
Zhao, L. et al. Using nomogram, decision tree, and deep learning models to predict lymph node metastasis in patients with early gastric cancer: A multi-cohort study. Am. J. Cancer Res. 13(1), 204–215 (2023).
PubMed PubMed Central Google Scholar
Tian, S. et al. Prediction of HER2 status via random forest in 3257 Chinese patients with gastric cancer. Clin. Exp. Med. 23(8), 5015–5024 (2023).
Article CAS PubMed Google Scholar
Fan, Z., Guo, Y., Gu, X., Huang, R. & Miao, W. Development and validation of an artificial neural network model for non-invasive gastric cancer screening and diagnosis. Sci. Rep. 12(1), 21795 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, C. et al. Using the K-nearest neighbor algorithm for the classification of lymph node metastasis in gastric cancer. Comput. Math. Methods Med. 2012, 876545 (2012).
Article PubMed PubMed Central Google Scholar
Huang, C. M. et al. Machine learning for predicting pathological complete response in patients with locally advanced rectal cancer after neoadjuvant chemoradiotherapy. Sci. Rep. 10(1), 12555 (2020).
Article ADS PubMed PubMed Central Google Scholar
Wu, A. et al. Development and validation of a CT radiomics and clinical feature model to predict omental metastases for locally advanced gastric cancer. Sci. Rep. 13(1), 8442 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Peng, J. et al. Develop and validate a radiomics space-time model to predict the pathological complete response in patients undergoing neoadjuvant treatment of rectal cancer: An artificial intelligence model study based on machine learning. BMC Cancer 23(1), 365 (2023).
Article PubMed PubMed Central Google Scholar
Jongerius, E. J. et al. Role of omentectomy as part of radical surgery for gastric cancer. Br. J. Surg. 103(11), 1497–1503 (2016).
Article CAS PubMed Google Scholar
Murakami, H. et al. Short-term outcomes from a randomized screening phase II non-inferiority trial comparing omentectomy and omentum preservation for locally advanced gastric cancer: The TOP-G trial. World J. Surg. 45(6), 1803–1811 (2021).
Article PubMed Google Scholar
Ri, M. et al. Gastrectomy with or without omentectomy for cT3-4 gastric cancer: A multicentre cohort study. Br. J. Surg. 107(12), 1640–1647 (2020).
Article CAS PubMed Google Scholar
Huh, J. W., Kim, H. R. & Kim, Y. J. Clinical prediction of pathological complete response after preoperative chemoradiotherapy for rectal cancer. Dis. Colon Rectum 56(6), 698–703 (2013).
Article PubMed Google Scholar
Wei, Q. et al. External validation and comparison of MR-based radiomics models for predicting pathological complete response in locally advanced rectal cancer: A two-centre, multi-vendor study. Eur. Radiol. 33(3), 1906–1917 (2023).
Article PubMed Google Scholar
Núñez, E., Steyerberg, E. W. & Núñez, J. Regression modeling strategies. Rev. Esp. Cardiol. 64(6), 501–507 (2011).
Article PubMed Google Scholar
Strobl, C., Malley, J. & Tutz, G. An introduction to recursive partitioning: Rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychol. Methods 14(4), 323–348 (2009).
Article PubMed PubMed Central Google Scholar
Ho, D. S. W., Schierding, W., Wake, M., Saffery, R. & O’Sullivan, J. Machine learning SNP based prediction for precision medicine. Front. Genet. 10, 267 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sapir-Pichhadze, R. & Kaplan, B. Seeing the forest for the trees: Random forest models for predicting survival in kidney transplant recipients. Transplantation 104(5), 905–906 (2020).
Article PubMed Google Scholar
Lorenzo, D. et al. Prognostic factors and decision tree for long-term survival in metastatic uveal melanoma. Cancer Res. Treat. 50(4), 1130–1139 (2018).
Article PubMed Google Scholar
Romero, M. P. et al. Decision tree machine learning applied to bovine tuberculosis risk factors to aid disease control decision making. Prev. Vet. Med. 175, 104860 (2020).
Article PubMed Google Scholar
Luo, X., Wen, X., Zhou, M., Abusorrah, A. & Huang, L. Decision-tree-initialized dendritic neuron model for fast and accurate data classification. IEEE Trans. Neural Netw. Learn. Syst. 33(9), 4173–4183 (2022).
Article MathSciNet PubMed Google Scholar

Download references

Acknowledgements

We acknowledge the First Affiliated Hospital of Nanchang University and the Second Affiliated Hospital of Nanchang University for providing the CT images.

Funding

This work was supported by the leading scientist Project of the Jiangxi Science and Technology Department (20213BCJL22050), Jiangxi Provincial Natural Science Foundation (20224ACB206029), and Jiangxi Provincial Natural Science Foundation (20224BAB206063).

Author information

These authors contributed equally: Ahao Wu and Lianghua Luo.

Authors and Affiliations

Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
Ahao Wu, Lianghua Luo, Qingwen Zeng, Xufeng Shu, Pang Huang, Zhonghao Wang, Tengcheng Hu, Zongfeng Feng, Yi Cao & Zhengrong Li
Department of Gastrointestinal Surgery, The Second Affiliated Hospital, Nanchang University, Nanchang, 330006, Jiangxi Province, China
Changlei Wu
Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
Ahao Wu
Department of Pathology, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
Yi Tu
Department of Radiology, The First Affiliated Hospital, Nanchang University, Nanchang, 330006, Jiangxi Province, China
Yanyan Zhu
Department of Digestive Surgery, Digestive Disease Hospital, The Third Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
Zhengrong Li
General Surgery Department of Jiangxi Provincial People’s Hospital, Nanchang, 330006, Jiangxi Province, China
Lianghua Luo

Authors

Ahao Wu
View author publications
You can also search for this author in PubMed Google Scholar
Lianghua Luo
View author publications
You can also search for this author in PubMed Google Scholar
Qingwen Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Changlei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Xufeng Shu
View author publications
You can also search for this author in PubMed Google Scholar
Pang Huang
View author publications
You can also search for this author in PubMed Google Scholar
Zhonghao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tengcheng Hu
View author publications
You can also search for this author in PubMed Google Scholar
Zongfeng Feng
View author publications
You can also search for this author in PubMed Google Scholar
Yi Tu
View author publications
You can also search for this author in PubMed Google Scholar
Yanyan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yi Cao
View author publications
You can also search for this author in PubMed Google Scholar
Zhengrong Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.H.W. and L.H.L. conceived the project and wrote the manuscript. Y.Y.Z. and C.L.W. drew the ROI of CT images. X.F.S. and Z.F.F. participated in data analysis. Z.H.W. and Y.T. participated in the discussion and language editing. C.T.H. and Y.H. collected validation group data. Y.C. and Z.R.L. reviewed the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Yi Cao or Zhengrong Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, A., Luo, L., Zeng, Q. et al. Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer. Sci Rep 14, 16208 (2024). https://doi.org/10.1038/s41598-024-66979-x

Download citation

Received: 16 May 2024
Accepted: 06 July 2024
Published: 13 July 2024
DOI: https://doi.org/10.1038/s41598-024-66979-x
Springer Nature Limited

Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer

Abstract

Similar content being viewed by others

CT radiomics nomogram for the preoperative prediction of lymph node metastasis in gastric cancer

Development and validation of a CT radiomics and clinical feature model to predict omental metastases for locally advanced gastric cancer

Computed Tomography-Based Deep Learning Nomogram Can Accurately Predict Lymph Node Metastasis in Gastric Cancer

Introduction

Materials and methods

Patients and study design

Radiomic feature extraction and selection

SVM

DT and RF

KNN

LR

Statistical analysis

Ethics approval and informed consent

Results

Clinical features

Radiomic feature and clinical feature selection

SVM-based predictive model

DT-based predictive model

RF-based predictive model

KNN-based predictive model

LR-based predictive model

Comparison of predictive abilities of various models

Discussion

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation