Identification of histopathological classification and establishment of prognostic indicators of gastric adenocarcinoma based on deep learning algorithm

Wang, Zhihui; Peng, Hui; Wan, Jie; Song, Anping

doi:10.1007/s00795-024-00399-8

Identification of histopathological classification and establishment of prognostic indicators of gastric adenocarcinoma based on deep learning algorithm

Original Paper
Open access
Published: 01 August 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Medical Molecular Morphology Aims and scope Submit manuscript

Identification of histopathological classification and establishment of prognostic indicators of gastric adenocarcinoma based on deep learning algorithm

Download PDF

Zhihui Wang¹,
Hui Peng²,
Jie Wan³ &
…
Anping Song^2,4

161 Accesses
Explore all metrics

Abstract

The aim of this study is to establish a deep learning (DL) model to predict the pathological type of gastric adenocarcinoma cancer based on whole-slide images(WSIs). We downloaded 356 histopathological images of gastric adenocarcinoma (STAD) patients from The Cancer Genome Atlas database and randomly divided them into the training set, validation set and test set (8:1:1). Additionally, 80 H&E-stained WSIs of STAD were collected for external validation. The CLAM tool was used to cut the WSIs and further construct the model by DL algorithm, achieving an accuracy of over 90% in identifying and predicting histopathological subtypes. External validation results demonstrated the model had a certain generalization ability. Moreover, DL features were extracted from the model to further investigate the differences in immune infiltration and patient prognosis between the two subtypes. The DL model can accurately predict the pathological classification of STAD patients, and provide certain reference value for clinical diagnosis. The nomogram combining DL-signature, gene-signature and clinical features can be used as a prognostic classifier for clinical decision-making and treatment.

Deep learning-based subtyping of gastric cancer histology predicts clinical outcome: a multi-institutional retrospective study

Article Open access 03 June 2023

A systematic pan-cancer study on deep learning-based prediction of multi-omic biomarkers from routine pathology images

Article Open access 15 March 2024

Classification vs Deep Learning in Cancer Degree on Limited Histopathology Datasets

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Gastric adenocarcinoma (STAD) accounts for more than 95% of all gastric malignancies and is the most common cause of cancer-related death [1, 2]. According to the Global Cancer Statistics Report 2020, there are about 1.07 million new cases of STAD, accounting for 5.6% of all new cancer cases and ranking fifth in terms of incidence rate, and about 769,000 deaths, accounting for 7.7% of all cancer deaths and ranking fourth in terms of mortality rate [3]. Patients with early gastric cancer undergoing radical surgery and subsequent chemotherapy have a 5-year survival rate of 90% after surgery [4]. However, more than half of STAD patients are initially diagnosed at an advanced stage, and the 5-year overall survival (OS) rate of STAD is less than 30% [5]. Tumor microenvironment (TME) denotes the non-cancerous cells and components presented in the tumor, including molecules produced and released by them. As an important component of TME, tumor-infiltrating immune cells (TIIC) are associated with the promotion or inhibition of tumor growth [6, 7]. Therefore, early detection and appropriate treatment are important ways to reduce the mortality of STAD patients, while understanding the degree of immune cell infiltration in different subtypes of gastric cancer is helpful for the administration of relevant immunotherapy.

The spatial characteristics of different tissues in histopathological images play an important role in the diagnosis and prognosis of cancer [8,9,10]. Traditionally, pathologists have identified and distinguished different pathological types of STAD by visual examination of hematoxylin and eosin(H&E)-stained histopathologic sections. However, this method is labor-intensive, tedious, and time-consuming, and the diagnostic accuracy is negatively affected by the acute shortage of pathologists and heavy diagnostic workloads [11]. To overcome these limitations, researchers have turned to deep learning (DL) based approaches that harness the power of artificial intelligence and neural networks to automate and enhance the analysis of histopathology images [12,13,14]. CLAM (Clustering-constrained Attention Multiple Instance Learning) is a deep-learning-based weakly supervised method that uses attention-based learning to automatically identify subregions of high diagnostic value to accurately classify the whole slide, while also enabling the use of instance-level clustering over the representative regions identified to constrain and refine the feature space [15]. In practical applications, CLAM has shown superior performance, particularly in medical image analysis. For instance, in cancer detection and classification tasks on pathology images, CLAM accurately identifies cancerous regions by incorporating clustering constraints, significantly enhancing diagnostic accuracy and efficiency [16, 17].

Recently, a DL method has been developed based on histopathological images, which has shown great potential for the rapid detection of adenocarcinoma in gastric biopsy and resection specimens. It exhibits high sensitivity and specificity and is beneficial for future diagnostic pathology workflows. In addition, it enables accurate segmentation of STAD regions, enabling further analysis and supporting translational research [18, 19]. Studies have shown that DL algorithms applied to H&E stained slides can predict microsatellite instability in STAD as well as specific mutations in other cancers [20, 21]. Osamu Iizuka et al. used convolutional neural networks (CNN) and recurrent neural networks (RNNs) to classify the biopsy histopathology WSIs of the stomach with an accuracy of more than 90% [22]. Huang et al. designed a CNN-based model, Gastro-MIL, for accurate diagnosis of STAD directly from digital H&E-stained images. The model has an accuracy of 92%, which is comparable to the discrimination ability of professional pathologists [23]. These DL methods have brought great hope to improve the accuracy and efficiency of STAD diagnosis and classification. By automated analysis of H&E-stained histopathological images, interobserver variability can be effectively reduced and more objective and reproducible results can be provided. In addition, it may be possible to reveal novel features of specific pathological subtypes, thereby contributing to the development of more targeted therapeutic strategies.

In this study, we created a DL model to classify adenocarcinomas and mucinous adenocarcinomas from histopathological images to support conventional histopathology diagnosis by expert pathologists. The TCGA-STAD cohort was used for training, followed by validation using an external independent dataset to further illustrate the generalization ability of the model. In addition, STAD patients were successfully classified into two subtypes with different molecular characteristics based on DL features combined with transcriptome datasets, which further explored the pathogenic mechanism at the genome scale.

Materials and methods

Patient cohorts

The 356 STAD patients with clinical characteristics and mRNA sequencing data were acquired from The Cancer Genome Atlas (TCGA) (https://portal.gdc.cancer.gov/). 356 WSIs of STAD were obtained from the TCGA-STAD cohort, including 322 cases of adenocarcinoma and 34 cases of mucinous adenocarcinoma (Scanned slides with extensive labeling in the area of the covered tissue, damaged slides, and slides that did not contain tumors were excluded, and only one sample was selected per patient.). In addition, 80 H&E-stained WSIs of STAD were obtained from Shanghai Zhuoli Biotech Company (Shanghai, China) and used for external validation. The ethical approval of validation cohort was obtained from the Tongxu County People's Hospital, Henan, China. The TCGA database is publicly available for research and therefore does not require ethical approval.

DL feature extraction and selection

Considering the very large image size of WSIs (typically 100,000 * 80,000 pixels), WSIs were cropped into many patches. Tissue regions were then exhaustively split into patches of 256 × 256 pixels (without overlapping) at 20 × using the OpenSlide library in Python. Feature vectors were extracted using a modified ResNet50 model pre-trained on ImageNet, by feeding it with a cropped pixel size of 256 × 256 patches. Finally, each patch was output as a 1024-dimensional feature vector using adaptive averaging of spatial pools after selecting the third residual block in the ResNet50 model.

DL models

356 WSIs of STAD were randomly divided into a training set (80%), validation set (10%) and test set (10%) for DL via clam as a way to construct pathohistological typing of STAD, and further estimated it robustness in the external validation set. During the training process, weakly supervised learning is employed, where each WSI is assigned a slide-level label indicating whether it belongs to adenocarcinoma or mucinous adenocarcinoma. Throughout training and inference, the model utilizes an attention-based pooling function to aggregate patch-level features into slide-level representations for classification. The model examines and ranks each patch within the tissue regions of the WSI, assigning an attention score to each patch, which reflects its contribution or importance to the collective slide-level classification of a specific category. By leveraging attention-based learning, the model can identify and aggregate regions of high diagnostic significance, thereby providing a slide-level classification for each WSI. The training was performed using a tenfold Monte Carlo cross-validation strategy. The training was performed using a tenfold Monte Carlo cross-validation strategy. Performance was further assessed using the area under the curve (AUC) from a receiver operating characteristic curve (ROC).

Attention map generation

CLAM is capable of generating interpretable heat maps that enable an intuitive analysis of the relative contribution of each tissue region to model predictions in each WSI [15]. These heat maps provide pathologists with insights into histological and cytological features that are strongly associated with high predictive value. To account for the relative importance of different regions in the pathological picture for the model's final level predictions, we calculated and saved unstandardized attention scores for all patches extracted from the pathological picture using the attention branches corresponding to the model's predicted categories. The attention score was learned by CLAM for each patch and converted into percentiles. For each WSI, the percentiles were then normalized to [0, 1] with 1 being the most predictive and 0 being the most non-informative. The normalized scores were converted to RGB colors using heat maps and displayed above their respective spatial locations in the pathology pictures to visually identify and interpret areas of high attention displayed in red and areas of low attention displayed in blue.

Unsupervised cluster of DL features

The least absolute shrinkage and selection operator (LASSO) analysis was used to select the most useful DL features among 1024 features, and the optimal values of the penalty parameter λ were determined by tenfold cross-validations. The importance of each feature was evaluated by the weight coefficient of DL. The larger the parameter estimate (absolute value), the higher the importance of the element. To gain more insight into the molecular mechanism of STAD, we performed an unsupervised clustering analysis to identify subgroups with similar patterns based on DL features, using the kmeans algorithm in the R package "Consensus Cluster Plus". Kaplan–Meier (K-M) curves were used to compare the prognosis of subgroups defined by DL features.

Transcriptome analysis of different histopathological subtypes

Identification of differentially expressed genes (DEGs)

The “limma” package in R software was utilized to screen DEGs of different histopathological subtypes. An adjusted p value of < 0.05 and log2 |Fold Change|> 0.5 were considered statistically significant. The “ggplot2” package in R was used to plot the volcano map in the two groups. The Significant DEGs were further screened by LASSO regression analysis.

Functional enrichment analysis

Based on the Gene Ontology (GO) database and Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway database, the “clusterProfiler” R package was used to perform functional enrichment analysis of the DEGs according to the pathological subtypes. The p-value of less than 0.05 was identified as a significant term.

Estimation of the immune cell infiltration

Immune-related gene set were obtained from Genecard database (Supplementary Table S1) and intersected with DEGs. The single-sample gene set enrichment analysis (ssGSEA) was used to estimate the relative abundance of different immune cell types in each sample. Then the correlation between the scores of each immune cell was calculated, and the differences in immune scores and immune checkpoint genes between subtypes were tested by Wilcoxon test.

Identification of histopathologic DL-signature and gene‑signature

The “limma” R package was used to screen the DEGs between adenocarcinoma and mucinous adenocarcinoma in the TCGA-STAD cohort, and the intersection of the immune-related DEGs after pathological prediction and classification was used to select the candidate gene signature. Multivariate Cox modeling was used to create gene signatures, and a risk score was determined by a linear combination of the regression coefficient (α) from the multivariate Cox regression model and gene expression levels based on the "ggrisk" package. According to the median risk score, all patients were divided into a high-risk group and a low-risk group, and the K-M survival curve was drawn to compare the survival rate of the two groups. Pathological features were derived from the DL features identified by the LASSO penalty model. Based on DL characteristics, genetic characteristics and clinical characteristics, a nomogram was constructed to predict OS.

Statistical analysis

All analyses were performed with R (version 4.3.1) or Python (version 3.7.12). The versions of the Python libraries and R packages used are in Supplementary Table S2. The Wilcoxon test was used to analyze the differences between the two groups. Correlations between variables were determined using Pearson's analysis. Survival analysis was conducted using the “survival” R package, and the log-rank test was performed with the “survdiff” function. All statistical tests were considered significant with p < 0.05.

Results

Performance of the histopathological classifier

A pathology-based DL model was developed in the training set of the TCGA-STAD cohort (8:1:1 for training, validation and testing). The tenfold Monte Carlo cross-validation was used to evaluate the classification performance of CLAM in clinical diagnostic tasks. The results showed that each model had good performance in recognizing adenocarcinoma and mucinous adenocarcinoma, with an average AUC of 0.90 (maximum 0.97, minimum 0.78) (Fig. 1A). Further confusion matrix results show that the correct rate of adenocarcinoma recognition is 95.65% and the success rate of mucinous adenocarcinoma recognition is 85.29%, both of which reflect high recognition rates (Fig. 1B). To visualize and interpret the relative importance of each region in the WSIs, we converted the attention scores of the model's predicted categories to percentiles, normalized them and mapped them to the original slides to generate an attention heatmap. Two representative heatmaps providing patch-level predictions for adenocarcinoma and mucinous carcinoma, respectively, are shown in Fig. 1C-D.

External validation model

DL systems are prone to overfitting the data they are trained on, so we introduced H&E-stained section data of 80 STAD cases (59 adenocarcinomas, 21 mucinous adenocarcinomas) for external validation. It was worth noting that the external validation dataset and the TCGA-STAD cohort are vastly different in terms of both patient ethnicity and slice preparation techniques. Considering these, we used an external cohort for validation and achieved an AUC of 0.78 in the dataset, suggesting that the DL model has good generalization capabilities(Fig. 2A). The results of the confusion matrix showed that adenocarcinoma was identified with a success rate of 71.19% and mucinous adenocarcinoma was identified with a success rate of 85% (Fig. 2B). In addition, to further validate the reliability of the analysis results, we invited pathologists to review the attention heat maps identified by CLAM. The attentional heat maps of pathological tissue sections of adenocarcinomas and mucinous adenocarcinomas in the validation cohort are shown in Fig. 2C-D, with the red areas corresponding to tumour regions.

Unsupervised cluster of DL features

Using 1024 DL-features, we identified 12 features by the LASSO-penalted feature selection, and the relative importance of the features is shown in the figure (Fig. 3A-C). The 12 DL features were further conducted to investigate the key clusters in the TCGA-STAD cohort. Using unsupervised clustering (k = 2), two stable subtypes were able to be identified: cluster 1 (236 STAD patients) and cluster 2 (120 STAD patients). Clusters1 contained 223 cases of adenocarcinoma and 13 cases of mucinous adenocarcinoma, while cluster2 contained 99 cases of adenocarcinoma and 21 cases of mucinous adenocarcinoma. We then created a comprehensive heatmap to show associations between subtypes and clinical features. The results showed that patients with mucinous adenocarcinoma were mainly concentrated in cluster 2 (Fig. 3D). Meanwhile, the K-M curve showed that the survival probability of STAD patients in cluster 2 was lower than that in cluster 1 (P < 0.043) (Fig. 3E), suggesting that the prognosis of mucinous adenocarcinoma is worse. In addition, the boxplot results showed that the distribution of all 12 DL-features was significantly different between cluster 1 and cluster 2 (Fig. 3F).

Functional enrichment analysis of DEGs in different histopathological subtypes

We screened 1287 DEGs between the cluster 1 and the cluster 2 in TCGA-STAD using the R package “limma” (P < 0.05, |log2FC|> 0.5) (Fig. 4A). 145 genes were further screened based on LASSO regression and tenfold cross-validation (Fig. 4B-C). GO analysis and KEGG analysis were conducted to obtain the biological functions of 145 DEGs to understand which signaling pathways might serve as an important role in STAD. The results showed that DEGs were mainly enriched in signaling pathways such as protein digestion and absorption and enteric nevous system development (Fig. 4D-E).

Landscape of immune characteristics in histopathological subtypes

The immune-related gene dataset was obtained on Genecard and intersected with 145 genes(Supplementary Table S3), resulting in 10 up-regulated immune-related genes (GCG, HLA-DRB5, UCN3, EDN2, PI3, SST, MAPT, BMP3, CMA1, LCN6) and 9 down-regulated immune-related genes (WFIKKN1, QRFP, TAFA1, TRHR, IL13, MIA, INSL4, PLA2G2A, IL9) (Fig. 5A-B, Supplementary Table S4). The Gene Set Enrichment Analysis (GSEA) cellular immunity database was used to evaluate the level of immune cell infiltration in each sample according to the gene expression value in the data set, and the Wilcoxon test was used to test the difference in immune cell infiltration between cluster 1 and cluster 2. The results showed significant differences in the infiltration abundance of activated CD4 T cells, CD56dim natural killer cells, activated CD8 T cells, memory B cells, and Type 2 T helper cells in the two clusters of patients(P < 0.05) (Fig. 5C). Further analysis revealed that the expression of immune-related differential genes in STAD patients showed a strong positive correlation with the expression of macrophage cells and Mast cells (Fig. 5D-E).

Construction of histopathologic DL-signature and gene-signature

To construct the histopathological gene signature of STAD, we first classified patients in the TCGA-STAD cohort into two subtypes: mucinous adenocarcinoma and adenocarcinoma, and obtained 232 up-regulated genes and 1329 down-regulated genes by differential analysis(P < 0.05, |Log2FC|> 1). These genes were intersected with 19 immune-related genes obtained by pathological features analysis, and one up-regulated gene BMP3 and one down-regulated gene MIA were obtained (Fig. 6A-B). Multivariate cox analysis was used to construct gene signatures, and the distribution of risk scores for patients in the TCGA-STAD cohort, survival status, and relative scores of genes were displayed by heat maps (Fig. 6C). K-M survival curve was drawn to evaluate the survival rate of patients in the high and low-risk groups, and the results showed that the survival rate of the high-risk group was significantly lower than that of the low-risk group(P < 0.001) (Fig. 6D). Meanwhile, we constructed a DL-signature based on the relative score of 12 DL features in TCGA-STAD cohort (Fig. 6E). The K-M survival curve showed that the high-risk group had a worse prognosis(P < 0.001) (Fig. 6F). Both gene and pathological risk scores had a significant effect on survival. In addition, the forest plot also showed that the pathological and gene feature models we constructed were superior to the clinical features compared with the traditional clinical features (Fig. 6G). Multivariate Cox regression showed that pathological features could be used as independent prognostic predictors of STAD. To provide a comprehensive and accurate approach for prognostic prediction, a nomogram was created using the histopathological DL-signature, gene-signature, and clinical variables of patients from the TCGA-STAD cohort (Fig. 6H-J). The nomogram model can predict 3-year and 5-year OS, which improves the practical application value of histopathological-related features.

Discussion

Digital pathology can provide valuable information for clinical decision-making and help pathologists to classify histopathological images [24, 25]. Importantly, DL applied to histopathological images also showed good performance in predicting tumor prognosis [26, 27].In this study, we trained a DL model to classify STAD histopathology sections, and the results show that the DL model exhibits high performance in identifying histopathological subtypes (mucinous carcinoma, adenocarcinoma). The AUC value of the model reached 0.90, and it had a high accuracy in the identification of adenocarcinoma and mucinous adenocarcinoma, suggesting that the use of the DL algorithm to assist pathologists in determining pathological classification is an effective means. To further confirm the reliability of this line of thought, we evaluated the model in an external validation set. The results showed that the AUC value of the validation cohort was 0.78, indicating that the model had a certain generalization ability, which provided a theoretical basis for the application of the DL model in STAD pathological classification and recognition. Notably, although the results of the validation cohort hold considerable value, there remains a discernible disparity compared to the outcomes of the training cohort. We guess that this divergence may stem from two potential causes. Firstly, the training cohort was in svs format, while the validation cohort was in ndpi format. This discrepancy could have led to performance variations in the model when processing different image formats, as different formats might employ distinct compression algorithms, color spaces, or image qualities, thereby affecting the accuracy of feature extraction. Secondly, the differing image sizes in the training and validation cohort could also be a factor contributing to the performance disparity.

We divided the TCGA-STAD cohort into two subtypes based on DL characteristics: cluster 1 and cluster 2, and the prognosis of the two subtypes was significantly different, indicating that classification validity is beneficial for predicting the clinical importance of genotype in treatment responsiveness. We also found that mucinous adenocarcinoma was mainly concentrated in cluster 2 and had a poor prognosis, which is consistent with previous reports [28].In addition, we found that DEGs were mainly enriched in signaling pathways such as protein digestion and absorption and enteric nervous system development. Proteolytic enzyme activity and participate in protein absorption transporter expression change can affect the integrity of the gastric epithelium and immune response, thereby promoting tumor growth and progression. It is noteworthy that a high degree of infiltration of the immune microenvironment is present in cluster 2. CD4 + /CD8 + T cells have been reported to partially reflect the infiltration of lymphocytes in gastric cancer tissues, predict the response to immunotherapy to a certain extent, and ultimately affect the tumor progression and survival of gastric cancer [29,30,31]. CD8 + T cells were associated with improved OS in patients with gastric cancer [32], while high infiltration of CD4 + T cells was correlated with worse OS [33].

Increasingly, researchers are recognizing the cellular properties of the tumor microenvironment (TME), particularly those of immune cells. The tumor immune microenvironment (TIME) plays a crucial role in tumor progression, invasion, metastasis, immune evasion, and treatment resistance [34, 35]. The stomach has strong acidic conditions and a unique endocrine system, which makes the TIME of STAD different. Tumors use diverse mechanisms to evade immune surveillance [36]. These mechanisms include enhancing negative immunomodulatory processes and altering antigen presentation. Populations of immune cells, including tumor-associated macrophages, lymphocytes, tumor-associated neutrophils, T cells, and natural killer cells, play key roles in STAD. Therefore, it is essential to enhance the understanding of the TIME, to identify new targets and improve the clinical efficacy of STAD treatment. The immune-related genes and DEGs were intersected to construct gene signature. The gene signature contains two genes (BMP3, MIA), of which BMP3 has been reported to inhibit the proliferation of STAD by regulating the cell cycle [37], while the mechanism of action of MIA in STAD is not clear. Meanwhile, DL signatures were constructed based on DL features. Both the risk score of gene signature and DL signature were able to significantly influence survival, and further results of multifactorial cox regression showed that DL-signature could serve as an independent prognostic factor. Finally, the systematic nomogram combining DL-signature and gene signature provides some reference value for clinical diagnosis.

Although our study made good progress, some limitations should be noted. First, the DL model was trained and validated. However, the size of the validation cohort was small, and larger samples and specific patient cohorts are needed to evaluate the generalizability of the model in clinical diagnosis. Secondly, the model cannot completely replace the diagnosis of pathological classification by pathologists, who usually need to take into account the influence of clinical factors. In addition, although our model combines transcriptomics analysis to enhance the interpretability of DL-based histopathological classification, the content of the analysis can be further deepened, and the prognosis prediction of STAD patients can be further improved by combining radiomics data of patients in the future.

Conclusions

In summary, we created a DL model based on histopathological predictive typing and demonstrated that the model recognizes pathological typing with high accuracy. This can be helpful to assist pathologists in making clinical diagnosis. In addition, a nomogram was built by combining DL-signature, gene-signature and clinical features, which can be used as a prognostic classifier for clinical decision-making, individual prognosis and treatment.

Data Availability

The WISs, clinical characteristics and mRNA sequencing data of STAD patients were obtained from the TCGA database (https://portal.gdc.cancer.gov/). The validation data set used to develop the deep-learning model is not publicly available because of patient privacy and health information.

References

Smyth EC, Nilsson M, Grabsch HI, van Grieken NC, Lordick F (2020) Gastric cancer. Lancet 396(10251):635–648. https://doi.org/10.1016/s0140-6736(20)31288-5
Article CAS PubMed Google Scholar
Comprehensive molecular characterization of gastric adenocarcinoma. Nature. Sep 11 2014;513(7517):202-9. https://doi.org/10.1038/nature13480
Sung H, Ferlay J, Siegel RL et al (2021) Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71(3):209–249. https://doi.org/10.3322/caac.21660
Article CAS PubMed Google Scholar
Shafabakhsh R, Yousefi B, Asemi Z, Nikfar B, Mansournia MA, Hallajzadeh J (2020) Chitosan: a compound for drug delivery system in gastric cancer-a review. Carbohydr Polym 242:116403. https://doi.org/10.1016/j.carbpol.2020.116403
Article CAS PubMed Google Scholar
Siegel RL, Miller KD, Jemal A (2015) Cancer statistics, 2015. CA Cancer J Clin Jan-Feb 65(1):5–29. https://doi.org/10.3322/caac.21254
Article Google Scholar
Chen H, Carrot-Zhang J, Zhao Y et al (2019) Genomic and immune profiling of pre-invasive lung adenocarcinoma. Nat Commun 10(1):5472. https://doi.org/10.1038/s41467-019-13460-3
Article CAS PubMed PubMed Central Google Scholar
Xiao Y, Yu D (2021) Tumor microenvironment as a therapeutic target in cancer. Pharmacol Ther 221:107753. https://doi.org/10.1016/j.pharmthera.2020.107753
Article CAS PubMed Google Scholar
Lu Z, Zhan X, Wu Y et al (2021) BrcaSeg: a deep learning approach for tissue quantification and genomic correlations of histopathological images. Genom Proteom Bioinform 19(6):1032–1042. https://doi.org/10.1016/j.gpb.2020.06.026
Article Google Scholar
Luo X, Zang X, Yang L et al (2017) Comprehensive computational pathological image analysis predicts lung cancer prognosis. J Thorac Oncol 12(3):501–509. https://doi.org/10.1016/j.jtho.2016.10.017
Article PubMed Google Scholar
Ji MY, Yuan L, Jiang XD et al (2019) Nuclear shape, architecture and orientation features from H&E images are able to predict recurrence in node-negative gastric adenocarcinoma. J Transl Med 17(1):92. https://doi.org/10.1186/s12967-019-1839-x
Article PubMed PubMed Central Google Scholar
Metter DM, Colgan TJ, Leung ST, Timmons CF, Park JY (2019) Trends in the US and Canadian pathologist workforces from 2007 to 2017. JAMA Netw Open 2(5):e194337. https://doi.org/10.1001/jamanetworkopen.2019.4337
Article PubMed PubMed Central Google Scholar
Abu Haeyeh Y, Ghazal M, El-Baz A, Talaat IM (2022) Development and evaluation of a novel deep-learning-based framework for the classification of renal histopathology images. Bioengineering (Basel). https://doi.org/10.3390/bioengineering9090423
Article PubMed Google Scholar
Brendel M, Getseva V, Assaad MA et al (2022) Weakly-supervised tumor purity prediction from frozen H&E stained slides. EBioMedicine 80:104067. https://doi.org/10.1016/j.ebiom.2022.104067
Article CAS PubMed PubMed Central Google Scholar
Kather JN, Heij LR, Grabsch HI et al (2020) Pan-cancer image-based detection of clinically actionable genetic alterations. Nat Cancer 1(8):789–799. https://doi.org/10.1038/s43018-020-0087-6
Article CAS PubMed PubMed Central Google Scholar
Lu MY, Williamson DFK, Chen TY, Chen RJ, Barbieri M, Mahmood F (2021) Data-efficient and weakly supervised computational pathology on whole-slide images. Nat Biomed Eng 5(6):555–570. https://doi.org/10.1038/s41551-020-00682-w
Article PubMed PubMed Central Google Scholar
Nero C, Boldrini L, Lenkowicz J et al (2022) Deep-Learning to Predict BRCA Mutation and Survival from Digital H&E Slides of Epithelial Ovarian Cancer. Int J Mol Sci. https://doi.org/10.3390/ijms231911326
Article PubMed PubMed Central Google Scholar
Wang CW, Muzakky H, Lee YC, Lin YJ, Chao TK (2023) Annotation-free deep learning-based prediction of thyroid molecular cancer biomarker BRAF (V600E) from cytological slides. Int J Mol Sci. https://doi.org/10.3390/ijms24032521
Article PubMed PubMed Central Google Scholar
Sharma H, Zerbe N, Klempert I, Hellwich O, Hufnagl P (2017) Deep convolutional neural networks for automatic classification of gastric carcinoma using whole slide images in digital histopathology. Comput Med Imaging Graph 61:2–13. https://doi.org/10.1016/j.compmedimag.2017.06.001
Article PubMed Google Scholar
Yoshida H, Shimazu T, Kiyuna T et al (2018) Automated histological classification of whole-slide images of gastric biopsy specimens. Gastric Cancer 21(2):249–257. https://doi.org/10.1007/s10120-017-0731-8
Article PubMed Google Scholar
Kather JN, Pearson AT, Halama N et al (2019) Deep learning can predict microsatellite instability directly from histology in gastrointestinal cancer. Nat Med 25(7):1054–1056. https://doi.org/10.1038/s41591-019-0462-y
Article CAS PubMed PubMed Central Google Scholar
Coudray N, Ocampo PS, Sakellaropoulos T et al (2018) Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat Med 24(10):1559–1567. https://doi.org/10.1038/s41591-018-0177-5
Article CAS PubMed PubMed Central Google Scholar
Iizuka O, Kanavati F, Kato K, Rambeau M, Arihiro K, Tsuneki M (2020) Deep learning models for histopathological classification of gastric and colonic epithelial tumours. Sci Rep 10(1):1504. https://doi.org/10.1038/s41598-020-58467-9
Article CAS PubMed PubMed Central Google Scholar
Huang B, Tian S, Zhan N et al (2021) Accurate diagnosis and prognosis prediction of gastric cancer using deep learning on digital pathological images: a retrospective multicentre study. EBioMedicine 73:103631. https://doi.org/10.1016/j.ebiom.2021.103631
Article PubMed PubMed Central Google Scholar
Schaumberg AJ, Juarez-Nicanor WC, Choudhury SJ et al (2020) Interpretable multimodal deep learning for real-time pan-tissue pan-disease pathology search on social media. Mod Pathol 33(11):2169–2185. https://doi.org/10.1038/s41379-020-0540-1
Article PubMed PubMed Central Google Scholar
Hekler A, Utikal JS, Enk AH et al (2019) Pathologist-level classification of histopathological melanoma images with deep neural networks. Eur J Cancer 115:79–83. https://doi.org/10.1016/j.ejca.2019.04.021
Article PubMed Google Scholar
Liu X, Zhang D, Liu Z et al (2021) Deep learning radiomics-based prediction of distant metastasis in patients with locally advanced rectal cancer after neoadjuvant chemoradiotherapy: a multicentre study. EBioMedicine 69:103442. https://doi.org/10.1016/j.ebiom.2021.103442
Article PubMed PubMed Central Google Scholar
Liu S, Sun W, Yang S et al (2022) Deep learning radiomic nomogram to predict recurrence in soft tissue sarcoma: a multi-institutional study. Eur Radiol 32(2):793–805. https://doi.org/10.1007/s00330-021-08221-0
Article PubMed Google Scholar
Rokutan H, Hosoda F, Hama N et al (2016) Comprehensive mutation profiling of mucinous gastric carcinoma. J Pathol 240(2):137–148. https://doi.org/10.1002/path.4761
Article CAS PubMed Google Scholar
Li F, Sun Y, Huang J, Xu W, Liu J, Yuan Z (2019) CD4/CD8 + T cells, DC subsets, Foxp3, and IDO expression are predictive indictors of gastric cancer prognosis. Cancer Med 8(17):7330–7344. https://doi.org/10.1002/cam4.2596
Article CAS PubMed PubMed Central Google Scholar
Zurlo IV, Schino M, Strippoli A et al (2022) Predictive value of NLR, TILs (CD4+/CD8+) and PD-L1 expression for prognosis and response to preoperative chemotherapy in gastric cancer. Cancer Immunol Immunother 71(1):45–55. https://doi.org/10.1007/s00262-021-02960-1
Article CAS PubMed Google Scholar
Xu S, Zhu Q, Wu L et al (2023) Association of the CD4(+)/CD8(+) ratio with response to PD-1 inhibitor-based combination therapy and dermatological toxicities in patients with advanced gastric and esophageal cancer. Int Immunopharmacol 123:110642. https://doi.org/10.1016/j.intimp.2023.110642
Article CAS PubMed Google Scholar
Choo J, Kua LF, Soe MY et al (2023) Clinical relevance of PD-1 positive CD8 T-cells in gastric cancer. Gastric Cancer 26(3):393–404. https://doi.org/10.1007/s10120-023-01364-7
Article CAS PubMed PubMed Central Google Scholar
You Q, Fang T, Yin X et al (2021) Serum CD4 is associated with the infiltration of CD4(+)T cells in the tumor microenvironment of gastric cancer. J Immunol Res 2021:6539702. https://doi.org/10.1155/2021/6539702
Article CAS PubMed PubMed Central Google Scholar
de Visser KE, Joyce JA (2023) The evolving tumor microenvironment: from cancer initiation to metastatic outgrowth. Cancer Cell 41(3):374–403. https://doi.org/10.1016/j.ccell.2023.02.016
Article CAS PubMed Google Scholar
Kumar V, Ramnarayanan K, Sundar R et al (2022) Single-Cell atlas of lineage states, tumor microenvironment, and subtype-specific expression programs in gastric cancer. Cancer Discov 12(3):670–691. https://doi.org/10.1158/2159-8290.Cd-21-0683
Article CAS PubMed PubMed Central Google Scholar
Fan X, Jin J, Yan L, Liu L, Li Q, Xu Y (2020) The impaired anti-tumoral effect of immune surveillance cells in the immune microenvironment of gastric cancer. Clin Immunol 219:108551. https://doi.org/10.1016/j.clim.2020.108551
Article CAS PubMed Google Scholar
Sun Z, Liu C, Jiang WG, Ye L (2020) Deregulated bone morphogenetic proteins and their receptors are associated with disease progression of gastric cancer. Comput Struct Biotechnol J 18:177–188. https://doi.org/10.1016/j.csbj.2019.12.014
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

No funding was received.

Author information

Authors and Affiliations

Department of Ultrasound Imaging, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430101, Hubei, China
Zhihui Wang
Department of Oncology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430101, Hubei, China
Hui Peng & Anping Song
Department of Pathology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430101, Hubei, China
Jie Wan
Department of Oncology, Tongji Hospital Sino-French New City Branch, Caidian District, No.288 Xintian Avenue, Wuhan, 430101, Hubei, China
Anping Song

Authors

Zhihui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hui Peng
View author publications
You can also search for this author in PubMed Google Scholar
Jie Wan
View author publications
You can also search for this author in PubMed Google Scholar
Anping Song
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Zhihui Wang: data analysis, writing the original draft. Hui Peng: data analysis, software, methodology. Jie Wan: data analysis, results review. Anping Song: conceived the idea for this paper, writing review and editing, and supervision. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Anping Song.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethics approval and consent to participate

The data in this study were obtained from the TCGA public database, and all methods were carried out in accordance with relevant guidelines and regulations. External validation data were obtained from Shanghai Zhuoli Biotech Company (Shanghai, China), and ethical approval was obtained from Tongxu County People's Hospital's ethics committee, Henan China (approval number: ZL-XP201402). All patients provided written informed consent.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 174 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Z., Peng, H., Wan, J. et al. Identification of histopathological classification and establishment of prognostic indicators of gastric adenocarcinoma based on deep learning algorithm. Med Mol Morphol (2024). https://doi.org/10.1007/s00795-024-00399-8

Download citation

Received: 11 March 2024
Accepted: 15 July 2024
Published: 01 August 2024
DOI: https://doi.org/10.1007/s00795-024-00399-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Identification of histopathological classification and establishment of prognostic indicators of gastric adenocarcinoma based on deep learning algorithm

Abstract

Similar content being viewed by others

Deep learning-based subtyping of gastric cancer histology predicts clinical outcome: a multi-institutional retrospective study

A systematic pan-cancer study on deep learning-based prediction of multi-omic biomarkers from routine pathology images

Classification vs Deep Learning in Cancer Degree on Limited Histopathology Datasets

Explore related subjects

Introduction

Materials and methods

Patient cohorts

DL feature extraction and selection

DL models

Attention map generation

Unsupervised cluster of DL features

Transcriptome analysis of different histopathological subtypes

Identification of differentially expressed genes (DEGs)

Functional enrichment analysis

Estimation of the immune cell infiltration

Identification of histopathologic DL-signature and gene‑signature

Statistical analysis

Results

Performance of the histopathological classifier

External validation model

Unsupervised cluster of DL features

Functional enrichment analysis of DEGs in different histopathological subtypes

Landscape of immune characteristics in histopathological subtypes

Construction of histopathologic DL-signature and gene-signature

Discussion

Conclusions

Data Availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval and consent to participate

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (XLSX 174 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation