Exploring TCGA database for identification of potential prognostic genes in stomach adenocarcinoma

Zhou, Lin; Huang, Wei; Yu, He-Fen; Feng, Ya-Juan; Teng, Xu

doi:10.1186/s12935-020-01351-3

Exploring TCGA database for identification of potential prognostic genes in stomach adenocarcinoma

Primary research
Open access
Published: 23 June 2020

Volume 20, article number 264, (2020)
Cite this article

Download PDF

You have full access to this open access article

Cancer Cell International Aims and scope Submit manuscript

Exploring TCGA database for identification of potential prognostic genes in stomach adenocarcinoma

Download PDF

Lin Zhou¹,
Wei Huang²,
He-Fen Yu²,
Ya-Juan Feng¹ &
…
Xu Teng ORCID: orcid.org/0000-0003-3555-5834²

7762 Accesses
24 Citations
1 Altmetric
Explore all metrics

Abstract

Background

Stomach adenocarcinoma (STAD) is the fifth most prevalent cancer in the world and ranks third among cancer-related deaths worldwide. The tumour microenvironment (TME) plays an important role in tumorigenesis, development, and metastasis. Hence, we calculated the immune and stromal scores to find the potential prognosis-related genes in STAD using bioinformatics analysis.

Methods

The ESTIMATE algorithm was used to calculate the immune/stromal scores of the STAD samples. Functional enrichment analysis, protein–protein interaction (PPI) network analysis, and overall survival analysis were then performed on differential genes. And we validated these genes using data from the Gene Expression Omnibus database. Finally, we used the Human Protein Atlas (HPA) databases to verify these genes at the protein levels by IHC.

Results

Data analysis revealed correlation between stromal/immune scores and the TNM staging system. The top 10 core genes extracted from the PPI network, and primarily involved in immune responses, extracellular matrix, and cell adhesion. There are 31 genes have been validated with poor prognosis and 16 genes were upregulated in tumour tissues compared with normal tissues at the protein level.

Conclusions

In summary, we identified genes associated with the tumour microenvironment with prognostic implications in STAD, which may become potential therapeutic markers leading to better clinical outcomes.

Identification of significant genes associated with prognosis of gastric cancer by bioinformatics analysis

Article Open access 26 December 2022

Prediction and therapeutic targeting of the tumor microenvironment-associated gene CTSK in gastric cancer

Article Open access 06 November 2023

Identification of tumor microenvironment-related prognostic genes in colorectal cancer based on bioinformatic methods

Article Open access 22 July 2021

Background

Stomach adenocarcinoma (STAD) is ranked as the fifth-most commonly diagnosed cancer and the third leading cause of cancer-related deaths worldwide as per the statistics of GLOBOCAN 2018 [1], with an estimated 679,100 new cases and 498,000 deaths occurring in China in 2015 [2]. Around 90–95% of all stomach cancers are adenocarcinoma. They are subdivided into cardia and non-cardia gastric cancers, respectively, based on whether the tumour is located near the gastro-oesophageal junction (cardia) or away from it [3]. Worldwide, the incidence rate of STAD is the highest in Asia, and among the Asian countries, China has the maximum incidence rate of STAD, accounting for 49.9% of global STAD cases [4]. In European countries, the 5-year survival rate varies from ~ 10 to 30% [5], and in China, from 30.2 to 35.9% [6]. To better understand the impact on tumour genetic composition of clinical outcomes, genome-wide gene expression repertoires, such as The Cancer Genome Atlas (TCGA) have been established to explore and discover large cohorts around the world [7]. Although extensive research has been conducted on the mechanism of the occurrence and development of STAD, the aetiology and pathogenesis of STAD still remain to be elucidated [8]. Hence, considering the high morbidity and mortality of STAD, it is essential to explore molecular markers that have a prognostic value of influencing the immune response from STAD patients.

The cells within the tumour microenvironment (TME) are an important component of the tumour tissue. An increasing number of evidences has elucidated the clinic pathological significance of TME in the prediction of treatment effects [9, 10]. The TME is the cellular milieu where the tumour is located. It consists of immune cells, mesenchymal cells, endothelial cells, along with inflammatory mediators and extracellular matrix (ECM) molecules [11, 12]. The cells and molecules in the TME are in a dynamic process, reflecting the evolutionary nature of cancer and jointly promoting immune escape, growth, and metastasis of tumours [13, 14]. Immune cells and stromal cells are the two main types of non-tumour components and considered to be of great value in the diagnosis and prognosis of tumours [7]. Therefore, understanding the molecular composition and function of TME are essential for the effective management of cancer progression and immune response in STAD. With the advent of the era of big data biology, bioinformatics analysis of large amounts of data has been made possible through a combination of biology, computer science, and information technology [15]. Its rapid development provides researchers with a more user-friendly and convenient platform to guide the implementation of basic experiments [16]. In 2013, Yoshihara et al. designed an algorithm called ESTIMATE to estimate stromal cells and immune cells in malignant tumour tissues with expression data. In this algorithm, the authors obtained immune and stromal scores to predict the TME by calculating the expression characteristics of specific molecular biomarkers in immune and stromal cells [17]. In addition, they used estimate scores to comprehensively evaluate the immune and stromal scores. In recent years, the ESTIMATE algorithm has been reported to be applied to glioblastoma [7], clear cell renal cell carcinoma [18], and colon cancer [19], thereby demonstrating the validity of this big-data based algorithm. However, there is no detailed analysis of the immune, stromal, and estimate scores of stomach adenocarcinoma.

In this study, the TME-related genes were obtained from the stomach adenocarcinoma datasets in TCGA database and the ESTIMATE algorithm was used to analyse the corresponding immune/stromal/estimate scores. Some core genes were obtained through the analysis of functional annotations and gene networks. Finally, the STAD dataset from the Gene Expression Omnibus (GEO) database was used to validate the acquired core genes, revealing their potential roles in the treatment of STAD.

Methods

Raw data

RNA-seq data for STAD patients were downloaded from TCGA database (https://tcga-data.nci.nih.gov/tcga/), the gene expression profile was measured experimentally using the Illumina HiSeq2000 RNA Sequencing platform by the University of North Carolina TCGA genome characterization centre. Clinical data such as age, TNM staging, gender, survival-time, and status were also downloaded from the cBioportal website (http://www.cbioportal.org/). We calculated the stromal/immune/estimate scores of the samples using the ESTIMATE algorithm (https://r-forge.r-project.org). The GSE84433 dataset from the GEO database was used for validation. The Human Protein Atlas (http://www.proteinatlas.org) was used to validate the immunohistochemistry of genes with prognostic values. Direct links to the immunohistochemistry images from the Human Protein Atlas are provided in the following:

Gene	Normal	Tumor
BCHE	https://www.proteinatlas.org/ENSG00000114200-BCHE/tissue/stomach#img	https://www.proteinatlas.org/ENSG00000114200-BCHE/pathology/stomach+cancer#img
CNN1	https://www.proteinatlas.org/ENSG00000130176-CNN1/tissue/stomach#img	https://www.proteinatlas.org/ENSG00000130176-CNN1/pathology/stomach+cancer#img
CPED1	https://www.proteinatlas.org/ENSG00000106034-CPED1/tissue/stomach#img	https://www.proteinatlas.org/ENSG00000106034-CPED1/pathology/stomach+cancer#img
CYP1B1	https://www.proteinatlas.org/ENSG00000138061-CYP1B1/tissue/stomach#img	https://www.proteinatlas.org/ENSG00000138061-CYP1B1/pathology/stomach+cancer#img
SELP	https://www.proteinatlas.org/ENSG00000174175-SELP/tissue/stomach#img	https://www.proteinatlas.org/ENSG00000174175-SELP/pathology/stomach+cancer#img
VIP	https://www.proteinatlas.org/ENSG00000146469-VIP/tissue/stomach#img	https://www.proteinatlas.org/ENSG00000146469-VIP/pathology/stomach+cancer#img

Differential expression analysis

Differential expression analysis was performed on the count matrix of the sample using the R package, DESeq2. The screening conditions for the differential genes were: Fold Change > | ±1.5|, adjusted p-values (padj) < 0.05. Heat maps of differential genes were drawn using the R-package, pheatmap.

Enrichment analysis and PPI network

The Database for Annotation, Visualization and Integrated Discovery (DAVID) tool [20] was used to perform functional enrichment analysis of the differentially expressed genes (DEGs), the corresponding biological processes (BP), cell components (CC), and molecular functions (MF) were identified using Gene Ontology (GO) and the signalling pathways involved were identified using the Kyoto Encyclopedia of Genes and Genomes (KEGG). The protein–protein interaction (PPI) network was constructed using the Search Tool for the Retrieval of Interacting Genes (STRING) database [21], and the core genes were identified using the CytoHubba plug-in in Cytoscape software [22]. Module analysis for the detection of interaction networks was performed using the Molecular Complex Detection (MCODE) plug-in in the Cytoscape platform.

Survival analysis

The survival curve is shown using the Kaplan–Meier curve, which is drawn using the R packages survival and survminer. This relationship was verified using a log-rank test. These analyses illustrate the relationship between differential genes and overall patient survival.

Results

Stromal and immune scores are associated with the TNM staging system and survival prognosis

We downloaded the RNA-seq gene expression matrix and clinical information data from 380 patients with STAD from TCGA database. Of all the samples, 63.9% were male and 36.1% were female, and 62.9% were white, 19.7% Asian, 14.2% were not reported, while others were black or African–American. The proportion of patients with T1–T2 and T3–T4 was 27.7% (n = 108) and 72.3% (n = 272), respectively. The proportion of patients with N0–N1 and N2–N3 was 58.4% (n = 221) and 41.6% (n = 159), respectively. Patients with M0 and M1 reached 92.7% (n = 343) and 7.3% (n = 37), respectively. The stromal scores and immune scores of all samples were obtained by ESTIMATE algorithm, and the score ranges were − 1832.01 to 2038.29 and − 1541.7 to 2619.69, respectively. From the aspect of tumour infiltration depth (T), the median stromal and immune scores in T1 stage are the lowest, and the order for stromal median score from T2 to T4 is: T2 < T3 < T4, order of immune scores is: T3 < T2 < T4, but the difference between them is not obvious (Fig. 1a). From the aspect of lymph node staging (N), the relationships between the median stromal and immune scores of the four stages were similar (without statistical significance) namely: N3 > N1 > N2 > N0, N1 > N3 > N2 > N0 (Fig. 1b). Finally, in terms of distant metastasis (M), the stromal and immune median scores are in the same order: M1 > M0, without statistical significance (Fig. 1c). From these data, it can be seen that there is an intense correlation between stromal/immune/estimate scores and the TNM staging system.

To analyse the potential relationship between the stromal/immune/estimate scores and the overall survival of the samples, we divided all samples into high and low score groups based on the positive/negative stromal/immune/estimate scores. The Kaplan–Meier survival curve showed that the high score group of the stromal scores has a lower survival rate than the low score group (Fig. 1d, p = 0.0032 in log-rank test). Similar phenomena were observed in the high and low score groups of the immune/estimate scores (Fig. 1e, p = 0.0505 in log-rank test, Fig. 1f, p = 0.0359 in log-rank).

Differential expression and enrichment analysis of STAD cases based on stromal and immune scores

To reveal the relationship between the stromal and/or immune scores and the gene expression profile of the samples, we performed differential analysis of all RNA-seq data from 380 STAD cases in TCGA database. The heat map of the high/low scores of the stromal/immune scores revealed differential gene expression profiles between the samples, in which 772 up-regulated genes and 211 down-regulated genes (fold change > |± 1.5|, padj < 0.05) were obtained based on the difference in stromal scores, simultaneously, 1182 up-regulated genes and 434 down-regulated genes (fold change > |± 1.5|, padj < 0.05) were obtained based on the differential analysis of immune scores (Fig. 2a). As can be seen from the Venn diagram (Fig. 2b), there are 245 identical up-regulated genes and 103 identical down-regulated genes (Additional file 1: Table S1).

We performed functional enrichment analysis on the obtained 348 differential genes (245 up-regulated genes and 103 down-regulated genes), including GO: BP, GO: CC, GO: MF, and KEGG pathway analysis. Sorting by − Log10 (p-value), we list the top 10 terms of each section. GO functions are mainly enriched in inflammatory and immune responses, extracellular matrices, and heparin binding (Fig. 2c–e), while KEGG pathways are mainly enriched in neuroactive ligand-receptor interaction and insulin secretion (Fig. 2f).

Survival analysis of differential genes

To analyse the potential role of differential genes in the overall survival of STAD patients, we downloaded STAD clinical data and gene expression data from the cBioportal website. Among all the differential genes, the high expression of 82 genes was associated with poor overall survival, and low expression of 9 genes (Fig. 3, p < 0.05) showed good overall survival (Additional file 2: Table S2).

PPI network analysis of genes with prognostic values

To analyse the interrelationship between genes with prognostic value, we used the STRING network tool to construct a PPI network of genes with prognostic values. Core genetic analysis of the PPI network was performed using the CytoHubba plugin in Cytoscape software. The top 10 core genes extracted from the PPI network are: CADM3, CARTPT, KCNA1, ADCYAP1R1, GPR88, SPARCL1, GFRA2, VIP, ACKR1, and MYL9, the core score is up to 47,964 (Additional file 3: Table S3), the module contains 62 nodes and 317 edges (Fig. 4a). At the same time, we used Cytoscape’s MCODE plug-into perform a modular analysis of the differential genes. CARPTT and SYNPO2 modules were identified through module analysis. The CARTPT module contains 17 points and 66 edges, while the SYNPO2 module contains 9 points and 21 edges. In the CARTPT module, CARTPT, KCNA1, and SPARCL1 have higher degree values, while in the SYNPO2 module, SYNPO2 and FLNC have higher degree values (Fig. 4b).

We performed functional enrichment analysis on the genes mined by the PPI network module. There are four terms of the biological process: regulation of heart/muscle contraction, regulation of ryanodine-sensitive calcium-release channel activity, neuronal action potential, G-protein coupled receptor signalling pathway (Fig. 4c). There are six terms of the cellular component: Z disc, actin cytoskeleton, cytoskeleton, fascia adherents, focal adhesion, and plasma membrane (Fig. 4d). The molecular function contains seven terms: cytoskeletal protein binding, receptor activity, muscle alpha-actinin binding, protein binding, neuropeptide hormone activity, structural constituent of muscle, and actin binding (Fig. 4e). There are four terms of the KEGG pathway: cAMP signalling pathway, focal adhesion, arrhythmogenic right ventricular cardiomyopathy (ARVC), and dilated cardiomyopathy (Fig. 4f).

Using the GEO database to verify genes with prognostic values

To reveal whether the differential genes from TCGA database have an equal prognostic value in other STAD cases, we downloaded the GSE84433 expression dataset and clinical data from the GEO database, which contained 357 samples. A total of 31 genes (Additional file 4: Table S4) with high expression and poor prognosis were verified (Fig. 5), and 16 genes have not been reported to be associated with a poor prognosis for STAD (Table 1). These genes may be potential genes for poor prognosis of STAD and may provide some reference value for the treatment of STAD in the future.

Table 1 Genes that influence the overall survival of STAD in both TCGA and GEO

Full size table

Prognostic gene validation using clinical tissue samples

To further confirm the reliability of the obtained genes with prognostic values, we used IHC to detect the protein expression of 31 genes in normal tissues and tumour tissues. The results showed that compared with normal tissues, sixteen proteins (ANGPTL1, BCHE, BNC2, CHRNA3, CNN1, CPED1, CYP1B1, FHL1, MYL9, PDLIM3, PRG4, RNF150, SELP, SPARCL1, SYNC, VIP) were significantly overexpressed in tumour tissues (Fig. 6, p < 0.05). The IHC map of the other 10 genes is shown in Additional file 5: Figure S1.

Discussion

In this paper, the ESTIMATE algorithm [17] was used to calculate the stromal/immune scores of RNA-seq data of STAD in TCGA database. Subsequently, the STAD samples were divided into high and low score groups according to the stromal/immune scores. Through differential and downstream analysis, 82 high expression genes related to poor prognosis and 9 low expression genes related to good prognosis were obtained. Then, 31 of the above genes were verified by the data in the GEO database, and 15 genes have been reported to be linked to the overall survival in STAD.

First, we obtained 245 up-regulated genes and 103 down-regulated genes by analysing the differences between high- and low-score group samples. Following functional enrichment analysis of the up- and down-regulated genes, it was found that many genes are associated with the TME, which is consistent with previous studies reporting the important roles of immune cells and stromal cells in the TME [38,39,40,41]. A total of 10 core genes were extracted from the PPI network, and a module analysis was performed. The functional enrichment analysis of these genes revealed them to be mainly related to the immune and inflammatory response.

Next, we performed survival analysis on the differential genes. High expression of 91 genes was linked to poor overall survival. The above genes were validated using data from stomach adenocarcinoma patients in the GEO database, and 31 genes with poor prognosis were obtained, which could be used as potential biomarkers for future treatment of STAD. Combined with the PPI network, we focused on the two genes CARTPT and SPARCL1, which have the highest degree value. The full name of CARTPT is CART prepropeptide, and this gene encodes a preproprotein that is proteolytically processed to generate multiple biologically active peptides. These peptides play a role in appetite, energy balance, maintenance of body weight, reward and addiction, and the stress response. Expression of a similar gene transcript in rodents is up-regulated following administration of cocaine and amphetamine. Mutations in this gene are associated with susceptibility to obesity in humans [42, 43]. In the gastrointestinal mucosa, CART expression was mainly identified in gastrin-producing G cells, but the physiological function of CART in gastrointestinal endocrine cells has not been elucidated [44, 45]. SPARC1 is a member of the SPARC family, a member of the extracellular matrix glycoprotein, and is involved in many physiological functions [33]. It has been shown to be down-regulated in a variety of cancers and can be used as a negative regulator of cell growth and proliferation. With the promotion of invasion and tumour formation, changes in SPARC expression are associated with disease progression and poor prognosis [46, 47].

In addition, we performed IHC analysis on prognostic genes, and further confirmed gene expression patterns at the protein level based on the human protein map. The results showed that these 16 genes were highly expressed in STAD, suggesting that most of these genes may play a carcinogenic role in STAD.

There have been many experimental studies on the correlation between gene expression and survival of STAD patients, but the size of the subjects is generally small, lacking a more comprehensive analysis of STAD and its microenvironment [48, 49]. With the rapid development of sequencing technology, more and more tumour databases have been developed, such as TCGA, GEO, and can be used free of charge [50, 51]. They facilitate the large-scale and comprehensive analysis of data. The TME plays an important role in the development of tumours and affects their occurrence, growth, and metastasis [52,53,54,55]. Based on immunological and stromal cell analysis of STAD samples, we obtained TME-related genes with prognostic value, providing potential value for future treatment of STAD.

Conclusions

In summary, the ESTIMATE algorithm was used to obtain immune/stromal scores for SA samples in TCGA database, which in turn yielded some prognostic genes associated with the TME. These genes were validated using data from the GEO database and may help outline the prognosis of STAD patients. Among them, the unreported genes could become potential biomarkers for STAD. In addition, research on the prognostic role of the overall gene set may provide significant information on their clinical applicability. Finally, further research on these genes may provide new insights into the TME in STAD with the potential of yielding better clinical outcomes.

Availability of data and materials

The data that support the findings of this study are available in TCGA at https://portal.gdc.cancer.gov, reference number TCGA-STAD, and GEO dataset [GSE84433].

Abbreviations

STAD:: Stomach adenocarcinoma
TCGA:: The Cancer Genome Atlas
TME:: Tumour microenvironment
ECM:: Extracellular matrix
GEO:: Gene Expression Omnibus
DAVID:: Database for Annotation, Visualization and Integrated Discovery
DEGs:: Differentially expressed genes
BP:: Biological processes
CC:: Cell components
MF:: Molecular functions
GO:: Gene Ontology
KEGG:: Kyoto Encyclopedia of Genes and Genomes
PPI:: Protein–protein interaction
STRING:: Search Tool for the Retrieval of Interacting Genes
MCODE:: Molecular Complex Detection

References

Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424.
Article PubMed Google Scholar
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin. 2019;69(1):7–34.
Article PubMed Google Scholar
Van Cutsem E, Sagaert X, Topal B, Haustermans K, Prenen H. Gastric cancer. Lancet. 2016;388(10060):2654–64.
Article CAS PubMed Google Scholar
Zhu YH, Jeong S, Wu M, Jin ZY, Zhou JY, Han RQ, et al. Dietary intake of fatty acids, total cholesterol, and stomach cancer in a Chinese population. Nutrients. 2019;11(8):1730.
Article CAS PubMed Central Google Scholar
Parkin DM, Bray F, Ferlay J, Pisani P. Global cancer statistics, 2002. CA Cancer J Clin. 2005;55(2):74–108.
Article PubMed Google Scholar
Allemani C, Matsuda T, Di Carlo V, Harewood R, Matz M, Nikšić M, et al. Global surveillance of trends in cancer survival 2000–14 (CONCORD-3): analysis of individual records for 37 513 025 patients diagnosed with one of 18 cancers from 322 population-based registries in 71 countries. Lancet. 2018;391(10125):1023–75.
Article PubMed PubMed Central Google Scholar
Jia D, Li S, Li D, Xue H, Yang D, Liu Y. Mining TCGA database for genes of prognostic value in glioblastoma microenvironment. Aging (Albany NY). 2018;10(4):592–605.
Article CAS PubMed PubMed Central Google Scholar
Kankeu Fonkoua L, Yee NS. Molecular characterization of gastric carcinoma: therapeutic implications for biomarkers and targets. Biomedicines. 2018;6(1):32.
Article PubMed Central CAS Google Scholar
Zeng D, Li M, Zhou R, Zhang J, Sun H, Shi M, et al. Tumor microenvironment characterization in gastric cancer identifies prognostic and immunotherapeutically relevant gene signatures. Cancer Immunol Res. 2019;7(5):737–50.
Article CAS PubMed Google Scholar
Binnewies M, Roberts EW, Kersten K, Chan V, Fearon DF, Merad M, et al. Understanding the tumor immune microenvironment (TIME) for effective therapy. Nat Med. 2018;24(5):541–50.
Article CAS PubMed PubMed Central Google Scholar
Hanahan D, Weinberg RA. The hallmarks of cancer. Cell. 2000;100(1):57–70.
Article CAS PubMed Google Scholar
Hanahan D, Coussens LM. Accessories to the crime: functions of cells recruited to the tumor microenvironment. Cancer Cell. 2012;21(3):309–22.
Article CAS PubMed Google Scholar
Jiang X, Wang J, Deng X, Xiong F, Ge J, Xiang B, et al. Role of the tumor microenvironment in PD-L1/PD-1-mediated tumor immune escape. Mol Cancer. 2019;18(1):10.
Article PubMed PubMed Central Google Scholar
Ren Q, Zhu P, Zhang H, Ye T, Liu D, Gong Z, et al. Identification and validation of stromal-tumor microenvironment-based subtypes tightly associated with PD-1/PD-L1 immunotherapy and outcomes in patients with gastric cancer. Cancer Cell Int. 2020;20:92.
Article PubMed PubMed Central Google Scholar
Yin Z, Lan H, Tan G, Lu M, Vasilakos AV, Liu W. Computing platforms for big biological data analytics: perspectives and challenges. Comput Struct Biotechnol J. 2017;15:403–11.
Article PubMed PubMed Central Google Scholar
He KY, Ge D, He MM. Big data analytics for genomic medicine. Int J Mol Sci. 2017;18(2):412.
Article PubMed Central CAS Google Scholar
Yoshihara K, Shahmoradgoli M, Martínez E, Vegesna R, Kim H, Torres-Garcia W, et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun. 2013;4(1):1–11.
Article CAS Google Scholar
Xu WH, Xu Y, Wang J, Wan FN, Wang HK, Cao DL, et al. Prognostic value and immune infiltration of novel signatures in clear cell renal cell carcinoma microenvironment. Aging (Albany NY). 2019;11(17):6999–7020.
Article CAS PubMed PubMed Central Google Scholar
Alonso MH, Ausso S, Lopez-Doriga A, Cordero D, Guino E, Sole X, et al. Comprehensive analysis of copy number aberrations in microsatellite stable colon cancer in view of stromal component. Br J Cancer. 2017;117(3):421–31.
Article CAS PubMed PubMed Central Google Scholar
da Huang W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57.
Article CAS Google Scholar
Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(Database issue):D447–52.
Article CAS PubMed Google Scholar
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.
Article CAS PubMed PubMed Central Google Scholar
Wu H, Wang Y, Wang S, Jiang M, Wang C, Fu W, et al. Is susceptibility locus for lung cancer in the 15q25 nicotinic acetylcholine receptor gene cluster CHRNA5-A3-B4 associated with risk of gastric cancer? Med Oncol. 2013;30(2):576.
Article PubMed CAS Google Scholar
Hiraki M, Kitajima Y, Koga Y, Tanaka T, Nakamura J, Hashiguchi K, et al. Aberrant gene methylation is a biomarker for the detection of cancer cells in peritoneal wash samples from advanced gastric cancer patients. Ann Surg Oncol. 2011;18(10):3013–9.
Article PubMed Google Scholar
Shi J, Zhang G, Yao D, Liu W, Wang N, Ji M, et al. Prognostic significance of aberrant gene methylation in gastric cancer. Am J Cancer Res. 2012;2(1):116–29.
CAS PubMed Google Scholar
Lan X, Xing J, Gao H, Li S, Quan L, Jiang Y, et al. Decreased expression of selenoproteins as a poor prognosticator of gastric cancer in humans. Biol Trace Elem Res. 2017;178(1):22–8.
Article CAS PubMed Google Scholar
Jiang H, Gu J, Du J, Qi X, Qian C, Fei BJMMR. A 21-gene support vector machine classifier and a 10-gene risk score system constructed for patients with gastric cancer. Mol Med Rep. 2020;21(1):347–59.
CAS PubMed Google Scholar
Zhang C, Liang Y, Ma MH, Wu KZ, Dai DQ. KRT15, INHBA, MATN3, and AGT are aberrantly methylated and differentially expressed in gastric cancer and associated with prognosis. Pathol Res Pract. 2019;215(5):893–9.
Article CAS PubMed Google Scholar
Niu G, Yang Y, Ren J, Song T, Hu Z, Chen L, et al. Overexpression of CPXM2 predicts an unfavorable prognosis and promotes the proliferation and migration of gastric cancer. Oncol Rep. 2019;42(4):1283–94.
CAS PubMed PubMed Central Google Scholar
Wang Z, Chen G, Wang Q, Lu W, Xu M. Identification and validation of a prognostic 9-genes expression signature for gastric cancer. Oncotarget. 2017;8(43):73826–36.
Article PubMed PubMed Central Google Scholar
Zhang J, Liu X, Yu G, Liu L, Wang J, Chen X, et al. UBE2C is a potential biomarker of intestinal-type gastric cancer with chromosomal instability. Front Pharmacol. 2018;9:847.
Article PubMed PubMed Central CAS Google Scholar
Wang H, Duan XL, Qi XL, Meng L, Xu YS, Wu T, et al. Concurrent hypermethylation of SFRP2 and DKK2 activates the Wnt/beta-catenin pathway and is associated with poor prognosis in patients with gastric cancer. Mol Cells. 2017;40(1):45–53.
Article PubMed PubMed Central CAS Google Scholar
Li P, Qian J, Yu G, Chen Y, Liu K, Li J, et al. Down-regulated SPARCL1 is associated with clinical significance in human gastric cancer. J Surg Oncol. 2012;105(1):31–7.
Article CAS PubMed Google Scholar
Deng X, Xiao Q, Liu F, Zheng C. A gene expression-based risk model reveals prognosis of gastric cancer. PeerJ. 2018;6:e4204.
Article PubMed PubMed Central CAS Google Scholar
Sun C, Yuan Q, Wu D, Meng X, Wang B. Identification of core genes and outcome in gastric cancer using bioinformatics analysis. Oncotarget. 2017;8(41):70271–80.
Article PubMed PubMed Central Google Scholar
Xu Y, Liu Z, Guo K. Expression of FHL1 in gastric cancer tissue and its correlation with the invasion and metastasis of gastric cancer. Mol Cell Biochem. 2012;363(1–2):93–9.
Article CAS PubMed Google Scholar
Bai Z, Ye Y, Liang B, Xu F, Zhang H, Zhang Y, et al. Proteomics-based identification of a group of apoptosis-related proteins and biomarkers in gastric cancer. Int J Oncol. 2011;38(2):375–83.
CAS PubMed Google Scholar
Kim JW, Nam KH, Ahn SH, Park DJ, Kim HH, Kim SH, et al. Prognostic implications of immunosuppressive protein expression in tumors as well as immune cell infiltration within the tumor microenvironment in gastric cancer. Gastric Cancer. 2016;19(1):42–52.
Article CAS PubMed Google Scholar
Bussard KM, Mutkus L, Stumpf K, Gomez-Manzano C, Marini FC. Tumor-associated stromal cells as key contributors to the tumor microenvironment. Breast Cancer Res. 2016;18(1):84.
Article PubMed PubMed Central CAS Google Scholar
Choi Y, Kim JW, Nam KH, Han SH, Kim JW, Ahn SH, et al. Systemic inflammation is associated with the density of immune cells in the tumor microenvironment of gastric cancer. Gastric Cancer. 2017;20(4):602–11.
Article CAS PubMed Google Scholar
Yu PC, Long D, Liao CC, Zhang S. Association between density of tumor-infiltrating lymphocytes and prognoses of patients with gastric cancer. Medicine (Baltimore). 2018;97(27):e11387.
Article PubMed PubMed Central Google Scholar
Lau J, Herzog H. CART in the regulation of appetite and energy homeostasis. Front Neurosci. 2014;8:313.
Article PubMed PubMed Central Google Scholar
Cutcutache I, Wu AY, Suzuki Y, McPherson JR, Lei Z, Deng N, et al. Abundant copy-number loss of CYCLOPS and STOP genes in gastric adenocarcinoma. Gastric Cancer. 2016;19(2):453–65.
Article CAS PubMed Google Scholar
Ekblad E, Kuhar M, Wierup N, Sundler F. Cocaine- and amphetamine-regulated transcript: distribution and function in rat gastrointestinal tract. Neurogastroenterol Motil. 2003;15(5):545–57.
Article CAS PubMed Google Scholar
Volkoff H, Peter RE. Characterization of two forms of cocaine- and amphetamine-regulated transcript (CART) peptide precursors in goldfish: molecular cloning and distribution, modulation of expression by nutritional status, and interactions with leptin. Endocrinology. 2001;142(12):5076–88.
Article CAS PubMed Google Scholar
Gagliardi F, Narayanan A, Mortini P. SPARCL1 a novel player in cancer biology. Crit Rev Oncol Hematol. 2017;109:63–8.
Article PubMed Google Scholar
Li T, Liu X, Yang A, Fu W, Yin F, Zeng X. Associations of tumor suppressor SPARCL1 with cancer progression and prognosis. Oncol Lett. 2017;14(3):2603–10.
Article PubMed PubMed Central CAS Google Scholar
Wang Q, Hu B, Hu X, Kim H, Squatrito M, Scarpace L, et al. Tumor evolution of glioma-intrinsic gene expression subtypes associates with immunological changes in the microenvironment. Cancer Cell. 2018;33(1):152.
Article CAS PubMed PubMed Central Google Scholar
Shigemori T, Toiyama Y, Okugawa Y, Yamamoto A, Yin C, Narumi A, et al. Soluble PD-L1 expression in circulation as a predictive marker for recurrence and prognosis in gastric cancer: direct comparison of the clinical burden between tissue and serum PD-L1 expression. Ann Surg Oncol. 2019;26(3):876–83.
Article PubMed Google Scholar
Li J, Lu Y, Akbani R, Ju Z, Roebuck PL, Liu W, et al. TCPA: a resource for cancer functional proteomics data. Nat Methods. 2013;10(11):1046–7.
Article CAS PubMed PubMed Central Google Scholar
Chen J, Kong Y, Weng S, Dong C, Zhu L, Yang Z, et al. Outcomes of surgery for gastric cancer with distant metastases: a retrospective study from the SEER database. Oncotarget. 2017;8(3):4342–51.
Article PubMed Google Scholar
Thakkar S, Sharma D, Kalia K, Tekade RK. Tumor microenvironment targeted nanotherapeutics for cancer therapy and diagnosis: a review. Acta Biomater. 2019;101:43–68.
Article PubMed CAS Google Scholar
Taube JM, Galon J, Sholl LM, Rodig SJ, Cottrell TR, Giraldo NA, et al. Implications of the tumor immune microenvironment for staging and therapeutics. Mod Pathol. 2018;31(2):214–34.
Article CAS PubMed Google Scholar
Petitprez F, Vano YA, Becht E, Giraldo NA, de Reynies A, Sautes-Fridman C, et al. Transcriptomic analysis of the tumor microenvironment to guide prognosis and immunotherapies. Cancer Immunol Immunother. 2018;67(6):981–8.
Article PubMed Google Scholar
Wiggins JM, Opoku-Acheampong AB, Baumfalk DR, Siemann DW, Behnke BJ. Exercise and the tumor microenvironment: potential therapeutic implications. Exerc Sport Sci Rev. 2018;46(1):56–64.
Article PubMed Google Scholar

Download references

Acknowledgements

The author thanks Professor Xu Teng and Professor Ya-Juan Feng for reviewing and editing the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant Nos. 81902960, 81672726), the Natural Science Foundation of Beijing (Grant No. 7204241).

Author information

Authors and Affiliations

School of Information Science and Technology, University of Science and Technology of China, Hefei, 230026, Anhui, China
Lin Zhou & Ya-Juan Feng
Beijing Key Laboratory for Tumor Invasion and Metastasis, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Capital Medical University, Beijing, 100069, China
Wei Huang, He-Fen Yu & Xu Teng

Authors

Lin Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Wei Huang
View author publications
You can also search for this author in PubMed Google Scholar
He-Fen Yu
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Juan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Xu Teng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, LZ, XT; Methodology, LZ, WH; Investigation, LZ, YH; writing—original draft, LZ; writing—review & editing, LZ, YF, XT; funding acquisition, XT; resources, XT; supervision, YF, XT. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Ya-Juan Feng or Xu Teng.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Up and down-regulate differential genes.

Additional file 2: Table S2.

Genes related to survival in STAD.

Additional file 3: Table S3.

Top 10 core genes from PPI network.

Additional file 4: Table S4.

Genes related to survival that have been verified in gastric cancer samples from the GEO database.

Additional file 5: Figure S1.

IHC analysis of 10 other genes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Zhou, L., Huang, W., Yu, HF. et al. Exploring TCGA database for identification of potential prognostic genes in stomach adenocarcinoma. Cancer Cell Int 20, 264 (2020). https://doi.org/10.1186/s12935-020-01351-3

Download citation

Received: 29 March 2020
Accepted: 15 June 2020
Published: 23 June 2020
DOI: https://doi.org/10.1186/s12935-020-01351-3

Exploring TCGA database for identification of potential prognostic genes in stomach adenocarcinoma

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Background

Methods

Raw data

Differential expression analysis

Enrichment analysis and PPI network

Survival analysis

Results

Stromal and immune scores are associated with the TNM staging system and survival prognosis

Differential expression and enrichment analysis of STAD cases based on stromal and immune scores

Survival analysis of differential genes

PPI network analysis of genes with prognostic values

Using the GEO database to verify genes with prognostic values

Prognostic gene validation using clinical tissue samples

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation