Molecular subtyping of glioblastoma based on immune-related genes for prognosis

Chen, Xueran; Fan, Xiaoqing; Zhao, Chenggang; Zhao, Zhiyang; Hu, Lizhu; Wang, Delong; Wang, Ruiting; Fang, Zhiyou

doi:10.1038/s41598-020-72488-4

Molecular subtyping of glioblastoma based on immune-related genes for prognosis

Article
Open access
Published: 23 September 2020

Volume 10, article number 15495, (2020)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Molecular subtyping of glioblastoma based on immune-related genes for prognosis

Download PDF

Xueran Chen^1,2^na1,
Xiaoqing Fan^3,4^na1,
Chenggang Zhao^1,5,
Zhiyang Zhao^1,5,
Lizhu Hu^1,5,
Delong Wang^3,4,
Ruiting Wang^3,4 &
…
Zhiyou Fang^1,2

3127 Accesses
15 Citations
Explore all metrics

Abstract

Glioblastoma (GBM) is associated with an increasing mortality and morbidity and is considered as an aggressive brain tumor. Recently, extensive studies have been carried out to examine the molecular biology of GBM, and the progression of GBM has been suggested to be correlated with the tumor immunophenotype in a variety of studies. Samples in the current study were extracted from the ImmPort and TCGA databases to identify immune-related genes affecting GBM prognosis. A total of 92 immune-related genes displaying a significant correlation with prognosis were mined, and a shrinkage estimate was conducted on them. Among them, the 14 most representative genes showed a marked correlation with patient prognosis, and LASSO and stepwise regression analysis was carried out to further identify the genes for the construction of a predictive GBM prognosis model. Then, samples in training and test cohorts were incorporated into the model and divided to evaluate the efficiency, stability, and accuracy of the model to predict and classify the prognosis of patients and to identify the relevant immune features according to the median value of RiskScore (namely, Risk-H and Risk-L). In addition, the constructed model was able to instruct clinicians in diagnosis and prognosis prediction for various immunophenotypes.

Identification of glioblastoma immune subtypes and immune landscape based on a large cohort

Article Open access 19 August 2021

A 1p/19q Codeletion-Associated Immune Signature for Predicting Lower Grade Glioma Prognosis

Article 07 September 2020

Optimized risk stratification strategy for glioma patients based on the feature genes of poor immune cell infiltration patterns

Article 03 August 2023

Introduction

Glioblastoma (GBM), an aggressive primary malignancy in the central nervous system, has a median survival time of 12–15 months and a 5-year survival rate of < 5%^1,2. According to the Clinical Practice Guideline formulated by the American National Comprehensive Cancer Network, chemotherapy is still the preferred choice for stage III–IV GBM³. Currently, there are various chemotherapy regimens, but some patients do not benefit from chemotherapy, and imaging examination could be applied to examine cancer development. In addition, the distinct long-term clinical outcomes may be detected based on tumor heterogeneities among these cases with the same pathological subtype⁴. However, some problems remain to be solved: how to assess tumor heterogeneities prior to treatment for these cases in a non-invasive or less traumatic way, estimate the risk of cancer progression, evaluate tumor response to chemotherapy in individual patients, and to estimate the different long-time overall survival (OS) among groups with different cancer heterogeneities⁵.

Currently, an immune disorder that can promote tumor genesis has been recognized as the enabling feature in the glioma genesis process⁶. Glioma cells can remarkably induce an immune response; in some cases, they can subjugate such a response to establish an appropriate microenvironment to promote their development⁷. Standard treatment cannot achieve a satisfying effect; thus, immunotherapy is being intensively investigated as an additional method⁸. Meanwhile, some parameters related to immunity have been reported to predict the disease prognosis, which has highlighted the significance of different immune states in identifying glioma outcomes^9,10. Nonetheless, immune phenotypes in a glioma microenvironment, together with their relationship with prognosis, are rarely examined systemically.

Biomarkers are able to accurately estimate disease prognosis and patient survival, which are thereby valuable for decision-making in clinical GBM treatment^11,12. Recently, an increasing number of studies have suggested that the expression patterns of genes can predict and classify the survival outcomes of GBM patients¹³. Nonetheless, this proposal has still not been identified as a clinical routine practice, which may be related to the lack of evidence, small sample size, and tremendous data fitting in most studies. Consequently, use of large-scale databases that are accessible to the public and involve the expression patterns of genes, like TCGA, makes it possible to identify the most reliable biomarkers to predict and classify GBM prognosis. In this study, a model to predict the prognosis of GBM was constructed and verified based on immune-related genes, according to the clinical characteristics of patients extracted from the ImmPort and TCGA databases. Our results can help clinicians evaluate the efficacy, predict the disease prognosis, and select the suitable GBM treatment.

Results

Mining of specific immune-related genes based on GBM patient survival and prognostic outcomes

At first, related data were collected based on the ImmPort and TCGA databases, followed by a pre-processing. Then, all immune-related genes and survival data were analyzed using the univariate Cox proportional hazards regression model based on the R survival package coxph function, with the significance level set at p < 0.05 (Supplementary Table S1). Finally, 92 prognosis-specific immune-related genes were mined. The association between the p values for these 92 genes and expression intensities (log2(EXP)), together with hazard ratios (HRs), is presented in Fig. 1A,B.

Altogether, 92 immune-related genes were identified, but most of them were not suitable for clinical detection. Therefore, the number of immune-related genes was reduced, while a high accuracy was maintained. Consequently, these 92 genes were narrowed down using a least absolute shrinkage and selection operator (LASSO) regression, to decrease the number of genes recruited into this risk model. The LASSO algorithm, a biased estimate used for processing multicollinearity data, can predict and select variables, and overcome the multicollinearity problem in regression analysis. Here, R package glmnet was utilized for LASSO regression analysis. The variation trajectory of each independent variable was assessed, as presented in Fig. 1C, which indicated that most independent parameters had coefficients of about zero with a gradual lambda increase. Moreover, the model was also established by means of a tenfold cross-validation. Figure 1D displays the confidence interval of each lambda, which reveals that the optimal model was acquired when the lambda was 0.04456. Therefore, this model was selected as the final model, involving 34 immune-related genes (Supplementary Table S2). Moreover, MASS R package was used for stepwise regression analysis, according to Akaike data standards, and 14 genes were used for the risk model construction (Supplementary Tables S3 and S4). The formula is presented in the “Methods” section.

Construction of the model to predict prognosis for GBM patients

Then, all samples in the training set were substituted into the formula to calculate the RiskScore value. The median RiskScore value was used as the threshold to classify patients into high- (Risk-H) and low-risk (Risk-L) groups. Receiver operating characteristic (ROC) analysis was also performed for prognosis classification according to the RiskScore value. The OS of all samples was 1–3 years (Supplementary Fig. S1). As a result (Fig. 2A), the model prediction efficiency for 1–3-year OS was examined, and the average area under the curve (AUC) was as high as 0.793. Moreover, Fig. 2B shows the sample distribution in Risk-H and Risk-L groups for various OS, suggesting no statistically significant differences in 0- and 1-year sample sizes between the two groups. Moreover, the 1.5-year sample size of Risk-H group was remarkably decreased compared with that of Risk-L group, which was more obvious with the OS extension (Fig. 2C). We next extracted the gene expression profile for the clustering analysis using log10 for all expression values. We also used the hierarchical clustering method to calculate the Euclidean distance between different features. Figure 2D shows the results of sample clustering of the training set. As expected, the above-mentioned 14 genes were markedly clustered into high and low expression groups, respectively, and the training set samples were also divided into two groups. Additionally, the RiskScore values between these two subclasses were compared (Fig. 2E).

To validate the model reliability, the expression patterns of the above 14 genes were extracted based on test cohort and substituted into the validation model. Meanwhile, the RiskScore values of all samples were also computed, and the test set data were also used to evaluate the model efficacy to predict the OS at 1–3 years, as presented in Supplementary Fig. S2, which displays the sample distribution in Risk-H and Risk-L groups at various OS. The difference in the distribution of 0–1-year sample size between the two groups was not statistically significant. Moreover, the 2-year sample size in the Risk-H group was also notably decreased compared with that in the Risk-L group, which was even obvious with the OS extension (Supplementary Fig. S2). Supplementary Figure S2 shows the results of sample clustering of the test cohort, as well as the different RiskScore values between these two subgroups.

Moreover, we retrieved the GSE74187 data set with prognosis follow-up information from the GEO database. The expression matrix of these 14 genes was extracted from the expression profile and the risk score of each sample was calculated using the same method. We evaluated the ROC risk score analysis, which indicated that the average AUC at 1, 2, and 3 years was 0.83 (Supplementary Fig. S3A). According to the median of the high-risk group, the prognosis was significantly worse than that of the low-risk group (Supplementary Fig. S3B), which was consistent with the training and test sets.

In addition, the expression patterns of 14 genes extracted based on all the above 523 samples were substituted into the model to calculate the RiskScore values to validate the model reliability and stability (Supplementary Fig. S4), which exhibits the results of sample clustering and different RiskScore values between these two subgroups. Overall, the RiskScore model established based on the expression patterns of 14 immune-related genes presented favorable accuracy and stability to identify immunity-related features.

Finally, we plotted the Kaplan–Meier survival curves of the Risk-H and Risk-L groups based on the 14-gene-based risk model in the training (n = 261) and test cohorts (n = 262), and in all the samples (n = 523), separately, as shown in Fig. 3A–C (p < 0.0001, p < 0.001, and p < 0.0001, respectively).

Functional annotations of immune-related genes and enrichment of signaling pathways specific to prognosis

All the above 14 gene families were first annotated according to the human gene classification in the HGNC database (Supplementary Table S5). All were significantly enriched in galanin receptors and endothelin receptor gene families (p < 0.05). Additionally, the clusterProfile in the R package was used for the enrichment analysis on the 14 genes. Supplementary Fig. S5 shows the results of the GO enrichment analysis and Supplementary Table S6 shows the related data, which indicated that most genes were enriched to distinct immune-related signaling pathways and biological processes.

The R package GSVA ssGSEA function was used for KEGG functional enrichment. Associations with the RiskScore values were examined based on the pathway enrichment scores among the different samples to obtain a total of 21 KEGG-related pathways (Supplementary Table S7–S9). These 21 pathways were chosen for clustering analysis in accordance with the sample enrichment results from the training cohort (Fig. 4A). Additionally, the relationship between the enrichment score and the RiskScore value was examined by selecting the two major pathways with the highest GSEA enrichment scores (e.g., vascular smooth muscle contraction and the JAK-STAT signaling pathway). The sample distribution in the two groups was also explored. We found that the pathway enrichment scores were different in the Risk-H relative to the Risk-L group (Fig. 4B,C).

Relationships of the RiskScore values with the clinical characteristics of samples

Subsequently, the associations between various parameters (such as neoadjuvant, sex, and age) and the RiskScore value were examined (Fig. 5A–C). Clearly, other features were not related to the RiskScore value (p > 0.05), except for age, and the constructed RiskScore model was dependent on patient age.

At last, the RiskScore values combined with the clinical characteristics were used to construct the nomogram model. Use of a nomogram, an approach to intuitively and effectively present risk model results, is convenient for predicting patient outcomes. Specifically, the straight-line length in a nomogram represents the effects of different parameters and their significance on the outcome. Here, a nomogram was constructed to combine the RiskScore, age, neoadjuvant, and sex, respectively, as displayed in Fig. 5D. RiskScore characteristics showed an obvious association with the greatest influence on predicting the survival rate, indicating that the 14-gene-based risk model had a superb prognosis prediction ability.

The forest plot was established based on the clinical characteristics and RiskScore. In Fig. 5E, the HRs for RiskScore were approximately 1.4 (p < 0.05).

Indeed, we also had analyzed the relationship between the expression level of 36 immune-checkpoint genes and RiskScore (Supplementary Table S10). In addition to eight genes, including PDCD1 and CTLA4, the expression levels of other 26 genes showed a positive correlation with RiskScore, suggesting that the constructed model was able to instruct clinicians in diagnosing and predicting the prognosis for various immunophenotypes.

Practical application of the prediction model for GBM patients

According to the prognostic prediction model, we analyzed the clinical follow-up data of these 24 GBM patients, which were divided into Risk-H and Risk-L groups (n = 12, each), based on the median RiskScore value. There was an inverse correlation between the RiskScore value and OS (p = 0.0392) (Fig. 6A), with an AUC of 0.7465 (Fig. 6B).

CD4+CD25+ regulatory T cells (Tregs) play an important role in anti-tumor immune responses, and a poor prognosis and declining survival rates are closely related with high Treg expression in cancer patients^14,15. Consistent with these, the RiskScore value showed a negative relationship with CD3+CD4+/CD3+CD8+ (r = − 0.9635, p < 0.0001; Fig. 6C), but a positive relationship with CD4+CD25+ Tregs percentage (r = 0.5167, p = 0.0116; Fig. 6D). Notably, PD-L1 or PD-L2 immunohistochemical (IHC) analysis results showed that the IHC score was positively correlated with the RiskScore (Fig. 6E,F).

Taken together, we concluded that this prognostic predictor showed great promise in clinical practice application.

Discussion

Currently, GBM treatments include surgery alone for an early-stage disease and adjuvant radio/chemotherapy plus surgical resection for an advanced stage. However, surgical resection cannot provide a satisfactory effect because cancer cells may have invaded the local adjacent tissues or developed metastasis¹⁶. Moreover, it is still controversial whether systemic adjuvant therapy can be prescribed following surgery owing to tumor heterogeneity or potential adverse effects¹⁷. Consequently, it is important to mine the potential biomarkers to predict GBM prognosis; this way, high-risk GBM cases can benefit from early adjuvant therapy. This can also assist in the clinical management of individual patients and thereby accurately distinguish patients that can be completely treated using adjuvant treatment from those that can avoid treatment and the possible chemotherapeutics-derived toxicity¹⁸. In the current work, a candidate signature was examined as a reliable method to predict GBM prognosis.

Due to the emerging next-generation sequencing techniques, a number of candidate biomarkers for the diagnosis and prognosis prediction of GBM were identified, which makes it possible to more specifically classify and more accurately predict GBM outcomes¹⁹. Several molecular markers, such as isocitrate dehydrogenase, O6-methylguanine DNA methyltransferase, phosphatase and tensin homolog, and epidermal growth factor receptor, are conventionally examined in clinical GBM cases^20,21. These molecular markers facilitate targeted anti-GBM treatments and individualized therapeutic methods. Nonetheless, GBM has a dismal prognosis, so new treatment strategies and molecular biomarkers are urgently needed to illustrate the underlying GBM mechanisms and improve the OS of patients.

Limited clinical data and fresh tumor specimens symbolizing transitional steps from tumor initiation to progression are important barriers to improving clinical outcomes in GBM patients. Methylation-based subtypes that predict GBM patient survival have been reported. Notably, the methylation levels of different subgroups could reflect different molecular genetic features^22,23. More and more attention has been paid to the relationship between the immune system and malignancy progression and pathogenesis, which contribute to GBM treatment, thereby promoting the development of anti-tumor treatments. CD68+ and CD163+ cells were the most abundant populations in GBM, and the percentage of CD163+ cells correlated with a poorer prognosis. Mesenchymal GBMs displayed the highest percentages of microglia, macrophage, and lymphocyte infiltration²⁴. Wild-type and the mesenchymal subtype, IDH1, in GBM presented strong immunosuppressive microenvironments, while tumors of mutated IDH1 and TCGA proneural subtypes exhibited a significantly less immunosuppressive state²⁵. Regarding tumor origin (namely, the immune system), the approach of regulating and killing cancer cells by modulating the immune system and promoting anti-cancer immunity in the tumor microenvironment is novel. Therefore, screening of novel significant prognosis-specific immune-related genes is meaningful for predicting disease prognosis and identifying novel therapeutic targets. Some researchers have reported gene expression-based immunoprofiling of GBM using TCGA data. For example, Arivazhagan et al. reported a 14-gene expression signature that predicted survival in GBM patients. A network analysis specifically revealed inflammatory response pathway activation in the high-risk group²⁶. Zhang et al. showed that samples with high tumor microenvironment (TME) scores were characterized by immune activation, TGF pathway activation, and high expression of immune checkpoint genes, while those with low TME scores were characterized by a high-frequency of IDH1 and MET mutations²⁷. Zhang et al. identified six immune-related genes (CANX, HSPA1B, KLRC2, PSMC6, RFXAP, and TAP1) as risk signatures. Importantly, Kaplan–Meier and ROC curves, as well as risk plotting, verified their performance in TCGA and CGGA datasets²⁸. Zhang et al. observed that a high immune score was associated with low methylation and copy number variation levels, a high expression of immunosuppressive markers (CD27, PDL1 and CTLA4), and a shorter recurrence-free survival²⁹. Here, GBM classification based on the prognosis-specific and immune-related signature could precisely estimate the clinical outcomes and identify those with a high or low risk of postoperative recurrence. Notably, PD-L1 or PD-L2 IHC analysis results showed that the IHC score was positively correlated with the RiskScore. Moreover, the RiskScore value showed a negative relationship with CD3+CD4+/CD3+CD8+, but a positive relationship with CD4+CD25+ Tregs percentage.

Here, 14 prognosis-specific immune-related genes were mined by big data mining, TCGA and ImmPort database sorting, and statistical analyses. Two key points must be cautiously taken into consideration to ensure the prognosis model validity: clinical utility and transport capability in different cohorts. Typically, our constructed prognosis model is better than other prognosis models for GBM that were not duplicated in GBM-independent cohorts. Additionally, our validation set was a multi-institutional cohort involving cases from different hospitals, which suggests that our constructed GBM model is applicable to different clinical settings and patient types. Afterwards, the 14-gene-based model was constructed for prognosis prediction, and RiskScore values for all cases were also computed. Then, the model was applied for prediction and validation. The prognosis model was established based on the expression patterns of specific immune-related genes, and it could classify patients at a certain clinical stage into various subgroups, according to the estimated survival outcomes.

Nine of these 14 genes were previously suggested to be involved in malignant transformation, pathogenesis, progression, and immune microenvironment of GBM, including S100A9, HSPA1A, GALR2, EDNRB, IL13RA2, ELN, NR1D1, HDGF, and MET^{30,31,32,33,34,35}. They were markedly correlated with patient survival and prognosis, which means that our bioinformatic mining displayed a high reliability and accuracy. However, the relationship of the other two genes (namely, CLRF1 and GRAP2) with GBM is not validated in a clinical or basic study, and we are interested in this topic. CRLF1 is verified to be involved in regulating malignant cancer cell proliferation and invasion, which can affect signaling pathways (such as MAPK/ERK and Akt/PI3K) and modulate the immune and nervous systems maturity during fetal development^36,37. GRAP2 is also found to be a candidate tumor suppressor, and it is recognized to be a prognosis prediction marker for different types of cancers, which can regulate tumor cell sensitivity to immunotherapy^38,39.

In conclusion, our results assist in identifying novel biomarkers for predicting the clinical prognosis of GBM. Additionally, the 14-gene-based risk model can provide a variety of targets for an accurate GBM treatment, and it can also help classify GBM patients according to the molecular subtypes. In addition, the constructed model may be used to instruct clinicians in the medication, prognosis prediction, and diagnosis of GBM patients with various immunophenotypes.

Methods

GBM tissue specimens were collected from 24 patients (ages 42–75) who underwent curative resection for glioma with informed consent between 2017 and 2019 at Hefei Cancer Hospital, Chinese Academy of Sciences (CAS), with Institutional Review Board approval. All methods were performed in accordance with the relevant guidelines and regulations, as stated in relevant sections below.

Pre-processing of original sample data and preliminary selection of immune-related genes in GBM

The up-to-date clinical follow-up information was extracted from TCGA GDC API. Altogether, 539 RNA-Seq data samples were mined (as displayed in Supplementary Table S11), and 529 of them were tumor tissues. Additionally, the immune-related gene set involving 1811 genes was also acquired based on the ImmPort database⁴⁰ (Supplementary Table S12).

At first, 529 tumor tissues were subjected to a pro-processing (Supplementary Table S13), and 523 of them involving 1,108 genes were used for further model analysis. Supplementary Table S14 presents the clinical characteristics of samples. Afterwards, these 523 samples were classified into training and test sets, respectively. Random grouping with replacement was carried out 100 times on all samples to remove the influence of random allocation bias on model stability. The training (n = 261) and test set (n = 262) samples are displayed in Supplementary Tables S15 and 16, respectively. The eventual data of training and test set samples are shown in Supplementary Table S14. Differences between the two sets were not statistically significant, indicating a reasonable sample grouping.

Univariate survival analysis for immune-related samples in training set

The univariate Cox proportional hazards regression model was utilized to analyze the immune-related genes and the survival data using the survival coxph function⁴¹ of R package. A p < 0.05 was regarded to be statistically significant.

Screening of immune-related genes specific to GBM prognosis, and establishment of the model to predict prognosis

At first, the R package MASS and glmnet functions were used for stepwise and LASSO regression analysis⁴², and the risk model was established based on specific immune-related genes, as displayed below:

$$ RiskScore = EDNRA \times -0.325652748 + HSPA1A \times -0.312268258 + S100A9 \times 0.17460672 + PI15 \times -1.128026913 + EDNRB \times -0.199258031 + GALR2 \times -1.690737959 + NR1D1 \times 0.367374589 + FGF14 \times 0.184640626 + ELN \times 0.258161826 + IL13RA2 \times 0.081069744 + MET \times 0.172446326 + HDGF \times -0.342300085 + GRAP2 \times -0.863180168 + CRLF1 \times -0.138403709 $$

(1)

Afterwards, related gene expression patterns were selected based on training and test sets, which were then substituted into the constructed model to calculate the RiskScore values in each sample. The median RiskScore value was utilized as the threshold to classify samples as belonging to the high- (Risk-H) or low-risk (Risk-L) group. Finally, the accuracy, stability, and efficiency of the model to predict and classify GBM prognosis were evaluated through gene clustering, ROC, and KM analyses.

Signaling pathway enrichment and functional annotations for immune-related genes specific to immunity

Finally, 14 genes were screened and the corresponding gene families were annotated in accordance with the human gene classification in the HGNC database⁴³. Moreover, GO enrichment analyses were carried out using these 14 prognosis-specific immune-related genes and clusterProfile⁴⁴ of R package.

Relationships of RiskScore with the signaling pathways and clinical characteristics of samples

At first, the R package GSVA⁴⁵ ssGSEA function was utilized to evaluate the score of KEGG enrichment analysis. At the same time, the relationship of RiskScore was computed, and later, clustering analysis was performed based on the pathway enrichment score for all samples. Then, the relationships of related factors (like neoadjuvant, sex, and age) with the RiskScore were determined. Finally, the nomogram model was established, and related clinical characteristics and RiskScore values were used to draw the forest plot, and the relationships between RiskScore and clinical characteristics with patient survival were examined.

Phenotyping of peripheral T cells and IHC staining for GBM tissue microarray analysis

Peripheral blood samples from 24 GBM patients undergoing curative resection with informed consent between 2017 and 2019 at Hefei Cancer Hospital, Chinese Academy of Sciences (Anhui, China), were stained with the following sets of monoclonal antibodies (BD Biosciences; San Jose, CA, USA): CD3-PE (clone SP34), CD4-APC-Cy7 (clone SK3), CD8-PerCP (clone SK1), and CD25-FITC (clone MA251), and analyzed on Cytomics FC500 Flow Cytometer CXP with the CXP analysis software (Beckman Coulter Inc.). Twenty-four GBM tissues were placed on a tissue microarray and stained with anti-PD-L1 (clone E1L3N) and anti-PD-L2 (clone D7U8C) antibodies (Cell Signaling Technology; Danvers, MA, USA) , and visualized using the KF-PRO Digital Slide Scanning System (Kongfong Biotech International Co., LTD; Ningbo, China).

Statistical methods

The TCGA dataset was randomly divided into training and test cohorts in a 1:1 ratio. Samples in the training set were analyzed to identify the potential prognosis-predicting genes and validated in both the test and the whole sets. First, the relationships between the expression of immune-related genes and patient OS were evaluated using the univariate Cox proportional hazards regression analysis. Typically, genes with a p < 0.05 through log rank test were selected to be the candidate variables. Later, the number of candidate genes was decreased based on the LASSO-Cox method, and later, immune-related genes showing the greatest significance were chosen for constructing the RiskScore model to predict prognosis. The RiskScore model could be calculated as follows:

$$Riskscore= \sum_{i=0}^{n}{\upbeta }_{i} \times {\upchi }_{i} $$

(2)

where β_i indicates the coefficient, and χ_i represents the gene expression level (fpkm) of each gene. The RiskScore model was calculated for all patients, who were then divided into low- or high-risk groups according to the median RiskScore value in the training set. Patients in the low-risk group had a lower risk of OS, while those in the high-risk group had a higher risk of OS. Then, the difference in OS between these two groups was calculated based on the Kaplan–Meier survival curve. The specificity and sensitivity of the model in diagnosis and prognosis prediction were evaluated according to the areas under the ROC curve. A two-tailed p < 0.05 was deemed to indicate statistical significance. The Bio-conductor and R software (version 3.5.0) were utilized for all statistical analyses.

Ethics approval and consent to participate

This study was reviewed and approved by the Institutional Review Board of the Cancer Hospital of Hefei Institutes of Physical Science, CAS, and written informed consent was obtained from patients based on the Declaration of Helsinki.

Abbreviations

AUC:: Area under the curve
GO:: Gene ontology
GBM:: Glioblastoma
HRs:: Hazard ratios
HGNC:: HUGO Gene Nomenclature Committee
IDH1:: Isocitrate dehydrogenase (NADP(+)) 1
KEGG:: Kyoto Encyclopedia of Genes and Genomes
LASSO:: Least absolute shrinkage and selection operator
MET:: MET proto-oncogene, receptor tyrosine kinase
OS:: Overall survival
ROC:: Receiver operating characteristic
TGF:: Transforming growth factor

References

Wen, P. Y. & Kesari, S. Malignant gliomas in adults. N. Engl. J. Med. 359(5), 492–507. https://doi.org/10.1056/NEJMra0708126 (2008).
Article PubMed CAS Google Scholar
Ostrom, Q. T. et al. CBTRUS statistical report: Primary brain and central nervous system tumors diagnosed in the United States in 2006–2010. Neuro Oncol. 15 Suppl 2, ii1–ii56. https://doi.org/10.1093/neuonc/not151 (2013).
Article PubMed Google Scholar
Nabors, L. B. et al. NCCN guidelines insights: central nervous system cancers, version 1.2017. J. Natl. Compr. Canc. Netw. 15(11), 1331–1345. https://doi.org/10.6004/jnccn.2017.0166 (2017).
Article PubMed Google Scholar
Cheng, W. et al. Bioinformatic profiling identifies an immune-related risk signature for glioblastoma. Neurology 86(24), 2226–2234. https://doi.org/10.1212/WNL.0000000000002770 (2016).
Article PubMed CAS Google Scholar
Verhaak, R. G. et al. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell 17(1), 98–110. https://doi.org/10.1016/j.ccr.2009.12.020 (2010).
Article PubMed PubMed Central CAS Google Scholar
Kim, R., Emi, M. & Tanabe, K. Cancer immunoediting from immune surveillance to immune escape. Immunology 121(1), 1–14. https://doi.org/10.1111/j.1365-2567.2007.02587.x (2007).
Article PubMed PubMed Central CAS Google Scholar
Silver, D. J., Sinyuk, M., Vogelbaum, M. A., Ahluwalia, M. S. & Lathia, J. D. The intersection of cancer, cancer stem cells, and the immune system: therapeutic opportunities. Neuro Oncol. 18(2), 153–159. https://doi.org/10.1093/neuonc/nov157 (2016).
Article PubMed Google Scholar
Finocchiaro, G. & Pellegatta, S. Immunotherapy for glioma: getting closer to the clinical arena?. Curr. Opin. Neurol. 24(6), 641–647. https://doi.org/10.1097/WCO.0b013e32834cbb17 (2011).
Article PubMed Google Scholar
Han, S. et al. Tumour-infiltrating CD4(+) and CD8(+) lymphocytes as predictors of clinical outcome in glioma. Br. J. Cancer 110(10), 2560–2568. https://doi.org/10.1038/bjc.2014.162 (2014).
Article PubMed PubMed Central CAS Google Scholar
Han, S. et al. Pre-treatment neutrophil-to-lymphocyte ratio is associated with neutrophil and T-cell infiltration and predicts clinical outcome in patients with glioblastoma. BMC Cancer 15, 617. https://doi.org/10.1186/s12885-015-1629-7 (2015).
Article PubMed PubMed Central CAS Google Scholar
Aldape, K., Zadeh, G., Mansouri, S., Reifenberger, G. & von Deimling, A. Glioblastoma: pathology, molecular mechanisms and markers. Acta Neuropathol. 129(6), 829–848. https://doi.org/10.1007/s00401-015-1432-1 (2015).
Article PubMed CAS Google Scholar
Szopa, W., Burley, T. A., Kramer-Marek, G. & Kaspera, W. Diagnostic and therapeutic biomarkers in glioblastoma: current status and future perspectives. Biomed. Res. Int. 2017, 8013575. https://doi.org/10.1155/2017/8013575 (2017).
Article PubMed PubMed Central CAS Google Scholar
Bao, Z. S. et al. Prognostic value of a nine-gene signature in glioma patients based on mRNA expression profiling. CNS Neurosci. Ther. 20(2), 112–118. https://doi.org/10.1111/cns.12171 (2014).
Article PubMed CAS Google Scholar
Gajewski, T. F. Identifying and overcoming immune resistance mechanisms in the melanoma tumor microenvironment. Clin. Cancer Res. 12(7 Pt 2), 2326s–2330s. https://doi.org/10.1158/1078-0432.ccr-05-2517 (2006).
Article PubMed CAS Google Scholar
Shevach, E. M. CD4+ CD25+ suppressor T cells: more questions than answers. Nat. Rev. Immunol. 2(6), 389–400. https://doi.org/10.1038/nri821 (2002).
Article PubMed CAS Google Scholar
Cunha, M. Maldaun MVC (2019) Metastasis from glioblastoma multiforme: a meta-analysis. Rev. Assoc. Med. Bras. 65(3), 424–433. https://doi.org/10.1590/1806-9282.65.3.424 (1992).
Article Google Scholar
Abdul, K. U. et al. WINDOW consortium: a path towards increased therapy efficacy against glioblastoma. Drug Resist. Updates 40, 17–24. https://doi.org/10.1016/j.drup.2018.10.001 (2018).
Article Google Scholar
Harrison, R. A. & de Groot, J. F. Treatment of glioblastoma in the elderly. Drugs Aging 35(8), 707–718. https://doi.org/10.1007/s40266-018-0568-9 (2018).
Article PubMed CAS Google Scholar
Huang, J. et al. Immune checkpoint in glioblastoma: promising and challenging. Front. Pharmacol. 8, 242. https://doi.org/10.3389/fphar.2017.00242 (2017).
Article PubMed PubMed Central CAS Google Scholar
Ayoub, Z. et al. Prognostic significance of O6-methylguanine-DNA-methyltransferase (MGMT) promoter methylation and isocitrate dehydrogenase-1 (IDH-1) mutation in glioblastoma multiforme patients: a single-center experience in the Middle East region. Clin. Neurol. Neurosurg. 182, 92–97. https://doi.org/10.1016/j.clineuro.2019.04.008 (2019).
Article PubMed Google Scholar
Chamberlain, M. C. & Sanson, M. Combined analysis of TERT, EGFR, and IDH status defines distinct prognostic glioblastoma classes. Neurology 84(19), 2007. https://doi.org/10.1212/WNL.0000000000001625 (2015).
Article PubMed Google Scholar
Ma, H. et al. Specific glioblastoma multiforme prognostic-subtype distinctions based on DNA methylation patterns. Cancer Gene Ther. https://doi.org/10.1038/s41417-019-0142-6 (2019).
Article PubMed Google Scholar
Tang, Y., Qing, C., Wang, J. & Zeng, Z. DNA methylation-based diagnostic and prognostic biomarkers for glioblastoma. Cell Transplant. 29, 963689720933241. https://doi.org/10.1177/0963689720933241 (2020).
Article PubMed Google Scholar
Martinez-Lage, M. et al. Immune landscapes associated with different glioblastoma molecular subtypes. Acta Neuropathol. Commun 7(1), 203. https://doi.org/10.1186/s40478-019-0803-6 (2019).
Article PubMed PubMed Central CAS Google Scholar
Zhang, C., Li, J., Wang, H. & Song, S. W. Identification of a five B cell-associated gene prognostic and predictive signature for advanced glioma patients harboring immunosuppressive subtype preference. Oncotarget 7(45), 73971–73983. https://doi.org/10.18632/oncotarget.12605 (2016).
Article PubMed PubMed Central Google Scholar
Arimappamagan, A. et al. A fourteen gene GBM prognostic signature identifies association of immune response pathway and mesenchymal subtype with high risk group. PLoS ONE 8(4), e62042. https://doi.org/10.1371/journal.pone.0062042 (2013).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhang, J., Xiao, X., Zhang, X. & Hua, W. Tumor Microenvironment Characterization in Glioblastoma Identifies Prognostic and Immunotherapeutically Relevant Gene Signatures. J. Mol. Neurosci. 70(5), 738–750. https://doi.org/10.1007/s12031-020-01484-0 (2020).
Article PubMed CAS Google Scholar
Zhang, M., Wang, X., Chen, X., Zhang, Q. & Hong, J. Novel immune-related gene signature for risk stratification and prognosis of survival in lower-grade glioma. Front. Genet. 11, 363. https://doi.org/10.3389/fgene.2020.00363 (2020).
Article PubMed PubMed Central Google Scholar
Zhang, B., Shen, R., Cheng, S. & Feng, L. Immune microenvironments differ in immune characteristics and outcome of glioblastoma multiforme. Cancer Med. 8(6), 2897–2907. https://doi.org/10.1002/cam4.2192 (2019).
Article PubMed PubMed Central CAS Google Scholar
Meshalkina, D. A. et al. Knock-down of Hdj2/DNAJA1 co-chaperone results in an unexpected burst of tumorigenicity of C6 glioblastoma cells. Oncotarget 7(16), 22050–22063. https://doi.org/10.18632/oncotarget.7872 (2016).
Article PubMed PubMed Central Google Scholar
Liu, Y. et al. Autocrine endothelin-3/endothelin receptor B signaling maintains cellular and molecular properties of glioblastoma stem cells. Mol. Cancer Res. 9(12), 1668–1685. https://doi.org/10.1158/1541-7786.MCR-10-0563 (2011).
Article PubMed PubMed Central CAS Google Scholar
Han, J. & Puri, R. K. Analysis of the cancer genome atlas (TCGA) database identifies an inverse relationship between interleukin-13 receptor alpha1 and alpha2 gene expression and poor prognosis and drug resistance in subjects with glioblastoma multiforme. J. Neurooncol. 136(3), 463–474. https://doi.org/10.1007/s11060-017-2680-9 (2018).
Article PubMed CAS Google Scholar
Shin, J. et al. Restoration of miR-29b exerts anti-cancer effects on glioblastoma. Cancer Cell Int. 17, 104. https://doi.org/10.1186/s12935-017-0476-9 (2017).
Article PubMed PubMed Central CAS Google Scholar
Thirant, C. et al. Differential proteomic analysis of human glioblastoma and neural stem cells reveals HDGF as a novel angiogenic secreted factor. Stem Cells 30(5), 845–853. https://doi.org/10.1002/stem.1062 (2012).
Article PubMed CAS Google Scholar
De Bacco, F. et al. The MET oncogene is a functional marker of a glioblastoma stem cell subtype. Cancer Res. 72(17), 4537–4550. https://doi.org/10.1158/0008-5472.CAN-11-3490 (2012).
Article PubMed CAS Google Scholar
Sims, N. A. Cardiotrophin-like cytokine factor 1 (CLCF1) and neuropoietin (NP) signalling and their roles in development, adulthood, cancer and degenerative disorders. Cytokine Growth Factor Rev. 26(5), 517–522. https://doi.org/10.1016/j.cytogfr.2015.07.014 (2015).
Article PubMed CAS Google Scholar
Yu, S. T. et al. CRLF1 promotes malignant phenotypes of papillary thyroid carcinoma by activating the MAPK/ERK and PI3K/AKT pathways. Cell Death Dis. 9(3), 371. https://doi.org/10.1038/s41419-018-0352-0 (2018).
Article PubMed PubMed Central CAS Google Scholar
Lee, I., Yeom, S. Y., Lee, S. J., Kang, W. K. & Park, C. A novel senescence-evasion mechanism involving Grap2 and Cyclin D interacting protein inactivation by Ras associated with diabetes in cancer cells under doxorubicin treatment. Cancer Res. 70(11), 4357–4365. https://doi.org/10.1158/0008-5472.CAN-09-3791 (2010).
Article PubMed CAS Google Scholar
Chen, K. Y., Chen, C. C., Tseng, Y. L., Chang, Y. C. & Chang, M. C. GCIP functions as a tumor suppressor in non-small cell lung cancer by suppressing Id1-mediated tumor promotion. Oncotarget 5(13), 5017–5028. https://doi.org/10.18632/oncotarget.2075 (2014).
Article PubMed PubMed Central Google Scholar
Bhattacharya, S. et al. ImmPort, toward repurposing of open access immunological assay data for translational and clinical research. Sci. Data 5, 180015. https://doi.org/10.1038/sdata.2018.15 (2018).
Article PubMed PubMed Central CAS Google Scholar
Liang, R. et al. A comprehensive analysis of prognosis prediction models based on pathwaylevel, genelevel and clinical information for glioblastoma. Int. J. Mol. Med. 42(4), 1837–1846. https://doi.org/10.3892/ijmm.2018.3765 (2018).
Article PubMed PubMed Central CAS Google Scholar
Hou, J. Y., Wang, Y. G., Ma, S. J., Yang, B. Y. & Li, Q. P. Identification of a prognostic 5-Gene expression signature for gastric cancer. J. Cancer Res. Clin. Oncol. 143(4), 619–629. https://doi.org/10.1007/s00432-016-2324-z (2017).
Article PubMed CAS Google Scholar
Braschi, B. et al. Genenamesorg: the HGNC and VGNC resources in 2019. Nucleic Acids Res. 47(D1), D786–D792. https://doi.org/10.1093/nar/gky930 (2019).
Article PubMed CAS Google Scholar
Xu, Z., Wang, C., Xiang, X., Li, J. & Huang, J. Characterization of mRNA expression and endogenous RNA profiles in bladder cancer based on the cancer genome atlas (TCGA) database. Med. Sci. Monit. 25, 3041–3060. https://doi.org/10.12659/MSM.915487 (2019).
Article PubMed PubMed Central CAS Google Scholar
Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinform. 14, 7. https://doi.org/10.1186/1471-2105-14-7 (2013).
Article Google Scholar

Download references

Funding

This research was supported by the National Natural Science Foundation of China (81872066, 31571433, 81773131 and 81972635), the Innovative Program of Development Foundation of Hefei Center for Physical Science and Technology (2018CXFX004 and 2017FXCX008), and Youth Innovation Promotion Association of Chinese Academy of Sciences (2018487).

Author information

These authors contributed equally: Xueran Chen and Xiaoqing Fan.

Authors and Affiliations

Anhui Province Key Laboratory of Medical Physics and Technology, Institute of Health and Medical Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, No. 350, Shushan Hu Road, Hefei, 230031, Anhui, China
Xueran Chen, Chenggang Zhao, Zhiyang Zhao, Lizhu Hu & Zhiyou Fang
Department of Molecular Pathology, Hefei Cancer Hospital, Chinese Academy of Sciences, No. 350, Shushan Hu Road, Hefei, 230031, Anhui, China
Xueran Chen & Zhiyou Fang
The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China (USTC), No. 17, Lujiang Road, Hefei, 230001, Anhui, China
Xiaoqing Fan, Delong Wang & Ruiting Wang
Department of Anesthesiology, Anhui Provincial Hospital, No. 17, Lujiang Road, Hefei, 230001, Anhui, China
Xiaoqing Fan, Delong Wang & Ruiting Wang
University of Science and Technology of China, No. 96, Jin Zhai Road, Hefei, 230026, Anhui, China
Chenggang Zhao, Zhiyang Zhao & Lizhu Hu

Authors

Xueran Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoqing Fan
View author publications
You can also search for this author in PubMed Google Scholar
Chenggang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Lizhu Hu
View author publications
You can also search for this author in PubMed Google Scholar
Delong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ruiting Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyou Fang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.R.C. and Z.Y.F.: conceived and designed the experiments. C.G.Z., Z.Y.Z., L.Z.H. .and D.L.W.: collected the data, prepared Figs. 1, 2 and 3. X.R.C., and X.Q.F.: phenotyping of peripheral T cells and IHC staining for GBM tissue microarray analysis, performed the analysis, prepared Figs. 4, 5, 6. X.R.C., R.T.W. and Z.Y.F.: participated in the discussion of the algorithm. X.R.C., X.Q.F. and C.G.Z.: prepared and edited the manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Xueran Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary figures.

Supplementary tables.

Supplementary legends.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, X., Fan, X., Zhao, C. et al. Molecular subtyping of glioblastoma based on immune-related genes for prognosis. Sci Rep 10, 15495 (2020). https://doi.org/10.1038/s41598-020-72488-4

Download citation

Received: 24 December 2019
Accepted: 02 September 2020
Published: 23 September 2020
DOI: https://doi.org/10.1038/s41598-020-72488-4
Springer Nature Limited

This article is cited by

Radiated glioblastoma cell-derived exosomal circ_0012381 induce M2 polarization of microglia to promote the growth of glioblastoma by CCL2/CCR2 axis
- Chunzhi Zhang
- Yuan Zhou
- Hu Wang
Journal of Translational Medicine (2022)
Glioma: molecular signature and crossroads with tumor microenvironment
- Lennart Barthel
- Martin Hadamitzky
- Susann Hetze
Cancer and Metastasis Reviews (2022)

Molecular subtyping of glioblastoma based on immune-related genes for prognosis

Abstract

Similar content being viewed by others

Introduction

Results

Mining of specific immune-related genes based on GBM patient survival and prognostic outcomes

Construction of the model to predict prognosis for GBM patients

Functional annotations of immune-related genes and enrichment of signaling pathways specific to prognosis

Relationships of the RiskScore values with the clinical characteristics of samples

Practical application of the prediction model for GBM patients

Discussion

Methods

Pre-processing of original sample data and preliminary selection of immune-related genes in GBM

Univariate survival analysis for immune-related samples in training set

Screening of immune-related genes specific to GBM prognosis, and establishment of the model to predict prognosis

Signaling pathway enrichment and functional annotations for immune-related genes specific to immunity

Relationships of RiskScore with the signaling pathways and clinical characteristics of samples

Phenotyping of peripheral T cells and IHC staining for GBM tissue microarray analysis

Statistical methods

Ethics approval and consent to participate

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation