Meta-analysis of whole-genome gene expression datasets assessing the effects of IDH1 and IDH2 mutations in isogenic disease models

Schulten, Hans-Juergen; Al-Adwani, Fatima; Saddeq, Haneen A. Bin; Alkhatabi, Heba; Alganmi, Nofe; Karim, Sajjad; Hussein, Deema; Al-Ghamdi, Khalid B.; Jamal, Awatif; Al-Maghrabi, Jaudah; Al-Qahtani, Mohammed H.

doi:10.1038/s41598-021-04214-7

Meta-analysis of whole-genome gene expression datasets assessing the effects of IDH1 and IDH2 mutations in isogenic disease models

Article
Open access
Published: 07 January 2022

Volume 12, article number 57, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Meta-analysis of whole-genome gene expression datasets assessing the effects of IDH1 and IDH2 mutations in isogenic disease models

Download PDF

Hans-Juergen Schulten¹,
Fatima Al-Adwani¹,
Haneen A. Bin Saddeq²,
Heba Alkhatabi^1,2,
Nofe Alganmi^1,3,
Sajjad Karim¹,
Deema Hussein⁴,
Khalid B. Al-Ghamdi⁵,
Awatif Jamal⁶,
Jaudah Al-Maghrabi^6,7 &
…
Mohammed H. Al-Qahtani¹

2857 Accesses
2 Altmetric
Explore all metrics

Abstract

Mutations in isocitrate dehydrogenase 1 (IDH1) and IDH2 are oncogenic drivers to a variable extent in several tumors, including gliomas, acute myeloid leukemia (AML), cholangiocarcinoma, melanoma, and thyroid carcinoma. The pathobiological effects of these mutations vary considerably, impeding the identification of common expression profiles. We performed an expression meta-analysis between IDH-mutant (IDH^mut) and IDH-wild-type (IDH^wt) conditions in six human and mouse isogenic disease models. The datasets included colon cancer cells, glioma cells, heart tissue, hepatoblasts, and neural stem cells. Among differentially expressed genes (DEGs), serine protease 23 (PRSS23) was upregulated in four datasets, i.e., in human colon carcinoma cells, mouse heart tissue, mouse neural stem cells, and human glioma cells. Carbonic anhydrase 2 (CA2) and prolyl 3-hydroxylase 2 (P3H2) were upregulated in three datasets, and SOX2 overlapping transcript (SOX2-OT) was downregulated in three datasets. The most significantly overrepresented protein class was termed intercellular signal molecules. An additional DEG set contained genes that were both up- and downregulated in different datasets and included oxidases and extracellular matrix structural proteins as the most significantly overrepresented protein classes. In conclusion, this meta-analysis provides a comprehensive overview of the expression effects of IDH mutations shared between different isogenic disease models. The generated dataset includes biomarkers, e.g., PRSS23 that may gain relevance for further research or clinical applications in IDH^mut tumors.

A comprehensive analysis of SLC25A1 expression and its oncogenic role in pan-cancer

Article Open access 19 November 2023

Multi-omics data integration reveals novel drug targets in hepatocellular carcinoma

Article Open access 04 August 2021

An integrative pan-cancer investigation reveals common genetic and transcriptional alterations of AMPK pathway genes as important predictors of clinical outcomes across major cancer types

Article Open access 17 August 2020

Introduction

Isocitrate dehydrogenases (IDHs) consist of three isozymes, i.e., IDH1, IDH2, and IDH3, which are key metabolic enzymes catalyzing the conversion of isocitrate to α-ketoglutarate (α-KG) via oxidative decarboxylation. IDH1 is located in the cytosol and peroxisomes, whereas IDH2 and IDH3 are located in the mitochondria. As components of the citrate acid cycle (CAC), IDH1 and IDH2 use NADP + as a coenzyme, whereas IDH3 uses NAD + as a coenzyme. The generated NADPH and NADH are reducing equivalents necessary for diverse metabolic and physiological processes.

Recurrent IDH1 mutations affecting codon R132 were initially identified in glioblastoma multiforme (GBM), where the mutation showed a significant prevalence in secondary GBM¹. Subsequently, IDH2 mutations affecting codon R172, which is homologous to IDH1 R132, were detected in WHO grade II and III astrocytomas and oligodendrogliomas as well as in secondary GBM². Virtually all IDH1 mutations in gliomas affect codon R132, which in the vast majority (> 85%) is a heterozygous missense mutation of arginine to histidine (R132H)³. Other less frequent IDH1 R132 mutations leading to different amino acid replacements, including R132C, R132G, R132G, and R132L, have been described in a number of solid and hematopoietic neoplasms and related pathogenic processes^4,5. In anaplastic thyroid carcinoma, IDH1 mutations are relatively common and affect the highly conserved residue G123^6,7. In acute myeloid leukemia (AML), IDH2 mutations are more prevalent than IDH1 mutations and usually affect codon R140. No oncogenic IDH3 mutations have been reported so far.

The oncogenic capacity of IDH1/2 mutations is conferred by a catalytically active dimer, most likely consisting of an IDH-mutant (IDH^mut) and an IDH-wild-type (IDH^wt) heterodimer, which reduces α-KG to D-2-hydroxyglutarate (D-2HG)^8,9. D-2HG is an oncometabolite that induces diverse metabolic and cellular effects, e.g., affecting CAC, inhibiting α-KG-dependent enzymes, such as histone and DNA demethylases, and blocking transcriptionally regulated cellular differentiation^4,10,11,12. In particular, a DNA methylation profile is induced that varies between different IDH^mut tumor types. For example, gliomas exhibit a DNA methylation profile, referred to as a glioma cytosine-phosphate-guanine (CpG) island methylator phenotype (G-CIMP), which differs from those in AML, cholangiocarcinoma, and melanoma^13,14,15. The discrepancies observed in the DNA methylation profiles of the investigated tumor types are also observed in the transcriptional profiles impeding the ability to assess common effects of IDH mutations on the transcriptome. Research on epigenetic and transcriptional effects of IDH mutations in cancer is ongoing, e.g., a recent study reported that transcriptional alterations in IDH1^mut glioma samples are primarily caused by chromatin-based DNA methylation-independent mechanisms¹⁶.

IDH mutations represent a valuable target for cancer treatment because they are commonly associated with early oncogenesis and are retained through later cancer stages. However, the success of therapy strategies varies between different IDH^mut tumor types, and alternative treatment options, such as the application of glutaminase inhibitors, are assessed^17,18,19. Isogenic disease models have become a valuable method in cancer research and drug discovery for studying the effects of a particular gene mutation in comparison to otherwise genetically identical cells²⁰. In particular, isogenic disease models have been repeatedly used to determine the transcriptional effects of IDH mutations under nearly unbiased conditions. We therefore performed a meta-analysis on datasets that compared expression profiles between IDH^mut and IDH^wt isogenic disease models with the aim of identifying biomarkers that have prospects for research or clinical applications.

Results

Compilation of datasets

The meta-analysis included six studies that were extracted from a database search and that compared the expression profiles between IDH1/2^mut and IDH1/2^wt conditions in isogenic disease models (Table 1). Four studies utilized microarrays, one used BeadChips, and one used RNA-sequencing (RNA-seq) to generate sets of differentially expressed genes (DEGs). In two studies, expression experiments were performed using human cell lines. In four studies, microarray expression experiments were performed using mouse cells/cell lines, mouse tissues or tumors. DEGs were determined based on a false discovery rate (FDR)-adjusted p-value ≤ 0.05 and a fold change (FC) ≥ 1.5.

Table 1 Expression studies on isogenic disease models included in the meta-analysis.

Full size table

Genes either up- or downregulated in the meta-analysis dataset

The number of DEGs in the individual datasets and the proportion of DEGs that were either up- or downregulated between at least two datasets varied considerably between the studies (Table 1, Fig. 1a). For example, in relation to the number of DEGs in individual datasets, mouse glioma cells shared proportionally fewer genes while mouse neural stem cells shared proportionally more genes with the common DEG set. The shared dataset comprised 111 DEGs, of which 49% were upregulated and 51% were downregulated (Table 2). Serine protease 23 (PRSS23) was upregulated in four datasets, i.e., human colon carcinoma cells, mouse heart tissue, mouse neural stem cells, and human glioma cells. Carbonic anhydrase 2 (CA2), and prolyl 3-hydroxylase 2 (P3H2) were upregulated in three datasets, whereas SOX2 overlapping transcript (SOX2-OT) was downregulated in three datasets.

Table 2 Meta-analysis DEG sets compiled from individual datasets of isogenic disease models.

Full size table

Ontology and pathway analysis of genes either up- or downregulated in the meta-analysis dataset

The most significantly overrepresented gene ontology (GO) annotations in the DEG set included diverse morphogenic and developmental processes, extracellular matrix and organelle components, and molecular activities in the categories of biological process, cellular component, and molecular function, respectively (Fig. 2a). The most significantly overrepresented protein class (p = 2.04 × 10^–3) was related to intercellular signal molecules (Fig. 2a) and included BDNF, BMP4, RCAN1, SEMA7A, STC1, TGFB2, and WNT7A, all of which were comparably upregulated under IDH1/2^mut conditions. The most significantly overrepresented pathways included extracellular matrix organization, collagen biosynthesis and modifying enzymes, and collagen formation (Table 3). The most significantly associated networks were related to various diseases, conditions, and cellular functions (Table 3). The top three networks were assembled with molecular relationship factors and displayed as a merged network (Fig. 3). Further interpretation of the DEG set was performed with the upstream regulator analysis tool (Supplementary Fig. 1). Activated upstream regulators that were predicted to be most significantly associated with the DEG set comprised chorionic gonadotropin (CG) complex, cytokine WNT3A, transmembrane receptor IL10RA, and transcription factor TP53. The transporter APOE and cytokine IFNG were predicted to be the most significantly inhibited upstream regulators.

Table 3 Top pathways and networks compiled from the meta-analysis DEG sets.

Full size table

Genes both up- and downregulated in the meta-analysis dataset

An additionally shared DEG set comprised 98 genes that were both up- and downregulated in two or three individual datasets (Fig. 1b; Table 2). Genes both up- and downregulated in three datasets included armadillo like helical domain containing 4 (ARMH4), cellular communication network factor 2 (CCN2), erythrocyte membrane protein band 4.1 like 4B (EPB41L4B), fibulin 1 (FBLN1), fibronectin 1 (FN1), G protein-coupled receptor class C group 5 member B (GPRC5B), serine protease 35 (PRSS35), serpin family E member 1 (SERPINE1), solute carrier family 16 member 2 (SLC16A2), secreted phosphoprotein 1 (SPP1), and synaptopodin (SYNPO).

Ontology and pathway analysis of genes both up- and downregulated in the meta-analysis dataset

The most significantly overrepresented GO annotations in the DEG set included diverse morphogenic and developmental processes, extracellular matrix components, and various binding properties in the categories of biological process, cellular component, and molecular function, respectively (Fig. 2b). The most significantly overrepresented protein classes included oxidases (p = 3.36 × 10^–4), comprising PRODH, LOX, QSOX1, and QSOX2, and extracellular matrix structural proteins (p = 5.36 × 10^–4) comprising COL4A1, COL4A2, COL8A1, and FBN1 (Fig. 2b). The most significantly overrepresented pathways included integrin cell surface interactions, extracellular matrix organization, and post-translational protein phosphorylation (Table. 3). The most significantly associated networks were related to various developmental processes, diseases, conditions, and cellular functions (Table 3). The top three networks were assembled with molecular relationship factors and displayed as a merged network (Fig. 4).

Discussion

In this meta-analysis, we compared the expression profiles of different IDH^mut vs. IDH^wt isogenic disease models to provide an overview of the nearly unbiased expression effects and the corresponding biological interpretations caused by the oncometabolite 2-HG. Although the statistical power of the IDH^mut vs. IDH^wt isogenic cell model datasets is generally lower than that of larger datasets generated in clinical tumor cases, the number of DEGs in proportion to the sample size is seemingly higher in isogenic cell models²¹. One likely explanation for this fact is that individual expression profiles vary considerably within IDH^mut tumors, similar to as in other tumors, limiting the capacity to generate common expression profiles. However, in our meta-analysis, only a relatively low number of DEGs were shared between individual datasets, which can be attributed to the fact that different cancer and non-cancer isogenic disease models and experimental conditions were used as briefly outlined as follows: Using colon carcinoma cells, in which IDH1/2 mutations were inserted via a recombinant adeno-associated virus vector methodology, an epithelial-mesenchymal transition (EMT)-like phenotype and changes in gene expression and cell morphology were observed²². In transgenic mouse models with conditional IDH2^mut coding sequences, activation of IDH2^mut expression at five weeks of age produced D-2HG leading to cardiomyopathy and neurodegeneration²³. In hepatoblasts, isolated from mouse embryos at E14, a doxycycline-inducible system led to IDH1/2^mut gene expression²⁴. The IDH1/2^mut hepatoblasts, which were cultured on collagen-coated plates, were refractory to differentiation. In neural stem cells derived from the cortex of mouse embryos at E14.5, Idh1^mut expression was induced via adenoviral-Cre-recombinase transduction²⁵. In these cells, neuronal lineage differentiation was blocked, although differentiation-promoting culture conditions were utilized. Employing a mouse model that is susceptible to the development of gliomas, p53-deficient cells with vector-integrated IDH1^mut genes and cells containing a PDGF expression vector were coinjected into mice. The induced PDGF-driven gliomas showed reduced immune infiltration in comparison to the corresponding IDH1^wt glioma mouse model²¹. In an in vitro study, glioma cells were infected with lentivirus IDH1^mut coding sequences²⁶. Doxycycline-induced IDH1^mut gene expression resulted in enhanced cell motility and morphological changes. The heterogeneity between the six isogenic disease models is exemplarily demonstrated by the diverse classification of the top pathways that were derived from the DEGs of each of the disease models (Supplementary Fig. 2).

The serine protease PRSS23 exhibits low tissue specificity in humans with the highest expression levels in female genital tract tissue and smooth muscle²⁷. Studies in mice reported that PRSS23 is variably expressed in the preimplantation uterus and is possibly involved in tissue remodeling in the ovary^28,29. The expression of PRSS23 has been detected in nuclei and extracellular vesicular exosomes where the protease is a component of the human secretome³⁰. Exosomal PRSS23 is, e.g., involved in cardiovascular disease where the protease likely mediates Snail/alpha‐smooth muscle actin signalling³¹. In cancer, PRSS23 is implicated in tumor progression, and it was identified in a systematic network survey of a meta-analysis of breast cancer microarray expression data as one of six genes involved in acquired lapatinib resistance³². Promoter studies in breast cancer cells indicated that PRSS23 is upregulated by estrogen receptor 1 (ESR1) and that its upregulated expression contributes to cell proliferation³³. shRNA-mediated knockdown of PRSS23 in a gastric cancer xenograft mouse model resulted in a decrease in tumor volume and tumor weight³⁴. Further in vitro experiments revealed that PRSS23 knockdown in gastric cancer cells apparently affected EIF2 pathway molecules. Based on a microarray study, PRSS23 was included in a gene classifier set that could discriminate papillary thyroid carcinoma from normal thyroid samples³⁵. In head and neck, renal, and pancreatic cancer, PRSS23 expression is significantly associated with an unfavorable prognosis³⁰. An epigenome-wide association study found, among several other DNA methylation sites, a significant association between changes of DNA methylation of DNA methylation sites at the PRSS23 gene and having a smoking habit but found no significant association with risk for lung cancer³⁶. The BioGRID database currently curates about 50 PRSS23 interactors, among which actin and actin-related proteins constitute the most overrepresented PANTHER protein class (p-value = 3 × 10^–3) (Supplementary Fig. 3).

Cytosolic CA2 is the physiologically predominant CA isoform and is known to interact with various acid/base transporters³⁷. These interactions are predicted to promote high glycolytic activity and cell proliferation in tumors. In lung cancer xenograft mouse models, shRNA-mediated knockdown of CA2 impaired tumor cell proliferation and angiogenesis and induced apoptosis³⁸. Pharmacological studies exploring CA2 inhibitors are pursued to develop therapeutic options for the treatment of various conditions including cancer³⁹. P3H family members consist of three isoenzymes in vertebrates. From a knockout study on P3H2 in a mouse embryonal carcinoma cell line, it can be presumed that the enzyme is the major posttranslational modifier of type IV collagen with 3-hydroxyproline, which is of significance for interactions of type IV collagen with other molecules⁴⁰. High P3H2 expression in different parts of the CNS, gastrointestinal tract, and some other tissues has been reported; however, the enzyme exhibits no prognostic significance in cancer and reveals only weak-to-moderate staining in most cancer tissues³⁰. The long non-coding RNA (lncRNA) SOX2-OT consists of several splice variants. SOX2, located in an intron of SOX2-OT, is transcribed in the same orientation as SOX2-OT and both are intensely expressed in embryonic stem cells⁴¹. SOX2-OT is implicated in neuronal and tumor development and progression. A meta-analysis of cancer datasets indicated that cancers with elevated SOX2-OT expression are significantly associated with unfavorable prognostic factors⁴². In two cervical cancer cell lines, a SOX2-OT transcript variant promoted cell growth, migration and invasion of the cells, indicating that the lncRNA may constitute a practical biomarker for cervical cancer⁴³. However, lower expression of SOX2-OT was observed in gastric tumors compared to matched normal gastric samples, and lower expression was observed in high-grade rather than low-grade gastric tumors⁴⁴.

Furthermore, we assessed the similarity of expression profiles between the either up- or downregulated gene set from our meta-analysis with expression profiles of two publicly accessible datasets of low grade gliomas and chondrosarcomas, enabling us to compare IDH^mut with IDH^wt cancer samples^45,46,47. The Venn diagram demonstrates that only a few DEGs are shared between our meta-analysis dataset and both clinical datasets (Supplementary Fig. 4). One likely explanation for this fact is that primary expression effects of an IDH mutation that emerge over days or weeks are measured in isogenic disease models, whereas clinical IDH^mut tumors evolve over months or years and acquire multiple other genomic alterations before they become clinically evident.

In summary, we generated a set of DEGs and biomarkers associated with IDH^mut status in isogenic disease models. Extracellular proteins and intercellular signaling are among the notable features of IDH^mut conditions. Biomarkers associated with various IDH^mut conditions, including the less characterized protease PRSS23, have considerable prospects for further research or clinical applications of IDH^mut cancers and related diseases.

Methods

Compilation of datasets from IDH1/2^mut vs. IDH1/2^wt isogenic disease models

Using the search term IDH to query the Gene Expression Omnibus (GEO), we designated 114 case series, out of which we identified seven whole-genome gene expression datasets derived from human and mouse isogenic disease models that compared IDH1/2^mut with IDH1/2^wt samples⁴⁸. One dataset without publication reference with detailed information was deselected. We then selected the remaining six studies for further analysis. These studies contained at least biologically IDH1/2^mut triplicates and biologically IDH1/2^wt duplicates and the datasets of each of the studies were sufficiently significant to compile a DEG set based on an FDR-adjusted p-value ≤ 0.05 and an FC ≥ 1.5. In studies that employed an isogenic disease model with different IDH1/2 mutations, the raw datasets of the different IDH1/2 mutations were pooled and processed as a single IDH1/2 mutation dataset. The generated meta-analysis dataset includes GEO submissions GSE41802²², GSE54838²³, GSE57002²⁴, GSE88828²⁵, GSE96979²¹, and GSE147223²⁶. Using the same above-quoted search strategy, no additional datasets were identified in another publicly accessible repository for high-throughput functional genomics experiments⁴⁹. The database repositories were essentially interrogated in November 2020.

Generation of DEG sets

For four microarray GEO datasets, the binary CEL files comprising the intensity calculations were imported into Transcriptome Analysis Console (TAC) version 4.0.2.15 (Thermo Fisher Scientific, Waltham, MA). TAC includes the LIMMA (linear models for microarray data) statistical package from Bioconductor⁵⁰. The binary CEL files were normalized in TAC and files of differentially expressed probe sets were compiled using eBayes correction in ANOVA. For the study utilizing the expression BeadChips, the normalized dataset was analyzed using the NetworkAnalyst 3.0 platform, which employs LIMMA statistics to generate differentially expressed probe sets⁵¹. For genes with more than one probe set in a dataset, the probe set with the highest FC was selected for further analysis; however, genes, with both significantly up- and downregulated probe sets in the same dataset, were excluded from further analysis. For the RNA-seq dataset, the publicly accessible Sequence Read Archive (SRA) datasets were downloaded from the NCBI resource⁵². We aligned the RNA-seq reads to the human reference genome assembly GRCh37 (hg19), using STAR aligner⁵³. Then, the R package DESeq2 was used to normalize count data, remove outliers, determine filtering thresholds, and find genes that were significantly differentially expressed between the IDH1^mut and IDH1^wt groups⁵⁴. Computation of the RNA-seq dataset was supported by the University High Performance Computing (Aziz Supercomputer) Center (http://hpc.kau.edu.sa). Mouse Genome Informatics (MGI), Ensembl release 101, BioMart software, and HUGO Gene Nomenclature Committee (HGNC) resources were employed to update gene IDs and/or convert mouse gene IDs to human gene IDs^55,56,57,58. To illustrate intersecting and non-intersecting genes between the either up- or downregulated gene set of our meta-analysis and external datasets, a web-based Venn diagram tool was employed (http://bioinformatics.psb.ugent.be/webtools/Venn/).

Ontology and pathway analysis

For further analysis of DEGs, which were based on an FDR-adjusted p-value ≤ 0.05 and an FC ≥ 1.5, the statistical overrepresentation test of the GO program PANTHER v. 16.0 was employed to interrogate annotation datasets in the categories of biological process, cellular component, molecular function, protein classes, and Reactome pathways⁵⁹. The PANTHER protein class ontology comprises commonly used classes of protein functions. The Reactome pathway analysis specifies the biological relationships between interacting molecules such as nucleic acids, proteins, and compounds. For all annotation datasets, a Fisher’s exact test p-value < 0.05 indicated statistical significance. The BioGRID build 4.1 database was queried for protein interactors⁶⁰. BioGRID curates protein, genetic and chemical interactions from various biomedical studies and datasets. The Ingenuity Pathway Analysis (IPA) software v. 68,752,261 (Qiagen, Hilden, Germany) was employed for further multifactorial interpretation of the gene sets. IPA utilizes the curated Ingenuity knowledge base as a reference dataset to interfere molecular relationships. Fisher’s exact test p-values indicated the significance of associations between analyzed dataset molecules and functional frameworks prebuilt or generated de novo by IPA. The molecule activity predictor was applied to predict expression effects/coherence of the expression effects of a molecule on other network molecules. Direct molecular relationships were used to survey the significance of fit, indicated as a score value, between molecules of uploaded gene sets and networks associated with specific functions or diseases. Direct and indirect molecular relationships were used for upstream regulator network analysis to investigate how upstream regulators affect differences in target gene expression. A z-score value indicates the activation/inhibition state of an upstream regulator.

Data availability

The raw datasets analyzed in the study are available at the GEO repository.

Abbreviations

α-KG :: α-Ketoglutarate
AML:: Acute myeloid leukemia
ANOVA:: Analysis of variance
CAC:: Citrate acid cycle
CEL file:: Affymetrix probe results file
CpG :: Cytosine-phosphate-guanine
DEG:: Differentially expressed gene
EMT:: Epithelial-to-mesenchymal transition
FC:: Fold change
FDR:: False discovery rate
GBM:: Glioblastoma multiforme
G-CIMP:: Glioma CpG island methylator phenotype
GEO:: Gene Expression Omnibus
GO:: Gene ontology
HG:: Hydroxyglutarate
IDH:: Isocitrate dehydrogenase
IPA:: Ingenuity Pathway Analysis
LIMMA:: Linear models for microarray data
lncRNA:: Long non-coding RNA
mut:: Mutant
RNA-seq:: RNA-sequencing
SRA:: Sequence Read Archive
TAC:: Transcriptome Analysis Console
wt:: Wild-type

References

Parsons, D. W. et al. An integrated genomic analysis of human glioblastoma multiforme. Science (New York, N. Y.) 321, 1807–1812. https://doi.org/10.1126/science.1164382 (2008).
Article ADS CAS Google Scholar
Yan, H. et al. IDH1 and IDH2 mutations in gliomas. N. Engl. J. Med. 360, 765–773. https://doi.org/10.1056/NEJMoa0808710 (2009).
Article CAS PubMed PubMed Central Google Scholar
Watanabe, T., Nobusawa, S., Kleihues, P. & Ohgaki, H. IDH1 mutations are early events in the development of astrocytomas and oligodendrogliomas. Am. J. Pathol. 174, 1149–1153. https://doi.org/10.2353/ajpath.2009.080958 (2009).
Article CAS PubMed PubMed Central Google Scholar
Dang, L., Yen, K. & Attar, E. C. IDH mutations in cancer and progress toward development of targeted therapeutics. Ann. Oncol. Off. J. Eur. Soc. Med. Oncol. 27, 599–608. https://doi.org/10.1093/annonc/mdw013 (2016).
Article CAS Google Scholar
Hartman, D. J. et al. Isocitrate dehydrogenase-1 is mutated in inflammatory bowel disease-associated intestinal adenocarcinoma with low-grade tubuloglandular histology but not in sporadic intestinal adenocarcinoma. Am. J. Surg. Pathol. 38, 1147–1156. https://doi.org/10.1097/pas.0000000000000239 (2014).
Article PubMed Google Scholar
Murugan, A. K., Bojdani, E. & Xing, M. Identification and functional characterization of isocitrate dehydrogenase 1 (IDH1) mutations in thyroid cancer. Biochem. Biophys. Res. Commun. 393, 555–559. https://doi.org/10.1016/j.bbrc.2010.02.095 (2010).
Article CAS PubMed PubMed Central Google Scholar
Xing, M. Molecular pathogenesis and mechanisms of thyroid cancer. Nat. Rev. Cancer 13, 184–199. https://doi.org/10.1038/nrc3431 (2013).
Article CAS PubMed PubMed Central Google Scholar
Golub, D. et al. Mutant isocitrate dehydrogenase inhibitors as targeted cancer therapeutics. Front. Oncol. 9, 417. https://doi.org/10.3389/fonc.2019.00417 (2019).
Article PubMed PubMed Central Google Scholar
Han, S. et al. IDH mutation in glioma: Molecular mechanisms and potential therapeutic targets. Br. J. Cancer 122, 1580–1589. https://doi.org/10.1038/s41416-020-0814-x (2020).
Article PubMed PubMed Central Google Scholar
Rakheja, D., Medeiros, L. J., Bevan, S. & Chen, W. The emerging role of d-2-hydroxyglutarate as an oncometabolite in hematolymphoid and central nervous system neoplasms. Front. Oncol. 3, 169. https://doi.org/10.3389/fonc.2013.00169 (2013).
Article PubMed PubMed Central Google Scholar
Schvartzman, J. M., Reuter, V. P., Koche, R. P. & Thompson, C. B. 2-hydroxyglutarate inhibits MyoD-mediated differentiation by preventing H3K9 demethylation. Proc. Natl. Acad. Sci. U.S.A. 116, 12851–12856. https://doi.org/10.1073/pnas.1817662116 (2019).
Article CAS PubMed PubMed Central Google Scholar
Reitman, Z. J. et al. Profiling the effects of isocitrate dehydrogenase 1 and 2 mutations on the cellular metabolome. Proc. Natl. Acad. Sci. U.S.A. 108, 3270–3275. https://doi.org/10.1073/pnas.1019393108 (2011).
Article ADS PubMed PubMed Central Google Scholar
Unruh, D. et al. Methylation and transcription patterns are distinct in IDH mutant gliomas compared to other IDH mutant cancers. Sci. Rep. 9, 8946. https://doi.org/10.1038/s41598-019-45346-1 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Turcan, S. et al. IDH1 mutation is sufficient to establish the glioma hypermethylator phenotype. Nature 483, 479–483. https://doi.org/10.1038/nature10866 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Malta, T. M. et al. Glioma CpG island methylator phenotype (G-CIMP): Biological and clinical implications. Neuro Oncol. 20, 608–620. https://doi.org/10.1093/neuonc/nox183 (2018).
Article CAS PubMed Google Scholar
Court, F. et al. Transcriptional alterations in glioma result primarily from DNA methylation-independent mechanisms. Genome Res. 29, 1605–1621. https://doi.org/10.1101/gr.249219.119 (2019).
Article CAS PubMed PubMed Central Google Scholar
Waitkus, M. S., Diplas, B. H. & Yan, H. Biological role and therapeutic potential of IDH mutations in cancer. Cancer Cell 34, 186–195. https://doi.org/10.1016/j.ccell.2018.04.011 (2018).
Article CAS PubMed PubMed Central Google Scholar
Seltzer, M. J. et al. Inhibition of glutaminase preferentially slows growth of glioma cells with mutant IDH1. Can. Res. 70, 8981–8987. https://doi.org/10.1158/0008-5472.Can-10-1666 (2010).
Article CAS Google Scholar
Kaminska, B., Czapski, B., Guzik, R., Król, S. K. & Gielniewski, B. Consequences of IDH1/2 mutations in gliomas and an assessment of inhibitors targeting mutated IDH proteins. Molecules (Basel, Switzerland) 24, 968. https://doi.org/10.3390/molecules24050968 (2019).
Article CAS Google Scholar
Torrance, C. J., Agrawal, V., Vogelstein, B. & Kinzler, K. W. Use of isogenic human cancer cells for high-throughput screening and drug discovery. Nat. Biotechnol. 19, 940–945. https://doi.org/10.1038/nbt1001-940 (2001).
Article CAS PubMed Google Scholar
Amankulor, N. M. et al. Mutant IDH1 regulates the tumor-associated immune system in gliomas. Genes Dev. 31, 774–786. https://doi.org/10.1101/gad.294991.116 (2017).
Article CAS PubMed PubMed Central Google Scholar
Grassian, A. R. et al. Isocitrate dehydrogenase (IDH) mutations promote a reversible ZEB1/microRNA (miR)-200-dependent epithelial-mesenchymal transition (EMT). J. Biol. Chem. 287, 42180–42194. https://doi.org/10.1074/jbc.M112.417832 (2012).
Article CAS PubMed PubMed Central Google Scholar
Akbay, E. A. et al. D-2-hydroxyglutarate produced by mutant IDH2 causes cardiomyopathy and neurodegeneration in mice. Genes Dev. 28, 479–490. https://doi.org/10.1101/gad.231233.113 (2014).
Article CAS PubMed PubMed Central Google Scholar
Saha, S. K. et al. Mutant IDH inhibits HNF-4α to block hepatocyte differentiation and promote biliary cancer. Nature 513, 110–114. https://doi.org/10.1038/nature13441 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Pirozzi, C. J. et al. Mutant IDH1 disrupts the mouse subventricular zone and alters brain tumor progression. Mol. Cancer Res. MCR 15, 507–520. https://doi.org/10.1158/1541-7786.Mcr-16-0485 (2017).
Article CAS PubMed Google Scholar
Liu, Y. et al. mTORC2/Rac1 pathway predisposes cancer aggressiveness in IDH1-mutated glioma. Cancers 12, 787 (2020).
Article CAS PubMed Central Google Scholar
Thul, P. J. et al. A subcellular map of the human proteome. Science (New York, N. Y.) https://doi.org/10.1126/science.aal3321 (2017).
Article Google Scholar
Diao, H., Xiao, S., Li, R., Zhao, F. & Ye, X. Distinct spatiotemporal expression of serine proteases Prss23 and Prss35 in periimplantation mouse uterus and dispensable function of Prss35 in fertility. PLoS ONE 8, e56757. https://doi.org/10.1371/journal.pone.0056757 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Wahlberg, P., Nylander, A., Ahlskog, N., Liu, K. & Ny, T. Expression and localization of the serine proteases high-temperature requirement factor A1, serine protease 23, and serine protease 35 in the mouse ovary. Endocrinology 149, 5070–5077. https://doi.org/10.1210/en.2007-1736 (2008).
Article CAS PubMed Google Scholar
Uhlen, M. et al. A pathology atlas of the human cancer transcriptome. Science 357, 2507. https://doi.org/10.1126/science.aan2507 (2017).
Article CAS Google Scholar
Wang, Z. et al. Exosomal microRNA-1246 from human umbilical cord mesenchymal stem cells potentiates myocardial angiogenesis in chronic heart failure. Cell Biol. Int. https://doi.org/10.1002/cbin.11664 (2021).
Article PubMed PubMed Central Google Scholar
Lee, Y. S. et al. Identification of novel therapeutic target genes in acquired lapatinib-resistant breast cancer by integrative meta-analysis. Tumour Biol. J. Int. Soc. Oncodevelop. Biol. Med. 37, 2285–2297. https://doi.org/10.1007/s13277-015-4033-7 (2016).
Article CAS Google Scholar
Chan, H. S. et al. Serine protease PRSS23 is upregulated by estrogen receptor α and associated with proliferation of breast cancer cells. PLoS ONE 7, e30397. https://doi.org/10.1371/journal.pone.0030397 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Han, B. et al. PRSS23 knockdown inhibits gastric tumorigenesis through EIF2 signaling. Pharmacol. Res. 142, 50–57. https://doi.org/10.1016/j.phrs.2019.02.008 (2019).
Article CAS PubMed Google Scholar
Jarzab, B. et al. Gene expression profile of papillary thyroid cancer: Sources of variability and diagnostic implications. Can. Res. 65, 1587–1597. https://doi.org/10.1158/0008-5472.Can-04-3078 (2005).
Article CAS Google Scholar
Sun, Y. Q. et al. Assessing the role of genome-wide DNA methylation between smoking and risk of lung cancer using repeated measurements: The HUNT study. Int. J. Epidemiol. https://doi.org/10.1093/ije/dyab044 (2021).
Article PubMed PubMed Central Google Scholar
Becker, H. M. & Deitmer, J. W. Transport metabolons and acid/base balance in tumor cells. Cancers 12, 899. https://doi.org/10.3390/cancers12040899 (2020).
Article CAS PubMed Central Google Scholar
Zhou, Y., Mokhtari, R. B., Pan, J., Cutz, E. & Yeger, H. Carbonic anhydrase II mediates malignant behavior of pulmonary neuroendocrine tumors. Am. J. Respir. Cell Mol. Biol. 52, 183–192. https://doi.org/10.1165/rcmb.2014-0054OC (2015).
Article PubMed Google Scholar
Akin, S. et al. Synthesis of 1,2,4-triazole-5-on derivatives and determination of carbonic anhydrase II isoenzyme inhibition effects. Bioorg. Chem. 83, 170–179. https://doi.org/10.1016/j.bioorg.2018.10.042 (2019).
Article CAS PubMed Google Scholar
Montgomery, N. T., Zientek, K. D., Pokidysheva, E. N. & Bächinger, H. P. Post-translational modification of type IV collagen with 3-hydroxyproline affects its interactions with glycoprotein VI and nidogens 1 and 2. J. Biol. Chem. 293, 5987–5999. https://doi.org/10.1074/jbc.RA117.000406 (2018).
Article CAS PubMed PubMed Central Google Scholar
Shahryari, A., Jazi, M. S., Samaei, N. M. & Mowla, S. J. Long non-coding RNA SOX2OT: Expression signature, splicing patterns, and emerging roles in pluripotency and tumorigenesis. Front. Genet. 6, 196. https://doi.org/10.3389/fgene.2015.00196 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, Y. et al. Clinicopathological implication of long non-coding RNAs SOX2 overlapping transcript and its potential target gene network in various cancers. Front. Genet. 10, 1375. https://doi.org/10.3389/fgene.2019.01375 (2019).
Article CAS PubMed Google Scholar
Chang, X., Zhang, H., Yang, Q. & Pang, L. LncRNA SOX2OT affects cervical cancer cell growth, migration and invasion by regulating SOX2. Cell Cycle (Georgetown, Tex.) 19, 1391–1403. https://doi.org/10.1080/15384101.2020.1750812 (2020).
Article CAS Google Scholar
Farhangian, P., Jahandoost, S., Mowla, S. J. & Khalili, M. Differential expression of long non-coding RNA SOX2OT in gastric adenocarcinoma. Cancer Biomarkers Sect. A Dis. Markers 23, 221–225. https://doi.org/10.3233/cbm-181325 (2018).
Article CAS Google Scholar
Nicolle, R. et al. Integrated molecular characterization of chondrosarcoma reveals critical determinants of disease progression. Nat. Commun. 10, 4622. https://doi.org/10.1038/s41467-019-12525-7 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Ceccarelli, M. et al. Molecular profiling reveals biologically discrete subsets and pathways of progression in diffuse glioma. Cell 164, 550–563. https://doi.org/10.1016/j.cell.2015.12.028 (2016).
Article CAS PubMed PubMed Central Google Scholar
Grossman, R. L. et al. Toward a shared vision for cancer genomic data. N. Engl. J. Med. 375, 1109–1112. https://doi.org/10.1056/NEJMp1607591 (2016).
Article PubMed PubMed Central Google Scholar
Barrett, T. et al. NCBI GEO: Archive for functional genomics data sets—update. Nucleic Acids Res. 41, D991–D995. https://doi.org/10.1093/nar/gks1193 (2012).
Article CAS PubMed PubMed Central Google Scholar
Athar, A. et al. ArrayExpress update—From bulk to single-cell expression data. Nucleic Acids Res. 47, D711–D715. https://doi.org/10.1093/nar/gky964 (2018).
Article CAS PubMed Central Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47. https://doi.org/10.1093/nar/gkv007 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zhou, G. et al. NetworkAnalyst 3.0: A visual analytics platform for comprehensive gene expression profiling and meta-analysis. Nucleic Acids Res. 47, W234-w241. https://doi.org/10.1093/nar/gkz240 (2019).
Article CAS PubMed PubMed Central Google Scholar
Leinonen, R., Sugawara, H. & Shumway, M. The sequence read archive. Nucleic Acids Res. 39, D19-21. https://doi.org/10.1093/nar/gkq1019 (2011).
Article CAS PubMed Google Scholar
Dobin, A. et al. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics (Oxford, England) 29, 15–21. https://doi.org/10.1093/bioinformatics/bts635 (2013).
Article CAS Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550. https://doi.org/10.1186/s13059-014-0550-8 (2014).
Article CAS PubMed PubMed Central Google Scholar
Yates, A. D. et al. Ensembl 2020. Nucleic Acids Res. 48, D682-d688. https://doi.org/10.1093/nar/gkz966 (2020).
Article CAS PubMed Google Scholar
Bult, C. J., Blake, J. A., Smith, C. L., Kadin, J. A. & Richardson, J. E. Mouse Genome Database (MGD) 2019. Nucleic Acids Res. 47, D801-d806. https://doi.org/10.1093/nar/gky1056 (2019).
Article CAS PubMed Google Scholar
Smedley, D. et al. The BioMart community portal: An innovative alternative to large, centralized data repositories. Nucleic Acids Res. 43, W589–W598. https://doi.org/10.1093/nar/gkv350 (2015).
Article CAS PubMed PubMed Central Google Scholar
Braschi, B. et al. Genenames.org: the HGNC and VGNC resources in 2019. Nucleic Acids Res. 47, 786–792. https://doi.org/10.1093/nar/gky930 (2019).
Article CAS Google Scholar
Mi, H. et al. PANTHER version 16: A revised family classification, tree-based classification tool, enhancer regions and extensive API. Nucleic Acids Res. 49, D394-d403. https://doi.org/10.1093/nar/gkaa1106 (2021).
Article CAS PubMed Google Scholar
Oughtred, R. et al. The BioGRID interaction database: 2019 update. Nucleic Acids Res. 47, D529-d541. https://doi.org/10.1093/nar/gky1079 (2019).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

H.J.S. (PI), K.B.G., A.J., and J.M. (Co-Is) were supported by the National Plan for Science, Technology and Innovation (MAARIFAH)—King Abdulaziz City for Science and Technology—The Kingdom of Saudi Arabia—award number: 13-BIO2289-03. The authors also acknowledge with thanks the Science and Technology Unit, King Abdulaziz University for technical support.

Author information

Authors and Affiliations

Center of Excellence in Genomic Medicine Research, Department of Medical Laboratory Technology, Faculty of Applied Medical Science, King Abdulaziz University, P.O. Box 80216, Jeddah, 21589, Saudi Arabia
Hans-Juergen Schulten, Fatima Al-Adwani, Heba Alkhatabi, Nofe Alganmi, Sajjad Karim & Mohammed H. Al-Qahtani
Department of Medical Laboratory Technology, Faculty of Applied Medical Science, King Abdulaziz University, Jeddah, Saudi Arabia
Haneen A. Bin Saddeq & Heba Alkhatabi
Department of Computer Science, King Abdulaziz University, Jeddah, Saudi Arabia
Nofe Alganmi
King Fahad Medical Research Center, Department of Medical Laboratory Technology, Faculty of Applied Medical Science, King Abdulaziz University, Jeddah, Saudi Arabia
Deema Hussein
Department of Otolaryngology, Head and Neck Surgery, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
Khalid B. Al-Ghamdi
Department of Pathology, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
Awatif Jamal & Jaudah Al-Maghrabi
Department of Pathology, King Faisal Specialist Hospital and Research Center, Jeddah, Saudi Arabia
Jaudah Al-Maghrabi

Authors

Hans-Juergen Schulten
View author publications
You can also search for this author in PubMed Google Scholar
Fatima Al-Adwani
View author publications
You can also search for this author in PubMed Google Scholar
Haneen A. Bin Saddeq
View author publications
You can also search for this author in PubMed Google Scholar
Heba Alkhatabi
View author publications
You can also search for this author in PubMed Google Scholar
Nofe Alganmi
View author publications
You can also search for this author in PubMed Google Scholar
Sajjad Karim
View author publications
You can also search for this author in PubMed Google Scholar
Deema Hussein
View author publications
You can also search for this author in PubMed Google Scholar
Khalid B. Al-Ghamdi
View author publications
You can also search for this author in PubMed Google Scholar
Awatif Jamal
View author publications
You can also search for this author in PubMed Google Scholar
Jaudah Al-Maghrabi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed H. Al-Qahtani
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.J.S, F.A., H.A.S., and H.K. conceived and designed the study. H.J.S., N.G., S.K., and D.H. analyzed the raw data and prepared the figures. H.J.S., K.B.G., A.J., and J.M. interpreted the data. H.J.S. and M.H.Q. drafted and revised the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Hans-Juergen Schulten.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schulten, HJ., Al-Adwani, F., Saddeq, H.A.B. et al. Meta-analysis of whole-genome gene expression datasets assessing the effects of IDH1 and IDH2 mutations in isogenic disease models. Sci Rep 12, 57 (2022). https://doi.org/10.1038/s41598-021-04214-7

Download citation

Received: 22 December 2020
Accepted: 16 December 2021
Published: 07 January 2022
DOI: https://doi.org/10.1038/s41598-021-04214-7
Springer Nature Limited

Meta-analysis of whole-genome gene expression datasets assessing the effects of IDH1 and IDH2 mutations in isogenic disease models

Abstract

Similar content being viewed by others

A comprehensive analysis of SLC25A1 expression and its oncogenic role in pan-cancer

Multi-omics data integration reveals novel drug targets in hepatocellular carcinoma

An integrative pan-cancer investigation reveals common genetic and transcriptional alterations of AMPK pathway genes as important predictors of clinical outcomes across major cancer types

Introduction