Abstract
DNA methylation pattern in oral squamous cell carcinoma (OSCC) remains poorly described. This study aimed to perform a genome-wide integrated analysis of the transcriptome and methylome and assess the efficacy of their prognostic signature model in patients with OSCC. We analyzed transcriptome and methylome data from 391 OSCC samples and 41 adjacent normal samples. A total of 8074 differentially expressed genes (DEGs) and 10,084 differentially expressed CpGs (DMCpGs) were identified. Then 241 DEGs with DMCpGs were identified. According to the prognostic analysis, the prognostic signature of methylation-related differentially expressed genes (mrDEGPS) was established. mrDEGPS consisted of seven prognostic methylation-related genes, including ESRRG, CCNA1, SLC20A1, COL6A6, FCGBP, CDKN2A, and ZNF43. mrDEGPS was a significant stratification factor of survival (P < 0.00001) irrespective of the clinical stage. The immune effector components, including B cells, CD4+ T cells, and CD8+ T cells, were decreased in the tumor environment of patients with high mrDEGPS. Immune checkpoint expressions, including CTLA-4, PD-1, LAG3, LGALS9, HAVCR2, and TIGHT, were comprehensively elevated (P < 0.001). The estimated half-maximal inhibitory concentration difference between low- and high-risk patients was inconsistent among chemotherapeutic drugs. In conclusion, the transcriptome–methylome interaction pattern in OSCC is complex. mrDEGPS can predict patient survival and responses to immunotherapy and chemotherapy and facilitate clinical decision-making in patients with OSCC.
Similar content being viewed by others
Introduction
Oral squamous cell carcinoma (OSCC) is one of the most common malignancies in the head and neck region, which impairs the quality of life1,2. Over the past 30 years, the age-standardized incidence rate was 6.2 and 3.6 per 100,000 for males and females, respectively, and the age-standardized death rate was 3.3 and 1.6 per 100,000 for males and females, respectively3,4. However, the 5-year survival rate after surgery or chemoradiotherapy was only 64.4%, according to the 8th edition of the American Joint Committee on Cancer (AJCC), with age of the patient and stage of OSCC as independent prognostic factors5. For early stage OSCC, the survival rate did not significantly increase (69.7%), which indicated the difficulty of survival modification in patients with OSCC6. With the inclusion of the depth of invasion and extranodal extension, the 8th edition AJCC staging exhibited superior performance in stratifying the survival of patients with OSCC than that by the former edition7. However, stratifying the patient survival remained significantly challenging.
Recently, it has become evident that epigenetic mis-programming constitutes a core component of cancer initiation and progression8. Currently, DNA methylation remains the main epigenetic marker that can be measured reliably using genome-wide studies in large numbers of samples9. DNA methylation occurs almost exclusively in CpG dinucleotides. The CpG dinucleotides tend to cluster in regions called CpG islands (CGI), while most tissue-specific differentially methylated regions appear outside of CGIs10. A comprehensive study of the profiles of different healthy individuals and tissue types enables the estimation of variance of each CpG site in the methylome11. The integrated analysis of methylome and clinical data made prognostic classification feasible based on methylome analysis in colorectal cancer, hepatocellular carcinoma, and leukemia12,13,14.
Moreover, there may be a link between DNA methylation and the tumor microenvironment15. Infiltrating immune cells are important participants in the tumor microenvironment (TME); they are involved in proliferation, signal maintenance, cell death resistance, invasion, metastasis, and angiogenesis16. Tumor-associated macrophages support disease progression and resistance to therapy by providing malignant cells with trophic and nutritional support17. Altered signals from tumor cells produce a suppressive tumor microenvironment for enrichment of inhibitory cells18. DNA methylation may participate in the changes of infiltrating immune cells, reflecting a specific immune response to the cancer cell19. Immune TME (TIME) is associated with therapy responsiveness of the immune-checkpoint blockade20.
Previous studies have preliminarily depicted the roles of DNA methylation in gene expression, patient prognosis, and immune markers in head and neck cancers21,22,23. However, a comprehensive analysis of transcriptome–methylome interactions and clinical characteristics may be required to mine the potential of methylation orchestration in the management of OSCC. Therefore, this study aimed to perform a genome-wide integrated analysis of the transcriptome and methylome, depict the complex pattern between methylation and gene expression, identify methylation-related differentially expressed genes (mrDEGs), and assess the efficacy of their prognostic signature model in predicting patient survival, TIME alterations, and responses to immunotherapy and chemotherapy in patients with OSCC.
Results
Patterns of DMCpGs and differential gene expression (DEG)
A flowchart of the study is shown in Fig. 1. A total of 8074 DEGs and 10,084 DMCpGs were identified. Clustering based on the DEGs or DMCpGs revealed two distinctive sample clusters, indicating the possibility to distinguish between OSCC samples and adjacent tissue samples (Fig. 2A,B). Of the 8074 DEGs, 4327 were upregulated and 3747 were downregulated (Fig. 2C). Of the 10,084 DMCpGs, 5937 were hypermethylated DMCpGs (HyperCpGs) and 4147 were hypomethylated DMCpGs (HypoCpGs). Both HyperCpGs and HypoCpGs were correlated with the upregulation and downregulation of genes (Fig. 2D). These results indicate that the regulation of gene expression by methylation may be complex and multimodal.
Relationship between distribution pattern of methylation alterations and gene expression
DMCpGs across different genomic regions were not randomly distributed across the genome. Most DMCpGs are found in the gene body and intergenic regions (IGR). HyperCpGs had a higher proportion located 200 bp upstream of transcriptional start sites (TSS200), 5′-untranslated regions (UTR), and 1st exon, whereas HypoCpGs were located in the gene body and intergenic regions (Fig. 3A). The results indicated that there was a high density of methylated CpGs in several kb regions upstream and downstream of the transcriptional start sites (TSS). The distribution pattern around the CGI differed significantly between HyperCpGs and HypoCpGs. The distribution of HyperCpGs was significantly enriched within CGI (62.9%), whereas HypoCpGs were mostly enriched in the open sea regions (72.6%) (Fig. 3B), indicating that CpG methylation in CGI and non-CGI was potentially functional in gene expression. To further integrate the distribution pattern around TSS, CGI, and gene expression, we plotted the CGI and non-CGI methylation levels of every gene within four expression quartiles grouped by distance to the TSS (Figs. 3C and S1). Then, to check the potential bias owning to unpaired samples, we divided the samples into paired normal samples (pNormal), paired Tumor samples (pTumor) and unpaired single Tumor samples (sTumor) and found that the methylation pattern of pTumor was similar to that of sTumor instead of pNormal, showing the robustness of results (Fig. S2). HypoCpGs proximal to the TSS (approximately ± 1 kb) were observed in both OSCC and adjacent samples in the highly expressed CGI genes. CGI CpGs with low gene expression levels exhibited a higher number of HyperCpGs around TSSs in OSCC samples than in adjacent tissues, indicating that hypermethylation-induced silencing of tumor suppressor genes was more evident in transcriptionally silent genes with CGI. Moreover, HypoCpGs were observed in non-CGI CpGs with low gene expression and in those with high gene expression away from the TSS, suggesting a potential role of non-CGI HypoCpGs in the regulation of oncogenes and tumor-suppressor genes.
Correlation between methylation levels and gene expressions
We recognized that HyperCpGs and HypoCpGs might function differently in genes with CGI and non-CGI CpGs in OSCC samples. To further understand the correlations between CpG alterations and gene expression, we defined CpGs as positively-correlated CpGs (PosCpGs), wherein HyperCpGs and HypoCpGs were positively correlated with upregulation and downregulation of gene expression, respectively, and vice versa, as negatively-correlated CpGs (NegCpGs). We plotted the correlation between methylation and gene expression levels grouped by the genomic distribution of methylated CpGs. NegCpGs were highly concentrated around CGIs and within the promoter, while PosCpGs were highly distributed within the gene body, and the majority of CpGs were open sea (Fig. 4A). The correlation modes were gradient from CGI to the open sea. The results indicated that the correlation between methylation and gene expression levels was diverse but organized across the genomic distribution. However, not all correlations between methylation and gene expression levels were statistically significant. We further divided the correlation into significantly positive, significantly negative, and non-significant. Significant associations were observed more frequently in the DMCpGs than in the non-DMCpGs. CGI CpGs located in the promoter tended to have significantly negative associations with gene expression, whereas CpGs away from CGI and located in the gene body had significantly positive associations (Fig. 4B).
Screening of prognostic DEGs with significant methylation correlation
To further screen the CpGs significantly correlated with gene expression, we intersected DMCpGs with PosCpGs or NegCpGs and obtained 522 and 384 DMCpGs that exhibited significant positive and negative correlations with gene expression, respectively (Fig. 5A). We then intersected DEGs with genes containing selected PosCpGs or NegCpGs and found 121 DEGs with positively related DMCpGs, 96 DEGs with negatively related DMCpGs, and 24 DEGs with both positively and negatively related DMCpGs (Fig. 5B). The intersection results further indicate the complexity of methylation regulation. It was difficult to predict the final gene expression, particularly when multiple methylation sites were altered. Here, we defined DEGs with significantly correlated DMCpGs as methylation related DEGs (mrDEGs). To determine the prognostic value of the 241 mrDEGs in OSCC, a multivariate Cox regression analysis was performed with a cutoff of P < 0.01. We found that three mrDEGs (ESRRG, CCNA1, and SLC20A1) were associated with a poor prognosis and significantly increased hazard ratio (HR), whereas five mrDEGs (COL6A6, FCGBP, CDKN2A, MEI1, and ZNF43) served as protective genes with HR < 1 (Fig. 5C). However, MEI1 did not reach a significant level in the Least Absolute Shrinkage and Selection Operator (LASSO) and was thus abandoned.
Establishment and validation of the prognostic signature
We then built an mrDEGs predictive signature (mrDEGPS) using seven survival-relevant mrDEGs. The mrDEGPS for each patient was calculated using the following formula:
We divided the patients with OSCC into low- and high-risk groups (Fig. 5D). The expressions of ESRRG, CCNA1, and SLC20A1 were comparatively higher in the high-risk group than in the other groups (Fig. 5E). Prognosis comparison showed that low-risk patients had significantly higher overall survival (OS) (P = 4.587e−10), disease-specific survival (DSS) (P = 1.28e−07), and progression-free survival (PFS) (P = 9.588e−6) than that in high-risk patients (Fig. 5F). The validation analysis in a small sample size (n = 97) demonstrated that patients in the high-risk group had poorer overall survival (P = 3.098e−2) than those patients in the low-risk group (Fig. 5G). The area under the curve (AUC) of 1-year survival was 0.714 and 0.702 in the training and validation datasets, respectively. Compared with the clinical stage, mrDEGPS displayed superior predictive performance.
Association between mrDEGPS with clinical features and human papillomavirus (HPV)
Next, we explored the association between the mrDEGPS and clinical features (Fig. 5H). We found that the mrDEGPS did not differ between patients with different age groups, gender, alcohol consumption, and surgical margin status (P > 0.05). Notably, there was no difference in mrDEGPS in the different clinical stages and histopathological grades (P > 0.05), implying the absence of linear correlation between mrDEGPS and these traditional survival stratification factors; thus, mrDEGPS was a potentially independent prognostic factor. We also found that the mrDEGPS was significantly associated with lymphovascular and perineural invasion. As both lymphovascular and perineural invasion were independent prognostic factors24, this result was consistent with mrDEGPS as a prognostic factor. Moreover, the human papillomavirus (HPV) infection was associated with higher mrDEGPS (P < 0.001), indicating that HPV might partially promote epigenetic alterations. We further included patients with HPV status not available (NA) in the analysis and found no differences in mrDEGPS between HPV (NA), HPV ( +), or HPV (−) (Fig. 5I), which indicated that the mixed population neutralized the difference between HPV ( +) and HPV (−).
Association between mrDEGPS and TIME
Since the selected mrDEGs were relevant to the immune response, we hypothesized that mrDEGPS might have the capacity to identify alterations in TIME. The results showed that the high-risk group samples had reduced proportions of exhausted T cells, type 1 regulatory T cells, follicular T-helper cells, dendritic cells, B cells, CD4+ T cells, and CD8+ T cells, and increased proportions of naive CD4+ T cells, naïve CD8+ T cells, induced regulatory T cells, T-helper 2 cells, effector memory T cells, natural killer T cells, natural killer cells, and neutrophils. (Figs. 6A and S3). It has been reported that DCs, CD8+ T cells, and CD4+ T cells display a beneficial effect on survival, while neutrophils, NK cells, and Tem cells display harmful effects in breast cancer25. Our results implied that high-risk patients might have an altered survival–harmful TIME. To further characterize the potential signaling pathways involved in the influence of mrDEGPS, gene set enrichment analysis (GSEA) was performed to enrich the Kyoto Encyclopedia of Genes and Genomes26 pathways ranked by gene correlation values with mrDEGPS. Half of the enriched pathways were associated with immune processes like “primary immunodeficiency,” “allograft rejection,” “autoimmune thyroid disease,” “intestinal immune network for IgA production,” “T cell receptor signaling pathway,” “antigen processing and presentation,” “natural killer cell-mediated cytotoxicity,” and “B cell receptor signaling pathway” (adi. P < 0.01) (Fig. 6B).
Prediction of immunotherapy and chemotherapy by mrDEGPS
T-cell receptor signaling is the essential basis for immunotherapy and may participate in chemotherapy resistance27,28. Current immunotherapy is mainly achieved by antibody blocking of CTLA-4 or PD-1 pathway29. Studies indicate that LAG3, LGALS9, HAVCR2, and TIGHT could be the next-generation immunotherapy checkpoints30,31,32,33. We found that all these immunotherapy checkpoints were downregulated in high-risk patients (P < 0.001) (Fig. 6C). However, there was no significant difference in PD-L1, indicating that mrDEGPS was correlated with TIME rather than with tumor cells (Fig. S4). The mrDEGPS was higher in patients with a response (P = 0.00005) and the response rate was significantly lower in high-risk patients (P = 0.0024) (Fig. 6D). These results indicate that mrDEGPS low-risk patients might benefit from immunotherapy. For chemotherapy, the estimated half-maximal inhibitory concentration difference between low- and high-risk patients was inconsistent among drugs, which was not significant for gefitinib, lower in low-risk patients for rapamycin, and lower in high-risk patients for cisplatin, docetaxel, sorafenib, erlotinib, and gemcitabine (Figs. 6E and S5). We observed that the result might provide a reference for chemotherapeutic drug selection for individuals, and thus facilitated the survival of patients with OSCC. In summary, mrDEGPS screened from complex transcriptome–methylome interactions could facilitate the prediction of survival and immunotherapeutic efficacy in patients with OSCC (Fig. 7).
Discussion
Several studies have demonstrated a strong relationship between epigenetic and genetic aberrations in tumorigenesis34. It is commonly believed that epigenetic changes, such as DNA methylation, can drive abnormal gene expression of crucial genes involved in the development and progression of cancer, including head and neck cancer35. Hypermethylation of tumor suppressor genes and hypomethylation of proto-oncogenes at the promoter sites are associated with carcinogenesis and progression of OSCC36,37. For example, several studies have suggested that hypermethylation of PAX1 and ZNF582 genes is associated with aggressive progression and poor survival38,39,40.
Although the effect of promoter methylation changes has been studied extensively, increasing evidence from genome-wide methylome studies suggests that the methylation patterns are complex and cancer-type-specific41,42,43. The term “CpG island methylator phenotype” has been repeatedly used over decades to describe widespread CpG island promoter methylation44,45. However, only around 4–8% CGIs exhibit tissue-specific methylation, while approximately 70% of annotated gene promoters are associated with a specific CGI46. Therefore, there must be undisclosed cancer-type-specific methylation outside CGIs. Our results showed that CpGs away from islands, denoted as open sea CpGs, may facilitate gene expression of oncogenes in OSCC.
Despite CGI features, cumulative evidence indicates that the transcriptome–methylome interaction is not restricted to promoters and TSS47. In contrast to the repression of promoter methylation on expression, gene body methylation orchestrates transcription in a complex pattern, which is in contrast to the repression of promoter methylation. Approximately half of CGIs in mammalian genomes are not associated with a known gene promoter and are referred to as orphan CGIs48. Orphan CGIs that do not map to promoters of any protein-coding or non-coding transcripts but possess chromatin and transcriptional markers may reflect enhancer activity49. Orphan CGIs display most of the evolutionarily conserved methylation differences among tissues, indicating their possible role in tissue specification50. Our results indicated that CpGs located in the gene body might have a positive association with gene expression, suggesting that most tissue-specific methylation CGIs are not located at promoter regions in OSCC46. These findings may potentially facilitate research on aberrant methylation of these rarely investigated regions.
Another hypothesis regarding the non-promoter DMCpGs is that most genes have two or more TSSs; therefore downstream TSS are probably within the bodies of the transcriptional units of the alternative upstream promoters51. A large-scale full-length cDNA analysis to explore the budding yeast transcriptome showed that alternative promoters could be located at CGIs or non-CGIs, and combinations of an upstream non-CGI and a downstream CGI, or vice versa, may also occur. It is well known that alternative promoters contribute to context-specific isoform expression and the regulation of isoform diversity52. However, it is difficult to interpret the link between expression and methylation in genes with multiple TSSs. Probes that are used to measure expression detect the output of all alternative promoters; however, only a single isoform might be dominant in OSCC cells. Methylation of a downstream promoter blocks transcription from that promoter, which would allow the elongation of a transcript emanating from an upstream promoter, thereby leading to an apparent discordance between methylation and expression.
Currently, most methylome studies use the term “methylation-driven gene”53,54. However, we believe that the expression “driven” may be misleading. Considering the complex methylation patterns of alternative promoters, we conservatively referred to methylation-driven DEGs as mrDEGs. The mrDEGs were simultaneously hypermethylated and hypomethylated at different sites. There is a high possibility that CGI HyperCpGs function synergistically with open sea HypoCpGs. However, due to the limitation of the technique, it was challenging to deduce whether the gene expression difference was the counterbalance effect of antagonistic methylation alterations. Therefore, mrDEGs may be a suitable terminology for DEGs with differentiated methylation.
The mrDEGPS model was established using seven prognostic mrDEGs (ESRRG, CCNA1, SLC20A1, COL6A6, FCGBP, CDKN2A, and ZNF43). Methylation of ESRRG and CCNA has been investigated in head and neck cancers and is associated with poor prognosis55,56,57. The prognostic significance of CDKN2A and ZNF43 promoter hypermethylation has also been confirmed in colorectal cancers58,59. To the best of our knowledge, polymorphisms of COL6A6 and SLC201A have been preliminarily investigated in other diseases, and their methylation, possibly related to the polymorphism, is unknown60,61.
In our study, we found that mrDEGPS could serve as an independent prognostic predictor. MrDEGs, including FCGBP, COL6A6, CDKN2A, and CCNA1 correlated with immune cell response, T-cell phenotype modulation, and treatment response to doxorubicin62,63,64,65, whereas ESRRG expression was negatively correlated with CD4+ T cell activation. The role of mrDEGPS in TIME, immunotherapy, and chemotherapy responses was further investigated. Activated immune cell clusters, including CD8+ effector cells, exhibited lower infiltration in high-risk OSCC samples, indicating that the mechanism of prognostic function of mrDEGPS might be attributed to TIME alterations.
In addition, HPV may significantly affect the methylation levels of prognostic mrDEGs. Differences in methylation in HPV-driven cases were revealed in a previous study of the OSCC epigenetic landscape66. For example, DNA methylation of IDO1 in head and neck squamous cell carcinomas (HNSC) correlates with HPV status and survival67. Methylation markers for diagnosis of OSCC were independent of HPV infection68. It could be speculated that HPV correlated with partially methylated gene expression, whereas mrDEGs were fitted in the prognosis model rather than a diagnosis model. However, it is paradoxical that low-risk samples have a higher HPV infection rate since HPV is a well-known poor prognostic factor. Further studies are required to elucidate the role of HPV in DNA methylation.
However, this study had some limitations. Firstly, the validation set was limited. However, considering the cancer-specific property of epigenetic modulation, it might not be reasonable to validate the model in other datasets (for example, a dataset of HNSC included samples other than OSCC). Second, the prediction reliability of the score model was not verified in our clinical observations owing to limited resources. In addition, the cohorts lacked partial clinicopathological data, and ethnic differences existed among the groups. However, we collected the best available matched data. These results may be further confirmed by genomic big data. As mrDEGPS is a prognostic model irrespective of the clinical stage, an improved novel staging system could be explored by combining genomic and clinical risk factors. Third, this study selected adjacent normal tissue as control. The control may be affected by field cancerization, leading to phenotypes different from true healthy tissue. However, we could not change the TCGA and other dataset as authors of secondary analysis. Lastly, immunotherapy and chemotherapy were more frequently prescribed for patients with late-stage OSCC. Further validation should be performed using data from patients with stage III/IV disease. Big data has evolved as the ubiquitous watchword of medical innovation69. We believe that the prevalence of medical big data can mitigate these limitations.
In conclusion, the transcriptome–methylome interaction pattern in OSCC is complex. Moreover, mrDEGPS could predict patient survival and responses to immunotherapy and chemotherapy and facilitate clinical decision-making in patients with OSCC.
Materials and methods
Data acquisition and preprocessing
We downloaded DNA methylation (528 HNSC samples and 50 normal samples), RNA sequencing (500 HNSC samples and 44 normal samples), and clinical information data from The Cancer Genome Atlas (TCGA) HNSC cohort from UCSC-Xena (https://xenabrowser.net/) on August 28, 2019. Excluding non-OSCC samples and samples with unavailable methylome or transcriptome data, a TCGA-OSCC cohort of 391 OSCC samples and 41 adjacent normal samples was enrolled in this analysis. We downloaded GSE41613, comprising 96 OSCC samples, from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/). All data were normalized in the R computing environment (version 4.1.0) using the DESeq2 or limma packages (https://www.r-project.org). The data were utilized according to the data access policies of TCGA and GEO. All the studies were performed in accordance with the Declaration of Helsinki.
Analysis of DNA methylation pattern in OSCC and its regulation on mRNA transcription
Gene expression in TCGA-OSCC cohort was correlated with the normalized beta value of HM450K probes using Spearman’s rank correlation followed by false discovery rate (FDR) correction using the FDR method. Methylation-gene relations with |correlation value|> 0.3 and FDR < 0.05 were considered significant, and kernel density estimation was used to plot the distribution of significant probes around the TSS ± 5 kb. The methylation level of each probe was measured by the beta value, which ranged from 0 to 1 (representing unmethylated to fully methylated levels, respectively). Similarly, we removed probes with missing beta values in ≥ 5% samples. The remaining probes with missing beta values were imputed using the k-nearest neighbors algorithm.
We mapped the probes to the promoter regions of the genes, which were defined as − 1.5 to 0 kb regions around the TSS. Next, the DNA methylation level of a gene was defined as the average beta value of the probes that mapped to its promoter region. Finally, samples with paired mRNA expression and DNA methylation profiles were analyzed, which involved 17,481 genes and 384 samples, to determine DMCpGs and DEGs. The Spearman rank correlation of CpG methylation levels with related gene expression was analyzed, and significant associations were based on the criteria of |correlation value|> 0.3 and FDR < 0.05.
Identification and validation of mrDEGPS
Kaplan–Meier analysis was used to evaluate the relationship between the genes and survival of patients with OSCC. The LASSO binary logistic regression model and multivariate Cox regression were adopted after primary filtration. The linear combination of the regression coefficient β derived from the multivariate Cox regression model and multiplied by the corresponding mRNA levels generated a prognostic signature. The mrDEGPS for each patient was calculated using the following risk score formula:
Patients were divided into high- and low-risk groups by setting the median risk score as the cut-off value. The OS, DSS, and PFS of the two groups were calculated using the Kaplan–Meier method with the log-rank test. Receiver operating characteristic (ROC) curves were generated to assess the predictive performance of the prognostic model. The expression patterns of genes in this prognostic model were visualized using the “pheatmap” package. In the validation analysis, we verified the Kaplan–Meier plot and ROC test in GEO using another OSCC cohort, GSE41613. The clinical features of the patients with low or high mrDEGPS were analyzed and compared.
Analysis of correlation between the survival risk model and tumor immune infiltration
ImmuCellAI (http://bioinfo.life.hust.edu.cn/ImmuCellAI#!/) was utilized to analyze the fraction of 24 types of immune cells in high- and low-risk samples with OSCC70. These immune cells included 18 subtypes of T cells, namely CD4+ T cells; CD8+ T cells; naive CD4+ T cells; naive CD8+ T cells; cytotoxic T cells; exhausted T cells; type 1 regulatory T cells; natural regulatory T cells; induced regulatory T cells; T-helper 1, 2, and 17 cells; follicular T-helper cells; central memory T cells; effector memory T cells; natural killer T cells; mucosal-associated invariant T cells; and gamma-delta T cells, as well as six other types of immune cells including B cells, natural killer cells, monocytes, macrophages, neutrophils, and dendritic cells. Immune Cell Abundance Identifier (ImmuCellAI) built an immune cell‐based support vector machine model for the prediction of immunotherapy response (AUC: 0.80–0.91), and the model was used to estimate the response results of OSCC.
GSEA was performed using the R package “clusterprofiler” to determine the enrichment of previously defined biological processes in the ranked correlated gene with risk score using RNA-seq data from TCGA-OSCC cohort. The raw count data of gene expression from the TCGA-OSCC cohort were normalized using the variance stabilizing transformation function in R package “DESeq2,” and then submitted to ImmuCellAI (http://bioinfo.life.hust.edu.cn/web/ImmuCellAI/) to estimate the abundance of immune cells, particularly the T-cell proportions, and to predict the response of immune-checkpoint inhibitor treatments.
Data availability
The datasets analysed during the current study are available in the The Cancer Genome Atlas (TCGA) Head and Neck Squamous Cell Carcinoma (HNSC) cohort from UCSC-Xena (https://xenabrowser.net/) dated August 28th, 2019, and a Gene Expression Omnibus dataset (GSE41613, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE41613).
References
Wang, S., Yin, S., Zhang, Z. L., Su, X. & Xu, Z. F. Quality of life after oral cancer resection and free flap reconstruction. J. Oral Maxillofac. Surg.: Off. J. Am. Assoc. Oral Maxillofac. Surg. 77, 1724–1732. https://doi.org/10.1016/j.joms.2019.02.029 (2019).
Gellrich, N. C., Handschel, J., Holtmann, H. & Krüskemper, G. Oral cancer malnutrition impacts weight and quality of life. Nutrients 7, 2145–2160. https://doi.org/10.3390/nu7042145 (2015).
Fitzmaurice, C. et al. Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 29 cancer groups, 1990–017: A systematic analysis for the global burden of disease study. JAMA Oncol. 5, 1749–1768. https://doi.org/10.1001/jamaoncol.2019.2996 (2019).
Ren, Z. H., Hu, C. Y., He, H. R., Li, Y. J. & Lyu, J. Global and regional burdens of oral cancer from 1990 to 2017: Results from the global burden of disease study. Cancer Commun. (Lond., Engl.) 40, 81–92. https://doi.org/10.1002/cac2.12009 (2020).
Zanoni, D. K. et al. Survival outcomes after treatment of cancer of the oral cavity (1985–2015). Oral Oncol. 90, 115–121. https://doi.org/10.1016/j.oraloncology.2019.02.001 (2019).
Luryi, A. L. et al. Treatment factors associated with survival in early-stage oral cavity cancer: Analysis of 6830 cases from the national cancer data base. JAMA Otolaryngol—Head Neck Surg. 141, 593–598. https://doi.org/10.1001/jamaoto.2015.0719 (2015).
Moeckelmann, N. et al. Prognostic implications of the 8th edition American joint committee on cancer (AJCC) staging system in oral cavity squamous cell carcinoma. Oral Oncol. 85, 82–86. https://doi.org/10.1016/j.oraloncology.2018.08.013 (2018).
Widschwendter, M. et al. Epigenome-based cancer risk prediction: Rationale, opportunities and challenges. Nat. Rev. Clin. Oncol. 15, 292–309. https://doi.org/10.1038/nrclinonc.2018.30 (2018).
Teschendorff, A. E. & Relton, C. L. Statistical and integrative system-level analysis of DNA methylation data. Nat. Rev. Genet. 19, 129–147. https://doi.org/10.1038/nrg.2017.86 (2018).
Portela, A. & Esteller, M. Epigenetic modifications and human disease. Nat. Biotechnol. 28, 1057–1068. https://doi.org/10.1038/nbt.1685 (2010).
Heyn, H. & Esteller, M. DNA methylation profiling in the clinic: Applications and challenges. Nat. Rev. Genet. 13, 679–692. https://doi.org/10.1038/nrg3270 (2012).
Hinoue, T. et al. Genome-scale analysis of aberrant DNA methylation in colorectal cancer. Genome Res. 22, 271–282. https://doi.org/10.1101/gr.117523.110 (2012).
Villanueva, A. et al. DNA methylation-based prognosis and epidrivers in hepatocellular carcinoma. Hepatology (Baltimore, Md.) 61, 1945–1956. https://doi.org/10.1002/hep.27732 (2015).
Jiang, H. et al. DNA methylation markers in the diagnosis and prognosis of common leukemias. Signal Transduct. Target. Ther. 5, 3. https://doi.org/10.1038/s41392-019-0090-5 (2020).
Klughammer, J. et al. The DNA methylation landscape of glioblastoma disease progression shows extensive heterogeneity in time and space. Nat. Med. 24, 1611–1624. https://doi.org/10.1038/s41591-018-0156-x (2018).
Hanahan, D. & Coussens, L. M. Accessories to the crime: Functions of cells recruited to the tumor microenvironment. Cancer Cell 21, 309–322. https://doi.org/10.1016/j.ccr.2012.02.022 (2012).
Vitale, I., Manic, G., Coussens, L. M., Kroemer, G. & Galluzzi, L. Macrophages and metabolism in the tumor microenvironment. Cell Metab. 30, 36–50. https://doi.org/10.1016/j.cmet.2019.06.001 (2019).
Anderson, K. G., Stromnes, I. M. & Greenberg, P. D. Obstacles posed by the tumor microenvironment to T cell activity: A case for synergistic therapies. Cancer Cell 31, 311–325. https://doi.org/10.1016/j.ccell.2017.02.008 (2017).
Langevin, S. M. et al. Peripheral blood DNA methylation profiles are indicative of head and neck squamous cell carcinoma: An epigenome-wide association study. Epigenetics 7, 291–299. https://doi.org/10.4161/epi.7.3.19134 (2012).
Binnewies, M. et al. Understanding the tumor immune microenvironment (TIME) for effective therapy. Nat. Med. 24, 541–550. https://doi.org/10.1038/s41591-018-0014-x (2018).
Basu, B. et al. Genome-wide DNA methylation profile identified a unique set of differentially methylated immune genes in oral squamous cell carcinoma patients in India. Clin. Epigenetics 9, 13. https://doi.org/10.1186/s13148-017-0314-x (2017).
Das, D. et al. Epigenomic dysregulation-mediated alterations of key biological pathways and tumor immune evasion are hallmarks of gingivo-buccal oral cancer. Clin. Epigenetics 11, 178. https://doi.org/10.1186/s13148-019-0782-2 (2019).
Zhu, Q., Tian, G. & Gao, J. Construction of prognostic risk prediction model of oral squamous cell carcinoma based on co-methylated genes. Int. J. Mol. Med. 44, 787–796. https://doi.org/10.3892/ijmm.2019.4243 (2019).
Chang, W. C. et al. A histopathological evaluation and potential prognostic implications of oral squamous cell carcinoma with adverse features. Oral Oncol. 95, 65–73. https://doi.org/10.1016/j.oraloncology.2019.06.012 (2019).
Cao, R., Yuan, L., Ma, B., Wang, G. & Tian, Y. Immune-related long non-coding RNA signature identified prognosis and immunotherapeutic efficiency in bladder cancer (BLCA). Cancer Cell Int. 20, 276. https://doi.org/10.1186/s12935-020-01362-0 (2020).
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30. https://doi.org/10.1093/nar/28.1.27 (2000).
Wang, T. et al. T-cell receptor signaling activates an ITK/NF-κB/GATA-3 axis in T-cell lymphomas facilitating resistance to chemotherapy. Clin. Cancer Res.: Off. J. Am. Assoc. Cancer Res. 23, 2506–2515. https://doi.org/10.1158/1078-0432.Ccr-16-1996 (2017).
Fesnak, A. D., June, C. H. & Levine, B. L. Engineered T cells: The promise and challenges of cancer immunotherapy. Nat. Rev. Cancer 16, 566–581. https://doi.org/10.1038/nrc.2016.97 (2016).
Ribas, A. & Wolchok, J. D. Cancer immunotherapy using checkpoint blockade. Science (New York, N.Y.) 359, 1350–1355. https://doi.org/10.1126/science.aar4060 (2018).
Andrews, L. P., Marciscano, A. E., Drake, C. G. & Vignali, D. A. LAG3 (CD223) as a cancer immunotherapy target. Immunol. Rev. 276, 80–96. https://doi.org/10.1111/imr.12519 (2017).
Daley, D. et al. Dectin 1 activation on macrophages by galectin 9 promotes pancreatic carcinoma and peritumoral immune tolerance. Nat. Med. 23, 556–567. https://doi.org/10.1038/nm.4314 (2017).
Das, M., Zhu, C. & Kuchroo, V. K. Tim-3 and its role in regulating anti-tumor immunity. Immunol. Rev. 276, 97–111. https://doi.org/10.1111/imr.12520 (2017).
Chauvin, J. M. & Zarour, H. M. TIGIT in cancer immunotherapy. J. Immunother. Cancer https://doi.org/10.1136/jitc-2020-000957 (2020).
You, J. S. & Jones, P. A. Cancer genetics and epigenetics: Two sides of the same coin?. Cancer Cell 22, 9–20. https://doi.org/10.1016/j.ccr.2012.06.008 (2012).
Worsham, M. J. et al. Delineating an epigenetic continuum in head and neck cancer. Cancer Lett. 342, 178–184. https://doi.org/10.1016/j.canlet.2012.02.018 (2014).
Milutin Gašperov, N. et al. DNA methylome distinguishes head and neck cancer from potentially malignant oral lesions and healthy oral mucosa. Int. J. Mol. Sci. https://doi.org/10.3390/ijms21186853 (2020).
Ghantous, Y., Nashef, A. & Abu-Elnaaj, I. Epigenetic alterations associated with the overall survival and recurrence free survival among oral squamous cell carcinoma patients. J. Clin. Med. https://doi.org/10.3390/jcm9041035 (2020).
Sun, R. et al. Evaluation of DNA methylation in matched oral swab and tissue specimens from Chinese patients with oral squamous cell carcinoma. Int. J. Oral Maxillofac. Surg. https://doi.org/10.1016/j.ijom.2020.05.022 (2020).
Cheng, S. J. et al. Hypermethylated ZNF582 and PAX1 genes in oral scrapings collected from cancer-adjacent normal oral mucosal sites are associated with aggressive progression and poor prognosis of oral cancer. Oral Oncol. 75, 169–177. https://doi.org/10.1016/j.oraloncology.2017.11.013 (2017).
Cheng, S. J. et al. Hypermethylated ZNF582 and PAX1 genes in mouth rinse samples as biomarkers for oral dysplasia and oral cancer detection. Head Neck 40, 355–368. https://doi.org/10.1002/hed.24958 (2018).
Hoadley, K. A. et al. Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer. Cell 173, 291-304.e296. https://doi.org/10.1016/j.cell.2018.03.022 (2018).
Gao, H. et al. Genome-wide DNA methylome analysis reveals methylation subtypes with different clinical outcomes for acute myeloid leukemia patients. Cancer Med. 9, 6296–6305. https://doi.org/10.1002/cam4.3291 (2020).
Yang, X., Gao, L. & Zhang, S. Comparative pan-cancer DNA methylation analysis reveals cancer common and specific patterns. Brief. Bioinform. 18, 761–773. https://doi.org/10.1093/bib/bbw063 (2017).
Toyota, M. et al. CpG island methylator phenotype in colorectal cancer. Proc. Natl. Acad. Sci. U.S.A. 96, 8681–8686. https://doi.org/10.1073/pnas.96.15.8681 (1999).
Hughes, L. A. et al. The CpG island methylator phenotype: what’s in a name?. Can. Res. 73, 5858–5868. https://doi.org/10.1158/0008-5472.Can-12-4306 (2013).
Hoivik, E. A. et al. DNA methylation of alternative promoters directs tissue specific expression of Epac2 isoforms. PLoS ONE 8, e67925. https://doi.org/10.1371/journal.pone.0067925 (2013).
Ando, M. et al. Chromatin dysregulation and DNA methylation at transcription start sites associated with transcriptional repression in cancers. Nat. Commun. 10, 2188. https://doi.org/10.1038/s41467-019-09937-w (2019).
Illingworth, R. S. et al. Orphan CpG islands identify numerous conserved promoters in the mammalian genome. PLoS Genet. 6, e1001134. https://doi.org/10.1371/journal.pgen.1001134 (2010).
Bae, M. G., Kim, J. Y. & Choi, J. K. Frequent hypermethylation of orphan CpG islands with enhancer activity in cancer. BMC Med. Genom. 9(1), 38. https://doi.org/10.1186/s12920-016-0198-1 (2016).
Illingworth, R. et al. A novel CpG island set identifies tissue-specific methylation at developmental gene loci. PLoS Biol. 6, e22. https://doi.org/10.1371/journal.pbio.0060022 (2008).
Carninci, P. et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nat. Genet. 38, 626–635. https://doi.org/10.1038/ng1789 (2006).
Demircioğlu, D. et al. A pan-cancer transcriptome analysis reveals pervasive regulation through alternative promoters. Cell 178, 1465-1477.e1417. https://doi.org/10.1016/j.cell.2019.08.018 (2019).
Long, J. et al. DNA methylation-driven genes for constructing diagnostic, prognostic, and recurrence models for hepatocellular carcinoma. Theranostics 9, 7251–7267. https://doi.org/10.7150/thno.31155 (2019).
Larmonie, N. S. D. et al. MN1 overexpression is driven by loss of DNMT3B methylation activity in inv(16) pediatric AML. Oncogene 37, 107–115. https://doi.org/10.1038/onc.2017.293 (2018).
Shen, Z. et al. ESRRG promoter hypermethylation as a diagnostic and prognostic biomarker in laryngeal squamous cell carcinoma. J. Clin. Lab. Anal. 33, e22899. https://doi.org/10.1002/jcla.22899 (2019).
Virani, S. et al. NDN and CD1A are novel prognostic methylation markers in patients with head and neck squamous carcinomas. BMC Cancer 15, 825. https://doi.org/10.1186/s12885-015-1806-8 (2015).
Weiss, D., Basel, T., Sachse, F., Braeuninger, A. & Rudack, C. Promoter methylation of cyclin A1 is associated with human papillomavirus 16 induced head and neck squamous cell carcinoma independently of p53 mutation. Mol. Carcinog. 50, 680–688. https://doi.org/10.1002/mc.20798 (2011).
Xing, X. et al. The prognostic value of CDKN2A hypermethylation in colorectal cancer: A meta-analysis. Br. J. Cancer 108, 2542–2548. https://doi.org/10.1038/bjc.2013.251 (2013).
Kel, A. et al. Walking pathways with positive feedback loops reveal DNA methylation biomarkers of colorectal cancer. BMC Bioinform. 20, 119. https://doi.org/10.1186/s12859-019-2687-7 (2019).
Heo, W. I. et al. Identification of novel candidate variants including COL6A6 polymorphisms in early-onset atopic dermatitis using whole-exome sequencing. BMC Med. Genet. 18, 8. https://doi.org/10.1186/s12881-017-0368-9 (2017).
Meyers, K. J. et al. Genetic variations associated with echocardiographic left ventricular traits in hypertensive blacks. Hypertension (Dallas, Tex.: 1979) 49, 992–999. https://doi.org/10.1161/hypertensionaha.106.081265 (2007).
Pacifico, R. & Davis, R. L. Transcriptome sequencing implicates dorsal striatum-specific gene network, immune response and energy metabolism pathways in bipolar disorder. Mol. Psychiatr. 22, 441–449. https://doi.org/10.1038/mp.2016.94 (2017).
Gari, M. A. et al. Identification of novel genetic variations affecting osteoarthritis patients. BMC Med. Genet. 17, 68. https://doi.org/10.1186/s12881-016-0330-2 (2016).
VinuÉ, Á. et al. Changes in CDKN2A/2B expression associate with T-cell phenotype modulation in atherosclerosis and type 2 diabetes mellitus. Trans. Res.: J. Lab. Clin. Med. 203, 31–48. https://doi.org/10.1016/j.trsl.2018.08.003 (2019).
Klajic, J. et al. DNA methylation status of key cell-cycle regulators such as CDKNA2/p16 and CCNA1 correlates with treatment response to doxorubicin and 5-fluorouracil in locally advanced breast tumors. Clin. Cancer Res.: Off. J. Am. Assoc. Cancer Res. 20, 6357–6366. https://doi.org/10.1158/1078-0432.Ccr-14-0297 (2014).
Jithesh, P. V. et al. The epigenetic landscape of oral squamous cell carcinoma. Br. J. Cancer 108, 370–379. https://doi.org/10.1038/bjc.2012.568 (2013).
Sailer, V. et al. DNA methylation of indoleamine 2,3-dioxygenase 1 (IDO1) in head and neck squamous cell carcinomas correlates with IDO1 expression, HPV status, patients’ survival, immune cell infiltrates, mutational load, and interferon γ signature. EBioMedicine 48, 341–352. https://doi.org/10.1016/j.ebiom.2019.09.038 (2019).
Arantes, L. M. et al. Validation of methylation markers for diagnosis of oral cavity cancer. Eur. J. Cancer (Oxford, England: 1990) 51, 632–641. https://doi.org/10.1016/j.ejca.2015.01.060 (2015).
Price, W. N. 2nd. & Cohen, I. G. Privacy in the age of medical big data. Nat. Med. 25, 37–43. https://doi.org/10.1038/s41591-018-0272-7 (2019).
Miao, Y. R. et al. ImmuCellAI: A unique method for comprehensive T-cell subsets abundance prediction and its application in cancer immunotherapy. Adv. Sci. (Weinh., Baden-Wurtt., Ger.) 7, 1902880. https://doi.org/10.1002/advs.201902880 (2020).
Acknowledgements
This work was supported by Natural Science Foundation of Shandong Province [Grant No. ZR2018PH023 to X.M.], Projects of Medical and Health Technology Development Program in Shandong Province [Grant No. 2017WS231 to X.M.], Scientific Initiation Funding of Binzhou Medical University [Grant No. BY2015KYQD24 to X.M.]. We also appreciated Dr. Xia Guo (Ph.D., Associate Professor, College of Foreign Languages and Cultures, Sichuan University) for her kind assistance in the English writing.
Author information
Authors and Affiliations
Contributions
All the authors contributed the concept. C.S., S.L., and Y.C designed the study, developed methodology, and acquired, analyzed, and interpreted data; Y.C. wrote the manuscript. C.S., S.L., X.T., C.M., R.W, X.M. and X.W. revised the manuscript. X.W. and Y.C. supervised the study.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Shi, C., Liu, S., Tian, X. et al. Prognostic and therapeutic prediction by screening signature combinations from transcriptome–methylome interactions in oral squamous cell carcinoma. Sci Rep 12, 11400 (2022). https://doi.org/10.1038/s41598-022-15534-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-15534-7
- Springer Nature Limited