Abstract
Background
Recent research reported that mononuclear phagocyte system (MPS) can contribute to immune defense but the classification of head and neck squamous cell carcinoma (HNSCC) patients based on MPS-related multi-omics features using machine learning lacked.
Methods
In this study, we obtain marker genes for MPS through differential analysis at the single-cell level and utilize “similarity network fusion” and “MoCluster” algorithms to cluster patients’ multi-omics features. Subsequently, based on the corresponding clinical information, we investigate the prognosis, drugs, immunotherapy, and biological differences between the subtypes. A total of 848 patients have been included in this study, and the results obtained from the training set can be verified by two independent validation sets using “the nearest template prediction”.
Results
We identified two subtypes of HNSCC based on MPS-related multi-omics features, with CS2 exhibiting better predictive prognosis and drug response. CS2 represented better xenobiotic metabolism and higher levels of T and B cell infiltration, while the biological functions of CS1 were mainly enriched in coagulation function, extracellular matrix, and the JAK-STAT signaling pathway. Furthermore, we established a novel and stable classifier called “getMPsub” to classify HNSCC patients, demonstrating good consistency in the same training set. External validation sets classified by “getMPsub” also illustrated similar differences between the two subtypes.
Conclusions
Our study identified two HNSCC subtypes by machine learning and explored their biological difference. Notably, we constructed a robust classifier that presented an excellent classifying prediction, providing new insight into the precision medicine of HNSCC.
Avoid common mistakes on your manuscript.
Introduction
Head and neck cancer (HNSC) is the sixth most common malignant tumor in the world, with head and neck squamous cell carcinoma (HNSCC) accounting for the majority (Sung et al. 2021). Although some progress has been made in the treatment of HNSCC recently, many patients experience significant declines in swallowing or speech, and the 5-year total mortality rate remains at 60% (Murdoch 2007). Immunotherapy has become an emerging cancer treatment modality that regulates the immune system to fight against tumor cells and mitigates resistance to other treatment modalities, thus having a significant impact on the survival and quality of life of cancer patients (Chen and Mellman 2013). Preclinical data indicate that HNSCC is a deeply immunosuppressive disease characterized by abnormal secretion of pro-inflammatory cytokines and dysfunction of immune effector cells (Gavrielatou et al. 2020). Recent clinical practices have shown that the use of monoclonal antibodies to inhibit the PD-1/L1 checkpoint demonstrates good efficacy in the treatment of various cancer types including HNSCC. Combination therapies using checkpoint inhibitors with radiotherapy and/or chemotherapy, cytokine-based and/or adoptive T-cell therapies have also presented some effectiveness (Wallis et al. 2015). However, only a small number of HNSCC patients indeed have benefited from the widely used immunotherapy strategies in clinical practice (Seiwert et al. 2016), so it is increasingly important to discover new biomarkers for the personalized treatment of patients.
The mononuclear phagocyte system (MPS) is an important component of the body’s immune defense (Zhang and Zhang 2020; Ren et al. 2021) and the primary executor of nanoparticle clearance. Previous researches have shown that MPS is the first and foremost significant obstacle blocking drug carriers to target sites after entering the body, especially in terms of clearing the majority of circulating nanomaterials (Lu et al. 2023). MPS, consisting of monocytes, macrophages, and dendritic cells (DCs), play a role in innate immunity through pathogen sensing and phagocytosis and serve as a cellular component in adaptive immunity by presenting antigens to T cells (Geissmann et al. 2010). Monocytes represent immune effector cells, equipped with chemokine receptors and adhesion receptors, producing inflammatory cytokines, phagocytosing cells, and toxic molecules (Geissmann et al. 2010). They can differentiate into inflammatory DCs or macrophages during inflammation but may be less efficient in the steady state (Auffray et al. 2009). Macrophages are phagocytic cells that can eliminate malignant cells by phagocytosis or by producing soluble factors to induce tumor cell apoptosis. In addition to their direct cytotoxic abilities, macrophages play an important role in regulating the progression of tumors through mechanisms such as angiogenesis, fibrosis, and immune surveillance (Long and Beatty 2013). The secretion products of pDC which is a subtype of DC have immunogenic and tolerogenic functions in tumor immunity (Mitchell et al. 2018; Koucký et al. 2019).
MPS is a part of the tumor immune microenvironment. Previous studies have shown that the proportion of MPS in HNSCC patients varies individually and is usually associated with patient survival and other phenotypes (Balm et al. 1982, 1984a, b). However, in the exploration of biomarkers for HNSCC, previous research has not yet focused on this important component of the immune system. Instead, they have paid more attention to biomarkers associated with several hot topics, such as PDL-1 expression (Dong et al. 2002; Freeman et al. 2000; Topalian et al. 2012), tumor mutational burden/neo-antigens (Charoentong et al. 2017), interferon-γ gene signature (Woo et al. 2015), and tumor microenvironment (Ager and May 2015). There seems to be no previous study using MPS to search for biomarkers and to accurately classify HNSCC patients based on these biomarkers, to achieve further precision treatment. Therefore, differentiated patient classification based on MPS biomarkers is feasible and can provide some references for future personalized treatment of HNSCC.
Based on biomarkers of MPS obtained from single-cell sequencing analysis, this study aimed to recognize HNSCC subtypes with distinct overall survival, drug, and immunotherapy responses. Notably, we can not only consider the impact of gene expression, but also the factor including gene methylation and mutation to have a more comprehensive analysis while classifying HNSCC patients. In addition, to make our research results potentially useful in practice, we constructed a robust classifier based on genes with specific expression in subtypes, which can classify patients even with only gene expression data and had a certain degree of accuracy. The classifier now has been uploaded to GitHub (https://github.com/CQMUZC/getMPsub).
Materials and methods
Data source
A single-cell RNA sequencing (scRNA-seq) profile, GSE195832, was obtained from the GEO database (https://www.ncbi.nlm.nih.gov/geo). Considering the aim to explore the marker genes from tumor-infiltrating MPS, four raw scRNA-seq samples, GSM5851565, GSM5851567, GSM5851569, and GSM5851571, were included in our analysis. TCGA HNSCC multi-omics feature regarding RNA-seq, methylation, and mutation were attained as the training dataset from UCSC Xena (https://xenabrowser.net/) to form the specific HNSCC subtype. GSE41613 and GSE65858 were applied as the validation datasets to confirm whether the subtypes had strong universality and to verify certain effectiveness of the classifier.
Single-cell analysis
We first paid attention to the global quality of the mixed data including Mitochondrial percentage, the count of expression in samples per gene, and the count of gene expression per sample. Only cells with a mitochondrial percentage below 25% and gene expression numbers between 600 and 7500 will be included in the analysis. In addition, only genes expressed in more than 1000 cells will be considered. Given the batch effect from different samples, we employed “harmony” reduction to diminish the system error (Korsunsky et al. 2019; Azizi et al. 2018). We performed standardization and normalization on the preprocessed expression matrix so that gene expression levels of each cell can be compared and further analyzed. Principal component analysis (PCA) was adopted to reduce the data noise and uniform manifold approximation and projection (UMAP) was utilized to further depict cell clusters clearly. Given the results of “clustree” and the specific biological targets (Zappia et al. 2018), cells were clustered by integrated results from graph-based clustering and shared nearest-neighbor clustering. Then we annotated cell clusters and verified each other by two methods, which used the R package “SingleR” based on “celldex” to auto-annotate cells and some marker genes to manually annotate. According to the results, R function “FindAllMarkers” was applied to identify differentially expressed genes (DEGs) between each cell cluster and others. Through all the above analysis, we determined MPS and detected their marker genes to deepen our understanding. To confirm the identity of these marker genes, Enrichr (https://maayanlab.cloud/Enrichr/) was adopted to identify which cells will be enriched by these markers. And KEGG pathway enrichment analysis was employed to represent their biological function.
Clustering analysis in multi-omics features
MPS-related muti-omics were identified by the intersection of gene variables and markers from the single-cell analysis. R package “MOVICS” was utilized to characterize the HNSCC subtypes by unsupervised clustering (Lu et al. 2021). In the beginning, R function “getElites” filtered features that met some stringent requirements, in which “Cox” was used for RNA-seq and methylation while “freq” was for binary omics data. Then the optimal number of clusters was acquired referring to Cluster Prediction Index (CPI) and Gaps-statistics (Chalise and Fridley 2017). Considering the silhouette score and the final survival difference, SNF (Wang et al. 2014) and MoCluster (Meng et al. 2016) performed the consensus clustering and recognized the HNSCC subtypes. The overall nominal P-value was calculated by log-rank test and Kaplan–Meier (KM) Curve was printed to show the HNSCC subtypes’ survival difference. Finally, based on the specific expression genes of two subtypes, we aimed to develop a classifier that can predict the subtypes of other HNSCC patients using only RNA-seq.
Drug sensitivity
Genomics of Drug Sensitivity in Cancer (GDSC) are a public database containing cancer cells’ drug sensitivity and molecular markers corresponding to the applied drugs. Among patient subtypes, we considered the differences in four small molecule compounds, Paclitaxel, 5-Fluorouracil, Erlotinib, and Pazopanib. We tested the differential drug response of two clusters to nanomedicines, because Abraxane, a nano-subtype of paclitaxel, was included in the GDSC drug database. 5-Fluorouracil, as the main clinical treatment for HNSCC, was utilized to test whether the subtype had a distinct drug sensitivity for conventional treatment methods. Erlotinib and Pazopanib were utilized to test whether there was a response to specific targets, EGFR and CSF1R. Given the effect of these drugs used in combination with radiotherapy, we subsequently test the differential drug response of patients who had records of radiation therapy in two clusters. Independent-samples t-test was performed to determine the differences in two clusters. Kruskal–Wallis rank sum test was performed for multiple subtypes.
Tumor immune microenvironment
“CIBERSORT” algorithm (https://cibersort.stanford.edu/) evaluated the infiltration degree of 22 immune cells between the two subtypes. EPIC can analyze the expression matrix to determine the infiltration proportions of eight types of immune cells, including B cells, cancer-associated fibroblasts (CAFs), CD4+ T cells, CD8+ T cells, endothelial cells, macrophages, and NK cells, which were all important components of the immune microenvironment. TIMER utilized a deconvolution algorithm to infer the abundance of tumor-infiltrating immune cells from gene expression profiles. The immune-infiltrating situation of different subtypes can be corroborated given the results of the above algorithms. Additionally, “TIDE” algorithm was employed to evaluate the potential clinical efficacy of immunotherapy in different subtypes and reflected the underlying ability of tumors to escape the immune system. The TIDE score evaluated the potential response to immune checkpoint therapy, with a higher score indicating a poorer response to this treatment and may require alternative therapies. The Exclusion score evaluated the degree of infiltration of immune-suppressive cells in the tumor microenvironment. The higher score indicated a more severe infiltration of immune-suppressive cells and a poorer response to immune checkpoint therapy. The Dysfunction score evaluated the functional state of immune cells in the tumor microenvironment. The higher score demonstrated that the function of immune cells was suppressed in the tumor microenvironment, leading to a poorer response to immune checkpoint therapy. Besides, the higher MSI score corresponded to a higher level of immune cell infiltration and stronger immune response.
MPS-related analysis
The abundance of MPS including macrophages, DCs, and monocytes was calculated by “IOBR” (Zeng et al. 2021). Some targets, CSF1R, TLR8, EFGR, CXCR4, ABCA1, MGFE8, CD47, and CX3CL1, related to tumor-associated macrophagocytes (TAMs), immune therapy in HNSCC, and efferocytosis were detected to explore whether subtypes expressed differently.
Functional analysis
GO was a database established by the Gene Ontology Consortium, aimed at describing gene and protein function. Through GO enrichment analysis, this study can roughly annotate genes and classify them according to biological processes, molecular function, and cellular component. KEGG was a database that systematically analyzed gene function, links genomic information and functional information. The Hallmark gene set was a collection of genes developed jointly by the Human Cell Atlas and the Genomics Institute of the Novartis Research Foundation. The gene set was generated from cell-type-specific genomic expression data and contained gene markers for multiple tissue and cell types, which can be utilized to identify and analyze differences between different cell types or states. In this study, GSEA enrichment analysis was performed using the three different reference gene sets of GO, KEGG, and Hallmark to validate the specific biological differences exhibited by two subtypes. Pathways enriched in two or more reference gene sets were considered to represent unique biological functions specific to the subtype.
Integrated metabolism analysis
In addition to representing the immune microenvironment of subtypes, the R package “IOBR” had been used to evaluate metabolic differences between two subtypes. This study assessed the metabolic levels of subtypes from three perspectives: metabolism, fatty acid metabolism, and cholesterol metabolism, aiming to explore the relevant differences.
The integrated analysis of copy number variation
Considering subtype-specific mutation might be promising as therapeutic target, this study compared the mutational frequency among different subtypes. R package “MOVICS” offered two functions to measure genomic alterations potentially affecting immunotherapy, namely the quantification of total mutation burden (TMB) and fraction genome altered (FGA). In addition, the function “compFGA” calculated and compared not only FGA but also computed specific gain (FGG) or loss (FGL) per sample within each subtype. To measure the consistency of current subtypes with other pre-existing classifications, “MOVICS” offered the function “compAgree” to generate alluvial diagram, visualizing the consistency of two evaluation phenotypes with the current subtype as a reference.
Statistical analysis
All the data processing and analyses were executed in R software (Version 4.2.2). t-Test and Wilcoxon test were utilized to compare the differences between quantitative variables while Chi-square test was employed in qualitative variables. Spearman correlation test was utilized to explore the relationships between variables. The Kappa coefficient was utilized to measure the level of agreement between classifier results and actual classifications. A KAPPA value below 0.4 indicated poor agreement, 0.4–0.6 indicated moderate agreement, 0.6–0.8 indicated good agreement, and above 0.8 indicated excellent agreement. P < 0.05 was considered statistically significant in the whole process.
Results
Data processing
The main process of this study, including the analysis involved, is specifically shown in Fig. 1. Considering integrity and commonality, 848 HNSCC patients were included as the working data when patients containing missing information were excluded. Among them, 481 TCGA-HNSCC patients were included to train the classifier while 270 HNSCC samples in GSE65858 and 97 HNSCC samples in GSE41613 were enrolled, respectively, as two validation datasets. The basic characteristics including the origin, form, and some clinical characteristics of data per dataset is displayed in Table 1.
Single-cell analysis recognized marker genes
Through diminishing the batch effort, four raw scRNA-seq samples presented a uniform and random distribution under the UMAP reduction (Figure S1A). 25 cell clusters were gathered, of which a smaller number responded to more cells (Figure S1B). Some genes, CXCL8, AIF1, C1QC, C1QA, CD68, C1QB, CD83, CD86, CD14, and LYZ, that have been confirmed to be specifically expressed in MPS are used as manually annotated marker genes to view their expression in 25 cell clusters. Cluster 1, 13, and 16 were the cell populations with high expression of these genes (Fig. 2A). Besides, other cells were identified for gene expression using corresponding marker genes. Based on their expression level, each cell cluster was ultimately annotated as a specific cell population, in which cell clusters with high or no expression of multiple marker genes were defined as “unknown”. Besides, R package “SingleR” recognized certain cells referring to “celldex”. The corresponding heatmap (Fig. 2B) depicted the expression level of various cells in 25 clusters. Finally, we presented the annotation results under the UMAP reduction, showing their specific clusters (Fig. 2C). Almost cell clusters had the same definition except for “unknown”, and “Fibroblasts”, which heatmap resulted by “SingleR” also had a similar expression level compared with manual annotation. For instance, the heatmap indicated Fibroblasts reference expressed highly in cluster 2, 20, and 11, defined as “tissue stem cells” by SingleR, which is consistent with our manual annotation. Notably, clusters 1, 13, and 16 were manually annotated as mononuclear phagocytes including monocytes, macrophages, and DCs, that is, manual annotation and SingleR had the same determination regarding the targeted cells. 863 marker genes of mononuclear phagocytes were found by R function “FindAllMarkers” (Table S3) and were corroborated enriching in macrophages, monocytes, and DCs by Enrichr (Table S1). KEGG analysis presented that the pathway enriching the most genes was “Phagosome”, indicating our marker genes indeed characterized phagocytosis-related biological functions (Fig. 2D).
Clustering analysis recognized HNSC subtypes
Given the integrated results by the CPI and Gap-statistics, the imputed optimal cluster was 2 (Fig. 3A). The consensus heatmap depicted robust pairwise similarities for two subtypes and the details of how it got a stable clustering result by applying hierarchical clustering (Fig. 3B). The genome-wide heatmap was utilized to reveal information about how the samples cluster together and provide insights into potential sample biases or other artifacts (Fig. 3C). Additionally, it offered the difference in some phenotypes such as age and clinical stage between the two subtypes. According to subtype-specific biomarkers (Table S2), we established an HNSC classifier using nearest template prediction (NTP) to predict the possible subtypes of each sample. The Kappa values, evaluating the performance of the HNSC subtypes classifier, represented a good consistency in predicting the subtype for HNSC samples by the comparison of the actual subtype and the predicted type in the training dataset (Fig. 3D). The consensus clustering resulted in the HNSC subtypes with distinct survival differences in the training dataset, in which samples in cluster 2 were more likely to have a better prognosis (Fig. 3E). GSE41613 and GSE65858 were considered the validation dataset to confirm the effectiveness of the HNSC classifier. The KM curve indicated that cluster 2 identified by the classifier had a longer overall survival time (Fig. 3F, G). Notably, for the convenience of future research, we have packaged the classifier using the NTP algorithm into an R package called “getMPsub” and uploaded it to GitHub for easy accessibility.
Clustering analysis recognized HNSC subtypes. A The number of multi-omics clusters. B A stable clustering result by applying hierarchical clustering. C The heatmap of overall clustering process. D The KAPPA value of the classifier. E The KM curve of TCGA cohort. F The KM curve of GSE41613. G The KM curve of GSE65858
Drug sensitivity
The t-test showed significant differences in sensitivity to four small molecule compounds between two subtypes, with cluster 2 exhibiting lower IC50 values, indicating greater sensitivity to these drugs. Both in the TCGA training set and in the two validation sets, the drug sensitivity of the two subtypes represented similar differences, which also verified the robustness of our classifying strategy (Fig. 4A–C). According to the NCCN (National Comprehensive Cancer Network, https://www.nccn.org/) guidelines for head and neck cancer treatment, these drugs were used in combination with radiotherapy in first-line treatment and in previous studies. Therefore, we do corresponding drug sensitivity tests on patients from the TCGA cohort who had records of radiation therapy. Further testing revealed that CS1 exhibited similar drug sensitivity regardless of its history of radiation therapy while CS2 patients, after undergoing radiation therapy, were more sensitive to 5-FU and Pazopanib compared to those without radiation therapy. (Figure S13).
The comparison of two subtypes’ drug sensitivity. A The estimated IC50 of paclitaxel, 5-fluorouracil, erlotinib, and pazopanib between two subtypes in TCGA cohort. B The estimated IC50 of paclitaxel, 5-fluorouracil, erlotinib, and pazopanib between two subtypes in GSE41613. C The estimated IC50 of paclitaxel, 5-fluorouracil, erlotinib, and pazopanib between two subtypes in GSE65858
Immune cell infiltration analysis
Tumor immune cell infiltration in recent researches was widely considered a crucial feature of the TME. The boxplot illustrated the different abundance of immune cells between the TME of two HNSC subtypes via three algorithms, CIBERSORT, EPIC and TIMER. In the CIBERSORT results, cluster 2 in three datasets tended to represent a higher degree of immune infiltration in almost all 22 immune cells, except in M1 macrophages, M2 macrophages, resting DCs, Neutrophils, activated Mast cells, resting memory CD4 T cells, and memory B cells (Fig. 5A, Figs. S7, S8). Similar results were acquired by the EPIC and TIMER algorithm, showing that the cluster 2 had a higher level of immune infiltration in main immune cell, CD8 T cell, and a lower level of some cells related to MPS (Fig. 5B, C, Figs. S7B, C, S8B, C). The difference in the degree of specific cell infiltration might be caused by the number of immune cells and different statistical methods used. Above the results, cluster 2 performed better. As a supplement, cluster 2 had a lower level of TIDE score, Exclusion score, but a higher score of MSI, indicating a better performance of the potential clinical efficacy of immunotherapy (Fig. 5D–G, Figs. S7D–G, S8D–G).
The comparison of two subtypes’ immune cell infiltration. A Differences in 22 immune cells between two subtypes by algorithm CIBERSORT. B Differences in eight immune cells between two subtypes by algorithm EPIC. C Differences in six immune cells between two subtypes by algorithm TIMER. D The TIDE score of two subtypes. E The Dysfunction score of two subtypes. F The Exclusion score of two subtypes. G The MSI score of two subtypes (ns/empty space, no significance, *p < 0.05, **p < 0.01, ***p < 0.001, and ****p < 0.0001)
MPS-related analysis
We examined the cell proportions of MPS in HNSC subtypes and found that cluster 2 in most gene signature, at a statistical level, had lower scores for macrophages, monocytes, and DCs, indicating that patients in cluster 2 had a lower proportion of MPS cells (Fig. 6A–C, Figs. S9A–C, S10A–C). This study then detected the immune checkpoint targets related to MPS. In the traditional HNSC targets, cluster 2 had a lower expression level of EFGR and a higher expression level of CXCR4 (Fig. 6F, G), with similar trend or similar statistical meaning in two validations (Figs. S9F, G, S10F, G). Considering that MPS included TAMs, we also tested the relevant targets for TAMs. Cluster 2 had a higher expression level of CSF1R and a lower expression level of TLR8 (Fig. 6D, E). Recent studies had also found that MPS more or less participated in efferocytosis, so this study also tested whether there were expression differences in signaling molecules and targets for efferocytosis in HNSC patient subtypes. Finally, the result depicted that CD47, MFGE8, and ABCA1 all had lower expression levels in cluster 2, while CX3CL1 had a higher expression level in cluster 2 (Fig. 6H–K). MFGE8, ABCA1, and CX3CL1 all presented the same results in at least one validation set (Figs. S9H–K, S10H–K).
MPS-related analysis. A The specific proportion of macrophages between two subtypes. B The specific proportion of DCs between two subtypes. C The specific proportion of monocytes between two subtypes. D-K The expression level of TLR8, CSF1R, EGFR, CXCR4, ABCA1, MFGE8, CD47, and CX3CL1, between two subtypes. (ns/empty space, no significance, *p < 0.05, **p < 0.01, ***p < 0.001, and ****p < 0.0001)
Functional analysis
Enrichment analysis showed that there were many pathways related to immunotherapy in both clusters, such as the JAK-STAT signaling pathway in cluster 1 and the KRAS signaling pathway in cluster 2. However, pathways enriched in two or more reference sets showed differences in different subtypes. Cluster 1 enriched pathways were related to coagulation function, extracellular matrix (ECM), and JAK-STAT signaling pathway, while cluster 2 enriched pathways were related to xenobiotic metabolism and DNA methylation (Fig. 7A, Figs. S3, S4). Biological functional GSVA analysis using the gene sets built in “MOVICS” represented significant differences between the two subtypes in interleukins, cytokines, pathogen defense, and senescence (Fig. 7B, Figs. S11A, S12A). The KRAS expression levels between the two subtypes in TCGA, GSE65858 were depicted using box plots, indicating significant statistical differences (Fig. 7C, Fig. S12B).
Metabolism analysis
This study also investigated metabolic differences between subtypes, including overall metabolism, fatty acid metabolism, and cholesterol metabolism. In most cases, cluster 2 had higher metabolic scores, corresponding to better metabolic capabilities. Notably, cluster 2 had a higher level of drug metabolism by cytochrome than cluster 1 while cluster 2 had a relatively lower level of drug metabolism by other enzyme, suggesting the talent difference of drug using (Fig. 8A, Figs. S5A, S6A). The expression level of fatty acid metabolism revealed that cluster was more likely to have a better ability of fatty acid metabolism (Fig. 8B, Figs. S5B, S6B). The level of cholesterol metabolism also had the similar tendency (Fig. 8C, Figs. S5C, S6C).
Metabolism analysis of two subtypes. A The overall metabolism analysis between two subtypes. B The fatty acid metabolism analysis between two subtypes. C The cholesterol metabolism analysis between two subtypes. (ns/empty space, no significance, *p < 0.05, **p < 0.01, ***p < 0.001, and ****p < 0.0001)
The integrated analysis of copy number variation
Comparing the mutational frequency among different clusters, this study noticed SYNE1, NSD1, CASP8, RP1, HRAS, CTNND2 had a statistical difference (Fig. 9A). Cluster 2 had a higher level of TMB, which is a favorable prognostic factor for cancer (Fig. 9B). Besides, the boxplot (Fig. 9C) demonstrated that cluster 2 had a more copy number-altered genome, whether lost or obtained. The alluvial diagram illustrated the differences in existing phenotypes between different subtypes, which can be seen that Cluster 2 has a higher proportion of survival status in a smaller number of people compared to Cluster 1 (Fig. 9D, Figs. S11D, S12D).
The integrated analysis of copy number variation. A The comparison of the mutational frequency between two subtypes. B The level of TMB between two subtypes. C The comparison of FGA, FGL, and FGG between two subtypes. D Consistency between subtypes and other clinical phenotypes. (0 means alive and 1 means death)
Discussion
An increasing number of studies had discovered the significant role of MPS in TME, and the regulation of HNSCC progression by MPS had stirred a heated discussion. Nearly all studies aimed at the overall influence of immune system on HNSCC pathobiology, and there seemed to be a lack of the exploration and application of MPS biomarkers for classifying HNSCC patients and verifying, from the statistical perspective based on the clinical data, whether MPS had an impact on the growth process of HNSCC. Therefore, the work of this study is both reasonable and necessary.
This study first detected the MPS marker genes at the single-cell level. By validating the results of manual and automatic annotation, we found that MPS in the single cell RNA seq samples included monocytes, macrophages, and DCs. Comparing the differences between cells, we identified genes that were specifically expressed in MPS. Based on the MPS marker genes, we utilized comprehensive clustering analysis mixing the results of “SNF” and “MoCluster” to divide 481 HNSC patients in TCGA into two subtypes, CS1 and CS2. KM curve and other functional analyses showed that CS2 had a better prognosis, and the two subtypes showed significant differences in multiple aspects such as drug sensitivity, tumor mutations, and enriched pathways, providing a certain differentiation strategy for clinical treatment of HNSCC. In addition, according to the specific expression genes of CS1 and CS2, we constructed a classifier that can perform corresponding stratification with only patient gene expression profile data. The KAPPA value of the classifier for the classification results was 0.688, representing a good performance. Two external independent validation sets were employed to test the efficacy of the classifier, and the final results corroborated that CS2 had better prognostic effects and similar differences in other aspects. To make our study reproducible and truly applicable to clinical research, we uploaded our packaged classifier to GitHub. Researchers can accurately classify patients with only RNA-seq data and clinical information, which might provide a reference for future clinical practice.
From the perspective of patient survival analysis alone, CS2 patients had longer overall survival time, with similar results in both the training and validation sets. However, predicting patient prognostic effects was only one function of the classifier. This study also explored other aspects such as the differences between the two subtypes in drug response, immune microenvironment, and biology. Even if CS1 had a higher probability of poor prognosis, we still hoped to present the various differences between the two subtypes at the genetic level to find suitable treatment strategies for each and apply them to clinical practice.
As many studies have shown that MPS played a crucial role in drug delivery, a higher proportion of MPS can lead to the dissolution of drug carriers, thereby enhancing patient resistance to drugs. Therefore, this study focused on the performance of the two subtypes in the GDSC database. CS2 presented a more sensitive response to Paclitaxel, 5-fluorouracil, Erlotinib, and Pazopanib, indicating that they may have better efficacy in clinical trials for patients in CS2. Paclitaxel therapy produced promising results in HNSCC patients and was successfully combined with cisplatin, carboplatin and/or etoposide in patients with advanced HNSCC (Spencer and Faulds 1994; Hitt et al. 2012). Given paclitaxel included Abraxane, a nanoformulation, in the GDSC, differences in patient drug response can indirectly reflect the impact of MPS on nanocarriers. 5-Fluorouracil was a major antimetabolite drug that was widely used in patients with advanced and recurrent HNSCC, which often appeared in clinical treatment (Yasumatsu et al. 2009; Lima et al. 2021; Psyrri et al. 2017). The EGFR signaling pathway contributed to the development and progression of HNSCC, and the EGFR-targeted drug erlotinib had shown good efficacy in the treatment of HNSCC (Gross et al. 2014; Stanam et al. 2015; Allen et al. 2015; Xu et al. 2017). For patients with recurrent or metastatic HNSCC, 800 mg/day of the CSF1R-targeted drug Pazopanib oral suspension in combination with standard weekly cetuximab had observed encouraging preliminary anti-tumor activity (Dincer et al. 2019; Adkins et al. 2018). From the analysis of the above drugs, which are clinically commonly used, targeting EFGR, CSF1R, and containing nanocarriers, we concluded that CS2 was more sensitive to these small molecular compounds. In other words, these drugs may be preferred when patients were identified as CS2 type. Furthermore, our study found CS2 patients who treated radiation therapy were more sensitive to 5-fluorouracil and Pazopanib compared to those without radiation therapy, indicating a combination strategy can be an effective way to improve the status of patients. Since the subtypes in this study were clustered based on multi-omics data related to MPS, it was reasonable to believe that MPS had a certain impact on the efficacy of these drugs.
CS2 had higher scores in most B cells and T cells in all three immune infiltration analyses, indicating a higher proportion of cells and immune effects. Overall, the immune system of CS2 can better inhibit HNSCC tumors, resulting in a better prognosis and drug response for patients. The biological functions analysis of CS2 illustrated that most pathways were enriched in metabolic pathways, which might be related to MPS. Further metabolic analysis also indicated that CS2 had better performance in overall metabolic, fatty acid metabolism, and cholesterol metabolism, reflecting CS2’s strong biological feature of xenobiotic metabolism. This phenomenon can be seen in some clues from immune infiltration analysis, which CS2 had lower proportions of MPS such as macrophages and DCs. Previous literature had shown that metabolism was a crucial underpinning of MPS (Davies et al. 2019), and its functional heterogeneity required diverse metabolism. MPS such as macrophages must induce or inhibit metabolic pathways to find, ingest, and digest apoptotic cells (Trzeciak et al. 2021). Metabolic interactions occurred when these cells came into contact with other cells, such as the stress-related metabolism produced by the interaction between NKT cells and MPS (Cortesi et al. 2018). In addition, MPS was also involved in the process of endogenous lipid oxidation-induced metabolism (Gioia and Zanoni 2021). In other aspects, we also compared the mutation differences between the two subtypes, in which CS2 had higher TMB and FGA, representing patients in CS2 were more likely to benefit from immunotherapy.
Although CS1 was considered to have a worse prognosis, the analysis of immune checkpoint inhibitors (ICIs) depicted higher expression of CS1 in many targets such as CD47, ABCA1, and MFGE8, which might be potential therapeutic targets. These targets had recently been shown to be associated with efferocytosis. In cancer treatment, CD47, as the classic “don’t eat me” signal of efferocytosis, was often overexpressed in apoptotic cell clearance defects and pathological blockade of CD47-SIRP1 interaction (Mehrotra and Ravichandran 2022). ABCA1 expression defects can lead to decreased efferocytosis in disease, and it seemed to have a beneficial effect on efferocytosis and anti-inflammatory signal transduction in vivo (Tang and Oram 2009; Wang and Oram 2005; Wang and Oram 2007; Joseph et al. 2002). Inflammatory-induced TLR signaling can further inhibit the expression of ligands such as MFGE8, thereby inhibiting increased efferocytosis and furthering disease pathology (Komura et al. 2009). Further enrichment analysis demonstrated that CS1 exhibited specificity in coagulation function, ECM, and JAK-STAT signaling pathway. In HNSCC patients, high platelet count and low tumor stroma ratio were closely associated with increased metastasis and poor prognosis. STAT3 and STAT5 were expressed and activated in HNSCC, contributing to cell survival and proliferation. In HNSCC, STAT can be activated by various signaling pathways, including EGFR, α7 nicotinic receptor, interleukin receptor, and erythropoietin receptor pathways (Lai and Johnson 2010). Due to the characteristic gene enrichment of CS1 in this target, future research can apply JAK inhibitors to clinical treatment when patients were identified as CS1 (Hwang et al. 2021).
Conclusion
In this study, we identified two HNSCC subtypes and explored their biological difference. Based on the specific genes from the two subtypes, we established the relevant classifier, contributing to implementing precise treatment for HNSCC patients. Combined with other clinical characteristics, we hope our study can guide therapeutic strategies and offer some clues between MPS and HNSCC.
Limitations
The study also has some limitations. In drug response analysis, the reference database only has response data of single drug action so we are unable to conduct sensitivity testing on the combined small molecule drug regimen. We look forward to more researchers delving deeper into drug response for cell lines to provide more data for future research.
Availability of data and materials
The datasets generated and analyzed during the current study are available in the Cancer Genome Atlas (TCGA) repository (https://portal.gdc.cancer.gov/) and GEO database (https://www.ncbi.nlm.nih.gov/geo).
References
Adkins D, Mehan P, Ley J, Siegel MJ, Siegel BA, Dehdashti F et al (2018) Pazopanib plus cetuximab in recurrent or metastatic head and neck squamous cell carcinoma: an open-label, phase 1b and expansion study. Lancet Oncol 19(8):1082–1093
Ager A, May MJ (2015) Understanding high endothelial venules: lessons for cancer immunology. Oncoimmunology 4(6):e1008791
Auffray C, Sieweke MH, Geissmann F (2009) Blood monocytes: development, heterogeneity, and relationship with dendritic cells. Annu Rev Immunol 27:669–692
Azizi E, Carr AJ, Plitas G, Cornish AE, Konopacki C, Prabhakaran S et al (2018) Single-cell map of diverse immune phenotypes in the breast tumor microenvironment. Cell 174(5):1293-1308.e36
Balm FJ, Drexhage HA, von Blomberg ME, Snow GB (1982) Mononuclear phagocyte function in head and neck cancer: NBT-dye reduction, maturation and migration of peripheral blood monocytes. Laryngoscope 92(7 Pt 1):810–814
Balm FJ, von Blomberg-van de Flier BM, Drexhage HA, de Haan-Meulman M, Snow GB (1984a) Mononuclear phagocyte function in head and neck cancer: depression of murine macrophage accumulation by low molecular weight factors derived from head and neck carcinomas. Laryngoscope 94(2 Pt 1):223–227
Balm FA, Drexhage HA, von Blomberg M, Weltevreden EF, Veldhuizen RW, Mullink R et al (1984b) Mononuclear phagocyte function in head and neck cancer. Chemotactic responsiveness of blood monocytes in correlation between histologic grade of the tumor and infiltration of these cells into the tumor area. Cancer. 54(6):1010–1015
Chalise P, Fridley BL (2017) Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm. PLoS One 12(5):e0176278
Charoentong P, Finotello F, Angelova M, Mayer C, Efremova M, Rieder D et al (2017) Pan-cancer immunogenomic analyses reveal genotype-immunophenotype relationships and predictors of response to checkpoint blockade. Cell Rep 18(1):248–262
Chen DS, Mellman I (2013) Oncology meets immunology: the cancer-immunity cycle. Immunity 39(1):1–10
Cortesi F, Delfanti G, Casorati G, Dellabona P (2018) The pathophysiological relevance of the iNKT cell/mononuclear phagocyte crosstalk in tissues. Front Immunol 9:2375
Davies LC, Rice CM, McVicar DW, Weiss JM (2019) Diversity and environmental adaptation of phagocytic cell metabolism. J Leukoc Biol 105(1):37–48
de Lima JM, Castellano LRC, Bonan PRF, de Medeiros ES, Hier M, Bijian K et al (2021) Chitosan/PCL nanoparticles can improve anti-neoplastic activity of 5-fluorouracil in head and neck cancer through autophagy activation. Int J Biochem Cell Biol 134:105964
Di Gioia M, Zanoni I (2021) Dooming phagocyte responses: inflammatory effects of endogenous oxidized phospholipids. Front Endocrinol (lausanne) 12:626842
Dincer C, Kaya T, Keskin O, Gursoy A, Tuncbag N (2019) 3D spatial organization and network-guided comparison of mutation profiles in glioblastoma reveals similarities across patients. PLoS Comput Biol 15(9):e1006789
Dong H, Strome SE, Salomao DR, Tamura H, Hirano F, Flies DB et al (2002) Tumor-associated B7–H1 promotes T-cell apoptosis: a potential mechanism of immune evasion. Nat Med 8(8):793–800
Freeman GJ, Long AJ, Iwai Y, Bourque K, Chernova T, Nishimura H et al (2000) Engagement of the PD-1 immunoinhibitory receptor by a novel B7 family member leads to negative regulation of lymphocyte activation. J Exp Med 192(7):1027–1034
Gavrielatou N, Doumas S, Economopoulou P, Foukas PG, Psyrri A (2020) Biomarkers for immunotherapy response in head and neck cancer. Cancer Treat Rev 84:101977
Geissmann F, Manz MG, Jung S, Sieweke MH, Merad M, Ley K (2010) Development of monocytes, macrophages, and dendritic cells. Science 327(5966):656–661
Gross ND, Bauman JE, Gooding WE, Denq W, Thomas SM, Wang L et al (2014) Erlotinib, erlotinib-sulindac versus placebo: a randomized, double-blind, placebo-controlled window trial in operable head and neck cancer. Clin Cancer Res 20(12):3289–3298
Hitt R, Irigoyen A, Cortes-Funes H, Grau JJ, García-Sáenz JA, Cruz-Hernandez JJ et al (2012) Phase II study of the combination of cetuximab and weekly paclitaxel in the first-line treatment of patients with recurrent and/or metastatic squamous cell carcinoma of head and neck. Ann Oncol 23(4):1016–1022
Hwang BO, Park SY, Cho ES, Zhang X, Lee SK, Ahn HJ et al (2021) Platelet CLEC2-Podoplanin Axis as a Promising Target for Oral Cancer Treatment. Front Immunol 12:807600
Joseph SB, McKilligin E, Pei L, Watson MA, Collins AR, Laffitte BA et al (2002) Synthetic LXR ligand inhibits the development of atherosclerosis in mice. Proc Natl Acad Sci U S A 99(11):7604–7609
Komura H, Miksa M, Wu R, Goyert SM, Wang P (2009) Milk fat globule epidermal growth factor-factor VIII is down-regulated in sepsis via the lipopolysaccharide-CD14 pathway. J Immunol 182(1):581–587
Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, Wei K et al (2019) Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods 16(12):1289–1296
Koucký V, Bouček J, Fialová A (2019) Immunology of plasmacytoid dendritic cells in solid tumors: a brief review. Cancers 11(4):470
Lai SY, Johnson FM (2010) Defining the role of the JAK-STAT pathway in head and neck and thoracic malignancies: implications for future therapeutic approaches. Drug Resist Updat 13(3):67–78
Long KB, Beatty GL (2013) Harnessing the antitumor potential of macrophages for cancer immunotherapy. Oncoimmunology 2(12):e26860
Lu X, Meng J, Zhou Y, Jiang L, Yan F (2021) MOVICS: an R package for multi-omics integration and visualization in cancer subtyping. Bioinformatics 36(22–23):5539–5541
Lu J, Gao X, Wang S, He Y, Ma X, Zhang T et al (2023) Advanced strategies to evade the mononuclear phagocyte system clearance of nanomaterials. Exploration (Beijing) 3(1):20220045
Mehrotra P, Ravichandran KS (2022) Drugging the efferocytosis process: concepts and opportunities. Nat Rev Drug Discov 21(8):601–620
Meng C, Helm D, Frejno M, Kuster B (2016) moCluster: identifying joint patterns across multiple omics data sets. J Proteome Res 15(3):755–765
Mitchell D, Chintala S, Dey M (2018) Plasmacytoid dendritic cell in immunity and cancer. J Neuroimmunol 15(322):63–73
Murdoch D (2007) Standard, and novel cytotoxic and molecular-targeted, therapies for HNSCC: an evidence-based review. Curr Opin Oncol 19(3):216–221
Psyrri A, Fortpied C, Koutsodontis G, Avgeris M, Kroupis C, Goutas N et al (2017) Evaluation of the impact of tumor HPV status on outcome in patients with locally advanced unresectable head and neck squamous cell carcinoma (HNSCC) receiving cisplatin, 5-fluorouracil with or without docetaxel: a subset analysis of EORTC 24971 study. Ann Oncol 28(9):2213–2218
Ren X, Zhang L, Zhang Y, Li Z, Siemers N, Zhang Z (2021) Insights gained from single-cell analysis of immune cells in the tumor microenvironment. Annu Rev Immunol 39:583–609
Seiwert TY, Burtness B, Mehra R, Weiss J, Berger R, Eder JP et al (2016) Safety and clinical activity of pembrolizumab for treatment of recurrent or metastatic squamous cell carcinoma of the head and neck (KEYNOTE-012): an open-label, multicentre, phase 1b trial. Lancet Oncol 17(7):956–965
Spencer CM, Faulds D (1994) Paclitaxel. A review of its pharmacodynamic and pharmacokinetic properties and therapeutic potential in the treatment of cancer. Drugs 48(5):794–847
Stanam A, Love-Homan L, Joseph TS, Espinosa-Cotton M, Simons AL (2015) Upregulated interleukin-6 expression contributes to erlotinib resistance in head and neck squamous cell carcinoma. Mol Oncol 9(7):1371–1383
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A et al (2021) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71(3):209–249
Tang C, Oram JF (2009) The cell cholesterol exporter ABCA1 as a protector from cardiovascular disease and diabetes. Biochim Biophys Acta 1791(7):563–572
Topalian SL, Drake CG, Pardoll DM (2012) Targeting the PD-1/B7-H1 (PD-L1) pathway to activate anti-tumor immunity. Curr Opin Immunol 24(2):207–212
Trzeciak A, Wang YT, Perry JSA (2021) First we eat, then we do everything else: The dynamic metabolic regulation of efferocytosis. Cell Metab 33(11):2126–2141
Van Allen EM, Lui VWY, Egloff AM, Goetz EM, Li H, Johnson JT et al (2015) Genomic correlate of exceptional erlotinib response in head and neck squamous cell carcinoma. JAMA Oncol 1(2):238–244
Wallis SP, Stafford ND, Greenman J (2015) Clinical relevance of immune parameters in the tumor microenvironment of head and neck cancers. Head Neck 37(3):449–459
Wang Y, Oram JF (2005) Unsaturated fatty acids phosphorylate and destabilize ABCA1 through a phospholipase D2 pathway. J Biol Chem 280(43):35896–35903
Wang Y, Oram JF (2007) Unsaturated fatty acids phosphorylate and destabilize ABCA1 through a protein kinase C delta pathway. J Lipid Res 48(5):1062–1068
Wang B, Mezlini AM, Demir F, Fiume M, Tu Z, Brudno M et al (2014) Similarity network fusion for aggregating data types on a genomic scale. Nat Methods 11(3):333–337
Woo SR, Corrales L, Gajewski TF (2015) Innate immune recognition of cancer. Annu Rev Immunol 33:445–474
Xu MJ, Johnson DE, Grandis JR (2017) EGFR-targeted therapies in the post-genomic era. Cancer Metastasis Rev 36(3):463–473
Yasumatsu R, Nakashima T, Uryu H, Masuda M, Hirakawa N, Shiratsuchi H et al (2009) The role of dihydropyrimidine dehydrogenase expression in resistance to 5-fluorouracil in head and neck squamous cell carcinoma cells. Oral Oncol 45(2):141–147
Zappia L, Oshlack A (2018) Clustering trees: a visualization for evaluating clusterings at multiple resolutions. GigaScience [Internet]. 2018 Jul 1 [cited 2023 Jun 13];7 (7). Available from https://doi.org/10.1093/gigascience/giy083/5052205
Zeng D, Ye Z, Shen R, Yu G, Wu J, Xiong Y et al (2021) IOBR: multi-omics immuno-oncology biological research to decode tumor microenvironment and signatures. Front Immunol 12:687975
Zhang Y, Zhang Z (2020) The history and advances in cancer immunotherapy: understanding the characteristics of tumor-infiltrating immune cells and their therapeutic implications. Cell Mol Immunol 17(8):807–821
Acknowledgements
We are very grateful to the TCGA and GEO database and other public resources for providing us with a research foundation. In addition, we greatly appreciate the inspiration and insightful assistance provided by the R package “MOVICS” for our classifier “getMPsub”.
Funding
This research was funded by (National Youth Science Foundation Project), grant number [82204159]) and by (Postdoctoral Fund project of Chongqing), grant number (cstc2021jcyj-bshX0220).
Author information
Authors and Affiliations
Contributions
Conceptualization, CZ; methodology, CZ and JD; software, CZ and KL; validation, CZ and JD; formal analysis, CZ and HL; resources, CZ, YZ and GL; data curation, GL; writing—original draft preparation, CZ; writing—review and editing, XZ and BX; visualization, CZ; supervision, BX; project administration, XZ; funding acquisition, BX All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhang, C., Deng, J., Li, K. et al. Mononuclear phagocyte system-related multi-omics features yield head and neck squamous cell carcinoma subtypes with distinct overall survival, drug, and immunotherapy responses. J Cancer Res Clin Oncol 150, 37 (2024). https://doi.org/10.1007/s00432-023-05512-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00432-023-05512-5