Immune-related gene data-based molecular subtyping related to the prognosis of breast cancer patients

Mu, Guoyu; Ji, Hong; He, Hui; Wang, Hongjiang

doi:10.1007/s12282-020-01191-z

Immune-related gene data-based molecular subtyping related to the prognosis of breast cancer patients

Original Article
Open access
Published: 27 November 2020

Volume 28, pages 513–526, (2021)
Cite this article

Download PDF

You have full access to this open access article

Breast Cancer Aims and scope Submit manuscript

Immune-related gene data-based molecular subtyping related to the prognosis of breast cancer patients

Download PDF

Guoyu Mu¹^na1,
Hong Ji²^na1,
Hui He³^na1 &
…
Hongjiang Wang¹

2317 Accesses
2 Citations
Explore all metrics

Abstract

Background

Breast cancer (BC), which is the most common malignant tumor in females, is associated with increasing morbidity and mortality. Effective treatments include surgery, chemotherapy, radiotherapy, endocrinotherapy and molecular-targeted therapy. With the development of molecular biology, immunology and pharmacogenomics, an increasing amount of evidence has shown that the infiltration of immune cells into the tumor microenvironment, coupled with the immune phenotype of tumor cells, will significantly affect tumor development and malignancy. Consequently, immunotherapy has become a promising treatment for BC prevention and as a modality that can influence patient prognosis.

Methods

In this study, samples collected from The Cancer Genome Atlas (TCGA) and ImmPort databases were analyzed to investigate specific immune-related genes that affect the prognosis of BC patients. In all, 64 immune-related genes related to prognosis were screened, and the 17 most representative genes were finally selected to establish the prognostic prediction model of BC (the RiskScore model) using the Lasso and StepAIC methods. By establishing a training set and a test set, the efficiency, accuracy and stability of the model in predicting and classifying the prognosis of patients were evaluated. Finally, the 17 immune-related genes were functionally annotated, and GO and KEGG signal pathway enrichment analyses were performed.

Results

We found that these 17 genes were enriched in numerous BC- and immune microenvironment-related pathways. The relationship between the RiskScore and the clinical characteristics of the sample and signaling pathways was also analyzed.

Conclusions

Our findings indicate that the prognostic prediction model based on the expression profiles of 17 immune-related genes has demonstrated high predictive accuracy and stability in identifying immune features, which can guide clinicians in the diagnosis and prognostic prediction of BC patients with different immunophenotypes.

A novel immune-related prognostic index for predicting breast cancer overall survival

Article 04 November 2020

Identification and development of an independent immune-related genes prognostic model for breast cancer

Article Open access 30 March 2021

Evaluating the tumor immune profile based on a three-gene prognostic risk model in HER2 positive breast cancer

Article Open access 03 June 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Background

Breast cancer (BC), which is the most common malignant tumor and the leading cause of cancer-related deaths in women in underdeveloped countries, affected 882,900 individuals and resulted in 324,300 deaths in 2012 alone; that year, BC accounted for 25% and 15% of all cancer cases and cancer deaths among females, respectively [1]. Generally, BC is associated with reproductive and endocrine risk factors, including oral contraceptive use, nulliparity and long menstrual periods [2]. On the contrary, some potentially modifiable risk factors include alcohol consumption, obesity, physical inactivity, and menopausal hormone therapy [3].

Some large-scale clinical data indicate that systemic adjuvant chemotherapy should generally not be recommended for most patients with early BC following surgery or radiotherapy, since chemotherapy would result in far greater toxicity relative to the survival benefit of the patients [4,5,6]. However, patients with a low likelihood of survival who do not undergo chemotherapy will quickly relapse, which results in the invasion of adjacent tissues and distant metastasis [7]. Consequently, it is particularly important to determine the relevant survival risk of patients through subgroup classification and early diagnosis and to provide additional systemic adjuvant chemotherapy to high-risk patients.

According to recent studies, BC can be classified into the following four subtypes: Luminal A (ER + /PR + /HER2 −, grade 1 or grade 2), Luminal B (ER + /PR + /HER2 + , or ER + /PR + /HER2 − grade 3), HER2-overexpressing (ER − /PR − /HER2 +), and triple-negative breast cancer (TNBC, ER − /PR − /HER2 −) [8]. Among them, the Luminal A subtype is associated with a favorable prognosis and sensitivity to endocrine therapy, which means that only endocrine therapy is the general treatment approach [9]. On the contrary, the Luminal B subtype is associated with a high tumor proliferation rate. The HER2-negative Luminal B subtype can usually be treated with endocrine therapy + chemotherapy, while the HER2-positive Luminal B subtype is generally treated with chemotherapy + anti-HER2 treatment + endocrine therapy [10]. Moreover, the HER2 overexpressing subtype is characterized by a poor prognosis and rapid progression and is mainly treated with chemotherapy + anti-HER2 therapy [11]. Specifically, the negative expression of ER, PR and HER2 in TNBC is related to its unique biological characteristics and potent heterogeneity, and the only standard treatment recommended for this subtype is chemotherapy [12]. Recently, progress has been made in the early diagnosis and treatment of BC, which makes BC a treatable disease; however, multidrug resistance (MDR) remains a major challenge in the treatment of metastatic BC, as the typical survival time of patients with metastatic BC is only 2–3 years [13]. Unfortunately, this general classification method cannot accurately reflect individual differences [14]. It is worth noting that the existing large-scale databases that contain gene expression data, including the TCGA and ImmPort, enable us to search for potentially reliable BC biomarkers to predict and classify patient prognosis [15].

Increasing evidence has supported the idea that immunocytes in the tumor microenvironment can remarkably promote or inhibit tumor growth, and thus, they can serve as indicators of BC prognosis. In addition, immune escape has been verified as a novel cancer marker [16]. In recent years, through immunotherapies, such as the BC vaccine, monoclonal antibodies (MAb), antibody–drug conjugates (ADCs), checkpoint inhibitors and stimulating molecule agonist antibodies, great progress has been achieved in the treatment of BC patients [17,18,19,20]. Moreover, tumor-infiltrating lymphocytes (TILs) and tumor-related macrophages in BC tissues have also been found to have crucial functions in the immune escape mechanism of tumor cells, and thus, they are remarkably related to patient prognosis [21, 22]. Nonetheless, the molecular events of tumor cell–immunocyte interaction in the BC microenvironment should be further examined and summarized, as the contribution of these events and their potential roles in predicting the prognosis of BC patients should be determined [23].

In this study, a prognostic prediction model for BC was developed and verified based on immune-related genes retrieved according to the clinical features of patients whose data were collected from the TCGA and ImmPort databases. Our findings are promising in that they may help clinicians evaluate the prognosis and therapeutic options for BC patients as well as therapeutic effects.

Materials and methods

Preprocessing of preliminary sample data and initial screening of BC immune-related genes

The most recent clinical follow-up data were downloaded on December 14, 2018 through the TCGA GDC API. In all, 1222 RNA-Seq data samples were included, as shown in Table S1. Overall, 1109 of these 1222 data samples were tumor tissues, while the remaining 113 were normal tissues. In addition, an immune-related gene set, which covered 1811 genes, was also downloaded from the ImmPort database on October 8th, 2018, as shown in Table S2.

First, the retrieved 1109 RNA-seq data samples were preprocessed according to the steps described below: (1) 39 samples with no clinical data and 21 with 0 OS (overall survival) were removed, (2) the normal tissue sample data was removed, (3) genes of FPKM (Fragments Per Kilobase Million) < 1 were also removed from all samples, and (4) only the expression profiles of immune-related genes were preserved. Altogether, 1376 genes were used for the subsequent analysis of the model. The preprocessed data are shown in Table S3, while the sample statistics of the clinical information are displayed in Table 1.

Table 1 Sample statistics of training set and test set

Full size table

Second, 1068 samples were classified into the training set and test set, and random grouping with replacement was performed for all samples for 500 times in advance to eliminate the impact of random allocation bias on model stability. Grouping was performed based on the training set: test set ratio of 0.7:0.3 since the BC sample size was over 1000. Specifically, the most suitable training and test sets were selected based on the following criteria: (1) similar age distributions, clinical stages, follow-up times and death proportions between the two groups; and (2) close binary sample sizes in the two randomly divided datasets after clustering the gene expression profiles. The final training set data (n = 533) are displayed in Table S4, and the test set data are shown in Table S5 (n = 535). Moreover, the clinical information statistics of both the test set and training set samples are presented in Table 1. The final information of both the training set and test set samples is shown in Table 1. No significant difference was observed between the training set and test set data, as verified by the P value, which indicated reasonable sample grouping.

Single-factor survival analysis of immune-related genes in the training set

All immune-related genes were analyzed using the univariate Cox proportional hazards regression model; at the same time, survival data were evaluated by the survival coxph function of R software [24], and p < 0.05 served as the significance threshold.

Screening of specific immune-related genes for BC prognosis and construction of the prognostic prediction model

First, the least absolute shrinkage and selection operator (Lasso, Tibshirani, 1996) algorithm was used to further narrow the range of prognosis-specific immune-related genes under the condition of maintaining high accuracy. Moreover, the glmnet package of R software was used for the lasso Cox regression analysis. Next, to further compress the number of immune-related genes, the R package MASS was employed for stepwise regression analysis using the Akaike information criterion (AIC), which considered the degree of fit of the statistical model as well as the number of parameters used in fitting. The StepAIC method in the MASS package originated from the most complex model, in which one variable was deleted sequentially to reduce the AIC; a smaller value was indicative of a superior model, which demonstrated a sufficient degree of fit and fewer parameters of the model. The risk model of 17 genes (Table S7) was finally obtained using this algorithm. The results of the stepwise regression are presented in Table S8. The formula was as follows:

RiskScore = PIK3CA*0.025861691 + CCR7*0.014541227 + SEMA7A*0.158263093 + ACVR2A* − 0.437173332 + CBL*0.231921725 + PLXNB2*0.014940811 + PLXND1*0.033074364 + APOBEC3F* − 0.314321194 + NFATC2* − 0.257156537 + NFKBIZ* − 0.046977178 + TNFSF4*0.16976996 + DAXX* − 0.034395422 + TLR2*0.023037905 + SEMA3B* − 0.044973358 + HSPA2* − 0.023131493 + TPT1* − 0.001623522 + CCL22* − 0.077745415.

Afterwards, the expression profiles of related genes were collected from both the training set and test set; subsequently, they were incorporated into the model to calculate the RiskScore of all the samples. Then, the median RiskScore served as the threshold by which the samples were classified into either the high-risk group (Risk-H) or the low-risk group (Risk-L); afterwards, a receiver-operating characteristic (ROC) curve analysis, Kaplan–Meier (KM) analysis and gene-clustering analysis were performed to comprehensively assess the efficiency, accuracy and stability of the model in predicting and classifying the prognosis of BC patients.

Functional annotations and signaling pathway enrichment of immune-related genes specific for prognosis

The gene families of the 17 screened genes were annotated according to the human gene classification in the HGNC (Human Gene Nomenclature) database [25]. Specifically, the clusterProfiler package of R software was used for the KEGG (Kyoto Encyclopedia of Genes and Genomes) and GO (Gene Ontology) enrichment analyses for the abovementioned 17 immune-related genes specific for prognosis. Specifically, the gene sets that intersected with the 17 genes were compared in each GO term and KEGG pathway. The GO term or KEGG pathway was considered annotated by the genes if there was an intersection, and finally, all the GO terms and KEGG pathways that annotated to the 17 genes were obtained.

Correlation between the RiskScore and signaling pathways as well as the clinical features of the samples

First, the KEGG functional enrichment scores of all samples were analyzed using the single-sample gene set enrichment analysis (ssGSEA) function of the R software package GSVA [26]. In addition, the correlation with the RiskScore was calculated, and a clustering analysis was performed according to the enrichment score of each sample in each pathway.

Subsequently, the correlations of related factors (including T, N, M, Stage, Age and HER2 expression) with the RiskScore were evaluated. Then, the nomogram model and forest plot were established using the clinical features (such as T, N, M, Stage, Age and HER2 expression) as well as the RiskScore, and the correlations of the RiskScore and the various clinical features with patient survival were assessed. The analysis process is shown in the Figure workflow.

Results

Retrieval of immune-related genes based on the survival and prognosis of BC patients

First, related data were downloaded from the TCGA and ImmPort databases and were then preprocessed (see “Materials and methods”). Subsequently, all the immune-related genes and survival data were analyzed by a univariate Cox proportional hazards regression model using the survival coxph function of R, with the significance level set at p < 0.05, as shown in Table S6. Eventually, 62 significantly different immune-related genes that were also associated with prognosis were discovered. The relationships of the p values of these 62 genes with the HR and expression levels are shown in Fig. 1.

Screening of prognosis-specific immune-related genes and construction of the prognostic prediction model for BC

Sixty-two immune-related genes were recognized, but many of these genes are not suitable for clinical detection. Consequently, the scope of immune-related genes was further narrowed to guarantee high accuracy. Thus, the R software package glmnet was used for the lasso Cox regression to refine the prognostic genes identified above, which led to a reduction in gene numbers from 62 to 29. Moreover, the R package MASS was employed for stepwise regression analysis using the AIC, which considered the degree of fit of the statistical model and the number of parameters used for fitting. On the contrary, the StepAIC method in the MASS package originated from the most complex model, in which one variable was deleted sequentially to reduce the AIC; a smaller value suggested a superior model, which indicates a sufficient degree of fit and fewer parameters of the model. Finally, the risk model of 17 genes was obtained using this algorithm (Table S7). The formula is provided in the “Materials and methods”.

Subsequently, training set samples were incorporated into the formula to calculate the RiskScore for all the samples, and the median RiskScore served as the threshold by which the samples were divided into either the high risk (Risk-H) or low risk (Risk-L) group. Furthermore, ROC curve analysis of the prognostic classification for the RiskScore was performed using the survivalROC package of R software. The OS distribution of the samples was approximately > 2 years (Fig. S1); as a result, the model predicting effect for the 3-, 5- and 10-year survival was evaluated in this study, with an average AUC of approximately 0.789, as presented in Fig. 2a. In addition, the sample distribution in the Risk-H and Risk-L groups under different OS periods is presented in Fig. 2b. As could be observed, no obvious difference in sample size was detected between the 0- and 1-year OS of the Risk-H and Risk-L groups; moreover, the sample size in the Risk-H group after the 5th year was dramatically smaller than that in Risk-L group, which had become markedly significant as the OS extended (Fig. 2c). The clustering results of the training set samples are presented in Fig. 2d. Obviously, the abovementioned 17 genes could be markedly clustered into high and low expression groups, while samples in the training set could also be assigned to two groups; the RiskScore values of the two subclasses were also compared (Fig. 2e).

In addition, to further confirm the stability and reliability of the prognostic prediction model, the expression profiles of these 17 genes were obtained from the test set and then integrated into the model for model verification; at the same time, the RiskScore of the samples was also calculated. Afterwards, data in the test set were used to evaluate the ability of the model to predict the 3-, 5- and 10-year survival rates. As shown in Fig. 3a, the average 3–10-year AUC is 0.726. The sample distribution in both the Risk-H and Risk-L groups at different OS periods is also displayed in Fig. 3b. No significant difference was observed in OS between the Risk-H group and Risk-L group at 0 and 1 year; in addition, the sample size in the Risk-H group after the 3rd year was notably reduced compared with that in the Risk-L group, which became more obvious as the OS increased (Fig. 3c). The clustering results for the samples in the test set and the difference in RiskScore values between the two groups are shown in Fig. 3d and e, respectively.

To further validate the stability as well as the reliability of the prognostic prediction model, the expression profile data of the abovementioned 17 genes were extracted from a total of 1068 samples, followed by substitution into the model. This was performed to calculate the RiskScore values for model validation, as previously described. The series of results are shown in Fig. 4. Taken together, the verification results based on the test set data suggested that the prognostic model established on the basis of the expression profiles of these 17 immune-related genes displayed excellent prediction accuracy and stability in identifying immune-related features.

Finally, the KM survival curves of the risk model, which were constructed based on the 17 genes in predicting the Risk-H and Risk-L groups for the training set, test set and all samples, are shown in Fig. 5. Figure 5a shows the KM survival curve of the training set (p < 0.0001), Fig. 5b shows the KM survival curve of the test set (p < 0.01), and Fig. 5c shows the KM survival curve of all the samples (p < 0.0001).

Functional annotations of immune-related genes and signaling pathway enrichment specific to prognosis

First, the gene families of the 17 obtained genes were annotated in accordance with the human gene classification in the HGNC database. As presented in Table 2, two genes were enriched into the Plexins family, and two genes were also significantly enriched in the Semaphorins family (p < 0.01). Moreover, the clusterProfiler package of R software was also used for the enrichment analyses of the 17 abovementioned immune-related genes specific to prognosis. The results of the GO enrichment are displayed in Fig. 6a, the results of the KEGG pathway enrichment analysis are presented in Fig. 6b, and data related to the GO and KEGG analyses are shown in Table S9 and Table S10, respectively. These results demonstrate that most of the abovementioned genes could be enriched in multiple immunity- and cancer-related biological processes and signaling pathways.

Table 2 17-gene function annotation results

Full size table

Correlation of the RiskScore with the signaling pathways and clinical features of the samples

First, the KEGG functional enrichment scores of samples in the training set and test set and then those of all samples were analyzed using the ssGSEA function of the R software package GSVA. Moreover, the correlations with the RiskScore were also calculated according to the enrichment scores of all pathways in all samples. In all, 45 related KEGG pathways were obtained and are shown in Tables S11-S13. Among them, the top 50% of pathways were selected for the clustering analysis according to their enrichment scores, as shown in Fig. 7. The JAK/STAT signaling pathway, Insulin signaling pathway and Pathways in cancer had the best correlation with a correlation coefficient of approximately 0.36.

Thereafter, the correlations of various factors (including T, N, M, Stage, Age and HER2 expression) with the RiskScore were also analyzed, as shown in Fig. 8. Clearly, obvious associations were found between other features and the RiskScore (p < 0.05), which reveals that the RiskScore model was dependent on these clinical features.

On the contrary, the nomogram model was constructed using the RiskScore along with the clinical features. A nomogram is a method that can be used to intuitively and effectively demonstrate the results of a risk model, which can conveniently predict outcomes. In the nomogram, the straight-line length was used to examine the impacts of different variables (and their values) on the outcome. In this study, the nomogram model was established using the clinical features (including T, N, M, Stage, Age and HER2 expression) together with the RiskScore, as shown in Fig. 9. According to the model results, the RiskScore features remarkably affected the prediction of the survival rate, which indicates that the risk model based on the 17 genes could efficiently predict prognosis.

Finally, the forest plot was established using both the RiskScore and the clinical features. Notably, the forest plot allows us to simply and intuitively illustrate the pooled statistical results of different research factors, which generally treats an ineffective line vertical to the X-axis (generally at the coordinate of X = 1 or 0) as the center, while several segments parallel to the X-axis represent the effect size and 95% confidence interval (CI) of each study. In this study, the forest plot was generated using the clinical features, such as T, N, Stage, Grade, Age, Alcohol consumption and Smoking status; the RiskScore was also calculated by the risk model, as shown in Fig. 10. The HR of the RiskScore was evidently increased compared with the HRs of other clinical features (p < 0.05). The multivariate Cox regression analyses of the various clinical features and the RiskScore are presented in Table S14.

Conclusions

BC is a highly complex and heterogeneous malignancy that is associated with heterogeneous molecular profiles, clinical responses to therapeutics and prognoses [27]. Tumor heterogeneity is responsible for the various BC subtypes, which each have different prognoses and sensitivities to chemotherapy [28]. In addition, no consistent therapeutic benefits can be achieved among different patients from clinical medication, which can be ascribed to their potential toxicities and side effects. As a result, postoperative systemic adjuvant chemotherapy remains a source of controversy in clinical practice. Therefore, it is crucial to discover potential BC biomarkers that can predict patient prognosis and recurrence, as well as to administer early adjuvant chemotherapy to high-risk patients who may benefit [29].

BC has been recognized to be immunogenic, as it involves multiple putative tumor-associated antigens (TAAs), such as HER2 and Mucin 1 (MUC1) [30, 31]. Notably, over the last decade, these TAAs have been treated as targets for the development of new cancer vaccines and bispecific antibodies (bsAbs), among which, some have been translated into tumor-specific immune responses and have been verified to be clinically beneficial [32]. Immunocytes in BC tissue primarily consist of T-lymphocytes (70–80%), while the remaining components are derived from B lymphocytes, macrophages, natural killer cells and antigen-presenting cells (APCs) [33, 34]. Of these, T cells can be activated through recognition of the tumor antigens presented by APCs; typically, the intensity and quality of T cell activation signals are related to a variety of interactions between the receptor and ligand [35].

Substantial evidence has supported the concept that immunocytes in the tumor microenvironment can effectively enhance or suppress tumor growth, which can thereby serve as a prognostic indicator in BC patients. The interactions between the immune system and incipient cancer cells, which is also referred to as immunoediting, can be divided into 3 phases, namely, elimination, equilibrium, and escape [36]. Of these phases, the elimination process suggests that the innate and adaptive arms of the immune system will recognize the new antigens (derived from mutations or translocations) on the surface of incipient cancer cells, which is associated with MHC-I; alternatively, the distress signals can be expressed by the transformed cells with chromosomal changes (such as aneuploidy or hyperploidy). Finally, the immune system will eliminate these abnormal cells [37]. The equilibrium status will be reached when the immune system fails to eliminate the transformed cells but can stop them from further progression, and such a process has been deemed to be the dormancy phase during the development of primary cancer. This phase is mediated by the equilibrium between cells and cytokines (such as IL-12, IFN-γ, TNF-α, CD4 TH1, CD8 + T cells, NK cells and γδT cells) that promote elimination as well as those that promote the persistence of nascent tumors (including IL-23, IL-6, IL-10, TGF-β, NKT cells, CD4 Th2, Foxp3 + regulatory T [Treg] cells, and MDSCs) [38]. On the contrary, monocytes play a crucial role in this process, during which they may differentiate into proinflammatory M1 or anti-inflammatory M2 types as a result of the effects of the tumor microenvironment [39]. Immune escape of cancer cells may occur through various mechanisms. In HR-positive BC, the absence of strong tumor antigens and low MHC-I expression allow for tumor progression that is unnoticed by the immune system [40]. Estrogen exerts an immunosuppressive effect on the tumor microenvironment, which can boost tolerance to weak immunogenic cancers; moreover, estrogen receptor (ER) can be expressed on most immunocytes, including macrophages, T and B lymphocytes, and NK cells [41]. The immune response can be polarized to the Th2- rather than the Th1-effector immune response in the presence of estrogen [42, 43]. In HER2-positive cancer cells, MHC-I presentation is negatively correlated with HER2 expression [44]. Typically, triple-negative breast cancer (TNBC) exhibits a spectrum of MHC-I presentation and high antigen expression in the tumor, but immune escape in TNBC has been found to be predominantly related to the development of the immunosuppressive tumor microenvironment (including Tregs, MDSCs and PD-1/PD-L1) [45]. As a result, in the era of immunotherapy, it is particularly important to be familiar with the molecular events in the tumor-immune microenvironment to search for biomarkers related to survival prediction in patients with BC of any subtype.

In this study, 17 prognosis-specific immune-related genes were discovered through mining, statistics and sorting of big data such as that found in the TCGA and ImmPort databases; moreover, a prognostic prediction model was also constructed, and the RiskScore of the patients was calculated. Finally, prediction ability and verification were determined. Our findings suggest that the prognostic prediction model that was constructed based on the expression profiles of specific immune-related genes can further classify patients with a definite clinical stage into different subgroups based on the predicted survival results. Furthermore, the RiskScore is calculated according to the expression profiles of specific immune-related genes and should be used in combination with the clinical features of patients, which should more precisely predict BC patient survival. Taken together, this model may contribute to the identification of new BC markers in the clinic and can provide multiple targets for the precise medical treatment of BC. The model can also be used for the accurate classification of patients at the molecular subtype level. Finally, this model is promising in that it can guide clinicians in determining the prognosis, clinical diagnosis and appropriate therapy for BC patients with different immunophenotypes.

Data availability

The supplementary data used and generated during the current study are available from the corresponding authors on reasonable request.

References

Waks AG, Winer EP. Breast cancer treatment. JAMA. 2019;321(3):316.
PubMed Google Scholar
Bernstein L. Epidemiology of endocrine-related risk factors for breast cancer. J Mammary Gland Biol Neoplas. 2002;7(1):3–15.
Google Scholar
Seiler A, Chen MA, Brown RL, Fagundes CP. Obesity, dietary factors, nutrition, and breast cancer risk. Curr Breast Cancer Rep. 2018;10(1):14–27.
PubMed PubMed Central Google Scholar
Laas E, Hamy AS, Michel AS, Panchbhaya N, Faron M, Lam T, Carrez S, Pierga JY, Rouzier R, Lerebours F, et al. Impact of time to local recurrence on the occurrence of metastasis in breast cancer patients treated with neoadjuvant chemotherapy: a random forest survival approach. PLoS ONE. 2019;14(1):e0208807.
CAS PubMed PubMed Central Google Scholar
Chaudhary LN, Wilkinson KH, Kong A. Triple-negative breast cancer: who should receive neoadjuvant chemotherapy? Surg Oncol Clin N Am. 2018;27(1):141–53.
PubMed Google Scholar
Charalampoudis P, Karakatsanis A. Neoadjuvant chemotherapy for early breast cancer. Lancet Oncol. 2018;19(3):e128.
PubMed Google Scholar
Cheng Y, Wu Y, Wu L. Gene expression-guided adjuvant chemotherapy in breast cancer. New Engl J Med. 2018;379(17):1680–1.
PubMed Google Scholar
Xiao W, Zheng S, Yang A, Zhang X, Zou Y, Tang H, Xie X. Breast cancer subtypes and the risk of distant metastasis at initial diagnosis: a population-based study. Cancer Manag Res. 2018;10:5329–38.
CAS PubMed PubMed Central Google Scholar
Park S, Lee SK, Paik HJ, Ryu JM, Kim I, Bae SY, Yu J, Kim SW, Lee JE, Nam SJ. Adjuvant endocrine therapy alone in patients with node-positive, luminal A type breast cancer. Medicine. 2017;96(22):e6777.
CAS PubMed PubMed Central Google Scholar
Alfarsi L, Johnston S, Liu DX, Rakha E, Green AR. Current issues with luminal subtype classification in terms of prediction of benefit from endocrine therapy in early breast cancer. Histopathology. 2018;73(4):545–58.
PubMed Google Scholar
Veitch Z, Khan OF, Tilley D, Ribnikar D, Kostaras X, King K, Tang P, Lupichuk S. Real-world outcomes of adjuvant chemotherapy for node-negative and node-positive HER2-positive breast cancer. J Nat Compr Cancer Netw JNCCN. 2019;17(1):47–56.
CAS Google Scholar
De Laurentiis M, Cianniello D, Caputo R, Stanzione B, Arpino G, Cinieri S, Lorusso V, De Placido S. Treatment of triple negative breast cancer (TNBC): current options and future perspectives. Cancer Treat Rev. 2010;36(Suppl 3):S80-86.
PubMed Google Scholar
Li Y, Gao X, Yu Z, Liu B, Pan W, Li N, Tang B. Reversing multidrug resistance by multiplexed gene silencing for enhanced breast cancer chemotherapy. ACS Appl Mater Interfaces. 2018;10(18):15461–6.
CAS PubMed Google Scholar
Lee G, Bang L, Kim SY, Kim D, Sohn KA. Identifying subtype-specific associations between gene expression and DNA methylation profiles in breast cancer. BMC Med Genomics. 2017;10(Suppl 1):28.
PubMed PubMed Central Google Scholar
Bhattacharya S, Dunn P, Thomas CG, Smith B, Schaefer H, Chen J, Hu Z, Zalocusky KA, Shankar RD, Shen-Orr SS, et al. ImmPort, toward repurposing of open access immunological assay data for translational and clinical research. Scie Data. 2018;5:180015.
CAS Google Scholar
Steven A, Seliger B. The role of immune escape and immune cell infiltration in breast cancer. Breast Care. 2018;13(1):16–21.
PubMed PubMed Central Google Scholar
Allahverdiyev A, Tari G, Bagirova M, Abamor ES. Current approaches in development of immunotherapeutic vaccines for breast cancer. J Breast Cancer. 2018;21(4):343–53.
PubMed PubMed Central Google Scholar
Cortes J, Curigliano G, Dieras V. Expert perspectives on biosimilar monoclonal antibodies in breast cancer. Breast Cancer Res Treat. 2014;144(2):233–9.
CAS PubMed PubMed Central Google Scholar
Bardia A. Antibody-drug conjugates in breast cancer. Clin Adv Hematol Oncol H and O. 2017;15(4):251–4.
PubMed Google Scholar
Bischoff J. Checkpoint inhibitors in breast cancer: current status and future directions. Breast care. 2018;13(1):27–31.
PubMed PubMed Central Google Scholar
Zabotina TN, Korotkova OV, Chertkova AI, Zakharova EN, Tabakov DV, Dzhgamadze NT, Savostikova MV, Artamonova EV, Khailenko VA, Kovalenko EI, et al. Tumor-infiltrating lymphocytes in breast cancer. Association with clinical and pathological parameters. Bull Exp Biol Med. 2018;166(2):241–4.
CAS PubMed Google Scholar
Wang J, Chen H, Chen X, Lin H. Expression of tumor-related macrophages and cytokines after surgery of triple-negative breast cancer patients and its implications. Med Sci Monit Int Med J Exp Clin Res. 2016;22:115–20.
CAS Google Scholar
Eltoukhy HS, Sinha G, Moore CA, Sandiford OA, Rameshwar P. Immune modulation by a cellular network of mesenchymal stem cells and breast cancer cell subsets: Implication for cancer therapy. Cell Immunol. 2018;326:33–41.
CAS PubMed Google Scholar
Zhang Y, Li H, Zhang W, Che Y, Bai W, Huang G. LASSObased CoxPH model identifies an 11lncRNA signature for prognosis prediction in gastric cancer. Mol Med Rep. 2018;18(6):5579–93.
CAS PubMed PubMed Central Google Scholar
Braschi B, Denny P, Gray K, Jones T, Seal R, Tweedie S, Yates B, Bruford E. Genenames.org: the HGNC and VGNC resources in 2019. Nucleic Acids Res. 2019;47(D1):D786–92.
CAS PubMed Google Scholar
Hanzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics. 2013;14:7.
PubMed PubMed Central Google Scholar
Cancer Genome Atlas N. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490(7418):61–70.
Google Scholar
Tazaki E, Shishido-Hara Y, Mizutani N, Nomura S, Isaka H, Ito H, Imi K, Imoto S, Kamma H. Histopathologcial and clonal study of combined lobular and ductal carcinoma of the breast. Pathol Int. 2013;63(6):297–304.
CAS PubMed PubMed Central Google Scholar
Shuai Y, Ma L. Prognostic value of pathologic complete response and the alteration of breast cancer immunohistochemical biomarkers after neoadjuvant chemotherapy. Pathol Res Pract. 2019;215(1):29–33.
CAS PubMed Google Scholar
Fremd C, Stefanovic S, Beckhove P, Pritsch M, Lim H, Wallwiener M, Heil J, Golatta M, Rom J, Sohn C, et al. Mucin 1-specific B cell immune responses and their impact on overall survival in breast cancer patients. Oncoimmunology. 2016;5(1):e1057387.
PubMed Google Scholar
Conley SJ, Bosco EE, Tice DA, Hollingsworth RE, Herbst R, Xiao Z. HER2 drives Mucin-like 1 to control proliferation in breast cancer cells. Oncogene. 2016;35(32):4225–34.
CAS PubMed PubMed Central Google Scholar
Ye H, Sun C, Ren P, Dai L, Peng B, Wang K, Qian W, Zhang J. Mini-array of multiple tumor-associated antigens (TAAs) in the immunodiagnosis of breast cancer. Oncol Lett. 2013;5(2):663–8.
CAS PubMed Google Scholar
Coventry BJ, Weightman MJ, Bradley J, Skinner JM. Immune profiling in human breast cancer using high-sensitivity detection and analysis techniques. JRSM Open. 2015;6(9):2054270415603909.
PubMed PubMed Central Google Scholar
Pusztai L, Karn T, Safonov A, Abu-Khalaf MM, Bianchini G. New strategies in breast cancer: immunotherapy. Clin Cancer Res Off J Am Assoc Cancer Res. 2016;22(9):2105–10.
CAS Google Scholar
Pardoll DM. The blockade of immune checkpoints in cancer immunotherapy. Nat Rev Cancer. 2012;12(4):252–64.
CAS PubMed PubMed Central Google Scholar
Mittal D, Gubin MM, Schreiber RD, Smyth MJ. New insights into cancer immunoediting and its three component phases–elimination, equilibrium and escape. Curr Opin Immunol. 2014;27:16–25.
CAS PubMed PubMed Central Google Scholar
Croxford JL, Tang ML, Pan MF, Huang CW, Kamran N, Phua CM, Chng WJ, Ng SB, Raulet DH, Gasser S. ATM-dependent spontaneous regression of early Emu-myc-induced murine B-cell leukemia depends on natural killer and T cells. Blood. 2013;121(13):2512–21.
CAS PubMed PubMed Central Google Scholar
Wu X, Peng M, Huang B, Zhang H, Wang H, Huang B, Xue Z, Zhang L, Da Y, Yang D, et al. Immune microenvironment profiles of tumor immune equilibrium and immune escape states of mouse sarcoma. Cancer Lett. 2013;340(1):124–33.
CAS PubMed Google Scholar
Jinushi M, Komohara Y. Tumor-associated macrophages as an emerging target against tumors: creating a new path from bench to bedside. Biochem Biophys Acta. 2015;1855(2):123–30.
CAS PubMed Google Scholar
Lee HJ, Song IH, Park IA, Heo SH, Kim YA, Ahn JH, Gong G. Differential expression of major histocompatibility complex class I in subtypes of breast cancer is associated with estrogen receptor and interferon signaling. Oncotarget. 2016;7(21):30119–32.
PubMed PubMed Central Google Scholar
Pierdominici M, Maselli A, Colasanti T, Giammarioli AM, Delunardo F, Vacirca D, Sanchez M, Giovannetti A, Malorni W, Ortona E. Estrogen receptor profiles in human peripheral blood lymphocytes. Immunol Lett. 2010;132(1–2):79–85.
CAS PubMed Google Scholar
Hu ZY, Xiao H, Xiao M, Tang Y, Sun J, Xie ZM, Ouyang Q. Inducing or preventing subsequent malignancies for breast cancer survivors? Double-edged sword of estrogen receptor and progesterone receptor. Clin Breast Cancer. 2018;18(5):e1149–63.
CAS PubMed Google Scholar
Salem ML. Estrogen, a double-edged sword: modulation of TH1- and TH2-mediated inflammations by differential regulation of TH1/TH2 cytokine production. Curr Drug Targets Inflamm Allergy. 2004;3(1):97–104.
CAS PubMed Google Scholar
Inoue M, Mimura K, Izawa S, Shiraishi K, Inoue A, Shiba S, Watanabe M, Maruyama T, Kawaguchi Y, Inoue S, et al. Expression of MHC class I on breast cancer cells correlates inversely with HER2 expression. Oncoimmunology. 2012;1(7):1104–10.
PubMed PubMed Central Google Scholar
Engel JB, Honig A, Kapp M, Hahne JC, Meyer SR, Dietl J, Segerer SE. Mechanisms of tumor immune escape in triple-negative breast cancers (TNBC) with and without mutated BRCA 1. Arch Gynecol Obstet. 2014;289(1):141–7.
CAS PubMed Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

Not applicable.

Author information

Guoyu Mu, Hong Ji and Hui He equally contributed to the work.

Authors and Affiliations

Breast Surgery Department, The First Affiliated Hospital of Dalian Medical University, Dalian, 116000, Liaoning, China
Guoyu Mu & Hongjiang Wang
Gynecology and Obstetrics Department, The Second Affiliated Hospital of Dalian Medical University, Zhongshan Road 467, Dalian, 116023, Liaoning, China
Hong Ji
Department of Laparoscopic Surgery, The First Affiliated Hospital of Dalian Medical University, Dalian, 116000, Liaoning, China
Hui He

Authors

Guoyu Mu
View author publications
You can also search for this author in PubMed Google Scholar
Hong Ji
View author publications
You can also search for this author in PubMed Google Scholar
Hui He
View author publications
You can also search for this author in PubMed Google Scholar
Hongjiang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

GYM and HJW designed the study. GYM and HH collected and analyzed the data. Guoyu Mu and HH wrote the manuscript. All the authors discussed the results and contributed to the final draft of the manuscript. All the authors read and approved the manuscript and agree to be accountable for all aspects of the research in ensuring that the accuracy and integrity of the work are appropriately investigated and resolved.

Corresponding author

Correspondence to Hongjiang Wang.

Ethics declarations

Conflict of interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (TIF 162 KB)

Supplementary file2 (TIF 607 KB)

Supplementary file3 (TIF 1026 KB)

Supplementary file4 (TIF 530 KB)

Supplementary file5 (TIF 1113 KB)

Supplementary file6 (TIF 1012 KB)

Supplementary file7 (TXT 3420 KB)

Supplementary file8 (TXT 236 KB)

Supplementary file9 (TXT 19887 KB)

Supplementary file10 (TXT 9933 KB)

Supplementary file11 (TXT 9959 KB)

Supplementary file12 (XLSX 203 KB)

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Mu, G., Ji, H., He, H. et al. Immune-related gene data-based molecular subtyping related to the prognosis of breast cancer patients. Breast Cancer 28, 513–526 (2021). https://doi.org/10.1007/s12282-020-01191-z

Download citation

Received: 21 October 2020
Accepted: 14 November 2020
Published: 27 November 2020
Issue Date: March 2021
DOI: https://doi.org/10.1007/s12282-020-01191-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Immune-related gene data-based molecular subtyping related to the prognosis of breast cancer patients

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Background

Materials and methods

Preprocessing of preliminary sample data and initial screening of BC immune-related genes

Single-factor survival analysis of immune-related genes in the training set

Screening of specific immune-related genes for BC prognosis and construction of the prognostic prediction model

Functional annotations and signaling pathway enrichment of immune-related genes specific for prognosis

Correlation between the RiskScore and signaling pathways as well as the clinical features of the samples

Results

Retrieval of immune-related genes based on the survival and prognosis of BC patients

Screening of prognosis-specific immune-related genes and construction of the prognostic prediction model for BC

Functional annotations of immune-related genes and signaling pathway enrichment specific to prognosis

Correlation of the RiskScore with the signaling pathways and clinical features of the samples

Conclusions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interests

Additional information

Publisher's Note

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation