Introduction

Survival for patients with invasive breast cancer has increased in the last decade due to new and improved therapeutic options as well as new insights in molecular biology. Methods to select patients based on the tumor phenotype are important to reduce over- and undertreatment, for example, gene expression profiles that identify subtypes [1, 2] associated with higher risk of metastasis. Although these techniques result in prognostic and predictive valuable information for specific patient groups, optimization of risk assessment might benefit from further improvement.

Despite an important update on the role of the microenvironment on cancer development by Hanahan et al. [3, 4], the classification system for predicting metastasis and disease-specific survival is still based on traditional tumor staging criteria (AJCC/UICC-TNM Classification) [5,6,7] which focus largely on the tumor cell autonomous processes and not on the microenvironment.

Complex interactions occur between cancer cells and cells in the tumor microenvironment, such as immune and stromal cells. A high stromal content has been associated with worse prognosis in different solid cancer types including breast cancer and especially in triple negative breast cancer [8,9,10,11,12,13,14]. Together with the development of malignant tumor stroma, the connective tissue framework of the tumor becomes active. The collagen bundles degrade, the number of inflammatory cells increases, fibroblasts differentiate into myofibroblasts and proliferate and angiogenesis increases [15]. Also, the cellular immune response has a fundamental role in cancer development. An example of the prognostic value of the activity of the immune system is represented by the Immunoscore which analyzes the distribution of CD3+ lymphocytes and CD8+ cytotoxic T cells [16]. In breast cancer, especially in triple negative tumors, the increased presence of tumor-infiltrating lymphocytes has been associated with good prognosis [17, 18]. De Kruijf et al. showed that the immune status of tumors based on six cellular immune markers has a statistically significant effect on prognosis preferable for tumors with a high immune status [19]. These six cellular immune markers (HLA-E, HLA-G, classical HLA class I (HLA-A, HLA-B and HLA-C), natural killer (NK) cells, cytotoxic T-lymphocytes (CTLs) and regulatory T (Treg) cells) were selected based on biological rationale and the balance between their various interactions.

Suggestions have been made about the influence of tumor stroma on suppression of the immune response [9, 20,21,22,23]. In this present study, the prognostic value of the interplay between tumor–stroma ratio (TSR) and the immune status of tumors in breast cancer patients was evaluated. We hypothesize that stroma-high tumors in combination with a low immune status behave more aggressively resulting in a high risk of disease progression.

Materials and methods

Study population

The study population was assessed retrospectively and consists of primary non-metastasized breast cancer patients. The patients were primarily treated with surgery between 1985 and 1994 in Leiden University Medical Center (N = 584). Exclusion criteria were bilateral breast tumors and a history of cancer (other than basal cell carcinoma or cervical carcinoma in situ). The resected breast tumors were graded by experienced breast cancer pathologists using current pathological standards. All samples were handled in a coded fashion, according to national ethical guidelines (“Code for Proper Secondary Use of Human Tissue”, Dutch Federation of Medical Scientific Societies). Approval of the study was obtained from the LUMC Medical Ethics Committee. The recommendations for reporting on tumor markers (the REMARK criteria) in prognostic studies were respected [24].

Tumor–stroma ratio

The TSR was visually estimated on routine Hematoxylin and Eosin (H&E) stained slides from formalin-fixed paraffin-embedded (FFPE) blocks of the primary tumor (N = 584) as previously described by our group [25]. Thirty-two percent of the tissues were scored in a blinded fashion by a second observer, with a concordance of classification of 94% (Cohen’s kappa = .85). Ten tissues were not eligible for TSR scoring due to poor quality. Evaluation of TSR started with microscopical orientation using a 5 × objective. Subsequently, a 10 × objective was used in the most stroma-abundant area. The field of highest stromal percentage was selected and scored per tenfold increments. Tumor cells must be present on all sides (north, east, south and west). Stroma percentage ≤ 50% was categorized as stroma-low and stroma percentage > 50% as stroma-high (Supplementary Fig. 1) [8, 12].

Immunohistochemistry

Tissue sections from intra-operatively derived FFPE tissue micro-array (TMA) material and immunohistochemistry analysis were used as previously described [19, 26, 27]. Whole FFPE sections were immunohistochemically stained with mouse antibodies against CD8+ and PEN5 recognizing CTLs and NK cells, respectively. TMA tissue sections were used for immunohistochemical stainings for the expression of classical HLA class I (anti-HLA-A and anti-HLAB/C), non-classical HLA-E, HLA-G and Treg cell infiltration as previously described in the literature [26, 27].

Quantification of CD8+ cells and PEN5 cells was performed in a blinded setup by two independent observers. Tumor infiltration of CD8+ was divided into low CTL infiltration (0–100 CD8+ tumor infiltrating cells/mm2) and high CTL infiltration (100–3.000 CD8+ tumor infiltrating cells/mm2). Tumor infiltration of NK cells was divided into the presence or absence of NK cells. Classical HLA class I was categorized into loss versus expression and HLA-E divided into no expression versus expression. HLA-G and Treg infiltration were categorized in absent versus present (Supplementary Fig. 2).

These six immune markers were classified into three immune status profile groups (Fig. 1) as previously described by de Kruijf et al. for this cohort [19].

Fig. 1
figure 1

Evaluation of immune status and classification. HLA human leukocyte antigen, CTL cytotoxic T-lymphocytes, Treg regulatory T cells, NK natural killer

Statistical analysis

Statistical analyses were performed using IBM SPSS statistics (version 23.0 for Windows). The inter-observer agreement in TSR, CTL and PEN5 evaluation is represented by Cohen’s Kappa value. A value above 0.6 was valid. Pearson χ 2 test was used for the evaluation of statistically significant differences between included and excluded patients, distribution of the separate immune markers between stroma-high and stroma-low cases and three immune status categories. A P value < .05 was considered statistically significant. The Kaplan–Meier method was performed to analyze the overall survival (OS) and recurrence free period (RFP). The log-rank test was applied for comparison between these curves. A P value < .05 was considered statistically significant. The time from date of surgery until any recurrence of breast cancer was defined as RFP. OS was defined as the time from date of surgery until death from any cause. Univariate and multivariate analyses for RFP and OS were calculated by Cox proportional hazard analysis. Variables with P value < .10 in univariate analysis were entered in multivariate analysis. Effect modification was evaluated by adding interaction in Cox regression analysis. Stepwise regression analysis (backward and forward) of the different immune cells was evaluated. Missing values were not included.

Results

Patients

Of all patients (N = 584), FFPE blocks were available. TSR could be evaluated in 98% of the cases (N = 574). In 43% of the cases, no classification of the immune status could be made due to the low quality of tissues or TMAs. The loss or damage of TMA cores is a known problem associated with preparation, staining and mounting of TMA slides. Moreover, the cores we used were rather small. Since several markers were combined in the profiles, the patient was excluded from further analyses when data of one or more markers were missing. Figure 2 provides a flowchart of subjects included. By comparison of prognostic parameters, no differences were found between included (N = 344) and excluded cases (N = 230), except for the treatment with hormonal therapy (P < .001). This can be explained by the fact that this therapy was only given sporadically between 1985 and 1988. No statistically significant differences were found for age, grade, tumor stage, tumor type, nodal stage, histological type, estrogen receptor, progesterone receptor, HER2 expression, TSR, chemotherapy and radiotherapy in these two groups.

Fig. 2
figure 2

Flowchart of subject inclusion. * For categorizing in one of the three immune status categories not all six groups need to be known. FFPE formalin-fixed paraffin-embedded, NK natural killer, CTL cytotoxic T-lymphocyte, Treg regulatory T, TSR tumor–stroma ratio

The median follow-up of the 344 included patients was 10.2 years (0.2–22.4 years). The mean age at presentation was 58.0 years (27.5–90.2 years). There is no statistically significant difference in the distribution of the separate markers between stroma-high and stroma-low cases, nor in the three immune status categories (P = .30). Table 1 provides a detailed overview of the immune markers stratified by TSR and Table 2 shows the clinicopathological and treatment characteristics.

Table 1 Distribution of the separate elements of the three immune status profiles
Table 2 Patient characteristics

Prognostic value of the TSR

Tumors with low and high stromal contents were observed in 51.5 and 48.5% of the cases (N = 574), respectively. Patients with stroma-high tumors had a worse RFP (HR 1.75; 95% CI 1.37–2.25; P < .001) and OS (HR 1.28; 95% CI 1.04–1.58; P = .02) compared to patients with stroma-low tumors (not shown). After 10 years, 32% of the patients with a stroma-low tumor had developed a recurrence of disease compared to 50% of patients with a stroma-high tumor. These results for RFP in favor for stroma-low tumors were also seen in the group of patients (N = 344) in which the immune status could be assessed (HR 1.76; 95% CI 1.28–2.42; P < .001) (Fig. 3a) with a 10-year RFP of 67% of patients in the stroma-low group compared to 49% in the stroma-high group. OS showed no significant difference between both stroma groups (HR 1.3; 95% CI .095–1.64; P = .114). Analysis for breast cancer subgroups showed that patients with a triple negative tumor have a high hazard ratio of 2.4 (95% CI 1.32–4.40; P = .003) for RFP in both the total group (known TSR) and in the selected group (known TSR and immune status). Furthermore, within the luminal A subgroup the TSR showed a significant difference in RFP (HR 1.5; 95% CI 1.13–2.19; P = .008), but not for OS. For the other subgroups (Luminal B and HER2-like tumors), no prognostic value of the TSR was found (Supplementary Table 1a, b).

Fig. 3
figure 3

Kaplan–Meier analysis for RFP of TSR, immune status profiles and classical HLA class I. a RFP for stroma low and high tumors, b RFP for three IS profiles, c RFP for TSR combined with IS profiles, d RFP for TSR combined with classical HLA class I. IS immune status, RFP recurrence free period, TSR tumor–stroma ratio

Prognostic value of the immune status of tumors

The immune status of tumors was classified as high in 18.9%, intermediate in 63.1% and low in 18.0% of the breast cancer cases. The RFP (Fig. 3b) and OS curves (not shown) of the three immune status categories were statistically significant (P < .001) in which patients with a high immune status profile had a better outcome compared to patients with a low immune status profile. After 10 years of follow-up, 79% of the patients in the high immune status category did not develop recurrence of disease compared to 58% in intermediate immune status category and 36% in low immune status category. Analysis for breast cancer subgroups showed that patients with a luminal A or triple negative tumor have a worse prognosis for both RFP and OS (Supplementary Table 2).

Prognostic value of TSR and immune status of tumors combined

The RFP data of TSR and immune status subtypes were combined and plotted in Fig. 3c. The overall P value between the subgroups was statistically significant (P < .001) (Table 3). A trend was observed for stroma-high tumors compared to stroma-low tumors calculated for the high immune status profile (P = .15) and intermediate immune status profile (P = .08). However, only for the low immune status profile the difference between stroma-high and stroma-low tumors showed significance (P = .002). Ten years RFP for stroma-low and high immune status showed a recurrence rate of 87 versus 17% of patients with stroma-high and low immune status tumors.

Table 3 Univariate and multivariate analysis for RFP and OS calculated by Cox proportional hazard analysis

Table 3 shows the results of univariate and multivariate Cox regression analyses. TSR remained statistically significant for RFP (P < .001) in multivariate Cox regression analysis and the immune status for RFP (P < .001) and OS (P = .001). Effect modification of stroma and immune status was not statistically significant. As expected, the TSR combined with immune status showed additional prognostic value in the analyzed patient cohort.

Prognostic value of TSR combined with classical HLA class I

To evaluate whether one or more of the six cellular immune cells were decisive in the immune status categories, a stepwise regression analysis was performed. In this analysis, classical HLA class I showed to be statistically significant in the immune status categories for RFP (P = .007), but not for OS (P = .06), whereas the other immune cells were not. These results indicate that classical HLA class I is the most determinant factor in the three immune status profiles. In 523 of the 574 cases (91%), classical HLA class I could be assessed. Tumors expressing classical HLA class I had significantly less recurrences (P = .001), with 10 years RFP of 66 versus 55%. In the same group, TSR showed RFP of 67 versus 49% in benefit for stroma-low tumors (P < .001).

Figure 3d shows a statistically significant difference (P < .001) for RFP for the combination of TSR and classical HLA class I. This indicates that patients with a stroma-low tumor and expression of classical HLA class I have a better prognosis compared to patients with a stroma-high tumor and loss of expression or downregulation of classical HLA class I with 10-year RFP 72% versus 46%, respectively.

In triple negative tumors, classical HLA class I (N = 92) was also of prognostic value (HR 0.28; 95% CI 0.15–0.55; P < .001). Patients with loss of expression or downregulation of classical HLA class I showed a 10-year RFP of 35% compared to 73% of the patients in which HLA class I is expressed. TSR and classical HLA class I combined showed significant difference in RFP (P = .001). Patients with stroma-low tumors and expression of classical HLA class I showed fewer recurrences compared to patients with stroma-high tumors and loss of expression or downregulation of classical HLA class with 10-year RFP of 75 versus 26%, respectively.

Discussion

There is a growing body of evidence that TSR and immune cell response in cancer development might be important factors in patient stratification for treatment decision making. The relation of the stromal involvement and immune response for the determination of patients for adjuvant treatment has merely been investigated. Gujam et al. described the relationship between TSR and clinicopathological parameters as tumor inflammatory infiltrate, CD68+ macrophage infiltrate and CD4+ and CD8+ T-lymphocyte infiltrate in ductal breast cancer. They concluded that a high TSR was consistently associated with low tumor inflammatory infiltrate [9]. Hynes et al. also published on the combination of TSR with peritumoral diffuse lymphoid inflammation and Crohn’s disease-like reaction in stage II/III colon cancer. A combination of these three parameters showed a significant association with survival outcomes [23].

Our study showed that TSR and the combination of six cellular immune cells, categorized into three immune status subgroups, are both independent prognostic factors. A combination of both parameters even strengthens each other’s’ effect.

The six cellular immune cells were selected based on biological rationale and the balance between their various interactions. Classical HLA class I presents tumor-associated antigens on the cell surface. CTLs are capable of recognizing the presentation of these antigens by HLA-A, HLA-B or HLA-C [28]. Tumor cells can escape recognition by CTLs by losing classical HLA class I expression. This makes the tumor cells more prone for recognition by NK cells [29]. On the other hand, HLA-E and HLA-G, also known as non-classical HLA class I, play a crucial role in the immune surveillance by NK cells. Expression of non-classical HLA I has an inhibitory effect on the function of NK cells [29,30,31]. Other cells which are important in tumor development are Tregs. Tumor cells can escape immune surveillance by attraction and induction of Tregs [32].

In this study, the prognostic value of TSR in addition with classical HLA class I was also shown. The effect was smaller than the combination with three immune status subtypes, but better applicable in daily routine pathology practice. Patients with stroma-low tumors also expressing classical HLA class I have a better prognosis than patients with stroma-high tumors with loss of expression or downregulation of classical HLA class I.

The estimation of TSR is simple, inexpensive and takes only a few minutes. It can be done on regular H&E slides during routine pathology investigation of the resected tissue. Since the introduction of pre-operative chemotherapy, which leads to the formation of non-desmoplastic stroma and, therefore, the resection material unsuitable for TSR scoring, it might be of interest to score the TSR on tumor biopsies. In esophageal adenocarcinoma biopsies, the reproducibility of TSR scoring on biopsies was good [33], and it is plausible that this is even better in breast cancer due to lack of the muscular area [34]. Promising is the current interest in automation of the TSR parameter [13]. Assessment of the six cellular immune markers is relatively time consuming. The assessment of only classical HLA class I takes less effort and may help optimize risk stratification in combination with TSR.

Patients with early stage breast cancer are often treated with adjuvant systemic therapy (endocrine therapy, chemotherapy or agents against HER2) based on tumor characteristics such as HER2 status, tumor size and lymph node status. A substantial number of women with breast cancer is overtreated. These patients do not benefit from adjuvant therapy but are exposed to the risk of toxic effects. The TSR, immune status or a combination of these prognostic markers might be used to select patients who could be spared adjuvant therapy or to select patients more confident to treatment and which can be monitored for recurrences more frequently. Especially patients with stroma-high tumors and low immune status could possibly benefit from more aggressive treatment whereas for patients with stroma-low tumors and high immune status less aggressive treatment could be discussed. The method described in this paper could give valuable additional pathology-based information for patients with invasive breast cancer.

Conclusion

Simple H&E stained sections contain more information than previously fathomed. The TSR is a simple, fast and cheap method for the identification of patients with more aggressive disease. Tumor immune status profiling is promising for further prognostication and the achievement of tailored treatment for breast cancer patients. The combination of TSR and immune status of tumors is a strong prognosticator, applicable for daily routine use.