Background

The choice of systemic therapy for breast cancer (BC) is based on the pathological diagnosis, which includes histological classification, grade, estrogen receptor (ER) status, progesterone receptor (PgR) status, overexpression of human epithelial growth factor receptor 2 (HER2), and Ki67 [1]. Although all these factors provide information on the cancer itself, the microenvironment around a tumor has also been found to be critical for the survival, invasive growth, and metastasis of the cancer.

The host immune system responds to fight cancer. Abnormal antigens on cancer cells can be detected by dendritic cells or macrophages, which activate anti-cancer responses such as cytotoxic T-lymphocytes to eliminate the cancer cells. Tumor-infiltrating lymphocytes (TILs) have shown prognostic value in various kinds of cancers [2], and immune-checkpoint inhibitors have highlighted their potential in treating malignant melanoma and other cancers [3]. This new type of targeted therapy enhances the body’s own anti-tumor immunity by releasing the brake caused by the cancer cells [4].

There is increasing evidence regarding the prognostic and predictive values of TILs, mainly in triple-negative (TN) and HER2-positive (HER2+) BCs [57]. However, different studies have used different criteria to evaluate TILs; some applied lymphocytic-predominant breast cancer (LPBC: e.g., stromal TILs >50 %) [6, 8], while others used different thresholds [7, 9]. A standard method is therefore needed to evaluate TILs to allow an integrated discussion and further our understanding of the BC microenvironment. In addition, the association between TILs and trastuzumab-containing chemotherapy in HER2+ BC is also controversial [10, 11].

An international TIL working group recommended a candidate scoring method [12], but this is too detailed for routine practice. We therefore modified this method and investigated the utility of an alternative simple scoring system for TILs involving hematoxylin–eosin (HE) staining, and explored the clinical values of this scoring in TN and HER2+ BCs.

Methods

Patients and samples

This retrospective, single-institution study was conducted at Sagara Hospital, with ethics committee approval (#14-06). We searched the clinical database for patients with primary BC treated between 2007 and 2014. Patients were included if they had TN-subtype invasive ductal carcinomas (ER−, PgR−, and HER2−) or HER2+ subtype (any ER, any PgR, and HER2+). In the initial study in an adjuvant setting, we included patients who underwent curative surgery and standard postoperative therapy. Patients with small tumors (pT1a) or locally advanced tumors (pT4) were excluded. For the second study in a neo-adjuvant setting, the inclusion criterion was patients with stage I–III cancer treated with standard preoperative chemotherapy. We regarded treatment with an anthracycline or taxane, or an anthracycline followed by a taxane, and with a standard chemotherapy dose as standard systemic therapies for TNBC, and trastuzumab-containing regimens for HER2+ BC.

The following clinicopathological information was obtained from the database: age; tumor size; histological classification; grade; lymphovascular invasion; nodal status; response to neoadjuvant chemotherapy (NAC); and biomarker profile. The thresholds for ER- and PgR-positivity were set at 1 % using immunohistochemical staining. Stored HE samples were retrieved for TIL assessment. Bilateral tumors, cases with massive necrosis, or tiny residual cancers after NAC were excluded. The REMARK (REporting of tumor MARKer Studies) guidelines were followed [13].

Assessment of TILs using a three-grade scale

All HE samples were reviewed by a breast pathologist (A.H.) who was blinded to the patient profiles. In resected specimens, a representative slide containing relatively high amounts of both invasive cancer and lymphocytic infiltration was selected for each case. The whole slide was screened using a low-power field, and a “hot spot” with many lymphocytes was identified. TILs were then evaluated in a medium-power field (100×) using a three-grade scale (Fig. 1a–c). The region of interest was restricted to within the tumor borders (Fig. 1d–f), as described by Salgado et al. [12]. Lymphocytes and plasma cells were counted, but neutrophils, eosinophils, and macrophages were not. We defined TIL score as the proportion of the area infiltrated by lymphocytes within the tumor itself plus the adjacent stroma, and classified the scores as low (<10 %), intermediate (10–50 %), and high (>50 %). Two cut-off points (10 and 50 %) were chosen from four candidate values (1, 10, 30, and 50 %), because this combination had potential prognostic power and was also fast and easy to implement, as shown in our previous preliminary report [14].

Fig. 1
figure 1

Representative photomicrographs of HE-stained tissue sections illustrating the simple scoring method for tumor-infiltrating lymphocytes (TILs). The area proportion of TILs was defined as low (a <10 %), intermediate (b 10–50 %), or high (c >50 %) in a medium-power field (×100 ac). This score was evaluated in a hot spot (d, red arrow) infiltrated by many lymphocytes within the border of an invasive tumor. An inflammatory reaction occurring distantly from the tumor (e, blue arrow) or around the intraductal component (f, blue arrow) was not taken into account. A necrotic area was also excluded from a subject

Outcome measures of pathologic complete response and prognosis

Pathological complete response (pCR) was defined as no residual invasive carcinoma at the primary site, such as ypT0 or ypTis, regardless of lymph node status. Prognostic information was obtained retrospectively from the clinical database and electronic charts. Relapse-free survival (RFS) was defined as the period from surgery to the latest follow-up, relapse of breast cancer, or death from any cause.

Statistical analysis

Survival curves were drawn using the Kaplan–Meier method and analyzed by log-rank tests. The proportions of patients with pCR were compared using χ 2 tests. The relationship between TILs and RFS in the adjuvant setting was examined using a Cox proportional hazards model. The limited numbers of recurrent and death events in the neo-adjuvant setting did not allow us to adjust for other prognostic factors using multivariate analysis. Data were analyzed using GraphPad Prism 5 (GraphPad Software, San Diego, CA, USA) and JMP 12 (SAS Institute Inc., Cary, NC, USA).

Results

Patient characteristics

A total of 3547 invasive carcinomas were identified from the database, most of which (2598 cases, 73.2 %) were ER+/HER2−. ER+/HER2+, ER−/HER2+, and TN subtype accounted for 301 (8.5 %), 267 (7.5 %), and 381 (10.7 %) cases, respectively. Based on the criteria described in Patients and samples above, we selected 154 consecutive cases of TNBC and 183 HER2+ BCs in the adjuvant setting (Table 1), and 50 consecutive TNBCs and 72 HER2+ BCs in the neo-adjuvant setting (Table 2). These patients were all women aged 22–82 years. The detailed distributions of age, tumor size, histological grade, ER status, nodal status, regimens of systemic therapy, and pCR ratio are summarized in Tables 1 and 2.

Table 1 Patient characteristics in the adjuvant setting
Table 2 Patient characteristics in the neo-adjuvant setting

Prognostic value of TILs in the adjuvant setting

Twenty-five recurrences and 19 deaths were observed after curative surgery and standard postoperative therapy, during a median follow-up period of 45 months [interquartile range (IQR): 24–66.8] for TNBC, and 12 recurrences and five deaths during 58 months (IQR: 44.5–74) for HER2+ BC. TILs (low, intermediate, and high) proved to have significant prognostic value (p = 0.015) regarding RFS in TNBC (Fig. 2a), but this was not verified (p = 0.097) among HER2+ BCs (Fig. 2b). The prognosis was also significantly poorer in TNBC patients in the low-TIL group compared with the intermediate/high-TIL groups [hazard ratio (HR): 2.68; 95 % confidence interval (CI): 1.13–5.95]. This remained significant in multivariate analysis (HR: 2.49; 95 % CI: 1.05–5.55), as did nodal status (Table 3). Other factors including age, tumor size, histological grade, and lymphovascular invasion had no significant influence on prognosis.

Fig. 2
figure 2

Prognostic value of tumor-infiltrating lymphocytes (TILs) in the adjuvant setting. Relapse-free survival was analyzed in the adjuvant setting for 154 TNBCs (a) and 183 HER2+ BCs (b). Stored samples obtained from primary surgery were categorized into three groups of low, intermediate, and high TILs. All patients received standard chemotherapy after curative surgery. TN triple-negative, BC breast cancer, HER2 human epithelial growth factor receptor 2

Table 3 Association of tumor-infiltrating lymphocytes with prognosis of relapse-free survival in patients with triple-negative breast cancer

Association between TILs in biopsy samples before NAC and pCR ratio

Biopsy samples taken before NAC were available for 48 TNBCs and 58 HER2+ BCs. Four cases only had intraductal carcinomas on core needle biopsies, and 12 other biopsies were unavailable because they had been diagnosed at other institutions. TILs were evaluated with the same method using the three-grade scale, and the score was compared with the pCR ratio. TILs were significantly related to pCR ratio in TNBC (p = 0.024) (Fig. 3a), but not in HER2+ BC (p = 0.30) (Fig. 3b).

Fig. 3
figure 3

Association between tumor-infiltrating lymphocytes (TILs) and pathological complete response (pCR). Core needle biopsy samples taken before primary systemic therapy were evaluated for TILs using the simple three-grade scale, and its association with the proportion of cases achieving pCR was compared among 48 TNBCs (a) and 58 HER2+ BCs (b). TN triple-negative, BC breast cancer, HER2 human epithelial growth factor receptor 2

Prognostic value of TILs in resected specimens after NAC

Among resected specimens after NAC, 28 TNBC and 41 HER2+ BC non-pCR cases underwent TIL assessment. One TNBC case was excluded because the residual cancer was only detected in the lymphatic vessels. For TNBC, 11 recurrences and seven deaths were observed during a 23-month median follow-up period (IQR: 12.5–37), and 10 recurrences and four deaths were observed during 34 months (IQR: 23.8–48.3) for HER2+ BC. The low- and intermediate-TIL groups showed relatively poorer prognoses than the high-TIL group in TNBC (Fig. 4a), though the differences were not statistically significant (p = 0.24). The three-grade scale for TILs revealed significant prognostic differences in HER2+ BC (RFS: p = 0.007) (Fig. 4b).

Fig. 4
figure 4

Survival analysis after neo-adjuvant systemic therapy. Relapse-free survival was analyzed among 28 TNBCs (a) and 41 HER2+ BCs (b). About 42 % of each subtype achieved pathological complete response (pCR) after standard neo-adjuvant chemotherapy (NAC). For non-pCR cases, tumor-infiltrating lymphocytes (TILs) in the residual carcinoma were evaluated in the three groups of low, intermediate, and high TILs. TN triple-negative, BC breast cancer, HER2 human epithelial growth factor receptor 2

In addition, 26 and 32 pair-matched TN and HER2+ BC samples were available, respectively. The TIL scores of pre-treatment biopsy and post-treatment surgical samples were compared (Online Resource). Changes in TIL scores were classified as decrease, increase, and no change. The proportions of recurrent cases in each subgroup are summarized in Fig. 5. The distributions of non-pCR cases were similar in each subgroup. Patients with TNBC and decreased TIL tended to have a higher rate of relapse, but the difference was not significant in this small setting.

Fig. 5
figure 5

Influence of change in tumor-infiltrating lymphocyte (TIL) score before and after neo-adjuvant chemotherapy (NAC) on the proportion of recurrent cases. We applied the three-grade scale for TILs to both pre-treatment biopsy samples and post-treatment surgical specimens. Matched pairs of 26 TNBCs (a) and 32 HER2+ BCs (b) were analyzed. TN triple-negative, BC breast cancer, HER2 human epithelial growth factor receptor 2

Discussion

The results of this retrospective study verified the prognostic and predictive values of TILs in TNBC, in accordance with other studies [5, 6, 9]. Consistent with a recent report, we found that TILs had no prognostic value concerning HER2+ BC in the adjuvant setting with trastuzumab-containing chemotherapy [11]. The N9831 trial also demonstrated that LPBC was a significant favorable prognostic factor in the adjuvant setting regarding HER2+ BC treated with chemotherapy alone. The microenvironment of non-LPBC tumors appears to be dramatically improved by adding trastuzumab, such that the prognostic impact of TILs disappears in the trastuzumab-containing setting. This improvement may be explained by trastuzumab’s strong antibody-dependent cellular cytotoxicity [15], or by adaptive immune reaction following cancer-cell death induced by trastuzumab-containing chemotherapy [16]. These findings fit well with our results; HER2+ cases with low numbers of TILs might have high levels of lymphocytes in reserve, and trastuzumab-containing systemic therapy could activate this potential. In contrast, some studies have reported that HER2+ BC has a different immune profile [17].

TILs in residual carcinoma after NAC had a significant prognostic impact among HER2+ BCs. These TILs were evaluated after trastuzumab-containing chemotherapy and had therefore already been influenced by systemic treatment. To the best of our knowledge, this is the first report indicating that TIL status after NAC is a favorable prognostic factor in HER2+ BC. Similarly, Dieci et al. [18] reported the prognostic value of TILs in residual disease after NAC in TNBC. Their survival curves for the High-TIL (stromal and/or intratumoral TIL >60 %) and Low-TIL groups are similar to those for patients with TNBC in the present study. However, the difference between the high-TIL and low/intermediate-TIL groups was not significant (HR: 3.49; 95 % CI: 0.75–16.3). The lack of statistical power (n = 28) and absence of events in the high-TIL group may account for the discordant result. The prognostic value of TILs in residual disease therefore needs to be confirmed in a larger cohort.

Our study had several limitations. In terms of analytical validation, we did not prove the robustness and reproducibility of the three-grade scale for TILs. However, intra-observer concordance was good (weighted κ = 0.62) in our recent investigation (n = 90, estimated by A.H., 3-month intervals; detailed data not shown), and inter-observer concordance was also good (weighted κ = 0.78) as shown at supplementary Table S1 in our previous report [14]. However, these data are inadequate, and inter-laboratory concordances should be examined before standardization. Given the limitations on reproducibility associated with human analytical procedures, automated technology using immunohistochemical staining [19] or multi-gene assays [20] may represent possible future directions. It is also unclear if we should evaluate average areas or hot spots. We recognized the heterogeneity of TILs (different intensities of lymphocytes in different areas), and hot spots tended to be located at the invasive edge. Further evidence is therefore needed regarding the role, mechanisms involved, and importance of hot spots. Another limitation was the small number of patients evaluated and the relatively short follow-up period. Nevertheless, we applied a single uniform method throughout the study, which clarified differences in the prognostic value of TILs between TN and HER2+ BCs, and also between adjuvant and neo-adjuvant settings.

New strategies should be considered to improve the outcome of patients with TNBC, such as immune-checkpoint inhibitors in low-TIL patients. TILs have the potential to act as an additional surrogate marker for survival, and TILs in residual carcinoma after NAC may therefore help in assessing risk and stratifying treatment in patients with either TN or HER2+ BC. This rapid and practical method may contribute to further translational research and the discovery of innovative drugs.

Conclusions

In summary, we propose a simple method to evaluate TILs. Based on a three-grade scale, the clinical value of TILs differs between TN and HER2+ BCs in patients receiving standard therapies. TILs after NAC seem to have a favorable prognostic impact on HER2+ BC and possibly on TNBC, suggesting that they may represent a surrogate marker for survival and drug efficacy in these situations.