Background

Triple-negative breast cancer (TNBC) is the term used to describe breast cancer cases that lack expression of estrogen receptor (ER), human epidermal growth factor receptor-2 (HER2), and progesterone receptor (PR) [1]. TNBC is characterized by a poor prognosis, and accordingly, the 5-year survival rate is only around 60% [2]. As the malignancy of breast cancer depends not only on its genetic abnormalities and biological characteristics but also on interactions between the cancer cells and their microenvironment, it is vital to understand the tumor microenvironment [3].

The microenvironment of breast cancer contains a variety of cell types, including tumor-infiltrating lymphocytes (TILs). Accumulating evidence indicates that TILs play essential roles in carcinogenesis and cancer progression [4]. Furthermore, interleukin (IL)-6 and IL-8 secreted by some subtypes of lymphocytes can generate a positive feedback loop between the immune microenvironment and tumor cells [5]. According to the results of a meta-analysis in 2014, the level of TILs was positively associated with a the prognosis of TNBC [6]. However, various subtypes of TILs have both inhibitory and stimulatory effects on the prognosis and progression of breast cancer. The CD4+ T cells and CD8+ T cells (primary effector TIL subtypes) have been linked to a better response to systemic treatment in breast cancer [7, 8]. On the contrary, FOXP3+ T-cell infiltration was found to predict a worse prognosis via the mediation of tumor immune escape [9, 10]. Because TNBC has unique clinicopathological and immunohistochemical features, determining the clinical associations of the total TIL count or the levels of specific subtypes of TILs in TNBC can improve our ability to predict the prognostic pattern and treatment response for TNBC.

The objective of the present systematic review and meta-analysis was to determine the prognostic roles of the total TILs or the levels of subtypes of TILs (CD4+, CD8+, and FOXP3+) in TNBC.

Methods

The present systematic review and meta-analysis were conducted following the requirements of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [11].

Search strategy and study selection

A systematic literature search was conducted using the MEDLINE, EMBASE, and Web of Science databases to identify eligible articles published before August 2019. The keywords used for the literature search included triple-negative breast cancer (TNBC), tumor-infiltrating lymphocytes (TILs), prognosis, and survival. Review and meta-analysis articles were scanned for additional relevant studies. The literature search strategies are outlined in Additional file 1.

Outcome definitions

Pathological complete response (pCR) was defined as the absence of all invasive disease cells and lymph node metastasis [12]. Overall survival (OS) was defined as the period from the date of TNBC diagnosis to the time of death with any cause [13]. Disease-free survival (DFS) was defined as the period from the start of treatment to the first recurrence, or to death without any type of relapse [13].

Inclusion and exclusion criteria

The inclusion criteria were the following: (1) paper written in English, (2) study population or study sub-group consisted of patients with TNBC, (3) the relationships between TIL levels and short-term prognosis (i.e., pCR) and long-term prognosis (i.e., OS and DFS) were investigated, (4) original studies without restriction in study design, (5) studies containing enough data to estimate the effects (i.e., hazard ratios [HRs] and corresponding 95% confidence intervals [CIs] for OS or DFS, and odds ratios [ORs] and corresponding 95% CIs for pCR). The exclusion criteria were the following: (1) reviews, commentaries, editorials, protocols, case reports, qualitative research, or letters; (2) duplicate publications; and (3) full text not published in English, and (4) studies without usable data.

Study selection and quality assessment

Title–abstract screening was performed first to determine eligibility by two independent reviewers. Full-text articles that passed the first stage screening were downloaded for further review according to the inclusion and exclusion criteria. Disagreements were resolved by consultation with a third author or by joint discussion.

As no randomized controlled trial was found, we assessed the risk of bias using an approach based on the ROBINS I tool [14] and the Quality In Prognosis Studies (QUIPS) tool [15]. The risk of bias assessment was conducted by two reviewers independently.

Data extraction

We extracted data from the included studies using a pilot-tested data extraction form. We extracted the following data for this review: (1) first author and publication year, (2) country in which study was conducted, (3) study design, (4) participant details, (5) duration of follow-up, (6) choice of cut-off scores for defining positive TILs, (7) TIL category, (8) TIL measurement details (category or continuous). The definition of high/low TIL level were attributed to the original papers. (9) adjusted HRs with 95% CIs for OS and/or DFS (univariable HRs were recorded only if adjusted HRs were not available), and (10) adjusted ORs with 95% CIs (or accurate event numbers) for pCR (univariable ORs were recorded only if adjusted ORs were not available).

Statistical analysis

We performed meta-analyses to obtain a pooled estimate of the prognostic role of TILs using RevMan 5.3. Category software, and continuous TILs were estimated separately to decrease the heterogeneity. The results were expressed as HR (95%CI) for OS and DFS and by OR (95% CI) as calculated by Review Manager 5.3 [16]. A P-value less than 0.05 was set as indicative of statistical significance. Between-study heterogeneity was measured using the Higgins I2 statistic and Cochrane’s Q test (P < 0.10 or I2 > 50% was considered indicative of statistically significant heterogeneity) [17]. A random effects model (Der Simonian and Laird method) was applied if heterogeneity was present. However, the fixed-effect model was used in the absence of between-study heterogeneity (P > 0.10 or I2 < 50%). We performed subgroup analyses according to different subtypes of TILs as a sensitivity analysis to confirm the robustness of our results. Funnel plots were drafted for each meta-analysis to assess the potential publication bias.

Results

Search results and study characteristics

A total of 3194 articles were selected through searching the chosen electronic databases, and an additional 5 records were identified by cross-checking the bibliographies of retrieved meta-analysis or relevant reviews. After exclusion of duplicates, we screened the titles and abstracts and identified 46 articles for full-text review. We eliminated 9 papers according to the inclusion/exclusion criteria. Ultimately, 37 papers were included in the final analysis (Fig. 1) [7, 18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53].

Fig. 1
figure 1

PRISMA flow diagram detailing the search strategy and results [11]

The basic characteristics and target outcomes extracted from the included studies are listed in Table 1. All included articles (n = 37) were full-reported retrospective cohort studies. The studies were conducted in the United States (18.9%, 7/37), Japan (16.2%, 6/37), South Korea (16.2%, 6/37), China (8.1%, 3/37), France (8.1%, 3/37), Italy (3.4%, 2/37), Singapore (3.4%, 2/37), Germany (5.4%, 2/37), Australia (2.7%, 1/37), Peru (2.7%, 1/37), Spain (2.7%, 1/37), Canada (2.7%, 1/37), Ireland (2.7%, 1/37), and Switzerland (2.7%, 1/37). The population targeted was patients with TNBC. Eleven studies (29.7%, 11/37) provided evidence of the prognostic value of TILs for short-term outcomes (pCR), and five (75.7%, 28/37) provided evidence of the prognostic values of TILs for long-term outcomes (OS and/or DFS). The details of data extraction are presented in Additional file 2.

Table 1 Clinical details of the included studies

TILs and pCR

From the 11 studies demonstrating the prognostic value of TILs for pCR among TNBC patients, the results showed that upregulation of TILs predicted a higher pCR rate. The pooled ORs were 2.14 (95% CI, 1.43–3.19) for TIL level (high vs. low) and 1.09 (95% CI, 1.02–1.16) for continuous TILs (10% increase in TIL level). When stratified by the TIL phenotypes of CD4+, CD8+, and FOXP3+, no statistical differences in pCR were found in the subgroup analysis. The details pooled results are presented in Fig. 2.

Fig. 2
figure 2

Forest plots of the random-effects meta-analysis for the efficacy of tumor-infiltrating lymphocytes (TILs) for predicting pathological complete response (pCR). a Low TILs vs. high TILs stratified by TIL phenotype. b Continuous TILs (10% increase) for pCR

TILs and OS

A total of 24 studies supported the prognostic value of TILs for OS in TNBC patients. The results showed upregulation of TILs predicted a better OS. The pooled HRs were 0.58 (95% CI, 0.48–0.71) for total TIL level (high vs. low) and 0.90 (95% CI, 0.87–0.93) for continuous TILs (Fig. 3).

Fig. 3
figure 3

Forest plots of the random-effects meta-analysis for the efficacy of tumor-infiltrating lymphocytes (TILs) for overall survival (OS). a Low TILs vs. high TILs stratified by TIL phenotypes. b TILs stratified by continuous TILs, 5% increase in TILs, 10% increase in TILs, and phenotypes

From subgroup analyses according to TIL phenotype (high vs. low), the HRs were 0.49 (95% CI, 0.32–0.76), 0.70 (95% CI, 0.46–1.06), and 1.28 (95% CI, 0.24–6.88) for CD4+ TILs, CD8+ TILs, and FOXP3+ TILs, respectively (Fig. 3a). Subgroup analyses according to the change in TIL level (continuous) returned HRs of 0.50 (95% CI, 0.28–0.89) and 1.80 (95% CI, 0.50–6.48) for CD8+ TILs and FOXP3+ TILs, respectively (Fig. 3b).

TILs and DFS

A total of 20 studies supported the prognostic value of TILs for DFS in TNBC patients. The results showed upregulation of TILs predicted better DFS, with pooled HRs of 0.66 (95% CI, 0.57–0.76) for TIL level (high vs. low) and 0.92 (95% CI, 0.90–0.95) for continuous TILs (Fig. 4).

Fig. 4
figure 4

Forest plots of the random-effects meta-analysis for the efficacy of tumor-infiltrating lymphocytes (TILs) for disease-free survival (DFS). a Low TILs vs. high TILs stratified by TIL phenotype. b TILs stratified by continuous TILs, 5% increase in TILs, and 10% increase in TILs

From subgroup analyses according to TIL phenotype (high vs. low), the HRs were 0.54 (95% CI, 0.36–0.80), 0.55 (95% CI, 0.38–0.81), and 0.50 (95% CI, 0.33–0.75) for CD4+ TILs, CD8+ TILs, and FOXP3+ TILs, respectively (Fig. 4a).

Subgroup analyses according to the change in TIL level (continuous) returned HRs of 0.93 (95% CI, 0.90–0.96), 0.70 (95% CI, 0.39–1.27), and 0.41 (95% CI, 0.21–0.80) for a 10% increase in TILs, continuous TILs, and a 5% increase in TILs of each subgroup, respectively (Fig. 4b).

Risk of bias in included studies

We evaluated the risk of bias for all included studies (n = 37). We found the main sources of bias were related to missing data, TIL measurement and confounding controls. Most of the missing data due to that not all the available patients were included in the final analysis as the information was not complete (participants were excluded due to missing data). Figure 5a shows the risk of bias assessments for each cohort. Evaluations for each domain across full reported studies are shown in Fig. 5b.

Fig. 5
figure 5

Risk of bias assessment at the study level. a Risk of bias graph: review authors’ judgements about each risk of bias item presented as percentages across all included full reported studies (n = 37). b Risk of bias summary: review authors’ judgements about each risk of bias item for each included study

Publication bias

Funnel plot analysis did not indicate apparent publication bias affecting the HRs for DFS and OS or the ORs for pCR in the included studies (Fig. 6).

Fig. 6
figure 6

Funnel plot analysis of potential publication bias. a High tumor-infiltrating lymphocytes (TILs) vs. low TILs for pathological complete response (pCR). b Continuous TILs (10% increase) for pCR. c High TILs vs. low TILs for overall survival (OS). d Continuous TILs for OS. e High TILs vs. low TILs for disease-free survival (DFS). f Continuous TILs for DFS

Discussion

As TNBC is a poor prognostic subtype of breast cancer, it is important to identify biomarkers that can rigorously predict its prognosis. The present review and meta-analysis synthesized 37 studies to evaluate the association between TIL levels, both total and specific subtypes, and prognosis in TNBC patients. Our findings indicate that a high TIL level in TNBC significantly increases the likelihood of pCR and improves DFS and OS.

In the present study, we used pCR as the indicator of short-term prognosis for patients with TNBC. Previous studies reported that higher TIL levels predict a better response to chemotherapy in patients with breast cancer [54,55,56]. According to our pooled results, compared to TNBC patients with low TIL levels, TNBC patients with high TIL levels had a higher rate of pCR to treatment (OR 2.14, 95% CI 1.43–3.19). Moreover, with each 10% increase in TIL level, patients with TNBC had an increased pCR rate (OR 1.09, 95% CI 1.02–1.16). A potential explanation for these findings is the influence of TILs to tumor immunosurveillance and tumor immunosuppression [57]. In addition, the treatment used in the included articles was inconsistent. However, no significant pCR improvement was observed for high levels of the CD4+, CD8+, and FOXP3+ TIL subgroups. This may due to the limited amount of data available for these subgroups.

The indicators of long-term prognosis in this study were OS and DFS. According to our pooled results, compared to TNBC patients with low TIL levels, patients with high TIL levels showed better OS (HR 0.58, 95% CI 0.48–0.71) and DFS (HR 0.66, 95% CI 0.57–0.76). Additionally, with a continuously increasing TIL levels, patients with TNBC had improved OS (HR 0.90, 95% CI 0.87–0.93) and DFS (HR 0.92, 95% CI 0.90–0.95). This finding is consistent with previous conclusions [3, 9, 25, 58, 59]. Our results indicate that a high level of TILs is a positive predictor for the prognosis of patients with TNBC.

The CD4+ TIL subgroup (high vs. low) showed a better OS (HR 0.49, 95%CI 0.32–0.76) and DFS (HR 0.54, 95%CI 0.36–0.80), and the CD8+ TIL subgroup (high vs. low) showed a better DFS only (HR 0.55, 95% CI 0.38–0.81). Nevertheless, the pooled results indicated CD4+ TILs and CD8+ TILs were positive predictors for long-term prognosis in TNBC. This is consistent with previous meta-analysis results [6]. The FOXP3+ TIL subgroup (high vs. low) also showed only better DFS (HR 0.50, 95% CI 0.33–0.75), with no statistical association with OS (HR 1.28, 95% CI 0.24–6.88). This finding for FOXP3+ TILs is opposite to that of previous meta-analyses [3, 6], and the reason for this inconsistency is unclear. More studies of the association of FOXP3+ TILs with the prognosis of TNBC are needed.

To our best knowledge, this was the first meta-analysis to pool the prognostic results for categorical TIL level and continuous TILs separately. Therefore, from the results, we can definitively conclude that a higher density of TILs corresponds to a better prognosis for TNBC. Our study does have some limitations. First, all included studies were retrospective cohort studies, with risks of bias related to missing data, TIL measurement, and confounding controls. Next, the variation in the definition of high/low TIL level, and the timeline(s) used for PFS and OS among the included studies can affect the accuracy of the results.

Conclusions

TNBC with higher levels of TILs showed better short-term and long-term prognoses. High levels of specific phenotypes of TILs (CD4+, CD8+, and FOXP3+) could positively predict the long-term prognosis for TNBC.