Introduction

Gastric cancer (GC) is a common malignancy in the world, ranking fifth in morbidity and third in mortality among all cancers [1]. Radical resection is still the main curative method for GC. However, surgery alone is insufficient to remove GC and postoperative chemotherapy (CT) or radiochemotherapy (RCT) with fluoropyrimidine (FU)-based regimens further improves the curative rate and survival of patients with resected GC [2]. Currently, postoperative CT and RCT, which improve overall survival (OS) equally [2], are considered as the standard of care for patients with ≥ stage IB GC [3]. However, approximately half of the patients relapse within 5 years, which underscores the need for novel prognostic and predictive markers other than TNM staging alone.

Tumor mutation burden (TMB), which is emerging as a predictor for response to immune checkpoint inhibitors [4], also exhibits great potential as a biomarker in early-stage tumors due to its association with anti-cancer immunity. It has been known that some somatic non-synonymous variants (SNVs) can lead to the translation of novel peptide epitopes or neoantigens, which induces immune infiltration by enhancing the tumor immunogenicity [5]. Meanwhile, immune infiltration is a strong prognostic or even a predictive factor in terms of both disease-free survival (DFS) and OS at different stages of tumors [6]. However, the characterization of TMB-associated immune infiltration in GC remains unclear.

Recently, TMB has been investigated in resected non-small-cell lung cancer (NSCLC), while different studies attained different conclusions. A retrospective study showed that high non-synonymous TMB (> 8 mutations/Mb), evaluated by next generation sequencing (NGS) of a panel including 1538 genes, was prognostic for favorable OS and DFS in 908 patients from the Lung Adjuvant Cisplatin Evaluation (LACE)-Bio-II trial [7]. On the contrary, Owada-Ozaki et al. [8] found in 90 patients with resected NSCLC that high TMB (the number of non-synonymous coding variants ≥ 62), calculated by whole exome sequencing (WES) analysis, indicated a poor prognosis. In another study, the prognostic impact from TMB was not observed (> 31 mutations/Mb as high TMB), after sequencing analysis using a capture panel, in 199 patients with early-stage squamous cell lung cancer [9].

Focusing on gastrointestinal cancer, a study analyzed TMB from targeted NGS data of 40 major genes in 516 patients with resected colorectal cancer and observed that TMB-high (≥ 8 non-synonymous somatic mutations) is associated with better relapse-free survival [10]. In GC, a study including 100 patients used an unknown NGS panel to find that prolonged OS was associated with high TMB (3.72 mutations/Mb) in the discovery set but not in the validation set [11]. Apparently, current results are inconsistent for the prognostic role of TMB in resected tumors, which might be due to different NGS platforms, gene panels, high TMB definitions, tumor types, and others.

The impact of TMB on the efficacy of postoperative CT/RCT remains elusive, despite CT/RCT is currently the standard of care in many tumors after surgery. In this study, we investigated the prognostic and predictive roles of TMB in patients with resected GC who received surgery plus postoperative CT/RCT or surgery alone from three independent cohorts, with a special focus on the potential mechanisms underlying these roles.

Patients and methods

Patients

The discovery set included consecutive patients who underwent GC operations between 2015 and 2016 at the Affiliated Hospital of Jiangsu University (AHJU), Zhenjiang, China. Fresh and/or formalin fixed paraffin embedded (FFPE) pairs of GC and normal tissues were dissected from surgical specimens. Diagnosis and tumor purity were confirmed by the pathologists. GC cohorts in The Cancer Genome Atlas (TCGA) and the Asian Cancer Research Group (ACRG) were used as validation sets, and data were acquired and preprocessed as described elsewhere [12].

The following inclusion criteria were used for all patients: (1) available sequencing data for TMB evaluation; (2) a prior history of gastrectomy; (3) pathological diagnosis of stage IB-III gastric adenocarcinoma; and (4) no prior history of radiation therapy, chemotherapy, or biological target therapy (including neoadjuvant therapy). A history of D2 radical gastrectomy with R0 resection were additionally asked in the AHJU cohort. The American Joint Committee on Cancer (AJCC) criteria were used for clinical and clinicopathologic classification and staging.

Sequencing analysis

Genomic DNA from fresh tumor tissues were extracted using the DNeasy Blood & Tissue kit (Qiagen). FFPE samples were de-paraffinized with xylene followed by genomic DNA extraction using QIAamp DNA FFPE Tissue Kit (Qiagen). We used KAPA Hyper Prep Kit (KAPA Biosystems) to prepare whole genome library and the Illumina Rapid Capture Extended Exome Kit (Illumina Inc.) to perform exome capture. Enriched libraries were sequenced using the Illumina HiSeq4000 NGS platform (Illumina, USA) as paired 150-bp reads. Data processing was carried out following the previously described methods [13].

Microsatellite instability (MSI) analysis

Single fluorescent multiplex PCR based on genomic DNA extracted from GC and normal tissues was performed to assess MSI status using five well-known mononucleotide repeats [14]. Samples with allelic size variations in more than one microsatellite were evaluated as MSI, or as MSS.

Immunohistochemistry (IHC)

Thymidylate synthase (TS, TYMS) and ERO1A were stained using ab108995 and ab177156 antibodies (Abcam, UK), respectively, based on a two-step IHC protocol [15]. Positive TS expression was defined as the presence of nuclear and/or cytoplasm staining of tumor cells irrespective of their proportion or intensity. Positive ERO1A expression was determined according to IHC score [15].

Multiplexed IHC (mIHC) and multispectral imaging

mIHC and multispectral imaging were applied to detect immune cell subsets. Following the manufacturer’s instructions, multiplex immunofluorescence staining was conducted based on PANO 7-plex IHC kit (Panovue, Beijing, China), with varied antibodies including anti-panCK (CST4545, Cell Signaling Technology, USA), anti-CD56 (CST3576), anti-CD8 (CST70306), anti-HLA-DR (ab92511, Abcam, UK), anti-S100 (ab52642) and anti-CD68 (BX50031, Biolynx, China). The invasive margin and tumor parenchyma were distinguished by S100 staining [16].

The stained slides were scanned and a single stack image were built based on the Mantra System (PerkinElmer, Waltham, Massachusetts, US). After removing autofluorescence, images of sections were reconstructed according to a spectral library for multispectral unmixing and cells were quantified by the inForm image analysis software (PerkinElmer, Waltham, Massachusetts, US).

Bioinformatics analysis

Parametric Gene Set Enrichment Analysis (PGSEA) [17] was used to perform tumor hypoxia scoring based on mRNA data and well-established signatures [18,19,20,21,22]. xCell, an in silico tool, was used to conduct cell type enrichment analysis from mRNA data [23]. These data were shown in Supplementary Data.

Statistical analysis

TMB was defined as the total SNV counts in coding regions [7] and dichotomized by the median value [9]. All TMB data were shown in Supplementary Data. OS and DFS had been previously defined [24]. At first, missing data in ACRG, TCGA and the pooled cohort were imputed by multiple imputations, respectively. The patterns of missing values were analyzed and corresponding impute methods were selected. The number of imputed complete datasets was similar to the percentage of cases that are incomplete in each cohort [25]. All datasets could be found in Supplementary Data.

Comparisons between groups were made with the χ2 test, Fisher’s exact probability test, Student’s t test and Mann–Whitney U test as needed. Correlations between groups were evaluated using Spearman correlation coefficient. Survival analysis was performed based on the Kaplan–Meier method and the log-rank test. Prognostic factors were analyzed using univariate and multivariate Cox proportional hazard models and we calculated hazard ratios (HRs) along with their 95% confidence intervals (CIs). We first estimated HR in each imputed dataset, following which the estimated HRs from all imputed datasets were combined according to the Rubin’s rules [26]. The imputed dataset that had the closest HR to the combined HR was selected to plot survival curves. We considered statistical significance with a two-sided p < 0.05. SPSS (version 19.0, Chicago, IL), R (version 3.6.1) and R Bioconductor packages were used for all the above analyses.

Results

Patient characteristics

The numbers of eligible patients in the AHJU, TCGA, and ACRG cohorts were 73, 352 and 222, respectively. The median TMBs in the AHJU and TCGA cohorts, were 88 and 85.5, respectively. The ACRG cohort included 43 eligible patients with WES data and 179 eligible patients with panel sequencing data. The median TMBs for these patients were 68 and 11, respectively, which were used as their corresponding cutoffs for high and low TMBs. Most MSI GC and MSS GC of about 40% were TMB-high (Table 1).

Table 1 Patient characteristics according to tumor mutation burden

Association between TMB and prognosis

In all cohorts, patients with higher TMBs had better OS than those with lower TMBs (Fig. 1A). Similar results were found for DFS (Fig. 1B). Furthermore, consistent results for both OS and DFS were observed in the pooled cohort that included all patients (Fig. 1C). After univariate selection for the prognostic significance of variables (Table S1), multivariate analysis in the pooled cohort showed that TMB was an independent predictor for both OS (HR = 0.67, 95% CI 0.51–0.88, P = 0.003) and DFS (HR = 0.71, 95% CI 0.55–0.91, P = 0.007; Table S2).

Fig. 1
figure 1

Association between tumor mutation burden (TMB) and prognosis in resected gastric cancer. A and B: Overall survival (OS; A) and disease-free survival (DFS; B) by TMBs in individual cohort; C: OS and DFS by TMBs in the pooled cohort. AHJU: Affiliated Hospital of Jiangsu University; ACRG: Asian Cancer Research Group; TCGA: The Cancer Genome Atlas; HR: hazard ratio; CI: confidence interval

These results indicated that high TMB correlated with favorable prognosis in resected GC.

Association between TMB and postoperative CT/RCT efficacy

In patients with lower TMBs, postoperative CT/RCT significantly improved both OS (Fig. 2A) and DFS (Fig. 2B) in all cohorts, compared with surgery alone. In patients with higher TMBs, postoperative CT/RCT significantly improved OS only in the TCGA cohort but not in the other two cohorts (Fig. 2a), and postoperative CT/RCT did not significantly improve DFS in any cohort (Fig. 2b).

Fig. 2
figure 2

Association between tumor mutation burden (TMB) and the efficacy of postoperative chemotherapy (CT) or radiochemotherapy (RCT) in resected gastric cancer. A and B: Overall survival (OS; A) and disease-free survival (DFS; B) by both TMBs and postoperative CT/RCT in individual cohort; C: OS and DFS by both TMBs and postoperative CT/RCT in the pooled cohort. AHJU: Affiliated Hospital of Jiangsu University; ACRG: Asian Cancer Research Group; TCGA: The Cancer Genome Atlas; HR: hazard ratio; CI: confidence interval

In the pooled cohort, patients with lower TMBs significantly benefited from postoperative CT/RCT for both OS and DFS (Fig. 2c). Multivariate analysis after univariate selection (Table S3) showed that postoperative CT/RCT was an independent predictor for both OS (HR = 0.47, 95% CI 0.33–0.67, P < 0.001) and DFS (HR = 0.43, 95% CI 0.30–0.60, P < 0.001; Table S4) in patients with lower TMBs. For patients with higher TMBs, postoperative CT/RCT significantly improved OS but not DFS (Fig. 2c).

Above results suggest that the benefit of postoperative CT/RCT in resected GC is more pronounced in patients with lower TMBs than with higher TMBs. However, postoperative CT/RCT may be also effective in some patients with high TMBs.

Association of TMB with postoperative CT/RCT efficacy by stages

Currently, patient selection for postoperative CT/RCT is mainly determined by pathological staging. To determine the subgroup of TMB-high GC that may benefit from postoperative CT/RCT, stratification by stage was performed. Considering a requirement of large sample size, this analysis was conducted in the pooled cohort. Patient characteristics according to both TMB and stage were shown in Table S5. The TMB-high stage Ib/II GC had more elderly, female, histology grade I/II, MSI subtype, and was less treated by postoperative CT/RCT, than other subgroups (P < 0.05). Subsequently, we found that patients with higher TMBs significantly benefited from postoperative CT/RCT for both OS and DFS in stage III but not in stage Ib/II (Fig. 3a). In contrast, regardless of the stages, the patients with lower TMBs significantly benefited from postoperative CT/RCT for both OS and DFS (Fig. 3b).

Fig. 3
figure 3

Association between tumor mutation burden (TMB) and the efficacy of postoperative chemotherapy (CT) or radiochemotherapy (RCT) by stages in resected gastric cancer. A and B: Overall survival (OS) and disease-free survival (DFS) by both stages and postoperative CT/RCT in TMB-high (A) and TMB-low (B) subgroups, respectively, in the pooled cohort. HR: hazard ratio; CI: confidence interval

Association of TMB with immune infiltration

The prognostic role of TMB is thought to be mediated by its effect in shaping tumor-host immune interactions through the production of neo-epitopes [5, 27]. However, the association of TMB with immune infiltration in GC remains unclear. Among the immune infiltration cells, the Th1 cells are the key to promote antitumor immunity while the fibroblasts can suppress antitumor immunity [28, 29]. We used the mRNA-based cell type enrichment analysis to calculate the abundances of Th1 cells and fibroblasts inside GC. Our results showed that TMB positively correlated with the abundance of Th1 cells but negatively correlated with the abundance of fibroblasts in both the TCGA and ACRG cohorts (Fig. 4a), suggesting that TMB may reflect dynamism of antitumor immunity.

Fig. 4
figure 4

Association between tumor mutation burden (TMB) and immune infiltration in resected gastric cancer. A: Correlation between TMB and the abundance of immune infiltration cells by transcriptome-based analysis; B: Multiplexed immunohistochemistry (mIHC) staining of surface biomarkers of immune cells in tumor tissues. 1: CD8; 2: CD56; 3: CD68 (green) and HLA-DR (red); 4: the reconstructed image for all surface biomarkers. The enlarged view shows that the invasive margin and tumour parenchyma were distinguished by S100 staining. The yellow line demarcates S100-positive tumor (above yellow line) from S100-negative stroma (under yellow line). C: Migration rates of immune cells from the invasive margin into the tumor parenchyma by TMBs. ACRG: Asian Cancer Research Group; TCGA: The Cancer Genome Atlas

Interestingly, TMB positively correlated with the abundance of NK cells but negatively correlated with the abundance of CD8 + T cells (Fig. 4a). These findings were further investigated by mIHC in eight samples that had a sufficient amount of tissue from the AHJU cohort. The cells in the invasive margin and the tumor parenchyma, which were distinguished by S100 staining, were quantified separately (Fig. 4b). Because the migration of immune cells from the invasive margin into the tumor parenchyma was critical to antitumor immunity [16], the migration rate was introduced to evaluate the dynamism of antitumor immunity. We defined the migration rate as the number of immune cells in the tumor parenchyma divided by the total number of immune cells in the invasive margin and tumor parenchyma. We found that TMB-high tumors had a significantly lower migration rate for CD8 + T cells than TMB-low tumors (Fig. 4c). Moreover, NK cells were subdivided into two major subsets: CD56dim and CD56bright, both of which are critical in antitumor immunity [30]. We showed that the migration rates of both CD56bright and the total NK cells were significantly higher in TMB-high than in TMB-low tumors and the migration rate of CD56dim NK cells also tended to be significantly higher in TMB-high than in TMB-low tumors (Fig. 4c).

These results indicate that high TMB may signify active antitumor immunity, which was characterized by NK cell rather than CD8 + T cell infiltration.

Association of TMB with hypoxia

Hypoxia renders tumors resistant to CT and RCT [31]. Thus, we quantified tumor hypoxia based on mRNA data and independent hypoxia signatures. We found that TMB positively correlated with hypoxic scores from all hypoxia signatures in both TCGA and ACRG (Fig. 5a). By stratification of TMB and stages, we further showed that hypoxic scores in TMB-high stage Ib/II GC were higher than any other subgroups in the combined TCGA and ACRG cohorts (Fig. 5b). Moreover, mutation loads in TP53, MYC and PTEN had been reported to correlate with hypoxia [32], we also found that the TMB-high stage Ib/II GC had a significantly higher mutation rate of TP53/MYC/PTEN than others (57.6% vs 43.8%; P = 0.002; Fig. S1).

Fig. 5
figure 5

Association between tumor mutation burden (TMB) and hypoxia in resected gastric cancer. A: Correlation between TMB and hypoxia scores (signature authors are shown); B: Hypoxia scores by both TMBs and stages; C: Correlation between TMB and ERO1A mRNA expression; D: Typical micrographs of ERO1A-positive (1) and ERO1A-negative (2) tumors, at 200 × magnification; E: Protein expressions of ERO1A and thymidylate synthase (TS) by both TMBs and stages; F: Correlation between Buffa hypoxia score and TS mRNA expression; G: Correlation between TMB and TS mRNA expression; H: Typical micrographs of TS-positive (1) and TS-negative (2) tumors, at 200 × magnification. TCGA: The Cancer Genome Atlas; ACRG: Asian Cancer Research Group

In the meantime, we used a novel endogenous hypoxia marker-ERO1A [33], to validate our findings. We observed that TMB positively correlated with the mRNA expression of ERO1A in both TCGA and ACRG (Fig. 5c). By IHC analysis of samples from the AHJU cohort (Fig. 5d), it was revealed that the positive rate of ERO1A protein expression was also significantly higher in TMB-high than in TMB-low tumors (26/37 vs 17/36, P = 0.045). Specifically, the TMB-high stage Ib/II GC was the most hypoxic in all subgroups in terms of ERO1A protein expression (Fig. 5e).

An important mechanism of tumor hypoxia to induce CT/RCT resistance is to increase gene expressions that could impair CT/RCT efficacy [31]. Thus, we investigated the association between TMB and thymidylate synthase (TS) whose high expression was deleterious for the outcome of adjuvant CT of GC [34]. In both TCGA and ACRG, we first verified the strong positive correlation between the Buffa hypoxic score and TS mRNA expression (Fig. 5f), and further demonstrated a similar correlation between TMB and TS mRNA expression (Fig. 5g). By IHC analysis of samples from the AHJU cohort (Fig. 5h), we observed that the positive rate of TS protein expression was higher in TMB-high than in TMB-low tumors (28/37 vs 15/36, P = 0.003). Moreover, TMB-high stage Ib/II GC had the highest positive rate of TS protein in all subgroups (Fig. 5e).

Altogether, TMB-high GC was more hypoxic than TMB-low GC. Especially, TMB-high stage Ib/II GC was more hypoxic, accompanied by more TS expression, than any other subgroups.

Discussion

In this study, we proved the prognostic and predictive roles of TMB in resected GC. Together with previous findings about similar roles of MSI status and PD-L1 expression in resected GC [35,36,37], biomarkers associated with antitumor immunity are emerging as a group of novel indicators for clinical outcomes. However, the frequency of MSI GC is usually reported to be less than 10%, and most of them are TMB-high as previously reported [35] and also shown in our study. Besides, TMB-high GC defined in our study included MSS GC of approximately 40% (Table 1), suggesting that TMB may be more effective than MSI to identify patients who may failed in postoperative CT/RCT. On the other hand, PD-L1 and TMB had been found to be independent [38, 39], and therefore a combined use of PD-L1 and TMB may largely improve the patient selection in postoperative therapies. In brief, TMB may be a useful biomarker in resected GC.

Our results indicated that TMB correlated with immune infiltration in resected GC. Specifically, NK cells rather than CD8 + T cells were enriched inside TMB-high GC. In consistent with our findings, a recent study in NSCLC also reported that CD8 + T cell infiltration was not associated with TMB [40]. In contrary, a significant correlation was found between CD8 + T cell density and PD-L1 expression in both NSCLC and GC [40, 41], indicating that CD8 + T cells inside tumors may usually be inhibited by PD-L1 signaling. Altogether, TMB-high and PD-L1 expression seemed to represent immune infiltration of different cell types, which may partly explain the independent role of TMB from PD-L1 in predicting immunotherapy efficacy [38, 39].

The key finding of our study is that TMB-high GC, with more active immune infiltration, benefited less from postoperative CT/RCT than TMB-low GC. Similarly, by identifying two major immune phenotypes in muscle-invasive bladder cancer, Fu et al. [42] found that stromal immunotype A, with more inflamed tumor, were not sensitive to adjuvant CT, while immunotype B tumors characterized by low levels of immune infiltration derive benefit from adjuvant CT. These findings support that the prevalence of immune infiltrates is the critical element in determining response to therapy despite the mechanisms remain elusive.

Hypoxia is considered as one of the major factors which could induce resistance to CT/RCT. Mutations in specific genes such as TP53, MYC and PTEN were enriched in hypoxic tumors, which was observed across cancer types based on public NGS data [32]. However, the association between hypoxia and TMB has not been verified by further experiments especially in GC validation cohorts. In this study, we confirmed the positive correlation between hypoxia and TMB that were consistently observed between the ACRG and TCGA GC cohorts, by IHC analysis of the novel endogenous hypoxia marker-ERO1A [33], in the AHJU GC cohort. Specifically, in correspondence with the resistance to postoperative CT/RCT, TMB-high stage Ib/II GC was the most hypoxic. We also showed that TMB-high stage Ib/II GC had more MSI subtype than others, indicating that hypoxia may be the primary cause for the poor response to postoperative CT/RCT indicated by immune-associated biomarkers. In addition, the TS expression, which may determine treatment outcomes and probably be up-regulated by hypoxia [17, 34], was also the strongest in TMB-high stage Ib/II GC, further stressing the potential role of hypoxia. However, the impact of hypoxia on TS expression needs further validations and mechanism researches.

Interestingly, tumor hypoxia was associated with the number of clonal but not subclonal mutations [32], indicating that hypoxia promotes early tumor evolution, prior to subclonal diversification. Together with the evidences that neoantigens based on clonal but not subclonal alternations promote anti-cancer immunity and the efficacy of immunotherapy [43, 44], these findings suggest that the selective pressure of hypoxia on tumors during their early period may determine immune infiltration later. In our study, TMB positively correlate with tumor hypoxia and immune infiltration at the same time, indicating a positive correlation between tumor hypoxia and immune infiltration, which may partly validate this hypothesis.

It is also interesting to observed that hypoxia seemed to be more severe in stage Ib/II tumors than in stage III tumors. Previously, a peak of tumor hypoxia at early stage of tumor development are also indicated in studies [45, 46]. A possible explanation is the fast growth of tumor cells while the development of tumor blood vessels is inadequate at early stage. Besides, the tumor microenvironment is ever changing during tumor development, and metabolic reprogramming can drive glucose metabolism of cancer cells to shift from “the Warburg effect” to “the reverse Warburg effect” [47]. The latter will consume lactate accumulated in stroma and may ameliorate tumor hypoxia.

The main limitations of this study include the small sample size, the retrospective analyses in patients who were selected without randomization from heterogeneous cohorts, and other confounding factors such as unknown surgical style and margin status in many patients from ACRG and TCGA. The strengths of this study are that we obtained consistent findings from independent cohorts, although they differ greatly in population, gene assays and sample sizes.

In conclusion, the present study revealed that TMB was a prognostic factor in resected GC and also an efficacy predictor of postoperative CT/RCT in stage Ib/II subgroup. TMB-associated immune infiltration may determine the favorable prognosis of TMB-high GC while TMB-associated hypoxia may reduce the efficacy of postoperative CT/RCT in TMB-high GC especially in the stage Ib/II subgroup. The addition of TMB to biomarkers of resected GC may be advantageous to decrease unnecessary postoperative therapy, although prospective validations are necessary.