Introduction

Perioperative chemotherapy (PC) with radical surgery represents the gold standard of treatment for resectable locally advanced gastric cancer (GC). GC causes a large number of health and economic burdens worldwide and it is the second most common diagnosed cancer and leading cause of cancer-related death in China [1]. About 60% of the patients in China are diagnosed as locally advanced or metastatic diseases. Radical surgery is still the most promising therapy for resectable GC. To increase the rate of radical resection and lower the odds of distant metastasis or local recurrence, preoperative chemotherapy, including neoadjuvant chemotherapy and conversion chemotherapy, is widely used. Some studies have proved that preoperative therapy could improve long-term survival [2,3,4]. Both the Magic Trial and FNCLCC/FFCD study showed a survival benefit of perioperative chemotherapy, compared with surgery alone [2, 5]. More recently, these results were supported by the multicentric RESOLVE trial, which found that perioperative chemotherapy had a significantly better disease-free survival than adjuvant chemotherapy [6]. Even for stage IV patients, some of them may receive radical surgery after conversion chemotherapy. So, it is important to evaluate the effectiveness of preoperative chemotherapy. The most widely used criteria for evaluating tumor response to certain therapy in clinical practice is the Response Evaluation Criteria in Solid Tumors (RECIST) [7, 8]. Besides, to evaluate local tumor response histologically after chemotherapy, another approach called tumor regression grade (TRG) is often used. Though there are several TRG systems and no international standard, several studies have indicated the correlation between TRG and prognosis [9,10,11,12]. However, in clinical practice, we found that the results of TRG and RECIST were not always consistent. We wonder which method performs better in predicting the prognosis. Therefore, we designed the present study to verify the consistency of TRG and RECIST criteria and compare the prognostic value of TRG and RECIST 1.1 to preoperative therapy in GC patients.

Methods

Patient selection

In this study, the data of gastric cancer patients who received preoperative therapy between December 2013 and October 2021 at Sun Yat-sen University Cancer Center were retrospectively collected. The inclusion criteria were: 1. had received chemotherapy or chemotherapy combined with immunotherapy before surgery; 2. underwent gastrectomy surgery. The exclusion criteria were as follows: 1. underwent preoperative radiotherapy; 2. History of another malignancy within the last 5 years except cured basal cell carcinoma of skin and cured carcinoma in situ of the uterine cervix. Our study was performed in accordance with the Declaration of Helsinki protocols and was approved by the local Ethics Committee of SYSUCC and waived patient-specific consent.

Evaluation system for tumor regression grade and RECIST

The overall survival (OS) was calculated from the date of diagnosis to death or the final follow-up date. The disease-free survival (DFS) was calculated from gastrectomy surgery to death or the final follow-up date or the date of recurrence/metastasis. The failure events of the survival analysis were death for OS and tumor recurrence or death for DFS. The censoring data was defined as not having observed a failure event and not knowing the exact survival time.

This study applied the National Comprehensive Cancer Network (NCCN) classification system to assess the tumor regression grade (TRG). TRG was defined by the percentage of viable cancer cells in the resected primary tumor: TRG 0 = complete response, no viable tumor cells including lymph nodes; TRG 1 = near complete response, single cancer cells or rare small groups of cells; TRG 2 = partial response, residual tumor cells with obvious tumor regression but the amount of cancer cells is more than that in TRG 1; TRG 3 = poor or no response, extensive residual cancer with no evident tumor regression [13, 14]. According to previous studies, pathological response was defined as TRG = 0 or 1 [15].

RECIST version 1.1 were applied in the present study: CR (complete response) means disappearance of all target lesions, PR (partial response) means the sum of the diameters of all targets lesions deceases ≥30%, PD (progressive disease) means the sum of the diameters of all targets lesions increases ≥20%, SD (stable disease) means insufficient shrinkage to qualify for PR or insufficient increase to qualify for PD) [7]. Patients who got CR or PR were categorized as clinical response (effective group) [16].

Statistical analysis

SPSS ver. 22.0 software (IBM Corp., Armonk, NY) was used to perform statistical analysis in this study. Consistency of TRG and RECIST were analyzed using kappa statistics (kappa ≦ 0.40, poor agreement; 0.40 < kappa ≦ 0.60, moderate agreement; 0.60 < kappa ≦ 0.80, good agreement; and kappa > 0.80, excellent agreement) [17]. OS and DFS was estimated using the Kaplan-Meier method. The OS and DFS curves for different TRG groups and different RECIST groups were compared using the log-rank test. Chi-Square test or Fisher’s exact test were used as appropriate to compare the different neoadjuvant therapies as well as clinicopathological characteristics associated with different TRG groups and different RECIST groups. P values less than 0.05 were considered significant statistically.

Results

Clinicopathological characteristics

We finally collected a total of 157 patients with preoperative treatment and gastrectomy in the present study. 125 (79.6%) patients were administered preoperative chemotherapy, and 32 (20.4%) patients received preoperative chemotherapy combined with immunotherapy. Most patients were male (65.6%), and their median age was 64 years old (ranging from 24 to 78). Most primary tumors (72.0%) were gastric cancers, and 28.0% were in the esophageal-gastric junction. Five patients (3.2%) were mismatch repair deficient (dMMR), and 10 (6.4%) were human epidermal growth factor receptor-2 (HER-2) positive. Fifty patients (31.8%) were classified as diffuse type and 60 (38.2%) were classified as intestinal type.

The TRG scores were as follows: TRG 0 (n = 24, 15.3%); TRG 1 (n = 29, 18.65%); TRG 2 (n = 60 38.2%); TRG 3 (n = 44, 28.0%). Only 56 patients had measurable lesions, including PD (n = 1, 1.8%), SD (n = 25, 44.6%) and PR (n = 30, 53.6%). The patient characteristics are listed in Supplementary Table 1.

In all 157 patients, 44 (28.0%) showed progression, including local recurrence and distant metastasis. According to RECIST 1.1, 7 of them achieved PR after preoperative therapy and 13 got SD. The rest could not be evaluated since they had no targeted lesion. Of the 44 patients with progression after surgery, no one got pCR after preoperative therapy, 7 got TRG 1, 19 got TRG 2, and 18 got TRG 3.

Patients who got SD or PD after preoperative chemotherapy tended to have peritoneal metastasis (P < 0.05, Table 1). TRG 2 or 3 was associated with younger age, advanced clinical stage at diagnosis, and liver or peritoneal metastasis (All P < 0.05, Table 2).

Table 1 Clinicopathologic variables associated with RECIST 1.1
Table 2 Clinicopathologic variables associated with TRG

Consistency analysis of TRG and RECIST

We considered TRG 0 or 1 as effective and TRG 2 or 3 as ineffective. Likewise, we regarded RECIST CR or PR as effective and SD or PD as ineffective. Thirty one patients got consistent results in TRG and RECIST 1.1 (11 were both effective and 20 were both ineffective, respectively). Ninteen patients got effective in RECIST 1.1 and ineffective in TRG, and another 6 patients got effective in TRG and ineffective in RECIST 1.1. The kappa value was 0.132, which showed poor consistency. Considering the patients were with different TNM stages, we classified them as neoadjuvant therapy group and conversion therapy group. There were 123 patients (78.3%) in neoadjuvant therapy group and 34 patients (21.7%) in conversion therapy group and we analyzed the agreement between TRG and RECIST separately. The kappa values were 0.089 and 0.048 in the preoperative chemotherapy group and conversion therapy group respectively, which showed poor consistency.

Considering that TRG scores focus on regression of primary lesions, which could not be considered as target lesions in the RECIST system, we retrospectively calculated the change of thickness of primary lesions after preoperative therapy and compared the changes with their TRG scores in 80 patients. When using 30% shrinkage of thickness as the cut-off point, the change of thickness on CT can predict the TRG score better (kappa value = 0.263, P = 0.02).

Survival analysis of TRG and RECIST

Till Jan 28, 2022, 9 (17.0%) of 53 patients in the TRG effective group had disease progression, compared with 31 (29.8%) of 104 patients in the TRG ineffective group (P = 0.06). The median DFS was not reached vs. 16.13 months in TRG effective and ineffective group, respectively (P = 0.03). The median OS of both TRG 0–1 group and TRG 2–3 group were not reached (P = 0.02). The DFS and OS were significantly better in the TRG effective group (Fig. 1).

figure 1

Fig. 1 Kaplan-Meier curves for different TRG groups. (A) Disease-free survival of TRG effective group and TRG  ineffective group, P=0.03. (B) Overall survival of TRG effective group and TRG ineffective group, P=0.02

Seven (23.3%) of 30 patients in the RECIST 1.1 effective group had disease progression, compared to eight (30.8%) of 26 patients in the ineffective group, P = 0.53. The median DFS for patients in the RECIST effective/ineffective group was 24.93 months and 11.60 months, respectively. The responses according to the RECIST assessment were not significantly correlated with either OS or DFS (P = 0.86 and 0.23, Fig. 2).

figure 2

Fig. 2 Kaplan-Meier curves for different RECIST groups. (A) Disease-free survival of RECIST effective group and RECIST ineffective group, P=0.23; (B) Overall survival of RECIST effective group and RECIST ineffective group, P=0.86

Furthermore, we divided patients with target lesions into 4 groups according to their responses in TRG and RECIST 1.1: both effective, only effective in TRG, only effective in RECIST 1.1, and both ineffective. Their OS and DFS were shown in Supplement Fig. 1. Patients who were only effective in TRG tended to have a better prognosis than those who were only effective in RECIST, though the difference was not significant (not reached vs. 24.93 months for DFS, respectively, P = 0.52;not reached vs. not reached for OS, P = 0.75) (Supplement Fig. 2).

Considering the preoperative treatment was sorted into neoadjuvant therapy and conversion therapy, we analyzed the survival according to TRG and RECIST respectively in each subgroup. It showed that the response of TRG tended to correlate with survival in both subgroups, though the P values were not significantly different (Supplement Fig. 3).

Comparative analysis between different neoadjuvant therapies

Patients in the chemoimmunotherapy group had a higher pathological complete response (pCR) rate than those in the chemotherapy group (34.4% vs 8.0%, respectively, p = 0.001). The median DFS was 19.53 months and not reached for patients who received chemotherapy or chemoimmunotherapy, respectively. But the difference of DFS and OS between these two groups were not statistically significant (P = 0.91 for both OS and DFS) (Supplement Fig. 4). Patients who received chemoimmunotherapy also had a higher response rate (TRG 0 or 1) than those who only received chemotherapy (50.0% vs 29.6%, respectively, P = 0.03).

Patients with pathologic complete response

A total of 21 patients (12.8%) got pCR (The follow-up time ranging from 3.13 months to 34.17 months). Among them, 5 (23.8%) got PR according to RECIST 1.1, and the rest had no target lesions. Five patients did not receive any treatment after surgery, and none of them suffered from recurrence or metastasis with. Seven patients (33.3%) had received chemotherapy combined with immunotherapy after surgery, and one of them had peritoneum metastasis 4 months after surgery. One patient (4.8%) had received immunotherapy alone. Eight patients (38.1%) had received postoperative chemotherapy (Supplement Table 2).

Discussion

In the present study, we found that TRG and RECIST 1.1 got poor consistency. The possible reasons for this inconsistency may include: firstly, the evaluation targets of these two systems are different. TRG mainly evaluates primary lesions, while RECIST 1.1 focus on measurable target lesions (for example, lymph nodes, liver metastasis), which excluded primary gastric lesion. Hence, we tried to compare TRG with the degree of shrinkage of primary lesions on computed tomography (CT), which showed better consistency with TRG. Secondly, unlike other malignancies, gastric cancer is a tumor with high heterogeneity, which may lead to different responses to therapy between the primary site and lymph nodes or distant metastatic lesions. Thirdly, due to fibrosis, necrosis, or edema after preoperative therapy, the size of lesions may not change on imaging [18]. Finally, different imaging devices and measurement errors might also contribute to the inconsistency.

Furthermore, we found that TRG was a better prognostic factor than RECIST 1.1 for GC patients who received preoperative therapy. The TRG score for 84.1% of patients who developed disease progression was 2 or 3. Some previous studies demonstrated that TRG was a reliable prognostic factor for OS and DFS [19, 20]. Although the TRG systems they used were different from ours, their findings persuasively supported the conclusions of our present study. For patients who had inconsistent results between TRG and RECIST 1.1, we found that patients who were only effective in TRG tended to have better OS than those who only showed effectiveness in RECIST 1.1. Due to the small sample size, the difference was statistically insignificant. A larger sample size study was needed to confirm our findings.

Nowadays, perioperative chemotherapy has become the standard treatment for locally advanced GC patients, however, we found that only 35.7% (56/157) of the patients from our study had measurable lesions. About two-thirds of the patients could not be effectively evaluated by RECIST 1.1. Given the low consistency between RECIST 1.1 and TRG, the poor prognostic value and the limited application of RECIST1.1, novel parameters for imaging evaluation were warranted to better predict the response of preoperative therapy. A previous study found that Tumor Volume Reduction Rate (TVRR) was significantly associated with TRG in locally advanced rectal cancer patients [21]. CT perfusion parameter value was also found to improve the accuracy of response evaluation in esophageal cancer patients treated with neoadjuvant chemoradiotherapy [18]. Were there any feasible solutions to reduce those differences between radiology and pathology? Some attempts were made in the present study. We tried to take primary tumor into account and regarded patients whose gastric tumor had shrunken over 30% together as effective. The degree of consistency with TRG had risen, though not very strong. It was possible that the edema or fibrosis of the gastric wall after the preoperative treatment affected the measurement of thickness change. A similar comparison of the esophageal wall thickness after the preoperative treatment was reported in other available studies [18, 22, 23]. To sum up, after putting primary tumors into consideration, the sensitivity and accuracy of efficacy prediction could be higher than only using RECIST evaluation.

The rate of pCR was 12.8% in our study, but none of them was considered as CR by RECIST 1.1 before operation. A study demonstrated that FDG-PET/CT may be more accurate for the prediction of pCR [24]. pCR rate was significantly higher in patients who received chemoimmunotherapy than those with only chemotherapy. Chemoimmunotherapy can increase the ORR by 10–20% compared with chemotherapy alone in the first line setting for metastatic GC patients [25,26,27]. The pCR rate in neoadjuvant immunotherapy was higher than that in neoadjuvant chemotherapy in gastric cancer [28,29,30]. Another phase-2-study in the similar field called VESTIGE concluded that the combination of proper doses of immunotherapy can bring promising response rates in GC patients [31]. The value of chemoimmunotherapy in preoperative therapy still needs to be verified in prospective, randomized, controlled clinical trials.

Till now, there is no consensus on postoperative treatment for pCR patients. A recent cohort study revealed that gastric patients who were sensitive to preoperative chemotherapy tended to have a better prognosis if they would receive postoperative chemotherapy [32]. In our present study, we found a majority of pCR patients (76.2%) received postoperative chemotherapy or chemoimmunotherapy. Only one patient who received postoperative chemoimmunotherapy had distant metastasis 4 months after the operation. Further studies are needed to verify the role of postoperative therapy for pCR patients.

There were some inherent limitations in our study. Firstly, the sample size was small and only 56 patients had measurable lesions. Secondly, the regimens for preoperative treatment were different, including chemotherapy and chemoimmunotherapy. Finally, selection bias existed due to the specialty of the retrospective study. Despite the limitations above, we innovatively analyzed the postoperative treatment of pCR patients. Last but not least, we revealed the relationship between TRG and RECIST 1.1 directly, which was rare in previous studies.

In conclusion, our results suggested that pathological assessment (TRG) was in poor agreement with imaging assessment (RECIST 1.1), and TRG is more useful in predicting the effect of preoperative therapy and prognosis. Patients who achieved TRG 0 or 1 after surgery tended to have better OS and DFS compared to those who got TRG 2 or 3. We demonstrated that the addition of immunotherapy may increase the pCR rate than using chemotherapy alone. In addition, patients with pCR should also highly value postoperative therapy. Further studies with a larger sample size are needed to confirm our findings.