Introduction

The standard therapy for locally advanced rectal cancer (LARC) is neoadjuvant chemoradiotherapy (NCRT) followed by total mesorectal excision (TME) [1, 2]. The response to NCRT varies among patients, which could affect further treatment decisions and prognosis of patients [3]. There are many ways to classify tumor response to NCRT, and pathological stage has always been considered as the simplest and most robust prognostic indicator of LARC [4]. Besides, carcinoembryonic antigen (CEA) [5], tumor differentiation [6], pathologic type, and perineural invasion status (PNI) [7,8,9] are also been confirmed to be main prognostic factors and were strongly correlated with patients outcomes. Recently, the neoadjuvant rectal (NAR) score has been recommended as a primary endpoint to assess preoperative treatment efficacy in clinical trials of rectal cancer [10]. The National Comprehensive Cancer Network (NCCN) guideline recommends tumor regression grade (TRG) should be a potential role to stratify tumor response to NCRT. TRG is defined as the ratio of fibro-inflammatory tissue to residual tumor cells, ranging from TRG 0 (pathologic complete response (pCR); no viable cancer cells) to TRG 3 (poor response; minimal or no regression, extensive residual cancer) [11]. It was reported that patients with TRG 0 have excellent local and distant disease control rates [12, 13]. However, less than one-third patients achieve TRG 0 after NCRT, and the majority of patients exhibit an intermediate response with varying residual tumor infiltration [14]. We wonder whether patients with different TRGs also have different prognosis.

Consequently, the purpose of this study is to assess the prognostic significance of clinicopathological features in patients with LARC treated with NCRT and TME. Furthermore, this study aims to compare the prognostic factors in patients with different TRGs.

Methods

Patients

Patients were included in the study if the following criteria were met: rectal adenocarcinoma proved with pathology, pretreatment clinical stage II or III (T3–T4, or N+) confirmed by MRI, received NCRT and TME in our hospital between 2010 and 2022. Patients with concurrent malignancy at another site were excluded. This study was approved by the institutional review board and adhered Strengthening the Reporting of Observational studies in Epidemiology (STROBE).

Neoadjuvant chemoradiotherapy

Neoadjuvant chemoradiotherapy included standardized concurrent chemoradiotherapy and total neoadjuvant therapy (TNT). The neoadjuvant pelvic IMRT (intensity-modulated radiation therapy) or VMAT (volumetric modulated arc therapy) with dose of 50–50.4 Gy/25–28 fractionations were delivered. Standardized concurrent chemotherapy was administered using capecitabine. TNT included at least 4 cycles of CAPOX (capecitabine plus oxaliplatin) before TME. The intervals between the completion of radiotherapy and TME are 5–8 weeks.

Clinicopathological factors

The following parameters were evaluated as potential prognostic factors in this study: age at the time of diagnosis (≥ 60 or < 60 years), gender, pathologic TNM stage, T downstaging, baseline CEA, NAR score, TRG, and the pathologic status of PNI, lymphovascular space invasion (LVI), and CRM. The NAR score was calculated using the formula [5 pN-3(cT-pT) + 12]2 / 9.61, and classified as low (˂8), middle (8–16), and high (>16) [15]. TRG of the surgical specimens was performed according to the guidelines of the NCCN. Additionally, T downstaging was defined as ypT stage lower than cT stage. The clinical T stage and N stage were evaluated by a radiologist with 5 years of experience on the pelvic MRI, and reviewed by a radiologist with 10 years of experience. In the event of any inconclusive result, a senior radiologist with 20 years of experience was asked to confirm it. All pathologic information was determined by a pathologist with 10 years of experience. If the results were uncertain, a second pathologist with 20 years of experience will be consulted to provide a decision. Any discrepancies will be resolved according to the consultation. In order to minimize the potential for information bias, all eligible patients were included within the specified time period. And in the absence of the primary effectiveness evaluation index due to the failure to observe the case data throughout the follow-up process, the worst observation carried forward (WOCF) can be employed to carry forward the data. Furthermore, the absence of secondary effectiveness evaluation indicators is not carried forward.

We first identified predictive factors for overall survival (OS) and disease-free survival (DFS) in all eligible patients, and then evaluated prognostic factors in patients with different TRGs.

Statistical analysis

Data were summarized as frequencies and percentages for categorical variables and as means and ranges for continuous variables. Categorical variables were evaluated using the chi-square or Fisher’s exact test, as appropriate. Continuous variables were assessed via the analysis of variance (ANOVA) test. The correlation between variables was evaluated for statistical significance using regression test, as necessary. The primary endpoints of the analysis were DFS (time from the date of diagnosis to the date of local recurrence, distant metastasis, or death) and OS (time from the date of diagnosis to the date of death). The survival analysis was based on the comparison of the Kaplan-Meier model with the log rank test. Univariate and multivariate analyses were performed using the Cox proportional hazards regression model [backward elimination (conditional)]. Subgroup analyses were conducted according to TRGs. IBM SPSS Statistics Version26 was used for all analysis. The significance was settled at a P-value ˂ 0.05 as usual.

Results

Patient characteristics

A total of 393 patients underwent NCRT followed by TME in our hospital during this decade, of which 219 patients (55.7%) received TNT and anther 174 patients (44.3%) received standard concurrent chemoradiotherapy. The number and proportion of patients at all levels of TRG were as follows: TRG 0 (83, 21.1%), TRG 1 (104, 26.5%), TRG 2 (179, 45.5%), TRG 3 (27, 6.9%). The average NAR score was 11.60, ranging from 0.00 to 50.36, and for NAR score classification, 139 patients (35.4%) had low score, 166 (42.2%) had middle score, and 88 (22.4%) had high score. Most patients (291, 74.0%) achieved T downstaging after NCRT. The clinicopathological parameters of all patients are shown in Table 1.

Table 1 Clinicopathologic characteristics of all patient (n=393)

The characteristics of patients with different TRGs are summarized in Table 2. As TRGs increased, the pathologic stage and NAR score became worse, and the probability of PNI and LVI was also higher. The average NAR score for TRG 0, TRG 1, TRG 2, and TRG 3 patients was 0.94, 11.21, 15.56, and 19.61, respectively (P ˂ 0.001). Additionally, only five of the TRG 1 patients exhibited positive lymph nodes, while over half of the TRG 3 patients were found to have positive lymph nodes following surgery. About 20% and 30% of cases presented with pathologic positive lymph nodes in patients with TRG1 and TRG2, respectively. Moreover, among TRG 1 patients, 50% were dragonized with ypT1-2 and ypT3-4, respectively. In contrast, a significantly greater proportion of TRG 2 patients were diagnosed with ypT3-4 than ypT1-2 (70.9% vs. 29.1%). None of the TRG 0 patients showed PNI or LVI, but in patients with TRG 3, more than 40% of them developed PNI, and nearly 26% of them experienced LVI. Furthermore, almost all TRG 0 patients exhibited low NAR scores, whereas over half of TRG 3 patients demonstrated high NAR scores.

Table 2 Clinicopathologic characteristics of patients with different TRGs

Oncological outcomes of all patients

Over a median follow-up period of 39 months (range, 3–128 months), the 5-year OS rate and 5-year DFS rate for all patients were 89.2% and 71.5%, respectively. Univariate analysis revealed that ypT stage, ypN stage, TRG, and NAR score were significant predictors for both OS and DFS, as shown in Table 3 and Fig. 1. Besides, positive CRM was associated with poor OS, and the presence of LVI and abnormal CEA had negative impacts on DFS. All significant variables from the univariate analysis were entered into a multivariate Cox regression model (Table 4). The results showed that the NAR score was an independent predictor of both DFS (HR = 1.971, 95% CI = 1.478–2.628, P ˂ 0.001) and OS (HR = 4.040, 95% CI = 1.792–9.111, P = 0.001). Patients with lower NAR scores showed superior long-term survival outcomes (low score vs. middle score vs. high score: 5-year OS rate: 98.2%, 88.8%, 72.1%; 5-year DFS rate: 87.1%, 70.1%, 48.4%).

Table 3 Univariate analyses for DFS and OS of all patients
Fig. 1
figure 1

Survival curves of 393 patients with LARC after NCRT followed by TME surgery. A Correlation of TRG with 5-year OS rate. B Correlation of TRG with 5-year DFS rate. C Correlation of NAR score with 5-year OS rate. D Correlation of NAR score with 5-year DFS rate. TRG, tumor regression grade; NAR, neoadjuvant rectal; OS, overall survival; DFS, disease-free survival

Table 4 Multivariate analyses for DFS and OS of all patients

Risk factors for DFS and OS in patients with different TRGs

The 5-year OS rates for TRG 0–3 were 96.9%, 91.1%, 85.2%, and 68.8%, respectively (P = 0.001). The 5-year DFS rates also showed the same trend (80.8%, 72.4%, 67.0%, 55.8% for TRG 0–3, respectively, P = 0.031). There was no significant difference between patients with TRG 0 and TRG 1 in OS (P = 0.393) or DFS (P = 0.286). However, patients who showed highly sensitive to NCRT (TRG 0–1) had higher 5-year DFS rates than patients with low sensitivity to NCRT (TRG 2–3): 76.1% vs 65.7% (P = 0.021).

In the subgroup analyses of different TRGs, the results showed interesting (Fig. 2). In univariate analysis, NAR score, ypT stage, and ypN stage were all relevant prognostic predictors of DFS for patients with TRG 1 and TRG 2. While for TRG 0 and TRG 3 patients, none of these clinicopathologic showed correlation with clinical outcomes. In multivariate analysis, ypT stage was the independent factor of DFS for TRG1 patients (HR = 4.940, 95% CI = 1.802–13.544, P = 0.002), and ypN status manifested as the independent factor of DFS for TRG 2 patients (HR = 2.793, 95% CI = 1.533–5.086, P = 0.001).

Fig. 2
figure 2

Survival curves of patients with different TRGs. A Correlation of ypT stage with 5-year OS rate in TRG 1 patients. B Correlation of ypT stage with 5-year DFS rate in TRG 1 patients. C Correlation of ypN stage with 5-year OS rate in TRG 2 patients. D Correlation of ypN stage with 5-year DFS rate in TRG 2 patients. TRG, tumor regression grade; OS, overall survival; DFS, disease-free survival

Discussion

Tumor response to NCRT can vary widely from complete absence of viable cancer cells in the primary site to total absence of tumor regression. In the current study, we investigated the prognostic impact of the clinicopathological features in LARC patients who underwent TME after NCRT, and the results showed that NAR score was the independent impact indicator for OS and DFS. However, subgroup analysis based on TRGs revealed different results. The independent factor for patients with TRG 1 was ypT stage, while for patients with TRG 2, ypN status was the dominant prognostic indicator. Moreover, for patients with TRG 0 and TRG 3, none of the clinical or pathologic characteristics was found to be associated with tumor outcomes. It is speculated that the reason for this may be due to the characteristics of patients with TRGs vary considerably. Patients with TRG 0 generally show favorable OS and DFS, while patients with TRG 3 are more likely to experience recurrence or metastasis. Another possible reason is that the number of patients with TRG 0 and TRG 3 is relatively small. In addition, we suspected that the pathological features of TRG 1 and TRG 2 patients may result in different prognostic factors. As most patients with TRG 1 did not present with positive lymph node, the prognostic impact of ypT stage was particularly significant. Conversely, most patients with TRG 2 were ypT3-4 stage, so the presence of positive lymph nodes had a crucial impact on outcomes. As far as we know, this is the first study to conduct a prognostic analysis of patients with varying TRGs.

Numerous published papers have mentioned the relation between TRG and patient’s outcome [16,17,18]. A cohort study showed significant differences in patient outcomes between any two TRG categories [19]. Previous reports indicate that 12–20% of patients achieved pCR after NCRT and had a favorable prognosis [20], which was similar to our study. However, excellent outcomes have also been found in patients with near pCR, and even comparable to those with pCR. Huh et al. [18] analyzed survival differences among various TRG grades, and they found that the 5-year OS rates of patients with TRG 0 and TRG 1 were similar (98% vs. 91%), and were significantly higher than that of patients with TRG 2 and TRG 3 (79%, P ˂ 0.001). Similar to their results, no significant difference was found in survival between patients with TRG 0 and TRG1 in our study (5-year OS rate: 96.9% vs. 90.9%, P = 0.381; 5-year DFS rate: 80.8% vs. 5.8%, P = 0.531), whereas high responders (patients with TRG 0–1) still had higher DFS than low responders (patients with TRG 2–3) (5-year DFS rate: 78.1% vs. 65.3%, P = 0.006). Likewise, Huebner et al. [21] also considered that partial pathologic response was a more superior predictor than pCR. However, inter-observer variation and different quantifying systems leaded to inconsistent results on the prognostic value of TRG [12]. Margherita and Hendrik et al. [22] suggested near pCR could not translate into good clinical prognosis.

Since the 1970s, the American Joint Committee on Cancer (AJCC) TNM staging system had been used worldwide for treatment selection in rectal cancer [23]. As some investigators have reported, pathologic TNM staging after surgery can provide valuable prognostic information on disease relapse and survival, and identify high-risk patients for additional postoperative adjuvant therapy [24,25,26]. Exactly, in our study, the ypT and ypN stage indeed played important role in the prognosis of TRG 1 and TRG 2 patients. Delitto et al. [27] indicated that the pathologic stage could dictate survival after NCRT for LARC patients. They found that stage III patients downstaged to ypT1N0 disease showed equivalent outcomes to patients with early-stage cT1N0 disease who underwent surgery directly. Cho et al. [28] identified the ypT stage and the presence of LVI as the independent prognostic indicators of pool DFS rates. Besides, the status of pathologic lymph nodes was considered to correlate with patient outcomes as well, in particular, those who developed with lateral lymph node (LNN) metastases generally had worse prognosis [29, 30]. In a study by Kim et al. [31], clinicopathologic factors were compared in 420 patients, and the results showed that ypN stage was the most important indicator for predicting DFS rather than TRG. In addition, Yokoyama et al. [32] demonstrated that the histology in pathologic positive lymph nodes may also be associated with survival, which could provide novel insights in LARC research.

One of the most noticeable changes before and after NCRT is T downstaging, which can be used to evaluate the tumor’s response to NCRT [33,34,35]. While some studies have suggested that T downstaging may not be associated with therapeutic efficacy or prognosis, it is important to consider the potential impact of this factor on patient outcomes [36, 37]. Mills et al. [38] found that T downstaging was not prognostically significant, while patients with TRG 3 were associated with inferior DFS; thus, they considered response categorization discrepancies may be partly explained by alternative patterns of residual disease, including tumor fragmentation. Similar to their conclusions, in the survival analysis of any group of patients, we did not find any correlation between T downstaging and OS, or DFS. Additionally, we found that ypT stage was the independent factor of DFS in patients with TRG 1, and it also demonstrated that the tumor response to neoadjuvant chemoradiotherapy may impact the prognosis of patients. It can be observed that, in the same situation of only a small proportion of residual tumor, patients with a concentric regression pattern (e.g., ypT1–2) have a more favorable prognosis than patients who with fragmentation pattern (e.g., ypT3–4).

The NAR score is another indicator reflecting the variation in tumor stage before and after NCRT, which is calculated based on the Valentini’s nomograms for OS [15]. The formula incorporated a weighted combination of the cT stage, ypT, and ypN stage. In the NSABP R-04 clinical trial, continuous NAR score has been validated to be significantly associated with OS. And NAR score was further classified as low (NAR < 8), middle (NAR = 8–16), and high (NAR > 16), and the results showed lower NAR score was associated with better OS. Moreover, NAR score has also been demonstrated to be a prognostic role and an ideal surrogate endpoint for DFS in the CAO/ARO/AIO-04 randomized phase 3 trial [10]. In addition, in a retrospective study of 1172 LARC patients, the NAR score was proven to outperform pCR in predicting OS. Meanwhile, many studies indicated that higher NAR scores tended to associated with fewer pCRs, lower TRGs, and more advanced pathologic stage, which from the side that [26, 39]. Our study found that higher NAR scores were associated with worse patient survival and were the most dominant factor. More interestingly, Sert et al. [40] not only evaluated the role of NAR scores in predicting outcomes, but also compared it with TRG. According to their research results, NAR score adds the value of tumor changes before and after NCRT in addition to ypT and ypN status. Therefore, it is suggested to use NAR score to predict patient survival.

There were some limitations of the present study that deserve consideration. Firstly, this was a single center retrospective study, and a prospective approach would be more valuable. Secondly, inter- and intra-observer variability of TRG classification was not taken into account, but this deficiency existed in most studies. Thirdly, the sample size is relatively small in subgroup analyses, which may result in a lack of statistical results.

In conclusion, this study evaluated the prognostic effects of different factors on long-term survival for LARC patients who received NCRT and TME. To date, there was no consensus on the effective predictors of prognosis for LARC patients received NCRT and TME. However, it is undeniable that the NAR score, TRG, and pathological stage are vital factors in prognostic evaluation, and patients with different TRGs may need to consider diverse prognostic predictors.