1 Introduction

Nowadays neoadjuvant chemoradiation (nCRT) is universally considered to be a valid treatment to achieve downstaging, to improve local disease control and to obtain better resectability in locally advanced rectal cancer (LARC) [1]. Currently, about 15–30% of patients undergoing neoadjuvant treatment achieve a pathologic complete response (pCR) with improved oncological outcomes [2]. A major challenge for the surgeon is to determine the course of treatment for those patients with pCR after nCRT [3,4,5]. In fact, the difficult choice between surgery (radical rectal resection or trans-anal local excision of the residual scar) and the ‘wait-and-see’ strategy does not provide a definitive or widely-applicable answer. The pCR is defined through endoscopy with biopsy and through radiological studies. However, a more reliable histopathologic response is obtained only after the resected specimen analysis [6, 7]. For this reason, it is important to develop methods to identify an early prediction of histopathological response in patients with LARC after nCRT. The early determination of pathological response to nCRT is important not only for prognostic value but also to change or adapt the standard nCRT strategy for those patients with a suboptimal pathologic response [8]. At this time, 18F-FDG PET-CT is one of the most powerful tools in cancer diagnosis and staging. It combines a positron emission tomography scanner (PET) and an X-ray computed tomography (CT) scanner, so that images acquired from both devices can be taken sequentially. The spatial distribution of metabolic or biochemical activity in the body can be more precisely aligned or correlated with anatomic imaging obtained by CT scanning. It is well known that nCRT induces changes in tumour metabolism that influence the 18F-FDG PET-CT tumour uptake [9].

2 Materials and methods

2.1 Patient population

We performed a retrospective analysis of patients with LARC diagnosis who underwent curative resection. The study was carried out at the General Surgery Department of the Sanchinarro University Hospital, Madrid, recruiting patients between October 2011 and February 2020 with LARC. The LARC patients are defined using radiological staging T3-4 e/o N + rectal tumour with a distance from the anal verge ≤ 10 cm. Patients who are younger than 18 years old, with concomitant metastases, perforated tumours, peritoneal carcinomatosis or comorbidities precluding surgery and neoadjuvant therapy with rectal stent, are excluded from the analysis. All included patients received nCRT, and surgical treatment was carried out 8–12 weeks after completing the neoadjuvant treatment. The preoperative study included a colonoscopy with biopsy to confirm rectal adenocarcinoma. Trans-anal endorectal ultrasound, pelvic MRI (magnetic resonance imaging), total body CT scan, and PET-CT scan were used as diagnostic and staging procedures. For the tumoral stage, the TNM staging system was used (American Joint Committee on Cancer) [10].

Pathologic tumour staging of the specimen after nCRT (ypTNM) and tumour regression grade (TRG) scores of the surgical specimens were established using the Seventh Guidelines of the American Joint Committee on Cancer [11]. TRG0 indicates a pathological complete response, TRG1 (moderate response) consists of single cells or small groups of cancer cells, TRG2 (minimal response) indicates residual cancer outgrown by fibrosis, and TRG3 (poor response) shows extensive residual cancer. Circumferential resection margin (CRM) is considered negative if the distance between the tumour and CRM is more than 1 mm [6]. Patients were divided into two groups according to the TRG: the responders group (TRG0-TRG1) and the no-responders group (TRG2-TRG3). Furthermore, we evaluated the prognostic value of 18F-FDG PET-CT in terms of disease-free survival (DFS) and overall survival (OS) in patients with LARC. Following nCRT, patients were reassessed to evaluate the tumour’s response 6 weeks after completing the treatment. The re-evaluation study included a full physical examination, MRI, CT-scan, and PET-CT scan. All resected specimens were examined by an experienced team of gastrointestinal pathologists. We excluded patients with stage IV and patients who were lost during follow up.

2.2 Preoperative chemoradiation (nCRT)

All patients were treated with preoperative intensity-modulated radiotherapy (IMRT) and an integrated-boost chemoradiation scheme. The planning target volume (PTV) included the presacral node, the tumour, the complete mesorectal fascia, and common and internal iliac lymph nodes. Radiation therapy was completed within 4–5 weeks, with a total of 23 fractions. The dose for the first PTV was 46 Gy in 23 fractions, and the concomitant boost (PTV2) was 57.5 Gy in 23 fractions (BED = 71.8 Gy[α/β = 10] and Eq. 2 Gy/f = 70 Gy). All patients received concurrent standard capecitabine-based chemotherapy (825 mg/m2, in bid) and a blood count was performed every 14 days. None of the patients included in the study received a short-course of CRT. All patients underwent a surgical procedure, including a low anterior resection or abdominoperineal excision 6–8 weeks after the completion of treatment. All patients were operated on by the same team of surgeons and with mechanical bowel preparation. Our study adhered to the STROBE recommendation for observational studies.

2.3 18F-FDG PET-CT analysis

All patients underwent a baseline 18F-FDG PET-CT scan within the week prior to the initiation of the nCRT (PET-CT SUV1) and a second scan (PET-C T SUV2) within 6 weeks of the completion of nCRT. The 18F-FDG PET-CT was performed with the advance whole-body PET scanner in 3D mode, with axial spatial resolution of 4.7 mm. CT-based attenuation and decay correction were done. Patients were fasting six hours before they underwent PET-CT, although water intake was encouraged. Before entering the scanner, patients were invited to drink water (1 l) and then void their bladder. Fasting serum glucose levels were checked 15 min before the FDG injection according to protocol. All patients received an intravenous injection of 18F-FDG. The exact time of injection of 10–15 mCi of 18F-FDG was recorded and imaging commenced no earlier than 45 min after the injection. Total body, caudo-cranial 18F-FDG PET-CT images were acquired 70 min after the injection of18F-FDG. PET-CT images were reconstructed from the acquired data, using the ordered subset expectation maximum iterative reconstruction algorithm. The maximum uptake value of the primary tumour was registered in all studies (baseline and after n weeks). SUVmax and SUVmean were calculated using the maximum and mean activity values with the highest radioactivity concentration in accordance with the injected dose and patient’s body weight. Changes in SUVmax values were analysed as the percentage difference from 18F-FDG PET-CT 1 (before nCRT) and 18F-FDG PET-CT 2 (after nCRT) (Δ% SUV = (\(\frac{SUVpre-SUVpost}{\mathrm{SUVpre}}100\%\))). It was evaluated in relation to the pathologic response, defined as the TRG [6].

2.4 Statistical analysis

Sensitivity, specificity, accuracy, positive predictive value (p-PV), and negative predictive value (n-PV) of post nCRT 18F-FDG PET-CT were assessed. To compare the correlation between the quantitative (numerical) variables, when these followed a normal distribution, a variance analysis and a t-Student were used. Categorical variables were assessed using a Chi-square test. The Mann–Whitney U-Test and the Kruskal–Wallis test were performed for numerical variables. A receiver-operating characteristic (ROC) curve and logistic regression techniques were used to obtain the predictive model and the inflection point. DFS and OS were calculated with the Kaplan–Meier method and log-rank test. A multivariate survival analysis for disease-free survival or overall survival was performed using the Cox proportional-hazard regression model. All variables related to the risk of DFS, or OS with a P value of < 0.2 in univariate analysis, were included in the multivariate analysis. For the statistical analysis, SPSS software (version 10, IBM SPSS, Chicago, IL, USA) was used and all tests were considered statistically significant if the value of p ≤ 0.05.

2.4.1 Ethics

The study was approved by the institutional ethical committee of Sanchinarro University Hospital, Madrid, and all patients included were informed about the treatment and provided written informed consent. The study was conducted in agreement with the Declaration of Helsinki for studies in humans.

2.4.2 Study aims

The aim of this study is to correlate the change in tumour 18F-FDG PET-CT standardized uptake value (SUV) before and after nCRT in patients with LARC, in order to differentiate between those who responded to treatment (the responders) and those who did not respond (the no-responders).

3 Results

A total of 137 patients with LARC were included and four were lost during follow-up as they underwent surgery in another centre and data could not be gathered. Therefore, 133 cases have been analysed (STROBE flowchart Fig. 1). The demographic data of the patients are summarized in Table 1. A total of 29 (22%) patients underwent abdominoperineal excision of the rectum (APER), 90 (68%) underwent a low anterior resection (LAR) with protective stoma and 14 (10%) underwent a LAR without protective stoma, as shown in Table 2. Patients were divided in two groups according to the TRG: 107 (80%) for the responders group (TRG0-TRG1) and 26 (25%) the no-responders group (TRG2-TRG3) (STROBE flowchart Fig. 1). Evaluation of the tumour response to nCRT according to TRG score is shown in Table 3. We found a primary tumour downstaging in T classification after nCRT in 85 patients (65%). Therefore, nodal downstaging after nCRT was achieved in 55 patients (42%). In our series the pathological complete response (cpR) rate was about 19%. A logistic regression analysis was performed in order to obtain a prediction model of the tumour’s pathological response (pR). Clinical prognostic factors (age, sex, TNM variables, tumoral markers, and FDG-PET values) were separately tested. From our statistical analysis, we obtained a significant difference in Δ%SUV between the two groups, the responders (Δ% SUV = 58%) versus the no-responders (Δ% SUV = 19%) (p < 0.012), as shown in Fig. 2. ROC curve preliminary cut-off value of 70% of the was individuate, as depicted in Fig. 3. This showed that Δ%SUV media was a stronger discriminator between the two groups with a high accuracy of 81% (34/42), a sensitivity of 84.4%, a specificity of 80%, a positive predictive value of 81.4% (p-PV), and a negative predictive value of 84.2% (n-PV).

Fig. 1
figure 1

Flowchart

Table 1 Patients and tumor characteristics
Table 2 Operative data
Table 3 Evaluation of the tumor response (TRG score)
Fig. 2
figure 2

Difference in Δ%SUV between the responders and no responders group

Fig. 3
figure 3

ROC (receiver operating characteristic) curve

The median follow-up period was 60.54 months (a range of 9–103 months). Patients with overall recurrence totalled 25 (18.8%) (three locoregional and 22 metastatic): 17 (12.8%) patients with delta Δ% SUV < 70% and 8 (6%) patients with Δ% SUV > 70% with a significant correlation between recurrence and Δ SUV (p = 0.037).

The median 5-years DFS was 55.3 months (a range of 9–92 months). In the group of patients with delta SUV < 70% DFS was 56.3% mean, while in patients with delta SUV > 70% DFS was 85.7%, showing a statistical significance (p < 0.05) (Fig. 4).

Fig. 4
figure 4

DFS (disease free survival) curve for the Δ SUV evaluation

In the multivariate analysis, ΔSUV results an independent risk factors, associated with local recurrence-free survival, as well as pTN stage and preoperative ASA (Table 4).

Table 4 Univariate and multivariate analysis DFS (disease free survival)

The 5-years OS (overall survival) rate result was 60% in patients Δ% SUV < 70% and 86.4% in patients Δ% SUV > 70%, showing a statistical significance (p < 0.05) (see Fig. 5). The unadjusted Cox-proportional hazards regression revealed that delta SUV and pTN were associated with worse overall survival (Table 5).

Fig. 5
figure 5

OS (overall survival) curve for the ΔSUV evaluation

Table 5 Univariate and multivariate analysis OS (overall survival)

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

4 Discussion

Radical surgery with TME (total mesorectal excision) remains the main curative treatment for patients affected by LARC [12]. The TME associated with nCRT improve outcomes by increasing the 5-year survival rate [9]. Several previous studies have demonstrated that, compared with adjuvant treatment, preoperative nCRT significantly improves loco-regional tumour recurrence and reduces toxicity compared with postoperative strategies [10]. The early identification of pR after nCRT in rectal cancer remains an important challenge, and could avoid surgical overtreatment without compromising local control and long-term survival [12]. It could considerably reduce the number of surgical procedures required in the future, allowing less invasive procedures, such as TAMIS (trans-anal minimally invasive surgery), to be initially performed for LARC [3]. This latter approach could also reduce the rates of mortality, morbidity and other unsatisfactory functional outcomes that may occur after rectal resection. Furthermore, an early prediction in the no-responders group could provide the clinician the opportunity to evaluate the possibility of the reorientation of the standard treatment by also increasing the number of chemotherapy cycles and radiotherapy. The current conventional radiology, such as endorectal ultrasound (EU), CT scan and MRI for monitoring the tumour response, shows several limitations to assess the pR after nCRT. This is mainly due to the difficulties in discerning between disease persistence and radiation induced inflammation and fibrosis after nCRT [13]. 18F-FDG PET-CT has a recognized validity for monitoring nCRT effects [14,15,16,17,18].

There are several studies that have investigated the predictive value of 18F-FDG PET-CT in the LARC and in recurrent rectal cancer, and additional studies are likely to be conducted using collaborative, international research platforms [18,19,20,21,22,23,24,25,26,27]. However, these studies have some limits, mainly due to the methodological heterogeneity secondary to their multicentric nature in terms of preoperative studies, chemoradiotherapy, patients’ characteristics and the PET study method (timing, technique and analysis of images). Despite these limits, it is important to note that almost all of these previous studies identify a significant correlation between tumour 18F-FDG uptake and pR, and a correlation between OS and DFS [13].

A recent study by Niccoli-Asabella et al. was able to evaluate the prognostic value of 18F-FDG PET-CT in terms of survival in patients with LARC who have undergone surgery after nCRT. This work showed a high percentage of patients with a TRG complete response (39.7%) with longer OS and DFS in the Responders group but without statistically significant differences [13].

The strength of our study relies on being one of the largest studies evaluating 18F-FDG PET-CT related to histopathological response (TRG score) at two time-points: before nCRT (early PET-CT) and after finishing CRT (late PET/CT). We found the optimal cut-off to distinguish the responders (TRG3-TRG4) from the no-responders (TRG0-TRG2), at 70% of the Δ%SUV. Furthermore, our analysis showed that Δ%SUV was a stronger discriminator between the two groups with a high accuracy of 81% (34/42), with a sensitivity of 84.4%, a specificity of 80%, a positive predictive value of 81.4%, and a negative predictive value of 84.2%. We were able to find a correlation between Δ%SUV and OS and DFS, also showing a statistical significance (p < 0.05).

Another similar study by Leccisotti et al. found similar results They learned that the optimal cut-off to distinguish the no-responders from the responders on the early PET-CT scan was a reduction in tumour SUVmax of 61.2% (85.4% sensitivity, 65.2% specificity) [18].

An important issue remains unclear; that is, the PET-CT study method (timing, technique, and qualitative analysis of images). Most studies report inaccurate results due to heterogeneous methods for 18F-FDG PET-CT quantification, the correct time to perform the study and the metabolic criteria [19, 20]. It is important to standardize the criteria for the correct use of 18F-FDG PET-CT in order to achieve a correct interpretation of the results. In the current literature, there is no standardized data that indicates the proper timing by which to perform the 18F-FDG PET-CT. It is well known that chemotherapy produces an inflammatory reaction that lasts for one week after the beginning of treatment, while radiotherapy inflammatory reaction may last up to six months. Therefore, it is important choose or indicate the right time to perform 18F-FDG PET-CT after nCRT in order to standardize the correct interval as being a potential source of false-positive findings on the late 18F-FDG PET-CT. The World Health Organization recommends 18F-FDG PET-CT 7 weeks after nCRT and surgery one week later [28]. This is mainly based after the trial performed by R O Perez et al., showing a proper restaging with 18F-FDG PET-CT at 6 and 12 weeks after the completion of therapy [15]. The present study was concomitant with this recommendation, as all patients underwent FDG PET-CT 6–7 weeks after the end of nCRT, and surgery was performed 8 weeks from the end of neoadjuvant treatment.

The main limitations are the relatively small number of patients in our cohort due to a single centre enrolment. However, the unicentric nature of our study also represents a guarantee for more homogenous data.

We believe that more technically advanced tools are important to accurately measure tumour change. Nowadays the use of quantitative analysis of PET/MRI to assess pCR following nRCT in LARC could improve outcome predictions and may welcome the era of adaptive therapy for cancer patients [29, 30].

In conclusion, the results of this analysis are promising and show that 18F-FDG PET-CT may be an indicator to evaluate the pR to nCRT in patients with LARC. The decrease in 18F-FDG PET-CT uptake in the primary tumour may offer primary information for the early identification of patients who more likely to obtain a pCR to nCRT and to predict patients who are unlikely to significantly regress. Rigorous follow up and larger future prospective studies are necessary to confirm these results.