Introduction

For patients with locally advanced rectal cancer (LARC), neoadjuvant chemoradiotherapy (NCRT) followed by surgical resection has become the standard therapy [1]. NCRT can help to downstage tumors, increase the rate of curative resection, and decrease the rate of local recurrence [2]. According to data from the Surveillance, Epidemiology, and End Results (SEER) database, about 35.7% of patients with rectal cancer in stage II/III received NCRT [3]. The reason could be that the survival benefit of NCRT has so far been controversial [4, 5], and radiotherapy can make surgical dissection more difficult and increase the risk of postoperative complications, such as anastomotic leakage, wound infection, and pelvic abscess [6]. Patients with stage II/III rectal cancer who were treated with NCRT followed by surgical resection form a distinct cohort with specific prognosis characteristics.

Nomograms have been routinely utilized to predict cancer outcomes In recent years [7]. A nomogram integrating multiple prognostic factors can be used to estimate the probability of a particular result. Useful in individualized medicine, it has been applied to a wide range of malignancies [8, 9], For patients with rectal cancer, a number of nomograms have been developed to date. However, nomograms for stage II/III rectal cancer patients treated with NCRT followed by surgical resection have rarely been reported. The aim of this study was to utilize the SEER database to develop a nomogram to predict these patients' prognosis.

The log odds of positive nodes (LODDS) was defined as the log of the ratio between the number of positive lymph nodes (PLN) and negative lymph nodes (NLN) [10]. In recent years, it has been proposed and confirmed as a reliable indicator of prognosis in a variety of cancers, such as pancreatic cancer [11], colorectal cancer [12], and gastric cancer [13], and has proven to be a superior predictive factor in patients with colorectal cancer when compared to the American Joint Committee on Cancer (AJCC) N stage [14, 15]. As a result, instead of AJCC N stage, I incorporated LODDS into this nomogram and expected this nomogram can assist in evaluating the prognosis of stage II/III rectal cancer patients treated with NCRT followed by surgical resection.

Methods

Patient selection

This population-based study was conducted using the SEER database. The SEER database collects statistical, oncological, diagnostic, treatment, and survival information from specific geographical regions representing roughly 28% of the US population. Database was obtained from the SEER*Stat software (version 8.3.6). Patients who were diagnosed after 2015 were not included in this study to ensure enough time for follow-up. In this study, only patients with stage II/III rectal cancer diagnosed between 2010 and 2015 were included. The following were the criteria for inclusion: (1) rectal cancer as the only primary tumor; (2) histologically confirmed as rectal adenocarcinoma; (3) stage II/III rectal cancer patients treated with NCRT followed by surgical resection; and (4) patients with complete survival information, demographic data, and clinicopathologic features. The data selection procedure is shown in Fig. 1. Finally, a total of 3829 patients were identified. At a 7:3 ratio, study patients were randomly assigned to the training cohort and validation cohort.

Fig. 1
figure 1

Flow diagram of the study selection process. NCRT neoadjuvant chemoradiotherapy, CEA carcinoembryonic antigen, AJCC American Joint Committee on Cancer

Statistical analysis

SPSS 26.0 was used for statistical analysis. The proportions between the training cohort and validation cohort were compared using Pearson’s Chi-square test. The best tumor size cutoff was calculated using X-tile software [16], and tumor size was grouped as < 4.6, 4.6–7.0, and ≥ 7.0 cm. LODDS was calculated using the following formula: LODDS = log [(0.5 + PLN)/(0.5 + NLN)] [17]. A P value of less than 0.05 was considered statistically significant. The univariate and multivariate Cox regression analyses were performed to determine the independent prognostic factors. The hazard ratio (HR, 95% confidence interval (CI)) was used to express the risk factors. A nomogram integrating independent prognostic factors was developed to predict the 24-, 36-, and 60-month overall survival of patients and conducted with R software (version 4.1.0). Receiver operating characteristic (ROC) curves were constructed and the area under curve values (AUCs) were determined to validate the discrimination of nomograms. To assess the consistency between actual and nomogram-predicted survival rates, calibration curves were plotted. Furthermore, X-tile software were used to stratify patients into three risk groups (high-, middle-, and low-risk) according to the total points obtained from the nomogram. And Kaplan–Meier curves were plotted to compare OS of the three groups.

Results

Clinicopathologic and demographic characteristics

3829 rectal cancer patients were included and randomly assigned in a 7:3 ratio to a training cohort (n = 2678) and validation cohort (n = 1151) (see Additional file 1 and Additional file 2). Table 1 showed the patients’ clinicopathologic and demographic characteristics and no statistically significant differences were found between the two cohorts. Overall, patients of white ethnicity (80.9%) made up the majority of the entire population, and men (62.9%) accounted for more than half of the patients. Tumor grade II was the most common (80.6%). The majority of patients were classified as T3 by the AJCC (7th edition) tumor-node-metastasis (TNM) system (83.3%). LODDS between − 1.5 and − 0.4 were found in roughly 62.5 percent of patients. More than half (56.9%) of the patients received postoperative chemotherapy and only a small percentage of patients (3.4%) underwent postoperative radiotherapy.

Table 1 Demographic and clinicopathologic characteristics of the included patients

Prognostic factors associated with OS

To determine the prognostic factors, the univariate and multivariate analyses were conducted included the patients from the training cohort. As shown in Table 2 in the univariate analysis, Race, sex, age, marital status, T stage, tumor grade, tumor size, LODDS, CEA level, and postoperative chemotherapy were statistically associated with OS (P < 0.05). The effects of postoperative radiation were not statistically significant (P > 0.05). Finally, race, sex, age, marital status, T stage, tumor grade, tumor size, LODDS, CEA level, and postoperative chemotherapy were all revealed to be independent predictive variables in the multivariate analysis. Advanced age, male sex, unmarried status, higher tumor grade, elevated CEA levels, and tumor size greater than 70 mm was related to a higher risk of mortality. T4b and LODDS greater than − 0.4 were obviously detrimental to patient survival. Furthermore, as measured by a hazard ratio of 0.81 (95% CI: 0.693–0.947), patients who underwent postoperative chemotherapy showed a reduced mortality rate.

Table 2 Univariate and multivariate Cox analyses in II/III rectal cancer patients treated with NCRT followed by surgical resection

Development and validation of prognostic nomogram

A prognostic nomogram integrating independent prognositc factors for OS was developed (Fig. 2). The predicted probability of 24-, 36-, and 60-month OS could be calculated by summing the points on the nomogram for each patient's matching factors. Figure 2 displayed the survival probability of a given patients was calculated using the nomogram.

Fig. 2
figure 2

Nomogram for predicting the OS at 24, 36 and 60 months. According to Nomogram’s estimate the cumulative risk of OS in patients no.30 at 24, 36 and 60 months is 0.0389, 0.07 and 0.131, respectively. *P < 0.05; **P < 0.01; ***P < 0.001. OS overall survival, CEA carcinoembryonic antigen, LODDS log odds of positive lymph nodes

In the validation of the nomogram, the AUCs of the nomogram for the 24-, 36-, and 60-month OS in the training cohort were 0.736, 0.720, and 0.688, respectively; and 0.691, 0.696, and 0.694 in the validation cohort, respectively. The AUCs demonstrated the satisfactory discriminative power of the model (Fig. 3). In both the validation and training cohorts, the calibration curves showed a high degree of consistency between actual and nomogram-predicted survival rates (Fig. 4).

Fig. 3
figure 3

ROC curves and AUC values of the nomogram for the predicted 24-, 36-, and 60-month OS rates for the training cohort (AC) and the validation cohort (DF). AUC area under the curve, ROC receiver operating characteristic, OS overall survival

Fig. 4
figure 4

The calibration curves predicting 24-, 36-, 60-month OS in the training cohort (AC) and the validation cohort (DF). The dashed line represents a perfect match between the nomogram predicted probability (x-axis) and the actual probability calculated. OS overall survival

Stratification of risk groups

All patients were stratified into three risk groups based on the total points obtained from the nomogram using X-tile software: high-risk (total points ≥ 368), middle-risk (339 ≤ total points < 368), and low-risk (total points < 339). As shown in Fig. 5, the Kaplan–Meier curves indicated that the three risk groups had significant differences in overall survival (P < 0.001). The 5-year OS in the training cohort for the high-, middle-, and low-risk were 49.0% (95% CI: 46.2–51.8%), 73.0% (95% CI: 71.6–74.4%), 86.4% (95% CI: 85.3–87.5%), respectively; and 48.5% (95% CI: 44.3–52.7%), 73.2% (95% CI: 71.0–75.4%), and 85.4% (95% CI: 83.6–87.2%) in the validation cohort, respectively. The mortality rates of patients in the high-risk group were significantly higher than those in the low-risk group.

Fig. 5
figure 5

Kaplan–Meier overall survival curves of patients at three risk groups in the training cohort (A) and validation cohort (B) according to the nomogram

Discussion

In this study, the data from 3829 patients in the SEER database were used to develop a prognostic nomogram. Currently, few prognostic nomograms for stage II/III rectal cancer patients treated with NCRT followed by surgical resection have been developed, especially based on a large population. The study by Zhifang et al. covering 785 cases [18], had certain limitations as it was a single-center study and their results lacked externally validation. The analysis conducted by Valentini et al. using data from five European randomized clinical trials was the only nomogram for those patients based on a large population [19], but it did not include other variables that may affect prognosis, such as race, marital status, tumor size, CEA level, and tumor grade. In addition to age, sex, T stage and postoperative chemotherapy, this investigation also identified race, marital status, tumor size, CEA level, tumor grade, and LODDS as independent prognostic factors through Cox regression analysis. All these risk factors were incorporated into the nomogram to improve its preciseness. Besides, to the best of my knowledge, this is the first nomogram that incorporates LODDS to predict the OS of stage II/III rectal cancer patients treated with NCRT followed by surgical resection. Through multivariable Cox regression analysis, black patients may have worse overall survival rates. The findings were consistent with earlier research showing that compared to white patients, black patients are diagnosed at a later stage, receive less benefit from adjuvant chemotherapy, and are more likely to suffer 5-FU toxicity than Whites, probably due to differences in dihydropyridine dehydrogenase status [20, 21]. This study found that patients who were married had better outcomes. According to previous studies, spouses may encourage patients to check their health state, and married cancer patients were diagnosed at an earlier stage than unmarried cancer patients [22]. Furthermore, married patients had better adherence to treatment than unmarried patients, such as receiving NCRT [23]. My results confirmed, for the first time, that tumor size greater than 70 mm was an independent prognostic factor for stage II/III rectal cancer patients treated with NCRT followed by surgical resection. However, recent research shows that tumor volume is superior to tumor size as a prognostic classifier in rectal cancer [24]. In addition, Yeo et al. discovered that the rate of tumor volume reduction in LARC patients receiving NCRT was a significant prognostic factor [25]. However, due to the lack of tumor volume data in the SEER database, this study cannot further confirm these observations.

As we all know, lymph node status plays a significant role in rectal cancer patients’ prognosis. In recent years, in addition to AJCC N stage, many scholars have proposed and analyzed other parameters to validate the status of lymph nodes, such as LODDS and the lymph node ratio (LNR). Although AJCC N stage is currently the most widely used method for lymph node classification, many researchers have pointed out its limitation since it only considers the number of positive lymph nodes and ignores the total number of lymph nodes (TLN) retrieved during surgery. In rectal cancer, TLN has been proven to be an independent prognosis-related factor [26,27,28]. For accurate staging of colorectal cancer, removal of at least twelve lymph nodes was recommended by AJCC. LNR involves both TLN and PLN and has been claimed to be more accurate than AJCC N stage for rectal cancer [29, 30]. However, other scholars have noticed that when TLN are either all non-metastatic or all metastatic, patients with the same LNR values might be extremely varied. LODDS, as a novel lymph node classification, can overcome the drawbacks of LNR and increase the accuracy for prognostic validation, and studies have shown its superiority to LNR in rectal cancer [17, 31, 32]. For LARC, LODDS was confirmed to perform better than both AJCC N stage and LNR in terms of prognostic predictive value [14, 15, 33]. Therefore, the nomogram will be more convincing and effective if LODDS is incorporated into it. The nomogram clearly demonstrated that LODDS was the most influential factor in a patient’s poor prognosis.

In this study, 56.9% of patients who received postoperative chemotherapy, seemed to have a better outcome. On the other hand, several large randomized controlled trials concluded that postoperative chemotherapy could not improve the OS of rectal patients receiving surgery after NCRT [34,35,36]. These trials, however, had significant limitations, such as poor adherence in the European Organization for Research and Treatment of Cancer trial 22921. Therefore, it remains uncertain if these patients will benefit from postoperative chemotherapy. According to current National Comprehensive Cancer Network (NCCN) recommendations, rectal patients with tumors staged as T3 or N1 and higher should consider postoperative chemotherapy after neoadjuvant chemotherapy [37]. However, experts convened by the European conference on rectal cancer concluded that there is currently insufficient evidence to demonstrate the benefits of receiving adjuvant chemotherapy after preoperative chemoradiotherapy; hence, postoperative chemotherapy cannot be recommended [38]. In this way, more comprehensive and conclusive researches will be conducted in the future.

In this study, a nomogram was developed to predict the prognosis of II/III rectal cancer patients with treated with NCRT followed by surgical resection. The probability of 24-, 36-, and 60-month OS can be simply calculated by summing the points on the nomogram for each patient's matching factors. The AUCs of the nomogram for the 24-, 36-, and 60-month OS In the training cohort were 0.736, 0.720, and 0.688, respectively. It revealed the discriminative capability of nomogram to predict OS of patients. Through calibration curves, the nomogram showed excellent consistency for predicting 24-, 36-, and 60-year OS. In additional, by employing risk group stratification, patients who are at high risk can be easily tracked down and should be taught to pay closer attention to their health status and have regular follow-up consultations with their doctors. To summarize, this nomogram provides a reliable and discriminative prognostic evaluation for patients with stage II/III rectal cancer treated with NCRT followed by surgical resection.

There are some limitations in this study, first, studies have demonstrated that tumor response to NCRT is one of the most important prognostic factors for LARC patients [39, 40] and that Mandard good responders have a significantly higher OS rate than Mandard bad responders [41]. In addition, Ann et al. concluded that all tumor-response gradings were related to survival and downstaging had an independent effect on OS [42]. However, the lack of a SEER database on tumor response to NCRT, response assessment, and waiting time between radiotherapy and surgery was the study's major limitation. Second, as this was a retrospective analysis based on the SEER database, there was bound to be some degree of selection bias. Third, the SEER database did not offer certain information that could have affected patient outcomes, such as surgical margins, lymph/vascular or perineural invasion, chemotherapy/radiation regimen, exact type of surgical resection, indications to receive postoperative chemotherapy, and complications of NCRT. A more advanced prognostic model is required to assist in evaluating the prognosis of stage II/III rectal cancer patients treated with NCRT followed by surgical resection.

Conclusion

A large population-based nomogram incorporating LODDS was developed to assist in evaluating the prognosis of stage II/III rectal cancer patients treated with NCRT followed by surgical resection. The nomogram showed a satisfactorily discriminative and stable ability to predict OS for those patients. A more advanced prognostic model is required to assist in evaluating the prognosis of those patients.