Introduction

As the population ages, the proportion of octogenarians (patients over 80 years old) is rapidly increasing [1,2,3]. Colorectal cancer (CRC) is the third most frequent cancer diagnosed among men and women in the United States, posing an age-specific risk [4]. According to the latest statistics, the incidence rate escalates rapidly with age, with 121.4 per 100,000 population in individuals aged 60 to 64 years compared with 237.9 per 100,000 aged 80 to 84 [5]. Thus, octogenarians are becoming an essential cohort of CRC patients.

Octogenarians have been historically recognized as a population at higher risk for undergoing major surgical procedures due to the severity of their comorbidities and decreased functionality. This perspective has been modified recently, and age is no longer considered a contraindication to abdominal surgery [6,7,8]. However, among researchers, controversy persists regarding the prognosis of patients in this cohort. Several studies have shown poorer survival after surgery in older CRC patients than in younger patients [9,10,11,12], although some studies have concluded that the survival of older patients is quite good [6, 13, 14]. Thus, a pragmatic and customized approach is required to evaluate the prognostic factors and prognosis of octogenarians treated with CRC resection.

Nomograms are currently the most precise method for predicting the prognosis of cancer patients [15]. Although Zhang et al. [16]. constructed postoperative nomograms to predict cancer-specific survival (CSS) after CRC resection, Wang et al. [17]. also developed a predictive model of cause-specific death (CSD) in elderly patients (age ≥ 65 years) using a competing-risk approach. However, a cut-off of 65 years is not a fair representation of the older population based on demographic ageing [18, 19]. None of these previous studies have targeted the emerging clinical population of octogenarians [7]. Meanwhile, the present method of estimating the prognosis of patients in this particular population is based on the American Joint Committee on Cancer (AJCC) TNM stage. At the same time, the TNM stage does not include independent prognostic factors such as tumour deposits (TDs) [20], perineural invasion (PNI) [20], or carcinoembryonic antigen (CEA) level [21], which tends to result in reduced accuracy.

Therefore, we compiled information on octogenarian patients after radical resection for CRC from the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) database and from the Colorectal Surgery Department of the First Affiliated Hospital of Wenzhou Medical University. We developed a nomogram to analyse and investigate the prognostic factors associated with CSS in this particular population and predicted their CSS to guide individualized treatment planning.

Method

Patient population and selection

The SEER program represents the principal source of a national database and cancer statistics in America and is presently maintained by the National Cancer Institute (NCI) [22]. The patient data of newly diagnosed CRC patients aged 80 years or older from 2010 to 2015 were extracted from the SEER database using the SEER*Stat program (v 8.4.0.1) [22]. Demographic information, clinicopathological information, treatment information, and follow-up results were collected. Deaths from cancer were characterized by SEER CSD classification variables. The initial inclusion criteria were as follows: (1) age ≥ 80 years; (2) pathological diagnosis of CRC and histologic type of adenocarcinoma; (3) year of diagnosis from 2010 to 2015; and (4) complete survival dates. The exclusion criteria were as follows: (1) histologic type other than adenocarcinoma; (2) survival ≤ 1 month; and (3) incomplete clinical information (data for most patients older than 100 years are incomplete). A total of 19,344 patients were collected and screened for inclusion in the study (n = 4794) (Fig. 1a). Then, the included individuals were randomly divided into a training cohort (70%) and a validation cohort (30%). The training cohort was used to construct the nomogram, while the smaller validation cohort was used to validate the model.

Fig. 1
figure 1

a Flow chart for the study of octogenarians treated with radical resection for colorectal cancer according to the inclusion and exclusion criteria, with internal validation in an American cohort and external validation in a Chinese cohort. b Nomogram prediction of 1-, 2- and 3-year CSS in octogenarians in the training cohort after radical resection for CRC. The nomogram is applied using points scored according to each involved variable, as indicated on the point scale. The total point score represents the probability of CSS at various points in time, with a higher score signifying a higher potential for an undesirable outcome

External validation data were finally derived from 368 octogenarian patients who underwent radical resection for CRC in the Department of Colorectal and Anal Surgery of the First Affiliated Hospital of Wenzhou Medical University from January 2016 to January 2020. The time of the last follow-up was January 2023. The inclusion and exclusion criteria were as described above. The research was approved by the ethics committees of the hospital and conformed to the seventh version of the Declaration of Helsinki.

Variable collection

Several variables were included in the current study: baseline demographics (i.e., age at diagnosis, sex); tumour features (i.e., T stage, N stage, tumour size, histologic type, tumour location, TDs, PNI, number of metastatic lymph nodes (LNs), CEA level, number of solid organ (liver, lung, brain, bone)) metastases; treatment information (i.e., chemotherapy, radiation therapy); and survival variables (i.e., survival duration in months, vital status). We restaged all octogenarians according to the eighth edition of the AJCC pathological staging system [23]. The tumour size was stratified by a 5 cm cut-off. In our study, CSS was determined as the duration from diagnosis to death or last contact and was used as the primary outcome.

Statistical method and nomogram construction

Categorical variables in the analysis are described as percentages, and continuous variables are described as the median and interquartile range (IQR). Comparisons of categorical covariates were performed via the chi-square test or Kruskal–Wallis test, and comparisons of continuous covariates were performed by the Mann‒Whitney U test. Then, univariate Cox regression was applied to filter the variables that were significantly associated with CSS in the training group. Predictors with p values less than 0.05 were entered into a multivariate Cox regression model. Variables were further eliminated using backwards stepwise selection based on the Bayesian information criterion (BIC) [24]. The obtained multivariate Cox regression models were used to evaluate the hazard ratios (HRs) and construct the ultimate prognostic nomogram model for predicting CSS at 1, 2, and 3 years in octogenarians after radical resection for CRC.

Nomogram validation

The functionality of the nomogram was determined by authentication and calibration utilizing both the internal and external validation cohorts. The concordance index (C-index) was adopted to evaluate the accuracy and discriminatory ability of the nomogram. The method “Bootstrap” described by Harrel is employed to ascertain the statistically significant value of the C-index between different models [25]. The calibration curve and area under the receiver operating characteristic (ROC) curve (AUC) were used to assess the calibration. Meanwhile, the Delong’s test is employed to compare the AUC values of different models. Nomogram validation was performed utilizing an internal validation set and an independent external validation set satisfying the previously indicated inclusion criteria.

Clinical utility

Decision curve analysis (DCA) is an approach to estimate the utility of a model by determining the net benefit under different thresholds [26, 27]. We assessed the clinical applicability of the predictive nomogram model by DCA and compared it to the AJCC TNM staging system. Meanwhile, the patients were divided into high-risk and low-risk groups based on the ROC cut-off values, and the variation in the survival of patients in each risk group was detected using the log-rank test and Kaplan‒Meier (K‒M) curve. All statistical analyses were conducted using SPSS 22.0 (SPSS, Chicago, IL, USA) and R software 4.2.1 (http://www.r-project.org). A two-sided test was statistically validated at P < 0.05.

Results

Patient characteristics

A total of 5,162 patients were recruited for this research; among them, 3349 were in the training cohort, 1445 were in the validation cohort, and 368 were in the external validation cohort (Fig. 1a). The population characteristics and clinicopathological features of the training and validation cohorts (internal and external) are shown in Table 1. There were no statistically significant differences in baseline data between the training and internal validation data sets (P > 0.05).

Table 1 Clinicopathological characteristics of octogenarians after curative resection

Nomogram construction

The univariate analysis showed that age, sex, tumour size, number of solid organ metastases, TDs, PNI, number of metastatic LNs, chemotherapy, and CEA level were statistically significantly related to CSS (Table 2). Following stepwise selection to eliminate further potential redundancies, the resulting nomogram model included age, T stage, N stage, number of solid organ metastases (liver, lung, brain, bone), TDs, PNI, number of metastatic LNs, chemotherapy and CEA level (Table 2). The risk scores for the individual items were calculated by summing the scores using the nomogram (Fig. 1b).

Table 2 Cox regression of univariate and multivariate analyses associated with CSS in octogenarians after curative resection

Nomogram both internal and external validation

This research examined the model’s reliability through cross-validation with internal cohorts and independent validation with external cohorts of Chinese patients. First, we validated the accuracy of the nomogram with the C-index. In the training, validation and external validation cohorts, the C-index values of the nomogram (training cohort, 0.758 (95% CI: 0.742–0.774); internal validation cohort, 0.762 (95% CI: 0.738–0.787)) were higher than those based on the AJCC stage (training cohort, 0.713 (95% CI: 0.697–0.729, p = 0.002); internal validation cohort, 0.701 (95% CI: 0.677–0.725, p = 0.003)), demonstrating the good discriminatory ability of the model.

This model was simultaneously validated in an external validation group. The C-index values of the model (external validation cohort, 0.727 (95% CI: 0.677–0.777)) were higher than AJCC stage (external validation cohort, 0.644 (95% CI: 0.586–0.702, p = 0.005)), which further validated the model’s reliability. The calibration curves for the training and validation cohorts revealed that the forecasts of the predictive model corresponded to the actual observed values (Fig. 2a-i).

Fig. 2
figure 2

Calibration of the nomogram in the training and validation cohorts and the AUC in the training and validation cohorts. a The 1-, 2-, and 3-year CSS in the training cohort. b The 1-, 2-, and 3-year CSS in the internal validation cohort. c The 1-, 2-, and 3-year CSS in the Chinese external cohort. e-g The AUC in the training, internal and external cohorts

The AUC in these three cohorts also indicated comparable efficiency of the model in distinguishing the effect. In the training cohort, the AUC of 1-, 2- and 3-year were 0.81 (95% CI: 0.78–0.83)), 0.8 (95% CI: 0.78–0.82)) and 0.8 (95% CI: 0.78–0.82)), respectively (Fig. 2e). In the internal validation cohort, the AUC of 1-, 2- and 3-year were respectively 0.78 (95% CI: 0.74–0.83)), 0.8 (95% CI: 0.77–0.83)) and 0.81 (95% CI: 0.78–0.84)) (Fig. 2f), while in the external validation cohort, the AUC of 1-,2- and 3-year were 0.77 (95% CI: 0.69–0.86)), 0.82 (95% CI: 0.76–0.88)) and 0.75 (95% CI: 0.67–0.82). Compared to the model of AJCC TNM, the nomogram also exhibits the greater predictive ability (Fig.S1 a-c).

Clinical application of the nomogram

DCA illustrated higher clinical value for the nomogram than the TNM staging system in the training, validation and external validation cohorts (Fig. 3a-c). The patients were then separated into a high-risk group (total score ≥ 159.5) and a low-risk group (total score < 159.5) based on the cut-off value of the ROC curve. The 1-year, 2-year, and 3-year survival rate in the high-risk group was 65.8%, 45.1%, and 34.3%, respectively. In the low-risk group, the 1-year, 2-year, and 3-year survival rate was 94.9%, 89.5%, and 85.2%, respectively. Based on K–M curves and log-rank tests, the survival rates in the high-and low-risk groups in these three cohorts were remarkably distinct, with significantly lower survival rates in high-risk patients than in low-risk patients (Fig. 3d-f).

Fig. 3
figure 3

DCA and KM results in the training and validation cohorts. The X-axis represents the probability of death, while the Y-axis indicates the net benefit of the therapy for patients. The dark green colour represents the net benefit with the risk threshold when all patients survive. The horizontal light green line indicates the net benefit of therapy when all patients die. The red line represents the nomogram, and the yellow line represents the TNM stage. The clinical value of the nomogram is better than that of the TNM stage when the risk threshold is between 0 and 100%. a-c DCA results in the training, internal, and external validation cohorts. d-f The KM results of training cohort (d), internal validation cohort (e) and external validation cohort(f)

Discussion

The necessity of exploring the characteristics of older populations, such as octogenarians, is increasing with increasing human life expectancy [18, 19]. However, research in this particular population is considerably restricted for many reasons, especially research on CRC. First, the high incidence of frailty and preoperative comorbidities or postoperative complications in octogenarians predisposes them to make fewer visits. Thus, controversy persists regarding the prognosis and improvement in the quality of life of octogenarians after surgery because of the limited data for this population [7, 28, 29]. Thus, the demographic information of elderly patients with CRC and their corresponding pathological features have commonly been investigated with a cut-off of 65 years of age [16, 17, 30]. Studies on octogenarians have generally focused on the impact of postoperative complications on their survival [12, 28]. However, recent studies have indicated that perioperative complications have no impact on the short-term survival of octogenarians [9, 13]. This leads to the exclusive characteristics of this population being easily neglected.

In this study, we developed a prognostic nomogram model using the American SEER database and validated its accuracy by internal cross-validation. In addition, our study considered the distinctive features of the Chinese and American patient populations. We externally validated the model with a Chinese single-centre database (N = 368) to exclude time and geographical influences, demonstrating the model’s precision. Compared by these two cohorts, we found that China has a higher rate of solid organ metastasis than the United States due to inadequate CRC surveillance, which resulted in an imbalance among the three groups at baseline [31]. In addition, chemotherapy and radiotherapy were not widely used in these three groups. One reason for this may be that the benefits of chemotherapy and radiotherapy have not been demonstrated in older adults over 70 years of age; another reason may be that older adults have poorer physical function and often poorly adapt to chemotherapy and radiotherapy [32].

In this nomogram, age, T stage, N stage, PNI, chemotherapy, TDs, CEA level, number of metastatic LNs, and number of solid organ metastases were chosen after univariate Cox regression screening and backwards stepwise selection. From the results, we constructed a model in the training set with potential benefits in predicting CSS in octogenarians after radical resection for CRC based on demographic information and clinicopathological traits.

Age at diagnosis was recognized as an independent prognostic factor in our nomogram. Although the exact mechanism has not been elucidated, similar conclusions have been proposed by numerous studies [13, 17, 33]. Similarly, the patterns of LNM in CRC are continually clarified, and the recognition of LNM is associated with decreased survival in CRC patients [4, 5, 34]. Considering the complicated physical characteristics of octogenarians, we performed modelling using continuous variables in pursuit of the most accurate predictions. It is traditionally perceived that chemotherapy in octogenarians represents a tremendous challenge to physical function and may lead to a worse outcome. However, our study found a survival advantage in octogenarians receiving chemotherapy, which mirrors the conclusions of some studies [35,36,37]. In addition, we innovatively incorporated TDs and PNI into our model, both of which have been proven to be substantially associated with the prognosis of CRC patients [20]. There is increasing evidence that TDs are associated with LNM, extramural vascular invasion (EMVI) and PNI, which are markers of a poorer prognosis, with potential additive effects of the combination of these features [38,39,40]. Restricted by the patient information in the SEER database, we predicted the prognosis of CRC patients using all three.

The AJCC TNM staging system presents many limitations regarding octogenarians. According to the TNM staging system, octogenarians with disease of similar stages may present with similar prognoses, which does not meet the practical requirements. Compared to the TNM staging system, our nomogram provided a more precise analysis of this sensitive group, with a C-index of 0.758, good calibration, and distinctive DCA results. Thus, the nomogram established in this study could be used to guide clinicians in individual therapeutic plan formulation and patient follow-up. For example, in the high-risk group (total score ≥ 159.5), necessary follow-ups are performed at certain times to confirm the postoperative status and enable treatment plan adjustment. In the low-risk group (total score < 159.5), a more aggressive therapeutic intervention might be chosen, depending on the patient’s wishes and the experimental conditions, to prolong the life expectancy and improve the quality of life of octogenarians. Additionally, our nomogram was less predictive of CSS in China. The primary reason might be that China faced a wave of Omicron infections at the end of 2022. Octogenarians infected with Omicron faced a greater challenge to their physical function, resulting in poorer CSS. Another reason might be that the tumour metastasis rate in octogenarians is higher in China than in the United States, resulting in a poorer prognosis. This may also reflect the inadequate system for early CRC screening in China [41, 42] and highlight the need to predict CRC in this population more accurately.

The prognostic nomogram developed in this study has significant clinical and practical implications for the management of colorectal cancer (CRC) in octogenarians. Octogenarians, distinguished by a shorter life expectancy and a higher prevalence of morbidity, frequently encounter challenges in reconciling therapeutic approaches with their desired quality of life. Meanwhile, the ongoing updating of theoretical framework and the relentless advancement of technology have made it challenging for geriatricians to balance the potential benefits and inherent dangers of aggressive treatments for octogenarians. The nomogram represents a valuable tool for personalized risk assessment, as it enables the identification of high-risk patients who may benefit from closer monitoring and tailored treatment strategies. Conversely, low-risk patients may be managed with a standard follow-up regimen, thus helping to set realistic expectations and make informed decisions about patient care. Concurrently, the nomogram facilitates geriatricians in conveying complex prognostic information to patients and their families in a straightforward and comprehensible pictorial format, thereby enhancing patient comprehension, satisfaction, and adherence to treatment plans. Furthermore, in complex clinical settings, the nomogram assists geriatricians in identifying patients who necessitate more intensive resources and those who can be managed with routine care, thereby optimizing the allocation of healthcare resources.

There are also some limitations to our study. First, the SEER database is a high-quality population-based cancer registry lacking data on body mass index, smoking, underlying diseases, comorbidities, and other factors that may influence CSS in octogenarians. Second, selection bias may also be inevitable due to the retrospective study design. For instance, partially censored data associated with a short follow-up time can also statistically affect the performance of the predictive model. Third, our nomogram was validated with external data from a single centre in China. Thus, further validation of the nomogram using a large amount of data from multiple centres is needed.

Conclusion

We constructed a nomogram for predicting the CSS of individual octogenarian patients with CRC who underwent radical resection. Internal validation in an American cohort and external validation in a Chinese cohort demonstrated the precision of the model and the better prognostic performance of the model than the conventional AJCC staging system in this particular population. Thus, this nomogram could be used to guide clinicians in clinical follow-up and therapeutic plan formulation.