Background

Pancreatic cancer has a high mortality rate, and the average 5-year survival rate in the United States is approximately 10% [1]. Because the early symptoms are not obvious, pancreatic cancer is difficult to diagnose and has poor treatment outcomes, which also increase the mortality rate [1, 2]. Currently, radical surgery and adjuvant therapy including radiotherapy and chemotherapy are the main treatment strategies for pancreatic cancer, and radical surgery is the only curative treatment [3, 4]. However, few patients are suitable for surgical treatment because of distant metastases or local invasion at the time of diagnosis [5, 6].

The location of the primary tumour can also affect prognosis and overall survival (OS), and pancreatic head adenocarcinoma (PHAC) is reported to have a significantly shorter OS relative to pancreatic bodytail [7]. Assessment of tumour prognosis primarily relies on the American Joint Committee on Cancer (AJCC) 8th edition staging system [8], which is based on the size and extent of the tumour, the number of metastatic lymph nodes, and the number of distant metastases. The AJCC staging system can guide clinical practice and prognostic analysis to some extent [9]. However, some important prognostic factors are not taken into account in the AJCC staging system, such as age at diagnosis and tumour grade, which reduces its predictive accuracy for some patients. Several prognostic models have been developed to predict the prognosis of pancreatic body/tail tumours [10, 11], although models of non-metastatic pancreatic head tumours still deserve to be explored further. Therefore, a nomogram is needed to predict survival of patients after resection of non-metastatic PHAC and assist with treatment decision-making.

In this study, we screened significant independent factors from the Surveillance, Epidemiology, and End Results (SEER) database to develop a nomogram for modifying the AJCC 8th staging system using Cox regression analysis. The nomogram was constructed for predicting survival in a training cohort and verified in a validation cohort. This nomogram would help assist clinicians to conduct personalized medicine.

Methods

Study population and selection criteria

Patient information was collected from the SEER database of the National Cancer Institute (http://seer.cancer.gov/). After consulting the CS Schema v0204+, we extracted data from 115,227 patients with pancreatic head tumours using SEER*Stat software (version 8.3.8). The inclusion criteria were as follows: (a) International Classification of Diseases for Oncology, Third Edition (ICD-O-3) histology codes 8140, 8255, 8480, 8481, 8500, 8503, 8521, 8523, and 8527; (b) ICD site code C25.0; (c) M0 stage; (d) survival time > 30 days; (e) age ≥ 20 years; (f) underwent surgery; and (g) complete information on race, sex, grade, AJCC staging, radiotherapy, chemotherapy, and the number and status of lymph nodes. Patient lacking any of this information were excluded from the study cohort. Because the original SEER cohort used the 7th edition AJCC staging, we transformed the 7th edition staging to the 8th edition staging for subsequent analysis based on the information about the size and extent of the tumour and the number of positive lymph nodes. The included patients were divided randomly into a training cohort (n = 4495) and a validation cohort (n = 1924) at a ratio of 7:3 using the ‘caret’ package in R. The training cohort was used to select the prognostic factors and construct the model, and the model performance was verified in the validation cohort. A detailed data-screening process is shown in Fig. 1.

Fig. 1
figure 1

Flowchart showing patient selection and analysis

Statistical analysis

Continuous variables are reported as median (interquartile range) and were analysed by Student’s t test, and categorical variables are reported as counts and proportions and were analysed by chi-squared test for comparisons among groups. Identification of the independent prognostic factors and development of the nomogram model predicting OS was implemented using univariate and multivariate Cox regression analysis. The optimal cut-off values of the lymph node ratio (LNR), tumour size, and total risk score were calculated via X-tile (version 3.6.1). The LNR was defined as the ratio of the number of positive lymph nodes to the number of total examined lymph nodes. We converted continuous variables to categorical variables for further processing and divided patients into ‘low’, ‘medium’, and ‘high’ groups in the training cohort based on the optimal threshold of the total risk score. OS was the primary endpoint and was defined as the time from diagnosis to death or the last follow-up. Kaplan-Meier (KM) survival analysis was performed to depict the capability of individual variables to discriminate OS. With the aim of predicting the OS of patients, a nomogram was finally constructed. The concordance index (C-index) was calculated to evaluate the discrimination of the model, and calibration curves and time-dependent receiver operating characteristic curves within 60 months were created to estimate the predictive capability of the model. The net reclassification improvement (NRI) and integrated discrimination improvement (IDI) were calculated to evaluate the improvement of the model prediction compared to the 8th AJCC staging using the ‘nricens’ and ‘rms’ packages. Decision curve analysis (DCA) is widely applied to evaluate a nomogram with the advantage of integrating patients’ or decision-makers’ preferences into the analysis and is increasingly used in clinical studies. We performed DCA to analyse the clinical benefits of the nomogram compared with those of the 8th AJCC staging system. R (version 4.0.2) and SPSS (version 26.0) were used to complete all statistical analyses in our study, and the main utilized R packages were ‘nricens’, ‘Cschange’, ‘rms’, and ‘timeROC’. A two-sided p < 0.05 was considered statistically significant.

Results

Demographics and characteristics of the included patients

A total of 6419 patients who underwent resection for non-metastatic PHAC met the criteria and were included in the analysis. Based on the 7:3 ratio of distribution, 4495 and 1924 patients were sorted into the training and validation cohorts, respectively. The median OS in these two cohorts were 16 (8–28) months and 16 (9–27) months, respectively. In the training cohort, most of the patients were 60–80 years old (2984 [66.4%]), the male (2269 [50.5%]) to female (2226 [49.5%]) ratio was 1.02:1, and the predominant race was white (3712 [82.6%]). In terms of pathological diagnosis, most of the patients had T2 stage disease (2798 [62.2%]), N1 stage disease (1823 [40.6%]), and moderately differentiated tumours (2355 [52.4%]). Stage II was the most common clinical stage (1931 [43.0%]). Most patients had tumours > 2.5 cm (2948 [65.6%]). Further, most patients received chemotherapy (3362 [74.8%]) and did not receive radiotherapy (3054 [67.9%]). The median number of retrieved lymph nodes was 17 (12–24), and the median number of positive lymph nodes was 2 (0–4).

To better guide clinical practice, two continuous variables in the training cohort, the LNR and tumour size, were stratified via X-tile (Supplementary Fig. S1 and Table S1). The LNR ranged from 0 to 1, and the tumour size ranged from 0.1 cm to 46 cm. The LNR was grouped into LNR1 (LNR ≤0.20) and LNR2 (LNR > 0.20), and tumour size was classified into group 1 (diameter ≤ 2.5 cm) and group 2 (diameter > 2.5 cm). The baseline clinicopathological characteristics, demographic characteristics, and comparable p values are shown in Table 1.

Table 1 Demographics and characteristics of patients in Training and Validation cohorts

Identification and selection of prognostic factors

We performed Cox regression analysis to identify significant prognostic factors correlated with OS. As shown in Table 2, univariate regression analysis revealed that clinicopathological characteristics, including age, grade, T stage, N stage, the LNR, and tumour size, were prognostic risk factors (p < 0.001), whereas interventions, including radiotherapy and chemotherapy, were prognostic protective factors (p < 0.001). There was no difference in prognosis based on race or gender. The corresponding KM survival curves exhibited the capability of these variables to discriminate the OS in the training cohort (Supplementary Fig. S2).

Table 2 Univariate Cox analysis of prognostic factors in the training cohort

To improve the application of the model, we further reduced the model variables. T stage is based on tumour size and extension, and the LNR and N stage both describe the metastatic lymph node status. Therefore, we compared the C-indexes of T stage and tumour size and those of the LNR and N stage in the training cohort. The C-index of T stage was 0.556, which was superior to that of tumour size (0.545, p = 0.015), and the C-index of N stage was 0.578, which was superior to that of the LNR (0.569, p = 0.026). Therefore, we finally developed a better model composed of T stage, N stage, grade, radiotherapy, and chemotherapy. The detailed multivariate Cox regression results are shown in Table 3.

Table 3 Multivariate Cox analysis of prognostic factors in the training cohort

Development and validation of the nomogram predicting overall survival

Based on this multivariate model, a visualized nomogram was constructed to predict the 1-, 2-, 3-, and 5-year survival rates (Fig. 2). The C-index of the nomogram was 0.676 (95% CI 0.665–0.687), and it had better predictive capability than the AJCC 8th staging system (C-index: 0.581, 95% CI 0.570–0.591, p < 0.001) in the training cohort. Similarly, in the validation cohort, the C-index was 0.667 (95% CI 0.649–0.684), and it also had better predictive capability than the AJCC 8th staging system in this cohort (C-index: 0.566, 95% CI 0.548–0.584, p < 0.001). The NRI and IDI were also calculated to demonstrate the predictive accuracy of the nomogram. In the training cohort, the NRIs based on bootstrapping for OS at 1, 2, 3, and 5 years were 0.305 (95% CI 0.248–0.357), 0.227 (95% CI 0.172–0.284), 0.057 (95% CI 0.003–0.108), and 0.132 (95% CI 0.085–0.197), respectively. The IDIs for OS at 1, 2, 3, and 5 years were 0.050, 0.063, 0.056, and 0.045, respectively (p < 0.001) (Table 4). The results were also verified in the validation cohort (Table 4). Therefore, the nomogram had better predictive accuracy for OS at 1, 2, 3, and 5 years than the AJCC 8th staging system.

Fig. 2
figure 2

The nomogram predicting overall survival of patients with non-metastatic PHAC. PHAC: pancreatic head adenocarcinoma

Table 4 The NRIs, IDIs, and C-indexes of the nomogram and AJCC staging system in OS prediction

The time-dependent AUCs of the nomogram for predicting OS within 60 months in the training and validation cohorts are shown in Fig. 3A-B. The AUCs were significantly higher in both cohorts than those of the AJCC 8th staging system, indicating that the model had favourable discrimination. The calibration curves for survival probability showed that the nomogram prediction had good consistency with the actual observation in the training cohort at 12, 24, 36, and 60 months (Fig. 3C-F); the validation cohort showed similar results (Supplementary Fig. S3).

Fig. 3
figure 3

a-b Comparison of the time-dependent AUCs between the nomogram and the AJCC staging system in the training and validation cohorts. c-f Calibration curves showing the probability of 12-, 24-, 36-, and 60-month overall survival of the model prediction and the observed survival in the training cohort. AUC: area under the curve; AJCC: American Joint Committee on Cancer

Based on the optimal cut-off value of the total nomogram score obtained using X-tile, we divided all patients into three risk groups: ‘low risk’ (≤138), ‘medium risk’ (139–195), and ‘high risk’ (> 195). The detailed risk score is shown in Supplementary Table S2. The KM survival curves showed that the nomogram risk grouping had better discrimination for OS than the AJCC 8th staging system (Fig. 4, Supplementary Table S3).

Fig. 4
figure 4

a-d Kaplan-Meier curves of overall survival for patients based on the nomogram risk grouping and the AJCC staging system in the two cohorts. AJCC: American Joint Committee on Cancer

Clinical utility analysis of the nomogram

To explore the clinical utility of the nomogram, we performed DCA. The results showed that the nomogram had better net benefits than the AJCC 8th staging system in both the training and validation cohorts (Fig. 5A-B), suggesting that the clinical utility of the nomogram is better than that of the AJCC 8th staging system.

Fig. 5
figure 5

a-b Decision curve analysis showed clinical benefits of the model predicting overall survival in the training and validation cohorts

Discussion

Few patients with PHAC are able to undergo surgical resection, and even patients who do undergo resection have a low 5-year survival rate. Therefore, there is an urgent need for a nomogram to predict survival for these patients. Since patients with metastatic PHAC are less likely to undergo surgery, we are more interested in patients who undergo resection of non-metastatic PHAC and desired to develop a predictive nomogram for this cohort.

Here, we performed a population-based study involving data from 6419 patients who underwent resection of non-metastatic PHAC from the SEER database. After assessing multiple variables, including age, gender, and AJCC staging, we finally developed a nomogram including five independent prognostic factors (T stage, N stage, grade, radiotherapy, and chemotherapy) that predicted OS at 1, 2, 3, and 5 years after resection with high accuracy. Importantly, these prognostic factors are easy to obtain in clinical practice. The results of our analyses, including the AUC, NRI, and IDI, showed that our model had better accuracy for predicting survival than the AJCC 8th staging system, and DCA showed the model had higher clinical benefits than the AJCC staging system. The calibration curves also showed good discrimination and calibration capabilities.

Previous studies have proposed various significant prognostic factors for patients with pancreatic cancer, such as age, lymph node metastasis, and biochemical indicators [10, 12,13,14]. Similarly, our study also demonstrated that age, T stage, N stage, grade, the LNR, and tumour size were significant risk factors; radiotherapy and chemotherapy were strong protective factors. He et al. revealed that high-grade classification indicates poor prognosis in adenocarcinoma of the pancreatic body and tail [11]. In our study, the high-grade classification was a strong prognostic risk factor of PHAC, so we included tumour grade into this model. The LNR was recently implemented as a more accurate assessment of lymph node metastasis to modify N stage and has been used to predict prognosis and guide clinical practice for patients with pancreatic cancer [15,16,17,18]. In our study, the LNR was an independent prognostic factor in Cox regression analysis. The cut-off value of 0.2 was also appropriate and further applied to the controversial problem of threshold. We then further compared the predictive abilities of the LNR and N stage by comparing their C-indexes. Interestingly, we found no significant difference. However, N stage is a more accessible factor than the LNR. Therefore, the LNR may not be superior to N stage for predicting survival in patients with non-metastatic PHAC after surgery.

Radical surgery and chemoradiotherapy are currently the main treatment strategies for patients with pancreatic cancer [1]. A phase III clinical trial revealed that adjuvant chemoradiotherapy can significantly increase the R0 resection rate and prolong survival [19]. Recent studies have also reported that chemoradiotherapy is a key protective factor in advanced pancreatic cancer [20,21,22]. Administration of adjuvant chemoradiotherapy can significantly improve the survival of PHAC patients. In our study, we demonstrated that radiotherapy and chemotherapy were strong protective factors in this large study population. Therefore, our nomogram included these interventions. The nomogram had better predictive capability than the AJCC 8th staging system. Therefore, physicians can use postoperative indicators and these treatment measures to assess patient prognosis and guide further treatment.

Although our model had good performance for predicting survival, some limitations must also be noted. First, the data were collected from the SEER database, and this was a retrospective study with inherent defects. Further, some significant prognostic factors, such as the resection margin, the level of carbohydrate antigen 19–9, and some physical indices, were not available in the SEER database. Second, the cut-off value of partial variables in our study were calculated by X-tile, and the wide application of the threshold requires further confirmation. Further large prospective studies to confirm the effectiveness of our prognostic model are also needed.

Conclusions

We developed and validated a nomogram that has high accuracy in predicting OS in patients after resection of non-metastatic PHAC. All of the included prognostic factors are easy to obtain. This nomogram could promote the development of a novel staging system and assist clinicians in conducting personalized clinical treatment.