Background

Colorectal cancer (CRC) is estimated to be the third leading cancer type among new cancer cases and deaths in the United States [1]. In 2018, among the two sexes combined, an estimated 97,220 new cases of colon carcinoma (5.6% of all cancer cases) [2] and an estimated 50,630 (8.3%) deaths from that cause occurred [1]. The poor prognosis and frequent recurrence of colon carcinoma might be related to lymph node metastasis (LNM) and distant metastasis [3]. According to the 7th American Joint Committee on Cancer (AJCC) cancer staging system [4], advanced colon carcinoma (stage III or IV) is diagnosed when LNM or distant metastasis occurs, regardless of the pathologic T (pT) classification. Studies have indicated that 27.3% of patients diagnosed with colon carcinoma develop liver metastasis during the course of their disease, and the proportions of patients with synchronous and metachronous liver metastasis (LIM) were 14.5% and 12.8% [5], respectively. In addition, we found that some advanced colon carcinoma patients remained at pT1 or pT2 due to the migration and invasion capabilities of early colon carcinoma.

When colon carcinoma is detected in a localized stage, the 5-year relative survival is 91.1%. However, the 5-year relative survival of colon carcinoma patients with regional metastasis or distant metastasis were 71.7% and 13.3%, respectively [6]. Therefore, early detection of colon carcinoma metastasis is important for modifying therapeutic strategies and improving patient prognosis.

Most studies of colon cancer metastasis have used lymph nodes to predict the prognosis and recurrence of colon carcinoma [7,8,9,10,11]; research on LIM is much less common. Additionally, there have been few reports or methods to predict LNM and LIM of colon carcinoma. Because the clinicopathological risk factors of LNM and LIM in patients with early colon carcinoma are poorly understood, we attempted to predict the risk factors based on a statistical predictive model.

Nomograms are reliable graphical calculating models that are used to accurately calculate and predict individual risk events by combining all risk factors for tumor development [12, 13]. An increasing number of nomograms are being widely established to provide assistance in formulating individual treatment and follow-up management strategies in several cancers, such as oropharyngeal cancer [14], gastrointestinal stromal tumors [15], adenoid cystic carcinoma [16], bladder cancer [17], and prostate cancer [18]. To the best of our knowledge, no nomograms have been carried out to predict LNM and LIM using data gathered from patients with early colon carcinoma in the Surveillance, Epidemiology, and End Results (SEER) database. Here, we performed nomograms to predict LNM and LIM of early colon carcinoma by combining all relevant risk factors. In addition, decision curve analysis (DCA) and an assessment of clinical impact were conducted to illustrate the clinical utility of the model.

This study aims to evaluate patients with early colon carcinoma using nomograms, discover patients with high risk scores and help to modify therapeutic strategies in clinical application.

Materials and methods

Patients and study design

The records of patients who underwent surgery for pT1 or pT2 colon carcinoma from 2004 to 2015 were retrieved from the SEER 18 registry database using SEER*Stat 8.3.5 software. The flow chart used for data selection is shown in Fig. 1. “The International Classification of Diseases for Oncology (ICD-O-3) Hist/behav, malignant” was used to screen colon carcinoma cases. “Year of diagnosis” ranged from 2004 to 2015. “Derived AJCC Stage Group 7th (2010+)”, “RX Summ-Surg Prim Site (1998+)”, and “Grading and differentiation codes in ICD-O-2” were used in the present study. The codes in Collaborative Stage (CS) (2004+), including tumor size, extension, lymph nodes and metastases, were also collected. The inclusion criteria were as follows: diagnostic confirmation was achieved based on microscopic analysis, and patient background characteristics (age, gender, race and marital status), tumor-related factors [i.e., tumor size and invasion, tumor numbers, histological grade, carcinoembryonic antigen (CEA), LIM, lung metastasis] and survival information were known and available. The exclusion criteria were as follows: death certificate or autopsy only and age < 18 years old. A total of 32,819 cases in the SEER cohort were included and analyzed. We further randomly divided the patients in a 2-to-1 ratio, forming a training set (n = 21,880) for nomogram construction and a validation set (n = 10,939) for internal verification. The data obtained in this study were rooted mainly in the public SEER database, which is available as open-access data. The ethics committee board of Sun Yat-sen Memorial Hospital, Sun Yat-sen University, approved the use of patients with early colon carcinoma for this study.

Fig. 1
figure 1

Study flowchart

Construction and validation of nomograms

Univariable and multivariable analysis were used to identify independent risk factors predictive of LNM and LIM in early colon carcinoma in the SEER discovery set. All variables were screened using the forward stepwise selection method in a multivariate binary logistic regression model [19, 20]. The SEER internal testing set was used to evaluate the predictive reliability and accuracy of the nomograms developed to predict LNM and LIM. For internal validation of the nomogram, we applied a bootstrapping method with 1000 resamples. The predictive performance of the nomograms was measured by a receiver operating characteristic (ROC) curve. Calibration curves were plotted to validate the accuracy and reliability of the nomograms by the Hosmer–Lemeshow test [21].

Clinical utility

DCA was performed to determine the clinical application value of the nomogram models by calculating the net benefits at each risk threshold probability [22, 23]. The net benefit (NB) was determined by subtracting the proportion of all false-positive patients from the proportion of true positives and weighted by the relative harm caused by forgoing treatment compared with the negative consequences of unnecessary treatment, the NB to the population of using the risk model together with highrisk threshold R is: NB = TPR*P−*(1−R)*FPR*(1−P) (TPR: true-positive rate; FPR: false-positive rate; P: prevalence of the outcome; R: proportion of cases with risk above risk threshold) [24]. Additionally, on the basis of the DCAs, we plotted curves to evaluate the clinical impact of the nomogram to help us more intuitively understand its significant value. These curves display the number of high-risk patients, along with the number of high-risk patients with outcomes of metastasis, at different threshold probabilities in a given population [25].

Statistical analysis

All statistical analyses were performed using the software IBM SPSS Statistics (version 24, SPSS Inc., Chicago, IL, USA) and the programming language R (version 3.3.4, http://www.R-project.org) for Windows. Patient clinical characteristics are summarized as the mean (s.d.) for continuous measures. The Chi squared test and Student’s t-test were used to compare categorical variables and continuous variables. The ROC curve, nomogram, calibration plots, DCA and clinical impact curves were calculated in R 3.3.4 with relevant packages, such as the survival ROC, rms, calibrate and decision curve packages. The cutoff values of the risk scores from the predictive nomograms of LNM and LIM were determined based on the maximum Youden index of the ROC curve in the training set, and the patients were divided into low- and high-risk groups. All statistical tests were two-sided, and a P value < 0.05 was considered statistically significant.

Results

Clinical characteristics of patients

The demographic and clinical characteristics of colon carcinoma patients in both cohorts are summarized in Table 1, and there were no significant differences between the two sets (P>0.05, Table 1). LNM was present in 3111 of 21,880 patients (14.2%) and 30 of 10,939 patients (14.5%) in the training and testing sets, respectively. LIM occurred in 1.5% of patients in the training set and 1.2% of patients in the testing set. There was no statistically significant difference in LNM rate (P = 0.277) or LIM rate (P = 0.06) between the two sets. In the correlation analysis, five variables, namely, histological grade, T classification, tumor size, serum CEA level and overall survival, were significantly correlated (P < 0.001) with LNM (Table 2) and LIM (Table 3) in both the training and testing sets.

Table 1 Demographic and clinical characteristics of colon carcinoma patients
Table 2 Correlations between clinicopathological characteristics of patients and lymph node metastasis in the training and validation sets
Table 3 Correlations between clinicopathological characteristics of patients and liver metastasis in the training and validation sets

Independent significant factors in the training set

To further identify candidate predictors of LNM and LIM, we evaluated all clinicopathological features by binary logistic regression analysis. Risk factors for LNM and LIM were initially identified by univariate logistic regression analysis in the training set (Table 4). Marital status, histological grade, histological type, T classification, tumor size and CEA were associated with LNM. Additionally, there were eight clinicopathological variables related to LIM, namely, age, race, histological grade, histological type, T classification, tumor size, CEA and N classification. A multivariate regression analysis was performed on all factors to verify the risk factors of LNM and LIM (Table 5). Eight variables were actually associated with LNM: age (45–65: odds ratio (OR) 0.83, 95% CI 0.692 to 0.996, P = 0.045; ≥ 65: 0.525, 0.438 to 0.63, P < 0.001), marital status (Single: 0.898, 0.826 to 0.976, P = 0.012; Unknown: 0.806, 0.675 to 0.962, P = 0.017), race (White: 0.732, 0.637 to 0.842, P < 0.001), histological grade (Moderately differentiated: 1.644, 1.442 to 1.875, P < 0.001; Poorly differentiated: 3.641, 3.088 to 4.292, P < 0.001; Undifferentiated: 3.462, 2.609 to 4.593, P < 0.001), histological type (Carcinoid tumor: 1.752, 1.328 to 2.311, P < 0.001; Neuroendocrine carcinoma: 3.74, 2.613 to 5.534, P < 0.001), T classification (T2: 2.221, 2.03 to 2.431, P < 0.001), tumor size (≥ 5 cm: 1.125, 1.003 to 1.262, P = 0.045; Unknown: 0.84, 0.731 to 0.967, P = 0.015) and CEA (Positive: 1.385, 1.228 to 1.561, P < 0.001; Unknown: 0.74, 0.678 to 0.808, P < 0.001). Similarly, LIM was related to five variables: age (≥ 65: 0.532, 0.332 to 0.851, P = 0.008), histologic grade (Moderately differentiated: 1.501, 1.032 to 2.184, P = 0.034; Poorly differentiated: 1.670, 1.028 to 2.714, P = 0.038), tumor size (≥ 5 cm: 2.886, 2.203 to 3.783, P < 0.001; Unknown: 2.463, 1.8 to 3.37, P < 0.001), CEA (positive: 10.436, 7.595 to 14.335, P < 0.001) and N classification (N1: 3.909, 2.999 to 5.095, P < 0.001; N2: 12.131, 8.670 to 16.975, P < 0.001).

Table 4 Risk factors for lymph node metastasis and liver metastasis identified by univariate logistic regression analysis
Table 5 Risk factors for lymph node metastasis and liver metastasis identified by multivariate logistic regression analysis

Development of nomograms for LNM and LIM prediction

Based on the independent risk factors identified in the multivariate regression analysis, two nomograms were developed to predict the possibility of LNM (Fig. 2a) and LIM (Fig. 2b) in patients with early colon carcinoma. Furthermore, point assignments and predictive scores for each variable in the nomogram models were calculated in Table 6. According to the LNM nomogram, histological grade made the largest contribution, followed by T stage, age, marital status, serum CEA level and histological type. N classification made the largest contribution in the LIM nomogram, followed by histological grade, tumor size, serum CEA level and age. The calibration curves for predicting LNM and LIM in the training set (Fig. 2c, e) showed good agreement between predictions and observations.

Fig. 2
figure 2

Nomogram and calibration curves for predicting lymph node metastasis and liver metastasis in patients with early colon carcinoma. There are seven factors in the lymph node metastasis prediction nomogram (a) and five factors in the liver metastasis prediction nomogram (b). Calibration curves for predicting lymph node metastasis and liver metastasis in the training set (c, e) and in the testing set (d, f) are shown. All the points assigned on the top point scale for each factor are summed together to generate a total point score. The total point score is projected on the bottom scales to determine the probability of cancer metastasis in an individual. The nomogram-predicted frequency of metastasis is plotted on the x-axis, and the actual observed frequency of metastasis is plotted on the y-axis

Table 6 Point assignments and predictive scores for each variable in the nomogram models

Performance and validation of nomograms for LNM and LIM prediction

The calibration curves for predicting LNM and LIM demonstrated that the nomograms were generally well calibrated in the testing set (Fig. 2d, f). To compare the predictive values for LNM and LIM of the nomogram models and clinicopathological risk factors, we applied ROC analysis. In the ROC curves of LNM in the training set (Fig. 3a) and the testing set (Fig. 3b), the area-under-the-curve (AUC) values of the nomograms were 0.667 (95% CI 0.661–0.673) and 0.658 (95% CI 0.649–0.667), respectively; these values were significantly larger than the AUCs of grade, tumor size and histological type in both sets (P < 0.0001). Similarly, the AUCs of nomograms of LIM in the training set (Fig. 3c) and the testing set (Fig. 3d), with values of 0.766 (95% CI, 0.760–0.771) and 0.825 (95% CI, 0.818–0.832), respectively, were higher than those for histological grade, histological type, tumor size and N classification. Moreover, we generated bar charts to evaluate the discriminatory power of the nomograms in LNM and LIM after calculating the risk scores from the nomograms. Using the maximum Youden index in the training set, we obtained cutoff values of 79 and 33 for the LNM and LIM nomograms, respectively. All patients were divided into low- and high-risk groups. Patients with predicted high-risk LNM actually had a higher proportion of N1 and N2 classification than the low-risk group in the training set (Fig. 4a). The proportion of N1 and N2 classification in the testing set was near the proportions in the training set (Fig. 4b). Similarly, the high-risk group had a greater possibility of LIM than the low-risk group in both the training and testing sets (Fig. 4c, d).

Fig. 3
figure 3

Receiver operating characteristic (ROC) curve analysis for lymph node metastasis and liver metastasis. Comparisons of the predictive values of the nomogram models and clinicopathological risk factors for lymph node metastasis and liver metastasis according to ROC analysis. ROC curves of lymph node metastasis in the training set (a) and the testing set (b); ROC curves of liver metastasis in the training set (c) and the testing set (d). The AUC was calculated, and its 95% CI was estimated by bootstrapping. The P values were two-sided. Abbreviations: LN, lymph nodes; ROC, receiver operating characteristic; 95% CI, 95% confidence interval

Fig. 4
figure 4

Discriminatory power of the nomograms for lymph node metastasis and liver metastasis, illustrated with bar charts. Risk classification for the predictive nomograms was conducted by the maximum Youden index of the ROC curve, and their performance in distinguishing lymph node metastasis and liver metastasis in the training set (a, c) and the testing set (b, d) were plotted

Clinical utility

Kaplan–Meier survival curves of overall survival for patients according to LNM (Fig. 5a) and LIM (Fig. 5b) in the entire SEER cohort verified that patients who were predicted to have LNM or LIM had a significant disadvantage in overall survival (P < 0.0001). DCAs were performed on the nomograms for predicting LNM (Fig. 5c) and LIM (Fig. 5d) in the training set. Threshold probabilities of 0–0.3 for LNM or 0–0.2 for LIM were the most beneficial for predicting LNM and LIM with our nomograms. Based on these DCAs of LNM, we further plotted curves to evaluate the clinical impact of the nomograms to help us more intuitively understand their substantial value. Clinical impact curves of the LNM nomogram in the training set (Fig. 5e) and testing set (Fig. 5f) showed that the model had remarkable predictive power: the predicted number of high-risk patients was always greater than the number of high-risk patients with outcomes of metastasis when the risk threshold was in the range of 0–0.3, and the cost–benefit ratios would be acceptable in the same range.

Fig. 5
figure 5

Kaplan–Meier survival curves, decision curve analyses, and clinical impact curves of overall survival for patients. Kaplan–Meier survival curves representing the overall survival of patients with lymph node metastasis (a) and liver metastasis (b) in the entire SEER cohort. The decision curves of the nomograms for predicting lymph node metastasis (c) and liver metastasis (d) in the training set were plotted. Clinical impact curves of the nomogram to predict lymph node metastasis in the training set (e) and the testing set (f) are shown. The y-axis represents the net benefit. The x-axis shows the threshold probability. The horizontal solid black line represents the hypothesis that no patients experienced lymph node metastasis or liver metastasis, and the solid gray line represents the hypothesis that all patients met the endpoint (c, d). At different threshold probabilities within a given population, the number of high-risk patients and the number of high-risk patients with the outcome were plotted (e, f)

Discussion

Colon carcinoma ranks fourth in terms of incidence but fifth in terms of mortality worldwide in 2018. In 2018, among both genders combined, the incidence of colon carcinoma is approximately 1,096,601 new cases, and the mortality is approximately 551,269 [26]. Death from colon carcinoma typically occurs due to distant metastasis, while lymph node metastases are thought to occur before distant metastasis [3]. A study has reported that an increased number of lymph nodes evaluated is associated with increased survival. Therefore, lymph node evaluation is important for the prognosis and treatment of patients with colon cancer and may be a measure of quality care [9]. For distant metastasis, a population-based cancer registry in Burgundy reported that 27.3% of patients diagnosed with colon carcinoma develop LIM during the course of their disease, and the 5-year cumulative metachronous LIM rate was 14.5% in general, 3.7% for TNM stage I tumors, and 13.3% for stage II [5]. Metachronous LIM also contributed greatly to the poor prognosis and recurrence of colon carcinoma.

When metastasis occurs, surgical treatments such as en bloc resections of the affected segments of the bowel and the associated draining lymph nodes [27], as well as adjuvant therapies, should be applied [28]. Partial or total colectomy is performed in the majority of patients with stage I and II colon cancer (84%), while 67% and 40% of patients with stage III and stage IV, respectively, receive chemotherapy in addition to colectomy to lower their risk of recurrence [29]. Several studies have examined the number [9], distribution and size of affected lymph nodes [8] or the ratio of metastatic to examined lymph nodes [7] to evaluate colon cancer survival. Some researchers have focused on mRNA expression of genes related to lymph nodes, such as guanylyl cyclase C (GCC) [11] and metastasis associated in colon cancer 1 (MACC1) [10], to evaluate colon cancer prognosis. It is unknown whether LIM is derived from cancer cells that first colonize intestinal lymph nodes or whether such metastases can form without prior lymph node involvement in colorectal cancer. Enquist et al. found direct hematogenous spread as a dissemination route contributing to CRC liver metastasis in CRC mouse models [30]. Therefore, the correlations between LNM, LIM and tumor recurrence should not be ignored, and in order to modify therapeutic strategies and improve patient prognosis, it is essential to estimate the risks of LNM and LIM in early colon carcinoma. c-MET, a proto-oncogene that initiates a range of signals to regulate various cellular functions, has been suggested to be associated with CRC progression [31] . Hiroya Takeuchi and coworkers reported that c-MET copy numbers in primary CRC of N1/N2-stage patients were significantly higher than the copy numbers in N0 cases (P < 0.03) and that overexpression of c-MET mRNA in primary CRC may be a predictor of tumor invasion and lymph node metastases [32]. Zuo et al. found that serum soluble lectin, which was increased in colon cancer patients with LIM compared to those without metastases, might be a promising new target for intervention in metastasis formation [33]. However, fundamental studies are not a direct way to predict metastasis in daily clinical practice and would be costly even if they could be employed in the clinic. As a result, we focused on clinical studies based on clinicopathological risk factors.

Some researchers have estimated the risk of metastasis using clinicopathological variables and nomograms. A study of 160 patients with early colorectal cancer assessed CT and MRI data to establish imaging criteria for LNM and concluded that a short-diameter size criterion of ≥ 4.1 mm for metastatic lymph nodes showed sensitivity of 78.6% and specificity of 75% [34]. In addition, Yan-qi Huang et al. developed and validated a radiomics-based nomogram incorporating the radiomics signature, CT-imaged lymph node status, and clinical risk factors to facilitate the preoperative individualized prediction of LNM in patients with colorectal cancer [24]. Martin R. Weiser and colleagues developed a colon cancer recurrence nomogram to predict relapse based on the number of positive and negative lymph nodes, lymphovascular invasion and other risk factors [35]. Because nomograms are commonly used tools for prognosis in oncology and medicine [22] and straight scales are useful for relatively simple calculations, we decided to build a nomogram for LNM and LIM prediction in early colon carcinoma. The scarcity of studies examining liver metastasis in colon carcinoma supported our decision to develop a nomogram for predicting LIM in early colon carcinoma.

Two nomograms were constructed and validated for predicting LNM and LIM in patients with early colon carcinoma. The nomogram for LNM incorporates seven factors, namely, age, marital status, CEA, histological type, T classification, histological grade and tumor size, while the nomogram for LIM includes five factors: age, CEA, tumor size, histological grade and N classification.

Both of the nomograms demonstrated good agreement between predictions and observations in the training and testing sets. Furthermore, better diagnostic efficiencies were shown by ROC curves in comparison with histologic grade, histologic type, tumor size and N classification. In particular, the AUCs of the LIM nomograms were calculated with values of 0.766 (0.760–0.771) and 0.825 (0.818–0.832), respectively, in the training set and the testing set.

However, the nomograms might not be useful with greater AUCs and good agreement between predictions and observations [13]. Therefore, decision curve analyses were performed in the present study. DCA is a novel method for evaluating diagnostic tests, prediction models and molecular markers. This method can also be easily extended to many of the applications common to performance measures for prediction models [22]. Here, good clinical utility was indicated in the proper range. Moreover, the clinical impact of the LNM nomogram on the basis of DCA, Kaplan–Meier survival curves and bar charts with Chi squared tests was used to improve the discriminatory power of the nomograms. The nomograms for predicting LNM and LIM actually possess good prediction efficiencies as judged by the methods above.

In our study, a large number of cases in the SEER dataset were chosen and randomly divided into a training set and an internal testing set. Our purpose was to evaluate the prediction of LNM and LIM in early colon carcinoma from large quantities of patient data, which are convincing and readily available in clinical decision making. For clinical application, it is important to make the assessment of risk factors as convenient as possible. We considered the variables needed in our nomogram to be prevalent in clinical practice and convenient to acquire. The limitations of our study are the lack of external validation for the nomogram and the absence of genetic markers. Because the testing set in this study was derived from the same SEER dataset as the training study, potentially leading to overfitting of the model, external validation at our hospital or another institution should be performed. Multicenter validation with a large sample size is preferable because it yields high-level evidence for clinical application. In addition, our research did not incorporate genetic markers because clinical risk factors are easier to collect. However, a combination of clinical variables and genetic markers may improve the prediction of LNM and LIM in patients with early colon carcinoma.

Conclusions

In conclusion, based on the clinical risk factors identified in a large population-based cohort, we established the first practical nomograms that can objectively and accurately predict individualized risk of LNM and LIM. Moreover, the internal cohort validation results demonstrate that the two nomograms perform well and have high accuracy and reliability. Our nomograms were demonstrated to be clinically useful in DCAs, and they should therefore help clinicians to improve individual treatment, make clinical decisions and guide follow-up management strategies for patients with early colon carcinoma.