Introduction

Colorectal cancer (CRC) is the third most common malignant tumor and the second leading cause of death worldwide [1]. To date, the risk factors affecting the prognosis of CRC patients have not yet been fully elucidated. Recurrence and metastasis are dismissed as one of the major prognostic factors for CRC patients [2]. After extended radical surgery, patients presenting with recurrence or metastases have 5-year survival rates of less than 33% [3,4,5]. However, there were limited prediction models were developed to foresee the outcomes of recurrence and metastases. Therefore, it is essential to develop a simple and effective predictive model to provide more research evidence for the prognosis of CRC.

Accumulating studies have shown that the host immune system may have a dual role in tumor promotion and suppression [6,7,8,9,10]. When the body undergoes the immune response and inflammation, the values and ratios of various indicators change in the blood with different degrees, which may be potentially helpful in outcome prediction [11, 12]. Previous studies have revealed that several inflammation-related indicators were independent prognostic factors in the blood for CRC patients, including the neutrophil-to-lymphocyte ratio (NLR), the platelet-to-lymphocyte ratio (PLR), the lymphocyte-to-monocyte ratio (LMR), and the albumin-to-globulin ratio (AGR) [13,14,15,16]. However, most of these studies only involve one or several inflammation-related factors and did not systematically explore the combined effects of inflammatory biomarkers in the prognosis of CRC. Moreover, there have been relatively few studies that focus on the outcomes of recurrence and metastasis.

Therefore, the purpose of this study is to comprehensively explore the roles of various blood indicators in the disease-free survival (DFS) of CRC patients and to develop a novel inflammation-related comprehensive biomarker using the LASSO algorithm, with a goal to provide research evidence for individual prediction and decision-making.

Material and methods

Study population

From August 2017 to December 2019, a total of 646 CRC patients who underwent surgical resection were consecutively enrolled at Fujian Medical University Union Hospital (Fujian, China). Patients were eligible if they met the following criteria: (1) newly pathologically diagnosed with primary CRC, (2) no receipt of any treatment that affected blood indicators, and (3) without neoadjuvant radiotherapy or radiochemotherapy prior to surgery. The exclusion criteria were as follows: (1) patients with a history of blood disease or infection and autoimmune disease and (2) patients who suffered from other cancers or emergency surgery before being included in the study. All patients provided written informed consent and fully understood the study. The research protocol has been approved by the institutional review board of Fujian Medical University (No.2020KY007) and followed the ethical standards of the Helsinki Declaration.

Data collection

The patient’s demographic and clinical characteristics were abstracted from the hospital medical record system. The relevant preoperative laboratory parameters for blood indicators were measured on the day of admission, including the tests of blood routine (including neutrophil, lymphocyte, monocyte, high-density lipoprotein counts), blood biochemical (including albumin, globulin counts), and tumor markers including alpha-fetoprotein (AFP), carcinoembryonic antigen (CEA), and carbohydrate antigen 19-9 (CA19-9). Then, eight blood indicators were calculated accordingly: neutrophil-to-lymphocyte ratio (NLR), lymphocyte-to-monocyte ratio (LMR), neutrophil×monocyte-to-lymphocyte ratio (SIRI, Systemic Inflammatory Response Index), albumin-to-globulin ratio (AGR), platelet-to-lymphocyte ratio (PLR), serum albumin + 5×total lymphocyte (OPNI/PNI, Onodera’s Prognostic Nutritional Index), monocyte-to-high-density lipoprotein ratio (MHR), and monocyte-to-lymphocyte ratio (MLR). In addition, we also compared the prognosis of colorectal cancer with a newly developed nomogram in previous literatures [17], including tumor size + NLR + PLR + PNI.

Follow-up

All patients were followed up by trained study interviewers through telephone inter at 3-month intervals for the first 2 years postoperatively and every 6 months for the next years. DFS was defined as the time from initial surgery to recurrence, metastasis, death (for any reason), or the last follow-up (January 9, 2021). The overall loss of follow-up rate was 4.3%. During follow-up, 28 cases were dead, 82 patients occurred recurrence and metastasis (19 subjects were identified before death), and 551 subjects were still alive at the last follow-up.

Statistical analysis

The cutoff points for NLR, LMR, SIRI, AGR, PLR, OPNI, MHR, and MLR were determined by the X-tile program (Yale University, Newhaven, Connecticut). Reverse Kaplan-Meier method was used to calculate median follow-up time. A heat map was used to reveal the degree of correlation among the eight blood indicators. Then, the LASSO algorithm was utilized to develop a comprehensive blood indicator. Hazard ratios (HRs) and 95% confidence intervals (CIs) were estimated based on the univariate and multivariate Cox regression model for exploring independent prognostic factors of CRC. Three prediction models were established using a generalized linear model (GLM), receiver operating characteristic (ROC) methods, and the predictive ability that were compared through the area under the curve (AUC). Decision curve analysis (DCA) was plotted to further assess the clinical usefulness of the models. All statistical analysis was performed using R software (version 4.0.5). P-value less than 0.05 was statistical significance.

Results

Patients’ characteristics

Of the 646 CRC cases included in the final analysis (Table 1), 403 (62.3%) were male patients, and 481 (74.46%) had stage II–III diseases. The majority of patients were diagnosed with adenocarcinoma (87.46%) and moderate tumor differentiation (85.91%). Approximately 10.53–21.12% of patients tended to have elevated or lowered inflammation-related indicators (Table S1). During a median follow-up time of 23 months, a total of 95 patients had outcome events with a 1-year survival rate of 89.47%.

Establishment of a comprehensive blood index

The heat map provided the correlation between eight blood indicators, reflecting in the area of the circle and the shade of the color (Fig. 1A). Several strong correlations were observed between these indicators, among which SIRI and MLR displayed the strongest correlation strength. Given the correlation between the variables and the limitation of the traditional Cox model, the LASSO Cox regression model was used to evaluate the eight selected indicators (Fig. 1B and Fig. S1). The formula for the comprehensive blood index was obtained as follows: NSAP = (0.01567×NLR) + (0.01733×SIRI) + (0.23396×AGR) + (0.00299×PLR). According to this formula, 646 patients were then divided into two groups using the optimal cut-point determined by the X-Tile software: the high-NSAP group (>1.05) and the low-NSAP group (≤1.05).

Fig. 1
figure 1

A Correlation heat map of inflammation indicators. B Partial likelihood deviance of the LASSO analysis

Table 1 Baseline clinicopathological characteristics of patients with colorectal cancer

The prognostic value of NSAP compared with their higher counterparts, patients with a lower NSAP tended to have a significantly better DFS (P < 0.001, Fig. 2). The univariate and multivariate Cox models were both showed that the high-NSAP was associated with an increased risk of DFS: The HRs were 2.56 (95% CI: 1.64-4.00, P<0.001), and 1.89 (95% CI: 1.12-3.13, P=0.015), respectively. Additionally, tumor stage (IV vs I stage, HR: 4.92, 95% CI 1.62–14.97), AFP (<20 vs ≥20, HR: 0.53, 95% CI 0.33–0.85), and CEA (<5 vs ≥5, HR: 0.53, 95% CI 0.33–0.88) were found to be independent prognostic factors for DFS in the multivariate analysis (Table 2 and Fig. S3).

Fig. 2
figure 2

Survival curve of inflammation comprehensive index

Table 2 Univariate and multivariate analysis for disease-free survival

Next, a prognostic nomogram based on the NSAP and other three factors selected by multivariate analysis was developed to quantitatively predict 1-year, 2-year, and 3-year DFS (Fig. 3). The predictive ability of the established nomogram (Model 3) was superior to that of Model 1 (TNM staging system), Model 2 (TNM staging system + AFP + CEA), and Model 4 (tumor size + NLR + PLR + OPNI). The AUC were 0.752 (95% CI 0.717–0.785), 0.711 (95% CI 0.674–0.746), 0.735 (95% CI 0.699–0.768), and 0.592 (95% CI 0.558–0.635), respectively (Fig. 4A). For internal validation, calibration curves for predicting 1-year and 2-year DFS indicated a good match between the predicted probabilities and the actual observations (Fig. S4). In addition, DCA revealed similar results that the established nomogram had more positive net benefit than previously established prognostic nomograms, traditional TNM staging system, or/and tumor biomarkers (Fig. 4B).

Fig. 3
figure 3

Nomogram composed of tumor stage, AFP, CEA, and NSAP

Fig. 4
figure 4

A Comparison of AUC between four models. B Comparison of DCA between four models

Discussion

To the best of our knowledge, the present study is one of few studies to establish a comprehensive blood index (NSAP) based on a combination of four inflammation-related parameters using the LASSO method. Our study demonstrated that NSAP was an independent prognostic factor for CRC, and the high-NSAP was associated with poor DFS. The nomogram based on NSAP and the TNM staging system and tumor markers displayed a high predictive performance and clinical decision value.

A large number of studies have shown that systemic inflammation has become an essential part of tumorigenesis, proliferation, survival, and migration and is likely to become a new direction for cancer treatment and monitoring [18,19,20]. To date, several hematological inflammation markers (NLR, OPNI, SIRI, PLR, LMR, AGR) were reported to have potential prognostic values in multiple tumor types [21,22,23,24,25,26]. In this study, we also found AFP and CEA were valuable independent factors for CRC prognosis, which is supported by results from several previous studies [27, 28]. Of note, except for the above six hands, we also added two additional inflammation indexes (MHR and MLR), which have not yet fully proven the predictive value in the prognosis of CRC. In addition, common prognostic indicators such as lymph node metastasis did not show statistical significance in the multivariate model, which was consistent with a previous study [29]. This may be due to the inclusion of the TNM stage in the multivariate model, resulting in the effect of lymph node metastasis on prognosis may be covered. Additional studies are required to confirm our findings.

Although a single inflammatory index was proved to have predictive ability, the comprehensiveness and accuracy are not satisfactory. In the current study, varying degrees of correlation among the eight indicators were also observed. Therefore, we employed the LASSO method to solve these problems instead of the traditional Cox model. The most prominent advantage of the LASSO algorithm is that the relatively unimportant coefficients of independent variables become 0 and are excluded from modeling through penalized regression on all variable coefficients [30]. It is particularly suitable for linear models with a reduced number of parameters, selection of parameters, and estimation of sparse parameters. Relevant studies have verified that the LASSO model is beneficial to improving the accuracy of the prediction model [30, 31]. In this study, using the LASSO method, we developed a novel comprehensive inflammation index (NSAP), which may be a useful and valuable index for predicting the survival prognosis of patients after CRC operation. In addition, compared with the model established by quoting the article, the model established by us has better predictive performance.

The exact underlying mechanism of the close correlation between the high NSAP and the poor DFS is not yet clear. We found that three of the four indicators included in NSAP are related to lymphocyte count, except for AGR. Previous studies showed that the local control of metastatic invasion by the immune system might be critical to survival. The presence of lymphocytes in the tumor may be a favorable prognostic sign [32]. For example, memory T cells in colorectal cancer can change tumor matrix or tumor cells in the adaptive immune response to reduce the metastatic potential of tumor cells. It may be that the transport characteristics, density, and long-term anti-tumor ability of T cells play a central role in controlling tumor recurrence [33]. Similarly, in solid tumors, tumor-infiltrating lymphocytes show oligoclonal expansion, recognition of tumor antigens, and tumor-specific cytolytic activity in vitro, which are conducive to improving clinical results, including delayed recurrence or delayed death [34, 35]. Additionally , the globulin in AGR, the main protein produced by immune organs, could reflect the body’s inflammation and immune status [36]. Therefore, these indicators have the potential to predict the survival of patients with CRC after surgery.

Nevertheless, the current research has some limitations that should be considered. First of all, although we have collected complete follow-up data of 646 patients, the findings of this study were still limited by the lack of external verification and a relatively small sample size. Second, as a single-center study, it will inevitably bring about potential selection bias. Future multi-center studies with larger sample sizes are warranted to confirm our findings further. Third, the 3-year follow-up time has affected the tracking of more outcome events and limited the observation of the survival rate of CRC for a longer time. It is necessary to continue the follow-up visit according to the established follow-up plan in the future.

Conclusion

This study developed a novel comprehensive inflammatory biomarker (NSAP) and suggested that NSAP is strongly associated with the DFS in CRC patients. Compared with the previously established model, the traditional TNM staging system, and tumor markers, NASP can effectively improve the predictive ability of the prognostic model. More prospective multi-center data sets are still needed to confirm our findings and the practicality of clinical decision-making in the future.