Introduction

The treatments of colorectal cancer (CRC) patients after curative surgery are based on clinical staging. CRC patients with high risk stages (i.e., stages II and III) are recommended for adjuvant therapy1,2,3,4,5. However, the heterogeneous characteristics of CRC results in different prognoses among CRC patients within the same clinical stage. As a result, both the American Joint Committee on Cancer/Union for International Cancer Control staging system (Tumour-Node-Metastasis [TNM] staging system) and the Japanese Society for Cancer of the Colon and Rectum (JSCCR) staging system have developed several editions to improve the accuracy of predicting prognosis6,7,8. These staging systems are composed of tumour depth, nodal status, and metastatic status, although there are differences among the versions even in the same staging system. Therefore, further subtyping in each clinical stage is indispensable for better treatment strategy.

Recently, classifications of CRC using genetic background have been proposed as alternative or additional tools for staging. Microsatellite instability (MSI) status indicated good prognosis in patients with stage II/III right-sided colon cancer (CC)9. Consensus molecular subtypes consisting of four groups (MSI immune, canonical, metabolic, and mesenchymal) have been presented as powerful tools for CRC biology and treatment10,11,12,13. These classifications based on genetic analysis have been expected to replace TNM staging in the future, although benefit for treatment decision has not been validated.

However, high risk patients for recurrence have been selected using clinicopathological features. Many articles have demonstrated that elevations in tumour markers (TMs, e.g., carcinoembryonic antigen [CEA] or cancer antigen 19-9 [CA19-9]) have been associated with poor prognosis in CRC14,15,16,17,18. Lymphovascular invasion (LVI) was also indicated to be a prognostic factor for CRC19,20,21,22. However, these factors have not been included in the TNM staging systems and have not been assessed in combination. We considered that improvement of classification by combination of these features was necessary before applying novel classifications that require more labour and cost.

Thus, we statistically selected the most suitable subtyping combined the influence of TM elevation and LVI on relapse-free survival (RFS) using the pooled data collected by the Japanese Study Group for Postoperative Follow-up of CRC (JFUP-CRC), which is one of the largest data collections in Japan23,24. We evaluated this classification (so called TM-LVI) as a prognostic factor for RFS and disease specific survival (DSS) in each TNM staging. We also assessed the association between application of adjuvant therapy and TM-LVI status.

Results

Clinicopathological characteristics of the patients

Clinicopathological characteristics were compared between CC and rectal cancer (RC) patients. There were significant differences between CC and RC patients in age (P < 0.0001), sex (P < 0.0001), histological type (P < 0.0001), the ratio of CEA elevation (P = 0.008), the degree of LVI (P < 0.0001), dissected lymph node number (12 ≤ or not), TNM stage (P < 0.0001), and the application of adjuvant therapy (P < 0.0001) but not in the ratio of CA19-9 elevation (Supplementary Table 1).

Adjuvant therapy consisted of chemotherapy except for two cases of radiotherapy and seven cases of chemoradiotherapy in RC. Most patients (94.6%) received 5-fluorouracil based chemotherapy. A total of 1.6% and 1.1% of the patients received oxaliplatin-based and irinotecan-based chemotherapy, respectively (Supplementary Table 1).

Thus, further analysis was performed by CC and RC.

Selection of the most suitable subtyping

Among six candidate subtypes (Table 1), ABC1 was the most statistically suitable subtype with the lowest Akaike’s information criterion (AIC, Supplementary Table 2) and the highest Harrell’s concordance index (c-index, Supplementary Table 3) according to the models including TNM staging. Then, we called ABC1 as TM-LVI. Both TNM and TM-LVI were significant in the Cox model for RFS in CC and RC patients. The interaction term between TNM and TM-LVI was significant in RC but not CC. Thus, in ranking the incidence rate of RFS, the main effect model with TNM and TM-LVI was applied to CC, and the interaction model was employed for RC. When RFS was ranked from 1st to 21st by TNM staging and TM-LVI, RFS was not ordered by TNM staging. Stage IIIa was a low recurrence risk group compared to most of stage II (Supplementary Table 4). Category C by TM-LVI belonged to the highest recurrence risk group in each TNM stage.

Validation of TM-LVI for RFS and DSS

Log-rank test demonstrated that RFS was significantly different by TM-LVI status in CC (stage IIa, IIIb, and IIIc) and RC (stage I, IIa, IIb, IIIa, IIIb) (Fig. 1). In particular, the 5-year RFS differed more than 20% by TM-LVI status between A and C (83.7% and 62.1% in stage IIIb of CC, 76.6% and 47.9% in stage IIb of RC, 91.6% and 63.3% in stage IIIa of RC, and 76.5% and 56.4% in stage IIIb of RC). Log-rank test also demonstrated that DSS was significantly different by TM-LVI in CC (stage IIa, IIb, IIIb, and IIIc) and RC (except for IIc) (Fig. 2). The 5-year DSS differed more than 20% by TM-LVI status between A and C (82.9% and 54.3% in stage IIIc of CC and 91.9% and 69.3% in stage IIb of RC).

Figure 1
figure 1

Relapse-free survival (RFS) of colon cancer (af) and rectal cancer patients (g-l) by TM-LVI status is shown for each TNM stage. The 5-year RFS rate is described on the right of the TM-LVI status. Bold type, P < 0.05; Red circle, difference in the RFS rate among TM-LVI statuses > 20%.

Figure 2
figure 2

Disease free survival (DSS) of colon cancer (af) and rectal cancer (gl) patients by TM-LVI status is shown for each TNM stage. The 5-year DSS rate is described on the right of the TM-LVI status. Bold type, P < 0.05; Red circle, difference of DSS rate among TM-LVI statuses > 20%.

We assessed the factors associated with RFS and DSS by univariate and multivariate analysis. TM-LVI was an independent prognostic factor for RFS of CC (stage IIa, IIIb, and IIIc, Tables 1,2) and RC (stage I, IIa, IIb, IIIa, and IIIb, Table 3) and also for DSS of CC (stage IIa, IIb, IIIb, and IIIc, Table 4) and RC (all stage except for IIc, Table 5).

Table 1 Candidate subtypes were determined depending on the tumour marker elevation status and lymphovascular invasion status.
Table 2 Univariate and multivariate analysis for relapse-free survival of colon cancer was performed in each clinical stage.
Table 3 Univariate and multivariate analysis for relapse-free survival of rectal cancer was performed in each clinical stage.
Table 4 Univariate and multivariate analysis for disease specific survival of colon cancer was performed in each clinical stage.
Table 5 Univariate and multivariate analysis for disease specific survival of rectal cancer was performed in each clinical stage.

Association between the adjuvant therapy and TM-LVI status

The application of adjuvant therapy significantly differed by TM-LVI in stage I, IIa, IIIa, and IIIc CC and stage IIIa RC (Table 6). However, the application of adjuvant therapy was not irrelevant with the recurrence risk evaluated by TM-LVI except for stage IIa CC. The application of adjuvant therapy did not differ by TM-LVI status (stage IIIb CC and in stage I, IIa, IIb, and IIIb RC) or adversely decreased in spite of the increased recurrence (stage IIIc CC and stage IIIa RC), although TM-LVI status was an independent prognostic factor for both RFS and DSS in these stages. These results suggested that TM-LVI, which represents tumour marker elevation and lymphovascular invasion, was not used for determining the use of adjuvant treatment.

Table 6 Association between the adjuvant therapy and TM-LVI status was assessed in each clinical stage.

Discussion

Due to the heterogeneity of the disease, further classification beyond TNM-based clinical staging has been considered indispensable for determining the treatment strategy of CRC. Despite continued effort, novel modalities are still under development10,11,12,13. We combined TM elevation and LVI for subtyping of the TNM staging system because of their potential as prognostic factors and ready-to-use availability. Among candidate classifications, we selected the most statistically suitable classification and named TM-LVI.

Our data demonstrated that TM-LVI was useful for subtyping and prognosis for not only RFS but also DSS, although we picked up TM-LVI depending on RFS. This may be consistent with the fact that TM elevation and LVI have been considered prognostic factors for RFS and overall survival, respectively16,17,18,21,22.

Our data indicated that adjuvant treatment was not considered in accordance with the recurrence risk determined by TM-LVI status. Thus, TM-LVI may be useful for considering adjuvant therapy after curative surgery when TM-LVI is an independent prognostic factor for both RFS and DSS (stage IIa, IIIb, and IIIc CC and stage I, IIa, IIb, IIIa, and IIIb RC).

We evaluated LVI by scoring both lymphatic invasion and venous invasion, although LVI is usually discussed as positive or negative. This may be because pathological assessments differ among pathologists regarding LVI status. In our massive dataset, the influence of pathologists may be reduced compared to data from single institute.

Our study has several limitations. First, the pathological results of LVI were not discussed among the pathologists to standardize the evaluation of LVI. Second, in this retrospective study, the treatment of the patients may vary depending on the clinicians and the hospitals. Third, genetic information was not collected. MSI status, which is associated with the prognosis of CRC patients, is not routinely assessed in most Japanese hospitals25.

In conclusion, we present a cost and labour efficient subtyping method (TM-LVI) for CRC patients using clinicopathological features routinely assessed in the clinic all over the world. The usefulness of TM-LVI should be validated in the future by randomized clinical trials regarding adjuvant treatment after curative surgery for patients with poor prognosis as estimated by TM-LVI.

Methods

Patients and data collection

The JFUP-CRC contains data from twenty-three institutes in Japan (Sapporo Medical University Hospital, Hirosaki University Hospital, Niigata University Hospital, Niigata Cancer Center Hospital, National Defence Medical College Hospital, Tochigi Cancer Center Hospital, Tokyo University Hospital, Tokyo Metropolitan Cancer and Infectious Diseases Center Komagome Hospital, National Cancer Center Hospital, Tokyo Women’s Medical University Hospital, National Center for Global Health and Medicine Hospital, Tokyo Medical and Dental University Hospital, Keio University Hospital, Teikyo University Hospital, Kyorin University Hospital, Kitasato University Hospital, Fujita Health University Hospital, Aichi Cancer Center Hospital, Kyoto University Hospital, Osaka International Cancer Institute Hospital, Osaka Rosai Hospital, Hyogo College of Medicine Hospital, and Kurume University Hospital). Each hospital retrospectively collected the clinical data of patients with CRC who underwent curative surgery. This study was approved by the institutional review board or ethics committee at all 23 hospitals above and was conducted in accordance with the Declaration of Helsinki and Ethical Guidelines for Clinical Research. The patients provided written informed consent, and patients had the option to opt-out if there was any disagreement with this study. The JFUP-CRC office pooled and organized the data for this study. Among the patients whose data are contained in the database, we assessed 11771 patients, consisting of 7151 CC patients and 4620 RC patients, who received curative surgery between 2004 and 2008. We classified these patients by the 8th edition TNM staging system6,7. A higher level of CEA or CA19-9 than the upper limit in each hospital was determined to indicate TM elevation. TM elevation was classified into three categories: both CEA and CA19-9 elevation, either CEA or CA19-9 elevation, or no elevation. Lymphatic or venous invasion was evaluated as 0 (no invasion), 1 (minimal invasion), 2 (moderate invasion), or 3 (severe invasion) by pathologists in each hospital according to the classification by the JSCCR8. We summed the evaluation in both lymphatic invasion and venous invasion as LVI, which was categorized as none (0), slight (1-2), mild (3-4), or severe (5-6).

Selection of the most suitable subtyping

To select the most suitable subtyping using both TM elevation and LVI, we assessed six candidate classifications (ABC1-ABC5, AB), which simplified 12 categories determined by TM elevation (both, either, or none) and LVI (none, slight, mild, or severe) into three (A, B, and C; ABC1-ABC5) or two (A, B; AB) subtypes (Table 1). Then, Akaike’s information criterion (AIC) and Harrell’s concordance index (c-index) were derived from the Cox proportional hazard model with TNM and each subtype to explore the most suitable (lower AIC and/or higher c-index) subtyping for RFS. If the interaction term between TNM and the candidate classification was significant, the term was included in the Cox model for ranking the incidence rates of RFS within each subtype. We did not exclude the patients who received adjuvant therapy, because we explored the subtyping available in all patients who received curative surgery.

Validation of TM-LVI for RFS and DSS

Univariate analysis was performed using Cox proportional hazard model, along with age (75 ≤ or not), sex, histological type (differentiated type or not), number of dissected lymph nodes (12 ≤ or not), and adjuvant therapy. Multivariate analysis was also performed using Cox proportional hazard model with the factors that showed significant differences (p < 0.05) in the univariate analysis. When TM-LVI was the only significant factor in the univariate analysis, multivariate analysis was performed using Cox proportional hazard model using the factors with p < 0.2. The influence of TM-LVI on RFS and DSS in each TNM stage was also assessed by Kaplan-Meier curve and evaluated by the Log-rank test.

Data analysis

The comparisons of the clinicopathological characteristics between CC and RC patients were assessed by the chi-squared test or t-test. The influence of the clinicopathological features on RFS and DSS was evaluated by a Cox proportional hazard model. RFS and DSS was calculated by the Kaplan-Meier method and compared by the log-rank test. Differences in the application of adjuvant therapy by combined subtyping were evaluated by the chi-squared test. Multivariate analysis for RFS and DSS was performed using a Cox proportional hazard model. A P value of <0.05 was considered significant for all analyses. All statistical analyses were performed using SAS software (SAS Institute, Cary, NC, USA).