Introduction

Although node-negative (stage II) colon cancer is considered to have an overall good prognosis, the 5-year cancer-specific survival is reported at 81–83% in patients who did not have subsequent adjuvant chemotherapy (1), suggesting the presence of risk for recurrence in a subgroup of patients. Currently, stage II high-risk patients are identified through suggested “high-risk” features in consensus guidelines from the American Society for Clinical Oncology, the European Society for Medical Oncology and the National Comprehensive Cancer Network (NCCN) for which pT4 tumors, poor histology differentiation, presentation of perforation (or obstruction) and inadequate lymph node sample (e.g., <12 nodes) for evaluation are shared risk factors in the guidelines (24). Notably, the NCCN guidelines exclude patients with poor histology differentiation if they have microsatellite instability (MSI) from the risk group. Also, these guidelines add a “close or indeterminate resection margin” as a criterion for adjuvant chemotherapy. Evident from the guidelines is, on the one side, the lack of uniform criteria, and thus the risk of introducing treatment variation dependent on which guidelines doctors adhere to; on the other side is the emergence of genetic information (e.g., MSI status) to further refine risk groups for adjuvant chemotherapy. Still, considerable under- and overtreatment is expected through the lack of precise prognostic markers.

Although still being firmly debated (5), the lymph node status is still the strongest predictor and prognosticator in the tumor-node-metastasis (TNM) staging system (6). However, although node status has guided decisions on whether to give adjuvant chemotherapy, it fails (alone) to accurately predict disease recurrence in a considerate number of patients and has led to undertreatment in a subgroup of stage II patients with an unfavorable prognosis. Following this line of emerging evidence, the search for new and better prognosticators in stage II colon cancer is ongoing, with several proposed strategies and approaches to find better biomarkers for prognostication and prediction (79). For the clinician drowning in the sea of emerging biomarkers, it may be hard to swim when the debris to grasp for does not show whether you will sink or float in the search for more precise decision-making.

Notably, there is a link between the node status and the genetic makeup of a tumor (10,11) that has been linked to prognosis, but with causality yet to be established. In a recent study, in this journal (12), the role of MSI, KRAS, BRAF and PIK3CA in stage II colon cancers was investigated. In the study by Vogelaar et al. (12), MSI was found in 23% of patients with a trend toward reduced disease-free and overall survival for patients with MSI, which is in contrast to the prognostic role of MSI found by others in early-stage colon cancers (13). This contradictory finding obviously warrants explanation based on the background of existing literature from the past couple of decades. Vogelaar et al. also confirmed a trend toward poorer prognosis in patients with mutant KRAS or BRAF (12). The negative impact of these mutations has been noted by others in both early- and late-stage colorectal cancer (CRC) (11,1416). Strikingly, the high mutation rate of BRAF in the study by Vogelaar et al. (19%) should be noted, as this rate is usually reported as a single-digit percentage, typically at 5–8% (17). However, an unusually high rate of BRAF mutations (15%) has been found by others as well (18). Finally, the results pointed toward trends but lacked statistical significance and included the chance for underpowered samples in subgroups and type II errors in interpretation.

Why is it important to define subgroups of stage II patients? For one, the overall risk for recurrence is not huge but is present in about 20% of stage II patients (19). A small but significant risk reduction is noted by giving adjuvant chemotherapy to stage II patients, but risk for overtreatment is still high, and the currently proposed high-risk criteria are inaccurate (20). Much energy has been put into gene panels that would yield better prognostic and predictive tools for risk assessment (2123), but the results are thus far disappointing, with no panels having reached widespread incorporation in clinical practice. The same goes for most other proposed prognostic systems (9,19,2426).

Several explanations may be brought forward as to why available gene panels have not yet reached clinical utility (27). The clinical samples investigated may not have been representative, derived from cohorts of previous diagnostic and therapeutic eras or from randomized trials, where patients’ (supra)-selection entry criteria do not represent the general patient cohorts; or, investigated cohorts have been subject to case mix of both treatment-naive and adjuvant chemotherapy-treated patients. Further, assay development and technology may not have been robust and valid, thus producing variation in findings across studies and hampering validation. Also, beyond analytical and clinical validity is the current lack of demonstration of clinical utility for many proposed markers or panels (28). Indeed, one really has to demonstrate that the prognostic tools work in real life and make a difference for patients in terms of selecting the accurate therapy that works for the particular patient.

Developing new and pragmatic approaches to biomarker research and development to ensure validity and utility relies on standardization of procedures. This is easier said than done: standardizing a complex and long process such as biomarker validation could prove a hefty swim in the sea of variables that arise when working with biological material (29). If not properly accounted for, anything from patients’ inherent physiology and genetic composition to the actual disease state, the handling techniques of the laboratory or the chosen data analysis methods can potentially affect outcomes. As a starting point in standardization, appropriate cohort sub-grouping is of fundamental importance: if done inaccurately, it can dilute the power of studies at best, or at worst, completely invalidate findings.

To recognize emerging subtype consensus classifications and probe their clinical validity as well as prognostic and predictive ability, the TNM system is thus necessary. The immunoscore (30), mentioned by Vogelaar et al. in the discussion of their work, is a tumor classification system based on the analysis of tumor-infiltrating lymphocyte subpopulations. This system has proven promising at predicting metastatic behavior (and thus survival) in a variety of cohorts and, even more interestingly, may not only be prognostic but also highly suggestive of patients amenable to immune-based cancer therapies (31); however, it lacks validation in prospective studies. A task force group has been launched, but validation of data across several cohorts is still awaited (32).

Subdividing patients according to a gene expression-based system is perhaps yet another valuable option. Gene expression data pooled from various studies recently produced a consensus molecular classification of CRC, subdividing it into four classes (33). These clustered around molecular features, such as MSI, hypermutation and immune infiltration (CMS1), chromosomal instability (CMS2-canonical), KRAS mutations and metabolic deregulation (CMS3-metabolic) and a mesenchymal geno- and phenotype (CMS4). Disturbingly, a subcohort of about 7% was not classifiable (labeled as “mixed or indeterminate”) in the proposed set, and a further 7% was classified as mixed groups between the four suggested “pure” groups—thus, further refinement is needed of this proposed classification. In truth, much is still to be learned about the complex genetic and epigenetic makeup of colorectal cancers (34).

Conclusion

Standardized prognostic biomarkers that can reach widespread clinical utility are now needed. Molecular traits are already at the foundations of companion biomarkers research and are increasingly employed to predict which group of patients will respond to any given treatment. Further refinement is needed to avoid under- and overtreatment, prevent patients from harm and ensure effective health economics with increasingly available high-cost drugs. Fast and reliable molecular techniques are now available with increased analytical validity. Clinical validity should be ensured through investigation of correctly powered and well-sized cohorts of homogeneous subgroups in a prospective fashion. Only then will results eventually allow prognostic molecular biomarkers to find a place and be integrated as tumor classification systems beyond the TNM system. We have harvested a huge number of bio-markers for which the prognostic yield has not reach its potential—it is now time to separate the wheat from the chaff for clinical utilization.

Disclosure

The authors declare that they have no competing interests as defined by Molecular Medicine, or other interests that might be perceived to influence the results and discussion reported in this paper.