Molecular pathological classification of colorectal cancer—an update

Dunne, Philip D.; Arends, Mark J.

doi:10.1007/s00428-024-03746-3

Molecular pathological classification of colorectal cancer—an update

REVIEW AND PERSPECTIVES
Open access
Published: 06 February 2024

Volume 484, pages 273–285, (2024)
Cite this article

Download PDF

You have full access to this open access article

Virchows Archiv Aims and scope Submit manuscript

Molecular pathological classification of colorectal cancer—an update

Download PDF

4725 Accesses
2 Citations
22 Altmetric
Explore all metrics

A Correction to this article was published on 28 February 2024

This article has been updated

Abstract

Colorectal cancer (CRC) has a broad range of molecular alterations with two major mechanisms of genomic instability (chromosomal instability and microsatellite instability) and has been subclassified into 4 consensus molecular subtypes (CMS) based on bulk RNA sequence data. Here, we update the molecular pathological classification of CRC with an overview of more recent bulk and single-cell RNA data analysis for development of transcriptional classifiers and risk stratification methods, taking into account the marked inter-tumoural and intra-tumoural heterogeneity of CRC. The importance of the stromal and immune components or tumour microenvironment (TME) to prognosis has emerged from these analyses. Attempts to remove the contribution of the tumour microenvironment and reveal neoplastic-specific transcriptional traits involved identification of the CRC intrinsic subtypes (CRIS). The use of immunohistochemistry and digital pathology to implement classification systems are evolving fields. Conventional adenoma versus serrated polyp pathway transcriptomic analysis and characterisation of canonical LGR5+ crypt base columnar stem cell versus ANXA1+ regenerative stem cell phenotypes emerged as key properties for improved understanding of transcriptional signals involved in molecular subclassification of colorectal cancers. Recently, classification by three pathway-derived subtypes (PDS1-3) has been developed, revealing a continuum of intrinsic biology associated with biological, stem cell, histopathological, and clinical attributes.

Molecular pathological classification of colorectal cancer

Article Open access 20 June 2016

The consensus molecular subtypes of colorectal cancer

Article 12 October 2015

Back to the Colorectal Cancer Consensus Molecular Subtype Future

Article 30 January 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Colorectal cancer (CRC) is the fourth most common cancer in men and women [1]. Most CRC, around 70–80%, are sporadic, while around 20–30% of CRC have a hereditary component, due to either uncommon or rare, high-risk, genetic tumour syndromes, such as Lynch Syndrome (LS) (3–4%) and familial adenomatous polyposis (FAP) (∼ 1%) amongst others [2, 3], or more common but low-risk alleles identified by genome-wide association studies (GWAS) [4, 5]. Only 1–2% of CRC cases arise from inflammatory bowel diseases [6].

Molecular pathways and classification

In 2016, we provided an overview and integration of the molecular classification of CRC [7], emphasising that it is not a homogenous disease, but can be classified into different subtypes, characterised by specific molecular features discovered over the preceding three decades. At the genomic level, despite a very wide range of individual gene alterations, CRC shows two major mechanisms of genomic instability: chromosomal instability (CIN) and microsatellite instability (MSI). Those CRC with chromosomal instability are most common (around ∼ 84% of sporadic CRC) and are characterised by gross changes in chromosome number and structure including deletions, gains, amplifications, translocations, and other often complex chromosomal rearrangements. These are often detectable as a high frequency of DNA somatic copy number alterations (SCNA), which are a hallmark of most tumours that arise by the adenoma-carcinoma sequence [8]. Other studies have associated CIN with inactivating mutations or losses in the adenomatous polyposis coli (APC) tumour suppressor gene, which occur as an early event in the development of bowel neoplasia in this sequence, and/or inactivation of TP53, the guardian of the genome, or other pathway members [2, 9, 10]. The second group (around ∼ 13–16% of sporadic CRC) is hypermutated and shows MSI due to defective DNA mismatch repair (MMR), often associated with wild-type TP53 and a near-diploid pattern of chromosomal stability [11,12,13]. Furthermore, MSI CRC often shows CpG island methylation phenotype (CIMP), which is a feature that induces epigenetic instability by promotor hypermethylation and silencing of a range of tumour suppressor genes, including MLH1, one of the MMR genes [14]. The integrated molecular analysis by The Cancer Genome Atlas project in 2012 [15] confirmed this largely DNA-based classification of CRC into two major groups of MSI CRC (∼ 13–16%) and (2) CIN CRC (∼ 84%) (Fig. 1). Our previous review [7] also briefly covered CRC classification at the transcriptomic level by the Consensus Molecular Subtypes (CMS) Consortium (2015) [16], which analysed CRC bulk RNA expression profiling data from multiple studies to describe four major CMS groups, with a residual mixed group (Fig. 1).

While the two-class DNA-based model (CIN and MSI) identified by genomic instability identifies tumour groups that are clearly distinct from each other in terms of their mutational and copy number alterations, there is an increasing recognition that there remains significant heterogeneity within tumours that are robustly classified as CIN or MSI. The presence and extent of this inter-tumour heterogeneity has been the focus of numerous studies since our previous review, each of which has provided new classification models that aim to capture transcriptional signalling and clinical phenotypes of interest. Furthermore, there have been a number of studies that aim to identify and characterise the factors underpinning these variations, using modern methodologies that have enabled further insight into the molecular and histological features that underpin the intra-tumoural heterogeneity that exists within individual tumours.

History of transcriptional classifications

The clinical value of using transcriptional data as the basis for molecular subtyping in cancer was demonstrated more than two decades ago, by a series of seminal studies in breast cancer [17, 18]. Breast cancers were aggregated into biologically similar subtypes that aligned with prognosis and some previously defined clinical attributes. This pioneering work led to the development of tools like MammaPrint, PAM50, and OncotypeDx to provide information of a patient’s risk of relapse and potential response to chemotherapy.

In the years that followed, the use of transcriptional subtyping in colorectal cancer was largely confined to the development of risk stratifiers that could be used to identify patients with highest risk of disease relapse following surgery in stage II CRC, culminating in the development of a number of FDA-approved diagnostic tests and prognostic assays, such as ColDx and OncotypeDx [19, 20]. However, more recent studies have shown that the biological traits associated with relapse in one subtype can be quite different in another, a point that weakens the value of these general risk stratification tools and supports molecular stratification as a primary assessment of the correlations between transcriptional data and clinical outcomes [21].

Molecular subtyping of CRC 2010–2016

One of the primary goals of molecular subtyping is the generation of molecular biomarkers in cancer that can be used to stratify tumours according to clinical risk groups or biological subtypes, which in turn provide improved understanding of signalling cascades that underpin tumour development and treatment response. As described in our previous review [7], the first landmark classification model in CRC that used multi-omic molecular information was published in 2012 as part of The Cancer Genome Atlas (TCGA) network project [15]. This study utilised mutational, epigenetic, mRNA, and miRNA information to identify molecular subtypes that strongly aligned with the CIN and MSI dogma, with additional substratification of the MSI group based on the level of mutational burden. Highly mutated CRC (∼ 16%) was split into two major groups: (1) hypermutated cancers (∼ 13%) with microsatellite instability (MSI) due to defective mismatch repair (dMMR) or (2) ultramutated cancers (∼ 3%) with DNA polymerase epsilon (POLE) exonuclease domain mutations that inactivate the proofreading function. In contrast, CRC with lower mutation rates (∼ 84%) that were non-hypermutated, microsatellite stable (MSS) cancers, with a high frequency of DNA somatic copy number alterations due to CIN, commonly showed mutations or deletions in APC, TP53, KRAS, PIK3CA, and SMAD4 (Fig. 1) [8, 15, 22].

In the same time period as the TCGA study was published, numerous other studies proposed molecular subtypes of CRC, primarily defined using transcriptional information (at least 5 of them). Remarkably, despite each of them using similar and, in some cases, the same data, there appeared to be little agreement between the genes and biomarkers that underpinned each approach. The reasons for this may be attributed to both technical variations that can undermine cross-platform comparisons of individual gene-level biomarkers and also the different bioinformatic approaches employed by each study for identification, characterisation, and final classifier deployment in these datasets.

To provide clarity to the field and to enable the development of a unified approach to molecular subtyping, an international consortium was assembled that included many of the groups that were behind the development of these individual approaches. This consortium, spanning more than 15 institutes and utilising data from more than 5000 tumour samples, sets out to identify concordance across the previously reported subtypes, leading to the establishment of a new paradigm in the field, the consensus molecular subtypes (CMS), comprising four major groups CMS1-4 (described briefly in our previous review) [16] (Fig. 1). The emergence of this consensus approach in CRC classification enabled the field to have a stable reference point across pre-clinical and clinical research. Numerous studies quickly deployed the CMS approach on clinical samples, to enable retrospective alignment with outcome and response to treatment modalities and to pre-clinical models in the hope of developing new understanding and potential therapeutics for these newly defined transcriptional subtypes.

Role and contributions of the stroma to transcriptome and prognosis

Previously, De Sousa E Melo et al. [23] demonstrated that although different prognostic signatures generally consist of non-overlapping sets of genes, they almost always identify the same group of poor-prognostic cases, suggesting that while individual genes are redundant, it is the most distinctive overall biological features that are associated with prognosis. In line with this data, multiple molecular subtyping studies in CRC have identified levels of fibroblast infiltration and genes specifically originating from cancer-associated fibroblasts (CAFs) as key factors in disease relapse [24,25,26]. The influence of the tumour microenvironment (TME) on the CMS classification and wider transcriptional signalling system was identified by numerous studies [24, 25], and the potential implication for intratumoural heterogeneity in diagnostic samples due to multi-regional sampling was demonstrated [27].

In an attempt to remove the contribution of the tumour microenvironment and reveal neoplastic-specific transcriptional traits, the CRC intrinsic subtypes (CRIS) were developed using profiling of tumour xenografts that filtered out signals from stromal and immune components [28], which in turn resulted in increased stability of classification. The CRIS approach identified five new subtypes, CRIS-A to CRIS-E (Fig. 1). CRIS-A was associated with MSI or KRAS mutations and mucinous features. CRIS-B tumours showed TGF-B pathway activity with EMT and a poor prognosis. CRIS-C cancers had elevated EGFR signalling with sensitivity to EGFR inhibitors. CRIS-D tumours showed WNT pathway activation with IGF-2 gene overexpression or amplification. CRIS-E had a Paneth cell-like phenotype with TP53 mutations. This CRIS subtyping successfully categorised independent primary and metastatic CRC datasets.

However, while these molecular studies identified the transcriptional consequences of these variations, the fundamental role played by the stromal and immune components in CRC when classifying tumours into clinically valuable categories was clearly defined by Jass et al. in the late 1980s [29], who presented a histological system that outperformed Dukes’ staging for predicting clinical outcomes of rectal cancers. This system utilised information equivalent to T and N status, alongside information about the presence and extent of lymphocytic infiltration and epithelial infiltration at the tumour margins, features reminiscent of the traits that are most prominent in CMS1 and CMS4 tumours.

While the promise of the precision medicine era in CRC was heralded by the development of CMS and other tools, the absence of a large clinical impact may be seen as a failure over the last decade. There are numerous underlying reasons for this; however, in the next sections, we focus on the incompatible nature of molecular stratification within the turnaround times required for diagnostic decision-making.

Rapid turn-around CMS classification and emergence of morphology, immunohistochemistry (IHC), and image-based surrogates

Currently, molecular analysis of CRCs for a timely pathology report often involves determination of proficient or deficient mismatch repair status, mostly by MMR immunohistochemistry, using either the 2-antibody [30] or conventional 4-antibody approach. Some laboratories perform MSI testing on tumour DNA, as an alternative to MMR immunohistochemistry, or in combination with it to resolve staining discrepancies [3]. For metastatic CRC, mutational analysis of all RAS genes may be performed when oncologists are considering anti-epidermal growth factor receptor (EGFR) therapy. CRCs with the BRAF V600E substitution may show aggressive behaviour and could be treated with combined BRAF and EGFR inhibition. Some CRCs have overexpression of the HER2 oncogene that may be analysed by immunohistochemistry and/or in situ hybridisation as they may respond to appropriate targeted therapy [22, 31].

Stratification of cancer patients into specific treatment groups, based on the molecular pathological changes of their tumours, has the potential to improve patient outcomes by delivering the right drug to the right patient. Development of predictive biomarkers for clinical use has relied largely on evaluation using low-throughput methods on single-gene status, for example, with KRAS mutational status in CRC as a predictive marker of resistance to EGFR inhibition. The implementation of prospective molecular stratification in randomised controlled trials (RCTs) such as FOxTROT [32] has demonstrated the feasibility of a rapid turnaround (within a week was the aim) for DNA extraction and RAS mutation analysis by pyrosequencing. In line with this requirement, there have been highly accurate CE-marked in vitro diagnostic device tools developed to deliver rapid-turnaround and easy to interpret results [33,34,35]. However, in contrast to single-gene biomarkers, more complex multi-gene and multi-omics classifiers can lead to a significant time lag between tissue processing, molecular profiling, data analysis, and result availability to the clinician. The ‘standard-of-care’ pathway for early stage (stage I–III) localised colonic or rectal cancer is shown in Fig. 2, in which radiological scans and tissue samples taken during endoscopy can be assessed histopathologically to provide the diagnosis and staging of the cancer within the clinically acceptable timeline. As neo-adjuvant response rates improve, the tissue obtained at the initial diagnosis is, in some cases, the only pathological material retrieved from the patient and hence the only material on which to carry out molecular profiling. With advances in molecular profiling technologies, the ability to successfully extract meaningful molecular information from even small, degraded samples increase; however, the time lag for feeding this information back to clinicians for discussion at a multidisciplinary team (MDT) meeting remains a critical issue. Therefore, if the potential patient benefit of this molecular stratification is ever to be realised, this process needs to be moved into rapid-turnaround prospective stratification to fit with the clinical timeline. Many have attempted to link CRC morphological patterns with molecular features with varying degrees of success. However, Budinska et al. (2023) have suggested that the main molecular signals align with characteristic morphological patterns seen in CRC and they examined the extent to which morphotype heterogeneity impedes prognostic and predictive expression-based classifiers [36].

IHC approach to CMS classification

To circumvent the need for molecular profiling and to construct a CMS classification approach that can be deployed in current diagnostic pathology laboratories, Trinh et al. [37] developed a five-marker IHC panel (FRMD6, ZEB1, HTR2B, CDX2, and cytokeratin) that works alongside standard MSI/dMMR testing to deliver a practical classification tool with 87% concordance with the ‘gold-standard’ transcriptomic CMS classification. This system utilised dMMR/MSI status to define CMS1, with remaining MSS cancers being classified into two classes using four of the IHC markers, either an epithelial (CMS2/3 combined) or mesenchymal (CMS4) subtype, with epithelial content being normalised using pan-cytokeratin IHC. The authors acknowledged the lack of separation between the epithelial classes, CMS2 and CMS3, as a limitation of this initial approach, which is driven by distinct biological signalling in the original CMS study. However, this IHC approach to CMS classification has not achieved widespread usage. In addition, as noted in other transcriptomics-based studies, the presence and extent of intratumoural heterogeneity and lack of standard biopsy sampling protocols can potentially undermine classification.

Digital pathology and image-based H&E approaches

The emergence and recent rapid acceleration of the field of digital pathology have been facilitated by the ongoing development of tools like QuPath and Halo [38] that support the generation of methodologies reliant on deep learning and AI, so too will opportunities to rapidly classify diagnostic samples in parallel with pathologist assessment. Development of digital pathology tools has enabled histology-based classification systems to be developed and applied to routine diagnostic H&E samples. Given the strong influence that the tumour microenvironment plays in CMS classification, the emergence of robust image-based classification tools represents a rapid and cost-effective way for upfront decision-making in clinical trials. Using ‘ground-truth’ transcriptional CMS calls from > 1200 tumours with sample-matched H&Es and transcriptional data from both tumour resections and pre-treatment biopsies assembled within the S:CORT consortium, Sirinukunwattana et al. [39] developed an image-based CMS (imCMS) deep learning classifier that could accurately call the four CMS classes when deployed on independent samples (AUC = 0.84 in TCGA samples, and AUC = 0.85 in rectal biopsies). While the headline figures for concordance with transcriptional CMS calls appear similar to the IHC approach, the value of the imCMS method was that it could call each of the four discrete CMS classes, as opposed to combining CMS2 and CMS3 and segregating these from CMS4.

In addition, this imCMS approach did not require parallel MSI/dMMR testing and IHC staining, as it was designed to be performed on diagnostic H&E images. More importantly, alongside the overall sample-level classification, the image-based approach provides an insight into tile-level classification that make up this overall call, enabling a more accurate spatial assessment of the presence and extent of intratumoural heterogeneity in individual samples, an issue that had previously been reported through multi-regional transcriptional assessments.

Single-cell intrinsic CMS (iCMS) classifier

As single-cell sequencing has become more routine in tumour profiling studies, the emergence of molecular classification from these data types has the potential to add more granularity to those using bulk tumour data. By using data derived from ~ 50,000 epithelial cells, Joanito et al. [40] developed the single-cell intrinsic CMS (iCMS) classification model, which identified two epithelial classes with distinct gene expression, transcriptional factor activity, and genomic profiles. In the single-cell data, the authors reported that the iCMS2 class was associated with SCNA/copy number variation (CNV) across many chromosomal regions, whereas iCMS3 displayed limited uniformity in CNVs (Fig. 1). In contrast, almost all MSI tumours were classified as iCMS3, and given this association, these tumours were also associated with mutational burden, CIMP, right-sidedness, and mucinous tumours. The authors demonstrated that this new two-class iCMS system could be combined with the bulk CMS four-class approach, to separate CMS4 tumours into new prognostic groups according to iCMS2 (better outcome) or iCMS3 (worse outcome). Remarkably, although the new iCMS system was based on intrinsic epithelial traits, when applied to bulk tumour data, the iCMS classifier was unable to find any distinct underlying biology within the epithelial-rich CMS2 (that accounts for ~ 40% of CRCs) and CMS3 subtypes, which were almost exclusively assigned to iCMS2 and iCMS3, respectively. CMS1 tumours were almost exclusively assigned as iCMS3. Overall, iCMS3 tumours were more likely to be associated with BRAF, KRAS, and PIK3CA mutations, whereas iCMS2 tumours were associated with mutations in APC and TP53. The authors propose a final model, termed the intrinsic-MSI-fibrosis (IMF) system, as the most informative as it considers the iCMS classification, microsatellite instability status, and levels of CAF-related fibrosis.

Single-cell polyp progression

While the iCMS proposed a set of epithelial classes that are evident in cancer, a number of recent studies have used similar single-cell technologies from 62 patients, across discovery and validation cohorts within the COLON MAP study, to provide more insight into the cell states within conventional adenomas and serrated polyps, alongside the cancers that arise from these developmental pathways [41]. This work confirmed many of the previously defined molecular associations associated with these classes of precancers, including associations of APC abnormalities with the conventional adenoma pathway and BRAF alterations with serrated polyps. This work also confirmed the elevation of LGR5+ cell populations (described later in more detail) in conventional adenomas compared with normal; however, in serrated polyp lesions, no such elevation was noted, with the authors proposing a process associated with metaplasia and loss of expression of the homeobox transcriptional regulator CDX2. The authors proposed that these changes are driven, in part, as a wound-healing response with a regenerative stem-like phenotype (Fig. 3, 4).

A key finding in this study was that in serrated lesions, the presence of cytotoxic cells (CD8+ T cells, NKs, and gdT cells), alongside the activation of an antigen processing and presentation gene signature, was significantly elevated compared with conventional adenomas, similar traits that were elevated in MSI tumours compared to MSS. Importantly, however, these traits were all observed prior to the onset of increased mutational burden (as a result of dMMR/MSI), providing evidence of triggers that drive activation of adaptive immunity in precancerous lesions that appear to be independent of dMMR-driven hypermutation. The authors utilised a set of BRAF-driven (Lrig1 CreERT2/+ ; Braf LSL-V600E/+) and KRAS-driven (Lrig1 CreERT2/+ ; Kras LSL-G12D/+) mouse models of serrated lesions to identify that it is the non-stem differentiated epithelial lineages that give rise to the immune activated environment.

Becker et al. [42] recently defined a continuum of biological signalling, using single-cell RNA sequencing and chromatin profiling, that aligned with the changes in cellular states during normal-precancer-cancer progression in CRC using a cohort of 48 polyps, 27 normal tissues, and 6 cancers collected from 15 patients. Importantly, these cases were disproportionately derived from patients with familial adenomatous polyposis (FAP), with 8 FAP and 7 non-FAP patients. The authors identified a clear elevation in the proportions of components of a cancer-associated TME during normal to cancer progression, namely, increased Tregs, exhausted T cells, pre-CAFs, and mature CAFs.

These TME changes track in parallel with an increase in stem-like epithelial cells from normal to precancer; however, this significant trend did not follow through in the cancer samples, which split evenly into groups with either extremely high stem-like signalling or a group with stem-like epithelial cells equivalent to unaffected colonic tissue. This latter group was suggestive of an alternative progression pathway from precancer to cancer in this subset of cases; however, as indicated above, this may primarily apply to FAP-related cases.

Stem cell classifications and plasticity

Given that epithelial cells are continuously lost due to apoptosis and shedding, the existence of a colonic stem cell population that gives rise to, and replenishes, all epithelial cells lining the intestinal mucosa has been long established. While these stem cells were thought to be located towards the base of the crypts of Lieberkühn in the small intestine, the subsequent discovery and characterisation of colonic crypt base columnar cells (CBCs), and the rapidly proliferating self-renewal properties they displayed, provided a critical explanation for the source and maintenance of many of the key phenotypes observed in colorectal cancer. As studies on CBCs increased, the identification of key selective biomarkers, like LGR5+ [43] and the ability of these stem cells to serve as the ‘cell-of-origin’ for tumourigenesis following inactivation of APC [44], further reinforced this hypothesis.

Although LGR5-positivity provides a marker of CBC stem cells, there have been numerous reports of how LGR5-negative cells can also give rise to neoplastic lesions, particularly within inflammatory, regenerative, or desmoplastic stromal environments [45, 46]. The association between an LGR5-negative cell-of-origin and stromal/inflammatory lesions aligns well with the findings from the single-cell polyp study mentioned earlier [41] that described the dominance of a differentiated regenerative-like metaplasia stem population in serrated polyps. In parallel, recent studies have highlighted that these non-canonical LGR5-negative stem populations may also be the drivers of CRC dissemination, tumour budding, and relapse [47], and while they may account for a small population in primary tumours, they are strongly enriched in metastatic lesions [48].

LGR5 negativity has been associated with inflamed tumours, and elegant recent work has demonstrated how stem cells and, indeed lesions overall, can shift between these LGR5-positive CBC and LGR5-negative ANXA1+ regenerative stem cell (RSC) states, defined as plasticity, as they adapt and respond to changes in microenvironmental conditions (Fig. 4) [46]. At the same time, using heterotypic organoid co-culture models, another recent study revealed that the steps involved in the regulation of this stem cell plasticity can be attributed to both cell-intrinsic and microenvironmental signalling [49], focusing on colonic stem cells (CSC), their regenerative populations described as revival colonic stem cells (revCSC), alongside identification of a distinct set of hyperproliferative colonic stem cells (proCSC).

These studies, using both bulk and single-cell technologies, have revealed the presence and extent of heterogeneity in stem cell populations within CRC, providing a more detailed assessment of the dynamics and consequences of the inter- and intra-cellular signalling networks that are ongoing within the heterogeneous milieu of lineages within a tumour mass. Importantly, however, these studies have used different terminology to describe each of the possibly overlapping stem populations, meaning that there remains a need for detailed assessment of each of these biomarkers in an agreed way in order to produce more consistent nomenclature.

Limitations of gene-based classifiers versus pathway-based classifiers

The molecular subtyping approaches described thus far, using both bulk and single-cell data, rely on methodologies defined by the early breast cancer subtyping work of the late 1990s and early 2000s, where individual gene-level expression data from microarray or RNAseq served as the template for aggregating tumours into similar subgroups. In contrast, it is well understood that pathway-level data, where genes are arranged into experimentally validated pathway signatures to represent important biological signalling pathways, provides a quick and reproducible way to test associations between groups of samples according to a broad range of molecular mechanisms and phenotypes. Given the biological value that pathway-level data provides, almost all molecular subtyping studies may use collections of such Gene Ontology signatures, like the Molecular Signature Database (MSigDB), to identify significant associations between these pathways and their identified subtypes [50]. Significantly elevated signalling can then be used as the hallmark features in each subtype compared to the others; as exemplified by cell cycle activation and signalling in both WNT and MYC targets in CMS2, metabolic signalling pathways in CMS3, and TGF-β activation and EMT signalling in CMS4 [16]. Based on these successes, gene-level discovery followed by pathway-level characterisation represents a more widely applicable approach.

Given that the end goal of many subtyping development studies is an eventual alignment and characterisation with important biological phenotypes in each subtype, we proposed a new method that changes the sequence of this stepwise approach, with the aim of providing a closer link with molecular mechanisms and clinical phenotypes. This pathway-level approach should replace the initial gene-level clustering by directly using these Gene Ontology and biological pathway signatures as the basis for grouping samples.

Pathway-derived subtypes

The first step in this alternative class discovery approach was to convert all our existing gene-level data cohorts into pathway-level scores prior to subtype discovery, across ~ 2000 signatures associated with biological processes contained within these databases to generate a matrix of 640,000 + combinations of biological phenotypes. When clustering is performed, using the methods in the same way as other gene-level studies, three pathway-derived subtypes (PDS) in CRC were identified, where PDS1 (26%), PDS2 (31%), PDS3 (30%), and a smaller more heterogeneous residual ‘mixed’ group that accounted for ~ 13% of tumours across the CRC cohorts (Fig. 1) [51].

Comparing the PDS and CMS classifications of the same data revealed granularity within the largest tumour subtype defined as epithelial-rich with uniform signalling attributes in the original CMS study, CMS2 group, which was now split almost equally into two highly distinct transcriptional subtypes, PDS1 and PDS3. At the same time, as identifying granularity in the epithelial-rich subtypes, the PDS approach found that the inflammatory/stromal CMS1/CMS4 subtypes were combined within a single subtype, PDS2 (Fig. 4) [51].

Remarkably, despite the clearly distinct transcriptional landscapes observed according to PDS classification, outside of enrichment for BRAF mutations and fewer APC mutations, in the PDS2 group (these are expected within the CMS1 and CMS4 groups, respectively), mutational and copy number profiles across all key genes assessed within the WNT, MAPK, PIK3CA, cell cycle, or TGF-β pathways were identical in PDS1 and PDS3, again, the two groups that contained equal proportions of CMS2 tumours. Downstream characterisation of PDS groups revealed that despite the absence of any genomic distinctions, these transcriptionally distinct subtypes were dominated by highly significant differences in many of the key cancer-associated hallmarks used in subtyping studies. As expected, PDS2 tumours were enriched for many traits associated with inflammatory/immune signalling pathways, such as stroma-related epithelial-to-mesenchymal transition (EMT), TGF-β pathway activation, and interferon responses. However, while PDS1 tumours displayed uniform and highly significant elevation of cell cycle-related pathways and MYC/WNT target activation in every single sample classified, there was almost universal transcriptional repression in PDS3 for many previously defined cancer-associated hallmarks [51].

Furthermore, while PDS1 was associated with fast-cycling canonical stem cells (LGR5 staining and CBC signatures), PDS2 was associated with regenerative stem cells (ANXA1 staining and RSC signatures), similar to the observed repression for cancer-relevant hallmarks; PDS3 was depleted/devoid of both of these stem populations and displayed low Ki67 staining. Although the majority of the previously described studies have focussed on changes in the stem populations, the absence of these cell populations was coupled with signalling in PDS3 tumours that appeared to indicate elevated numbers of differentiated colonic epithelial lineages, particularly transit-amplifying cells, enteroendocrine cells, and enterocytes (Fig. 4).

When H&Es were assessed either manually or using AI models (similar to imCMS), PDS3 tumours were indistinguishable from PDS1 and PDS2, and no pathological features or differentiation/grading differences were observed. While the presence of a slow-cycling, stem-depleted, and transcriptionally repressed group that is indistinguishable by histology and contains the same genomic profile as other tumours is interesting in itself, when tested in a series of clinical cohorts including the PETACC-3 clinical trial, PDS3 tumours represented the worst stage II/III prognostic group in terms of relapse-free survival following surgery, regardless of treatment.

To complement the PDS classifier, and the numerous stem cell classifiers that exist, we proposed a ‘Stem Maturation Index’ (SMI) classifier tool that provides a macro-view of overall cellular states when used in bulk data, or, when used in single-cell data, a comparative measurement of stem-ness versus differentiation for individual epithelial lineages. Importantly, this approach also offers a smoother transition between bulk, single-cell, and spatial transcriptomics, as it can reduce technical biases that undermine individual gene/probe assessments across platforms, enabling a more robust assessment of subtle signalling pathways underpinning tumour cell identity. While the previous stem cell classifiers may suggest that tumour cells display one or more of these stem states, our PDS and SMI data indicates that ~ 25–30% of CRC are more aligned to features of normal-like epithelial homeostasis in terms of stem-to-differentiated ratios even when they display all the same proportions of cancer driver mutations.

Looking forward—re-discovery of findings from bulk and single-cell research in the era of spatial profiling

The use of bulk molecular data, which has been used here to describe any method that does not specifically sort different lineages prior to processing, typically involves the use of macro-dissected tissue from annotated slides, tissue curls or fresh/frozen tissue pieces. While estimates can be made about lineage abundance from annotated H&E or IHC-stained sequential sections if available, the precise composition of the tissue sample used to generate the bulk molecular data remains unknown. Furthermore, although estimates can be made as to these abundances, a reliable estimate of the identity of the precise lineage(s) that each RNA/DNA signal arises from cannot be determined, meaning that bulk profiles in each of these studies only offer, in the case of RNAseq, an average expression value for each gene across the full milieu of cells that were processed in each sample.

When discussed in this context, the advantages of single-cell technologies and the lineage-specific resolution they bring have offered the field an intriguing insight into the presence and extent of both genomic and transcriptomic heterogeneity within tumours. It can be argued that in the era of single-cell technologies, bulk profiling is too dated to be useful; however, as exemplified by the PDS study, bulk transcriptomics can still successfully be used as the basis for novel biological discovery and risk-stratification that can in-turn be interrogated/validated with newer methods. Furthermore, although the lack of lineage-specific information attributed to bulk profiling discussed above is a limitation, the fact that serial sections can be used to identify the precise localisation of expression in the same tumour has enabled bulk discoveries to provide some insights into subtyping/biomarker research and translational/diagnostic pathology.

The studies described here highlight how, as technologies advance, so too does our understanding of the intricate mechanisms underpinning cancer development and progression, revealing a unique insight into the sometime subtle signalling pathways that are likely to be key to the inter-compartmental crosstalk that drives tumour-wide responses. It could also be argued that many of the key findings from the subtyping studies over the last decade have relied heavily on molecular events and histological features that were previously discovered and characterised using routine histopathology and immunohistochemistry, as exemplified by the alignment between the Jass and CMS classifications, or the placement of well-established Vogelstein-described molecular events within previously defined polyp morphologies.

In line with this latter argument, in the same way that bulk profiling preceded single-cell technologies in biomarker development and molecular studies, the advent and widespread adoption of spatial-based technologies holds enormous potential for driving our understand on further. Future studies will likely begin to use parallel deep phenotyping methodologies in bulk and single-cell sequencing data from the same sample, complemented with advanced in situ tissue profiling using spatial transcriptomics, multiplex immunofluorescence/proteomics, alongside AI-based digital pathology. In this scenario, the signalling pathways and subtypes that were collapsed into an average score in bulk profiling could be revealed in individual cells at high-resolution across the entire field of cancer cells and stroma that pathologists use to generate diagnostic reports.

It may be unsurprising that a review such as this would end by promoting the value of pathology in guiding the next wave of molecular subtyping discoveries; however, the CRC field is on the cusp of producing some of the largest and most detailed tissue-based datasets that have ever existed. In this new era of spatially informed molecular research, pathology-led studies are once again required to ensure that cancer discoveries are developed based on the discipline that bridges science and medicine.

Change history

28 February 2024
A Correction to this paper has been published: https://doi.org/10.1007/s00428-024-03773-0

References

Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F (2021) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71(3):209–249
Article PubMed Google Scholar
Fearon ER, Vogelstein B (1990) A genetic model for colorectal tumorigenesis. Cell 61(5):759–767
Article CAS PubMed Google Scholar
Poulogiannis G, Frayling IM, Arends MJ (2010) DNA mismatch repair deficiency in sporadic colorectal cancer and Lynch syndrome. Histopathology 56(2):167–179
Article PubMed Google Scholar
Dunlop MG, Dobbins SE, Farrington SM, Jones AM, Palles C, Whiffin N, Tenesa A, Spain S, Broderick P, Ooi L-Y (2012) Common variation near CDKN1A, POLD3 and SHROOM2 influences colorectal cancer risk. Nat Genet 44(7):770–776
Article CAS PubMed PubMed Central Google Scholar
Whiffin N, Hosking FJ, Farrington SM, Palles C, Dobbins SE, Zgaga L, Lloyd A, Kinnersley B, Gorman M, Tenesa A (2014) Identification of susceptibility loci for colorectal cancer in a genome-wide meta-analysis. Hum Mol Genet 23(17):4729–4737
Article CAS PubMed PubMed Central Google Scholar
Munkholm P (2003) The incidence and prevalence of colorectal cancer in inflammatory bowel disease. Aliment Pharmacol Ther 18:1–5. https://doi.org/10.1046/j.1365-2036.18.s2.2.x
Article PubMed Google Scholar
Müller MF, Ibrahim AEK, Arends MJ (2016) Molecular pathological classification of colorectal cancer. Virchows Arch 469(2):125–134. https://doi.org/10.1007/s00428-016-1956-3
Article CAS PubMed PubMed Central Google Scholar
Poulogiannis G, Ichimura K, Hamoudi RA, Luo F, Leung SY, Yuen ST, Harrison DJ, Wyllie AH, Arends MJ (2010) Prognostic relevance of DNA copy number changes in colorectal cancer. J Pathol 220(3):338–347
Article CAS PubMed Google Scholar
Frayling I, Arends M (2013) Adenomatous polyposis coli. In: Maloy S, Hughes K (eds) Brenner’s Encyclopedia of Genetics, vol 1, 2nd edn. Academic Press, San Diego, pp 27–29
Morin PJ, Sparks AB, Korinek V, Barker N, Clevers H, Vogelstein B, Kinzler KW (1997) Activation of β-catenin-Tcf signaling in colon cancer by mutations in β-catenin or APC. Science 275(5307):1787–1790. https://doi.org/10.1126/science.275.5307.1787
Article CAS PubMed Google Scholar
Cerretelli G, Ager A, Arends MJ, Frayling IM (2020) Molecular pathology of Lynch syndrome. J Pathol 250(5):518–531. https://doi.org/10.1002/path.5422
Umar A, Boland CR, Terdiman JP, Syngal S, de la Chapelle A, Rüschoff J, Fishel R, Lindor NM, Burgart LJ, Hamelin R (2004) Revised Bethesda Guidelines for hereditary nonpolyposis colorectal cancer (Lynch syndrome) and microsatellite instability. J Natl Cancer Inst 96(4):261–26813
Article CAS PubMed Google Scholar
Ibrahim AE, Arends MJ, Silva A-L, Wyllie AH, Greger L, Ito Y, Vowler SL, Huang TH, Tavaré S, Murrell A (2010) Sequential DNA methylation changes are associated with DNMT3B overexpression in colorectal neoplastic progression. Gut 60(4):499–508. https://doi.org/10.1136/gut.2010.223602
Article CAS PubMed Google Scholar
Gay LJ, Arends MJ, Mitrou PN, Bowman R, Ibrahim AE, Happerfield L, Luben R, McTaggart A, Ball RY, Rodwell SA (2011) MLH1 promoter methylation, diet, and lifestyle factors in mismatch repair deficient colorectal cancer patients from EPIC-Norfolk. Nutr Cancer: Int J 63(7):1000–1010. https://doi.org/10.1080/01635581.2011.596987
Article CAS Google Scholar
The Cancer Genome Atlas Network (326 collaborators) (2012) Comprehensive molecular characterization of human colon and rectal cancer. Nature 487(7407):330–337 (http://www.nature.com/nature/journal/v487/n7407/abs/nature11252.html#supplementary-information)
Article Google Scholar
Guinney J, Dienstmann R, Wang X, de Reynies A, Schlicker A, Soneson C, Marisa L, Roepman P, Nyamundanda G, Angelino P, Bot BM, Morris JS, Simon IM, Gerster S, Fessler E, De Sousa E, Melo F, Missiaglia E, Ramay H, Barras D, Homicsko K, Maru D, Manyam GC, Broom B, Boige V, Perez-Villamil B, Laderas T, Salazar R, Gray JW, Hanahan D, Tabernero J, Bernards R, Friend SH, Laurent-Puig P, Medema JP, Sadanandam A, Wessels L, Delorenzi M, Kopetz S, Vermeulen L, Tejpar S (2015) The consensus molecular subtypes of colorectal cancer. Nat Med 21(11):1350–1356
Article CAS PubMed PubMed Central Google Scholar
Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, Fluge O, Pergamenschikov A, Williams C, Zhu SX, Lønning PE, Børresen-Dale AL, Brown PO, Botstein D (2000) Molecular portraits of human breast tumours. Nature 406(6797):747–752. https://doi.org/10.1038/35021093
Article CAS PubMed Google Scholar
Sorlie T, Perou CM, Tibshirani R, Aas T, Geisler S, Johnsen H, Hastie T, Eisen MB, van de Rijn M, Jeffrey SS, Thorsen T, Quist H, Matese JC, Brown PO, Botstein D, Lønning PE, Børresen-Dale AL (2001) Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci USA 98(19):10869–10874. https://doi.org/10.1073/pnas.191367098
Article CAS PubMed PubMed Central Google Scholar
Webber EM, Lin JS, Evelyn PW (2010) Oncotype DX tumor gene expression profiling in stage II colon cancer. Application: prognostic, risk prediction. PLoS Curr 2:RRN1177
Article PubMed PubMed Central Google Scholar
Salazar R, Roepman P, Capella G, Moreno V, Simon I, Dreezen C et al (2011) Gene expression signature to improve prognosis prediction of stage II and III colorectal cancer. J Clin Oncol 29(1):17–24
Article PubMed Google Scholar
Bramsen JB, Rasmussen MH, Ongen H, Mattesen TB, Ørntoft M-BW, Árnadóttir SS, Sandoval J, Laguna T, Vang S, Øster B, Lamy P, Madsen MR, Laurberg S, Esteller M, Dermitzakis ET, Ørntoft TF, Andersen CL (2017) Molecular-subtype-specific biomarkers improve prediction of prognosis in colorectal cancer. Cell Rep 19(6):1268–1280. https://doi.org/10.1016/j.celrep.2017.04.045
Article CAS PubMed Google Scholar
Arends MJ (2013) Pathways of colorectal carcinogenesis. Appl Immunohistochem Mol Morphol 21(2):97–102. https://doi.org/10.1097/PAI.0b013e3182849808
Article CAS PubMed Google Scholar
De Sousa E, Melo F, Wang X, Jansen M, Fessler E, Trinh A, de Rooij LPMH et al (2013) Poor-prognosis colon cancer is defined by a molecularly distinct subtype and develops from serrated precursor lesions. Nat Med 19(5):614–618
Article Google Scholar
Isella C, Terrasi A, Bellomo SE, Petti C, Galatola G, Muratore A, Mellano A, Senetta R, Cassenti A, Sonetto C, Inghirami G, Trusolino L, Fekete Z, De Ridder M, Cassoni P, Storme G, Bertotti A, Medico E (2015) Stromal contribution to the colorectal cancer transcriptome. Nat Genet 47(4):312–319. https://doi.org/10.1038/ng.3224
Article CAS PubMed Google Scholar
Calon A, Lonardo E, Berenguer-Llergo A, Espinet E, Hernando-Momblona X, Iglesias M, Sevillano M, Palomo-Ponce S, Tauriello DVF, Byrom D, Cortina C, Morral C, Barceló C, Tosi S, Riera A, Attolini CS-O, Rossell D, Sancho E, Batlle E (2015) Stromal gene expression defines poor-prognosis subtypes in colorectal cancer. Nat Genet 47(4):320–329. https://doi.org/10.1038/ng.3225
Article CAS PubMed Google Scholar
Corry SM, McCorry AM, Lannagan TR, Leonard NA, Fisher NC, Byrne RM, Tsantoulis P, Cortes-Lavaud X, Amirkhah R, Redmond KL, McCooey AJ, Malla SB, Rogan E, Sakhnevych S, Gillespie MA, White M, Richman SD, Jackstadt R-F, Campbell AD, Maguire S, S:CORT and ACRCelerate consortia, McDade SS, Longley DB, Loughrey MB, Coleman HG, Kerr EM, Tejpar S, Maughan T, Tejpar S, Leedham SJ, Small DM, Ryan AE, Sansom OJ, Lawler M, Dunne PD (2022) Activation of innate-adaptive immune machinery by poly(I:C) exposes a therapeutic vulnerability to prevent relapse in stroma-rich colon cancer. Gut 71(12):2502–2517. https://doi.org/10.1136/gutjnl-2021-326183
Article CAS PubMed Google Scholar
Dunne PD, McArt DG, Bradley CA, O’Reilly PG, Barrett HL, Cummins R, O’Grady T, Arthur K, Loughrey MB, Allen WL, McDade SS, Waugh DJ, Hamilton PW, Longley DB, Kay EW, Johnston PG, Lawler M, Salto-Tellez M, Van Schaeybroeck S (2016) Challenging the cancer molecular stratification dogma: intratumoral heterogeneity undermines consensus molecular subtypes and potential diagnostic value in colorectal cancer. Clin Cancer Res 22(16):4095–4104. https://doi.org/10.1158/1078-0432.CCR-16-0032
Article CAS PubMed Google Scholar
Isella C, Brundu F, Bellomo SE, Galimi F, Zanella E, Porporato R, Petti C, Fiori A, Orzan F, Senetta R, Boccaccio C, Ficarra E, Marchionni L, Trusolino L, Medico E, Bertotti A (2017) Selective analysis of cancer-cell intrinsic transcriptional traits defines novel clinically relevant subtypes of colorectal cancer. Nature Comms 8:15107
Article CAS Google Scholar
Jass JR, Love SB, Northover JMA (1987) A new prognostic classification of rectal cancer. Lancet 329(8545):1303–1306
Article Google Scholar
Aiyer KTS, Doeleman T, Ryan NA, Nielsen M, Crosbie EJ, Smit VTHBM, Morreau H, Goeman JJ, Bosse T (2022) Validity of a two-antibody testing algorithm for mismatch repair deficiency testing in cancer; a systematic literature review and meta-analysis. Mod Pathol 35(12):1775–1783. https://doi.org/10.1038/s41379-022-01149-w
Article CAS PubMed PubMed Central Google Scholar
Imyanitov E, Kuligina E (2021) Molecular testing for colorectal cancer: clinical applications. World J Gastrointest Oncol 13(10):1288–1301. https://doi.org/10.4251/wjgo.v13.i10.1288
Article PubMed PubMed Central Google Scholar
Morton D, Seymour M, Magill L, Handley K, Glasbey J, Glimelius B, Palmer A, Seligmann J, Laurberg S, Murakami K, West N, Quirke P, Gray R, FOxTROT Collaborative Group (2023) Preoperative chemotherapy for operable colon cancer: mature results of an international randomized controlled trial. J Clin Oncol 41(8):1541–1552. https://doi.org/10.1200/JCO.22.00046
Article CAS PubMed PubMed Central Google Scholar
Colling R, Wang LM, Soilleux E (2017) Validating a fully automated real-time PCR-based system for use in the molecular diagnostic analysis of colorectal carcinoma: a comparison with NGS and IHC. J Clin Pathol 70(7):610–614. https://doi.org/10.1136/jclinpath-2017-204356
Article CAS PubMed Google Scholar
Weyn C, Van Raemdonck S, Dendooven R, Maes V, Zwaenepoel K, Lambin S, Pauwels P (2017) Clinical performance evaluation of a sensitive, rapid low-throughput test for KRAS mutation analysis using formalin-fixed, paraffin-embedded tissue samples. BMC Cancer 17(1):139. https://doi.org/10.1186/s12885-017-3112-0
Article CAS PubMed PubMed Central Google Scholar
Van Haele M, Vander Borght S, Ceulemans A, Wieërs M, Metsu S, Sagaert X, Weynand B (2020) Rapid clinical mutational testing of KRAS, BRAF and EGFR: a prospective comparative analysis of the Idylla technique with high-throughput next-generation sequencing. J Clin Pathol 73(1):35–41. https://doi.org/10.1136/jclinpath-2019-205970
Article CAS PubMed Google Scholar
Budinská E, Hrivňáková M, Ivkovic TC, Madrzyk M, Nenutil R, Bencsiková B, Tukmachi DA, Ručková M, Dubská LZ, Slabý O, Feit J, Dragomir M-P, Linhartova PB, Tejpar S, Popovici V (2023) Molecular portraits of colorectal cancer morphological regions. eLife 12:RP86655
Article PubMed PubMed Central Google Scholar
Trinh A, Trumpi K, De Sousa E, Melo F, Wang X, de Jong JH, Fessler E, Kuppen PJ, Reimers MS, Swets M, Koopman M, Nagtegaal ID, Jansen M, Hooijer GK, Offerhaus GJ, Kranenburg O, Punt CJ, Medema JP, Markowetz F, Vermeulen L (2017) Practical and robust identification of molecular subtypes in colorectal cancer by immunohistochemistry. Clin Cancer Res 23(2):387–398. https://doi.org/10.1158/1078-0432.CCR-16-0680
Article CAS PubMed Google Scholar
Bankhead P, Loughrey MB, Fernández JA, Dombrowski Y, McArt DG, Dunne PD, McQuaid S, Gray RT, Murray LJ, Coleman HG, James JA, Salto-Tellez M, Hamilton PW (2017) QuPath: open source software for digital pathology image analysis. Sci Rep 7(1):16878. https://doi.org/10.1038/s41598-017-17204-5
Article CAS PubMed PubMed Central Google Scholar
Sirinukunwattana K, Domingo E, Richman SD, Redmond KL, Blake A, Verrill C, Leedham SJ, Chatzipli A, Hardy C, Whalley CM, Wu CH, Beggs AD, McDermott U, Dunne PD, Meade A, Walker SM, Murray GI, Samuel L, Seymour M, Tomlinson I, Quirke P, Maughan T, Rittscher J, Koelzer VH, S:CORT consortium (2019) Image-based consensus molecular subtype (imCMS) classification of colorectal cancer using deep learning. Gut 70(3):544–554. https://doi.org/10.1136/gutjnl-2019-319866
Article CAS Google Scholar
Joanito I, Wirapati P, Zhao N, Nawaz Z, Yeo G, Lee F, Eng CLP, Macalinao DC, Kahraman M, Srinivasan H, Lakshmanan V, Verbandt S, Tsantoulis P, Gunn N, Venkatesh PN, Poh ZW, Nahar R, Oh HLJ, Loo JM, Chia S, Cheow LF, Cheruba E, Wong MT, Kua L, Chua C, Nguyen A, Golovan J, Gan A, Lim WJ, Guo YA, Yap CK, Tay B, Hong Y, Chong DQ, Chok AY, Park WY, Han S, Chang MH, Seow-En I, Fu C, Mathew R, Toh EL, Hong LZ, Skanderup AJ, DasGupta R, Ong CJ, Lim KH, Tan EKW, Koo SL, Leow WQ, Tejpar S, Prabhakar S, Tan IB (2022) Single-cell and bulk transcriptome sequencing identifies two epithelial tumor cell states and refines the consensus molecular classification of colorectal cancer. Nat Genet 54:963–975
Article CAS PubMed PubMed Central Google Scholar
Chen B, Scurrah CR, McKinley ET, Simmons AJ, Ramirez–Solano MA, Zhu X, Markham NO, Heiser CN, Vega PN, Rolong A, Kim H, Sheng Q, Drewes JL, Zhou Y, Southard–Smith AN, Xu Y, Ro J, Jones AL, Revetta F, Berry LD, Niitsu H, Islam M, Pelka K, Hofree M, Chen JH, Sarkizova S, Ng K, Giannakis M, Boland GM, Aguirre AJ, Anderson AC, Rozenblatt–Rosen O, Regev A, Hacohen N, Kawasaki K, Sato T, Goettel JA, Grady WM, Zheng W, Washington MK, Cai Q, Sears CL, Goldenring JR, Franklin JL, Su T, Huh WJ, Vandekar S, Roland JT, Liu Q, Coffey RJ, Shrubsole MJ, Lau KS (2021) Differential pre-malignant programs and microenvironment chart distinct paths to malignancy in human colorectal polyps. Cell 184(26):6262-6280.e26. https://doi.org/10.1016/j.cell.2021.11.031
Article CAS PubMed PubMed Central Google Scholar
Becker WR, Nevins SA, Chen DC, Chiu R, Horning AM, Guha TK, Laquindanum R, Mills M, Chaib H, Ladabaum U, Longacre T, Shen J, Esplin ED, Kundaje A, Ford JM, Curtis C, Snyder MP, Greenleaf WJ (2022) Single-cell analyses define a continuum of cell state and composition changes in the malignant transformation of polyps to colorectal cancer. Nat Genet 54:985–995
Article CAS PubMed PubMed Central Google Scholar
Barker N, van Es JH, Kuipers J, Kujala P, van den Born M, Cozijnsen M, Haegebarth A, Korving J, Begthel H, Peters PJ, Clevers H (2007) Identification of stem cells in small intestine and colon by marker gene Lgr5. Nature 449(7165):1003–1007. https://doi.org/10.1038/nature06196
Article CAS PubMed Google Scholar
Barker N, Ridgway RA, van Es JH, van de Wetering M, Begthel H, van den Born M, Danenberg E, Clarke AR, Sansom OJ, Clevers H (2009) Crypt stem cells as the cells-of-origin of intestinal cancer. Nature 457(7229):608–611. https://doi.org/10.1038/nature07602
Article CAS PubMed Google Scholar
Schwitalla S, Fingerle AA, Cammareri P, Nebelsiek T, Göktuna SI, Ziegler PK, Canli O, Heijmans J, Huels DJ, Moreaux G, Rupec RA, Gerhard M, Schmid R, Barker N, Clevers H, Lang R, Neumann J, Kirchner T, Taketo MM, van den Brink GR, Sansom OJ, Arkan MC, Greten FR (2013) Intestinal tumorigenesis initiated by dedifferentiation and acquisition of stem-cell-like properties. Cell 152(1–2):25–38. https://doi.org/10.1016/j.cell.2012.12.012
Article CAS PubMed Google Scholar
Vasquez EG, Nasreddin N, Valbuena GN, Mulholland EJ, Belnoue-Davis HL, Eggington HR, Schenck RO, Wouters VM, Wirapati P, Gilroy K, Lannagan TRM, Flanagan DJ, Najumudeen AK, Omwenga S, McCorry AMB, Easton A, Koelzer VH, East JE, Morton D, Trusolino L, Maughan T, Campbell AD, Loughrey MB, Dunne PD, Tsantoulis P, Huels DJ, Tejpar S, Sansom OJ, Leedham SJ (2022) Dynamic and adaptive cancer stem cell population admixture in colorectal neoplasia. Cell Stem Cell 29(8):1213-1228.e8. https://doi.org/10.1016/j.stem.2022.07.008. (Erratum.In:CellStemCell29(11):1612)
Article CAS PubMed Google Scholar
Fumagalli A, Oost KC, Kester L, Morgner J, Bornes L, Bruens L, Spaargaren L, Azkanaz M, Schelfhorst T, Beerling E, Heinz MC, Postrach D, Seinstra D, Sieuwerts AM, Martens JWM, van der Elst S, van Baalen M, Bhowmick D, Vrisekoop N, Ellenbroek SIJ, Suijkerbuijk SJE, Snippert HJ, van Rheenen J (2020) Plasticity of Lgr5-negative cancer cells drives metastasis in colorectal cancer. Cell Stem Cell 26(4):569-578.e7. https://doi.org/10.1016/j.stem.2020.02.008
Article CAS PubMed PubMed Central Google Scholar
Cañellas-Socias A, Cortina C, Hernando-Momblona X, Palomo-Ponce S, Mulholland EJ, Turon G, Mateo L, Conti S, Roman O, Sevillano M, Slebe F, Stork D, Caballé-Mestres A, Berenguer-Llergo A, Álvarez-Varela A, Fenderico N, Novellasdemunt L, Jiménez-Gracia L, Sipka T, Bardia L, Lorden P, Colombelli J, Heyn H, Trepat X, Tejpar S, Sancho E, Tauriello DVF, Leedham S, Attolini CS, Batlle E (2022) Metastatic recurrence in colorectal cancer arises from residual EMP1+ cells. Nature 611(7936):603–613. https://doi.org/10.1038/s41586-022-05402-9
Article CAS PubMed Google Scholar
Qin X, Rodriguez FC, Sufi J, Vlckova P, Claus J, Tape CJ (2023) An oncogenic phenoscape of colonic stem cell polarization. Cell 186(25):5554–5568. https://doi.org/10.1016/j.cell.2023.11.004
Article CAS PubMed Google Scholar
Liberzon A, Birger C, Thorvaldsdóttir H, Ghandi M, Mesirov JP, Tamayo P (2015) The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst 1(6):417–425
Article CAS PubMed PubMed Central Google Scholar
Malla SB, Byrne RM, Lafarge MW, Corry SM, Fisher NC, Tsantoulis PK, Campbell A, Lannagan TRM, Najumudeen AK, Gilroy KL, Amirkhah R, Maguire SL, Mulholland EJ, Belnoue-Davis HL, Grassi E, Viviani M, Rogan E, Redmond KL, Sakhnevych S, McCooey AJ, Bull C, Hoey E, Sinevici N, Hall H, Ahmaderaghi B, Domingo E, Blake A, Richman SD, Isella C, Miller C, Bertotti A, Trusolino L, Loughrey MB, Kerr EM, Tejpar S, Maughan TS, Lawler M, Leedham SJ, Koelzer VH, Sansom OJ, Dunne PD (2024) Pathway level subtyping identifies a slow-cycling and transcriptionally lethargic biological phenotype associated with poor clinical outcomes in colon cancer independent of genetics. Research Square. https://doi.org/10.21203/rs.3.rs-3891488/v1

Download references

Author information

Authors and Affiliations

Patrick G. Johnston Centre for Cancer Research, Queens University Belfast, Belfast, Northern Ireland, BT8 7AE, UK
Philip D. Dunne
Cancer Research UK Scotland Institute, Garscube Estate, Glasgow, G61 1QH, UK
Philip D. Dunne
Edinburgh Pathology & Cancer Research UK Scotland Centre, Institute of Genetics & Cancer, University of Edinburgh, Crewe Road, Edinburgh, EH4 2XR, UK
Mark J. Arends

Authors

Philip D. Dunne
View author publications
You can also search for this author in PubMed Google Scholar
Mark J. Arends
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both authors, Philip Dunne and Mark Arends, contributed approximately equally to the writing and editing of this review manuscript, and both authors contributed to the preparation of the figures, figure legends, and reference list. Where the authors refer to their own previously published manuscripts, this is indicated in the text of the manuscript.

Corresponding author

Correspondence to Mark J. Arends.

Ethics declarations

This review manuscript does not provide any original research findings, but only reviews previously published manuscripts and discusses their findings, placing them into perspective. Therefore, there is compliance with ethical standards through previous ethically acceptable publications.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dunne, P.D., Arends, M.J. Molecular pathological classification of colorectal cancer—an update. Virchows Arch 484, 273–285 (2024). https://doi.org/10.1007/s00428-024-03746-3

Download citation

Received: 06 December 2023
Revised: 16 January 2024
Accepted: 19 January 2024
Published: 06 February 2024
Issue Date: February 2024
DOI: https://doi.org/10.1007/s00428-024-03746-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Molecular pathological classification of colorectal cancer—an update

Abstract

Similar content being viewed by others

Molecular pathological classification of colorectal cancer

The consensus molecular subtypes of colorectal cancer

Back to the Colorectal Cancer Consensus Molecular Subtype Future

Introduction

Molecular pathways and classification

History of transcriptional classifications

Molecular subtyping of CRC 2010–2016

Role and contributions of the stroma to transcriptome and prognosis

Rapid turn-around CMS classification and emergence of morphology, immunohistochemistry (IHC), and image-based surrogates

IHC approach to CMS classification

Digital pathology and image-based H&E approaches

Single-cell intrinsic CMS (iCMS) classifier

Single-cell polyp progression

Stem cell classifications and plasticity

Limitations of gene-based classifiers versus pathway-based classifiers

Pathway-derived subtypes

Looking forward—re-discovery of findings from bulk and single-cell research in the era of spatial profiling

Change history

28 February 2024

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Molecular pathological classification of colorectal cancer—an update

Abstract

Similar content being viewed by others

Molecular pathological classification of colorectal cancer

The consensus molecular subtypes of colorectal cancer

Back to the Colorectal Cancer Consensus Molecular Subtype Future

Introduction

Molecular pathways and classification

History of transcriptional classifications

Molecular subtyping of CRC 2010–2016

Role and contributions of the stroma to transcriptome and prognosis

Rapid turn-around CMS classification and emergence of morphology, immunohistochemistry (IHC), and image-based surrogates

IHC approach to CMS classification

Digital pathology and image-based H&E approaches

Single-cell intrinsic CMS (iCMS) classifier

Single-cell polyp progression

Stem cell classifications and plasticity

Limitations of gene-based classifiers versus pathway-based classifiers

Pathway-derived subtypes

Looking forward—re-discovery of findings from bulk and single-cell research in the era of spatial profiling

Change history

28 February 2024

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation