Introduction

Colorectal cancer (CRC) is the third commonest cancer worldwide and caused 15,903 deaths in 2014 in the United Kingdom alone [1]. The stage of disease at diagnosis is the most important factor dictating survival. If the cancer is detected early, the reported 5-year survival rate is 90%, which can decrease to 14% if the disease is advanced on diagnosis [2]. The natural history of CRC is to develop from a benign adenoma, and the estimated time interval for development from normal mucosa to adenoma to invasive adenocarcinoma is 5–10 years [3, 4]. Detecting the disease early, therefore, is key to reducing mortality.

As most patients with CRC are asymptomatic or have non-specific symptoms in the early stages, it is vital to find a safe, acceptable, sensitive, specific, and cost-effective test that detects the early stage of the disease [5].

Currently, colonoscopy is the gold-standard diagnostic test to identify colonic pathology [2, 6]. A meta-analysis in 2015 by Brenner et al., showed that colonoscopy is estimated to reduce colorectal cancer incidence by 69% and mortality by 68% [7]. However, this is invasive, has low adherence, and is associated with potential risks to the patient [2]. Its alternative, virtual colonoscopy still requires bowel preparation and can cause discomfort to the patient. The risk of unnecessary radiation especially in the young is also an important disadvantage [8]. Other screening investigations include flexible sigmoidoscopy, fecal occult blood test (FOBT), and fecal immunohistochemistry test (FIT), which is a DNA-based fecal test. Although the FOBT has less than 50% sensitivity for CRC, the FIT has a reported 78% sensitivity and 96% specificity [9]. Aversion to handling stool is an important reason for the low uptake of the test. Only 58% of patients who are sent the FOBT return a sample. In the UK, the introduction of the DNA-based fecal test, which is easier to use, is expected to increase uptake to 75% by 2020, with the challenge of obtaining a sample for testing from stool still remaining [10].

Blood-based markers in current use, such as carcinoembryonic antigen (CEA) and cancer antigen (CA) 19-9, are for surveillance and for monitoring response to treatment but have a low sensitivity and specificity ranging from 40 to 70% and 73 to 90%, respectively, making them unsuitable as screening or diagnostic markers [5, 10] A more recently proposed marker, which is commercially available, is methylated septin 9. This is a molecular-based blood test whose reported accuracy in the literature has been variable. A recent meta-analysis by Yan et al. has shown sensitivity of 76% and specificity 87% making it comparable to the DNA-based fecal test [11].

The purpose of this study is a systematic review of the literature on diagnostic biomarkers in blood or tissue over the last 5 years in colorectal cancer.

Materials and methods

Inclusion criteria

  • Primary human studies and meta-analyses in the last 5 years, which assessed serum or tissue markers’ diagnostic ability in CRC.

Exclusion criteria

  • Studies looking at familial or inherited CRC

  • Studies with less than 100 patients overall

  • Animal or in vitro studies

  • Studies that did not specify sensitivity or specificity of the markers.

Search strategy

An electronic search of PubMed, EMBASE, Cochrane, and ISI Web of Science was performed for the relevant studies between January 2013 and December of 2017 using the following terms: (marker OR biomarker) AND (serum OR blood OR tissue) AND (diagnosis OR screening) AND (colorectal OR colon OR bowel or rectal) AND (cancer OR carcinoma OR neoplasia). There were no language restrictions and duplicates were removed. After reviewing the title and abstract of the studies, as per the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flowchart, the relevant manuscripts were selected for full-text review (Fig. 1). One additional relevant article was added. A 5-year timeframe was chosen to provide a summary of the recent advances in biomarkers and of the ones that have potential for future use.

Fig. 1
figure 1

PRISMA diagram of systematic review

Data extraction

All original studies and meta-analyses which assessed diagnostic ability of markers were included. Data extracted from each full-text manuscript were as follows: authors, publication year, whether it was serum or plasma or whole blood or tissue, area under the curve, sensitivity, and specificity of detecting primary CRC and if compared to another standard test and cutoff values. If the AUC or sensitivity or specificity was not specifically determined the study was excluded from the analysis.

Study outcomes

The primary outcome was to compare the diagnostic ability of the markers studied in blood or tissue. The secondary outcome was to see whether the marker was compared to other markers. The tertiary outcome was to assess the diagnostic ability of the marker in early versus late CRC.

Results

The literature search over the last 5 years yielded 2844 papers from which 51 studies were eligible for inclusion in the review (Fig. 1).

The markers were divided in broadly four groups: nucleic acids (RNA/DNA/messenger RNA/microRNAs), cytokines, antibodies, and proteins. The nucleic acid category was further subdivided into single microRNA, panel of microRNAs, and a separate group with RNA, DNA, and messenger RNA. They are summarized in Tables 1, 2, 3, 4, 5 and 6. Overall, 29 prospective studies, 14 retrospective studies, and 8 meta-analyses were included.

Table 1 Single miRNA markers
Table 2 miRNA panel markers
Table 3 RNA/DNA/messenger RNA markers
Table 4 Cytokine markers
Table 5 Antibody markers
Table 6 Protein markers

Nucleic acids (Tables 1, 2, 3)

MicroRNAs (Tables 1, 2)

These are small non-coding RNA particles, which regulate gene expression by binding to messenger RNA (mRNA) and affecting protein translation or gene expression. They are thought to act as either tumor suppressor genes or oncogenes. In the literature, they have been investigated singly or in panels to assess their diagnostic and prognostic capabilities. A study done by Imaoka et al. in 2016, reported that mi-1290 showed promise as a diagnostic marker. Mi-1290, is thought to promote epithelial mesenchymal transition (EMT), proliferation and has metastatic potential. The study investigated 324 patients and showed a sensitivity of 70.1% and specificity of 91.2% in detecting CRC. Its sensitivity in detecting adenomas compared to controls was 46.4% and specificity 91.2%, therefore inadequate to be used as a screening test on its own [11].

Table 1 shows studies that investigated a single miRNA in colorectal cancer and Table 2 shows the studies that investigated panels of these markers.

A meta-analysis performed by Zhang et al. for mi-21 showed a sensitivity and specificity of 81% [12] whilst Xu F et al.’s meta-analysis for the same marker in 2015, showed a pooled sensitivity of 75% with specificity of 84%. Among the original studies, the most promising was for mi-1290, which showed an AUC of 0.830, sensitivity of 70.1% and specificity of 91.2% [11].

Assessing different combinations of miRNAs had variable results. Fang et al. investigated a panel of three markers (mi-24, mi-320a, and mi-423-5p) which showed an overall sensitivity of 92.79% and specificity of 70.77% in detecting CRC and also showed high sensitivity and specificity in detecting early cancer [13].

RNA/DNA/messenger RNA (Table 3)

Septin 9, which has been well studied and is now commercially available [Epi proColon 2.0 (Epigenomics), mS9 (Abbott Molecular), ColoVantage (Quest Diagnostics)], has had disputed results. A meta-analysis by Nian et al. published in 2017, included 25 studies, of which only 2 showed a low risk of bias. Twenty-one studies excluded “difficult-to-diagnose” patients and seven studies did not specify thresholds used. The pooled sensitivity was 72% and specificity of 92% which if combined with FIT can increase up to 94% sensitivity, with a decreased specificity of 68% [14]. It also highlighted that the sensitivity for stage I disease was 45% and for polyps was 15% which makes it rather poor for a screening test.

A prospective analysis by Wu et al. investigated long non-coding RNA nuclear-enriched abundant transcript variants 1 and 2 (NEAT_v1 and NEAT_v2). Non-protein coding RNAs are greater than 200 nucleotides and constitute more than 70% of the genome. Non-protein coding RNA nuclear-enriched abundant gene 1 has 2 transcripts: NEAT1_v1 and NEAT1_v2. NEAT1_v2 showed a 70% overall sensitivity and 96% specificity in detecting CRC from controls, although the mean value in early versus late CRC was not significantly different [15]. Further study with larger cohorts and in different types of cancer is required to further validate this marker.

Hao et al., investigated ALU sequences in circulating free DNA, which are the most active sequences in the human genome [16]. ALU115, ALU247/115, and CEA, had a sensitivity of 85.57% and specificity of 97.27% in detecting CRC [16].

SDC2 is an integral membrane protein and is known to participate in cell migration and proliferation of cells. The SDC2 gene is expressed in mesenchymal but not epithelial colonic cells. It is also expressed in pancreatic epithelial cells. SDC2 methylation of DNA in a prospective study by Oh T et al. in 2013 showed sensitivity of detecting early CRC of 92% although this needs further validation in a larger study [17].

Messenger RNA conveys information from DNA to protein products of the genes expressed. The studies looking into mRNA have many flaws in the study design and description of control groups and many do not investigate the sensitivity and specificity of the marker in detecting early CRC. Cyclin E, p27kipl, and ki-67, investigated by Li et al. in a retrospective study, showed sensitivities and specificities around 80% [18]. However, this was only measured on tissue and not correlated to blood markers.

SALL4, a zinc-finger transcription factor, was evaluated in a prospective study by Khales et al. in 2015 and showed sensitivities of 96.1% and specificity of 95% [19]. This transcription factor is also found in other cancers and needs further validation in a larger cohort of patients which include polyps [19].

Cytokines (Table 4)

These are small secreted proteins, which can have autocrine or paracrine effects. Types of cytokines include chemokines (e.g., interleukin-8), lymphokines (e.g., interleukin-6), and interferons. They are released by a variety of cells including macrophages, T cells, B cells, and mast cells, and have been implicated in inflammatory and neoplastic diseases. The most promising was interleukin-8, which showed a sensitivity of 70% and specificity of 91% in detecting CRC in a meta-analysis conducted by Xia WJ et al. in 2015 [20]. Interleukin-8 is a chemokine thought to be involved in cancer progression and promotes angiogenesis, proliferation and migration of the cancer cells [20]. The study included 5 diagnostic studies with 725 participants. They were all high quality studies and if 1 study, by Burger et al., was excluded, the heterogeneity was significantly reduced. Limitations to this study were that it includes a relatively small selection of studies, the cutoffs of the different studies varied and a subgroup analysis could not be performed [20].

Antibodies (Table 5)

There were only three studies in the last 5 years that investigated antibodies, which fulfilled the inclusion criteria. None of the studies showed promising enough results for their use in diagnosis of CRC.

Proteins (Table 6)

In this category, the more promising proteins were trefoil factor (TFF)3 [21], CA11-19 [22], a combination of insulin-like growth factor binding protein 2 (IGFBP2), Dickkopf-3(DKK3), and pyruvate kinase M2 (PKM2) [23] and DC-SIGN/DC-SIGNR [24].

TFF3 belongs to a TFF family, which consists of three stable secretory proteins: TFF1-3. TFF3 is secreted by goblet cells of the intestine and to a lesser extent in the salivary glands, breast, and respiratory tissue. It is thought to promote invasion of cells by acting directly on the cells and indirectly on the vasculature. Results from the study by Li et al. showed a sensitivity of 74.2% and specificity of 94.8%; however, the level of this marker in polyps is not significantly different from that in the CRC cohort, making this less likely to be a useful diagnostic test [21].

CA11-19 is a 701 amino acid glycoprotein which showed very promising results in detecting CRC in a study by Overholt et al [22]. It showed a sensitivity of only 40% in detecting adenomatous polyps, but again a larger study is needed to include more CRC and more patients with polyps. Other limitations for this study were that only one center was included and the authors indicated a larger multi-center study was being planned [22].

Fung KY et al. investigated a combination of IGFBP2DKK3 and PKM2 [23] which have been implicated in proliferation, migration, and angiogenesis of cancer cells. The study showed a sensitivity of 73% and specificity of 95% in detecting CRC [23]. The sensitivity of detecting early stage cancer was moderate with 59% in stage I and 84% in stage II. This again needs further validation in a larger study, which includes patients with polyps.

DC-SIGN and DC-SIGNR are membrane-bound C type lectins. DC-SIGN is found on the surface of dendritic cells in the colon but also in the placenta, cervical mucosa and uterus. DC-SIGNR is found on the endothelial cells in the placenta, liver, and lymph nodes. Jian YM et al. investigated the serum level of DC-SIGN and DC-SIGNR, showing very high sensitivity and specificity in detecting CRC from healthy controls in a 290-patient cohort. The markers (DC-SIGN and DC-SIGNR) were separately analyzed for their sensitivity of detecting early CRC (stage I–III) and this was higher compared to CEA/CA19-9, yet polyps were not investigated [24].

Selection of markers for outcome study

From our review the following markers were found to have a sensitivity ≥ 70% and specificity ≥ 90%: interleukin 8 [25], NEAT_v2, SDC2 methylation of DNA [17], SEPT9 [14, 26], CA11-19 [22] and DC-SIGN/DC-SIGNR [24]. This cutoff sensitivity and specificity were used because they will discriminate markers which show promise in detecting CRC, and are comparable to the currently available biomarkers. The primary, secondary, and tertiary outcomes for these markers are as follows.

Diagnostic ability summary

Interleukin 8 (sensitivity 70%, specificity 91%, AUC 0.92), NEAT_v2 (sensitivity 70%, specificity 96%, AUC 0.871), SDC2 methylation of DNA (sensitivity 87%, specificity 95.2%), SEPT9 (sensitivity 71%, specificity 92%), CA11-19 (sensitivity 98%, specificity 84%), and DC-SIGN/DC-SIGNR (sensitivity 94.8%, specificity 98.7%, AUC 0.9885).

Comparison studies summary

The DC-SIGN/DC-SIGNR was compared to CEA (sensitivity 29.22%) and CA 19-9 (sensitivity 14.67%) and had better sensitivity (98.7%) and specificity (94.8%). SEPT9 in combination with fecal immunochemical test (FIT) improved sensitivity to 94% from 71% but decreased specificity to 68% from 92% in a recent meta-analysis by Nian et al [14]. Compared to FIT alone, Jin and colleagues showed that septin 9 showed a sensitivity and specificity of detecting CRC of 74.8 and 87.4%, respectively, whereas FIT alone had 58% sensitivity and 82.4% specificity [26]. CA 11-19, SDC2 methylation of DNA, NEAT1_v1 and Il-8 were not compared to any other marker.

Early stage detection summary

The tertiary outcome was assessed in SDC-SIGN/SDC-SIGNR, SEPT9, and SDC2 methylation of DNA and NEAT1_v1. NEAT1_v1 showed no difference in the mean value of the marker in early compared to late cancers. For SEPT9, sensitivity of detecting stage I CRC was 45% and 70% for stage II [14]. SDC2 methylation of DNA showed a sensitivity of 92% for detecting stage I CRC [17]. Finally, SDC-SIGN showed sensitivity and specificity of detecting early CRC of 81.33% and 55.56%, respectively [24].

Discussion

Investigation of biomarkers for the diagnosis of CRC can have a significant effect on its prognosis. Ransohoff described the search for a non-invasive biomarker as the “Holy Grail of cancer biomarker research” [27].

The currently used screening tests are either too uncomfortable, costly and potentially hazardous or have a low compliance rate due to patients’ aversion to sampling stool.

This review has highlighted the large numbers of markers being investigated and yet there is a lack of well-designed studies to investigate their use in diagnosis of CRC. There is also a lack of follow-up studies for many markers which have shown promise.

One of the first blood tests brought to clinical use for screening is called ColonSentry™. This measures mRNA in a 7-gene panel (ANXA3, CLEC4D, LMNB1, PRRG4, TNFAIP6, VNN1, and IL2RB) [27]. The sensitivity and specificity for ColonSentry for detecting CRC is 72 and 70%, respectively [27]. Retrospective studies on methylated SEPT9, have reported sensitivities, which range from 52 to 72% and specificities which range from 90 to 95% in detecting CRC [28, 29]. However, methylated SEPT9 has also been studied in a screening population as part of the PRESEPT trial in the USA and Germany, showing overall sensitivity of 48.2% and specificity of 91.5% for detecting CRC [30]. The difference between the PRESEPT and the other studies was that the first investigated the screening capacity of asymptomatic patients rather than symptomatic ones.

The markers we identified as promising included the following: interleukin 8 [25], NEAT_v2, SDC2 methylation of DNA [17], SEPT9 [14, 26], CA11-19 [22] and DC-SIGN/DC-SIGNR [24]. However, none were ideal and many had limitations that need addressing. Meta-analysis had been performed for only two markers (IL-8 and SEPT9) and the first included a small number of studies with variable cutoff values and a subgroup analysis could not be performed. As for the meta-analysis for SEPT9 published by Nian et al. in 2017, only two studies showed a low risk of bias. A total of 25 studies were included and most of them used the 2/3 positive result (known as the 2/3 algorithm) of the Epipro Colon assay. The pooled sensitivity was 71% and specificity 92% for generation 1 Epipro Colon assay and 76% and 94%, respectively, for generation 2 assay. The diagnostic value was highest for stage IV disease with a sensitivity of 79% specificity of 93%. However, 12 of the studies showed that SEPT9 has a sensitivity of 15 and 5% in detecting adenomas and polyps, respectively, and pooled sensitivity for larger size (> 1 cm) polyps or adenomas was 23%, making it a less than ideal screening marker. The studies which combined FIT with SEPT9 showed a higher sensitivity (94%) but lower specificity (68%). A recent retrospective study by Fu and colleagues also found that the 1/3 positive results (known as the 1/3 algorithm) of Epipro Colon Assay 2.0, is more sensitive in detecting early CRC (sensitivity of 69.6% for stage I) although still poor at detecting polyps and adenomas (sensitivity 16.8% from 7.9% using the 2/3 algorithm) and therefore, may be useful as an early cancer screening test [31]. Although SEPT9 shows promise, there is a large heterogeneity in the study results, which may be attributed to many factors including gender, race, age, assay method, and other environmental factors so larger prospective studies are required to further verify its diagnostic potential. SDC-SIGN, SDC-SIGNR, SEPT 9, and SDC2 methylation of DNA also showed a better detection rate in early versus late CRC compared to CEA. However, the sensitivities and specificities were still too low to have any true value in diagnosing CRC.

The rest of the studies did not clearly define their control groups, many including polyps in the control group and others not including them at all. Patient cohorts were mixed between symptomatic and asymptomatic individuals, which affected the results. Furthermore, not all studies assessed their markers’ ability to detect early CRC, an important factor in a diagnostic/screening marker. Variability of cutoff values, method of analysis of the markers, and timing of sample taking can also have a significant impact on the heterogeneity of the results. Inclusion of positive controls is also important as many of these markers, test positive in other cancers or diseases.

The cost-effectiveness of the test is another important consideration. In 2009, Lansdorp-Vogelaar and colleagues concluded that investigations such as endoscopy, FOBT, and FIT were all cost-effective given the high cost of treating late-stage CRC [32]. Ladabaum and colleagues who investigated the cost-effectiveness of SEPT9 as a screening test in Germany found that although it is more cost-effective than no screening at all, it is less cost-effective than FIT [33]. It is important to note that this model was based on a prospective study on asymptomatic patients, which showed lower sensitivity as we have discussed above. These authors have not found any studies investigating the cost-effectiveness of all the other markers identified in this study as promising in diagnosis of CRC. It is important to consider the combination of biomarkers or even combination of blood test and stool-based tests, to increase the accuracy of the test.

Among the limitations of this review is the exclusion of studies that looked at diagnostic markers for patients with adenomas or polyps but no CRC. Moreover, not all studies included all relevant demographic details on patients, whether the tumors were colonic (right versus left-sided) or rectal which can underestimate the diagnostic potential of the markers. Even with studies assessing the same marker, the assay method and cutoff values were not always homogeneous, thus introducing more variability in the results.

Conclusions

The race is still on to discover a sensitive, specific blood-based test for the diagnosis of CRC.