Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer

Wang, Qiang; Xu, Jianhua; Wang, Anrong; Chen, Yi; Wang, Tian; Chen, Danyu; Zhang, Jiaxing; Brismar, Torkel B.

doi:10.1007/s11547-023-01593-x

Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer

Abdominal Radiology
Open access
Published: 17 January 2023

Volume 128, pages 136–148, (2023)
Cite this article

Download PDF

You have full access to this open access article

La radiologia medica Aims and scope Submit manuscript

Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer

Download PDF

3846 Accesses
11 Citations
1 Altmetric
Explore all metrics

Abstract

This study aimed to systematically summarize the performance of the machine learning-based radiomics models in the prediction of microsatellite instability (MSI) in patients with colorectal cancer (CRC). It was conducted according to the preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies (PRISMA-DTA) guideline and was registered at the PROSPERO website with an identifier CRD42022295787. Systematic literature searching was conducted in databases of PubMed, Embase, Web of Science, and Cochrane Library up to November 10, 2022. Research which applied radiomics analysis on preoperative CT/MRI/PET-CT images for predicting the MSI status in CRC patients with no history of anti-tumor therapies was eligible. The radiomics quality score (RQS) and Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) were applied to evaluate the research quality (full score 100%). Twelve studies with 4,320 patients were included. All studies were retrospective, and only four had an external validation cohort. The median incidence of MSI was 19% (range 8–34%). The area under the receiver operator curve of the models ranged from 0.78 to 0.96 (median 0.83) in the external validation cohort. The median sensitivity was 0.76 (range 0.32–1.00), and the median specificity was 0.87 (range 0.69–1.00). The median RQS score was 38% (range 14–50%), and half of the studies showed high risk in patient selection as evaluated by QUADAS-2. In conclusion, while radiomics based on pretreatment imaging modalities had a high performance in the prediction of MSI status in CRC, so far it does not appear to be ready for clinical use due to insufficient methodological quality.

Development and validation of a radiomics-based nomogram for the preoperative prediction of microsatellite instability in colorectal cancer

Article Open access 09 May 2022

Non-invasive CT radiomic biomarkers predict microsatellite stability status in colorectal cancer: a multicenter validation study

Article Open access 26 August 2024

Radiomics-based prediction of microsatellite instability in colorectal cancer at initial computed tomography evaluation

Article 27 June 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Colorectal cancer (CRC) ranks as the third most common malignant tumor and the second leading cause of cancer-related death globally [1]. Microsatellite instability (MSI) is a well-established cancer hallmark that is defined as the generalized instability of the short, non-sense, repeat DNA sequences (i.e., microsatellites) due to a deficient repair system of the DNA mismatches at replication. About 13–15% of CRC patients have tumors with MSI [2, 3]. It occurs more often in older patients, in right-sided locations, and has a lower pathological stage, representing a distinct CRC subtype [4].

Clinical decision-making can benefit from the information on pre-treatment MSI status for patients with CRC. Patients with MSI often have better outcomes and are less likely to have lymph node spread and metastasis [2, 5]. Besides, patients with CRC MSI generally do not benefit from preoperative 5-fluorouracil-based adjuvant therapy [6,7,8]. Under this context, MSI testing has been recommended for all patients with stage II rectal patients by the National Comprehensive Cancer Network practice guidelines since 2016 [9]. Furthermore, MSI status can also serve as a predictor for the response to immunotherapy [10, 11]. Previous studies have shown that MSI CRC patients are sensitive to immune checkpoint inhibitors due to the high expression level of mutant neoantigens [12, 13]. Therefore, the European Society for Medical Oncology recommends MSI evaluation before immunotherapy [14] and the US Food and Drug Administration has approved MSI as an indication for cancer immunotherapy [15].

At present, MSI status is mainly evaluated through immunohistochemistry or polymerase chain reaction on specimens obtained by colonoscopy biopsy or surgical resection [2]. However, information about mismatch repair protein express level obtained postoperatively exerts little influence on the pretreatment planning, and the limited samples obtained via biopsy may not thoroughly reflect the intra-tumoral heterogeneity [16]. In some cases, a false negative result may occur (2.1–5.9%) [17]. In addition, biopsy and surgery are also invasive procedures, leaving the patients at risk of procedure-related complications and are not practical for repeated monitoring [18]. A non-invasive, reliable, and cost-effective approach to identifying the MSI status would be of great value.

Imaging modalities, such as computed tomography (CT), magnetic resonance imaging (MRI), and positron emission tomography/CT (PET/CT), are commonly used for the detection, characterization, and staging of CRC. The subtle information underlying these images may reflect the genetic/molecular alterations of CRC, such as MSI [19]. By using modern computing techniques, the imaging information can be mined and converted to quantitative high-dimension data, and the latter can be further exploited for the construction of prediction models via machine learning algorithms—this technique has been coined as “radiomics” [19,20,21,22].In recent years, plenty of studies using the radiomics approach for CRC MSI status prediction have emerged [22]. However, the reported prediction accuracy and efficacy of these radiomics models vary and the overall performance remains unknown. To date, there is not any research summarizing current evidence about radiomics methods for MSI status prediction in CRC patients. Such summaries are of clinical importance for evidence-based patient management. This systematic review was therefore aimed to summarize the current evidence and to provide a summary of the predictive performance of the radiomics models in the diagnosis of MSI in CRC. In addition, the research and reporting quality of these studies were also evaluated.

Materials and methods

This study was conducted according to the Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies (PRISMA-DTA) guideline [23], and the checklist can be found in Supplementary file 1. The research protocol has been registered at the PROSPERO website (https://www.crd.york.ac.uk/prospero/) under registration No. CRD42022295787.

Literature search

A systematic literature search was performed to detect any potentially relevant publications at four public databases: PubMed, Embase, Web of Science, and Cochrane Library with key terms of “colorectal cancer (CRC)/colon cancer/rectal cancer/colorectal liver metastases (CRLM)”, “microsatellite instability (MSI)/mismatch repair deficient (dMMR)” and “radiomics/texture analysis/radiogenomics/imaging biomarker”, their synonyms, and their Medical Subject Headings terms (detailed search queries are provided in Supplementary file 2). The literature search was first conducted on April 15 2022 and last updated on November 10 2022.

Study selection

Studies meeting the following inclusion and exclusion criteria were regarded as eligible and included in this research. Inclusion criteria: 1) retrospective or prospective design; 2) patients with CRC confirmed by postoperative histopathological examination and no history of anti-tumor therapies (i.e., neoadjuvant chemotherapy or radiation therapy) before imaging examinations; 3) radiomics features extracted from the entire volume of the lesion at CT, MRI or PET/CT examinations and used as a single predictor or one of the variables in a prediction model; 4) MSI status was evaluated on the surgical specimens; 5) publications in English. Exclusion criteria: 1) publications in the form of review, conference abstract, corrigendum, book chapter, or study protocol; 2) research outcomes not involving MSI; 3) deep learning research; 4) sample size of less than 50 patients.

Two researchers ('Q.W' and 'J.X', with 7 and 2 years of experience in preparing and updating systematic reviews, respectively) conducted study selection independently, first by screening the title and abstract and then by reading the full text of the potentially eligible studies. The disagreement was solved by discussion or consultancy with a senior researcher ('T.B.B'). In addition, review and cited references in the included articles were manually identified to detect any eligible research.

Data extraction

A predefined table was applied to extract the study information, which included: 1) basic study characteristics (for example the first author, publish year, country, and study design); 2) patient characteristics; 3) characteristics in radiomics workflow (such as tumor segmentation method, software used for radiomics feature extraction; a typical radiomics research workflow is shown in Fig. 1); 4) diagnostic performance metrics (true positives, false positives, false negatives, and true negatives) to construct a 2 × 2 table. When a study involved training and test cohorts, the diagnostic performance in the test cohort was selected for the model’s prediction power. If several prediction models were developed in one study, the model with the best performance was chosen. If the study did not have a test cohort, the predictive metrics in the validation cohort were extracted. When the provided data on diagnostic performance were insufficient to create a 2 × 2 table, an email was sent to the corresponding author for the missing information. The metrics were visualized as a forest plot to intuitively evaluate the predictive performance of the radiomics prediction models, which was achieved by using the software Review Manager (RevMan, version 5.3. Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014).

The terms “validation” and “test” were unified in this study to avoid any confusion: “validation cohort” was defined as the part of the training cohort which was randomly divided for fine-tuning of super-parameters during modeling, while “test cohort” was defined as a hold-out dataset that was externally separate from the training cohort, not involved in the modeling [24]. The test cohort could be temporally or geographically independent from the training cohort [25]. “External cohort” and “test cohort” will be used interchangeably in this study.

Assessment of radiomics quality score and the risk of bias

The tool used for methodological quality evaluation of the radiomics studies was the radiomics quality score (RQS) scale, which was proposed by Lambin and colleagues in 2017 [20]. The RQS scale consists of 16 items evaluating the research and reporting quality in the workflow of the radiomics model development. Different points are assigned to each item according to the degree the research achieves. The total points for this scoring system are 36, corresponding to 100% in percentage [20].

Research quality was also evaluated by using the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) criterion [26]. This tool assesses the risk of bias in a study in four dimensions: patient selection, index test, reference standard, and flow and timing, with results marked as low, high, and unclear risk indicating different levels of risk in each domain [26].

Data extraction and study quality evaluation were performed and cross-validated by the same two researchers ('Q.W' and 'J.X',). In case of a discrepancy occurring, the senior researcher ('T.B.B',) was consulted to reach an agreement.

Results

The initial search yielded 97 records from the four public databases. After the removal of 48 duplicates, 37 ineligible studies, 12 studies were finally included in this systematic review [27,28,29,30,31,32,33,34,35,36,37,38]. Among them, 10 studies with available data were able to construct a 2 × 2 contingency table [27,28,29,30,31, 33,34,35, 37, 38]. Figure 2 describes a PRISMA flowchart of the study selection.

General characteristics and the incidence of MSI

The included studies were published between December 2019 and August 2022, and all studies were retrospectively designed (one study claimed to be prospective, but was judged as retrospective after discussion [32]). A total of 4,320 patients were included, with a sample size ranging from 90 to 837 (median 238) and a male/female ratio of 1.5 (2,592/1,728). Four studies were performed as multicenter research, with a sample size in the external cohorts ranging from 61 to 441 (median 82) [30, 35, 37, 38]. Five studies exclusively focused on rectal cancer, while the others on CRC [29, 32, 36,37,38].

Based on the surgically resected specimens, eleven studies evaluated the MSI status using the immunohistochemistry approach and one using the polymerase chain reaction method [31]. The incidence of MSI ranged from 8 to 34% (median: 19%). Among nine studies with available data, a majority of studies (8/9) reported an interval between imaging examination and surgery of less than 2 weeks [27, 29, 30, 33, 34, 36,37,38]. Table 1 provides detailed information about the basic characteristics of the included studies.

Table 1 Study and patient characteristics

Full size table

RQS and QUADAS-2 assessment

The median RQS score of the included studies was 13.5 points (range 5–18), corresponding to 38% (range 14–50%) of the full RQS score. The highest score of 50% was obtained in only one study [30]. The lowest score of 5 points (14%) was observed in an early study on this topic, and the main points were lost due to a lack of validation cohort [27]. Regarding performance in each item of the RQS, three items were fulfilled by all studies (100%): “feature reduction or adjustment,” “biological correlates,” and “comparison to gold standard.” On the other hand, four items (“phantom study,” “prospective study,” “cost-effectiveness analysis” and “open science and data”) were assigned 0 as none of the included studies involved them. A summary of the RQS score is presented in Fig. 3 A and B, and detailed information on the RQS score for each study is provided in Supplementary file 3.

A majority of the studies showed a low or unclear risk of bias and applicability concerns as evaluated by QUADAS-2 (Fig. 3 C). The main source of the high risk of bias and application concern was the domain of “patient selection” due to the retrospective nature of the studies, and patient selection bias seemed inevitable. Detailed evaluation of the included studies in each domain is provided in Supplementary file 4.

Study characteristics

The study characteristics are described according to the five phases of a radiomics research workflow (Table 2):

Table 2 Characteristics of the radiomics study workflow

Full size table

(1)
Imaging acquisition and tumor segmentation

Among the included studies, seven used CT imaging, four MRI [29, 32, 37, 38], and one PET/CT [33]. Six studies applied images from one phase/sequence [27,28,29, 34,35,36]; the most frequently used phase was the portal venous phase of CT imaging (7/12) [27, 28, 30, 31, 34,35,36]. The tumor was segmented manually in 11 studies and semi-automatically in one [27].
(2)
Imaging preprocessing and feature extraction

Seven studies stated imaging preprocessing before feature extraction [30, 31, 33, 35,36,37,38], but only five of them described their preprocessing techniques (resampling or gray-level discretization) [30, 35,36,37,38]. Pyradiomics was the most frequently used package for feature extraction (5/12) [29, 30, 33, 35, 38], and the number of the extracted radiomics features ranged from 254 to 6,420 (median: 1037).
(3)
Feature selection/dimension reduction

All studies performed dimension reduction to select the most informative features and avoid potential model overfitting. The least absolute shrinkage and selection operator (LASSO) was the researchers’ favorite machine learning tool to reduce redundant features (7/12) [27, 29,30,31, 34, 36, 37], followed by correlation analysis (3/12) [28, 36, 38]. After feature selection, the number of radiomics features was reduced to 11 (range 2-51) to be included in the radiomics model.

In six studies, inter-/intra-observer correlation coefficient analysis was not only used for the assessment of feature reproducibility and stability but also feature selection [29, 30, 34, 36,37,38].
(4)
Model development

Due to the relatively low incidence of MSI, resampling techniques were applied to balance the negative/positive classifications in six studies [27, 30, 32, 33, 35, 37], among which the Synthetic Minority Oversampling Technique was the most frequently used algorithm (4/6) [27, 30, 32, 37]. Logistic regression was the most commonly used classifier for modeling (6/12) [30,31,32, 34, 36, 37]. Cross-validation with 5 or tenfold was applied in six studies (6/12) to avoid model overfitting and to determine the superparameter [27, 29, 30, 34, 37, 38]. Six studies evaluated the predictive value of clinicopathological variables [29,30,31, 34,35,36], in which tumor location and age (both 4/6) were the most frequent, significant indicators for the prediction of MSI status, followed by carcinoembryonic antigen (3/6). All those six studies then combined the studied variables with the calculated radiomics risk score into a compound clinical radiomics model to predict MSI status.
(5)
Model performance

Four studies visualized their models as a nomogram [30, 31, 34, 36], two studies provided the formula [32, 37], and one study used radiomics-based artificial neural network [35]. The area under the receiver operator curve (AUC) of the prediction models ranged from 0.75 to 0.99 (median 0.84) in the training cohort, from 0.74 to 0.93 (median 0.83) in the validation cohort, and from 0.78 to 0.96 (median 0.83) in the test cohort [30, 35, 37, 38]. Among the 10 studies with available metrics data, the median sensitivity was 0.76 (range 0.32–1.00) and the median specificity was 0.87 (range 0.69–1.00) (Fig. 4). In specific, in the radiomics model based on CT or PET/CT, the median sensitivity was 0.79 (range 0.32–1.00) and the median specificity was 0.84 (range 0.69–1.00) [27, 28, 30, 31, 33,34,35,36]. Five studies evaluated the agreement between the model-predicted outcome and the observed outcome by plotting a calibration curve [30,31,32, 34, 36]. Decision curve analysis was performed among five studies to evaluate the clinical usefulness of their models [30,31,32, 34, 37].
Fig. 4
Performance metrics and forest plot of the sensitivity and specificity of the radiomics models in the prediction of microsatellite instability in patients with colorectal cancer. CI, confidence interval; CT, computed tomography; FN, false negative; FP, false positive; MRI, magnetic resonance imaging; PET/CT, positron emission tomography/CT; TN, true negative; TP, true positive. # data from the test cohort (i.e., the independent external cohort); † data from the validation cohort; ‡ data from the training cohort. Note that meta-analysis was not performed to synthesize the performance metrics due to the study heterogeneity
Full size image

Discussion

This systematic review showed that radiomics models using the machine learning approach on pretreatment imaging modalities had a high predictive efficacy, with a median AUC of 0.83, a median sensitivity of 0.76, and a specificity of 0.87. Despite these promising results, the radiomics model is still far away from clinical utility due to the insufficient methodological quality as reflected by the low RQS score.

The translation of these prediction models into clinical routine settings is mainly determined by the study’s validity. Ideally, a reliable radiomics signature can be developed from a prospective, large sample cohort with a study population consecutively enrolled. Although none of the included studies was prospectively designed, the largest sample size was as high as 837 and almost half of the studies (5/12) had a sample size of over 490. The median incidence of MSI in the included studies was 19%, which was a little higher than the reported incidence (13–15%) [3, 39,40,41,42]. Two studies that did not state whether the subjects were consecutively included or not had an MSI incidence as high as 33% and 34% [32, 38]. That might be due to their case–control study design (1:2). In diagnostic test studies, this type of study design is prone to overestimate the performance of the prediction model and should be avoided as it cannot reflect the real-world situation [26]. One may argue that when performing machine learning algorithms, the positive and negative classifications of a cohort should be balanced to avoid potential overfitting. In fact, several techniques have been proposed to deal with this situation, such as the Synthetic Minority Oversampling Technique [43, 44]. Half of the reviewed studies adopted techniques to cope with the imbalanced classifications [28, 32, 35, 36, 38].

Before translating the radiomics models into clinical implementation, it is also vital to verify the model in an external cohort [25]. Given that the model developed in the training cohort tends to be overfitting, the external cohort can be used to evaluate the generalization of a prediction model and provide a real performance of the model in real-world practice [45]. One-third of the studies (4/12) tested their models in an external cohort, yielding a median AUC of 0.83 [32, 34, 38]. On the other hand, internal validation using cross-validation or bootstrapping techniques within the training cohort plays an equivalent role to avoid potential overfitting and to optimize the prediction model [45,46,47]. Six studies adopted five-/tenfold cross-validation when developing their models.

Researchers should also make their prediction model reproducible and validated by other investigators. The first step could be to deposit the radiomics codes/data at a public platform (such as https://github.com) or to provide more details on software usage. However, none of the included research published their code or data, resulting in a zero score for the “open science and data” item in the RQS scale. Besides, the models should also be presented in a proper and easy-to-use form for clinical usage, for example, present as a nomogram. Six studies provided the formula and/or nomogram, which forwarded one step for their models validated by other centers. Furthermore, the determination of the optimal cutoff value of the prediction model is often a trade-off between sensitivity and specificity. Its important role has been emphasized by a specific item in the RQS score. The knowledge of the specific cutoff value of a model makes it possible for other researchers to validate the model. However, only two studies stated the cutoff values of their models [32, 35]. When implementing the prediction model, researchers should also be aware of the target patient population or subpopulation. The patients in the included studies had different indications, where some studies merely focused on rectal cancer or CRC stage II/III, while others were on the general CRC population.

RQS is a commonly used tool for the appraisal of radiomics research quality [20]. As it evaluates the key steps in the radiomics research workflow, RQS has the potential to become not only a guide when performing the radiomics study, but also a useful checklist when submitting their manuscript to a journal. The included studies fulfilled well in three domains of the RQS scale, accounting for 17% of the full score (6 points). Besides, more than half of the studies (7/12) reported both a discriminative performance and a resampling technique in the item of “discrimination statistics”, earning an average of 1.6 points for this item. However, the included studies in this review only yielded a median score of 13.5 points (corresponding to 38% of the full score of 36) and the highest score of 18 points (50% of the full score). The main reason was that four domains in the RQS scale were not in response by any of the included studies, for example, to make their code/data public. These four domains account for 39% of the full scale (14 points). However, the RQS scale may assign a too-high weight to the item “prospective study” (7 points), which is approximately equal to 20% of the full score. This is a relatively high score given that most other items in the RQS tool often have a maximum of 1–2 point(s). However, no prospective studies were included in this systematic review, which further contributed to a lower RQS score in the included studies.

On the other hand, other appraisal tools, such as QUADAS-2, which was designed for the appraisal of the general diagnostic test studies, should also be adopted to complement the RQS tool in the assessment of radiomics research quality. For instance, the RQS scale does not involve patient selection, but this issue is of clinical importance when evaluating a diagnostic test study. In the QUADAS-2, patient selection is one of the four main constituent dimensions. Besides, other commonly used guidelines, such as the “checklist for artificial intelligence in medical imaging” (CLAIM) [48] and the “transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD)” statement [25], may also be beneficial to conduct a rigorous and reproducible radiomics study and to improve the research and reporting quality.

There are some limitations in this study. First, the number of included studies was relatively limited, no study was prospectively designed, and only four studies validated their models in external cohorts. These limitations may undermine the conclusion drawn from our study. On the other hand, the limited number of studies, as shown by the initial records retrieved from the four databases, also reflects that this topic (using radiomics approach for predicting gene expression levels in CRC) is relatively novel and the research is still at its early stage. Second, the included studies were heterogeneous not only in the imaging modalities and phase/sequence used but also in the imaging features and modeling strategies. In this context, a meta-analysis to synthesize the diagnostic metrics was not performed and a pooled AUC for the radiomics model in the prediction of MSI status was therefore absent. Third, deep learning studies were not included due to the poor interpretability of deep learning-derived imaging features. This is also a burgeoning field where the deep learning model is often assumed to have higher accuracy than the radiomics models [49]. Lastly, although RQS is a useful tool in the assessment of radiomics research quality, it has limitations. Further revision of RQS might make it more comprehensive in the quality appraisal of the radiomics studies.

Conclusions

In conclusion, despite radiomics models derived from pretreatment imaging modalities having a high performance in the prediction of MSI status in CRC patients, radiomics does not seem to be ready to serve as an imaging biomarker utilized in clinical practice due to the insufficient methodological quality of the research.

Availability of data and materials

The datasets supporting the conclusions of this article are included within the article and its supplementary files.

Abbreviations

AUC:: Area under the receiver operator curve
CI:: Confidence interval
CRC:: Colorectal cancer
CT:: Computed tomography
MRI:: Magnetic resonance imaging
MSI:: Microsatellite instability
PET/CT:: Positron emission tomography/CT
QUADAS-2:: Quality assessment of diagnostic accuracy studies 2
RQS:: Radiomics quality score

References

Siegel RL, Miller KD (2021) Cancer statistics, 2021. CA Cancer J Clin 71:7–33. https://doi.org/10.3322/caac.21654
Article PubMed Google Scholar
Vilar E, Gruber SB (2010) Microsatellite instability in colorectal cancer-the stable evidence. Nat Rev Clin Oncol 7:153–162. https://doi.org/10.1038/nrclinonc.2009.237
Article CAS PubMed PubMed Central Google Scholar
Lorenzi M, Amonkar M, Zhang J, Mehta S, Liaw K-L (2020) Epidemiology of microsatellite instability high (MSI-H) and deficient mismatch repair (dMMR) in solid tumors: a structured literature review. J Oncol 2020:1–17. https://doi.org/10.1155/2020/1807929
Article CAS Google Scholar
Park SY, Lee HS, Choe G, Chung JH, Kim WH (2006) Clinicopathological characteristics, microsatellite instability, and expression of mucin core proteins and p53 in colorectal mucinous adenocarcinomas in relation to location. Virchows Arch 449:40–47. https://doi.org/10.1007/s00428-006-0212-7
Article CAS PubMed Google Scholar
Merok MA, Ahlquist T, Røyrvik EC, Tufteland KF, Hektoen M, Sjo OH et al (2013) Microsatellite instability has a positive prognostic impact on stage II colorectal cancer after complete resection: results from a large, consecutive Norwegian series. Ann Oncol 24:1274–1282. https://doi.org/10.1093/annonc/mds614
Article CAS PubMed Google Scholar
Fischer F, Baerenfaller K, Jiricny J (2007) 5-fluorouracil is efficiently removed from DNA by the base excision and mismatch repair systems. Gastroenterology 133:1858–1868. https://doi.org/10.1053/j.gastro.2007.09.003
Article CAS PubMed Google Scholar
Popat S, Hubner R, Houlston RS (2005) Systematic review of microsatellite instability and colorectal cancer prognosis. J Clin Oncol 23:609–618. https://doi.org/10.1200/JCO.2005.01.086
Article CAS PubMed Google Scholar
Copija A, Waniczek D, Witkoś A, Walkiewicz K, Nowakowska-Zajdel E (2017) Clinical significance and prognostic relevance of microsatellite instability in sporadic colorectal cancer patients. Int J Mol Sci. https://doi.org/10.3390/ijms18010107
Article PubMed PubMed Central Google Scholar
Eriksson J, Amonkar M, Al-Jassar G, Lambert J, Malmenäs M, Chase M et al (2019) Mismatch repair/microsatellite instability testing practices among US physicians treating patients with advanced/metastatic colorectal cancer. J Clin Med 8:558. https://doi.org/10.3390/jcm8040558
Article PubMed PubMed Central Google Scholar
Cohen R, Rousseau B, Vidal J, Colle R, Diaz LA Jr, André T (2020) Immune checkpoint inhibition in colorectal cancer: microsatellite instability and beyond. Target Oncol 15:11–24. https://doi.org/10.1007/s11523-019-00690-0
Article PubMed Google Scholar
Wang F, Wang ZX, Chen G, Luo HY, Zhang DS, Qiu MZ et al (2020) Expert opinions on immunotherapy for patients with colorectal cancer. Cancer Commun (Lond) 40:467–472. https://doi.org/10.1002/cac2.12095
Article PubMed Google Scholar
Chalabi M, Fanchi LF, Dijkstra KK, Van den Berg JG, Aalbers AG, Sikorska K et al (2020) Neoadjuvant immunotherapy leads to pathological responses in MMR-proficient and MMR-deficient early-stage colon cancers. Nat Med 26:566–576. https://doi.org/10.1038/s41591-020-0805-8
Article CAS PubMed Google Scholar
Mandal R, Samstein RM, Lee KW, Havel JJ, Wang H, Krishna C et al (2019) Genetic diversity of tumors with mismatch repair deficiency influences anti-PD-1 immunotherapy response. Science 364:485–491. https://doi.org/10.1126/science.aau0447
Article CAS PubMed PubMed Central Google Scholar
Luchini C, Bibeau F, Ligtenberg MJL, Singh N, Nottegar A, Bosse T et al (2019) ESMO recommendations on microsatellite instability testing for immunotherapy in cancer, and its relationship with PD-1/PD-L1 expression and tumour mutational burden: a systematic review-based approach. Ann Oncol 30:1232–1243. https://doi.org/10.1093/annonc/mdz116
Article CAS PubMed Google Scholar
Diao Z, Han Y, Chen Y, Zhang R, Li J (2021) The clinical utility of microsatellite instability in colorectal cancer. Crit Rev Oncol Hematol 157:103171. https://doi.org/10.1016/j.critrevonc.2020.103171
Article PubMed Google Scholar
Kwon M, An M, Klempner SJ (2021) Determinants of response and intrinsic resistance to PD-1 blockade in microsatellite instability-high gastric cancer. Cancer Discov 11:2168–2185. https://doi.org/10.1158/2159-8290.cd-21-0219
Article CAS PubMed Google Scholar
Than M, Witherspoon J, Shami J, Patil P, Saklani A (2015) Diagnostic miss rate for colorectal cancer: an audit. Ann Gastroenterol 28:94–98
PubMed PubMed Central Google Scholar
Saraste D, Martling A, Nilsson PJ, Blom J, Törnberg S, Hultcrantz R et al (2016) Complications after colonoscopy and surgery in a population-based colorectal cancer screening programme. J Med Screen 23:135–140. https://doi.org/10.1177/0969141315625701
Article PubMed Google Scholar
Tomaszewski MR, Gillies RJ (2021) The biological meaning of radiomic features. Radiology 298:505–516. https://doi.org/10.1148/radiol.2021202553
Article PubMed Google Scholar
Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J et al (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14:749–762. https://doi.org/10.1038/nrclinonc.2017.141
Article PubMed Google Scholar
Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures. Data Radiol 278:563–577. https://doi.org/10.1148/radiol.2015151169
Article Google Scholar
Badic B, Tixier F, Cheze Le Rest C, Hatt M, Visvikis D (2021) Radiogenomics in colorectal cancer. Cancers (Basel). https://doi.org/10.3390/cancers13050973
Article PubMed Google Scholar
McInnes MDF, Moher D, Thombs BD, McGrath TA, Bossuyt PM, Clifford T et al (2018) Preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies: the PRISMA-DTA statement. JAMA 319:388–396. https://doi.org/10.1001/jama.2017.19163
Article PubMed Google Scholar
Bluemke DA, Moy L (2020) Assessing radiology research on artificial intelligence: a brief guide for authors, reviewers, and readers-from the radiology editorial board. Radiology 294:487–489. https://doi.org/10.1148/radiol.2019192515
Article PubMed Google Scholar
Collins GS, Reitsma JB, Altman DG, Moons KG (2015) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMC Med 13:1. https://doi.org/10.1186/s12916-014-0241-z
Article PubMed PubMed Central Google Scholar
Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB et al (2011) QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 155:529–536. https://doi.org/10.7326/0003-4819-155-8-201110180-00009
Article PubMed Google Scholar
Fan S, Li X, Cui X, Zheng L, Ren X, Ma W et al (2019) Computed tomography-based radiomic features could potentially predict microsatellite instability status in stage II colorectal cancer: a preliminary study. Acad Radiol 26:1633–1640. https://doi.org/10.1007/s00261-019-02117-w10.1016/j.acra.2019.02.009
Article PubMed Google Scholar
Golia Pernicka JS, Gagniere J, Chakraborty J, Yamashita R, Nardo L, Creasy JM et al (2019) Radiomics-based prediction of microsatellite instability in colorectal cancer at initial computed tomography evaluation. Abdom Radiol (NY) 44:3755–3763. https://doi.org/10.3389/fonc.2019.0125010.1007/s00261-019-02117-w
Article PubMed Google Scholar
Zhang W, Huang Z, Zhao J, He D, Li M, Yin H et al (2021) Development and validation of magnetic resonance imaging-based radiomics models for preoperative prediction of microsatellite instability in rectal cancer. Ann Transl Med 9:134. https://doi.org/10.3389/fonc.2021.64493310.21037/atm-20-7673
Article CAS PubMed PubMed Central Google Scholar
Cao Y, Zhang G, Zhang J, Yang Y, Ren J, Yan X et al (2021) Predicting microsatellite instability status in colorectal cancer based on triphasic enhanced computed tomography radiomics signatures: a multicenter study. Front Oncol. https://doi.org/10.3389/fonc.2021.687771
Article PubMed PubMed Central Google Scholar
Pei Q, Yi X (2022) Pre-treatment CT-based radiomics nomogram for predicting microsatellite instability status in colorectal cancer. Eur Radiol 32:714–724. https://doi.org/10.3389/fonc.2021.66678610.1007/s00330-021-08167-3
Article CAS PubMed Google Scholar
Li Z, Dai H, Liu Y, Pan F, Yang Y, Zhang M (2021) Radiomics analysis of multi-sequence MR images for predicting microsatellite instability status preoperatively in rectal cancer. Front Oncol 11:697497. https://doi.org/10.3389/fonc.2021.70205510.3389/fonc.2021.697497
Article PubMed PubMed Central Google Scholar
Li J, Yang Z, Xin B, Hao Y, Wang L, Song S et al (2021) Quantitative prediction of microsatellite instability in colorectal cancer with preoperative PET/CT-based radiomics. Front Oncol 11:702055. https://doi.org/10.3389/fonc.2021.78163610.3389/fonc.2021.702055
Article PubMed PubMed Central Google Scholar
Ying M, Pan J, Lu G, Zhou S, Fu J, Wang Q et al (2022) Development and validation of a radiomics-based nomogram for the preoperative prediction of microsatellite instability in colorectal cancer. Eur Radiol 22:524. https://doi.org/10.1007/s00330-022-08954-610.1186/s12885-022-09584-3
Article Google Scholar
Chen X, He L, Li Q, Liu L, Li S, Zhang Y et al (2022) Non-invasive prediction of microsatellite instability in colorectal cancer by a genetic algorithm–enhanced artificial neural network–based CT radiomics signature. Eur Radiol. https://doi.org/10.1007/s00330-022-08954-6
Article PubMed PubMed Central Google Scholar
Yuan H, Peng Y, Xu X, Tu S, Wei Y, Ma Y (2022) A tumoral and peritumoral CT-based radiomics and machine learning approach to predict the microsatellite instability of rectal carcinoma. Cancer Manag Res 14:2409–2418. https://doi.org/10.2147/cmar.s377138
Article PubMed PubMed Central Google Scholar
Jing G, Chen Y, Ma X, Li Z, Lu H, Xia Y et al (2022) Predicting mismatch-repair status in rectal cancer using multiparametric MRI-based radiomics models: a preliminary study. Tomography 2022:6623574. https://doi.org/10.3390/tomography805018410.1155/2022/6623574
Article Google Scholar
Li Z, Zhang J, Zhong Q, Feng Z, Shi YS, Xu LG et al (2022) Development and external validation of a multiparametric MRI-based radiomics model for preoperative prediction of microsatellite instability status in rectal cancer: a retrospective multicenter study. Eur Radiol. https://doi.org/10.1007/s00330-022-09160-0
Article PubMed PubMed Central Google Scholar
Gelsomino F, Barbolini M, Spallanzani A, Pugliese G, Cascinu S (2016) The evolving role of microsatellite instability in colorectal cancer: a review. Cancer Treat Rev 51:19–26. https://doi.org/10.1016/j.ctrv.2016.10.005
Article CAS PubMed Google Scholar
Yamashita R, Long J, Longacre T, Peng L, Berry G, Martin B et al (2021) Deep learning model for the prediction of microsatellite instability in colorectal cancer: a diagnostic study. Lancet Oncol 22:132–141. https://doi.org/10.1016/s1470-2045(20)30535-0
Article PubMed Google Scholar
Boland CR, Goel A (2010) Microsatellite instability in colorectal cancer. Gastroenterology 138:2073–87.e3. https://doi.org/10.1053/j.gastro.2009.12.064
Article CAS PubMed Google Scholar
Kawakami H, Zaanan A, Sinicrope FA (2015) Microsatellite instability testing and its role in the management of colorectal cancer. Curr Treat Options Oncol 16:30. https://doi.org/10.1007/s11864-015-0348-2
Article PubMed PubMed Central Google Scholar
Blagus R, Lusa L (2013) SMOTE for high-dimensional class-imbalanced data. BMC Bioinformatics 14:106. https://doi.org/10.1186/1471-2105-14-106
Article PubMed PubMed Central Google Scholar
Koçak B, Durmaz E, Ateş E, Kılıçkesmez Ö (2019) Radiomics with artificial intelligence: a practical guide for beginners. Diagn Interv Radiol 25:485–495. https://doi.org/10.5152/dir.2019.19321
Article PubMed PubMed Central Google Scholar
Chen Q, Zhang L, Mo X, You J, Chen L, Fang J et al (2021) Current status and quality of radiomic studies for predicting immunotherapy response and outcome in patients with non-small cell lung cancer: a systematic review and meta-analysis. Eur J Nucl Med Mol Imaging 49:345–360. https://doi.org/10.1007/s00259-021-05509-7
Article PubMed Google Scholar
Ubaldi L, Valenti V, Borgese RF, Collura G, Fantacci ME, Ferrera G et al (2021) Strategies to develop radiomics and machine learning models for lung cancer stage and histology prediction using small data samples. Phys Med 90:13–22. https://doi.org/10.1016/j.ejmp.2021.08.015
Article CAS PubMed Google Scholar
Avanzo M, Wei L, Stancanello J, Vallieres M, Rao A, Morin O et al (2020) Machine and deep learning methods for radiomics. Med Phys 47:e185–e202. https://doi.org/10.1002/mp.13678
Article PubMed Google Scholar
Mongan J, Moy L, Kahn CE (2020) Checklist for artificial intelligence in medical imaging (CLAIM): a guide for authors and reviewers. Radiol Artif Intell 2:e200029. https://doi.org/10.1148/ryai.2020200029
Article PubMed PubMed Central Google Scholar
Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K et al (2019) A guide to deep learning in healthcare. Nat Med 25:24–29. https://doi.org/10.1038/s41591-018-0316-z
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank Professor Jennifer S. Golia Pernicka and her team for supplying additional information to make a 2 × 2 contingency table complete.

Funding

Open access funding provided by Karolinska Institute. Qiang Wang receives a scholarship from the China Scholarship Council (CSC) (No. 201907930009).

Author information

Q.W and J.X These authors contributed equally to this work.

Authors and Affiliations

Division of Medical Imaging and Technology, Department of Clinical Science, Intervention and Technology (CLINTEC), Karolinska Institutet, Stockholm, Sweden
Qiang Wang & Torkel B. Brismar
Department of Radiology, Karolinska University Hospital Huddinge, Room 601, Novum PI 6, Hiss F, Hälsovägen 7, 141 86, Huddinge, Stockholm, Sweden
Qiang Wang & Torkel B. Brismar
Department of General Surgery, Songshan Hospital, Chongqing, China
Jianhua Xu
Department of Vascular Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Anrong Wang
Department of Interventional Therapy, People’s Hospital of Dianjiang County, Chongqing, China
Anrong Wang
Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
Yi Chen
Department of Gastroenterology, Chongqing General Hospital, Chongqing, China
Tian Wang
Department of Gastroenterology and Hepatology, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
Danyu Chen
Department of Pharmacy, Guizhou Provincial People’s Hospital, Guiyang, China
Jiaxing Zhang

Authors

Qiang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Xu
View author publications
You can also search for this author in PubMed Google Scholar
Anrong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Tian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Danyu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jiaxing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Torkel B. Brismar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization was performed by Q.W. and T.B.B; methodology by Q.W and J.Z.; software by Q.W.; validation by J.X., A.W. and J.Z.; formal analysis by Q.W., J.X., T.W. and D.C; investigation by Q.W. and A.W.; data curation by T.W. and D.C; writing—original draft preparation—by Q.W. and J.X; writing—review and editing—by T.B.B; visualization by Q.W. and Y.C.; supervision by T.B.B; project administration by Q.W.; funding acquisition by Q.W. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Qiang Wang.

Ethics declarations

Conflict of Interest

The authors declare no conflict of interest.

Informed Consent

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOC 66 KB)

Supplementary file2 (DOCX 20 KB)

Supplementary file3 (DOCX 16 KB)

Supplementary file4 (DOCX 15 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Q., Xu, J., Wang, A. et al. Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer. Radiol med 128, 136–148 (2023). https://doi.org/10.1007/s11547-023-01593-x

Download citation

Received: 18 November 2022
Accepted: 04 January 2023
Published: 17 January 2023
Issue Date: February 2023
DOI: https://doi.org/10.1007/s11547-023-01593-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer

Abstract

Similar content being viewed by others

Development and validation of a radiomics-based nomogram for the preoperative prediction of microsatellite instability in colorectal cancer

Non-invasive CT radiomic biomarkers predict microsatellite stability status in colorectal cancer: a multicenter validation study

Radiomics-based prediction of microsatellite instability in colorectal cancer at initial computed tomography evaluation

Introduction

Materials and methods

Literature search

Study selection

Data extraction

Assessment of radiomics quality score and the risk of bias

Results

General characteristics and the incidence of MSI

RQS and QUADAS-2 assessment

Study characteristics

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of Interest

Informed Consent

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOC 66 KB)

Supplementary file2 (DOCX 20 KB)

Supplementary file3 (DOCX 16 KB)

Supplementary file4 (DOCX 15 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation