Treatment planning of gastrointestinal stromal tumors (GISTs) includes distinguishing GISTs from other intra-abdominal tumors and GISTs’ molecular analysis. The aim of this study was to evaluate radiomics for distinguishing GISTs from other intra-abdominal tumors, and in GISTs, predict the c-KIT, PDGFRA, BRAF mutational status, and mitotic index (MI). Patients diagnosed at the Erasmus MC between 2004 and 2017, with GIST or non-GIST intra-abdominal tumors and a contrast-enhanced venous-phase CT, were retrospectively included. Tumors were segmented, from which 564 image features were extracted. Prediction models were constructed using a combination of machine learning approaches. The evaluation was performed in a 100 × random-split cross-validation. Model performance was compared to that of three radiologists. One hundred twenty-five GISTs and 122 non-GISTs were included. The GIST vs. non-GIST radiomics model had a mean area under the curve (AUC) of 0.77. Three radiologists had an AUC of 0.69, 0.76, and 0.84, respectively. The radiomics model had an AUC of 0.52 for c-KIT, 0.56 for c-KIT exon 11, and 0.52 for the MI. The numbers of PDGFRA, BRAF, and other c-KIT mutations were too low for analysis. Our radiomics model was able to distinguish GISTs from non-GISTs with a performance similar to three radiologists, but less observer dependent. Therefore, it may aid in the early diagnosis of GIST, facilitating rapid referral to specialized treatment centers. As the model was not able to predict any genetic or molecular features, it cannot aid in treatment planning yet.
Gastrointestinal stromal tumors (GISTs) are rare mesenchymal tumors of the gastrointestinal tract, with an estimated incidence between 10 and 15 cases per million persons per year [1, 2]. The most common tumor locations are the stomach (56%) and the small intestine (32%) . Differentiating GISTs from other intra-abdominal tumors (non-GISTs) is highly important for early diagnosis and treatment planning . Due to the rarity of GISTs, establishing the correct diagnosis can be challenging. Computed tomography (CT) is the imaging modality of choice in GIST diagnosis , but assessment through an invasive tissue biopsy is generally required . A non-invasive and quicker method may aid in the early assessment of GISTs, allowing rapid transfer of such patients to specialized treatment centers.
Treatment planning of GISTs is based on their molecular profile. The mitotic index (MI) reflects the proliferative rate of GISTs, correlates with survival and risk of metastatic spread , and determines the use of adjuvant systemic treatment. Treatment decisions are also based on the GISTs’ mutational status. PDGFRA exon 18 mutated (Asp842Val) GISTs are resistant to imatinib . GISTs with a c-KIT exon 11 mutation have shown a greater sensitivity for imatinib than those with a c-KIT exon 9 mutations . The MI and these genetic mutations are currently assessed through an invasive tissue biopsy. Prediction of such molecular characteristics based on imaging could guide treatment planning while awaiting the results of a final tissue biopsy.
Radiomics relates imaging features to molecular characteristics in order to contribute to diagnosis, prognosis, and treatment decisions. Radiomics has shown promising results in risk stratification of GISTs [8,9,10,11,12,13,14,15,16,17], but has not been used to distinguish GISTs from non-GISTs nor to predict the molecular profile.
The aim of this study was to evaluate whether radiomics based on CT is capable of (1) differentiating GISTs from other intra-abdominal tumors resembling GISTs prior to treatment, i.e., the differential diagnosis and (2) predicting the presence and type of mutation (BRAF, PDGFRA, and c-KIT) and the MI of GISTs, i.e., the molecular analysis, also called “radiogenomics”.
Materials and Methods
Approval by the Erasmus MC institutional review board was obtained (MEC-2017–1187). Patients from our institute between 2004 and 2017 with a histopathologically proven primary GIST or intra-abdominal tumors resembling GIST with at least a contrast-enhanced venous-phase CT prior to treatment [3, 18] were retrospectively included. The cohort of intra-abdominal tumors resembling GISTs was composed of consecutive intra-abdominal benign and malignant spindle cell and epithelioid non-GIST soft tissue tumors . Age at diagnosis, sex, and tumor location (based on radiology reports) were collected. The sample sizes of the non-GIST and the GIST cohort were matched. The non-GIST subtypes were balanced, i.e., a similar number of patients per subtype was randomly included.
GISTs with a known mutation status and/or MI prior to therapy were included in the molecular analysis. Both were obtained from pathology reports. The mutation was categorized as “absent” or “present” for each type (e.g., c-KIT) and subtype (e.g., c-KIT exon 11). The MI (expressed in high power fields (HPF), magnification 40 × , totaling 5mm2), determined on biopsy or excision material, was split into low (≤ 5/50 HPF) and high (> 5/50 HPF) . An adjusted MI was calculated per 50 HPF when the MI was not counted per 50 HPF. As not all of these characteristics may have all been analyzed in all patients, if a specific mutation (e.g., c-KIT) or the MI was not stated in the pathology reports, this was categorized as “missing” and the patient was not included in the related radiomics analysis.
Figure 1 depicts the radiomics workflow. Tumors were manually segmented once by one of two clinicians under the supervision of a musculoskeletal radiologist (5 years of experience) using in-house developed software . A subset of 30 GISTs was segmented by both clinicians, in which inter-observer variability was evaluated through the pairwise Dice similarity coefficient (DSC), with a DSC > 0.70 indicating good agreement . For each lesion, 564 features quantifying intensity, shape, and texture were extracted using the PREDICT  (version 3.1.13) and PyRadiomics  (version 3.0.1) toolboxes (see Supplemental Material 1). The WORC toolbox (version 3.4.0) was used to create a decision model from the features [24,25,26]. In WORC, radiomics is formulated as a modular workflow consisting of multiple components, e.g., feature selection, resampling, and machine learning. For each component, a variety of commonly used algorithms and their associated hyperparameters are included. Using automated machine learning, WORC automatically constructs and optimizes the radiomics workflow to determine which combination of algorithms and hyperparameters maximizes the prediction performance on the training set. The final model consists of an ensemble of the 100 workflows performing best on the training set (for details, see Supplemental Material 2). The code for the feature extraction and model creation has been published open source .
Evaluation of all models was done through a 100 × random-split cross-validation. In each iteration, the data was randomly split into 80% for training and 20% for testing in a stratified manner (see Supplemental Fig. S1). Within the training set, the WORC model optimization was performed using an internal cross-validation (5 ×). Hence, all optimization was done on the training set to eliminate any risk of overfitting on the test set.
Performance was evaluated using the area under the curve (AUC) of the receiver operating characteristic (ROC) curve, balanced classification accuracy (BCA) , sensitivity, and specificity. The positive classes were defined as GIST, the presence of the mutations, and a high MI. The mean performance measures over the 100 cross-validation iterations were computed, and their 95% confidence intervals (CIs) were constructed using the corrected resampled t-test . When plotting ROC curves, confidence bands were constructed using fixed-width bands .
First, to evaluate the predictive value of imaging, a radiomics model based on imaging only was evaluated. To assess the predictive value of volume alone, an additional volume-only model was trained. Second, as radiologists frequently use age, sex, and location in the differential diagnosis, an additional model for the differential diagnosis was created based on radiomics, age, sex, and location.
The differences in feature values and CT acquisition parameters between the GIST and non-GIST cohorts were assessed using a Mann–Whitney U univariate statistical test for continuous variables, and a chi-square test for categorical variables. P-values of the features were corrected for multiple testing using the Bonferroni correction, i.e., multiplying the p-values by the number of tests. Values of p < 0.05 were considered statistically significant. For the statistically significant acquisition parameters, the individual predictive value was assessed using the AUC.
To gain insight into the models, the patients were ranked from typical to atypical for both the GIST and non-GIST groups, based on the consistency of the model predictions. This was determined by the number of times (percentage) that a patient was classified correctly when included in the test set of a cross-validation iteration. Typical examples for each class consisted of the patients who were always classified correctly; atypical vice versa.
The robustness of the radiomics features and model to variations in the segmentations and the CT acquisition protocols was evaluated using the intra-class correlation coefficient (ICC) [31, 32] and ComBat [33, 34] (see Supplemental Material 3).
Performance of the Radiologists
To compare the models with clinical practice, three radiologists (5, 15, and 12 years of experience) independently scored the lesions on a ten-point scale to indicate their certainty of the tumor being a GIST (i.e., 1 = strongly disagree, 10 = strongly agree). The radiologists were blinded for the diagnosis but had access to the CT scan, patient age, and sex. The agreement between radiologists was evaluated using Cohen’s kappa. To enable direct statistical comparison, the radiomics model was evaluated in an additional leave-one-out cross-validation, after which the DeLong test was used to compare the AUCs .
The dataset included 247 patients (125 GISTs, 122 non-GISTs) (see Table 1) and has been publicly released . The dataset of 247 CT scans originated from 66 different scanners, resulting in variation in the acquisition protocols. The scans originated from four different manufacturers (Siemens, Berlin, Germany: 126; Philips, Eindhoven, the Netherlands: 63; General Electric, Boston, United States: 10; Toshiba, Tokyo, Japan: 48). Between the GIST and non-GIST scans, statistically significant differences were found in peak kilovoltage (KVP) (p = 0.025), slice thickness (p = 9.52 × 10−4). Their individual predictive power was however low (AUC of 0.56 for KVP, 0.60 for slice thickness), which is supported by the inter-quartile ranges being the same in GISTs and non-GISTs (KVP: (100.0, 120.0), slice thickness: (3.0, 5.0)). No statistically significant differences were found in manufacturer (p = 0.15), pixel spacing (p = 0.10), or tube current (p = 0.15). On the subset of 30 GISTs that was segmented by both observers, the mean DSC was 0.84 (standard deviation of 0.20), indicating good agreement.
Of the 125 GIST patients, two were not included in the molecular radiomics analysis as the molecular characteristics were obtained after receiving systemic treatment, resulting in 123 GIST patients included in the molecular analysis. The mutation analysis was performed on tissue obtained from the primary lesion, except for three patients where a metastatic hepatic lesion was used. c-KIT mutational analysis was performed in 98/123 (80%) GISTs. One patient had a c-KIT mutation which was not further specified. Twenty-six out of 98 patients (27%) had no c-KIT mutation. The majority of patients had a c-KIT exon 11 mutation (N = 59, 60%). Due to the low numbers of c-KIT exon 9 (N = 10), c-KIT exon 13 (N = 2), PDGFRA (N = 14), and BRAF (N = 0), these mutations were excluded from further analysis.
The MI was analyzed in 90/123 (73%) GISTs (55 low, 35 high). The MI of 33 (37%) GISTs was converted to the adjusted MI. The MI was determined on excision material in 54 (60%) patients, and on biopsy material in 36 (40%) patients, including one patient in which the MI was based on the hepatic GIST metastasis.
The performances of the models distinguishing GISTs from non-GISTs are shown in Table 2; the ROC curves are shown in Fig. 2. The radiomics model, i.e., based on imaging only, had a mean AUC of 0.77. An overview of the selected algorithms and hyperparameters for each cross-validation iteration in this model can be found online . Only using volume did not perform well (AUC of 0.56). Combining radiomics with age, sex, and location yielded an improvement (AUC of 0.84).
The performance of the radiologists is shown in Table 2; their ROC curves are shown in Fig. 2. The three radiologists respectively had a lower (0.69), similar (0.76), and higher (0.84) AUC than the radiomics model. Compared to the model with the same inputs, i.e., based on radiomics, age, sex, and tumor location, the AUCs of the first two radiologists were lower, while the AUC of the third radiologist was similar. Cohen’s kappa measures between the pairs of radiologists were 0.20, 0.31, and 0.33, all indicating poor inter-observer agreement. The DeLong test between the pairs of radiologists indicated a statistically significant difference in performance for radiologists 1 versus 3 (p = 6 × 10−5) and 2 versus 3 (p = 0.01). The radiomics model evaluated in a leave-one-out cross-validation (AUC of 0.82) performed statistically significantly better than the first radiologist (p = 0.0018); for comparison with the other radiologists, the differences were not statistically significant.
Evaluation of Models for the Molecular Analysis
For the c-KIT mutation stratification and MI predictions, the performance of the model based on radiomics, age, and sex is depicted in Table 3. All models had a mean AUC close to guessing (0.50) and focused on the majority class (c-KIT mutation and c-KIT exon 11 mutation: high sensitivity, low specificity; MI vice versa).
As the molecular analysis models did not perform well, the model insight analysis was only conducted for the differential diagnosis. The p-values of the feature importance analysis are shown in Supplemental Table S1. In total, 43 features had significant p-values after Bonferroni correction (1.1 × 10−17 to 4.6 × 10−2). These included the tumor location (1.1 × 10−17), two intensity features, three orientation features, four shape features of which three related to the tumor area, and 33 texture features. A list of these features and their p-values has been added to the mentioned published code . Volume was not found to be significant.
GISTs were ranked from typical to atypical as identified by the radiomics model. Of the 247 patients, 104 tumors (44 GISTs, 60 non-GISTs, 42%) were always classified correctly and were thus considered typical. Twenty-nine tumors (18 GISTs, 11 non-GISTs, 12%) were always classified incorrectly and thus atypical. In Fig. 3, four CT slices of such typical and atypical examples of GISTs are shown. Visual inspection of the tumors on imaging defined as typical or atypical by the radiomics model showed a relation with necrosis (more present in typical GIST, typically a necrotic core) and shape (more compact, circular, and non-lobulated for typical GIST). The tumors which were equally often classified as GIST and non-GIST in the cross-validation iterations were mostly small tumors. The typical imaging characteristics used by the model and the difficulty with small tumors correspond to findings in the literature on GIST risk stratification [14, 37]. Smaller tumors were also more often misclassified by the radiologists in our study.
Only using features with a good (ICC > 0.75, 327/564 features) or excellent (ICC > 0.9, 197/564 features) reliability across the segmentations lowered the performance (AUC of 0.67 for both). Using ComBat to harmonize the features for manufacturer or protocol differences yielded a similar performance as without (AUC of 0.80 and 0.77, respectively). Detailed results for these experiments are shown in Supplemental Table S2.
Radiomics can distinguish GISTs from other intra-abdominal tumors with a performance similar to three radiologists. Radiomics could not predict the presence and subtype of c-KIT mutations or the MI.
Diagnosing GISTs is currently done manually by radiologists and confirmed through a tissue biopsy [4, 38, 39]. The ability to distinguish rare GISTs from non-GISTs on routine CT scans through radiomics could be a quick method for the initial assessment of intra-abdominal tumors. Radiomics could aid quick referral of GIST patients from a peripheral hospital to a center of expertise, shortening time to diagnosis by refining patient selection prior to biopsies, and prevent GISTs from being missed (i.e., false negatives), unnecessary referral, or even treatment for non-GISTs (i.e., false positives). To our knowledge, this is the first study to evaluate the GIST differential diagnosis on many locations through an automated radiomics approach on a large, multi-scanner dataset, and compare the performance of the model with that of the radiologists.
There were significant performance differences between the radiologists, and their agreement was poor, indicating high observer dependence. The advantages of the radiomics model are that it is automatic and observer independent, assuming the segmentation is reproducible as indicated by the high DSC and that it will always give the same prediction on the same image, thereby improving consistency over manual scoring.
In clinical practice, tumor location is highly relevant for distinguishing GISTs from non-GISTs, as GISTs grow typically in the stomach or small intestines . In our study, tumor location was based on radiology reports, which is subjective and occasionally fails to report the true tumor primary origin . Moreover, the tumor location distribution in our dataset may not be a correct representation of the overall population, e.g., only non-GISTs were located in the uterus. Despite the subjectivity of potential bias in tumor location, we added location to the imaging model for a fair comparison with the radiologists. Although this led to a higher AUC, a model based on location, e.g., simply classifying all lesions in the uterus as non-GISTs, is unfeasible and cannot be applied in the general population. The radiomics model rather predicts the likelihood of a lesion being a GIST purely based on the imaging appearance. Further research on location-matched datasets is required to investigate the value of location in the GIST differential diagnosis model.
In the literature, radiomics for risk classification or outcomes such as malignant potential or aggressive behavior for GISTs [8,9,10,11,12,13,14,15,16,17] has mostly been based on criteria such as the Armed Forces Institute of Pathology criteria, modified National Institutes of Health consensus criteria of 2008, and the modified Fletcher classification system [3, 40,41,42,43,44]. These studies illustrate the clinical need for new methods to stratify GISTs and show the potential of radiomics for GISTs. Our first contribution with respect to the existing literature is the focus on the diagnostic trajectory of GISTs, to simplify the diagnostic process of this rare tumor type by predicting the differential diagnosis. Existing studies mainly focus on risk classification, which has a less apparent direct application in clinical practice, and generally first require the GIST differential diagnosis to be applicable [3, 40,41,42,43]. Second, our method determines the optimal radiomics pipeline from a large number of radiomics algorithms and parameters, automatically evaluating a large number of radiomics methods, whereas existing studies typically report the results of a “hand-crafted,” manually optimized radiomics pipeline [8,9,10,11,12,13,14,15,16,17]. Moreover, through an extensive cross-validation scheme, all model optimization was performed on the training dataset, eliminating the risk of overfitting the model on the test set. Lastly, we evaluated the model’s robustness to segmentation and scanner variations.
Our model was not able to distinguish different genetic mutations or the MI of GISTs, which may be attributed to various factors. First, the dataset for the mutation analysis was relatively small (e.g., 90 patients in the MI analysis), which may have been too small for radiomics to learn from. Second, the use of different gene panels for the GIST mutational analysis over the years may have resulted in inaccuracies in the golden standard. Additionally, this might have led to a potential underestimation of mutation prevalence in the current cohort, as newer sequencing techniques use larger gene panels and have a higher sensitivity. Third, other (more complex or deep learning based) radiomics methods may be required to discover more intricate features. Lastly, the negative results may simply suggest that molecular characteristics such as a c-KIT mutation are too subtle to detect solely based on portal venous phase CT imaging characteristics. Other CT phases or modalities (e.g., magnetic resonance imaging) could provide more useful information.
Our study has several limitations. First, there was heterogeneity in the acquisition protocols. There were two acquisition parameters (KVP and slice thickness) with statistically significant differences between GISTs and non-GISTs, but their individual predictive power was low. Hence, although a minor positive bias due to heterogeneity in acquisition protocols cannot be completely ruled out, the predictive performance cannot be attributed to this bias alone. Alternatively, this heterogeneity may have also negatively affected the performance. Nevertheless, the radiomics model achieved a promising performance, similar to three experienced radiologists, suggesting high generalizability. Second, complete histologic data was only available for a subset of the patients. No data regarding the clinical outcome such as survival or recurrence was available for the GISTs. Finally, the current radiomics approach requires manual segmentation. While accurate, this process is also time-consuming and potentially subject to observer variability, although the DSC indicated good agreement. Only using features with a good or excellent reliability across the segmentations lowered the performance. This may indicate that there are features that have a low reliability but a high predictive power, thus resulting in low performance when removing these. Alternatively, it may indicate overfitting of the model to observer-dependent characteristics of the segmentation and thus exploitation of a bias in the segmentations. Automatic segmentation methods may help to overcome this limitation.
Future work should focus on the extension of the dataset, leading to more statistical power, potentially improving the performance as the model has more cases to learn from, and paving the way for more data-driven approaches such as deep learning. Also, this may result in sufficient samples to study the prediction of less common GIST mutations. Next, external validation of our findings on an independent, external dataset is required. Eventually, this may be followed by a prospective clinical trial with harmonized acquisition protocols in which the performance, as well as the cost-effectiveness, is assessed.
Our radiomics model was able to distinguish GIST from non-GIST intra-abdominal tumors based on pre-treatment CT imaging with a performance similar to three experienced radiologists, but is less observer dependent. Our model may therefore aid clinicians early on in the diagnostic chain to ensure rapid transfer of GISTs to specialized centers. The model was not able to predict the c-KIT mutational status and the MI.
Programming code for the method presented in this study is available on Zenodo at https://doi.org/10.5281/zenodo.3839322.
Verschoor AJ, Bovee J, Overbeek LIH, group P, Hogendoorn PCW, Gelderblom H. The incidence, mutational status, risk classification and referral pattern of gastro-intestinal stromal tumours in the Netherlands: a nationwide pathology registry (PALGA) study. Virchows Arch. Feb 2018;472(2):221-229. https://doi.org/10.1007/s00428-017-2285-x
Soreide K, Sandvik OM, Soreide JA, Giljaca V, Jureckova A, Bulusu VR. Global epidemiology of gastrointestinal stromal tumours (GIST): A systematic review of population-based cohort studies. Cancer Epidemiol. Feb 2016;40:39-46. https://doi.org/10.1016/j.canep.2015.10.031
Miettinen M, Lasota J. Gastrointestinal stromal tumors: review on morphology, molecular pathology, prognosis, and differential diagnosis. Arch Pathol Lab Med. Oct 2006;130(10):1466-78. https://doi.org/10.1043/1543-2165(2006)130[1466:GSTROM]2.0.CO;2
Lau S, Tam KF, Kam CK, et al. Imaging of gastrointestinal stromal tumour (GIST). Clin Radiol. Jun 2004;59(6):487-98. https://doi.org/10.1016/j.crad.2003.10.018
Demetri GD, von Mehren M, Antonescu CR, et al. NCCN Task Force report: update on the management of patients with gastrointestinal stromal tumors. J Natl Compr Canc Netw. Apr 2010;8 Suppl 2:S1-41; S42-4. https://doi.org/10.6004/jnccn.2010.0116
Rudolph P, Gloeckner K, Parwaresch R, Harms D, Schmidt D. Immunophenotype, proliferation, DNA ploidy, and biological behavior of gastrointestinal stromal tumors: a multivariate clinicopathologic study. Hum Pathol. Aug 1998;29(8):791-800. https://doi.org/10.1016/S0046-8177(98)90447-6
Cassier PA, Fumagalli E, Rutkowski P, et al. Outcome of patients with platelet-derived growth factor receptor alpha-mutated gastrointestinal stromal tumors in the tyrosine kinase inhibitor era. Clin Cancer Res. Aug 15 2012;18(16):4458-64. https://doi.org/10.1158/1078-0432.CCR-11-3025
Chen T, Ning Z, Xu L, et al. Radiomics nomogram for predicting the malignant potential of gastrointestinal stromal tumours preoperatively. Eur Radiol. Mar 2019;29(3):1074-1082. https://doi.org/10.1007/s00330-018-5629-2
Zhuo T, Li X, Zhou H. Combining Radiomics and CNNs to Classify Benign and Malignant GIST. Advances in Intelligent Systems Research. 2018;147:281-287. https://doi.org/10.2991/ncce-18.2018.44
Feng C, Lu F, Shen Y, et al. Tumor heterogeneity in gastrointestinal stromal tumors of the small bowel: volumetric CT texture analysis as a potential biomarker for risk stratification. Cancer Imaging. Dec 5 2018;18(1):46. https://doi.org/10.1186/s40644-018-0182-4
Xu F, Ma X, Wang Y, et al. CT texture analysis can be a potential tool to differentiate gastrointestinal stromal tumors without KIT exon 11 mutation. Eur J Radiol. Oct 2018;107:90-97. https://doi.org/10.1016/j.ejrad.2018.07.025
Yang L, Dong D, Fang M, et al. Can CT-based radiomics signature predict KRAS/NRAS/BRAF mutations in colorectal cancer? Eur Radiol. May 2018;28(5):2058-2067. https://doi.org/10.1007/s00330-017-5146-8
Ning Z, Luo J, Li Y, et al. Pattern Classification for Gastrointestinal Stromal Tumors by Integration of Radiomics and Deep Convolutional Features. IEEE J Biomed Health Inform. May 2019;23(3):1181-1191. https://doi.org/10.1109/JBHI.2018.2841992
Zhou C, Duan X, Zhang X, Hu H, Wang D, Shen J. Predictive features of CT for risk stratifications in patients with primary gastrointestinal stromal tumour. Eur Radiol. 2016;26(9):3086-3093. https://doi.org/10.1007/s00330-015-4172-7
Ba-Ssalamah A, Muin D, Schernthaner R, et al. Texture-based classification of different gastric tumors at contrast-enhanced CT. Eur J Radiol. Oct 2013;82(10):e537-43. https://doi.org/10.1016/j.ejrad.2013.06.024
Liu S, Pan X, Liu R, et al. Texture analysis of CT images in predicting malignancy risk of gastrointestinal stromal tumours. Clin Radiol. Mar 2018;73(3):266-274. https://doi.org/10.1016/j.crad.2017.09.003
Kurata Y, Hayano K, Ohira G, Narushima K, Aoyagi T, Matsubara H. Fractal analysis of contrast-enhanced CT images for preoperative prediction of malignant potential of gastrointestinal stromal tumor. Abdom Radiol (NY). Oct 2018;43(10):2659-2664. https://doi.org/10.1007/s00261-018-1526-z
Kang HC, Menias CO, Gaballah AH, et al. Beyond the GIST: mesenchymal tumors of the stomach. Radiographics. Oct 2013;33(6):1673-90. https://doi.org/10.1148/rg.336135507
Miettinen M, Lasota J. Gastrointestinal stromal tumors: pathology and prognosis at different sites. Semin Diagn Pathol. May 2006;23(2):70-83. https://doi.org/10.1053/j.semdp.2006.09.001
Starmans MPA, Miclea RL, van der Voort SR, Niessen WJ, Thomeer MG, Klein S. Classification of malignant and benign liver tumors using a radiomics approach. in Medical Imaging 2018: Image Processing, E. D. Angelini and B. A. Landman, Eds., vol. 10574, SPIE-Intl Soc Optical Eng. March 2018;343-349. https://doi.org/10.1117/12.2293609
Zou KH, Warfield SK, Bharatha A, et al. Statistical validation of image segmentation quality based on a spatial overlap index1: scientific reports. Academic Radiology. 2004/02/01/ 2004;11(2):178-189. https://doi.org/10.1016/S1076-6332(03)00671-8
van der Voort SR, Starmans MPA. Predict: a Radiomics Extensive Digital Interchangable Classification Toolkit (PREDICT). Zenodo. Accessed 25-02-2021, https://github.com/Svdvoort/PREDICTFastr. https://doi.org/10.5281/zenodo.3854839
Van Griethuysen JJ, Fedorov A, Parmar C, et al. Computational radiomics system to decode the radiographic phenotype. Cancer research. 2017;77(21):e104-e107. https://doi.org/10.1158/0008-5472.CAN-17-0339
Vos M, Starmans MPA, Timbergen MJM, et al. Radiomics approach to distinguish between well differentiated liposarcomas and lipomas on MRI. British Journal of Surgery. Dec 2019;106(13):1800-1809. https://doi.org/10.1002/bjs.11410
Starmans MPA, van der Voort SR, Phil T, et al. Reproducible radiomics through automated machine learning validated on twelve clinical applications. arxiv preprint. 2021 https://arxiv.org/abs/2108.08618
Starmans MPA, Van der Voort SR, Phil T, Klein S. Workflow for Optimal Radiomics Classification (WORC). Zenodo. Accessed 22-12-2021, https://github.com/MStarmans91/WORC. https://doi.org/10.5281/zenodo.3840534
Tharwat A. Classification assessment methods. Applied Computing and Informatics. 2018/08/21/ 2018; https://doi.org/10.1016/j.aci.2018.08.003
Nadeau C, Bengio Y. Inference for the Generalization Error. Machine Learning. 2003/09/01 2003;52(3):239-281. https://doi.org/10.1023/A:1024068626366
Macskassy SA, Provost F, Rosset S. ROC confidence bands: An empirical evaluation. ACM; 2005:537-544. https://doi.org/10.1145/1102351.1102419
Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. Journal of Chiropractic Medicine. 2016/06/01/ 2016;15(2):155-163. https://doi.org/10.1016/j.jcm.2016.02.012
Traverso A, Wee L, Dekker A, Gillies R. Repeatability and Reproducibility of Radiomic Features: A Systematic Review. International Journal of Radiation Oncology • Biology • Physics. 2018/11/15 2018;102(4):1143-1158. https://doi.org/10.1016/j.ijrobp.2018.05.053
Fortin J-P, Parker D, Tunç B, et al. Harmonization of multi-site diffusion tensor imaging data. NeuroImage. 2017/11/01/ 2017;161:149-170. https://doi.org/10.1016/j.neuroimage.2017.08.047
Orlhac F, Boughdad S, Philippe C, et al. A Postreconstruction Harmonization Method for Multicenter Radiomic Studies in PET. Journal of Nuclear Medicine. 08/2018 2018;59(8):1321-1328. https://doi.org/10.2967/jnumed.117.199935
DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. Sep 1988;44(3):837-45.
Starmans MPA, Miclea RL, Vilgrain V, et al. Automated differentiation of malignant and benign primary solid liver lesions on MRI: An externally validated radiomics model. medRxiv preprint. 2021. https://doi.org/10.1101/2021.08.10.21261827
Maldonado FJ, Sheedy SP, Iyer VR, et al. Reproducible imaging features of biologically aggressive gastrointestinal stromal tumors of the small bowel. Abdominal Radiology. Nov 6 2017;43(7):1567-1574. https://doi.org/10.1007/s00261-017-1370-6
Akahoshi K, Oya M, Koga T, Shiratsuchi YJWjog. Current clinical management of gastrointestinal stromal tumor. 2018;24(26):2806. https://doi.org/10.3748/wjg.v24.i26.2806
Liu M, Liu L, Jin E. Gastric sub-epithelial tumors: identification of gastrointestinal stromal tumors using CT with a practical scoring method. Gastric Cancer. Jul 2019;22(4):769-777. https://doi.org/10.1007/s10120-018-00908-6
Joensuu H. Risk stratification of patients diagnosed with gastrointestinal stromal tumor. Hum Pathol. Oct 2008;39(10):1411-9. https://doi.org/10.1016/j.humpath.2008.06.025
Fletcher CD, Berman JJ, Corless C, et al. Diagnosis of gastrointestinal stromal tumors: A consensus approach. Hum Pathol. May 2002;33(5):459-65. https://doi.org/10.1177/106689690201000201
Jones RL. Practical aspects of risk assessment in gastrointestinal stromal tumors. J Gastrointest Cancer. Sep 2014;45(3):262-7. https://doi.org/10.1007/s12029-014-9615-x
Milliron B, Mittal PK, Camacho JC, Datir A, Moreno CC. Gastrointestinal Stromal Tumors: Imaging Features Before and After Treatment. Curr Probl Diagn Radiol. Jan - Feb 2017;46(1):17-25. https://doi.org/10.1067/j.cpradiol.2015.08.001
Li C, Fu W, Huang L, et al. A CT-based nomogram for predicting the malignant potential of primary gastric gastrointestinal stromal tumors preoperatively. Abdominal Radiology. 2021/03/13 2021; https://doi.org/10.1007/s00261-021-03026-7
Martijn P. A. Starmans acknowledges funding from the research program STRaTeGy (project number 14929-14930), which is (partly) financed by the Netherlands Organisation for Scientific Research (NWO), and EuCanShare and EuCanImage (European Union’s Horizon 2020 research and innovation programme under grant agreements Nr. 825903 and Nr. 952103, respectively). This work was partially carried out on the Dutch national e-infrastructure with the support of SURF Cooperative.
The study was conducted according to the guidelines of the Declaration of Helsinki and was approved by the local Institutional Review Board (or Ethics Committee) of the Erasmus MC (MEC-2017–1187).
Due to the use of retrospective, anonymized data, the need for written informed consent was waived by the Institutional Review Board.
Conflict of Interest
Wiro J. Niessen is the founder, scientific lead and stockholder of Quantib BV. The other authors do not declare any conflicts of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Starmans, M.P.A., Timbergen, M.J.M., Vos, M. et al. Differential Diagnosis and Molecular Stratification of Gastrointestinal Stromal Tumors on CT Images Using a Radiomics Approach. J Digit Imaging 35, 127–136 (2022). https://doi.org/10.1007/s10278-022-00590-2