Abstract
We hypothesized that imaging-only-based machine learning algorithms can analyze non-enhanced CT scans of patients with acute intracerebral hemorrhage (ICH). This retrospective multicenter cohort study analyzed 520 non-enhanced CT scans and clinical data of patients with acute spontaneous ICH. Clinical outcome at hospital discharge was dichotomized into good outcome and poor outcome using different modified Rankin Scale (mRS) cut-off values. Predictive performance of a random forest machine learning approach based on filter- and texture-derived high-end image features was evaluated for differentiation of functional outcome at mRS 2, 3, and 4. Prediction of survival (mRS ≤ 5) was compared to results of the ICH Score. All models were tuned, validated, and tested in a nested 5-fold cross-validation approach. Receiver-operating-characteristic area under the curve (ROC AUC) of the machine learning classifier using image features only was 0.80 (95% CI [0.77; 0.82]) for predicting mRS ≤ 2, 0.80 (95% CI [0.78; 0.81]) for mRS ≤ 3, and 0.79 (95% CI [0.77; 0.80]) for mRS ≤ 4. Trained on survival prediction (mRS ≤ 5), the classifier reached an AUC of 0.80 (95% CI [0.78; 0.82]) which was equivalent to results of the ICH Score. If combined, the integrated model showed a significantly higher AUC of 0.84 (95% CI [0.83; 0.86], P value <0.05). Accordingly, sensitivities were significantly higher at Youden Index maximum cut-offs (77% vs. 74% sensitivity at 76% specificity, P value <0.05). Machine learning–based evaluation of quantitative high-end image features provided the same discriminatory power in predicting functional outcome as multidimensional clinical scoring systems. The integration of conventional scores and image features had synergistic effects with a statistically significant increase in AUC.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Intracerebral hemorrhage (ICH) is the most severe form of stroke with a 1-month morbidity and mortality approaching 50% and death or severe disability exceeding 75% [1,2,3]. In contrast to recent advances in interventional treatments of patients with ischemic stroke, beneficial effects of medical treatment and surgical intervention on the mortality and functional outcome of ICH patients were not observed in recent trials [4, 5]. Accurate stratification of ICH prognosis is highly desired regardless of the therapeutic options that are available and remains a clinical research priority [6]. Therefore, several prognostic tools have been proposed for the prediction of mortality and functional outcome in spontaneous ICH [7]. Though potentially useful for ascertaining prognosis and facilitating communication between clinicians, numerous methodological and reporting deficiencies are reported for a majority of these tools [7]. There is growing interest in augmented diagnostic and prognostic vision with machine learning (ML) in the medical field due to the wide range of applications of these algorithms and the increasing availability of computational power. ML is a type of artificial intelligence that learns patterns and rules from given information [8]. Recent studies applied ML to severity and outcome prediction models for neurological disorders such as ischemic stroke [8], aneurysmal subarachnoid hemorrhage [9], and traumatic brain injury [10]. However, ML approaches in the field of ICH were mainly focused on prompt diagnosis and automated volume quantification [11, 12] with lacking algorithms for the prediction of clinical outcome. As of late, Wang et al. have been among the first to develop an outcome prediction model based on ML by incorporating initial clinical presentations, laboratory data, and imaging findings [13]. Imaging findings were limited to ICH volume and location, presence of intraventricular hemorrhage, ventricle compression, and midline structure shift [13]. Further integration of quantitative imaging characteristics may hold additional prognostic value [9]. In the past, specific CT markers and histogram-based analyses of ICH heterogeneity have been linked to poor clinical outcome and reinforce this notion [14,15,16]. The goal of this study was twofold: First, we hypothesized that quantitative radiomic filter- and texture-derived high-end image features extracted from non-enhanced computed tomography (NECT) brain scans can be used to predict clinical outcome of ICH patients. To test and evaluate this hypothesis, we employed a radiomics-based ML approach on NECT brain scans of patients presenting with acute primary ICH [17]. Secondly, we hypothesized that the diagnostic power of the presented algorithm using high-end image features is equal to the ICH Score serving as the most widely utilized prognostic model for predicting mortality [18].
Materials and Methods
Study Population
We retrospectively analyzed the database of three university hospitals (University Medical Center Hamburg-Eppendorf, Charité University Medical Center Berlin, University Medical Center Münster) with a high-volume tertiary stroke center, for patients with ICH aged ≥18 years between January 2010 and April 2019. Inclusion criteria were defined as follows: Spontaneous ICH confirmed on NECT on admission. Patients were excluded if they had a secondary ICH from head trauma, hemorrhagic transformation of ischemic infarction, brain tumor, cerebral aneurysm, or vascular malformation. Baseline patient characteristics were retrieved from medical records, including Glasgow Coma Scale (GCS) at admission and modified Rankin Scale (mRS) at discharge. Additionally, we obtained vascular risk factors, blood pressure parameters, antiplatelet and oral anticoagulation (OAC) medication, and follow-up procedures, such as craniectomy or intraventricular drainage placement from patients’ clinical records and follow-up CT. A binary clinical outcome was defined based on modified Rankin Scale (mRS) on discharge with ≤3 as good outcome and mRS >3 as poor outcome [19]. According to the inclusion criteria, 520 patients were included, out of which 151 (29%) patients had a good outcome (mRS 0–3) and 369 (71%) patients had a poor outcome (mRS 4–6). Details are listed for further consideration in Table 1. This multicenter retrospective study was approved by the ethics committee (Ethik-Kommission der Ärztekammer Hamburg, Ethik-Komission der Charité Berlin) and written informed consent was waived by the institutional review boards. All study protocols and procedures were conducted in accordance with the Declaration of Helsinki. The deidentified data and analytic code are available from the corresponding author upon reasonable request.
Image Acquisitions
The NECT scans were performed using standard clinical parameters with axial < 5 mm section thickness. All datasets were inspected for quality and excluded in case of severe motion artifacts. In detail, the images were acquired on the following scanners: 256 slice scanner (Philips iCT 256) with 120 kV, 280–320 mA, < 5.0 mm slice reconstruction; 80 slice scanner (Toshiba Aquilion Prime) with 120 kV, 280 mA, < 5.0 mm slice reconstruction and < 0.5 mm in-plane resolution; and 2 × 128 slice scanner (SOMATOM Definition Flash) with 120 kV, 280 mA, < 5.0 mm slice reconstruction and < 0.5 mm in-plane resolution.
Post-procedure Evaluations
NECT scans were obtained and stored for further evaluation. Two experienced neuroradiologists (JN and SE) assessed and documented the following imaging features on NECT scans: [1] intraventricular hemorrhage; [2] ICH location; [3] craniectomy in the follow-up NECT scans. ICH locations were classified as basal ganglia, thalamus, lobe, brain stem, pons, and cerebellum. In the following ICH, volumes were segmented semi-automatically on the basis of the original NECT images [20]. Regions of interest (ROIs) were delineated using Analyze 11.0 Software (Biomedical Imaging Resource, Mayo Clinic, Rochester, MN). Consensus ROIs were derived based on overlapping segmentations of both readers. Both readers were blinded to all clinical information and bleeding location. Discrepancies were settled by joint discussion of the 2 readers and a third reader (UH). JN and SE: 3 years clinical experience in diagnostic neuroradiology in an academic full-service hospital; UH: 8 years clinical experience in diagnostic neuroradiology; JN, SE, and UH: research with focus on clinical applications of image processing and predictive modelling.
ICH Score
ICH Scores were obtained for every patient included according to the definition of Hemphill et al. based on five independent and multidimensional predictors (ICH volume, infratentorial location, GCS, age, and intraventricular extension) [18]. ICH volumes were obtained from ICH delineations. Oral anticoagulants (OAC) were not included as their addition does not increase the prognostic performance of the ICH Score [21]. As the ICH Score is a prognostic model for 30-day mortality in ICH patients (equivalent to mRS 6), a binary mortality outcome was defined based on mRS at discharge with mRS ≤ 5 (survival) and mRS = 6 (death).
Imaging-Based Outcome Prediction
Radiomic features were defined according to the PyRadiomics Python package v2.1.0. Features were extracted from consensus ROIs and resampled to 0.5 mm × 0.5 mm × 2 mm resolution using sitk BSpline interpolators. Resampling was performed to ensure comparability of texture analysis. Extracted features comprised 252 first-order features (thereof 18 based on unfiltered images, 144 based on wavelet decompositions, 90 based on log-sigma laplacian of Gaussian filters), 902 texture features (thereof 68 based on unfiltered images, 544 based on wavelet decompositions, 290 based on log-sigma laplacian of Gaussian filters), and 14 shape features. In total, 1218 quantitative image features were extracted from the ICH ROIs. To adjust for effects of therapeutic interventions that cannot be detected on admission NECTs, we included decompressive craniectomy as sole clinical parameter into the machine learning models.
ML-based classification was performed using random forest algorithms (Python scikit-learn environment v0.20.3 [22]). Random forest is a ML technique that utilizes multiple decision trees trained on random sub-selections of samples in order to improve stability and reduce overfitting of the algorithm [23]. Decision trees learn decision rules according to predictor values of the training data samples. With increasing depth of nodes, decision trees can represent more complex decision rules, resulting in a better fitting of the model [23, 24]. Hyperparameter tuning (total number of features, number of trees, maximum depth of the tree, minimum number of samples to split an internal node, number of features considered for splitting (mtry), minimum number of samples at leaf node, bootstrapping yes/no) was performed in a nested 5-fold cross-validation approach for each training set using grid search algorithms. Parameters at initiation were set to scikit-learn default values.
Selection of features with highest predictive value was conducted separately for each training dataset of the 5-fold cross-validation outer loop sample split according to Gini impurity measures [25]. Classifier models were trained and tested on each set’s unique training and testing samples (outer loop) utilizing optimized hyperparameters and feature importance of the respective training data (inner loop).
Integration of ICH Score and Imaging-Based Outcome Prediction
It was shown that combinations of classification models trained on heterogeneous predictors tend to have higher synergistic effects if knowledge flows are merged at a very late stage of the data evaluation process. Therefore, probabilities for survival of the ICH Score and of the imaging-based classifier were extracted. The arithmetic average of both probabilities was then used for outcome prediction.
Statistics
Model validation and testing of all classifiers was conducted in a nested 5-fold cross-validation with independent training and validation sets in a model-external approach [26]. Accordingly, model selection and hyperparameter tuning was performed with grid search algorithms on each training data set using a second cross-validation layer. Model stability was examined through comparative analysis of 10 randomly permuted cross-validation sets.
Receiver-operating characteristic (ROC) curves were generated from prediction results of all cross-validation sets. Confidence intervals (CI) for sensitivities and specificities were bootstrapped (2000 replicates, pROC v1.15 [27] R-package). Bonferroni adjustments were applied to control for alpha error inflation.
Furthermore, the classifiers were analyzed using ROC areas under the curve (AUC), sensitivity, specificity, accuracy, Youden Index, positive predictive value, negative predictive value (ThresholdROC v2.8 R-package), and Matthews correlation coefficient (MCC) [28] metrics (psychometric v.2.2. R-package). MCC evaluates all fields of the confusion matrix and is considered a favorable measure for unbiased comparisons of binary classifiers [29]. With TP: true positives, TN: true negatives, FP: false positives, and FN: false negatives, MCC is defined as:
A flow chart of the proposed ML-based prediction of the clinical outcome is depicted in Fig. 1.
Results
Our analysis included NECT images of 520 patients with acute ICH. One hundred fifty-one patients (29%) had a mRS of 0–3 and 369 (71%) had a mRS of 4–6. There were no statistically significant differences in clinical parameters age (P value = 0.85), sex (P value = 0.85), hypertension (P value = 0.25), diabetes mellitus (P value = 0.62), antiplatelet or anticoagulant medication (P value = 0.5 and P value = 0.78, respectively), and systolic blood pressure at admission (P value = 0.75). Both time from symptom onset to admission CT and time from CT to hospital discharge were not statistically different (P value 0.92 and P value = 0.13, respectively). However, patients with mRS 4-6 had a significantly lower GCS (GCS 9 versus GCS 14; P value <0.001), higher percentage of intraventricular hemorrhage (59% versus 33.1%; P value <0.001), higher ICH volumes (35.2 cm3 versus 8.4 cm3; P value <0.001), and a higher rate of supra-tentorial craniectomies (27.4% versus 10.6%; P value <0.02). There were no significant differences in ICH locations. ICH Score was significantly higher in patients with mRS 4-6 (median 3 versus 1; P < 0.001).
Imaging-Based Outcome Prediction
Machine learning–based ROC AUCs of the validation sets for predicting functional clinical outcome were 0.80 (95% CI [0.77; 0.82]) for mRS ≤ 2, 0.80 (95% CI [0.78; 0.81]) for mRS ≤ 3, and 0.79 (95% CI [0.77; 0.80]) for mRS ≤ 4. Trained on survival prediction (mRS ≤ 5), the classifier reached ROC AUCs of 0.80 (95% CI [0.78; 0.82]) which was equivalent to results of the ICH Score with ROC AUC of 0.80 (95% CI [0.79; 0.82]) (Fig. 2, Table 2). Exclusion of the parameter craniectomy yes/no had no effect on classification performance. Model selection and hyperparameter tuning within the nested cross-validation process resulted in the following median settings for mRS ≤ 2, ≤ 3, ≤ 4, and ≤ 5, respectively (medians over cross-validation sets): Number of features considered: 25, 100, 200, 100; number of trees: 750, 1000, 500, 1000; maximum depth of trees: 10 for all cut-off values; number of features considered for splitting (mtry), minimum number of samples to split an internal node, and minimum number of samples at leaf node: 1 for all cut-off values. Feature importance analyses of the mean top 100 predictors of all training data sets suggests that features with highest predictive power are mainly derived from wavelet (43%) and log-sigma (30%) filtered images. Unfiltered original images contributed 27% to total predictive power. Within feature classes, texture metrics dominated predictions (58%) (Fig. 3). Predictive power of the 15 most important features demonstrates dominance of texture and shape features compared to first-order metrics (basic statistical measures of the grey level distribution). To also assess the predictive value of the ICH volume only, an additional ROC analysis was performed (supplementary Figure 1). ROC AUC for ICH volume as sole predictor was 0.72 with a Youden Index of 0.30 at 60% specificity and 70% sensitivity.
Integration of ICH Score and Imaging-Based Outcome Prediction
ICH Score metrics reached a ROC AUC of 0.80 (95% CI [0.79; 0.82]), which was equivalent to the purely imaging-based classifier with ROC AUC of 0.80 (95% CI [0.78; 0.82]). If combined, the integrated model showed a significantly higher ROC AUC of 0.84 (95% CI [0.83; 0.86], P value <0.05). Sensitivities of the integrated model were significantly higher at Youden Index maximum cut-offs with 77% vs. 74% sensitivity at 76% specificity, P value <0.05 (Fig. 2, Table 2).
Discussion
In this study, we developed an imaging-based ML model for predicting the functional outcome of ICH patients. The proposed approach employing quantitative image features derived from NECT scans provided high discriminatory accuracy between good and poor functional outcome of ICH patients at different mRS cut-off values. This study is based on a large multicenter and heterogeneous imaging dataset of 520 patients that was acquired in clinical routine over almost a decade. The proposed classification is solely based on high-end image features without a priori information about the location of the hemorrhage and without controlling for factors such as patient conditions, image acquisition parameters, or scanner type. Observed classification performance and model stability across all nested cross-validation runs suggest sufficient generalizability of our results.
It is a well-known paradigm that the ICH volume profoundly impacts functional clinical outcome. Initially derived by Broderick et al. to predict 30-day mortality after ICH, the ICH volume has been later validated and included in the ICH Score [3, 18]. In line with these findings, we have shown that ML-based outcome assessment using ICH volume as sole predictor already achieves ROC AUCs of >0.70 (supplementary Figure 1). Similarly, surrogate parameters of ICH volume such as maximum 2D diameter or minor axis length had comparatively high predictive importances in the imaging-based ML model. However, total contribution to predictive power of shape-based metrics in the comprehensive model was only 19% at ROC AUCs of 0.80. It thus stands to reason that the ICH formation on NECT holds additional and relevant information which is not assessable by human eyes but can be evaluated by imaging-based ML algorithms. As so, analyses of the 100 most powerful features demonstrate the importance of second-order features (e.g., texture metrics) in comparison to first-order features. In contrast to first-order measures, second-order metrics also capture information regarding the spatial distribution of gray levels and are often difficult to evaluate by the human visual system. The predictive value of second-order features is particularly apparent in the high predictive power of the gray level non-uniformity (Fig. 3). This specific finding could be related to the heterogenous appearance of hematomas that are still actively bleeding with evidence of spot sign or in those of patients with anticoagulation that are at risk for further expansion. It is equally conceivable that the gray level non-uniformity may differentiate areas of hyperacute ICH as the blend sign—with blending of a hypoattenuating area and a hyperattenuating region relative to the surrounding brain parenchyma—suggesting hematoma expansion and in reversal poor clinical outcome.
Hence, the proposed approach can be used as supportive tool to augment conventional image analysis and to improve prognostic decision for both radiologists and clinicians. As aspects of precision medicine are an emerging concept [30], combining the ICH Score with high-end imaging features may be useful in this respect. In line with this, the ICH Score seems to be limited in extension to critical care patients. In a prospective multicenter cohort study with patients presenting with spontaneous ICH and admitted to the intensive care unit (ICU), the ICH Score had only acceptable discriminatory power [31]. Although at this stage speculative and part of future studies, the proposed ML classifier may provide promising complementary results. In anticoagulation-associated ICH, the ICH Score may not be as reliable [21, 32, 33] and clinical outcomes in these patients likewise substantially often worse in comparison to patients without oral anticoagulation (OAC) [34, 35]. Assuming that OAC therapy alters morphology and intensity of ICH, it is most likely that radiomic features are affected by OAC therapy. As we trained the ML model on acute CT images of both, patients receiving OAC and patients without OAC, the information on OAC therapy is incorporated in the model through these differences in ICH imaging characteristics.
Since our quantitative imaging feature analysis performs equally in comparison to multidimensional scoring systems (e.g., ICH Score), the application of the proposed ML approach may be of value for randomized clinical trials. Challenges and opportunities to optimize clinical research and randomized trials in ICH are ongoing [36]. The ML approach could simplify trial procedures by performing an imaging-based prediction of functional outcome or early mortality. Simultaneously the multicenter approach of this study takes local variations in practice into account which are necessary to reflect upon a successful trial planning. Furthermore, this approach may also be of value for telemedicine and remote prediction of ICH outcome in regions lacking neuroradiological specialists. Taken together, the proposed method integrates the merits from quantitative radiomic features and ML algorithms and relates the employed predictors to well-known imaging characteristics.
Despite the promising results, several limitations deserve comment. Our study had general limitations typically associated with quantitative radiomics-based image analysis and classification [17, 37,38,39]. These limitations include differences in image acquisition settings (e.g., size of the field of view, gantry tilt) and under- or overfitting of machine learning algorithms. Bias of these factors was minimized through (a) employment of NECT scans that offer standardized HU metrics and (b) the application of random forest algorithms that are comparably stable with regard to overfitting. The risk of overfitting was further reduced by evaluating multiple different models in a nested cross-validation approach. Furthermore, we observed study-specific limitations: First, we included a limited number of patients in a retrospective analysis. An expansion of sample size in a prospective study design would certainly contribute to further improving generalizability of our results. However, observed model stability suggests sufficient robustness for evaluating feasibility and limitations of the proposed algorithm. The utilized dataset includes imaging data from 520 patients acquired over a relatively long period of almost a decade in three different centers. In such heterogeneous datasets, results of nested cross-validation approaches serve as a valid indicator for confirming feasibility and performance of the proposed classifier in the underlying clinical setting. Due to standardized and calibrated quantitative imaging parameters and signal intensity processing of CT scanners, we assume neglectable bias on classifier performance in a generalized setting. Second, the manual definition of ROIs still implies a certain degree of observer dependence within the ML process. To minimize its influence, we employed consensus segmentations from two independent readers and applied a semi-automated delineation method that was shown to have a favorable inter- and intra-observer reliability and a high level of congruence with a fully automated delineation [20, 40]. Furthermore, it was found that radiomic features are relatively stable with regard to variations in segmentations [41, 42]. The lack of data on withdrawal and limitation of care are a further limitation [43]. Final limitation was the missing correlation with long-term data (e.g., mRS at 90 days and mortality) as it might offer additional information but was not available for this study [44].
Conclusion
Quantitative imaging features of acute NECT evaluated by ML algorithms provide a high discriminatory power in predicting functional outcome in patients with spontaneous ICH. Additional integration of the ICH Score increases predictive power of the ML classifier, hence providing promising complementary results. The findings support the potential of ML algorithms to augment conventional image analysis, improve prognostic decision, and simplify trial procedures. In the very near future, such ML techniques may play a pivotal role in determining optimized therapeutic regimes and predicting the prognosis for patients with ICH in an individualized manner.
References
Drury I, Whisnant JP, Garraway WM, Kissela B, Kleindorfer D, Moomaw CJ, et al. Primary intracerebral hemorrhage: impact of CT on incidence. Neurology. 1984;34:653–7.
Jakubovic R, Aviv RI. Intracerebral hemorrhage: toward physiological imaging of hemorrhage risk in acute and chronic bleeding. Front Neurol. 2012;3:86.
Broderick JP, Brott TG, Duldner JE, Tomsick T, Huster G. Volume of intracerebral hemorrhage. A powerful and easy-to-use predictor of 30-day mortality. Stroke. 1993;24:987–93.
Moullaali TJ, Wang X, Martin RH, Shipes VB, Robinson TG, Chalmers J, et al. Blood pressure control and clinical outcomes in acute intracerebral haemorrhage: a preplanned pooled analysis of individual participant data. Lancet Neurol. 2019;18:857–64.
Hemphill JC, Greenberg SM, Anderson CS, Becker K, Bendok BR, Cushman M, et al. Guidelines for the management of spontaneous intracerebral hemorrhage. Stroke. 2015;46:2032–60.
Selim M. Unmet needs and challenges in clinical research of intracerebral hemorrhage. Stroke. 2018;49:1299–307.
Gregório T, Pipa S, Cavaleiro P, Atanásio G, Albuquerque I, Chaves PC, et al. Prognostic models for intracerebral hemorrhage: systematic review and meta-analysis. BMC Med Res Methodol. 2018;18:145.
Heo J, Yoon JG, Park H, Kim YD, Nam HS, Heo JH. Machine learning-based model for prediction of outcomes in acute stroke. Stroke. 2019;50:1263–5.
Rubbert C, Patil KR, Beseoglu K, Mathys C, May R, Kaschner MG, et al. Prediction of outcome after aneurysmal subarachnoid haemorrhage using data from patient admission. Eur Radiol. 2018;28:4949–58.
Rau C-S, Kuo P-J, Chien P-C, Huang C-Y, Hsieh H-Y, Hsieh C-H. Mortality prediction in patients with isolated moderate and severe traumatic brain injury using machine learning models. PLoS One. 2018;13:e0207192.
Arbabshirani MR, Fornwalt BK, Mongelluzzo GJ, Suever JD, Geise BD, Patel AA, et al. Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. npj Digit Med. 2018;1:9.
Scherer M, Cordes J, Younsi A, Sahin Y-A, Götz M, Möhlenbruch M, et al. Development and validation of an automatic segmentation algorithm for quantification of intracerebral hemorrhage. Stroke. 2016;47:2776–82.
Wang H-L, Hsu W-Y, Lee M-H, Weng H-H, Chang S-W, Yang J-T, et al. Automatic machine-learning-based outcome prediction in patients with primary intracerebral hemorrhage. Front Neurol. 2019;10:910.
Morotti A, Boulouis G, Dowlatshahi D, Li Q, Barras CD, Delcourt C, et al. Standards for detecting, interpreting, and reporting noncontrast computed tomographic markers of intracerebral hemorrhage expansion. Ann Neurol. 2019;86:480–92.
Barras CD, Tress BM, Christensen S, Collins M, Desmond PM, Skolnick BE, et al. Quantitative CT densitometry for predicting intracerebral hemorrhage growth. Am J Neuroradiol. 2013;34:1139–44.
Soun JE, Montes D, Yu F, Morotti A, Qureshi AI, Barnaure I, et al. Spot Sign in Secondary Intraventricular hemorrhage predicts rarly neurological decline. Clin Neuroradiol. 2019;1–8 [Online ahead of print].
Kniep HC, Madesta F, Schneider T, Hanning U, Schönfeld MH, Schön G, et al. Radiomics of brain MRI: utility in prediction of metastatic tumor type. Radiology. 2019;290:479–87.
Hemphill JC, Bonovich DC, Besmertis L, Manley GT, Johnston SC. The ICH score: a simple, reliable grading scale for intracerebral hemorrhage. Stroke. 2001;32:891–7.
Volbers B, Staykov D, Wagner I, Dörfler A, Saake M, Schwab S, et al. Semi-automatic volumetric assessment of perihemorrhagic edema with computed tomography. Eur J Neurol. 2011;18:1323–8.
Urday S, Beslow LA, Goldstein DW, Vashkevich A, Ayres AM, Battey TWK, et al. Measurement of perihematomal edema in intracerebral hemorrhage. Stroke. 2015;46:1116–9.
Houben R, Schreuder FHBM, Bekelaar KJ, Claessens D, van Oostenbrugge RJ, Staals J. Predicting prognosis of intracerebral hemorrhage (ICH): performance of ICH score is not improved by adding oral anticoagulant use. Front Neurol. 2018;9:100.
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–30.
Breiman L. Mach Learn 2001;45:5–32.
Applications. Cha Zhang • Yunqian Ma Editors ensemble machine learning. [cited 2019 Aug 26];Available from: www.springer.com
Louppe G, Wehenkel L, Sutera A GP. Understanding variable importances in forests of randomized trees. Proc. 26th Int. Conf. Neural Inf. Process. Syst. 2013;1:431–439.
Limkin EJ, Sun R, Dercle L, Zacharaki EI, Robert C, Reuzé S, et al. Promises and challenges for the implementation of computational medical imaging (radiomics) in oncology. Ann Oncol. 2017;28:1191–206.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77.
Matthews BW. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. BBA - Protein Struct. 1975;405:442–51.
Powers DMW, Evaluation: from precision, recall and F-measure to ROC, informedness, markedness & correlation. 2011;2:37–63.
Hamburg MA, Collins FS. The path to personalized medicine. N Engl J Med. 2010;363:301–4.
Rodríguez-Fernández S, Castillo-Lorente E, Guerrero-Lopez F, Rodríguez-Rubio D, Aguilar-Alonso E, Lafuente-Baraza J, et al. Validation of the ICH score in patients with spontaneous intracerebral haemorrhage admitted to the intensive care unit in Southern Spain. BMJ Open. 2018;8:e021719.
Katsanos AH, Krogias C, Lioutas VA, Goyal N, Zand R, Sharma VK, et al. The prognostic utility of ICH-score in anticoagulant related intracerebral hemorrhage. J Neurol Sci. 2020;409:116628.
Fakiri MO, Uyttenboogaart M, Houben R, van Oostenbrugge RJ, Staals J, Luijckx GJ. Reliability of the intracerebral hemorrhage score for predicting outcome in patients with intracerebral hemorrhage using oral anticoagulants. Eur J Neurol. 2020;27:2006–13.
Morotti A, Goldstein JN. Anticoagulant-associated intracerebral hemorrhage. Brain Hemorrhages. 2020;1:89–94.
Boulouis G, Morotti A, Pasi M, Goldstein JN, Gurol ME, Charidimou A. Outcome of intracerebral haemorrhage related to non-Vitamin K antagonists oral anticoagulants versus Vitamin K antagonists: a comprehensive systematic review and meta-analysis. J Neurol Neurosurg Psychiatry. 2018;89:263–70.
Selim M, Hanley D, Steiner T, Christensen HK, Lafuente J, Rodriguez D, et al. Recommendations for clinical trials in ICH. Stroke. 2020;51:1333–8.
Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, They Are Data. Radiology. 2016;278:563–77.
Aerts HJWL. The potential of radiomic-based phenotyping in precision medicine. JAMA Oncol. 2016;2:1636–42.
Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. 2017;14:749–62.
Ironside N, Chen CJ, Mutasa S, Sim JL, Marfatia S, Roh D, et al. Fully automated segmentation algorithm for hematoma volumetric analysis in spontaneous intracerebral hemorrhage. Stroke. 2019;50:3416–23.
Parmar C, Rios Velazquez E, Leijenaar R, Jermoumi M, Carvalho S, Mak RH, et al. Robust radiomics feature quantification using semiautomatic volumetric segmentation. PLoS One. 2014;9:e102107.
Yip SSF, Aerts HJWL. Applications and limitations of radiomics. Phys Med Biol. 2016;61:R150–66.
Zahuranec DB, Brown DL, Lisabeth LD, Gonzales NR, Longwell PJ, Smith MA, et al. Early care limitations independently predict mortality after intracerebral hemorrhage. Neurology. 2007;68:1651–7.
Selim M, Hanley D, Steiner T, Christensen HK, Lafuente J, Rodriguez D, et al. Recommendations for clinical trials in ICH: the second hemorrhagic stroke academia industry roundtable. Stroke. 2020;51:1333–8.
Acknowledgements
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
- Substantial contributions to conception and design: JN, HK, JF, UH, GT, FQ
- Acquisition and analysis and interpretation of data: SE, CF, PS, LD, AM, GB, GS, TR, MB
- Drafting a significant portion of the manuscript or figures: JN, HK, UH
Corresponding author
Ethics declarations
Conflict of Interest
- J. Fiehler: Consultant for Acandis, Boehringer Ingelheim, Codman, Microvention, Sequent, Stryker. Speaker for Bayer Healthcare, Bracco, Covidien/ev3, Penumbra, Philips, Siemens. Grants from Bundesministeriums für Wirtschaft und Energie (BMWi), Bundesministerium für Bildung und Forschung (BMBF), Deutsche Forschungsgemeinschaft (DFG), European Union (EU), Covidien, Stryker (THRILL study), Microvention (ERASER study), Philips.
- G. Thomalla has received personal fees from Acandis, Bayer, Boehringer Ingelheim, Bristol- Myers Squibb/Pfizer, Daichi Sankyo, Stryker, grants and personal fees from Bayer, grants from the German Research Foundation, Corona Foundation, German Innovation Fund.
- All other authors declare that they have no conflict of interest with a company whose product is used in the study or may be affected by its outcome. Please refer to the ICMJE Form for Disclosure of Potential Conflicts of Interest for further details.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
ESM 1
(DOCX 25 kb)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nawabi, J., Kniep, H., Elsayed, S. et al. Imaging-Based Outcome Prediction of Acute Intracerebral Hemorrhage. Transl. Stroke Res. 12, 958–967 (2021). https://doi.org/10.1007/s12975-021-00891-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12975-021-00891-8