Abstract
Background
Shunt-dependent hydrocephalus significantly complicates subarachnoid hemorrhage (SAH), and reliable prognosis methods have been sought in recent years to reduce morbidity and costs associated with delayed treatment or neglected onset. Machine learning (ML) defines modern data analysis techniques allowing accurate subject-based risk stratifications. We aimed at developing and testing different ML models to predict shunt-dependent hydrocephalus after aneurysmal SAH.
Methods
We consulted electronic records of patients with aneurysmal SAH treated at our institution between January 2013 and March 2019. We selected variables for the models according to the results of the previous works on this topic. We trained and tested four ML algorithms on three datasets: one containing binary variables, one considering variables associated with shunt-dependency after an explorative analysis, and one including all variables. For each model, we calculated AUROC, specificity, sensitivity, accuracy, PPV, and also, on the validation set, the NPV and the Matthews correlation coefficient (ϕ).
Results
Three hundred eighty-six patients were included. Fifty patients (12.9%) developed shunt-dependency after a mean follow-up of 19.7 (± 12.6) months. Complete information was retrieved for 32 variables, used to train the models. The best models were selected based on the performances on the validation set and were achieved with a distributed random forest model considering 21 variables, with a ϕ = 0.59, AUC = 0.88; sensitivity and specificity of 0.73 (C.I.: 0.39–0.94) and 0.92 (C.I.: 0.84–0.97), respectively; PPV = 0.59 (0.38–0.77); and NPV = 0.96 (0.90–0.98). Accuracy was 0.90 (0.82–0.95).
Conclusions
Machine learning prognostic models allow accurate predictions with a large number of variables and a more subject-oriented prognosis. We identified a single best distributed random forest model, with an excellent prognostic capacity (ϕ = 0.58), which could be especially helpful in identifying low-risk patients for shunt-dependency.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Shunt-dependent hydrocephalus is a significant complication of aneurysmal subarachnoid hemorrhage (SAH) affecting 11 to 39.6% of patients with aneurysmal SAH [1, 8, 13, 17, 27, 29, 35, 39, 49]. Several studies have tried to identify potential predictors of shunt-dependency to estimate prognosis, to offer the best treatment strategy by preventing complications associated with unnecessary treatment or delayed surgical intervention, and to reduce hospitalization and rehabilitation length and costs [1, 5, 8, 9, 11,12,13, 16, 17, 21,22,23, 25, 27, 29, 31, 33, 35, 36, 39, 43, 46,47,48, 50]. Among these variables, some have been proposed, like the patient’s age and gender [22, 23, 33, 35, 36, 47, 48], the neurological status at presentation (Hunt & Hess and WFNS scales) [12, 20,21,22, 25, 27, 47, 48], the amount of cisternal blood on the first CT scan (Fisher and BNI scales) [12, 16, 21,22,23, 25, 35, 47], the presence of acute hydrocephalus on the first CT scan and the need for external ventricular drain (EVD) [1, 12, 21,22,23, 25, 27, 33, 36, 43, 47, 48], the duration of EVD treatment [25, 48], aneurysms location and size [9, 22, 25, 33, 36, 47], the type of treatment for aneurysm exclusion (endovascular or surgical) [9, 25, 35, 36, 50], the onset and duration of posttreatment complications (i.e., fever and/or infections) [25, 27, 36, 47, 50], the duration of blood clearance detected on serial CT scans [29], and altered values of blood or cerebrospinal fluid (CSF) markers [27, 31, 46]. Some meta-analyses [11, 45, 47, 50] have synthesized this information, and scores for risk stratification have been proposed to be used in the clinical practice [5, 12, 13, 21, 27], whose validity, however, has not yet been confirmed on other cohorts.
Despite consistent associations between some variables and the development of shunt-dependency across studies, results differ on the role of other potentially important items, with the likely effect of scores over-emphasizing some variables while neglecting relevant ones. This could represent a limitation when risk scores built on these premises are used in clinical practice.
Modern standards of data analysis and prediction models rely on machine learning (ML), a branch of statistical analysis that is gaining more and more consideration in the medical field due to its excellent results and, more recently, also in neurosurgery [6, 10, 18, 34, 41, 42, 44]. ML consists of algorithm-based models with the ability to learn and perform tasks that are not explicitly programmed, to improve the performances with experience (i.e., when the model analyzes new data), and to work with a large amount of data and nonlinear associations, where classical statistical methods can show some limitations [6, 18, 38].
We aimed at testing the ability of machine learning models to predict the development of shunt-dependent hydrocephalus in aneurysmal SAH patients, intending to develop a prognostic model based on current data analysis standards, in order to reduce omission of potentially relevant variables and allow for better individual risk estimation.
Methods
Data collection and variables selection
Electronic files and radiological data of patients undergoing surgical or endovascular treatment for aneurysmal SAH at our institution between January 2013 and March 2019 were retrospectively consulted to collect information on variables potentially related to shunt-dependency, according to the results of previous works on this topic [1, 5, 7,8,9, 11,12,13, 16, 17, 21,22,23, 25, 27, 29, 33, 36, 39, 43, 46,47,48, 50]. Also, we expanded this information by including quantitative information on the neurological and general clinical status of patients at the time of the acute event (SAH), such as the Karnofsky performance status (KPS), the ASA physical status classification system, the modified rankin scale (mRS), and the National Institute of Health Stroke Scale (NIHSS). The variables considered are summarized in Table 1.
Patients were subdivided according to their GCS at presentation into three groups to train the models: 12–15, 8–11, and < 8. We subdivided treatment timing into very early (< 6 h), early (6–12 h), late (12–24 h), and delayed (> 24 h). Patients were further dichotomized according to the duration of EVD permanence and fever in ≤ 5 days and > 5 days. Fever onset was dichotomized in early and delayed onset (cutoff: 7 days after treatment). The bicaudate index was measured at the first CT after symptoms onset, immediately after treatment and at 14 days (or at the last CT performed within 14 days from the acute event). Additionally, information on follow-up duration and the onset of shunt-dependency were retrieved.
Treatment protocol
Patients with suspected aneurysmal SAH after skull CT and CT-angiography (CTA) referred to our institution were managed by a multidisciplinary team composed of neurosurgeons, endovascular neuroradiologists, and anesthesiologists who chose the most appropriate treatment on a case-by-case basis, taking into account the patient’s clinical status, age, and comorbidities, as well as the entity of SAH and the aneurysm location and morphology. In cases of imaging of insufficient quality of the CTA or unsatisfactory depiction, a digital subtraction angiography (DSA) of the intracranial vessels was performed.
After aneurysm occlusion (either endovascular or surgical), patients were transferred to the intensive care unit (ICU), the intermediate care (IMC) unit, or the neurosurgical ward according to the patients’ preoperative clinical status, age, comorbidities, entity of the subarachnoid hemorrhage, and performed treatment after the operating physician and the anesthesiologist had reached interdisciplinary consensus. In each case, patients underwent neurological and clinical monitoring for a minimum of 14 days after treatment, as well as routine transcranial Doppler studies for early detection of vasospasm. CT of the skull was performed immediately after treatment, whenever neurological deterioration occurred (pre- and postoperative), before sedation weaning (for intubated patients), before EVD removal, and before discharge.
EVD insertion, weaning, and indications for permanent shunts
Indication for EVD was posed in patients with an acute neurological deterioration associated with radiological findings of acute hydrocephalus. In patients who were referred intubated to our institution, EVD was placed when the immediate pre- or postoperative scan showed acute hydrocephalus. In both cases, the reservoir was placed at such a height to drain 10 ml of CSF/h. Once the clinical and neurological status was stable, weaning began by increasing by 2 cm of H2O every 24–48 h until the absence of CSF drainage. Then, the drain was kept closed for 24–48 h, and if neurological status remained stable, a CT scan was performed. If no ventricle dilatation was documented, the EVD was removed. If neurological deterioration occurred during EVD weaning, a CT scan was performed, and in case of evident or suspected ventricle dilatation, the reservoir was open again to drain 10 ml of CSF/h. A new attempt of weaning was made following the same protocol, and if neurological deterioration with ventricle dilatation occurred a second time, the patient was deemed shunt-dependent.
Patients with poor clinical conditions or low GCS, in which recognizing neurological deterioration would have been more challenging during EVD weaning, were treated following the same protocol, and a CT scan was performed prior to each change of the reservoir height.
After discharge, if clinical conditions remained stable, patients underwent a clinical and radiological follow-up with CTA at 3 months, MR-angiography at 6 months, and DSA at 12 months. Other investigations were performed when deemed necessary, in case of suspected incomplete exclusion of the aneurysm or when neurological changes occurred. If ventricle enlargement was detected in association with neurological deterioration during follow-up, a permanent ventricular shunt was indicated.
Statistical analysis, preprocessing, creation, and testing of models
Continuous variables are reported as mean with standard deviation, and categorical variables are expressed as percentages. Statistical analysis, data preprocessing, and graphics creation were performed with SPSS Statistics© 23 (IBM Corp. Armonk, NY, USA) and MATLAB R2020 (MathWorks Inc., Natick, MA, USA; https://www.mathworks. com). A Wilk-Shapiro test was used to assess normal distribution. We first conducted an exploratory statistical analysis, and a comparison of variables between shunt-dependent and non-shunt-dependent patients was performed with a t test for unpaired data for continuous variables and a χ2 test for categorical variables. A Bonferroni correction was used for multiple comparisons.
Before training machine learning models, missing variables for > 40% of patients were removed to avoid the significant influence of the imputation, as well as those patients with missing information on eight or more variables. Missing data were imputed with K-nearest neighbor imputation. Patients deceased before assessment of shunt-dependency were excluded, like those survived but missed on follow-up lacking information on the development of shunt-dependency.
ML models were trained using the open-source platform H2O (https://www.h2o.ai, Mountain View, CA, USA), which provides a package of scripts for ML algorithms whose parameter can be customized ad hoc. We used the web interface (H2O Flow) provided by the site running in Java™ (https://www.java.com, Oracle Corporation, Redwood, CA, USA). For our purposes, we tested four of the most frequently employed algorithms for supervised learning without knowing previously which one would be the most precise for our purposes: generalized linear modeling (GL), distributed random forest (DRF), gradient boosting machine (GBM), and deep learning (DL). The clean dataset was randomly split into training (75% of the patients) and validation set (25%). A 6-fold cross-validation was performed on the training set, before evaluating prediction performances on the validation set. Cross-validation is a resampling technique to obtain a more accurate and less biased estimate of how the model will score on previously unseen data. It consists of creating k samples (in our case, k = 6) of equal size from the training dataset, of which one is used as a validation set and the remaining as a training set. This process is repeated k times, using each of the subsamples once as a validation sample, and the results of all iterations are summarized by metrics mean and standard deviation. In our case, the area under the receiver operating characteristic curve (AUROC or AUC), accuracy, sensitivity, specificity, the positive predictive value (PPV), and the Matthews correlation coefficient were calculated. The Matthews correlation coefficient, or ϕ, is a measure of the quality of a binary classification used in machine learning, with scores ranging between + 1 identifying a perfect prediction and − 1 indicating total disagreement. A score equal to 0 means the model makes no better prediction than a random guess [4].
For each model, the algorithm parameters were customized and fine-tuned to obtain the optimized Matthews correlation coefficient. Also, the binarization threshold was chosen to maximize ϕ. Algorithms training was performed using logloss (logarithmic loss metric) as the stopping parameter: once the algorithm parameters are set by the operator, this procedure iterates the development of models of increasing complexity until the performance of the model decreases. The logloss evaluates how close the predicted values are to the actual ones. Values can be greater than or equal to 0, with 0 meaning that the model correctly predicts an event. For each model variable, importances were calculated, and recursive feature selection was performed by removing variables with lower coefficients stepwise until reaching optimal scores. Performances on the validation sets were synthesized in confusion matrices, and sensitivity, specificity, PPV and negative predictive value (NPV), and accuracy with 95% confidence intervals were calculated, along with the AUC and ϕ.
On both sets, calibration metrics were calculated as the Hosmer-Lemeshow goodness of fit test and as slope and intercept of the calibration curve.
Results
During the considered period, 479 patients underwent treatment of aneurysmal SAH at our institution (mean age: 59 ± 13 years, 320 females [66.8%], mean follow-up: 19.7 ± 12.6 months). Variables retrieved were available in the following proportions: GCS at admission in 393 patients (82% of cases), Fisher score in 377 (78.5%), Hunt-Hess in 395 (82.3%), BNI in 152 (31.6%), WFNS in 392 (81.6%), ICH in 385 (80.2%), IVH in 388 (80.8%), treatment timing in 376 (78.3%), SAH sum score in 150 (31.2%), IVH sum score in 357 (74.4%), BI preoperative in 141 (29.3%), postoperative in 358 (74.6%), at 14 days in 170 (35.4%), at the last CT scan in 249 (51.9%), mRS on admission in 397 (82.7%), ASA class in 390 (81.2%), KPS score in 392 (81.6%), NIHSS score in 392 (81.6%), presence/absence of acute hydrocephalus in 330 (68.7%), need for EVD placement in 392 (81.6%), EVD duration—if EVD present—in 150 (31.2%), rebleeding in 393 (81.9%), aneurysm location in 400 (83.3%), multiple aneurysm in 389 (81%), aneurysm max. diameter in 370 (77.1%), treatment modality in 404 (84.2%), need for posttreatment ICU in 394 (82.1%), DCI in 388 (80.8%), treatment complication in 390 (81.2%), postoperative fever in 390 (81.2%), fever timing in 375 (78.1%), days with fever in 319 (66.5%), meningitis in 382 (79.6%) or other infections in 383 (79.8%), and development of shunt-dependency in 390 (81.2%).
After removing patients deceased before evaluation of the development of shunt-dependency (n = 29 [6%]), those with more than eight missing variables or missing information on the development of shunt-dependency (n = 64, 13.4%), and after eliminating all variables missing for more than 40% of patients, the clean dataset comprised 386 patients and 32 variables (Table 2).
The exploratory statistical analysis yielded significant association with the onset of shunt-dependency for the following variables: Fisher > 2 (p < 0.01), Hunt-Hess = 1 (p = 0.001, negative correlation) or 3 (p = 0.001, positive correlation), WFNS > 2 (p < 0.01), presence of IVH on admission (p < 0.001), higher preoperative IVH sum score (p = 0.001), higher postoperative BI (p = 0.001), lower KPS on admission (p = 0.004), acute hydrocephalus on admission (p < 0.001), need for EVD treatment perioperatively (p < 0.001), the permanence of EVD > 5 days (p < 0.001), the need for posttreatment ICU (p < 0.001), presence of postoperative fever lasting > 5 days (p < 0.001), and the postoperative development of meningitis (p = 0.001) or other infections (p < 0.001). See also Table 2 for details.
Performances after 6-fold cross-validation are summarized in Table 3. The highest accuracy and ϕ on the resampled training set was obtained with a DL algorithm with 31 variables (0.84 [± 0.07]) and 0.54 [± 0.1], respectively). However, the best performances on the validation set were reached with a DRF model including 21 variables (see Tables 4 and 5), with a correlation coefficient (ϕ) of 0.59 an AUC = 0.88, sensitivity and specificity of 0.73 (C.I.: 0.39–0.94) and 0.92 (C.I.: 0.84–0.97), respectively, PPV = 0.59 (0.38–0.77) and 0.96 (0.90–0.98). Accuracy was = 0.90 (0.82–0.95) (see also Table 4).
Figure 1 and Table 6 depict the AUC and the confusion matrix obtained from this model on the validation set, while Fig. 2 is a calibration plot of the model on the training and validation set: for the test set, the calibration slope and intercept were 1.02 and 0.03, respectively, whereas the calibration slope and intercept for the validation set were 0.88 and 0.07. The Hosmer-Lemeshow goodness of fit test showed a good fit of the model on both the resampled training set (χ2 = 1.7, p = 0.99) and the validation set (χ2 = 1.02, p = 1) (see also Tables 6 and 7 and Supplementary Material).
Discussion
In this study, we trained different machine learning models to predict the occurrence of chronic shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage. We further identified a single model with the best performances on previously unseen data, analyzing all variables retrieved with a distributed random forest algorithm (see Supplementary Material for details on the model parameters and the model code).
The results are comparable to previously proposed predictive models (see Table 8), which, however, took into account a limited number of variables selected after a statistical analysis of association with shunt-dependency performed with traditional methods [5, 12, 13, 20, 21, 23].
In comparison with models and scores based on previous statistical concepts, however, ML models bear the advantage of allowing more precise and subject-based predictions by including a substantial amount of variables and analyzing complex nonlinear relationships, rather than fitting the subjects’ features into predetermined models with selected and weighted variables according to statistical significance. As our experience confirms, including items that did not show a significant association with shunt-dependency when creating ML models improved the overall performances. Moreover, in the final model, variables significantly associated with shunt-dependency did not improve the overall model accuracy when used as splitting nodes in the decision trees (see also Supplementary Material). This enables to perform a more flexible and updated prediction for each subject according to small as well as relevant changes in patients’ clinical and radiological conditions.
Additionally, ML models can improve and refine autonomously when new data are provided [37], providing a dynamic model that can increase accuracy with time.
The potentials of machine learning techniques in medicine and neurosurgery have been widely tested, and their employment in diagnostic and prognostic tasks is becoming more and more common given their abilities to outperform human capacity and traditional statistics [18, 38, 41, 42, 44]. Machine learning can be considered an evolution of traditional statistics, and there is no clear line dividing them [3]. The fundamental distinction between machine learning models and traditional statistical approaches is the ability of machine learning models to independently learn from examples rather than perform a pre-programmed task [37].
Classification or regression tasks can be accomplished by supervised learning algorithms. These algorithms work with known variables (input) and outcomes (output) to detect associations between them and, once trained, can generalize this information and predict the outcome when new inputs are provided. In contrast, unsupervised learning algorithms are used to detect unknown clusters or patterns among vast amounts of data [37].
Among four of the most diffused supervised algorithms, in our experience, the most accurate model was built on a distributed random forest algorithm and included 21 items (see Table 3, Table 4 and Supplementary Material). The concept behind the DRF algorithm is to build a set of decision trees, each taking into account a subgroup of randomly selected variables and then summarizing the results of all trees either be mean or by vote to obtain an overall prediction by majority [28]. For each tree, the algorithm identifies a set of decision rules that predict the outcome based on the given variables [24]. A detailed explanation of the other tested algorithm (GL, GBM, DL) can be found here [14, 15, 26, 30, 32].
We relied on the Matthews correlation coefficient (ϕ) to identify the single best model, a metric optimized for data imbalance that is commonly used in ML and bioinformatics [4]. When the sample size in the data classes are unevenly distributed (in our case, shunt-dependent vs. non-shunt-dependent), data imbalance occurs. This frequently happens in ML, resulting in classification models maximizing the accuracy by biasing toward the majority class and leading to poor generalization [40]. In this situation, the standard measures of performance, like accuracy, are no longer a proper measure of imbalanced data. A common way to address this issue is to over-/under-sample one of the two classes [41]. However, this strategy can alter the results when the number in the minority class is limited [4].
An additional issue with model building in ML is overfitting. This is a frequent problem occurring when a too sophisticated and accurate model learns from irrelevant information or randomness of the training dataset. As a result, the predictions on new datasets will be weak. To prevent this problem, we used cross-validation, early stopping, and features removal. Also, a so-called train-test split [19] can be a sign of overfitting: when a model performs with significantly better accuracy on the training set than on the validation set, overfitting is probably occurring. In our DRF model, no train-test split differences were observed under this respect (see Tables 3 and 4).
We have to stress some limitations: first, despite all information being recalled from clinical records (i.e., prospectively acquired), the data were collected retrospectively, and we cannot exclude related biases. Second, observations of radiological scans, for example, the bicaudate index, are highly operator-dependent, and having automatized measurements would make the data more reliable. Third, some potentially relevant variables were not included, like CSF markers, or specific surgical procedures like fenestration of the lamina terminalis: CSF markers are not routinely acquired at our institution, but it would be interesting to test them in future models. Finally, despite the good metrics shown by our final model, its ability to identify patients who will actually need a permanent shunt is less accurate than its capacity to correctly exclude subjects who will not develop chronic shunt-dependent hydrocephalus. The ability to predict the development of a disease is the actual goal of any prediction model. Still, correctly ruling out future negative patients can represent a significant support for clinical decision-making and follow-up planning as well as a tool to reduce hospitalization length and costs. Additionally, it is noteworthy that the positive and negative predictive values of a test are related to the prevalence of the condition to be predicted [2]. In our cohort, the prevalence of shunt-dependency was 12.9%, and we could reasonably expect the same model to show different positive predictive values in cohorts with a different proportion of positive subjects.
Conclusions
We trained and tested a distributed random forest model with 32 features, which reached an excellent sensitivity and specificity with ϕ = 0.59. Compared to previous models built on traditional statistical methods, it can analyze a larger amount of data and variables; work with complex nonlinear relationships; and offer a more flexible, subject-based, and accurate prognostic tool, which autonomously refines with the experience. Even though some limitations are present, prospectively testing this model performance could confirm its prognostic capacity.
Abbreviations
- SAH:
-
Subarachnoid hemorrhage
- ML:
-
Machine learning
- CT:
-
Computerized tomography
- CTA:
-
CT-angiography
- DSA:
-
Digital subtraction angiography
- AUROC/AUC:
-
Area under the receiver operating characteristic curve/area under the curve
- PPV/NPV:
-
Positive predictive value/negative predictive value
- CI:
-
Confidence interval
- EVD:
-
External ventricular drain
- CSF:
-
Cerebrospinal fluid
- WFNS:
-
World Federation of Neurological Surgeons
- BNI:
-
Barrow Neurological Institute
- BI:
-
Bicaudate index
- GCS:
-
Glasgow coma scale
- KPS:
-
Karnofsky performance status
- ASA:
-
American Society of Anesthesiology
- mRS:
-
Modified Rankin scale
- NIHSS:
-
National Institute of Health Stroke Scale
- ICU:
-
Intensive care unit
- IMC:
-
Intermediate care unit
- GL:
-
Generalized linear modeling
- DRF:
-
Distributed random forest
- GBM:
-
Gradient boosting machine
- DL:
-
Deep learning
References
Adams H, Ban VS, Leinonen V, Aoun SG, Huttunen J, Saavalainen T, Lindgren A, Frosen J, Fraunberg M, Koivisto T, Hernesniemi J, Welch BG, Jaaskelainen JE, Huttunen TJ (2016) Risk of shunting after aneurysmal subarachnoid hemorrhage: a collaborative study and initiation of a consortium. Stroke 47:2488–2496. https://doi.org/10.1161/STROKEAHA.116.013739
Altman DG, Bland JM (1994) Statistics notes: diagnostic tests 2: predictive values. BMJ 309:102. https://doi.org/10.1136/bmj.309.6947.102
Beam AL, Kohane IS (2018) Big data and machine learning in health care. Jama 319:1317–1318. https://doi.org/10.1001/jama.2017.18391
Boughorbel S, Jarray F, El-Anbari M (2017) Optimal classifier for imbalanced data using Matthews correlation coefficient metric. PLoS One 12:e0177678. https://doi.org/10.1371/journal.pone.0177678
Chan M, Alaraj A, Calderon M, Herrera SR, Gao W, Ruland S, Roitberg BZ (2009) Prediction of ventriculoperitoneal shunt dependency in patients with aneurysmal subarachnoid hemorrhage. J Neurosurg 110:44–49. https://doi.org/10.3171/2008.5.17560
Cleopas TJ, Zwinderman AH (2015) Machine learning in medicine. Springer, Dordrecht
Czorlich P, Ricklefs F, Reitz M, Vettorazzi E, Abboud T, Regelsberger J, Westphal M, Schmidt NO (2015) Impact of intraventricular hemorrhage measured by Graeb and LeRoux score on case fatality risk and chronic hydrocephalus in aneurysmal subarachnoid hemorrhage. Acta Neurochir 157:409–415. https://doi.org/10.1007/s00701-014-2334-z
de Oliveira JG, Beck J, Setzer M, Gerlach R, Vatter H, Seifert V, Raabe A (2007) Risk of shunt-dependent hydrocephalus after occlusion of ruptured intracranial aneurysms by surgical clipping or endovascular coiling: a single-institution series and meta-analysis. Neurosurgery 61:924–933; discussion 933-924. https://doi.org/10.1227/01.neu.0000303188.72425.24
Dehdashti AR, Rilliet B, Rufenacht DA, de Tribolet N (2004) Shunt-dependent hydrocephalus after rupture of intracranial aneurysms: a prospective study of the influence of treatment modality. J Neurosurg 101:402–407. https://doi.org/10.3171/jns.2004.101.3.0402
Deo Rahul C (2015) Machine learning in medicine. Circulation 132:1920–1930. https://doi.org/10.1161/CIRCULATIONAHA.115.001593
Di Russo P, Di Carlo DT, Lutenberg A, Morganti R, Evins AI, Perrini P (2019) Shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage. A systematic review and meta-analysis. J Neurosurg Sci. https://doi.org/10.23736/s0390-5616.19.04641-1
Diesing D, Wolf S, Sommerfeld J, Sarrafzadeh A, Vajkoczy P, Dengler NF (2018) A novel score to predict shunt dependency after aneurysmal subarachnoid hemorrhage. J Neurosurg 128:1273–1279. https://doi.org/10.3171/2016.12.JNS162400
Dorai Z, Hynan LS, Kopitnik TA, Samson D (2003) Factors related to hydrocephalus after aneurysmal subarachnoid hemorrhage. Neurosurgery 52:763–769; discussion 769-771. https://doi.org/10.1227/01.neu.0000053222.74852.2d
Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33:1–22
Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38:367–378. https://doi.org/10.1016/s0167-9473(01)00065-2
Garcia S, Torne R, Hoyos JA, Rodriguez-Hernandez A, Amaro S, Llull L, Lopez-Rueda A, Ensenat J (2018) Quantitative versus qualitative blood amount assessment as a predictor for shunt-dependent hydrocephalus following aneurysmal subarachnoid hemorrhage. J Neurosurg:1–8. https://doi.org/10.3171/2018.7.JNS18816
Gruber A, Reinprecht A, Bavinzski G, Czech T, Richling B (1999) Chronic shunt-dependent hydrocephalus after early surgical and early endovascular treatment of ruptured intracranial aneurysms. Neurosurgery 44:503–509; discussion 509-512. https://doi.org/10.1097/00006123-199903000-00039
Gulshan V, Peng L, Coram M, Stumpe MC, Wu D, Narayanaswamy A, Venugopalan S, Widner K, Madams T, Cuadros J, Kim R, Raman R, Nelson PC, Mega JL, Webster DR (2016) Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316:2402–2410. https://doi.org/10.1001/jama.2016.17216
Hawkins DM (2004) The problem of Overfitting. J Chem Inf Comput Sci 44:1–12. https://doi.org/10.1021/ci0342472
Hostettler IC, Muroi C, Richter JK, Schmid J, Neidert MC, Seule M, Boss O, Pangalu A, Germans MR, Keller E (2018) Decision tree analysis in subarachnoid hemorrhage: prediction of outcome parameters during the course of aneurysmal subarachnoid hemorrhage using decision tree analysis. J Neurosurg 129:1499–1510. https://doi.org/10.3171/2017.7.JNS17677
Jabbarli R, Bohrer AM, Pierscianek D, Muller D, Wrede KH, Dammann P, El Hindy N, Ozkan N, Sure U, Muller O (2016) The CHESS score: a simple tool for early prediction of shunt dependency after aneurysmal subarachnoid hemorrhage. Eur J Neurol 23:912–918. https://doi.org/10.1111/ene.12962
Jeong TS, Yoo CJ, Kim WK, Yee GT, Kim EY, Kim MJ (2018) Factors related to the development of shunt-dependent hydrocephalus following subarachnoid hemorrhage in the elderly. Turk Neurosurg 28:226–233. https://doi.org/10.5137/1019-5149.JTN.19752-16.1
Kim JH, Kim JH, Kang HI, Kim DR, Moon BG, Kim JS (2019) Risk factors and preoperative risk scoring system for shunt-dependent hydrocephalus following aneurysmal subarachnoid hemorrhage. J Korean Neurosurg Soc. https://doi.org/10.3340/jkns.2018.0152
Kuo P-J, Wu S-C, Chien P-C, Rau C-S, Chen Y-C, Hsieh H-Y, Hsieh C-H (2018) Derivation and validation of different machine-learning models in mortality prediction of trauma in motorcycle riders: a cross-sectional retrospective study in southern Taiwan. BMJ Open 8:e018252–e018252. https://doi.org/10.1136/bmjopen-2017-018252
Lai L, Morgan MK (2013) Predictors of in-hospital shunt-dependent hydrocephalus following rupture of cerebral aneurysms. J Clin Neurosci 20:1134–1138. https://doi.org/10.1016/j.jocn.2012.09.033
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539
Lenski M, Biczok A, Huge V, Forbrig R, Briegel J, Tonn JC, Thon N (2019) Role of cerebrospinal fluid markers for predicting shunt-dependent hydrocephalus in patients with subarachnoid hemorrhage and external ventricular drain placement. World Neurosurg 121:e535–e542. https://doi.org/10.1016/j.wneu.2018.09.159
Li J, Tian Y, Zhu Y, Zhou T, Li J, Ding K, Li J (2020) A multicenter random forest model for effective prognosis prediction in collaborative clinical research network. Artif Intell Med 103:101814. https://doi.org/10.1016/j.artmed.2020.101814
Mijderwijk HJ, Fischer I, Zhivotovskaya A, Bostelmann R, Steiger HJ, Cornelius JF, Petridis AK (2019) Prognostic model for chronic shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage. World Neurosurg. https://doi.org/10.1016/j.wneu.2018.12.156
Miotto R, Wang F, Wang S, Jiang X, Dudley JT (2017) Deep learning for healthcare: review, opportunities and challenges. Brief Bioinform 19:1236–1246. https://doi.org/10.1093/bib/bbx044
Na MK, Won YD, Kim CH, Kim JM, Cheong JH, Ryu JI, Han MH (2017) Early variations of laboratory parameters predicting shunt-dependent hydrocephalus after subarachnoid hemorrhage. PLoS One 12:e0189499. https://doi.org/10.1371/journal.pone.0189499
Natekin A, Knoll A (2013) Gradient boosting machines, a tutorial. Front Neurorobot 7:21–21. https://doi.org/10.3389/fnbot.2013.00021
O'Kelly CJ, Kulkarni AV, Austin PC, Urbach D, Wallace MC (2009) Shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage: incidence, predictors, and revision rates. J Neurosurg 111:1029–1035. https://doi.org/10.3171/2008.9.Jns08881
Obermeyer Z, Emanuel EJ (2016) Predicting the future - big data, machine learning, and clinical medicine. N Engl J Med 375:1216–1219. https://doi.org/10.1056/NEJMp1606181
Paisan GM, Ding D, Starke RM, Crowley RW, Liu KC (2018) Shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage: predictors and long-term functional outcomes. Neurosurgery 83:393–402. https://doi.org/10.1093/neuros/nyx393
Park YK, Yi HJ, Choi KS, Lee YJ, Chun HJ, Kwon SM, Kim DW (2018) Predicting factors for shunt-dependent hydrocephalus in patients with aneurysmal subarachnoid hemorrhage. Acta Neurochir 160:1407–1413. https://doi.org/10.1007/s00701-018-3560-6
Rajkomar A, Dean J, Kohane I (2019) Machine learning in medicine. N Engl J Med 380:1347–1358. https://doi.org/10.1056/NEJMra1814259
Ramos LA, van der Steen WE, Sales Barros R, Majoie C, van den Berg R, Verbaan D, Vandertop WP, Zijlstra I, Zwinderman AH, Strijkers GJ, Olabarriaga SD, Marquering HA (2019) Machine learning improves prediction of delayed cerebral ischemia in patients with subarachnoid hemorrhage. J Neurointerv Surg 11:497–502. https://doi.org/10.1136/neurintsurg-2018-014258
Rincon F, Gordon E, Starke RM, Buitrago MM, Fernandez A, Schmidt JM, Claassen J, Wartenberg KE, Frontera J, Seder DB, Palestrant D, Connolly ES, Lee K, Mayer SA, Badjatia N (2010) Predictors of long-term shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage. Clinical article. J Neurosurg 113:774–780. https://doi.org/10.3171/2010.2.Jns09376
Staartjes VE, Schroder ML (2018) Letter to the editor. Class imbalance in machine learning for neurosurgical outcome prediction: are our models valid? J Neurosurg Spine 29:611–612. https://doi.org/10.3171/2018.5.Spine18543
Staartjes VE, Serra C, Muscas G, Maldaner N, Akeret K, van Niftrik CHB, Fierstra J, Holzmann D, Regli L (2018) Utility of deep neural networks in predicting gross-total resection after transsphenoidal surgery for pituitary adenoma: a pilot study. Neurosurg Focus 45:E12. https://doi.org/10.3171/2018.8.Focus18243
Staartjes VE, Zattra CM, Akeret K, Maldaner N, Muscas G, Bas van Niftrik CH, Fierstra J, Regli L, Serra C (2019) Neural network-based identification of patients at high risk for intraoperative cerebrospinal fluid leaks in endoscopic pituitary surgery. J Neurosurg:1–7. https://doi.org/10.3171/2019.4.Jns19477
Tso MK, Ibrahim GM, Macdonald RL (2016) Predictors of shunt-dependent hydrocephalus following aneurysmal subarachnoid hemorrhage. World Neurosurg 86:226–232. https://doi.org/10.1016/j.wneu.2015.09.056
van Niftrik CHB, van der Wouden F, Staartjes VE, Fierstra J, Stienen MN, Akeret K, Sebok M, Fedele T, Sarnthein J, Bozinov O, Krayenbuhl N, Regli L, Serra C (2019) Machine learning algorithm identifies patients at high risk for early complications after intracranial tumor surgery: registry-based cohort study. Neurosurgery. https://doi.org/10.1093/neuros/nyz145
Wilson CD, Safavi-Abbasi S, Sun H, Kalani MYS, Zhao YD, Levitt MR, Hanel RA, Sauvageau E, Mapstone TB, Albuquerque FC, McDougall CG, Nakaji P, Spetzler RF (2017) Meta-analysis and systematic review of risk factors for shunt dependency after aneurysmal subarachnoid hemorrhage. 126:586. https://doi.org/10.3171/2015.11.Jns152094
Wostrack M, Reeb T, Martin J, Kehl V, Shiban E, Preuss A, Ringel F, Meyer B, Ryang YM (2014) Shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage: the role of intrathecal interleukin-6. Neurocrit Care 21:78–84. https://doi.org/10.1007/s12028-014-9991-x
Xie Z, Hu X, Zan X, Lin S, Li H, You C (2017) Predictors of shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage? A Systematic Review and Meta-Analysis. World Neurosurg 106(844–860):e846. https://doi.org/10.1016/j.wneu.2017.06.119
Yang T-C, Chang CH, Liu Y-T, Chen Y-L, Tu P-H, Chen H-C (2013) Predictors of shunt-dependent chronic hydrocephalus after aneurysmal subarachnoid haemorrhage. Eur Neurol 69:296–303. https://doi.org/10.1159/000346119
Zaidi HA, Montoure A, Elhadi A, Nakaji P, McDougall CG, Albuquerque FC, Spetzler RF, Zabramski JM (2015) Long-term functional outcomes and predictors of shunt-dependent hydrocephalus after treatment of ruptured intracranial aneurysms in the BRAT trial: revisiting the clip vs coil debate. Neurosurgery 76:608–613; discussion 613-604; quiz 614. https://doi.org/10.1227/neu.0000000000000677
Zeng J, Qin L, Wang D, Gong J, Pan J, Zhu Y, Sun T, Xu K, Zhan R (2019) Comparing the risk of shunt-dependent hydrocephalus in patients with ruptured intracranial aneurysms treated by endovascular coiling or surgical clipping: an updated meta-analysis. World Neurosurg 121:e731–e738. https://doi.org/10.1016/j.wneu.2018.09.207
Funding
Open Access funding provided by Università degli Studi di Firenze within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they no conflict of interest.
Ethical approval
For this type of study, formal consent is not required.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the Topical Collection on Vascular Neurosurgery - Other
Electronic supplementary material
ESM 1
(DOCX 422 kb)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Muscas, G., Matteuzzi, T., Becattini, E. et al. Development of machine learning models to prognosticate chronic shunt-dependent hydrocephalus after aneurysmal subarachnoid hemorrhage. Acta Neurochir 162, 3093–3105 (2020). https://doi.org/10.1007/s00701-020-04484-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00701-020-04484-6