Development and validation of a machine learning-based prognostic risk stratification model for acute ischemic stroke

Wang, Kai; Hong, Tao; Liu, Wencai; Xu, Chan; Yin, Chengliang; Liu, Haiyan; Wei, Xiu’e; Wu, Shi-Nan; Li, Wenle; Rong, Liangqun

doi:10.1038/s41598-023-40411-2

Development and validation of a machine learning-based prognostic risk stratification model for acute ischemic stroke

Article
Open access
Published: 23 August 2023

Volume 13, article number 13782, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Development and validation of a machine learning-based prognostic risk stratification model for acute ischemic stroke

Download PDF

Kai Wang^1,2^na1,
Tao Hong^3,4,5^na1,
Wencai Liu⁶^na1,
Chan Xu⁷,
Chengliang Yin⁸,
Haiyan Liu^1,2,
Xiu’e Wei^1,2,
Shi-Nan Wu⁹,
Wenle Li ORCID: orcid.org/0000-0002-2933-646X^2,7 &
…
Liangqun Rong^1,2

1761 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Acute ischemic stroke (AIS) is a most prevalent cause of serious long-term disability worldwide. Accurate prediction of stroke prognosis is highly valuable for effective intervention and treatment. As such, the present retrospective study aims to provide a reliable machine learning-based model for prognosis prediction in AIS patients. Data from AIS patients were collected retrospectively from the Second Affiliated Hospital of Xuzhou Medical University between August 2017 and July 2019. Independent prognostic factors were identified by univariate and multivariate logistic analysis and used to develop machine learning (ML) models. The ML model performance was assessed by area under the receiver operating characteristic curve (AUC) and radar plot. Shapley Additive explanations (SHAP) values were used to interpret the importance of all features included in the predictive model. A total of 677 AIS patients were included in the present study. Poor prognosis was observed in 209 patients (30.9%). Six variables, including neuron specific enolase (NSE), homocysteine (HCY), S-100β, dysphagia, C-reactive protein (CRP), and anticoagulation were included to establish ML models. Six different ML algorithms were tested, and Random Forest model was selected as the final predictive model with the greatest AUC of 0.908. Moreover, according to SHAP results, NSE impacted the predictive model the most, followed by HCY, S-100β, dysphagia, CRP and anticoagulation. Based on the RF model, an online tool was constructed to predict the prognosis of AIS patients and assist clinicians in optimizing patient treatment. The present study revealed that NSE, HCY, CRP, S-100β, anticoagulation, and dysphagia were important factors for poor prognosis in AIS patients. ML algorithms were used to develop predictive models for predicting the prognosis of AIS patients, with the RF model presenting the optimal performance.

The Modified Fisher Scale Lacks Interrater Reliability

Article 16 November 2020

Inflammatory Responses After Ischemic Stroke

Article 29 June 2022

Predictors of the unfavorable outcomes in acute ischemic stroke patients treated with alteplase, a multi-center randomized trial

Article Open access 12 March 2024

Introduction

Acute ischemic stroke (AIS), is the fifth leading cause of death in the United States¹, and the leading cause of death in China². AIS imposes a very heavy burden on high-income countries with the rapid development of social economy. Furthermore, the corresponding burden is increasing rapidly in low-income and middle-income countries³. The identification of risk factors associated with poor prognosis in AIS patients could assist clinicians to provide close surveillance and timely intervention for high-risk stroke patients with poor prognosis and may guide the implementation of strategies in organized stroke care provision, thereby optimizing clinical outcomes. Machine learning (ML) algorithms can efficiently identify features highly correlated to outcomes from a large number of features, outperforming traditional statistical methods^4,5. Consequently, ML can be used to improve the prediction accuracy of prognosis.

ML algorithms using big data have improved and optimized the predictive performance of the prognosis prediction, as reported by prior literature^6,7,8. Therefore, the goal in this retrospective study was to identify factors associated with prognosis for patients with stroke and develop an ML-based predictive model. Moreover, the proposed model was integrated in an online tool to provide clinicians and patients with visual and practical predictive assessment.

Materials and methods

Data source and collection

Retrospective data were obtained from electronic health record of the Second Affiliated Hospital of Xuzhou Medical University. Patients diagnosed with AIS from August 2017 to July 2019 were enrolled in this study cohort. Inclusion criteria were: The diagnosis of AIS met the requirements of the World Health Organization with symptom onset less than 24 h⁹. Exclusion criteria: (1) Incomplete clinical data. (2) Patients with severe abnormal organ function. (3) Inadequate auxiliary examinations. (4) Follow-up less than one year. The diagnosis process of AIS was completed and confirmed independently by two physicians. In situations where there was diagnostic disagreement, the final diagnosis was reviewed with a senior physician to reach a consensus. This study was approved by the Ethics Committee of the Second Affiliated Hospital of Xuzhou Medical University, and all studies were conducted in accordance with relevant guidelines/regulations and the Declaration of Helsinki, and informed consent was obtained from all participants.

Data processing and variable selection

In the data cleaning process, the information of subjects with missing value was deleted, and normality tests were performed for the continuous numerical variables to assess the presence of outliers. Subsequently, the information of subjects with outliers was deleted. Ultimately, the current study included a total of 677 subjects in the database. The data were collected by trained and qualified members of the research group using a uniformly designed questionnaire, which included living area, occupation, education level, family economic status, eating habits (alcohol intake, drinking sugary drinks, smoking, high-fat diet, etc.), lifestyle and habits, disease medication, menstrual history (female) and psychosocial factors. First, the purpose and significance of the survey were explained to the patients. On the basis of patients fully understanding, informed consent forms were signed, and the patients were asked to read the guidance on the questionnaire carefully and fill it in. Patients with reading difficulties, such as illiteracy or poor eyesight, were aided by members of the research group and the contents of the questionnaire were read to them to help them fill it in. The type and severity of stroke were evaluated by clinical symptoms, head CT, MR, and angiography results. Blood lipid, blood glucose, homocysteine, and other data were obtained by laboratory tests. Blood pressure, weight, height, time of onset, and baseline National Institutes of Health Stroke Scale (NIHSS) score were also recorded. Drug use, stroke recurrence and clinical prognosis were recorded during follow-up. Dysphagia was assess by water swallowing test according to the criteria¹⁰. Clinical prognosis was assessed according to the modified Rankin Scale (mRS) according to the results of 1-year follow-up. The mRS ≥ 3 was classified as poor prognosis and mRS < 3 was classified as good prognosis. All detection indexes were completed by the Second Hospital of Xuzhou Medical University. The flowchart of data collection is shown in Fig. 1.

Binary categorical features were encoded with 0 and 1; for example, the gender of patients was encoded as 0 or 1 (0 = male, 1 = female). A total of 30 demographic and clinical variables were included as baseline variables for analysis. All parameter data were retrieved from inpatient electronic medical records. Univariate logistic analysis was performed to identify impactful clinical variables, and then statistically significant variables were included in multivariate logistic analysis.

Model development and performance evaluation

This dataset was randomly split into a training cohort (70%) and a validation cohort (30%). The training cohort was used to construct and perform cross-validated ML models to avoid overfitting, and the validation database was used to validate the predictive power of models.

Based on the results of multivariate logistic analysis, the identified statistically significant variables were used to construct ML models. Six ML algorithms were employed to develop predictive models based on the training cohort, the performance of all candidate models was assessed using tenfold cross-validation within the training cohort, the final model was identified according to the highest mean AUC and the performance was further validated on the validation cohort. During tenfold cross-validation, the training cohort was divided into ten sets, nine of them were used for model testing and one for model assessment. Meanwhile, radar plots were drawn and the accuracy, sensitivity and specificity were calculated to evaluate performance. Due to the “black-box” nature of ML algorithms, the Shapley Additive explanations (SHAP) metric was used to interpret the models and assist doctors understand the findings of the models. The contribution of each variable to the model prediction was evaluated by SHAP values^11,12.

Statistical analysis

Continuous and categorical variables were expressed as mean ± SD and frequency, respectively. P-Values < 0.05 were considered statistically significant with 95% confidence intervals (CIs) applied for all analyses. Statistical analyses of the demographic and clinical characteristics of all included patients were performed in R (version 4.0.5, HTTPS: //www.r-project.org/). Python (version 3.8) was used to develop ML predictive models and a web risk calculator.

Ethics approval and consent to participate

This study was approved by the Ethics Committee of the Second Affiliated Hospital of Xuzhou Medical University (ethics number: [2020] 081603), and all studies were conducted in accordance with relevant guidelines/regulations, informed consent was obtained from all participants, and all studies were conducted in accordance with the Declaration of Helsinki.

Results

Characteristics of study population

A total of 713 AIS patients were included in this study, and 36 patients with missing value were deleted. Finally, 677 patients diagnosed as AIS were enrolled in the present study, and 209 patients had poor prognosis (30.9%). The differences between AIS patients with good and poor prognosis are described in Table 1. No differences in stroke occurrence between age and gender groups were observed, and there were no significant differences in the location of stroke-associated arterial blood vessels.

Table 1 Baseline characteristics of AIS patients between good and poor prognosis.

Full size table

Correlation of variables with clinical outcome

A total of 474 patients were assigned to the training cohort (70% of the total population). In the training cohort, univariable regression was used to identify fifteen significant risk factors, including Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Homocysteine (HCY), Myoglobin (MB), C-reactive protein (CRP), Neuron-Specific Enolase (NSE), S100β, treatment history (thrombolysis, thrombectomy, antiplatelet, anticoagulation, statin, PPI), and complicates (dysphagia and stroke-associated pneumonia) (Table 2). Subsequently, the previously identified variables were used to determine independent risk factors using multivariate regression. The results of the multivariable analysis in the training cohort are presented in Table 2. Independent risk factors associated with prognosis included NSE, HCY, CRP, S-100β, dysphagia, and anticoagulation.

Table 2 Univariate and multivariate logistic regression analysis of risk factors for poor prognosis in AIS patients.

Full size table

Development and validation of predictive models

Based on the six significant risk factors identified through multivariable Cox regression analysis on the training cohort, the following six machine learning algorithms were used to develop predictive models: Naive Bayesian classification (NBC), extreme Gradient Boosting (XGB), Random Forest (RF), Decision Tree (DT), Gradient Boosting Machine (GBM), and Logistic Regression (LR). For internal validation, tenfold cross-validation was employed to compare the performance of all models. ROC curves were plotted, and the AUCs of all established models are illustrated in Fig. 2. Among them, the RF model exhibited the best prediction performance with an AUC of 0.931. The AUCs of the XGBboost, GBM, DT, LR, and NBC models were 0.922, 0.923, 0.923, 0.829, and 0.871, respectively. Additionally, the selected models were comprehensively evaluated by calculating their accuracy, sensitivity, and specificity, as depicted in Fig. 3. Consequently, the RF model was hereby identified as the most effective predictive model for the prognosis of AIS, displaying high accuracy (0.789), sensitivity (0.755), and specificity (0.877). The F1 score is a commonly used metric for evaluating the overall performance of classification models, especially in situations with imbalanced class distributions. The F1 score ranges between zero and one, with values closer to one indicating better model performance. In the case of the final RF model, the F1 score was 0.699. Additionally, an external cohort was used to test the model, and the result also suggested that the RF model is the optimal model to predict the prognosis, with the highest ACU of 0.908. The AUCs for each model were: 0.884 for the XGB model, 0.883 for the GBM model, 0.879 for the DT model, 0.882 for the LR model, and 0.860 for the NBC model (Fig. 4).

Model interpretation

This study analyzed the independent validation set in the RF model through the SHAP package, as shown in Fig. 5. In the final RF model, feature importance rankings of six predictors are shown in Fig. 5A. Based on the SHAP summary plots for poor prognosis in AIS patients, the related features ranked from highest to lowest importance were NSE, HCY, S-100β, dysphagia, CRP and anticoagulation. Figure 5B showed the distribution of the contribution of each feature to the model output. The influence of feature values on the results is represented by colors, with each dot representing a case in each row, red dots representing larger feature values, and blue representing lower ones. The smaller the feature value, the corresponding SHAP value is less than zero7, indicating a negative impact. On the other hand, the larger the feature value, the corresponding SHAP value is greater than zero, indicating a positive impact. The region with the widest distribution is NSE, indicating that it has the greatest impact.

Online application for prognosis prediction

Based on the RF model, an easy-to-use online calculator was built to predict the prognosis of AIS patients, which can be obtained at https://mlmedicine-prog-prog-stroke-ysnamt.streamlitapp.com/. By entering the patient's clinical data, doctors and patients can obtain the estimated probability of poor prognosis immediately. Readers can also use these detailed parameter settings to reproduce the proposed model in Python. The model_parameter_settings.txt can be downloaded from the following publicly available GitHub repositories. (https://github.com/Wu-Shi-Nan/Acute-ischemic-stroke/blob/main/model_parameter_settings.txt).

Discussion

Stroke is the second most common cause of disability and the leading cause of death among adults¹³, imposing a substantial burden on families and society in terms of disease and medical impact. Furthermore, this burden has consistently increased over the past three decades^14,15. Consequently, there is an urgent need to research early and accurate prediction of adverse outcomes in patients with AIS, to facilitate precise clinical management and surveillance.

According to the present study, dysphagia was identified as a significant cause of poor prognosis in patients with stroke. Dysphagia is a common complication after a stroke¹⁶. Martino et al. reported that dysphagia occurs in 37% to 78% of patients with stroke and increases the risk for pneumonia 3–11 times in patients with confirmed aspiration¹⁷. Additionally, previous literature has proved that dysphagia is the most important cause of post-stroke pneumonia^18,19,20. Stroke inhibits immunological responses through the activation of the autonomic nervous system and stress axis, contributing to the development of stroke-associated pneumonia, further exacerbating the condition²¹.

The present findings showed that CRP has a negative effect on the prognosis in stroke patients. CRP is an inflammatory response indicator, which can be significantly elevated during the acute phase of an inflammatory response. Meanwhile, stroke and infection are strongly intertwined. Many studies have reported that impaired immune response occurs in stroke patients, resulting in increased susceptibility to infections²². It has been previously reported that the presence of infection during acute and subacute phases of stroke is a common predictor of a poorer prognosis²³. Higher CRP values as associated with worse the infection²⁴. Increased c-reactive protein is related to a poorer prognosis regarding the course of stroke²³.

Meanwhile, AIS patients are prone to develop inflammatory response, and the immune-inflammatory response is an essential process post-stroke²⁵. Xie et al. showed that the interaction between the activation of coagulation and the inflammatory response leads to the consumption of coagulation factors. Additionally, patients with a higher international normalized ratio had more serious strokes, and the increased international normalized ratio was an independent predictor of one-year all-cause mortality²⁶. Anticoagulation increases the international normalized ratio and bleeding risk, thus increasing the incidence of poor prognosis. Bautista et al. suggested that warfarin use at baseline was associated with mortality²⁷, which is consistent with the findings of this study.

S100β and NSE were positively associated with poor prognosis in the present findings. S-100β, a member of a family of Ca + binding proteins, is a small acidic calcium-binding protein and abundantly expressed in the nervous system, mostly in astrocytes and several neuronal populations²⁸. Clinically, it is considered to be a serum marker of cerebral damage and hypoxia for assessing neurological prognosis²⁹. NSE, a dimeric isoenzyme of the glycolytic enzyme enolase, is involved in the glycolytic pathway. It is abundantly present in neurons and cells of neuroendocrine origin and used as a serum marker for neuronal loss, and its highest activity is found in brain tissue cells^30,31. S100β and NSE are markers of glial cells³². Due to stroke, ischemia and hypoxia occur in the brain, leading to rupture of the membrane structure of the neurons, and the process of acute stroke causes a large release of S100β and NSE into the blood. The degree of neurological impairment can be assessed to some extent by S100β and NSE^23,33. Previous studies have reported that increased levels of NSE and S100B are positively correlated with poor outcomes of stroke patients^34,35,36, which is consistent with our findings. NSE and S100β were also included in the presented predictive model to evaluate the prognosis of patients with stroke. Meanwhile, NSE ranked first among all predictors based on SHAP analysis. In addition, HCY was related to a poor outcome in stroke patients, and the relationship between hyperhomocysteinemia and poor outcome in stroke patients has been previously reported³⁷. When stroke patients presented with inflammatory response, cellular injury and necrosis would occur. Furthermore, as a response to this reaction, adenosine triphosphate would be released into the extracellular space, and hydrolyzed into adenosine. Adenosine has anti-inflammatory and tissue-protective properties³⁸. However, high HCY could reduce activities and protein content of electron transport chain components, hence lowering mitochondrial energy metabolism and decreasing adenosine triphosphate production³⁹. Thereby, for patients with high HCY, the anti-inflammatory and tissue protection in AIS patients were worse, and the prognosis was poorer accordingly.

An RF model was developed in this study to predict the risk of poor prognosis in AIS patients, which showed excellent performance compared with other ML models. The factors NSE, HCY, CRP, S-100β, anticoagulation, and dysphagia were identified as independent factors for poor prognosis. This study has laid a foundation for timely and accurate prediction of poor prognosis risk in order to further organize stroke care. Finally, poor prognosis rates for AIS patients can be easily obtained by entering corresponding clinical features in the developed online calculator. As shown in Fig. 6, the probability is calculated online quickly (Probability of poor prognosis = 9.4%, low-risk group). The wide application of smart devices nowadays makes the developed tool greatly convenient to use.

Previous studies have attempted to use ML algorithms to evaluate the prognosis of stroke^{5,40,41,42,43}. Peng et al. leveraged a cohort of 423 patients for predicting 30-day mortality in spontaneous intracerebral hemorrhage patients⁴². Bacchi et al. designed models using LR, RF, DT, and artificial neural networks to predict in-hospital mortality⁴³. Chen et al. used five ML techniques, including CatBoost, XGB, Boosting Decision Tree, RF, and AdaBoost, to investigate the 90-day poor prognosis in patients with transient ischemic attack or minor stroke. The factors associated with poor prognosis were identified, and the CatBoost model (AUC = 0.839) had the best predictive performance⁴⁰. However, their study had a relatively short predictive window time. Heo et al. explored the applicability of ML algorithms to predict long-term prognosis in AIS patients. They reported that the deep neural network with an AUC of 0.888 can improve the prediction of long-term outcomes, but the predictive variables were not identified⁵.

Compared to other studies, the proposed predictive model has the following advantages. First, a comprehensive range of ML algorithms were tested, including NBC, XGB, RF, DT, GBM, and LR. This comprehensive approach allows exploration of different modeling techniques and selecting the most suitable algorithm for the specific prediction task. Second, several significant risk factors were identified through the analysis, such as NSE, HCY, CRP, S-100β, anticoagulation, and dysphagia, which are independently associated with unfavorable outcomes in AIS patients. The variables mentioned above are mostly readily available patient information, which makes this model potentially suitable for assisting clinical applications. Proper use of the model can provide valuable insights into the factors influencing prognosis and aids in clinical decision-making. Third, this study employed a relatively long prediction window, which extends the observation period and captures a broader range of patient outcomes. This reduces the impact of short-term fluctuations and random variability, enhancing the reliability and stability of the predictions. Fourth, the SHAP values were used to interpret the model, providing insights into the contribution and importance of each feature in the prediction process. This enhances the model transparency and interpretability, allowing for better understanding and trust in the results. Fifth, a web-based platform that visualizes the predictive model and provides a dynamic calculator for easy and practical usage was developed. This user-friendly interface enhances the accessibility and usability of the proposed model, making it more convenient for clinical applications. Overall, the predictive model stands out by utilizing clinical samples combined with various ML algorithms, identifying significant risk factors, employing a long prediction window, providing model interpretation, and offering a user-friendly platform. These advantages contribute to its potential clinical utility and effectiveness in predicting AIS outcomes.

There are several limitations present in this study. First, this is a retrospective study, which introduces the potential for selection bias and limits the establishment of causal relationships. Second, the sample size was not large enough, which may affect the generalization ability of the findings, and the data collection was limited to a single medical center, which may restrict the external validity of the results. Despite the good predictive power of our ML model, external validation in a completely different patient cohort is unavailable. Third, it is recommended to compare the predictive model with another prognostic system, such as the Acute Stroke Registry and Analysis of Lausanne score, to provide a more comprehensive evaluation and enhance the persuasiveness of the findings⁴⁴. Finally, it should be noted that the NIHSS score, which stands for National Institutes of Health Stroke Scale, is a widely predictor of outcomes in AIS predictive models⁴⁵. The NIHSS score upon admission was found to be a significant factor in the development of unfavorable outcomes. It is important to acknowledge that the absence of the NIHSS score data in the final predictive model might have limited its performance. A prospective multiple-center cohort could be designed to enhance the credibility of the results in the future.

Conclusion

In summary, this paper established machine learning models to predict the prognosis of acute ischemic stroke. This tool can assist clinicians and patients to predict prognoses and formulate management strategies.

Data availability

Corresponding authors can be contacted for data upon reasonable request.

References

Mozaffarian, D. et al. Heart disease and stroke statistics–2015 update: A report from the American Heart Association. Circulation 131(4), e29-322 (2015).
PubMed Google Scholar
Collaborators GBDLRoS et al. Global, regional, and country-specific lifetime risks of stroke, 1990 and 2016. N. Engl. J. Med. 379(25), 2429–2437 (2018).
Google Scholar
Kim, A. S., Cahill, E. & Cheng, N. T. Global stroke belt: Geographic variation in stroke burden worldwide. Stroke 46(12), 3564–3570 (2015).
PubMed Google Scholar
Han, H. & Liu, W. The coming era of artificial intelligence in biological data science. BMC Bioinform. 20(Suppl 22), 712 (2019).
Google Scholar
Heo, J. et al. Machine learning-based model for prediction of outcomes in acute stroke. Stroke 50(5), 1263–1265 (2019).
PubMed Google Scholar
Lee, K. C. et al. Prediction of prognosis in patients with trauma by using machine learning. Medicina (Kaunas) 58(10), 1379 (2022).
PubMed Google Scholar
Li, C. et al. Machine learning predicts the prognosis of breast cancer patients with initial bone metastases. Front. Public Health 10, 1003976 (2022).
PubMed PubMed Central Google Scholar
Chen, S. et al. Machine learning-based prognosis signature for survival prediction of patients with clear cell renal cell carcinoma. Heliyon 8(9), e10578 (2022).
CAS PubMed PubMed Central Google Scholar
Stroke--1989: Recommendations on stroke prevention, diagnosis, and therapy. Report of the WHO Task Force on Stroke and other Cerebrovascular Disorders. Stroke 1989, 20:1407–1431.
Güleç, A., Albayrak, I., Erdur, Ö., Öztürk, K. & Levendoglu, F. Effect of swallowing rehabilitation using traditional therapy, kinesiology taping and neuromuscular electrical stimulation on dysphagia in post-stroke patients: A randomized clinical trial. Clin. Neurol. Neurosurg. 211, 107020 (2021).
PubMed Google Scholar
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2(1), 56–67 (2020).
PubMed PubMed Central Google Scholar
Nohara, Y. M. K., Soejima, H. & Nakashima, N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput. Methods Progr. Biomed. 214, 106584 (2022).
Google Scholar
Roth, G. A. et al. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: A systematic analysis for the Global Burden of Disease Study 2017. Lancet 392(10159), 1736–1788 (2018).
Google Scholar
Collaborators, G. B. D. S. Global, regional, and national burden of stroke and its risk factors, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 20(10), 795–820 (2021).
Google Scholar
Winstein, C. J. et al. Guidelines for adult stroke rehabilitation and recovery: A guideline for healthcare professionals from the American Heart Association/American Stroke Association. Stroke 47(6), e98–e169 (2016).
PubMed Google Scholar
Dziewas, R. et al. Pharyngeal electrical stimulation for early decannulation in tracheotomised patients with neurogenic dysphagia after stroke (PHAST-TRAC): A prospective, single-blinded, randomised trial. Lancet Neurol. 17(10), 849–859 (2018).
PubMed Google Scholar
Martino, R. et al. Dysphagia after stroke: Incidence, diagnosis, and pulmonary complications. Stroke 36(12), 2756–2763 (2005).
PubMed Google Scholar
Smith, C. J. et al. Diagnosis of stroke-associated pneumonia: Recommendations from the pneumonia in stroke consensus group. Stroke 46(8), 2335–2340 (2015).
PubMed Google Scholar
Armstrong, J. R. & Mosher, B. D. Aspiration pneumonia after stroke: Intervention and prevention. Neurohospitalist 1(2), 85–93 (2011).
PubMed PubMed Central Google Scholar
Finlayson, O. et al. Stroke Outcome Research Canada Working G: Risk factors, inpatient care, and outcomes of pneumonia after ischemic stroke. Neurology 77(14), 1338–1345 (2011).
CAS PubMed Google Scholar
Hotter, B. et al. Inflammatory and stress markers predicting pneumonia, outcome, and etiology in patients with stroke: Biomarkers for predicting pneumonia, functional outcome, and death after stroke. Neurol. Neuroimmunol. Neuroinflamm. 7(3), e692 (2020).
PubMed PubMed Central Google Scholar
Shi, K., Wood, K., Shi, F. D., Wang, X. & Liu, Q. Stroke-induced immunosuppression and poststroke infection. Stroke Vasc. Neurol. 3(1), 34–41 (2018).
PubMed PubMed Central Google Scholar
Lasek-Bal, A. et al. The importance of selected markers of inflammation and blood-brain barrier damage for short-term ischemic stroke prognosis. J. Physiol. Pharmacol. 70(2), 209–217 (2019).
Google Scholar
Pathak, A. & Agrawal, A. Evolution of C-reactive protein. Front. Immunol. 10, 943 (2019).
CAS PubMed PubMed Central Google Scholar
Anrather, J. & Iadecola, C. Inflammation and stroke: An overview. Neurotherapeutics 13(4), 661–670 (2016).
CAS PubMed PubMed Central Google Scholar
Xie, X. et al. Prognostic value of international normalized ratio in ischemic stroke patients without atrial fibrillation or anticoagulation therapy. J. Atheroscler. Thromb. 26(4), 378–387 (2019).
CAS PubMed PubMed Central Google Scholar
Bautista, A. F. et al. Early prediction of prognosis in elderly acute stroke patients. Crit. Care Explor. 1(4), e0007 (2019).
PubMed PubMed Central Google Scholar
Donato, R. et al. S100B’s double life: Intracellular regulator and extracellular signal. Biochim. Biophys. Acta 1793(6), 1008–1022 (2009).
CAS PubMed Google Scholar
Stroick, M. F. M., Ragoschke-Schumm, A., Fassbender, K., Bertsch, T. & Hennerici, M. G. Protein S-100B—A prognostic marker for cerebral damage. Curr. Med. Chem. 13, 3053–3060 (2006).
CAS PubMed Google Scholar
Shen, Q. Q., Wang, W., Wu, H. & Tong, X. W. The effect of edaravone combined with DL-3-N-butylphthalide on the levels of tumor necrosis factor-alpha, interleukin-10, neuron-specific enolase and effect in patients with acute cerebral infarction. J. Physiol. Pharmacol. 73(3), 371–376 (2022).
Google Scholar
Jauch, E. C. et al. Group Nr-PSS: Association of serial biochemical markers with acute ischemic stroke: The National Institute of Neurological Disorders and Stroke recombinant tissue plasminogen activator Stroke Study. Stroke 37(10), 2508–2513 (2006).
CAS PubMed Google Scholar
Rashwan, H. M. et al. Bioactive phytochemicals from Salvia officinalis attenuate cadmium-induced oxidative damage and genotoxicity in rats. Environ. Sci. Pollut. Res. Int. 28(48), 68498–68512 (2021).
CAS PubMed Google Scholar
Kanavaki, A. et al. Serum levels of S100b and NSE proteins in patients with non-transfusion-dependent thalassemia as biomarkers of brain ischemia and cerebral vasculopathy. Int. J. Mol. Sci. 18(12), 2724 (2017).
PubMed PubMed Central Google Scholar
Bloomfield, S. M., McKinney, J., Smith, L. & Brisman, J. Reliability of S100B in predicting severity of central nervous system injury. Neurocrit. Care 6(2), 121–138 (2007).
CAS PubMed Google Scholar
Hu, Y. et al. Serum neuron specific enolase may be a marker to predict the severity and outcome of cerebral venous thrombosis. J. Neurol. 265(1), 46–51 (2018).
CAS PubMed Google Scholar
Kanazawa, M., Takahashi, T., Nishizawa, M. & Shimohata, T. Therapeutic strategies to attenuate hemorrhagic transformation after tissue plasminogen activator treatment for acute ischemic stroke. J. Atheroscler. Thromb. 24(3), 240–253 (2017).
CAS PubMed PubMed Central Google Scholar
Forti, P. et al. Homocysteinemia and early outcome of acute ischemic stroke in elderly patients. Brain Behav. 6(5), e00460 (2016).
PubMed PubMed Central Google Scholar
Le, T. T. et al. Purinergic signaling in pulmonary inflammation. Front. Immunol. 10, 1633 (2019).
CAS PubMed PubMed Central Google Scholar
Kaplan, P., Tatarkova, Z., Sivonova, M. K., Racay, P. & Lehotsky, J. Homocysteine and mitochondria in cardiovascular and cerebrovascular systems. Int. J. Mol. Sci. 21(20), 7698 (2020).
CAS PubMed PubMed Central Google Scholar
Chen, S. D. et al. Machine learning is an effective method to predict the 90-day prognosis of patients with transient ischemic attack and minor stroke. BMC Med. Res. Methodol. 22(1), 195 (2022).
PubMed PubMed Central Google Scholar
Zhu, Z. et al. Serum hepatocyte growth factor is probably associated with 3-month prognosis of acute ischemic stroke. Stroke 49(2), 377–383 (2018).
CAS PubMed PubMed Central Google Scholar
Peng, S. Y., Chuang, Y. C., Kang, T. W. & Tseng, K. H. Random forest can predict 30-day mortality of spontaneous intracerebral hemorrhage with remarkable discrimination. Eur. J. Neurol. 17(7), 945–950 (2010).
PubMed Google Scholar
Bacchi, S. et al. Stroke prognostication for discharge planning with machine learning: A derivation study. J. Clin. Neurosci. 79, 100–103 (2020).
PubMed Google Scholar
Ntaios, G. F. M., Ferrari, J., Lang, W., Vemmos, K. & Michel, P. An integer-based score to predict functional outcome in acute ischemic stroke: The ASTRAL score. Neurology 78, 1916–1922 (2012).
CAS PubMed Google Scholar
Chung, C. C., Su, E. C., Chen, J. H., Chen, Y. T. & Kuo, C. Y. XGBoost-based simple three-item model accurately predicts outcomes of acute ischemic stroke. Diagnostics (Basel) 13(5), 842 (2023).
CAS PubMed Google Scholar

Download references

Funding

This study was supported by: Scientific Research Project of Jiangsu Health Committee (No.H2019054), the Xuzhou Science and Technology Planning Project (No. KC21220) and Development Fund of Affiliated Hospital of Xuzhou Medical University (No.XYFM202250), Shaanxi Provincial Health and Health Research Fund Project (2022E006).

Author information

These authors contributed equally: Kai Wang, Tao Hong and Wencai Liu.

Authors and Affiliations

Department of Neurology, The Second Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
Kai Wang, Haiyan Liu, Xiu’e Wei & Liangqun Rong
Key Laboatory of Neurological Diseases, The Second Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
Kai Wang, Haiyan Liu, Xiu’e Wei, Wenle Li & Liangqun Rong
Pediatric Surgery Ward, Fuwai Hospital Chinese Academy of Medical Sciences, Shenzhen, China
Tao Hong
Department of Cardiovascular Surgery, General Hospital of Northern Theater Command, Shenyang, 110000, China
Tao Hong
Postgraduate College, Dalian Medical University, Dalian, 116000, China
Tao Hong
Department of Orthopaedics, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, 600 Yishan Road, Shanghai, 200233, China
Wencai Liu
The State Key Laboratory of Molecular Vaccinology and Molecular Diagnostics & Center for Molecular Imaging and Translational Medicine, School of Public Health, Xiamen University, Xiamen, China
Chan Xu & Wenle Li
Faculty of Medicine, Macau University of Science and Technology, Macau, China
Chengliang Yin
School of Medicine, Eye Institute of Xiamen University, Xiamen University, Xiamen, Fujian, China
Shi-Nan Wu

Authors

Kai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tao Hong
View author publications
You can also search for this author in PubMed Google Scholar
Wencai Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chan Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chengliang Yin
View author publications
You can also search for this author in PubMed Google Scholar
Haiyan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiu’e Wei
View author publications
You can also search for this author in PubMed Google Scholar
Shi-Nan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Wenle Li
View author publications
You can also search for this author in PubMed Google Scholar
Liangqun Rong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.L.L., Q.L.R. and S.N.W. completed the study design. K.W., W.C.L. and W.L.L. performed the study, collected and analyzed the data. T.H. and W.L.L. drafted the manuscript. Q.L.R., X.E.W., K.W. and H.Y.L. provided the expert consultations and suggestions. C.X. and C.L.Y. conceived of the study, participated in its design and coordination, and helped to embellish language. All authors reviewed the final version of the manuscript.

Corresponding authors

Correspondence to Shi-Nan Wu, Wenle Li or Liangqun Rong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, K., Hong, T., Liu, W. et al. Development and validation of a machine learning-based prognostic risk stratification model for acute ischemic stroke. Sci Rep 13, 13782 (2023). https://doi.org/10.1038/s41598-023-40411-2

Download citation

Received: 03 March 2023
Accepted: 09 August 2023
Published: 23 August 2023
DOI: https://doi.org/10.1038/s41598-023-40411-2
Springer Nature Limited

Development and validation of a machine learning-based prognostic risk stratification model for acute ischemic stroke

Abstract

Similar content being viewed by others

The Modified Fisher Scale Lacks Interrater Reliability

Inflammatory Responses After Ischemic Stroke

Predictors of the unfavorable outcomes in acute ischemic stroke patients treated with alteplase, a multi-center randomized trial

Introduction