Abstract
Background
Stroke-associated pneumonia (SAP) is a preventable determinant for poor outcome after stroke. Machine learning (ML) using large-scale clinical data warehouses may be able to predict SAP and identify patients for targeted interventions. The aim of this study was to develop a prediction model for identifying clinically apparent SAP using automated ML.
Methods
The ML model used clinical and laboratory parameters along with heart rate (HR), heart rate variability (HRV), and blood pressure (BP) values obtained during the first 48 h after stroke unit admission. A logistic regression classifier was developed and internally validated with a nested-cross-validation (nCV) approach. For every shuffle, the model was first trained and validated with a fixed threshold for 0.9 sensitivity, then finally tested on the out-of-sample data and benchmarked against a widely validated clinical score (A2DS2).
Results
We identified 2390 eligible patients admitted to two-stroke units at Charité between October 2020 and June 2023, of whom 1755 had all parameters available. SAP was diagnosed in 96/1755 (5.5%). Circadian profiles in HR, HRV, and BP metrics during the first 48 h after admission exhibited distinct differences between patients with SAP diagnosis vs. those without. CRP, mRS at admission, leukocyte count, high-frequency power in HRV, stroke severity at admission, sex, and diastolic BP were identified as the most informative ML features. We obtained an AUC of 0.91 (CI 0.88–0.95) for the ML model on the out-of-sample data in comparison to an AUC of 0.84 (CI 0.76–0.91) for the previously established A2DS2 score (p < 0.001). The ML model provided a sensitivity of 0.87 (CI 0.75–0.97) with a corresponding specificity of 0.82 (CI 0.78–0.85) which outperformed the A2DS2 score for multiple cutoffs.
Conclusions
Automated, data warehouse-based prediction of clinically apparent SAP in the stroke unit setting is feasible, benefits from the inclusion of vital signs, and could be useful for identifying high-risk patients or prophylactic pneumonia management in clinical routine.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Prognosis after stroke is often poor, with more than 40% of patients becoming disabled, institutionalized, or dying within 3 months of the index event [1]. Early medical intervention to treat major modifiable factors may limit mortality and morbidity in stroke [2, 3]. In particular stroke-associated pneumonia (SAP) is consistently associated with a high risk of early mortality in acute stroke [4,5,6]. The pathogenesis of SAP includes a stroke-induced immunodepression characterized by lymphopenia as well as lymphocytic and monocytic dysfunction impairing antibacterial defenses [14, 15, 30, 32]. The stroke-induced immunodepression may protect against excessive neuroinflammation but increases the risk of post-stroke infections, especially pneumonia [7, 8]. Several clinical parameters are associated with SAP including old age, stroke severity, autonomic dysfunction, impaired consciousness and, most importantly, dysphagia and immune dysfunction [8,9,10, 33,34,35]. However, identifying patients at high risk for SAP remains challenging and is currently not broadly implemented in clinical routine despite the availability of widely validated risk scores like the A2DS2 score [15]. The increasing availability of clinical data warehouses for large-scale data acquisition and analysis in many clinical centers may provide novel opportunities for automated, machine learning (ML)-based predictions of SAP and targeted timely interventions.
To gauge SAP risk caused by immune dysfunction, markers related to the intricate interaction between the autonomic nervous system (ANS) and the immune system may be particularly useful. Specifically, while a well-regulated ANS is crucial for maintaining immune homeostasis, stroke may lead to altered cardiovascular function and dysregulation of the ANS, affecting the balance between sympathetic and parasympathetic activity. These changes in the ANS are reflected in the electrocardiography and blood pressure monitoring data of stroke patients [36].
In this retrospective study, we developed, validated and out-of-sample tested a prognostic ML model for predicting the risk of pneumonia after stroke. Taking into consideration ANS and immune system interactions and relying on our previous work [36], in addition to common predictive features based on demographics, comorbidities, and clinical characteristics, we incorporate features derived from electrocardiography and blood pressure monitoring data into an ML model to investigate their impact on SAP prediction for the first time. The model is applicable for automated use in the stroke unit setting during the acute phase after admission.
Materials and methods
Dataset
All patients diagnosed with nontraumatic hemorrhagic (ICD-10: I61.-) or ischemic stroke (ICD-10: I63.-) in one of two separate stroke units at Charité—Universitätsmedizin Berlin, Germany, between October 2020 and June 2023 were initially selected. The two stroke units, consisting of a total of 20 monitoring beds, allowed data transfer and integration into the Data Warehouse Connect (DWC) system (Philips) for long-term storage of monitoring data. The Charité/BIH (Berlin Institute of Health) Health Data Lake (HDL), a Hadoop-based platform that allows the storage of a multitude of clinical, epidemiological, laboratory, and monitoring data, was used for further data integration, harmonization, and analysis. Usage and analysis of the data were approved by the Institutional Review Board of Charité—Universitätsmedizin Berlin. Electrocardiogram (ECG) data were recorded with Philips MP30 and MP50 monitors and stored for analysis in the data lake. We collected all beat-to-beat intervals from heartbeats marked by the Philips monitors for up to 48 h after admission. Besides ECG measures, the data lake also contained a comprehensive set of additional parameters from each patient, including blood pressure values, laboratory values, clinical scores, and diagnoses. SAP was diagnosed by the treating physician based on clinical symptoms and/or suggestive clinical examination and/or radiological findings and/or microbiological evidence of pulmonary infection in the stroke unit.
A2DS2—a clinical score to benchmark machine learning (ML)-based prediction.
The A2DS2 score was developed previously as a prognostic score for predicting the risk of pneumonia after ischemic stroke [10]. Table 1 summarizes how the score is composed on an ordinal scale from 0 to 10. We used this score to benchmark ML-based predictions of SAP.
Feature selection for machine learning
Our goal was to develop a prediction algorithm for SAP based on data from the first 48 h after stroke unit admission, which would be suitable for automated use in a data warehouse setting. Based on previous studies [10, 11], the following selection of risk factors was coded into features and included in this study for the ML-based prediction of SAP. First, A2DS2 score variables requiring little patient history were collected: age (numeric), sex (binary), and National Institute of Health Stroke Scale (NIHSS, numeric) at admission. Furthermore, we included a modified Rankin Scale (mRS, numeric) at admission and the presence of ischemic or intracranial hemorrhage (I63, binary) as additional features along with laboratory values, including CRP (binary, < 5 mg/l or not) and leukocyte count (binary, within 3.9–10.5 × 109/L or not), that were recorded within 48 h after admission. Finally, based on the available monitoring data, heart rate (HR, from ECG), heart rate variability (HRV, from ECG), and blood pressure (BP) metrics were calculated for the first 48 h after patient admission. Both non-invasive (NBP) and arterial (ABP) blood pressure measurements were included based on availability.
Calculation of heart rate, heart rate variability, and blood pressure metrics.
HRV measured within 24–72 h after stroke onset has been investigated as a potential prognostic indicator [36,37,38]. Accordingly, we evaluated HR, HRV, and BP during the initial 48 h after admission.
For the calculation of HR/HRV-associated metrics, we collected heartbeat data in the form of beat-to-beat intervals (RR) of consecutive heartbeats directly from the Philips monitors. Adhering to common standards we divided the data into 5-min segments [39]. Removing ectopic beats entirely from the data led to a high decrease in patient numbers. Therefore, instead of exclusively analyzing normal heartbeat data, we deployed an artifact detection and correction, as well as a detrending method, described by Lipponen and Travainen [41, 42].
Keeping in mind the circadian variations of the monitoring metrics over the day, we calculated a 24-h time course for each patient and metric. For HR and BP (systolic, diastolic, mean), we derived the median value for each hour of the day from the averaged 5-min segments of the hour, respectively. The median of the HRV metrics was equally obtained by considering the values of all available 5-min segments for the respective hour. Patients were only considered for analysis if all metrics could be derived for at least 10 out of 24 h (Fig. 1).
Compliant with current field standards, we calculated the following five HRV measures using the NeuroKit2 python toolbox [39, 40]: SDNN (standard deviation of beat-to-beat intervals), RMSSD (root mean square of successive RR intervals), LF (low-frequency power, 0.04–0.15 Hz), HF (high-frequency power, 0.15–0.4 Hz) and the ratio of LF/HF. Finally, ML features were generated by averaging each metric over the 24 values of individual hours, with an exception of the HR, where only values between 21:00 and 7:00 were averaged to better capture the circadian characteristics following previous work [36] (Fig. 2, gray shaded area).
Machine learning: training, validation, out-of-sample testing
We employed a supervised logistic regression model to predict the development of clinically apparent SAP as a binary classification. We assessed algorithm performance using a nested cross-validation (nCV) approach: The whole data set was split 100 times 4:1 into 80% training and validation sets and 20% out-of-sample testing sets. For reproducibility and comparability, we deployed StratifiedShuffleSplit of the scikit-learn python library with a fixed random state. For every shuffle, we applied a fivefold-CV grid search on the training and validation set, optimizing the area under the receiver-operating curve. Within the grid search, we trained a logistic regression classifier with a newton-cg solver and L2 penalty. Hyperparameters (C-values 0.001, 0.1, 1, 10, 100, 1000) were tuned for every fold and the best performing model was selected. Using the selected parameters of the best estimator, the logistic regression classifier was subsequently trained on the entire training and validation set of the respective shuffle. A threshold was then chosen to obtain a sensitivity of 0.9 or larger. This threshold was based on clinical considerations [10, 13]. Finally, these trained models along with their selected thresholds were tested and evaluated on the unseen, out-of-sample test sets and performance metrics were averaged over the 100 shuffles.
The ML pipeline was compiled to balance and normalize the data (using sklearn’s MinMaxScaler). We applied the Synthetic Minority Over-sampling Technique (SMOTE) to balance the two classes (SAP vs. no SAP) during training within the grid search. SMOTE is a form of data augmentation, where new samples are synthesized from existing examples. The new synthetic data points are generated by applying k-nearest-neighbors to a random sample of the minority class, then selecting a random member of the resulting k-neighbors and finally creating the synthetic sample at a randomly selected point between the initial point and its randomly chosen neighbor in the feature space. This way, synthetic samples are created until the dataset is balanced.
Performance assessment
We assessed average ML model performance on 100 ensembles of unseen, out-of-sample test sets by calculating the area under the receiver-operator-characteristics (AUC), sensitivity, and specificity for every shuffle. Additionally, we calculated the A2DS2 score [10, 13] and assessed its classifying capabilities on every iteration both on the training and validation set as well as on hold-out test set and benchmarked the results against the ML performance. We obtained 95% confidence intervals of all metrics from bootstrapping (n = 200).
Results
We identified 2390 eligible patients admitted to two stroke units between October 2020 and June 2023 matching our diagnosis criteria (Fig. 1A). From the initial selection, 635 patients were excluded from further analysis due to either (1) missing laboratory or clinical values, (2) because no SAP diagnosis was recorded in the quality management system, but pneumonia was indicated within 7 days after admission in a broader clinical dataset, (3) missing blood pressure data, or (4) insufficient ECG data within the first 48 h (see flowchart in Fig. 1). To fully evaluate the circadian profiles of HR and HRV, we required HR/HRV data to cover at least 10 h in each patient. For the remaining 1755 patients, SAP was diagnosed in 96/1,755 (5.5%). The baseline characteristics of the patients are summarized in Table 2.
Figure 2 shows HR, HRV, and BP values for patients with (red) and without (blue) SAP diagnosis. Differences between the groups included an overall lower HR and a pronounced HR dip during the night (gray shaded area), lower values for HRV (with exception for LF/HF), as well as generally higher diastolic BP in the control group. The distinct differences between the groups thus motivated the use of HR, HRV, and BP as additional features in ML. Average values of these metrics were consequently combined with clinical and laboratory features to obtain a feature vector for each patient consisting of age, sex, main diagnosis (I61.- or I63.-), NIHSS at admission, mRS at admission, CRP, leukocyte count, HR, SDNN, RMSSD, LF, HF, LF/HF, systolic BP, diastolic BP and mean BP. These features were selected to allow for maximized automation when implemented in a data warehouse and stroke unit setting, as they only require minimal patient history or clinical tests.
With these features, we obtained an AUC of 0.91 (95% CI 0.88–0.95) for the ML model on the out-of-sample test data. Similarly, we calculated an AUC of 0.84 (CI 0.76–0.91; Fig. 3A) for the A2DS2 score as a benchmark for our model. ML provided a significantly higher AUC than A2DS2 (p < 0.001, Wilcoxon signed-rank test). With the fixed sensitivity thresholds of 0.9 obtained during training and validation, the ML model provided a sensitivity of 0.87 (CI 0.75–0.97) and a corresponding specificity of 0.82 (CI 0.78–0.85) on the out-of-sample test data. The ML model demonstrated superior performance compared to the A2DS2 score, achieving higher specificity at A2DS2 cutoffs of 2 (specificity 0.42, CI 0.37–0.46), 3 (specificity 0.62, CI 0.58–0.66), and 4 (specificity 0.71, CI 0.67–0.75), all while maintaining a high level of sensitivity.
Shapley values identified the most informative features during training and validation as CRP, mRS at admission, leukocyte count, HF, NIHSS at admission, sex, and diastolic BP followed by systolic BP, age, and HR while LF, LF/HF, SDNN, RMSSD and type of stroke (I63/I61) only had a marginal impact on model classification (Fig. 4).
Discussion
Our study developed a prognostic ML model for predicting the risk of post-stroke pneumonia during the acute phase of the index event. It is applicable for automated use in the stroke unit. We used clinical and laboratory parameters known to be predictive for SAP, which are routinely collected and do not require extensive history taking or additional tests. We extended these features by including physiological parameters from HR, HRV, and BP obtained during the first 48 h after admission, which exhibited distinct profiles in SAP patients compared to controls.
Importantly, to our knowledge, our study represents the first clinical-scale and ML-based investigation to include heart rate variability (HRV) and associated variables and vitals related to ANS function for SAP prediction. We aimed to integrate these distinct circadian profiles in our model, including the nocturnal non-dipping of heart rate [36]. On out-of-sample data, the ML method provided good discrimination performance between patients developing SAP vs. those that did not, outperforming a previously developed scoring system [10].
Previous research has identified several predictive biomarkers of SAP [15, 26]. Blood-based biomarkers included immune, inflammatory, and stress-related proteins as well as ratios and indices such as the neutrophil-to-lymphocyte ratio (NLR), systemic immune-inflammation index (SII), platelet-to-lymphocyte ratio (PLR), and systemic inflammation response index (SIRI), of which the NLR was reported as the best predictor for SAP occurrence [24]. Heart rate variability [27] and in particular very low-frequency HRV [28], an index of integrative autonomic-humoral control, has been reported as an early marker of sub-acute post-stroke infections, including in experimental models [29]. However, these biomarkers only marginally improved the prediction of SAP over routine clinical parameters [23, 28]. Thus, careful evaluation of prognostic markers is needed [25]. It is reassuring that our data-driven approach identified CRP, leukocyte count, HR, and diastolic BP among the informative ML features which are classical parameters for infection diagnosis.
Using prognostic markers, several SAP prediction scores have been proposed, including the A2DS2 score, the 22-point ISAN score, the PNA score, and the ACDD score. The ICH-LR2S2 score has been developed specifically for SAP after acute intracranial hemorrhage [19]. Comparative internal or external validations of these scores have been performed [13, 20,21,22]. A large external validation study reported the A2DS2 score to have the highest sensitivity (87%) and the AIS-APS score to have the highest specificity (92.8%) [20]. Another comparative study concluded that the clinical prediction scores varied in their simplicity of use and, while comparable in performance, their utility for preventive intervention trials and in clinical practice required further investigation [21]. More recently, ML-based prediction of SAP, including methods based on natural language processing, has also been explored [17, 18]. These studies reported AUCs of 0.84 which is below the AUC of 0.91 reported here.
In this context, it is important to note that any score and ML model also requires a cutoff or threshold to be provided along with the respective score or ML model to be useful for everyday clinical application and decision support. While AUC is a convenient measure that takes into account many potential cutoffs or thresholds to quantify the general discriminative power of a score or ML model, a pre-determined threshold is required to be established for clinical use. Many studies have not provided such a pre-determined, fixed cutoff based on which to derive sensitivity and specificity when externally testing the performance. By choosing the best threshold post hoc, sensitivity, and specificity values may thus potentially appear over-optimistic. In contrast, we here determined a threshold from training data only and based on clinical considerations (i.e., sensitivity equal or above 0.9) to make the approach applicable for real-world use. We then applied this fixed threshold to the out-of-sample test data for evaluation of sensitivity and specificity.
The development of a prediction score that identifies stroke patients at risk for stroke-associated pneumonia has important clinical implications. By identifying high-risk patients, healthcare providers could take proactive steps to prevent the development of pneumonia, like to intensify methods of pneumonia prophylaxis such as implementing measures to reduce the risk of aspiration (such as optimizing the patient's position during feeding and adapting food consistency), targeted speech and language therapy, more in-depth clinical examinations as well as more frequent blood tests to check for signs of infection [12, 31].
While preventive antibiotic therapy did not improve functional outcomes after stroke, local immunomodulation could open up a new research opportunity to find preventive management for SAP [15, 26]. The benefits of robust SAP prediction regarding patient wellbeing, but also health care costs could be substantial: shorter hospital stays, less—or timelier and more targeted—use of expensive antibiotics with potential side effects (which could slow the development of antibiotic resistances), and better long-term outcome after stroke and much more. In this context, it is an interesting question whether the altered HRV biomarkers analyzed here could also serve as potential targets for preventive measures. Exploring the therapeutic implications and the possibility of mitigating the risk of aspiration pneumonia by modulating HRV should be a relevant focus of future research.
Limitations of this study are inherited in its retrospective setting. Prospective validation in an external patient cohort would enhance validity. In our study, the presence of SAP was defined according to the discretion of the treating physician. Although the diagnosis of SAP follows PISCES recommendations, standardized recording and evaluation of all diagnostic criteria in each case would improve interpretation of the results [16]. Compared with previous internal and external validation studies, we found a surprisingly high level of discrimination for the A2DS2 score. Previous analyses showed this capability to be highly dependent on the thoroughness of the SAP definition applied. However, the reported frequency of SAP in this cohort is well in line with previous studies. The A2DS2 score exhibited similar performance compared to previous studies [10]. From the ML perspective, the comparatively low frequency of SAP in both the training and the testing datasets is a challenge, as in unbalanced datasets ML algorithms tend to classify all instances as the majority class (in this case, no SAP), if not addressed properly. We have tried to solve this problem by using the widely used Synthetic Minority Over-sampling Technique (SMOTE) which helped to improve the performance of our algorithm. Finally, it will be interesting to replicate our results in other post-stroke infections, such as urinary tract infections (UTIs) or colitis. We here chose to focus on SAP as the target outcome as it is the most common post-stroke infection that occurs only a few days after stroke. Future work should determine the generalizability of our approach to other post-stroke infections.
In summary, our results show that automated, data warehouse-based predictions of clinically apparent SAP in the stroke unit setting are feasible and benefit from including parameters of autonomic nervous system function. Such predictions could be useful for identifying high-risk patients, tailor monitoring, and facilitate studies on prophylactic pneumonia management in clinical routine. Future prospective validation studies, however, are needed to fully assess its performance and generalizability prior to a potential implementation into the clinical routine.
Data availability
The datasets generated and analyzed during the current study are not publicly available due to privacy or ethical restrictions but may be available from the corresponding author on reasonable request. Access to the data will be granted to qualified researchers for purposes of reproducing the results or replicating the procedure.
References
GBD 2019 Stroke Collaborators (2021) Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol 20(10):795–820.
Balami JS, Chen RL, Grunwald IQ, Buchan AM (2011) Neurological complications of acute ischaemic stroke. Lancet Neurol 10:357–371
Kumar S, Selim MH, Caplan LR (2010) Medical complications after stroke. Lancet Neurol 9:105–118
Heuschmann PU, Kolominsky-Rabas PL, Misselwitz B, Hermanek P, Leffmann C, Janzen RWC et al (2004) Predictors of in-hospital mortality and attributable risks of death after ischemic stroke—the German stroke registers study group. Arch Intern Med 164:1761
Teh WH, Smith CJ, Barlas RS et al (2018) Impact of stroke-associated pneumonia on mortality, length of hospitalization, and functional outcome. Acta Neurol Scand 138(4):293–300
Koennecke HC, Belz W, Berfelde D, Endres M, Fitzek S, Hamilton F et al (2011) Factors influencing in-hospital mortality and morbidity in patients treated on a stroke unit. Neurology 77:965–972
Meisel C, Meisel A (2011) Suppressing immunosuppression after stroke. N Engl J Med 365(22):2134–2136
Elkind MSV, Boehme AK, Smith CJ, Meisel A, Buckwalter MS (2020) Infection as a Stroke Risk Factor and Determinant of Outcome After Stroke. Stroke. 51(10):3156–3168. https://doi.org/10.1161/STROKEAHA.120.030429. Epub 2020 Sep 8. PMID: 32897811; PMCID: PMC7530056.
Sellars C, Bowie L, Bagg J, Sweeney MP, Miller H, Tilston J et al (2007) Risk factors for chest infection in acute stroke: a prospective cohort study. Stroke 38:2284–2291
Hoffmann S, Malzahn U, Harms H, Koennecke HC, Berger K, Kalic M, Walter G, Meisel A, Heuschmann PU (2012) Development of a clinical score (A2DS2) to predict pneumonia in acute ischemic stroke. Stroke 43:2617–2623
Friedant AJ, Gouse BM, Boehme AK, Siegler JE, Albright KC, Monlezun DJ, George AJ, Beasley TM, Martin-Schild S (2015) A simple prediction score for developing a hospital-acquired infection after acute ischemic stroke. J Stroke Cerebrovasc Dis 24:680–686
Westendorp WF, Nederkoorn PJ, Vermeij JD, Dijkgraaf MG, van de Beek D (2011) Post-stroke infection: a systematic review and meta-analysis. BMC Neurol. 11:110. Published 2011 Sep 20.
Hotter B, Hoffmann S, Ulm L, Meisel C, Bustamante A, Montaner J, Katan M, Smith CJ, Meisel A (2021) External validation of five scores to predict stroke-associated pneumonia and the role of selected blood biomarkers. Stroke 52(1):325–330. https://doi.org/10.1161/STROKEAHA.120.031884. (Epub 2020 Dec 7 PMID: 33280547)
Hoffmann S, Harms H, Ulm L, Nabavi DG, Mackert BM, Schmehl I, Jungehulsing GJ, Montaner J, Bustamante A, Hermans M, Hamilton F, Göhler J, Malzahn U, Malsch C, Heuschmann PU, Meisel C, Meisel A; PREDICT Investigators (2017) Stroke-induced immunodepression and dysphagia independently predict stroke-associated pneumonia - The PREDICT study. J Cereb Blood Flow Metab. 37(12):3671–3682. https://doi.org/10.1177/0271678X16671964. Epub 2016 Oct 14. PMID: 27733675; PMCID: PMC5718319.
Westendorp WF, Dames C, Nederkoorn PJ, Meisel A (2022) Immunodepression, infections, and functional outcome in ischemic stroke. Stroke 53(5):1438–1448. https://doi.org/10.1161/STROKEAHA.122.038867. (Epub 2022 Mar 28 PMID: 35341322)
Smith CJ, Kishore AK, Vail A, Chamorro A, Garau J, Hopkins SJ, Di Napoli M, Kalra L, Langhorne P, Montaner J, Roffe C, Rudd AG, Tyrrell PJ, van de Beek D, Woodhead M, Meisel A (2015) Diagnosis of stroke-associated pneumonia: recommendations from the pneumonia in stroke consensus group. Stroke 46(8):2335–2340. https://doi.org/10.1161/STROKEAHA.115.009617. (Epub 2015 Jun 25 PMID: 26111886)
Tsai HC, Hsieh CY, Sung SF (2022) Application of machine learning and natural language processing for predicting stroke-associated pneumonia. Front Public Health 29(10):1009164. https://doi.org/10.3389/fpubh.2022.1009164.PMID:36249261;PMCID:PMC9556866
Zheng Y, Lin YX, He Q, Zhuo LY, Huang W, Gao ZY, Chen RL, Zhao MP, Xie ZF, Ma K, Fang WH, Wang DL, Chen JC, Kang DZ, Lin FX (2022) Novel machine learning models to predict pneumonia events in supratentorial intracerebral hemorrhage populations: An analysis of the Risa-MIS-ICH study. Front Neurol 25(13):955271. https://doi.org/10.3389/fneur.2022.955271.PMID:36090880;PMCID:PMC9452786
Yan J, Zhai W, Li Z, Ding L, You J, Zeng J, Yang X, Wang C, Meng X, Jiang Y, Huang X, Wang S, Wang Y, Li Z, Zhu S, Wang Y, Zhao X, Feng J (2022) ICH-LR2S2: a new risk score for predicting stroke-associated pneumonia from spontaneous intracerebral hemorrhage. J Transl Med 20(1):193. https://doi.org/10.1186/s12967-022-03389-5.PMID:35509104;PMCID:PMC9066782
Zapata-Arriaza E, Moniche F, Blanca PG, Bustamante A, Escudero-Martínez I, Uclés O, Ollero-Ortiz Á, Sánchez-García JA, Gamero MÁ, Quesada Á, Vidal De Francisco D, Romera M, De la Cruz C, Sanz G, Montaner J (2018) External Validation of the ISAN, A2DS2, and AIS-APS Scores for Predicting Stroke-Associated Pneumonia. J Stroke Cerebrovasc Dis 27(3):673–676. https://doi.org/10.1016/j.jstrokecerebrovasdis.2017.09.059. Epub 2017 Nov 2. PMID: 29103860.
Kishore AK, Vail A, Bray BD, Chamorro A, Napoli MD, Kalra L, Langhorne P, Montaner J, Roffe C, Rudd AG, Tyrrell PJ, van de Beek D, Woodhead M, Meisel A, Smith CJ (2016) Clinical risk scores for predicting stroke-associated pneumonia: A systematic review. Eur Stroke J. 1(2):76–84. https://doi.org/10.1177/2396987316651759. Epub 2016 Jun 1. PMID: 31008268; PMCID: PMC6301233.
Zhang X, Xiao L, Niu L, Tian Y, Chen K (2022) Comparison of six risk scores for stroke-associated pneumonia in patients with acute ischemic stroke: a systematic review and Bayesian network meta-analysis. Front Med (Lausanne) 12(9):964616. https://doi.org/10.3389/fmed.2022.964616.PMID:36314025;PMCID:PMC9596973
Hotter B, Hoffmann S, Ulm L, Montaner J, Bustamante A, Meisel C, Meisel A (2020) Inflammatory and stress markers predicting pneumonia, outcome, and etiology in patients with stroke: Biomarkers for predicting pneumonia, functional outcome, and death after stroke. Neurol Neuroimmunol Neuroinflamm 7(3):e692. https://doi.org/10.1212/NXI.0000000000000692.PMID:32098866;PMCID:PMC7051196
Wang RH, Wen WX, Jiang ZP, Du ZP, Ma ZH, Lu AL, Li HP, Yuan F, Wu SB, Guo JW, Cai YF, Huang Y, Wang LX, Lu HJ (2023) The clinical value of neutrophil-to-lymphocyte ratio (NLR), systemic immune-inflammation index (SII), platelet-to-lymphocyte ratio (PLR) and systemic inflammation response index (SIRI) for predicting the occurrence and severity of pneumonia in patients with intracerebral hemorrhage. Front Immunol 13(14):1115031. https://doi.org/10.3389/fimmu.2023.1115031.PMID:36860868;PMCID:PMC9969881
Meisel A, Meisel C, Harms H, Hartmann O, Ulm L (2012) Predicting post-stroke infections and outcome with blood-based immune and stress markers. Cerebrovasc Dis 33(6):580–588. https://doi.org/10.1159/000338080. (Epub 2012 Jun 15 PMID: 22710977)
Faura J, Bustamante A, Miró-Mur F, Montaner J (2021) Stroke-induced immunosuppression: implications for the prevention and prediction of post-stroke infections. J Neuroinflammation 18(1):127. https://doi.org/10.1186/s12974-021-02177-0.PMID:34092245;PMCID:PMC8183083
Günther A, Salzmann I, Nowack S, Schwab M, Surber R, Hoyer H, Witte OW, Hoyer D (2012) Heart rate variability—a potential early marker of sub-acute post-stroke infections. Acta Neurol Scand 126(3):189–196. https://doi.org/10.1111/j.1600-0404.2011.01626.x. (Epub 2011 Nov 25 PMID: 22118023)
Brämer D, Günther A, Rupprecht S, Nowack S, Adam J, Meyer F, Schwab M, Surber R, Witte OW, Hoyer H, Hoyer D (2019) Very low frequency heart rate variability predicts the development of post-stroke infections. Transl Stroke Res 10(6):607–619. https://doi.org/10.1007/s12975-018-0684-1. (Epub 2019 Jan 7 PMID: 30617993)
Engel O, Akyüz L, da Costa Goncalves AC, Winek K, Dames C, Thielke M, Herold S, Böttcher C, Priller J, Volk HD, Dirnagl U, Meisel C, Meisel A (2015) Cholinergic pathway suppresses pulmonary innate immunity facilitating pneumonia after stroke. Stroke 46(11):3232–3240. https://doi.org/10.1161/STROKEAHA.115.008989. (Epub 2015 Oct 8 PMID: 26451017)
Endres M, Moro MA, Nolte CH, Dames C, Buckwalter MS, Meisel A (2022) Immune pathways in etiology, acute phase, and chronic sequelae of ischemic stroke. Circ Res 130(8):1167–1186. https://doi.org/10.1161/CIRCRESAHA.121.319994. (Epub 2022 Apr 14 PMID: 35420915)
Tablan OC, Anderson LJ, Besser R, Bridges C, Hajjeh R; CDC; Healthcare Infection Control Practices Advisory Committee. Guidelines for preventing health-care—associated pneumonia, 2003: recommendations of CDC and the Healthcare Infection Control Practices Advisory Committee. MMWR Recomm Rep. 2004 Mar 26;53(RR-3):1–36. PMID: 15048056.
Haeusler KG, Schmidt WU, Föhring F, Meisel C, Helms T, Jungehulsing GJ, Nolte CH, Schmolke K, Wegner B, Meisel A, Dirnagl U, Villringer A, Volk HD (2008) Cellular immunodepression preceding infectious complications after acute ischemic stroke in humans. Cerebrovasc Dis 25(1–2):50–58. https://doi.org/10.1159/000111499. (Epub 2007 Nov 22 PMID: 18033958)
Sykora M, Siarnik P, Szabo J, Turcani P, Krebs S, Lang W, Jakubicek S, Czosnyka M, Smielewski P (2019) Baroreflex sensitivity is associated with post-stroke infections. An open, prospective study. J Neurol Sci 406:116450. https://doi.org/10.1016/j.jns.2019.116450. Epub 2019 Sep 3. PMID: 31610381.
Sykora M, Diedler J, Poli S, Rizos T, Turcani P, Veltkamp R, Steiner T (2011) Autonomic shift and increased susceptibility to infections after acute intracerebral hemorrhage. Stroke 42(5):1218–1223. https://doi.org/10.1161/STROKEAHA.110.604637. (Epub 2011 Mar 10 PMID: 21393593)
Swor DE, Thomas LF, Maas MB, Grimaldi D, Manno EM, Sorond FA, Batra A, Kim M, Prabhakaran S, Naidech AM, Liotta EM (2019) Admission heart rate variability is associated with fever development in patients with intracerebral hemorrhage. Neurocrit Care 30(2):244–250. https://doi.org/10.1007/s12028-019-00684-w.PMID:30756320;PMCID:PMC6474355
Nelde A, Klammer MG, Nolte CH, Stengl H, Krämer M, von Rennenberg R, Meisel A, Scheibe F, Endres M, Scheitz JF, Meisel C (2023) Data lake-driven analytics identify nocturnal non-dipping of heart rate as predictor of unfavorable stroke outcome at discharge. J Neurol. https://doi.org/10.1007/s00415-023-11718-x. Epub ahead of print. PMID: 37079032.
Graff B, Gąsecki D, Rojek A, Boutouyrie P, Nyka W, Laurent S, Narkiewicz K (2013) Heart rate variability and functional outcome in ischemic stroke: a multiparameter approach. J Hypertens 31(8):1629–1636. https://doi.org/10.1097/HJH.0b013e328361e48b. (PMID: 23751963)
Mäkikallio AM, Mäkikallio TH, Korpelainen JT, Sotaniemi KA, Huikuri HV, Myllylä VV (2004) Heart rate dynamics predict poststroke mortality. Neurology 62(10):1822–1826. https://doi.org/10.1212/01.wnl.0000125190.10967.d5. (PMID: 15159485)
Heart rate variability: standards of measurement, physiological interpretation and clinical use. Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology. Circulation (1996) 93(5):1043–65. PMID: 8598068.
Makowski D, Pham T, Lau ZJ, Brammer JC, Lespinasse F, Pham H, Schölzel C, Chen SA (2021) NeuroKit2: A Python toolbox for neurophysiological signal processing. Behav Res Methods 53(4):1689–1696
Tarvainen MP, Ranta-aho PO, Karjalainen PA (2002) An advanced detrending method with application to HRV analysis. IEEE Trans Biomed Eng 49(2):172–175
Lipponen JA, Tarvainen MP (2019) A robust algorithm for heart rate variability time series artefact correction using novel beat classification. J Med Eng Technol 43(3):173–181. https://doi.org/10.1080/03091902.2019.1640306. (Epub 2019 Jul 17 PMID: 31314618)
Acknowledgements
The study was supported by the NeuroCure Clinical Research Center (NCRC), funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany´s Excellence Strategy EXC 2049-390688087. ME also reports funding from Collaborative Research Center ReTune TRR 295- 424778381, BMBF, DZNE, DZHK, EU, Corona Foundation, and Foundation Leducq. AM received funding from the German Research Foundation (SFB/TRR167; ME 1562/4-1) and Foundation Leducq (19CVD01).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
CM is part of patent applications to detect and predict clinical outcomes and to manage, diagnose, and treat neurological conditions, all of which are outside the submitted work. CHN received research grants from the German Ministry of Research and Education, German Center for Neurodegenerative Diseases, German Center for Cardiovascular Research, and speaker and/or consultation fees from Abbott, Alexion, Astra Zeneca, Bayer Pharma, Bristol-Myers Squibb, Daiichi Sankyo, Novartis, Pfizer Pharma, Portola, and Takeda all outside the submitted work. ME reports grants from Bayer and fees paid to the Charité from Abbott, Amgen, AstraZeneca, Bayer, Boehringer Ingelheim, BMS, Daiichi Sankyo, GSK, Sanofi, Covidien, Novartis, Pfizer, all outside the submitted work. BH reports speaker fees from Pfizer and BMS with no relation to the submitted work. The remaining authors have no conflicts of interest or competing interests to declare. AM received speaker honoraria from Alexion, argenx, Axunio, Boehringer Ingelheim, Bristol-Myers Squibb/Pfizer, Grifols, and Hormosan; honoraria from Alexion, UCB, Janssen, and Merck for consulting services; and financial research support (paid to his institution) from Octapharma, argenx, and Alexion.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nelde, A., Krumm, L., Arafat, S. et al. Machine learning using multimodal and autonomic nervous system parameters predicts clinically apparent stroke-associated pneumonia in a development and testing study. J Neurol 271, 899–908 (2024). https://doi.org/10.1007/s00415-023-12031-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00415-023-12031-3