Introduction

Stroke-associated pneumonia (SAP) is one of the most common medical complications in patients with acute ischemic stroke (AIS), with an estimated incidence ranging between 5 and 26% [1]. SAP reduces the quality of life and increases the treatment costs, hospital stay, and risk of mortality in AIS patients [2,3,4,5,6]. Therefore, there is a need to develop fast and reliable tools to identify high-risk patients to improve clinical outcomes.

Previous studies have established different scoring systems for early pneumonia prediction after AIS, such as the A2DS2 scale, the AIS-APS scale, and the ISAN scale [7,8,9]. However, these tools are based solely on clinical data, and their prediction efficiency is moderate [10]. Brain imaging is necessary to diagnose stroke and evaluate the extent of the disease. Diffusion-weighted imaging-magnetic resonance imaging (DWI-MRI) is the most sensitive and accurate imaging method for diagnosing AIS and has been widely used in studies related to stroke [11, 12]. Several studies identified a correlation between the brain infarct size on DWI-MRI and prognosis following an AIS and may also have a role in the development of complications [13,14,15]. Studies have also found an association between several MRI radiological features, including the location, infarct volume, the number of lobes involved, and the brain atrophy score, with the risk of developing SAP [16, 17].

Radiomics uses algorithms to objectively extract a large number of quantitative features from medical images. This data can be used to transform subjective visual evaluation into an objective evaluation data-driven evaluation of traditional radiologic characteristics [18,19,20,21,22]. This technique is increasingly being used to facilitate the diagnosis of stroke lesions [23, 24], predict early outcomes [25,26,27], and evaluate the long-term prognosis of stroke [28, 29]. However, to our knowledge, no studies have been conducted evaluating the role of radiomics in predicting SAP following an AIS.

Therefore, in this study, we aimed to explore the predictive value of radiomics in predicting SAP and construct a prediction model based on clinical features and DWI-MRI radiomics features to predict SAP following AIS. The model was developed into a nomogram, and decision curve analysis (DCA) was performed to evaluate the clinical utility of the model.

Materials and methods

Study population

AIS patients who underwent a DWI-MRI scan from January 2018 to December 2021 were selected from our institution. All patients aged 18 years or above who were immediately hospitalized within 24 h following the onset of AIS symptoms and had a confirmed diagnosis of AIS on DW-MRI as defined by the World Health Organization [30, 31] and a National Institute of Health stroke scale (NIHSS) score of 15 or less were included in the study. Patients that presented with diseases that had clinical symptoms similar to pneumonia, such as pulmonary edema, pulmonary embolism, pulmonary atelectasis, tuberculosis, pulmonary tumor, and non-infective interstitial lung disease (n = 3), and those who had pneumonia before admission (n = 17) were excluded from the study. In addition, patients who lacked the complete clinical data (n = 276) and those who lacked an MRI or had severe artifacts on MRI (n = 16) were also excluded. The patients' enrollment flow chart was illustrated in Fig. 1. Finally, 298 patients were included. None of the patients underwent intravenous thrombolysis. The patients were randomly divided into a training cohort (n = 208) and a validation cohort (n = 90) at a ratio of 7:3.

Fig. 1
figure 1

The flow chart of patients' enrollment

Diagnostic criteria for SAP

SAP was defined as a 'spectrum of lower respiratory tract infections occurring within the first 7 days after the onset of stroke'. Clinicians diagnosed SAP according to the data retrieved from chest images, clinical signs and symptoms, and laboratory parameters per guidelines issued by the Centers for Disease Control and Prevention criteria (CDC) [32]. The patients diagnosed with SAP were recorded and were divided into the SAP and non-SAP groups according to the criteria above.

Ethical considerations

The study was approved by the Medical Ethics Committee of our institution: (2018) Medical Research Review No.04. Written informed consent was obtained from all patients participating in this study.

Clinical data extraction

The clinical data, including demographics, past history, comorbidities, characteristics of condition on admission, laboratory results and image features, were extracted for the first time after admission and retrieved from the patient's medical records. The demographics included age, sex and body mass index (BMI). Past history included smoking and stroke. Comorbidities included hypertension, diabetes and dyslipidemia. Characteristics of condition on admission included dysphagia, NIHSS score, and the Modified Rankin Scale (mRS) score. The mRS score was used to measure the level of disability following AIS. Based on this score, the patients were divided into functionally independent if they had an mRS score of 2 or less and functionally dependent if they had an mRS score above 2 [33, 34]. Laboratory results included platelets (PLT), creatinine (Cr), aspartate transaminase to alanine transaminase ratio (AST/ALT), total cholesterol (Tch), low-density lipoprotein (LDL), high-density lipoprotein (HDL), triglycerides (TG), fasting plasma glucose (FPG), homocysteine (HCY), albumin (ALB), and the neutrophil to lymphocyte ratio (NLR). Image features included stenosis location (internal carotid artery (ICA), middle cerebral artery (MCA)), the extent of the stenosis (more or less than 50%), infarction side (left, right, or bilateral) and volume.

MRI acquisition

All patients underwent a DWI-MRI within 24 h of admission. The DWI-MRIs were acquired using a GE 3.0 T MRI scanner, using a repetition time (TR) of 4090 ms, an echo time (TE) of 98.0 ms, a field of view (FOV) = 230 mm x 230 mm, a matrix of 192 × 192, a slice thickness to gap ration of 5 mm/1.5 mm, a b value of 0, and 1000 s/mm2.

Lesion site annotating and feature extraction

The radiomics analysis process was divided into 4 phases; lesion site annotation, feature extraction, feature selection, and model construction (Fig. 2). Two radiologists annotated the ischemic lesions on the patient’s DWI using the 3D-Slicer software version 4.10.2. Then, the consistency of the annotated volumes was evaluated by calculating the intra-class correlation coefficients (ICC). An ICC greater than 0.75 indicates a good agreement [35].

Fig. 2
figure 2

Flowchart of radiomics analysis

Subsequently, radiomics features were extracted using the PyRadiomics version 3.0.1 software as indicated by the Image Biomarker Standardization Initiative [36]. These features included first-order, shape and texture. The first-order and texture features were extracted from the original image, the Gaussian Laplace filtered image, and the wavelet filtered image.

Feature selection and model construction

Univariate analysis was used to identify the factors that differed between all patients that developed SAP and those that did not. Then, the significant variables in the training cohort were inputted into the multivariate logistics regression (MLR) model to determine the independent clinical predictors of SAP (P < 0.05). These features were then used to construct the clinical prediction model. The risk ratios of the predicted factor were expressed as odds ratio (OR) (95% confidence interval).

Spearman's correlation coefficient was used to calculate the correlation and redundancy of features in the training cohort. The features were classified as redundant if they had a Spearman correlation coefficient higher than 0.8. For each pair of features that have been marked as redundant, choose to retain one of the features, the other redundant feature was removed [37]. Subsequently, the optimal predictive features were screened out utilizing the least absolute shrinkage and selection operator (LASSO) with ten-fold cross-validation [38]. Eventually, the radiomics score was calculated for each patient based on the linear combination of weighted selection parameters for the relevant LASSO coefficients of the optimal features, which was used to construct the radiomics prediction model. Finally, a combined prediction model was built based on both radiomics and clinical features, as well as a clinical prediction model containing only the clinical features and a radiomics prediction model containing only the radiomics features were built for comparison. Then, the best-performing model was developed into a nomogram. Subsequently, a subgroup analysis was performed based on age, sex, BMI, and stroke severity on admission to test the generalization ability of our model.

Statistical analyses

The Python software (version 3.0) and R software (version 4.2.1) were used for statistical analyses. The normally distributed variables were expressed as mean ± standard deviation (SD), while the non-normally distributed variables were expressed as median (first quartile, third quartile). The variables between the SAP and non-SAP groups were compared using the student's t-test for the normally distributed variables and the Mann–Whitney U test for the non-normally distributed variables. The categorical variables were expressed as numbers (percentage), and the chi-squared (χ2) test was used for comparison. Pearson correlation analysis was used to test the correlation. The predictive performance of all 3 models was compared by calculating the area under the curve (AUC) of a receiver operating characteristics curve (ROC), the accuracy, sensitivity, specificity, negative predictive values (NPV), and positive predictive values (PPV). The Delong test was used to compare the differences in ROC curves between the models. The nomogram's accuracy was evaluated using calibration curves [39]. A decision curve analysis (DCA) was used to evaluate the clinical value of the nomogram in the validation cohort [40]. For all statistical tests, a p-value below 0.05 was considered statistically significant.

Results

Clinicodemographic features

The clinicodemographic features of the SAP and non-SAP patients in the training and validation cohorts are summarized in Table 1. A total of 298 patients were enrolled in this study, of whom 208 were assigned to the training cohort, and 90 were assigned to the validation cohort randomly. Of the 298 patients enrolled in the study, 39 (13.09%) patients developed SAP within 7 days following admission. A significant difference was noted in the age, BMI, dysphagia, NIHSS score, mRS score (> 2), TG, NLR, stenosis location (ICA) and volume between the SAP and non-SAP patients of the training cohort (all P < 0.05).

Table 1 Risk factors in the training and validation cohorts

Identification of the independent clinical predictors

The MLR identified TG, NLR, NIHSS score, dysphagia, and stenosis location (ICA) as independent predictors for SAP (Table 2). Based on the result of the MLR, the clinical prediction model was defined by the formula: Y = -1.059TG + 0.373NLR + 0.188NIHSS score + 2.433Dysphagia + 1.542stenosis location (ICA).

Table 2 Univariate and multivariable regression findings

The association between stenosis location (ICA), infarct volume and SAP

The infarct volume was found to be significantly larger in patients with ICA stenosis than in patients without ICA stenosis (P = 0.006) (Fig. 3A).

Fig. 3
figure 3

The association between stenosis location (ICA), infarct volume and SAP. A The distribution of infarct volume in patients with ICA stenosis and patients without ICA stenosis. B The distribution of infarct volume in patients with SAP and patients without SAP

The infarct volume was found to be significantly larger in patients with SAP than in patients without SAP (P = 0.017) (Table 1, Fig. 3B). In addition, both in patients with SAP and in patients without SAP, infarct volume was larger in patients with ICA stenosis.

Infarct volume showed moderate performance in predicting whether AIS patients would develop SAP, with an AUC of 0.635 (Fig. 4).

Fig. 4
figure 4

Performance of infarct volume in predicting SAP

Feature extraction and radiomics scores

A total of 1041 features predictive of SAP were extracted. The 10 most relevant radiomics features for SAP in the training cohort were obtained by LASSO with ten-fold cross-validation (Fig. 5A-C). The distribution of the radiomics scores of patients with and without SAP for the training and validation cohorts is illustrated in Fig. 5D. The radiomics scores of patients with SAP and those without SAP were 2.30 (2.18, 2.65) and 2.13 (1.91, 2.40), respectively, for the training cohort and 2.29 (2.09, 2.52) and 2.09 (1.82, 2.29), respectively for the validation cohorts. The radiomics scores in patients with SAP were generally higher than in those without SAP. Wilcoxon's test showed a significant difference in the radiomics scores between the patients with SAP in both the training and validation cohorts (P < 0. 05). The mean ICC between the lesion volumes annotated by the 2 radiologists was 0.99 (95% CI 0.99–1, P < 0.05), indicating that the reproducibility of the feature extraction was good.

Fig. 5
figure 5

The optimal radiomics features for SAP. A The tuning parameter (k) in the tenfold cross-validation LASSO model. B The coefficients plotted against log(k). C The most relevant radiomics features predictive of SAP. D The distribution of radiomics scores in the training and validation cohorts

We investigated the association between radiomics scores and infarct volume as well as radiomics scores and independent risk factors (NIHSS, NLR, TG) (Fig. 6). The radiomics scores and NIHSS scores exhibited a positive linear relationship, with a Pearson correlation coefficient (r) of 0.171(P = 0.014) (Fig. 7A). The radiomics scores and volume exhibited a positive linear relationship, with a Pearson correlation coefficient (r) of 0.372(P < 0.001) (Fig. 7B).

Fig. 6
figure 6

Correlation analysis chart

Fig. 7
figure 7

The association between stenosis location (ICA), infarct volume and SAP. A Association between radiomics scores and NIHSS scores. B Association between radiomics scores and volume

Prediction performance of the models

The prediction performance of the 3 models is summarized in Table 3, while Fig. 8 illustrates the ROCs for the 3 models. The clinical prediction model achieved an AUC of 0.785 (95%CI 0.673–0.889) and 0.736 (95%CI 0.629–0.837) for the training and validation cohorts, respectively. The sensitivity and specificity of the model were 0.667 and 0.768, respectively, in the training cohort and 0.417 and 0.872, respectively, in the validation cohort.

Table 3 The performance of three prediction models in the training and validation cohorts
Fig. 8
figure 8

ROC curves of the 3 prediction SAP models for the training (A) and validation (B) cohorts

The radiomics prediction model achieved an AUC of 0.660 (95% CI 0.546–0.766) and 0.646 (95% CI 0.541–0.756) in the training and validation cohorts, respectively. The sensitivity and specificity of the model were 0.417 and 0.826, respectively, in the training cohort and 0.533 and 0.733, respectively, in the validation cohort.

The combined prediction model had an AUC of 0.859 (95% CI 0.759–0.936) and 0.830 (95% CI 0.758–0.896) in the training and validation cohorts, respectively. The sensitivity and specificity of the model were 0.750 and 0.826, respectively, in the training cohort and 0.800 and 0.787, respectively, in the validation cohort.

Compared with the other 2 models, the combined model had a significantly higher AUC in both the training and validation cohorts (Delong test P < 0.05) (Table 4) and was therefore used to develop the clinical nomogram.

Table 4 Comparison of ROC between the models using the Delong test

Development and validation of the nomogram

The nomogram of the combined model is illustrated in Fig. 9A. Clinicians could use the nomogram to predict the risk of developing SAP by summing the risk of the relevant clinical variables and the radiomics risk score. The calibration curves for the training and validation nomograms are illustrated in Fig. 9B. The calibration plot displayed a good level of consistency between the predicted and actual probabilities for both cohorts. The nomogram's DCA is displayed in Fig. 9C. The DCA confirmed the clinical utility of the model.

Fig. 9
figure 9

Nomogram of the combined prediction SAP model. A The nomogram. B The calibration curves for the training and validation nomograms. C The decision curve for the nomogram

As shown in Table 5 and Fig. 10, the subgroup analysis showed that the performance of the nomogram was not influenced by patient age, sex, BMI and stroke severity on admission (Delong test P > 0.05).

Table 5 Subgroup analysis of AUCs using the Delong test
Fig. 10
figure 10

Subgroup analysis of the combined model. After dividing by the age (A), sex (B), BMI (C), and stroke severity on admission (D)

Discussion

SAP is a potentially preventable complication of stroke. The early identification of SAP is essential to limit the adverse clinical outcome of pneumonia [41, 42]. In this study, we explored the predictive value of radiomics in predicting SAP and construct an artificial intelligent model based on clinical features and DWI-MRI radiomics features to predict SAP following AIS. The combined model performed better than clinical model and radiomics model. The DCA confirmed the clinical effectiveness of the proposed model.

It has been confirmed by several studies that brain MRI-based imaging features are closely related to SAP. Zhao et al. [43] showed that the DWI Alberta Stroke Program Early Computed Tomography Score (DWI-ASPECTS) used to predict the severity of AIS could also be used to predict the occurrence of SAP in patients with mild AIS [1, 44]. We investigated the association between radiomics scores, infarct volumes and NIHSS scores, found that there was a linear positive correlation between radiomics scores and NIHSS scores (Pearson:0.171, P = 0.014), as well as between radiomics scores and infarct volume (Pearson:0.372, P < 0.001). Yu et al. [17] found that neuroimaging features play a key role in predicting SAP. Brain atrophy and core infarct volume are closely related to the occurrence of SAP [45, 46]. We found that larger infarct volume and higher radiomics score were associated with a greater risk of developing pneumonia. We also investigated the predictive performance of infarct volume and obtained moderate performance, with an AUC of 0.635. The NIHSS score reflects the severity of the stroke [47]. Previous studies have shown that stroke severity increases the risk of developing SAP [1]. Therefore, the radiomics score reflects the severity of stroke to a certain extent and affects the occurrence of SAP.

Studies have shown a good association between several radiomics features and clinical stroke outcomes, such as prognosis and recurrence [18]. Currently, the radiomics features has been certified to improve the prediction ability of prognosis prediction. Tang et al. [48] found an association between specific morphology radiomics features and stroke recurrence in patients with symptomatic intracranial atherosclerotic stenosis. With the addition of radiomics features, the AUC of the prediction model was increased by 11.7% and 17% in the training and validation sets, respectively. Zhou et al. [49] demonstrated that the radiomics features performed well in predicting AIS outcomes. With the addition of radiomics features, the AUC of the prediction model was increased by 10.1% and 10.6% in the training and validation sets, respectively. However, relatively few studies have used radiomics features to predict the risk of complications following AIS. Immunological changes are associated with an increased tendency to respiratory infections [13]. And neuroanatomical correlates are associated with immunological changes after stroke and increased risk of infection, so it is easy to develop SAP [16]. An activation of the sympathetic nervous system is the main immunosuppressive mechanism leading to a high incidence of infections after stroke [50]. Studies have revealed that significant correlations with texture features and neural density in the side of the hippocampus contralateral to the ischemic area. These preliminary results suggest that texture features can reflect microscopic changes that occur post-stroke, even in an area spared by ischemia [29]. Therefore, we believe that the extracted radiomics features, belonging to the texture features, can reflect the microscopic changes that occur after a stroke and provide a good representation of neural alterations caused by low immunity. It may potentially be used to predict the risk of developing infections after AIS. In this study, we used the PyRadiomics software version 3.0.1 to extract radiomics features from the manually segmented brain lesions [18]. A total of 1041 radiomics features were obtained, of which 10 were identified as highly predictive of SAP. The majority of these features belonged to the wavelet feature cluster. The wavelet feature cluster measures asymmetry around the mean, which represents the tissue damage caused by the infarct. A study reported that this type of features might be associated with complications due to other pathological changes [49]. Among these features, LowGrayLevelZoneEmphasis is a feature used to describe the distribution of gray levels in an image, and in MRI images of stroke patients, lesion site usually shows different gray levels, which reflects the difference in density between the lesion site and the surrounding normal site. Higher LowGrayLevelZoneEmphasis values indicate that areas of lower gray levels are more prominent around the lesion site, which may suggest pathological changes in the contours or margins of the infarcts, reflecting heterogeneity of the lesion site, which may be associated with SAP. Cluster Prominence represents the cluster significance and is a measure of GLCM skewness and asymmetry, which was associated with stroke prognosis in previous studies [51], may be related to the occurrence of pneumonia. However, more research is required to understand the molecular mechanisms involved behind the development of this feature.

Consistent with previous studies, NLR, dysphagia, and NIHSS score were identified as independent predictors of SAP [1, 42, 52,53,54]. In addition, the stenosis location (ICA) and low TG were also identified as independent risk factors for predicting SAP. Chlamydia pneumoniae infection can also promote the development of atherosclerosis [55], Cao J et al. found that 84.0% of Chinese patients with carotid atherosclerotic plaques tested positive for chlamydia pneumonia-specific antigens [56]. As atherosclerotic plaques grow larger, leading to the narrowing of the arteries [57], the body's immunity system is weakened, leading to an increased risk of infections [16]. A previous study reported that when compared to MCA, the ICA causes larger infarcts, thus increasing the risk of SAP [58]. Our study found that patients with ICA stenosis were more likely to develop SAP, and this relationship persisted after adjustment for confounding factors. Patients with ICA stenosis had larger infarct volumes, and larger infarct volumes can affect multiple brain functions and impair immunity, ultimately leading to functional impairment and increased susceptibility to infections [16]. TGs are a potential source of arachidonic acids. The lipases and cyclooxygenase found in the lipid droplets of macrophages can catalyze the esterification of arachidonic acid to gradually transform it into eicosanoids compounds [59, 60]. These compounds are important lipid mediators of inflammation, and have an important role in balancing the extent of the inflammatory response [61]. Some studies have shown that when the concentration of TG is high, the TG-rich lipoprotein can combine with lipopolysaccharide to exert immune regulation on cells crucial for hosting the immune defense [62], thus reducing the risk of SAP. The same association was not reported in other studies [63, 64]. Therefore, further research is recommended to confirm the specific relationship between TG and SAP.

Our study has some limitations that have to be acknowledged. Since this study was based on data extracted retrospectively, the model's prediction accuracy needs to be validated prospectively. The use of single-agency data may lead to issues of geographic specificity and representativeness of the sample, a limitation that may restrict our generalizations about different groups. Therefore, in future studies, we will actively explore the possibility of external validation to enhance the reliability and generalizability of the study. In addition, the effect of treatment during hospitalization was not taken into account in this model. Finally, the combined model had a high NPV and a low PPV, possibly due to the low incidence of SAP in our cohort (SAP: 39/298, 13.1%).

Conclusion

The addition of the radiomics features to the clinical model improved the prediction of SAP in AIS patients, which verified its feasibility. The proposed nomogram could identify patients at risk of developing SAP and thus provide timely interventions.