Introduction

As one of the most common causes of death, acute myocardial infarction (AMI) is a serious disease [1, 2]. AMI leads to 6.4 million deaths per year in the USA and Europe, and it accounts for more than a third of deaths in other developed countries [3, 4]. According to the China Cardiovascular Health and Disease Report 2021, AMI rose from 23.2/100,000 in 2003 to 62.33/100,000 in 2008 for urban residents in China. AMI results bring heavy burdens to family and society due to its high cost in the treatment. Moreover, baseline diseases such as diabetes would increase the mortality of patients with AMI, which could be explained that diabetes might accelerate the process of atherosclerosis [5, 6].

Heart failure is a common complication of AMI. It is characterized by severe systolic and/or diastolic function impairment of the heart [7, 8]. The progression from AMI to heart failure involves myocardial shock, remodeling, and chronic neuroendocrine system activation [9]. AMI impairs the ventricle function through wall thinning of the infarct zone, ventricular dilatation, and ultimately compensatory hypertrophy and fibrosis [10]. The larger the infarct size leads to heavier burden on the remaining viable myocardium, consequently contributing to higher probability of heart failure [11].

Previous studies have reported that being complicated with heart failure increased the mortality rate of 10 times higher for patients with acute coronary syndrome [12]. It is thus particularly important to predict the risk of heart failure in patients with AMI. Recent studies have shown that carbohydrate antigen 125, brain derived neurotrophic factor, serum soluble growth stimulation expressed gene 2, and interleukin-33 are effective predictors of heart failure in patients with AMI [12, 13]. However, these indicators are rarely measured in clinical trials, which largely limits their clinical practice. In another way, the clinical indexes available from routine laboratory tests during admission would be convenient for evaluating the risk of heart failure, but previous models hardly test the linear relationship between involved variables and terminal event or any interaction among the involved variables. Furthermore, the small sample size or the recruitment of certain subpopulation limits the reliability of the established prediction models to general patients with AMI [14]. In this study, the risk factors related to heart failure in patients with AMI after admission were combined to develop a logistic prediction model and a nomogram was established to evaluate how much the involved indexes contribute to heart failure. This model will help clinicians to predict the possibility of heart failure in patients with AMI during hospitalization and make personalized measures in their therapy process.

Material and methods

Patient inclusion and exclusion

Patients with AMI admitted to Dongyang People’s Hospital from January 2010 to September 2022 were included. Patients with AMI were diagnosed upon getting two of three points: clinical manifestations including chest pain and chest distress; abnormal electrocardiogram indicating for ST segment elevation and complete left bundle branch block; troponin or creatine kinase isoenzyme levels higher than 2 times the upper limitation range [15, 16]. Patients who met one of the following exclusion criteria were removed: a previous history of heart failure; existence of heart failure within 24 h after admission, indicated from by clinical manifestation and laboratory examination; presence of cancer, mental illness, or other serious complications; missing data; no myocardial infarction confirmation by emergency percutaneous coronary intervention (PCI). The terminal event was heart failure during hospitalization, and its diagnosis was based on guidelines from the European Society of Cardiology [17]. Heart failure was diagnosed during hospitalization based on the presence of risk factors, corresponding symptoms or signs, abnormal electrocardiogram, BNP ≥ 35 pg/mL and abnormal findings of echocardiography [17]. This study was approved by the Ethics Committee of Dongyang People’s Hospital. Written informed consent for participation was not required for this study due to its retrospective design, and the study was undertaken in accordance with national legislation and institutional requirements.

Observation indicator collection from a clinical record information mining database

Patient information was extracted retrospectively from a clinical record information mining database (supported Le 9 Co., Ltd.) and included sex, age, history of non-ST segment elevation myocardial infarction (NSTEMI). Laboratory examination results within 24 h after admission were collected, including troponin T, creatine kinase isoenzyme (CKMB), D-dimer, B-type pro-brain natriuretic peptide, aspartate aminotransferase, high-sensitivity C-reactive protein (CRP), creatinine, white blood cell (WBC) count, platelets, hematocrit. Physical examination was performed to get systolic blood pressure (SBP), left ventricular ejection fraction (LVEF) and heart rate. The treatment intervenes for AMI such as emergency PCI (PCI treatment performed within 12 hours after the onset of the disease is defined as emergency PCI treatment) was also included in this study. Each index unit was converted to the current international unit.

Variable screening for inclusion in the established prediction model

RStudio 4.4.1 software and IBM SPSS 25.0 software were used to process the data. After collecting the patient data, the variables with missing data more than 25% were removed. The remaining missing variables were interpolated by multiple imputation function as previously described [18]. Count data have been expressed as numbers (percentages), and measurement data have been represented as medians (interquartile ranges). Following univariate analysis (“twogrps” function in CBCgrps package), the boxTidwell function was used to analyze whether there was a linear relationship between significant variables in the univariate analysis and logitp. A multicollinearity test was then performed to test for the presence of multicollinearity between the involved variables, and variance inflation factors (VIFs) of less than 10 indicated no collinearity [19]. Finally, logistic and bidirectional stepwise regression analyses were used to obtain the independent risk factors, and a nomogram was constructed. The area under the curve (AUC) of the receiver operating characteristic (ROC) curve was used to evaluate the discrimination ability of the established model. At the optimal cutoff value, the prediction accuracy, positive prediction value (PPV) and negative predictive value (NPV) were calculated. Calibration evaluation was used to determine whether the predicted probability was consistent with the observed probability. In addition, a decision curve analysis (DCA) was conducted to evaluate the clinical benefit of the nomogram. A P value less than 0.05 was considered significant. Also, the model based on stacking method was established in the validation group, which combined the advantages of SVM, C5.0 and Xgboost methods as described previously [20, 21]. The importance of involved variables in the model was evaluated by SHAP summary plots. Finally, the differences among the discrimination powers of established models were tested by using Delong test based on the AUCs.

Results

Baseline characteristics of subjects

A total of 1875 patients were enrolled in this study, including 1313 patients in the modeling group, and 562 patients in the validation group. The global heart failure rate was 5.1% (97/1875). The clinical characteristics were comparable between the modeling and validation groups (Table 1).

Table 1 Baseline characteristics comparison on patients in the modeling group and the validation groupa

Variable screening by univariate and multivariate analyses

Univariate analysis showed there were 12 indicators significantly related to heart failure in the modeling group (Table 2) (P <  0.05). There were linear relationships between 12 variables and logitp (Supplementary Table 1), but no multiple-collinearity was found between the variables (Supplementary Table 2). Logistic and stepwise regression analyses showed that six variables (troponin T, age, heart rate, SBP, LVEF and pro-brain natriuretic peptide) were independent risk factors of heart failure (Table 3) (P <  0.05). Consequently, a logistic prediction model was constructed.

Table 2 Univariate analysis of enrolled variables between heart failure and control groupa
Table 3 Multivariate logistic regression analysis and Stepwise regression analysis of involved variables in modeling group

Establishment of a nomogram from the modeling group

Based on the logistic regression model, a nomogram graph was established by evaluating how much the significant risk factors contributed to the heart failure risk (Fig. 1). Individual points corresponding to the indicated variables were obtained by mapping their detailed results to the upper scoring line; then, the summed score was displayed as the total points. Finally, the risk value of heart failure was obtained by matching the total points to the scoring line at the bottom of the graph.

Fig. 1
figure 1

Nomogram graph displaying the contributions of involved variables to the risk of heart failure among patients with AMI during hospitalization

Prediction ability of the established model in the modeling validation groups

The AUC value of the model was 0.829 in the modeling group (95% CI: 0.779–0.879), with an optimal cut-off point of 0.038, a sensibility of 70.0%, and a specificity of 81.8%, (Fig. 2A). The prediction accuracy was 0.706 (95% CI:69.6–71.2%), with a PPV of 0.126 (95% CI:9.5–15.8%) and a NPV of 0.986 (95% CI:97.9–99.4%). The P value of the calibration curve was 0.940, R2: 0.246, slope: 1.000, Brier: 0.041 (Fig. 2B). DCA curves showed that the model could predict the risk of heart failure with good net clinical benefit (Fig. 2C).

Fig. 2
figure 2

Evaluation of the prediction ability of the established model in the modeling group. A Discrimination power indicated by ROC curve; B Prediction accuracy indicated by calibration chart; C Net clinical benefit indicated by DCA

The AUC value for patients in the validation group was 0.846 (95% CI: 0.774–0.918), with an optimal cut-off point of 0.050, a sensibility of 70.8%, and a specificity of 87.1% (Fig. 3A). The prediction accuracy was 0.717 (95% CI:71.6–71.8%), with a PPV of 0.148 (95% CI:9.7–20.0%) and a NPV of 0.989 (95% CI:97.9–99.9%). The P value of the calibration chart was 0.947, R2: 0.257, slope: 1.000, Brier: 0.045 (Fig. 3B). Additionally, the DCA curve was far away from the two extreme curves (Fig. 3C).

Fig. 3
figure 3

Evaluation of the prediction ability of the established model in the validation group. A Discrimination power indicated by ROC curve; B Prediction accuracy indicated by calibration chart; C Net clinical benefit indicated by DCA

Another model was constructed by the stacking method using SVM, C5.0 and XGboost. The AUC value was 0.821 (Fig. 4A), which was comparable to the logistic model by Delong test (P = 0.662). Among the enrolled variables, the top three important variables were the level of pro-BNP higher than 15,000 pg/ml, heart rate and LVEF (Supplementary Fig. 1). However, the calibration of the final integrated model is poor (Fig. 4B).

Fig. 4
figure 4

The machine learning models by the ensemble method. A ROC curves; B Calibration curves

Discussion

AMI and its subsequent heart failure are the leading causes of death worldwide [22]. Therefore, early identification of AMI patients with high risks of heart failure during hospitalization could cause doctors’ more attention in the patients’ management. The independent risk factors related to heart failure could be corrected by active inventions, eventually leading to improvement in their survival rate [23, 24]. In this study, the risk factors associated with heart failure in AMI patients during hospitalization were age, heart rate, SBP, LVEF and troponin T and pro-brain natriuretic peptide levels. A logistic model and could efficiently predict the risk of heart failure and the nomogram could visually display how the model predicted the risk.

Previous studies have shown that age is closely related to the incidence of heart failure [12]. Due to slow blood circulation and altered myocardial metabolism in the elderly, the gradual myocardial cell atrophy and insufficient myocardial reserve capacity lead to heart failure after AMI [25]. A number of studies have shown that long-term hypertension eventually leads to heart failure [26, 27]. In fact, hypertension can cause excessive myocardial cell hypertrophy, myocardial cell degeneration and necrosis, eventually leading to heart failure. In addition, patients with heart failure who have lower systolic blood pressure have a worse prognosis than those with higher systolic blood pressure [28]. This phenomenon could be explained by impairment in pump function and cardiac output [28]. In this study, lower blood pressure also contributed significantly to the heart failure in AMI patients. Therefore, for AMI patients with low blood pressure, the blood pressure should be corrected as soon as possible to improve the coronary perfusion pressure and reduce the occurrence of heart failure.

BNP belongs to a family of natriuretic peptides that are mainly secreted by atrial myocytes in the normal heart [29], and its secretion is directly proportional to the severity of left ventricular dysfunction [30]. Brain natriuretic peptide and troponin T have long been used as indicators to predict heart failure [31,32,33,34,35,36]. Therefore, for patients with high BNP, limiting the blood volume could partly reduce the pressure on the heart by diuresis and other means, finally reducing the incidence of heart failure. This study also showed that heart rate was positively linked to heart failure risk in patients with AMI. This possibly occurs because tachycardia causes cardiac cycle shortening, mainly diastole, and reduces blood return to the heart, resulting in significant reductions in cardiac output per minute and more serious coronary ischemia [37]. Therefore, in the case with left ventricular dysfunction caused by acute myocardial infarction, β-blockers and calcium-channel blockers could be applied to reduce the heart rate, improving the clinical outcome [38]. LVEF is one of the important variables to evaluate the type of heart failure, and was negatively associated with the risk of new-onset heart failure among patients with acute myocardial infarction [17]. This is consistent with the findings of others [39].

In addition, machine learning models by Stacking methods showed no advantage in performance compared with the logistic model. Moreover, the decision support provided by machine learning models is often difficult to interpret [40]. In this study, the prediction model is presented as a nomogram, and has advantages in clinical interpretation and application.

In this study, the established model helped with the early identification of patients at high risk of heart failure. Considering that previous studies mainly focus on the effect of a single factor on the outcomes, this study synthesizes all of the independent factors by establishing a prediction model. This approach makes them intuitive and clearly present, which is helpful for clinician analysis and diagnosis. Compared with the prediction model of Tan et al., our prediction model owned a higher AUC value with less involved factors, indicating a higher discrimination ability [41]. Compared with the prediction model of Yan et al., our model is applicable to a larger population of patients than the elderly women alone [14]. Although studies have shown that women with coronary artery disease have a higher risk of heart failure than men, it is generally accepted that men are more susceptible to cardiovascular disease [39, 42]. Therefore, models that target larger patient populations are highly desirable. The different variables in our prediction model from previous study is due to the variable screening method and patient population heterogenicity. In detail, this study clearly confirmed the relationship between continuous variables and logitp, as well as the absence of collinearity relationship between variables, making it more scientific and accurate compared with previous work. Furthermore, the variables included in our study are more clinically available, which will enable clinicians to quickly identify patients at risk for heart failure [40].

Of course, this study has its limitations. First, case data from only one hospital was selected for this single-center retrospective analysis, and the external validation was not performed due to no data from another center. This limitation might decrease the credibility of the study. Second, data for other important factors, such as socio-economic status and detailed periods before the occurrence of heart failure, were not included due to missing information in the database.

Conclusion

Age, heart rate, systolic blood pressure, and troponin T, left ventricular ejection fraction and pro-brain natriuretic peptide levels were the independent risk factors for heart failure in patients with AMI. The logistic prediction model can effectively identify patients with high risks of heart failure with good discriminatory power and accuracy. Ultimately, this intuitive and convenient model will be useful for clinicians in AMI patients’ management.