Development and validation of an interpretable model for predicting sepsis mortality across care settings

Lee, Young Seok; Han, Seungbong; Lee, Ye Eun; Cho, Jaehwa; Choi, Young Kyun; Yoon, Sun-Young; Oh, Dong Kyu; Lee, Su Yeon; Park, Mi Hyeon; Lim, Chae-Man; Moon, Jae Young

doi:10.1038/s41598-024-64463-0

Development and validation of an interpretable model for predicting sepsis mortality across care settings

Article
Open access
Published: 13 June 2024

Volume 14, article number 13637, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Development and validation of an interpretable model for predicting sepsis mortality across care settings

Download PDF

Young Seok Lee¹^na1,
Seungbong Han²^na1,
Ye Eun Lee²,
Jaehwa Cho³,
Young Kyun Choi⁴,
Sun-Young Yoon⁵,
Dong Kyu Oh⁶,
Su Yeon Lee⁶,
Mi Hyeon Park⁶,
Chae-Man Lim⁶,
Jae Young Moon⁵ &
the Korean Sepsis Alliance (KSA) Investigators

462 Accesses
Explore all metrics

Abstract

There are numerous prognostic predictive models for evaluating mortality risk, but current scoring models might not fully cater to sepsis patients’ needs. This study developed and validated a new model for sepsis patients that is suitable for any care setting and accurately forecasts 28-day mortality. The derivation dataset, gathered from 20 hospitals between September 2019 and December 2021, contrasted with the validation dataset, collected from 15 hospitals from January 2022 to December 2022. In this study, 7436 patients were classified as members of the derivation dataset, and 2284 patients were classified as members of the validation dataset. The point system model emerged as the optimal model among the tested predictive models for foreseeing sepsis mortality. For community-acquired sepsis, the model’s performance was satisfactory (derivation dataset AUC: 0.779, 95% CI 0.765–0.792; validation dataset AUC: 0.787, 95% CI 0.765–0.810). Similarly, for hospital-acquired sepsis, it performed well (derivation dataset AUC: 0.768, 95% CI 0.748–0.788; validation dataset AUC: 0.729, 95% CI 0.687–0.770). The calculator, accessible at https://avonlea76.shinyapps.io/shiny_app_up/, is user-friendly and compatible. The new predictive model of sepsis mortality is user-friendly and satisfactorily forecasts 28-day mortality. Its versatility lies in its applicability to all patients, encompassing both community-acquired and hospital-acquired sepsis.

Development and validation of a score to predict mortality in ICU patients with sepsis: a multicenter retrospective study

Article Open access 29 July 2021

Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy

Article Open access 21 January 2020

The association between four scoring systems and 30-day mortality among intensive care patients with sepsis: a cohort study

Article Open access 27 May 2021

Introduction

Sepsis is a significant global health issue due to its steep mortality rates and economic impact. While our understanding and treatment of sepsis have evolved, sepsis-related deaths still account for an alarming 30–45% of global mortality, representing almost 20% of all deaths worldwide^1,2,3,4. Considering that the prognosis of sepsis is influenced by an individual’s clinical condition and the nature of the pathogen, timely individual risk assessment using a prognostic predictive model is crucial. This allows for proper allocation of medical resources and can potentially reduce mortality^5,6,7,8.

There are numerous prognostic predictive models for evaluating mortality risk^{9,10,11,12,13,14}. Notably, the Acute Physiological and Chronic Health Assessment (APACHE) score and the Simple Acute Physiology Score (SAPS) are frequently employed in intensive care units (ICUs) to gauge mortality risk^10,11. With evolving patient demographics in ICUs, such as increasing numbers of elderly, multimorbid, and immunocompromised patients, these scoring systems have been periodically updated to ensure effectiveness^10,15,16. While beneficial, these models have limitations. Designed primarily for critically ill patients in ICUs, their application is typically upon ICU admission rather than at the initial diagnosis. Given that sepsis can be diagnosed in diverse settings, from ICUs to general wards or emergency rooms, and the crucial nature of timely intervention, current scoring models might not fully cater to sepsis patients’ needs. There is a pressing need for a predictive model that facilitates rapid mortality prediction and individualized treatment planning for sepsis patients across all care settings.

This study developed and validated a new model for sepsis patients that accurately predicts the 28-day mortality, which is user-friendly and suitable for any care setting. Particularly, we have prioritized ensuring that the final model is interpretable, thereby enhancing its usability for clinicians.

Results

Clinical characteristics of patients in the derivation dataset and the validation dataset

In the sepsis registry database, 10,440 patients were assigned to the derivation dataset, whereas 3344 patients were assigned to the validation dataset. Of these, 26 patients with coronavirus disease 2019 (COVID-19) in the derivation dataset and 226 patients with COVID-19 in the validation dataset were excluded. In addition, 2978 patients in the derivation dataset and 834 patients in validation dataset were excluded because of unclear survival status at 28 days after sepsis diagnosis. Finally, 7436 patients were included in the derivation dataset, and 2284 patients were included in the validation dataset (Fig. 1).

The clinical characteristics of the study population for the derivation and validation datasets are described in Table 1. In both datasets, nonsurvivors were older and more likely to be male compared to survivors. In addition, nonsurvivors had higher clinical frailty scale (CFS), sequential organ failure assessment (SOFA), and Charlson’s comorbidity index scores compared to survivors. Comorbidities were similar between the two groups, except malignancies. Sepsis caused by a respiratory infection had a poorer prognosis than sepsis caused by other infections. The use of steroids, ventilators, and continuous renal replacement therapy (CRRT) was higher in nonsurvivors than in survivors. Nonsurvivors had a lower blood pressure and body temperature and higher heart and respiratory rates compared to survivors at the time of diagnosis of sepsis. In addition, nonsurvivors had a greater rate of organ dysfunction based on initial laboratory findings compared to survivors at the time of diagnosis of sepsis (Supplementary Table S1).

Table 1 Clinical characteristics of study population in derivation dataset and validation dataset.

Full size table

New models for predicting 28-day mortality in sepsis patients

In the derivation dataset, multivariable logistic regression identified the following as significant predictors for 28-day mortality: age, CFS, presence of malignancy, SOFA score, sepsis originating from respiratory infections, use of CRRT, body temperature, albumin levels, international normalized ratio (INR), C-reactive protein (CRP) levels, and lactic acid levels at the time of sepsis diagnosis (Supplementary Table S2). These 11 factors were used as predictive variables across the seven prediction models.

To identify the most precise models for predicting sepsis mortality, we assessed the point system (PS), ordinary logistic regression (OL), random forest (RF), regularized discriminant analysis (RDA), support vector machine (SVM), gradient-boosting machine (GBM), and ensemble method (ENS) predictive models using multiple metrics. Based on the analysis of the area under the receiver operating characteristic curve (AUC), calibration plots, the Hosmer–Lemeshow test statistic, and Brier score in both datasets, including cross-validation, the performance of the PS model demonstrated similarity to that of other models in predicting sepsis mortality (Fig. 2 and Supplementary Tables S3–5). Additionally, the PS model’s ease of interpretation for clinical application led to its selection as the final model.

The final predictive model for sepsis mortality was structured as follows: Age (scored from 0 for < 40 years to 7 for > 80 years in increments based on decade ranges), CFS (scored from 0 for 1–3, up to 7 for 6–9), presence of malignancy (7 points), SOFA score (scored from 0 for 2–7 up to 13 for 18–21), sepsis due to respiratory infection (6 points), CRRT usage (7 points), body temperature (scored from 0 for > 38℃ to 7 for < 36 ℃), albumin level (scored from 0 for ≥ 3.5 g/dL to 7 for < 2.5 g/dL), INR (> 1.3 earns 3 points), CRP level (scored from 0 for ≤ 10 mg/dL up to 2 for > 20 mg/dL), and lactic acid level (scored as 0 for < 4 mmol/L and 7 for ≥ 4 mmol/L). The maximum attainable score was 73 points (Table 2). The associated mortality risk based on the total score can be found in Supplementary Table S6.

Table 2 New predictive model of sepsis mortality.

Full size table

New predictive model of sepsis mortality in different clinical situations

To assess the efficacy of the new model across diverse clinical scenarios, we examined the ROC curve in both the derivation and validation datasets for community-acquired and hospital-acquired sepsis. For community-acquired sepsis, the model’s performance was satisfactory (derivation dataset AUC: 0.779, 95% CI 0.765–0.792; validation dataset AUC: 0.787, 95% CI 0.765–0.810; Fig. 3). Similarly, for hospital-acquired sepsis, it performed well (derivation dataset AUC: 0.768, 95% CI 0.748–0.788; validation dataset AUC: 0.729, 95% CI 0.687–0.770; Fig. 3).

To further assess its efficacy for predicting outcomes for critically ill patients, we compared its ability to predict 28-day mortality against the established SAPS 3 system using both datasets. Among the 3,612 critically ill sepsis patients in the derivation dataset, the performance of the new scoring model (AUC: 0.745) was comparable to that of the SAPS 3 model (AUC: 0.722) (difference in AUC; 95% CI 0.005–0.042; P = 0.012), indicating similar predictive accuracy. Similarly, in the validation dataset comprising 949 critically ill sepsis patients, the new model (AUC: 0.750) tended to show statistically insignificant non-inferior predictive accuracy compared to SAPS (difference in AUC; 95% CI − 0.001 to 0.071; P = 0.063) (Fig. 4).

Clinical utility of new predictive model of sepsis mortality

To enhance the clinical utility of the new model, we developed a calculator app using shinyapps.io. The calculator, accessible at https://avonlea76.shinyapps.io/shiny_app_up/, is user-friendly and compatible with both smartphones and computers (e.g. electronic medical record). Given the criticality of swift decision-making for patients with sepsis, this app promises to be an invaluable tool for predicting sepsis mortality in clinical settings.

Discussion

This study introduced a new model for predicting 28-day mortality among sepsis patients, incorporating 11 variables. The model exhibited commendable performance across various clinical scenarios, including both community- and hospital-acquired sepsis, positioning it as a valuable instrument for assessing mortality risk across all sepsis patients.

Despite advancements in sepsis strategies informed by numerous studies, the overall prognosis for sepsis remains suboptimal^1,2,3,4. For better outcomes, it is imperative for clinicians to differentiate between patients with likely favorable outcomes and those at higher risk, ensuring tailored treatment approaches. Mismanagement of medical resources can escalate mortality rates, which makes efficient resource allocation vital^17,18,19. If prognostic predictions can be made at the time of sepsis diagnosis, those with a poorer outlook could be prioritized for ICU beds and resources over those with a more optimistic prognosis.

While established scoring models such as APACHE and SAPS are renowned for their predictive accuracy across diverse cohorts of critically ill patients¹⁴, they primarily target this patient subset and not the broader population. Their adaptation for sepsis patients might be less than ideal. Following the revised sepsis-3 definition²⁰, new prognostic models (encompassing biomarker models, immune dysfunction scores, and machine-learning models) have been developed and validated^{13,21,22,23,24,25,26}. Yet, small sample sizes and restricted accessibility limit some, such as the biomarker model and immune dysfunction score^25,26. Machine-learning models, although superior in terms of predictive accuracy compared to older scoring models, are typically institution-specific and their performance may not be generalizable. Notably, any lack of laboratory data can postpone the presentation of results^13,21,22.

The sepsis mortality prediction model presented here has noteworthy advantages over its predecessors. First, this study established an optimal scoring point system for predicting the sepsis mortality among seven predictive models. The predictive models ranged from original logistic regression models to the latest machine learning models.

Based on the AUC, calibration plots, Hosmer–Lemeshow test statistics, and Brier score, the PS model did not exhibit inferior performance to other machine learning techniques in the fivefold cross validation and the validation dataset. In fact, it demonstrated superior performance in terms of calibration. In addition, the PS model offered interpretive advantages, leading to its selection as the final prediction model. We concluded that our model is the most suitable among several models for predicting sepsis mortality. Second, its applicability spans both community- and hospital-acquired sepsis, demonstrating robust performance in both patient categories (Fig. 3). Furthermore, it consistently demonstrated performance comparable to SAPS in terms of predicting outcomes, even among critically ill sepsis patients (Fig. 4). Third, the model’s design readily accommodates real-time clinical data, enabling prompt prognosis predictions concurrent with sepsis diagnosis. Fourth, to augment the practicality of our model, we developed a user-friendly calculator app, which provides rapid mortality prediction, facilitating informed decision-making regarding clinical management and potentially mitigating sepsis-associated mortality rates. Finally, the model encompasses foundational clinical parameters such as age, CFS, and malignancy presence, infection sites, inflammation metrics, and severity indicators (including organ dysfunction) making it clinically intuitive. Notably, the inclusion of the CFS, recently identified as a predictor of sepsis prognosis, strengthens the model²⁷.

However, this study had several limitations. First, potential selection bias arose in determining survival outcomes for patients discharged within 28 days after sepsis diagnosis. Second, only Korean patients participated in this study. However, the number of included patients was relatively large. Finally, the sepsis mortality rate in this study was relatively high because it was conducted in a university-affiliated hospital. During the study period, these hospitals admitted more severe patients compared to other periods, primarily because of the low medical resources during the COVID-19 pandemic. Despite these limitations, our model was developed using a relatively large sample size and diverse modeling techniques. In addition, the calculator based on our model is user-friendly and convenient for clinical settings.

In conclusion, the new predictive model of sepsis mortality is user-friendly and effectively forecasts 28-day mortality. Its versatility lies in its applicability to all patients, encompassing both community-acquired and hospital-acquired sepsis, at the time of diagnosis. To further validate this novel model, multinational studies with larger cohorts across diverse scenarios are needed.

Methods

Nationwide multicenter prospective sepsis cohort

The cohort was supported by a research program from the Korea Disease Control and Prevention Agency. Clinical data were amassed by the Korean Sepsis Alliance (KSA), encompassing tertiary referral and university-affiliated hospitals, to study the epidemiology, clinical practices, and outcomes of sepsis patients in the Republic of Korea. Between September 2019 and December 2021, 20 hospitals contributed to the sepsis cohort project (KSA 3 database). From January 2022 to December 2022, this participation narrowed to 15 hospitals (KSA 4 database). The database differentiated patients into community-acquired sepsis (diagnosed in emergency rooms) and hospital-acquired sepsis (diagnosed 48 h post-admission to a general ward). The study cohort comprised adults aged ≥ 19 years diagnosed with life-threatening organ dysfunction caused by a dysregulated host response to infection, characterized by a ≥ 2-point increase in the SOFA score²⁰. Data spanned from “time zero” (either when community-acquired sepsis was identified in emergency rooms or when hospital-acquired sepsis was diagnosed by medical professionals after 48 h of admission in the general ward) until hospital discharge or death^{27,28,29,30,31}.

Study design

This research was a multicenter prospective observational cohort study using data from the Korean Sepsis Alliance’s sepsis cohort. The derivation dataset (KSA 3) was gathered from 20 hospitals between September 2019 and December 2021; the validation dataset (KSA 4) was collected from 15 hospitals between January 2022 and December 2022. We designed and assessed various predictive models for sepsis mortality using both datasets, aiming to select the most accurate model. Subsequently, we developed an app to implement the final model, facilitating its clinical application. The primary outcome was the 28-day mortality post-sepsis diagnosis. Due to the lack of follow-up data in the KSA database, we established an arbitrary definition for 28-day mortality following the diagnosis of sepsis. For patients discharged more than 28 days after the diagnosis of sepsis, the 28-day mortality was recorded. However, patients discharged within 28 days were categorized as survivors if they received antibiotics for more than 7 days and did not require life-sustaining treatment. This decision was based on the observation that most infections are treated with antibiotics for more than 7 days after the diagnosis. Patients from other categories were excluded due to uncertainty regarding their 28-day mortality status. In addition, patients with coronavirus disease 2019 (COVID-19) were excluded due to the potential impact on clinical outcomes.

Definition of variables

The CFS is a robustly validated nine-point scale that classifies patients based on clinical insight. It was used at sepsis diagnosis, leveraging clinical data from up to 2 weeks prior to diagnosis^27,32,33. Comorbid patients had prior disease diagnoses at the sepsis detection time, and the infection site was identified as the infection’s origin. Comorbidities and infection sites were classified following the management of severe sepsis in Asia’s intensive care units (MOSAICS) study method^2,34. Malignancy, including hematological and solid tumors, indicated current malignancy presence at sepsis diagnosis. Vital signs and laboratory results were recorded at sepsis diagnosis; the usages of ventilators and CRRT were determined based on need at sepsis diagnosis.

Statistical analysis

We followed the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) guidelines in developing and validating a 28-day mortality predictive model for sepsis patients³⁵. All statistical analyses were conducted using R software (http://www.r-project.org). Descriptive statistics for both derivation and validation datasets are presented as means ± standard deviation (SDs) or frequencies with percentages for continuous and categorical variables, respectively. For predicting 28-day mortality, various models including OL, RDA, and three machine learning algorithms (RF, SVM, and GBM) were employed.

The RDA employs a classification rule rooted in regularized group covariance matrices, targeting enhancement against multicollinearity of covariates. The RF, an ensemble learning theory derivative, produces multiple decision trees during training³⁶. The final class prediction arises from the majority prediction across all trees. RF effectively captures both simple and intricate classification functions by recognizing predictor interactions. SVM identifies a hyperplane in a high-dimensional space that distinctly separates data points of varying classes. It showcases robustness in processing high-dimensional data and employs an influential regularization method to prevent overfitting³⁷. GBM, another ensemble learning algorithm, strengthens predictions by progressively optimizing the log-likelihood loss function, starting from a base model³⁸. Its strengths include discerning complex nonlinear relationships and adeptly managing diverse data types. In addition, we contemplated two other techniques for final model selection: the PS³⁹ and an ENS amalgamating OL, RF, RDA, SVM, and GBM, which averaged mortality probabilities for a final prediction. Among machine learning models, various options exist for hyperparameter selection, including grid search, random search, genetic algorithm, and Bayesian optimization. However, current research suggests that none of these methods are extensively employed. Typically, the default option is widely utilized in medical data modeling^40,41,42. Nonetheless, hyperparameter selection was conducted for RF. The hyperparameters were determined through five-fold cross-validation on the derivation dataset; the R package tuneRanger was employed for hyperparameter selection during RF modeling. The hyperparameters for other machine learning techniques were selected based on the default settings.

To pinpoint pivotal predictive variables, univariable logistic regression models were initially fitted, with variables with a p-value < 0.1 becoming primary candidates for predictive models. Multicollinearity was addressed by iterative removal of the least significant variable in the multivariable logistic regression model, scrutinized via the variation inflation factor index (VIF). Following intensive consultations with clinical and statistical experts, we chose 11 predictive variables for the concluding model. For continuous variables such as age, CRP level, and SOFA score, which were included in the logistic model, linearity was examined using multivariable fractional polynomial models based on the R package mfp (Multi-variable Fractional Polynomials)⁴³. Considering that no significant nonlinear relationships were observed, the logistic model was fitted in its original scale without any variable transformations. We also fitted a PS model based on a logistic model. Unlike other machine learning models, a risk score was assigned to each risk factor, which has interpretive advantages. For this purpose, continuous variables were categorized and scored according to the level of each category corresponding to a patient profile. Each patient’s risk characteristics were assigned a score, which was used to determine the overall risk. PS models provide interpretable results compared with black box models, thus facilitating decision-making for clinical researchers involved in patient care. However, PS models require an underlying base model, and thus, the logistic model previously fitted is considered the base model. The main principle comprises approximating the linear combination of covariates as an overall risk score with respect to the risk. For detailed construction of the PS model, please refer to Sullivan et al., Zhang et al., and Greving et al.^39,44,45. In the derivation dataset, the rate of missing data was not high. Variables with a relatively high rate of missing data were body mass index (4.78%), international normalized ratio (INR) (4.96%), and albumin (1.29%), all of which were < 5%. Indeed, 6730 individuals from the derivation dataset and 2033 individuals from the validation dataset, after excluding those with significant missing data, were included in the analyses for model fitting. We performed model fitting by categorizing variables with well-established categories. In general, ML methods do not require variable categorization. However, the creation of a PS model requires categorization. In this case, we categorized variables according to clinical criteria or socially recognized categories (e.g. age). The categories for continuous variables were decided based on the ease of model interpretability. For example, CFS and body temperature values were derived from prior studies^27,30. Values for albumin, INR, and lactate were established based on clinical significance (for instance, albumin levels: ≥ 3.5 g/dL as normal, 2.5 g/dL ≤ albumin < 3.5 g/dL as low, and < 2.5 g/dL indicating significant hypoalbuminemia; INR > 1.3 as abnormal; lactic acid ≥ 4 mmol/L as hyperlactatemia). Furthermore, no scaling or other variable transformations for continuous variables were performed prior to PS and logistic regression modeling. However, in other machine learning techniques, continuous variables were standardized before modeling.

Model performances of PS, OL, RF, RDA, SVM, GBM, and ENS were gauged through various metrics: the area under the AUC, calibration plots, the Hosmer–Lemeshow test statistic, and the Brier score. The AUC gauges patient risk discrimination, and the calibration plot juxtaposes predicted against observed probabilities. For calibration assessment, patients were categorized into 10 risk factions. Recognizing the known calibration measurement inefficiencies of the Hosmer–Lemeshow test, we abstained from p-value calculations, emphasizing test statistic model comparisons instead.

Internal validation was conducted to evaluate the apparent performance of the derivation set using the AUC, Hosmer–Lemeshow test statistics, and Brier score. Additionally, fivefold cross-validation was employed to assess predictive performance. Furthermore, external validation was conducted to evaluate model performance in the validation dataset. In addition, our final model was juxtaposed with the SAPS 3 scoring system using Delong’s AUC comparison method⁴⁶. RF, RDA, SVM, and GBM implementations employed R packages: randomForest, klaR, e1071, and mboost, respectively^47,48,49,50. ROC curves were used to gauge the discrimination capability of the predictive model across clinical scenarios such as community-acquired or hospital-acquired sepsis, evaluated via R packages pROC and ‘predictABEL’. All p-values were two-sided, with values under 0.05 deemed statistically significant.

Ethics statement

The study received approval from the institutional review board (IRB) of each participating hospital (Supplementary Tables S7, 8). In alignment with the principles outlined in the Declaration of Helsinki, we prioritized patient privacy and confidentiality. The requirement for written informed consent was waived by the IRBs of the participating hospitals.

Data availability

Data will be shared upon reasonable request to the corresponding author only after permission by the institutional review board of each participating hospital.

References

Rudd, K. E. et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: Analysis for the Global Burden of Disease Study. Lancet (Lond., Engl.) 395, 200–211. https://doi.org/10.1016/s0140-6736(19)32989-7 (2020).
Article Google Scholar
Li, A. et al. Epidemiology, management, and outcomes of sepsis in ICUs among countries of differing national wealth across Asia. Am. J. Respir. Crit. Care Med. 206, 1107–1116. https://doi.org/10.1164/rccm.202112-2743OC (2022).
Article PubMed Google Scholar
Markwart, R. et al. Epidemiology and burden of sepsis acquired in hospitals and intensive care units: A systematic review and meta-analysis. Intensive Care Med. 46, 1536–1551. https://doi.org/10.1007/s00134-020-06106-2 (2020).
Article PubMed PubMed Central Google Scholar
Bauer, M. et al. Mortality in sepsis and septic shock in Europe, North America and Australia between 2009 and 2019- results from a systematic review and meta-analysis. Crit. Care (Lond., Engl.) 24, 239. https://doi.org/10.1186/s13054-020-02950-2 (2020).
Article Google Scholar
McGrath, S. P., MacKenzie, T., Perreard, I. & Blike, G. Characterizing rescue performance in a tertiary care medical center: A systems approach to provide management decision support. BMC Health Serv. Res. 21, 843. https://doi.org/10.1186/s12913-021-06855-w (2021).
Article PubMed PubMed Central Google Scholar
Zhang, Z., Ho, K. M., Gu, H., Hong, Y. & Yu, Y. Defining persistent critical illness based on growth trajectories in patients with sepsis. Crit. Care (Lond., Engl.) 24, 57. https://doi.org/10.1186/s13054-020-2768-z (2020).
Article Google Scholar
Gavelli, F., Castello, L. M. & Avanzi, G. C. Management of sepsis and septic shock in the emergency department. Intern. Emerg. Med. 16, 1649–1661. https://doi.org/10.1007/s11739-021-02735-7 (2021).
Article PubMed PubMed Central Google Scholar
Evans, L. et al. Surviving sepsis campaign: International guidelines for management of sepsis and septic shock 2021. Intensive Care Med. 47, 1181–1247. https://doi.org/10.1007/s00134-021-06506-y (2021).
Article PubMed PubMed Central Google Scholar
Quintairos, A., Pilcher, D. & Salluh, J. I. F. ICU scoring systems. Intensive Care Med. https://doi.org/10.1007/s00134-022-06914-8 (2022).
Article PubMed PubMed Central Google Scholar
Zimmerman, J. E., Kramer, A. A., McNair, D. S. & Malila, F. M. Acute Physiology and Chronic Health Evaluation (APACHE) IV: Hospital mortality assessment for today’s critically ill patients. Crit. Care Med. 34, 1297–1310. https://doi.org/10.1097/01.Ccm.0000215112.84523.F0 (2006).
Article PubMed Google Scholar
Le Gall, J. R., Lemeshow, S. & Saulnier, F. A new simplified acute physiology score (SAPS II) based on a European/North American multicenter study. JAMA 270, 2957–2963. https://doi.org/10.1001/jama.270.24.2957 (1993).
Article PubMed Google Scholar
Lemeshow, S. et al. Mortality probability models (MPM II) based on an international cohort of intensive care unit patients. JAMA 270, 2478–2486 (1993).
Article CAS PubMed Google Scholar
Goh, K. H. et al. Artificial intelligence in sepsis early prediction and diagnosis using unstructured data in healthcare. Nat. Commun. 12, 711. https://doi.org/10.1038/s41467-021-20910-4 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Haniffa, R., Isaam, I., De Silva, A. P., Dondorp, A. M. & De Keizer, N. F. Performance of critical care prognostic scoring systems in low and middle-income countries: a systematic review. Crit. Care (Lond., Engl.) 22, 18. https://doi.org/10.1186/s13054-017-1930-8 (2018).
Article Google Scholar
Flaatten, H. et al. The status of intensive care medicine research and a future agenda for very old patients in the ICU. Intensive Care Med. 43, 1319–1328. https://doi.org/10.1007/s00134-017-4718-z (2017).
Article CAS PubMed Google Scholar
Nassar, A. P., Malbouisson, L. M. & Moreno, R. Evaluation of simplified acute physiology score 3 performance: A systematic review of external validation studies. Crit. Care (Lond., Engl.) 18, R117. https://doi.org/10.1186/cc13911 (2014).
Article Google Scholar
Emanuel, E. J. et al. Fair allocation of scarce medical resources in the time of Covid-19. N. Engl. J. Med. 382, 2049–2055. https://doi.org/10.1056/NEJMsb2005114 (2020).
Article PubMed Google Scholar
Sabatello, M., Burke, T. B., McDonald, K. E. & Appelbaum, P. S. Disability, ethics, and health care in the COVID-19 pandemic. Am. J. Public Health 110, 1523–1527. https://doi.org/10.2105/ajph.2020.305837 (2020).
Article PubMed PubMed Central Google Scholar
Kirkpatrick, J. N., Hull, S. C., Fedson, S., Mullen, B. & Goodlin, S. J. Scarce-resource allocation and patient triage during the COVID-19 pandemic: JACC review topic of the week. J. Am. Coll. Cardiol. 76, 85–92. https://doi.org/10.1016/j.jacc.2020.05.006 (2020).
Article CAS PubMed PubMed Central Google Scholar
Singer, M. et al. The third international consensus definitions for sepsis and septic shock (sepsis-3). JAMA 315, 801–810. https://doi.org/10.1001/jama.2016.0287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kong, G., Lin, K. & Hu, Y. Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU. BMC Med. Inform. Decis. Mak. 20, 251. https://doi.org/10.1186/s12911-020-01271-2 (2020).
Article PubMed PubMed Central Google Scholar
Li, K., Shi, Q., Liu, S., Xie, Y. & Liu, J. Predicting in-hospital mortality in ICU patients with sepsis using gradient boosting decision tree. Medicine 100, e25813. https://doi.org/10.1097/md.0000000000025813 (2021).
Article PubMed PubMed Central Google Scholar
Khwannimit, B., Bhurayanontachai, R. & Vattanavanit, V. Validation of the sepsis severity score compared with updated severity scores in predicting hospital mortality in sepsis patients. Shock (Augusta, Ga) 47, 720–725. https://doi.org/10.1097/shk.0000000000000818 (2017).
Article PubMed Google Scholar
Hou, N. et al. Predicting 30-days mortality for MIMIC-III patients with sepsis-3: A machine learning approach using XGboost. J. Transl. Med. 18, 462. https://doi.org/10.1186/s12967-020-02620-5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fang, W. F. et al. Development and validation of immune dysfunction score to predict 28-day mortality of sepsis patients. PLoS One 12, e0187088. https://doi.org/10.1371/journal.pone.0187088 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mikacenic, C. et al. A two-biomarker model predicts mortality in the critically ill with sepsis. Am. J. Respir. Crit. Care Med. 196, 1004–1011. https://doi.org/10.1164/rccm.201611-2307OC (2017).
Article CAS PubMed PubMed Central Google Scholar
Lee, H. Y. et al. Preexisting clinical frailty is associated with worse clinical outcomes in patients with sepsis. Crit. Care Med. 50, 780–790. https://doi.org/10.1097/ccm.0000000000005360 (2022).
Article PubMed Google Scholar
Hyun, D. G. et al. Mortality of patients with hospital-onset sepsis in hospitals with all-day and non-all-day rapid response teams: A prospective nationwide multicenter cohort study. Crit. Care (Lond., Engl.) 26, 280. https://doi.org/10.1186/s13054-022-04149-z (2022).
Article Google Scholar
Im, Y. et al. Time-to-antibiotics and clinical outcomes in patients with sepsis and septic shock: A prospective nationwide multicenter cohort study. Crit. Care (Lond., Engl.) 26, 19. https://doi.org/10.1186/s13054-021-03883-0 (2022).
Article Google Scholar
Park, S. et al. Normothermia in patients with sepsis who present to emergency departments is associated with low compliance with sepsis bundles and increased in-hospital mortality rate. Crit. Care Med. 48, 1462–1470. https://doi.org/10.1097/ccm.0000000000004493 (2020).
Article CAS PubMed Google Scholar
Jeon, K. et al. Characteristics, management and clinical outcomes of patients with sepsis: A multicenter cohort study in Korea. Acute Crit. Care 34, 179–191. https://doi.org/10.4266/acc.2019.00514 (2019).
Article PubMed PubMed Central Google Scholar
Rockwood, K. et al. A global clinical measure of fitness and frailty in elderly people. CMAJ Can. Med. Assoc. J. Assoc. Med. Can. 173, 489–495. https://doi.org/10.1503/cmaj.050051 (2005).
Article Google Scholar
Moorhouse, P. & Rockwood, K. Frailty and its quantitative clinical evaluation. J. R. Coll. Physicians Edinb. 42, 333–340. https://doi.org/10.4997/jrcpe.2012.412 (2012).
Article CAS PubMed Google Scholar
Phua, J. et al. Management of severe sepsis in patients admitted to Asian intensive care units: Prospective cohort study. BMJ (Clin. Res. Ed.) 342, d3245. https://doi.org/10.1136/bmj.d3245 (2011).
Article Google Scholar
Moons, K. G. et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): Explanation and elaboration. Ann. Intern. Med. 162, W1-73. https://doi.org/10.7326/m14-0698 (2015).
Article PubMed Google Scholar
Breiman, L. J. M. L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Statnikov, A., Wang, L. & Aliferis, C. F. A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification. BMC Bioinform. 9, 319. https://doi.org/10.1186/1471-2105-9-319 (2008).
Article CAS Google Scholar
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 7, 21. https://doi.org/10.3389/fnbot.2013.00021 (2013).
Article PubMed PubMed Central Google Scholar
Sullivan, L. M., Massaro, J. M. & D’Agostino, R. B. Sr. Presentation of multivariate data for clinical use: The Framingham Study risk score functions. Stat. Med. 23, 1631–1660. https://doi.org/10.1002/sim.1742 (2004).
Article PubMed Google Scholar
Yang, L. et al. Study of cardiovascular disease prediction model based on random forest in eastern China. Sci. Rep. 10, 5245. https://doi.org/10.1038/s41598-020-62133-5 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Su, X. et al. Prediction for cardiovascular diseases based on laboratory data: An analysis of random forest model. J. Clin. Lab. Anal. 34, e23421. https://doi.org/10.1002/jcla.23421 (2020).
Article CAS PubMed PubMed Central Google Scholar
Couronné, R., Probst, P. & Boulesteix, A. L. Random forest versus logistic regression: A large-scale benchmark experiment. BMC Bioinform. 19, 270. https://doi.org/10.1186/s12859-018-2264-5 (2018).
Article Google Scholar
Ambler, G. & Benner, A. mfp: Multivariable fractional polynomials (2023).
Zhang, X. et al. Symptomatic intracranial hemorrhage after mechanical thrombectomy in Chinese ischemic stroke patients: The ASIAN score. Stroke 51, 2690–2696. https://doi.org/10.1161/strokeaha.120.030173 (2020).
Article CAS PubMed Google Scholar
Greving, J. P. et al. Development of the PHASES score for prediction of risk of rupture of intracranial aneurysms: A pooled analysis of six prospective cohort studies. Lancet Neurol. 13, 59–66. https://doi.org/10.1016/s1474-4422(13)70263-1 (2014).
Article PubMed Google Scholar
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44, 837–845 (1988).
Article CAS PubMed Google Scholar
Liaw, A. & Wiener, M. J. R. N. Classification and regression by randomForest. R News 2, 18–22 (2002).
Google Scholar
Weihs, C., Ligges, U., Luebke, K. & Raabe, N. klaR analyzing German business cycles. In Data Analysis and Decision Support (eds Baier, D. et al.) (Springer-Verlag, 2005).
Google Scholar
Meyer, D. et al. e1071: misc functions of the department of statistics, probability theory group (formerly: E1071), TU Wien. 1 (2019).
Hothorn, T., Buehlmann, P., Kneib, T., Schmid, M. & Hofner, B. mboost: Model-based boosting. R Package Version 2, 9–7 (2022).
Google Scholar

Download references

Acknowledgements

The authors would like to thank all participating intensivists in the Korean Sepsis Alliance.

Funding

This work was supported by the Research Program funded by the Korea Disease Control and Prevention Agency (Fund code 2019E280500, 2020E280700, 2021-10-026) and supported by Korean Sepsis Alliance (KSA) affiliated with Korean Society of Critical Care Medicine (KSCCM). This work was also supported in part by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2022R1F1A1063027).

Author information

These authors contributed equally: Young Seok Lee and Seungbong Han.
A list of authors and their affiliations appears at the end of the paper.

Authors and Affiliations

Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Internal Medicine, Korea University Guro Hospital, Seoul, Republic of Korea
Young Seok Lee
Department of Biostatistics, Korea University College of Medicine, Seoul, Republic of Korea
Seungbong Han & Ye Eun Lee
Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Gangnam Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea
Jaehwa Cho
Division of Infectious Disease and Critical Care Medicine, Department of Internal Medicine, Chungnam National University College of Medicine, Chungnam National University Sejong Hospital, Sejong, Republic of Korea
Young Kyun Choi
Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Internal Medicine, Chungnam National University College of Medicine, Chungnam National University Sejong Hospital, Sejong, Republic of Korea
Sun-Young Yoon & Jae Young Moon
Department of Pulmonary and Critical Care Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
Dong Kyu Oh, Su Yeon Lee, Mi Hyeon Park, Chae-Man Lim, Sang‑Bum Hong & Suk‑Kyung Hong
Department of Critical Care Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
Gee Young Suh, Kyeongman Jeon & Ryoung‑Eun Ko
Department of Pulmonary and Critical Care Medicine, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
Young‑Jae Cho, Yeon Joo Lee & Sung Yoon Lim
Department of Pulmonary, Allergy and Critical Care Medicine, Hallym University Sacred Heart Hospital, Anyang, Republic of Korea
Sunghoon Park
Department of Internal Medicine, Kangwon National University Hospital, Chuncheon, Republic of Korea
Jeongwon Heo
Department of Acute Care Surgery, Korea University Anam Hospital, Seoul, Republic of Korea
Jae‑myeong Lee
Department of Internal Medicine, Daegu Catholic University Medical Center, Daegu Catholic University College of Medicine, Daegu, Republic of Korea
Kyung Chan Kim
Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Inje University Sanggye Paik Hospital, Seoul, Republic of Korea
Youjin Chang
Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
Sang‑Min Lee
Division of Allergy, Pulmonary and Critical Care Medicine, Department of Internal Medicine, Pusan National University Yangsan Hospital, Yangsan, Republic of Korea
Woo Hyun Cho
Department of Anesthesiology, Chonnam National University Hospital, Gwangju, Republic of Korea
Sang Hyun Kwak
Department of Internal Medicine, Research Center for Pulmonary Disorders, Chonbuk National University Medical School, Jeonju, Republic of Korea
Heung Bum Lee
Department of Pulmonary and Critical Care Medicine, Ulsan University Hospital, University of Ulsan College of Medicine, Ulsan, Republic of Korea
Jong‑Joon Ahn
Department of Internal Medicine, Jeju National University Hospital, Jeju National University School of Medicine, Jeju, Republic of Korea
Gil Myeong Seong
Division of Allergy, Pulmonary, and Critical Care Medicine, Department of Internal Medicine, Chungnam National University College of Medicine, Chungnam National University Hospital, Daejeon, Republic of Korea
Song I. Lee
Department of Internal Medicine, Hanyang University College of Medicine, Seoul, Republic of Korea
Tai Sun Park
Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea
Su Hwan Lee
Department of Pulmonary and Allergy, Department of Internal Medicine, Regional Respiratory Center, Yeungnam University Hospital, Daegu, Republic of Korea
Eun Young Choi
Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Inje University Ilsan Paik Hospital, Inje University College of Medicine, Goyang, Republic of Korea
Hyung Koo Kang
Division of Acute Care Surgery, Department of Surgery, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
Suk‑Kyung Hong

Authors

Young Seok Lee
View author publications
You can also search for this author in PubMed Google Scholar
Seungbong Han
View author publications
You can also search for this author in PubMed Google Scholar
Ye Eun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jaehwa Cho
View author publications
You can also search for this author in PubMed Google Scholar
Young Kyun Choi
View author publications
You can also search for this author in PubMed Google Scholar
Sun-Young Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Dong Kyu Oh
View author publications
You can also search for this author in PubMed Google Scholar
Su Yeon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Mi Hyeon Park
View author publications
You can also search for this author in PubMed Google Scholar
Chae-Man Lim
View author publications
You can also search for this author in PubMed Google Scholar
Jae Young Moon
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

the Korean Sepsis Alliance (KSA) Investigators

Sang‑Bum Hong
, Suk‑Kyung Hong
, Gee Young Suh
, Kyeongman Jeon
, Ryoung‑Eun Ko
, Young‑Jae Cho
, Yeon Joo Lee
, Sung Yoon Lim
, Sunghoon Park
, Jeongwon Heo
, Jae‑myeong Lee
, Kyung Chan Kim
, Youjin Chang
, Sang‑Min Lee
, Woo Hyun Cho
, Sang Hyun Kwak
, Heung Bum Lee
, Jong‑Joon Ahn
, Gil Myeong Seong
, Song I. Lee
, Tai Sun Park
, Su Hwan Lee
, Eun Young Choi
& Hyung Koo Kang

Contributions

Lee YS, Han S, and Moon JY had full access to all the data in the study and take responsibility for the integrity of the data as well as the accuracy of the data analysis. Lee YS, Han S, and Moon JY performed study concept and design. Lee YS, Han S, Lee YE, and Moon JY performed acquisition, analysis, and interpretation of data. Lee YS, Han S, and Moon JY performed drafting of the manuscript. All authors performed critical revision of the manuscript for important intellectual content. Lee YS, Han S, and Lee YE carried out statistical analysis. Lim CM and Han S obtained funding. Cho J, Choi YK, Yoon SY, Oh DK, Lee SY, Park MH, and Lim CM provided administrative, technical, or material support. Lee YS and Moon JY performed study supervision. All authors made manuscript approval. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jae Young Moon.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, Y.S., Han, S., Lee, Y.E. et al. Development and validation of an interpretable model for predicting sepsis mortality across care settings. Sci Rep 14, 13637 (2024). https://doi.org/10.1038/s41598-024-64463-0

Download citation

Received: 18 January 2024
Accepted: 10 June 2024
Published: 13 June 2024
DOI: https://doi.org/10.1038/s41598-024-64463-0
Springer Nature Limited

Development and validation of an interpretable model for predicting sepsis mortality across care settings

Abstract

Similar content being viewed by others

Development and validation of a score to predict mortality in ICU patients with sepsis: a multicenter retrospective study

Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy

The association between four scoring systems and 30-day mortality among intensive care patients with sepsis: a cohort study

Introduction

Results

Clinical characteristics of patients in the derivation dataset and the validation dataset

New models for predicting 28-day mortality in sepsis patients

New predictive model of sepsis mortality in different clinical situations

Clinical utility of new predictive model of sepsis mortality

Discussion

Methods

Nationwide multicenter prospective sepsis cohort

Study design

Definition of variables

Statistical analysis

Ethics statement

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

the Korean Sepsis Alliance (KSA) Investigators

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Development and validation of an interpretable model for predicting sepsis mortality across care settings

Abstract

Similar content being viewed by others

Development and validation of a score to predict mortality in ICU patients with sepsis: a multicenter retrospective study

Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy

The association between four scoring systems and 30-day mortality among intensive care patients with sepsis: a cohort study

Introduction

Results

Clinical characteristics of patients in the derivation dataset and the validation dataset

New models for predicting 28-day mortality in sepsis patients

New predictive model of sepsis mortality in different clinical situations

Clinical utility of new predictive model of sepsis mortality

Discussion

Methods

Nationwide multicenter prospective sepsis cohort

Study design

Definition of variables

Statistical analysis

Ethics statement

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

the Korean Sepsis Alliance (KSA) Investigators

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation