Identifying and characterizing high-risk clusters in a heterogeneous ICU population with deep embedded clustering

Castela Forte, José; Yeshmagambetova, Galiya; van der Grinten, Maureen L.; Hiemstra, Bart; Kaufmann, Thomas; Eck, Ruben J.; Keus, Frederik; Epema, Anne H.; Wiering, Marco A.; van der Horst, Iwan C. C.

doi:10.1038/s41598-021-91297-x

Identifying and characterizing high-risk clusters in a heterogeneous ICU population with deep embedded clustering

Article
Open access
Published: 08 June 2021

Volume 11, article number 12109, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Identifying and characterizing high-risk clusters in a heterogeneous ICU population with deep embedded clustering

Download PDF

José Castela Forte ORCID: orcid.org/0000-0001-9273-0702^1,2,3,
Galiya Yeshmagambetova³,
Maureen L. van der Grinten³,
Bart Hiemstra²,
Thomas Kaufmann²,
Ruben J. Eck⁴,
Frederik Keus⁵,
Anne H. Epema²,
Marco A. Wiering³ &
…
Iwan C. C. van der Horst⁶

6595 Accesses
24 Citations
2 Altmetric
Explore all metrics

Abstract

Critically ill patients constitute a highly heterogeneous population, with seemingly distinct patients having similar outcomes, and patients with the same admission diagnosis having opposite clinical trajectories. We aimed to develop a machine learning methodology that identifies and provides better characterization of patient clusters at high risk of mortality and kidney injury. We analysed prospectively collected data including co-morbidities, clinical examination, and laboratory parameters from a minimally-selected population of 743 patients admitted to the ICU of a Dutch hospital between 2015 and 2017. We compared four clustering methodologies and trained a classifier to predict and validate cluster membership. The contribution of different variables to the predicted cluster membership was assessed using SHapley Additive exPlanations values. We found that deep embedded clustering yielded better results compared to the traditional clustering algorithms. The best cluster configuration was achieved for 6 clusters. All clusters were clinically recognizable, and differed in in-ICU, 30-day, and 90-day mortality, as well as incidence of acute kidney injury. We identified two high mortality risk clusters with at least 60%, 40%, and 30% increased. ICU, 30-day and 90-day mortality, and a low risk cluster with 25–56% lower mortality risk. This machine learning methodology combining deep embedded clustering and variable importance analysis, which we made publicly available, is a possible solution to challenges previously encountered by clustering analyses in heterogeneous patient populations and may help improve the characterization of risk groups in critical care.

Identification of distinct clinical phenotypes of cardiogenic shock using machine learning consensus clustering approach

Article Open access 29 August 2023

Stratifying Mortality Risk in Intensive Care: A Comprehensive Analysis Using Cluster Analysis and Classification and Regression Tree Algorithms

Article Open access 15 May 2024

Deploying unsupervised clustering analysis to derive clinical phenotypes and risk factors associated with mortality risk in 2022 critically ill patients with COVID-19 in Spain

Article Open access 15 February 2021

Introduction

Critically ill patients constitute a highly heterogeneous population, with high rates of acute and chronic multimorbidity, and different profiles of risk, response to interventions, and outcomes. Despite extensive research, however, the goal of unravelling patient heterogeneity remains largely unattained. Critical care clinicians rely on a combination of laboratory and clinical examination variables, and their own clinical experience (clinical gestalt) and gut feeling to characterize patients¹. While human beings excel at ascribing meaning to observed patterns once they have seen them, data-driven approaches enabling the combination of diverse data streams can enhance patient characterization within known “sub-phenotypes” or provide insight into new categorizations^2,3.

Clustering analysis has been used for several purposes in medical research, with different approaches showing promising results to help identify and characterize relevant clusters. One of the main appeals of clustering analysis is that its principles resemble a heuristic which clinicians are familiar with: it finds similarities and differences between patients and divides them into group¹. Within critical care research in particular, latent class analysis (LCA) has been the most broadly used algorithm for sub-phenotype identification in cohorts of patients with acute respiratory distress syndrome (ARDS) and acute kidney injury (AKI)^4,5. LCA is an established, model-based statistical technique, that defines the best fitting models for data assumed to contain several unobserved groups^4,5. Unlike LCA, where clusters are derived from the distribution of the data, unsupervised clustering algorithms such as k-means and hierarchical clustering find clusters by identifying similarities between cases¹. These two algorithms have recently been applied to identify clusters in a general ICU population, cardiovascular clusters in septic shock patients, and corticosteroid response in patients with severe asthma, with a variation of k-means called fuzzy c-means also being used to cluster severely injured blunt trauma patients^2,6,7,8.

Similarly, prediction models with increasingly higher accuracy and explainability for mortality and organ injury have been suggested, capable of processing high-frequency data^9,10,11. Both approaches have different advantages and shortcomings. Prediction models can deal with virtually any data format and provide individual probabilities for an outcome, but are bound by a priori hypotheses and can only compute the probability of one specific outcome. On the other hand, traditional clustering algorithms, are not designed to process the high-frequency, dynamic data collected in ICU¹.

In this study, we sought to develop and apply a novel approach to identifying and characterizing clusters of critically ill patients. Using co-morbidity, clinical examination, and laboratory data from a minimally selected ICU cohort, we compared the performance of different clustering methodologies and applied a combined deep embedded clustering and feature importance analysis algorithm to identify clusters of patients at high risk of AKI and mortality during ICU stay, and at 30 and 90 days. Then, we trained a classifier to predict and validate cluster membership, and identified the features driving these predictions. We hypothesized that this approach could identify clinically recognizable patient clusters with clinically significant differences in mortality and severe acute kidney injury.

Methods

Data sources

Data used for this study originated from the prospective, single-centre Simple Intensive Care Studies (SICS) I cohort study. All acutely admitted, critically ill patients included the study underwent clinical examination and critical care ultrasonography (CCUS) within the first 24 h of ICU admission. Informed consent was obtained for all included patients, and all analyses were performed in accordance with relevant guidelines and regulations. Further details on inclusion criteria, informed consent, and study protocol are available elsewhere^12,13. The study was approved by the local institutional review board (Medisch Ethische Toetsingscommissie (METc) of the UMCG, M15.168207).

Co-morbidity, clinical examination, and laboratory data

The dataset consisted of patient characteristics including co-morbidities, clinical examination variables including CCUS, vital signs, and urine output, and a time-series of 40 laboratory values measured at least once daily (Table 1). CCUS measurements were validated by experts, and vital signs were recorded from the bedside monitor^12,13. Patients with more than 10% missing data (i.e. variables for which no measurements were registered at any moment during ICU stay) were excluded from the analysis. Missing laboratory data were imputed using a rolling mean based on ICU-specific values¹⁴. For other variables, iterative imputation with 10 iterations (a method similar to multivariate imputation by chained equations) was used¹⁵.

Table 1 Complete list of input features. List including all patient characteristics, co-morbidities, time-series of laboratory parameters, and clinical examination parameters.

Full size table

Feature extraction (mean and variance concatenated over the whole time-series) was employed to represent the time-series data. Table 1 shows the average number of co-morbid conditions per patient per cluster, calculated based on the information on co-morbidities.

Outcome

To define and assess clinically relevant differences between clusters, mortality at three time points (in-ICU, as well as at 30 and 90 days) was taken as a primary outcome. Kaplan Meier curves were used to visualize the mortality per cluster during and after ICU stay. The secondary outcome was the development of severe AKI (stages 2 or 3). Additionally, differences in the development of any stage of AKI, need for vasopressors, ICU length of stay, development of shock, and need for renal replacement therapy are also reported.

Development and comparison of different clustering methodologies

Most clustering algorithms, such as k-means clustering and hierarchical clustering, are not designed to process high-frequency, dynamic data¹. Different strategies have been developed to facilitate this, including combining K-means and HC with a time-series processing methodology such as dynamic time warping (DTW), as well as using clustering algorithms which represent data in a different way, such as deep embedding clustering (DEC)^16,17. In both these approaches, described in more detail in the Supplementary information files, features such as mean and variance are extracted from the time-series data and subsequently fed into a clustering algorithm. In this study, we compared a DEC model to two “traditional” clustering algorithms, k-means and hierarchical clustering (HC), as well as a combination of HC and dynamic time warping (HC-DTW).

Deep embedded clustering algorithms utilize autoencoder neural networks to learn a certain representation of the data, and then use this representation to form clusters¹⁶. Despite its frequent use in for clustering analyses in other fields, there are but a few reports of analysis of medical data using DEC¹⁸. The DEC model developed in this study combined a multilayer perceptron (MLP) autoencoder, which is a type of neural network, and a custom clustering layer with the k-means clustering algorithm. The clustering layer reconstructs features created by the MLP autoencoder, and converts it to cluster label probabilities represented by Student’s t-distribution. The clustering layer weights represent the cluster centroids and are initialized using k-means algorithm. To improve cluster purity, a centroid-based target distribution is constructed by squaring the encoded vectors and normalizing them by frequency per cluster. Finally, the algorithm is trained to minimize Kullback–Leibler divergence loss for a maximum of 8000 iterations with 0.01 tolerance threshold.

Once clusters were computed for all four algorithms, validity assessments were conducted¹⁹. Internal validity assesses whether the structure of the clustering is intrinsically appropriate for the data. Patients clustered in the same cluster should have similar data, whereas patients from different clusters should be as distinct as possible from those in other clusters. Here, the Silhouette index was used to internally validate k-means, HC, and HC-DTW. For DEC, cluster-wise stability was computed by resampling the dataset 100 times and computing the Jaccard similarities to the originally defined clusters as well as entropy scores^20,21,22. External validity assesses whether clustering results match some a priori expected data structure. When the true cluster labels are known, this is done by comparing the clustering output to a given “correct” clustering^1,23. Since no “true” labels are available when attempting to identify new putative patient clusters, the clinical recognizability of these clusters was used as surrogate of external validity. Lastly, to compare the different methodologies, the potential clinical utility of the clustering was assessed by examining the distribution of patients across the different clusters and whether the different clustering configurations identified between-group differences in the input features.

Cluster membership prediction and feature importance analysis for cluster characterization

A gradient boosting algorithm (XGBoost) was trained to predict cluster membership over 10-folds for each of the 100 clustering configurations resulting from DEC²⁴. Then, SHapley Additive exPlanations (SHAP) values were computed on the run with the highest accuracy to represent the feature importance of each variable in the model. SHAP values are widely used in game theory to determine the contribution of particular features to the difference between the actual and the mean predictions²⁵. Positive and negative SHAP values signal that variables contribute positively or negatively to cluster membership, respectively. Finally, clusters were characterized based on the between-cluster differences in input variables and outcomes, and feature importance values. The admission and discharge diagnoses of all patients were analysed and relevant clinical information to aid in the characterization was extracted. A full schematic overview of the analysis is provided in Fig. 1.

Statistical analysis

Descriptive characteristics for the study population were reported as means with standard deviations and proportions for continuous and categorical variables, respectively. Differences in input variables and outcomes between clusters were determined using analysis of variance and chi-squared tests. Hazard ratios for mortality per cluster were computed, and the p-value for comparison against the full cohort was calculated using the log-rank test. A p-value below 0.05 was considered significant. All clustering and further statistical analyses were performed using Python with PyCharm as interface (version 2019.3.5).

Results

Study population and outcome

Of the 1075 patients included in SICS-I, 743 had less than 10% missing data and were included in the analysis. Both the numbers of variables and the numbers of measurements per variable varied per patient, with on average 21 measurements of each variable per patient. The average number of measurements per variable per patient, and the average time between measurements for each variable across all patients are presented in Supplementary Table S2. Patient characteristics, clinical examination variables, and outcomes are reported in Table 2. Complete data on laboratory variables and outcomes can be found in Supplementary Tables S3–S6, and Supplementary Figs. S9–S10. After 30 and 90 days, 166 and 205 patients (22.3%) had died, with no patients lost to follow-up.

Table 2 Clinical characteristics for the 743 patients, including patient characteristics, clinical examination data, co-morbidities and medical history, and outcomes.

Full size table

Performance of different clustering methodologies

Across k-means, HC, and HC-DTW, the silhouette score was highest when patients were divided into only 2 clusters (Supplementary Table S1). When 3–6 clusters were considered, silhouette scores were still similar, but lower, across algorithms. However, the high silhouette scores are explained by all three algorithms always grouping most patients into one of the clusters, even when the dimensionality (and hence the noise) of the inputs was reduced using principal component analysis. As shown in Supplementary Table S1, all three methods tended to cluster around 90% of patients in only one cluster, regardless of the number of clusters. In contrast, clusters generated by DEC showed balanced cluster membership distribution irrespective of the putative number of clusters (Supplementary Table S1) and identified significant between-cluster differences for the majority of the input features (Supplementary Tables S3–S5). Amongst the seven different possible clustering configurations generated by DEC, stability was highest for six clusters (Supplementary Figs. S1–S8). The tenfold cross-validation XGBoost model predicted cluster membership with 83% accuracy, with sensitivity ranging from 64 to 90% and specificity from 85 to 100% (Supplementary Table S7, Supplementary Fig. S11).

Feature importance analysis and cluster characterization

Sixty-eight patients with high prevalence of respiratory failure or infection (34%), as well as sepsis (21%), were assigned to cluster 1 (Fig. 3, Supplementary Table S3). These patients had a long ICU stay, and the highest rate of worsened respiratory condition after 24 h (Fig. 2, Supplementary Tables S3–S5 and Supplementary Fig. S9). Feature importance analysis identified increased alkaline phosphatase, gamma-GT, bilirubin and lactate as having the greatest impact on cluster membership predictions (Supplementary Fig. S12). Higher values for the former three variables drove predictions towards cluster membership, while a high lactate was associated with non-membership. Patients in this cluster were not at increased mortality risk (ICU, 30-day or 90-day; Table 3 and Fig. 4).

Table 3 Mortality rates, and hazard ratios for mortality and acute kidney injury per cluster.

Full size table

Cluster 2 (n = 100) included the highest percentage of surgical patients (48%, Fig. 2, Supplementary Table S3), including the largest post-transplant group, and cardiac and vascular procedures (Fig. 3, Supplementary Table S6). Almost 40% of patients presented with acute or chronic cardiac condition, with 63% having a low cardiac index (Fig. 3, Supplementary Table S6). Accordingly, troponin T, lactate dehydrogenase (LDH), creatine kinase (CK) and inflammatory variables were increased (Fig. 2, Supplementary Table S4, Supplementary Fig. S9). Patients in this cluster had higher mortality rates (ICU, 30-day, and 90-day; HR 1.55 [95% CI 1.26–1.91], 1.39 [1.12–1.71], and 1.30 [1.06–1.61], respectively) and were at increased risk of stage 2 or 3 acute kidney injury (HR 1.26 [95% CI 1.03–1.59]) (Fig. 3, Table 3, Supplementary Table S6). Higher values for arterial oxygen (pO₂), LDH, lactate, troponin and calcium were associated with cluster membership, while low pO₂, LDH, and ASAT values drove predictions towards non-membership (Supplementary Fig. S13).

Cluster 3 (n = 46) consisted of patients with diverse disease etiology, from liver disease and transplant (24%), to cardiac arrest (11%), sepsis (10%) and respiratory failure (10%). These patients were the youngest and had the highest severity scores at admission (Fig. 2, Supplementary Table S3). Like cluster 2, laboratory variables showed significant elevated troponin T, LDH, CK and inflammatory values (Fig. 2, Supplementary Table S4 and Supplementary Fig. S10). During clinical examination, this group recorded the lowest urine output values (0.58 ml/kg/h), 43% had delayed capillary refill time, and 11% had severe mottling (Fig. 2, Supplementary Table S3). Almost 35% of patients required renal replacement therapy (RRT), and 91% developed AKI, with a HR of stage 2 or 3 AKI of 1.67 [95% CI 1.24–2.25]. These patients also needed the highest vasopressor dose, and had the highest mortality (ICU HR 1.80 [1.33–2.42], 30-day HR 1.66 [1.23–2.23], and 90-day HR 1.58 [1.17–2.12]) (Table 3, Fig. 4). High liver enzymes, LDH, and troponin drove predictions towards cluster membership (Supplementary Fig. S14).

One-hundred forty-four patients with low co-morbidity were in cluster 4, having been admitted primarily with neurosurgical or neurological (40%) and trauma-related (17%) diagnoses (Fig. 3, Supplementary Table S6). These patients presented with relatively lower APACHE IV and SAPS II scores (62 and 41), and had the shortest ICU stay (2.8 days), lowest AKI incidence (31%), and significantly lower risk of mortality (ICU, 30-day, and 90-day (HR 0.57 [0.48–0.68], 0.75 [0.63–0.89], 0.66 [0.55–0.78], respectively). This group also had a markedly reduced risk of severe AKI (HR 0.44 [95% CI 0.37–0.52]). High albumin, hemoglobin, and pO₂ values predicted membership, while high fibrinogen and potassium predicted non-membership (Supplementary Fig. S15).

For cluster 5 (n = 290), the main admission diagnoses were respiratory failure (19%), cardiac arrest (12%), neurological causes (11%), or trauma (12%). Patients in this cluster had a medium co-morbidity profile, the longest mean ICU stay (7.7 days), and high rates of delayed CRT at admission (35%). Worsening respiratory condition was frequent (14%), and COPD was a common co-morbidity (17%). High thrombocytes and potassium predicted cluster membership, while high values for pO₂, creatinine, bilirubin, and phosphate drove predictions towards non-membership for some patients (Supplementary Fig. S16). The 95 patients assigned to cluster 6 primarily suffered from sepsis or respiratory infection, having been admitted to the ICU with respiratory (41%) or gastrointestinal diagnoses (20%). They also had high rates of distributive shock (38%), with physical examination showing a high cardiac index in 59% of patients, as well as high respiratory rates and heart rates. High values for fibrinogen, creatinine, urea, and CRP drove predictions towards cluster membership (Supplementary Fig. S17). Patients in these clusters were not at increased or reduced risk of mortality (Table 3).

Discussion

In this study, we set out to identify patient sub-phenotypes of clinical relevance using time-series laboratory, co-morbidity and clinical examination data. To do this, we compared four different clustering approaches. With a deep clustering algorithm, we identified six sub-phenotypes that capture differences in morbidity and in commonly measured clinical variables. In addition, these sub-phenotypes differed significantly in relevant clinical events rates (such as need for RRT and use of vasopressors) as well as mortality, with one group at low mortality risk, and two at higher mortality risk compared to average.

As the first study evaluating the combination of clustering such a broad range of ICU data including demographic, co-morbidity, clinical examination, and laboratory data with feature importance analysis to characterize patient sub-phenotypes in a minimally-selected ICU population, we draw several conclusions.

We found that traditional clustering algorithms such as k-means and HC were highly susceptible to the variation in data and outliers generated by the inclusion of a large number of laboratory variables. Previous studies using these algorithms with other, mostly low-dimensional datasets had not reported this issue, and do not show the large imbalance in cluster size we observed for k-means and HC in this study^2,6,7. Interestingly, we identified the same issue with HC-DTW despite the use of a computationally-expensive technique like DTW to make time-series sequences of different length more uniform, and therefore suitable for clustering. Deep embedded clustering, on the other hand, provided a balanced patient distribution across clusters using the extracted features alone. Despite the large amount of data and the moderate cohort size, we managed to achieve stable clusters, and use these labels to train a classifier to identify the features driving cluster membership predictions.

The findings from variable importance analysis provided interesting, adjuvant data for the interpretation of the clusters identified during this analysis. Previous clustering analyses have based interpretation of the phenotypes found on differences in means of the variables measured, or by listing variables related to each cluster based on relative importance^2,6,7. Here, we complemented the descriptive statistics of each cluster with SHAP values to establish the directionality of the association between high and low values of a variable and cluster membership. For example, membership of clusters with higher mortality risk was associated with increased ASAT and LDH, which are known biomarkers of myocardial ischemia and have also been shown to be positively correlated with ICU mortality^{14,26,27,28,29}. Likewise, patients were more likely to be in the cluster with the lowest mortality risk (cluster 4) when their albumin, hemoglobin, and pO₂ values were higher, as well as when their liver function was better. These associations are supported by literature, including the addition of albumin measurements to APACHE scores^30,31,32,33.

The results of our study suggest that this clustering methodology is superior to most widely used approaches for clustering of critically ill patients for several reasons. Firstly, it can process time-series data, unlike k-means, HC, or even HC with dynamic time warping. A recent study using clustering analysis to define cardiovascular phenotypes suggested that incorporating serial measures to study transitions from one phenotype to another during ICU stay would provide additional insight to their analysis, which was limited to static variables⁶. Similarly, clustering studies on treatment response in critically ill patients would benefit substantially from processing time-series data, as opposed to one-time treatment administration⁷.

Secondly, departing from a “minimally-selected” patient cohort, we identified six clusters which differed significantly in mortality and AKI risk, and were also clinically recognizable and describable. Caution has been advised when interpreting the results of clustering analyses, especially when identifying “novel” sub-phenotypes, and rightfully so³. Clustering algorithms will inevitably partition patients into clusters, and, as with most unsupervised machine learning techniques, it remains hard to establish what variables drove this partition. It was with this in mind that we set out to cluster patients using a methodology which could identify clusters with significantly different outcomes and simultaneously provided some validation of the results and additional insight into the variables associated with each cluster. This pipeline of clustering, training a classifier on the stable cluster labels, and extraction of SHAP values therefore helped “open the black box” and characterize clusters in more detail.

Lastly, the inclusion of time-series data into clustering analyses can bring a wide range of benefits to studies aiming to characterize patient sub-phenotypes. It can enable the use of continuous hemodynamic monitoring and laboratory data to detect variations in sensitivity to myocardial ischemia or acute kidney injury, or to identify groups with differential treatment responses over time. From a clinical perspective, the potential of accurate clustering of ICU patients to improving patient care is two-fold: it can contribute to better clinical trial design, and it has the potential to inform clinical monitoring and prognostication at bedside. The former idea is supported by recent research on ARDS that suggests sub-phenotyping based on biological markers (such as laboratory parameters) has the potential to identify mechanistic markers proximal to the clinical expression of critical disease and syndromes³⁴. By doing so, it can provide a more accurate alternative to clinical phenotyping, which is prone to misclassification and remains challenging even for well-known syndromes, as well as better predict treatment response³⁴. Given the challenges critical care trials often face to demonstrate meaningful clinical effects, improved randomization/allocation and design in clinical trials is especially important³⁵. Secondly, the identification of clinically meaningful clusters of ICU patients, separating patients by mechanism (cardiac, respiratory, infectious, or other) as well as prognosis (low vs high risk), can be leveraged from ICU admission and throughout ICU stay to guide and optimize staffing tasks. This would allow for not only more personalized care, for example, by minimizing staff contact with patients identified as belonging to clusters with “lower care needs”. In addition, accurate clustering could also be an effective way of summarizing the status of the patient, and could potentially be translated to a (triage) system possibly integrated in a clinical dashboard providing an overview of patients for rapid review by physicians. Naturally, it is essential to replicate and validate the findings of this study before any of these steps can be taken. To this end, we have made the code used in this study publicly available at https://github.com/J1C4F8/SICS_DEC and are currently setting up a multicenter study to assess the validity of the clusters and the feasibility of their clinical application.

Our study also included some limitations. First, while the goal of the SICS-I study was to collect data from a minimally-selected clinical population, inclusion criteria did apply which may account for some selection bias^12,13. For example, patients expected to stay in the ICU for less than 24 h, due to discharge or extremely dire prognosis, were not included. Second, the six sub-phenotypes identified do not represent an exhaustive classification of critically ill patient subtypes. Information user in previous studies like end-of-life desires, need for life-sustaining therapies, and post-discharge care needs would complement our analysis, which did not include any variables of the disease course after the ICU except for mortality². Third, the SHAP values reported are from an intermediate XGBoost which, despite its moderately high accuracy, does not guarantee the variables identified by SHAP are the exact same variables that the DEC model relied on when creating the clusters. Lastly, external validation of the six identified clusters in an independent cohort is necessary. Future studies with larger datasets should look to validate and replicate our findings, and address the possibility of patients belonging to multiple clusters or whether the addition of other features from the time-series data, such as trend or seasonality, would improve the results we obtained by extracting only the mean and variance of each variable.

In conclusion, our analysis of a cohort with 743 ICU patients, based on a combination of clustering and feature importance analysis of co-morbidity, clinical examination, and laboratory data identified six patient sub-phenotypes with varying mortality and risk of severe acute kidney injury. This machine learning methodology, which we made publicly available, is a possible solution to challenges previously encountered by clustering analyses in heterogeneous populations, and may help improve the characterization of risk groups in critical care.

Data availability

The datasets generated during and/or analysed during the current study are not publicly available due to containing sensitive patient information (in particular the detailed admission and discharge diagnoses information) but are available for researchers who meet the criteria for access to confidential data on reasonable request. The code used to create the models is open and available on GitHub, at https://github.com/J1C4F8/SICS_DEC.

References

Castela Forte, J., Perner, A. & van der Horst, I. C. C. The use of clustering algorithms in critical care research to unravel patient heterogeneity. Intensive Care Med. 45, 1025–1028. https://doi.org/10.1007/s00134-019-05631-z (2019).
Article PubMed Google Scholar
Vranas, K. C. et al. Identifying distinct subgroups of intensive care unit patients: A machine learning approach. Crit. Care Med. 45(10), 1607–1615. https://doi.org/10.1097/CCM.0000000000002548 (2017).
Article PubMed PubMed Central Google Scholar
van Smeden, M., Harrell, F. E. & Dahly, D. L. Novel diabetes subgroups. Lancet Diabetes Endocrinol. 6(6), 439–440. https://doi.org/10.1016/S2213-8587(18)30124-4 (2018).
Article PubMed Google Scholar
Sinha, P. et al. Latent class analysis of ARDS subphenotypes: A secondary analysis of the statins for acutely injured lungs from sepsis (SAILS) study. Intensive Care Med. 44(11), 1859–1869. https://doi.org/10.1007/s00134-018-5378-3 (2018).
Article CAS PubMed PubMed Central Google Scholar
Bhatraju, P. K. et al. Identification of acute kidney injury subphenotypes with differing molecular signatures and responses to vasopressin therapy. Am. J. Respir. Crit. Care Med. 199(7), 863–872. https://doi.org/10.1164/rccm.201807-1346OC (2019).
Article CAS PubMed PubMed Central Google Scholar
Geri, G. et al. Cardiovascular clusters in septic shock combining clinical and echocardiographic parameters: A post hoc analysis. Intensive Care Med. 45, 657–667. https://doi.org/10.1007/s00134-019-05596-z (2019).
Article PubMed Google Scholar
Wu, W. et al. Multiview cluster analysis identifies variable corticosteroid response phenotypes in severe asthma. Am. J. Respir. Crit. Care Med. 199(11), 1358–1367. https://doi.org/10.1164/rccm.201808-1543OC (2019).
Article CAS PubMed PubMed Central Google Scholar
Liu, D. et al. Unsupervised clustering analysis based on MODS severity identifies four distinct organ dysfunction patterns in severely injured blunt trauma patients. Front. Med. (Lausanne) 7(46), 2020. https://doi.org/10.3389/fmed.2020.00046.eCollection (2020).
Article Google Scholar
Nielsen, A. B. et al. Survival prediction in intensive-care units based on aggregation of long-term disease history and acute physiology: A retrospective study of the Danish National Patient Registry and electronic patient records. Lancet Digit. Health 1, e78–e89. https://doi.org/10.1016/S2589-7500(19)30024-X (2019).
Article PubMed Google Scholar
Thorsen-Meyer, H. C. et al. Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: A retrospective study of high-frequency data in electronic patient records. Lancet Digit. Health 2(4), e179–e191. https://doi.org/10.1016/S2589-7500(20)30018-2 (2020).
Article PubMed Google Scholar
Shickel, B. et al. DeepSOFA. A continuous acuity score for critically ill patients using clinically interpretable deep learning. Sci. Rep. 9, 1879. https://doi.org/10.1038/s41598-019-38491-0 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Hiemstra, B. et al. Clinical examination, critical care ultrasonography and outcomes in the critically ill: Cohort profile of the Simple Intensive Care Studies-I. BMJ Open 7, e017170. https://doi.org/10.1136/bmjopen-2017-017170 (2017).
Article PubMed PubMed Central Google Scholar
Hiemstra, B. et al. The diagnostic accuracy of clinical examination for estimating cardiac index in critically ill patients: The Simple Intensive Care Studies-I. Intensive Care Med. 45, 190–200. https://doi.org/10.1007/s00134-019-05527-y (2019).
Article PubMed Google Scholar
Alkozai, E. M. et al. Systematic comparison of routine laboratory measurements with in-hospital mortality: ICU-Labome, a large cohort study of critically ill patients. Clin. Chem. Lab. Med. 56(7), 1140–1151. https://doi.org/10.1515/cclm-2016-1028 (2018).
Article CAS PubMed Google Scholar
van Buuren, S. & Groothuis-Oudshoorn, K. mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–68 (2010).
Google Scholar
Xie, J., Girshick, R. & Farhadi, A. Unsupervised deep embedding for clustering analysis. Int. Conf. Mach. Learn. 48, 478–487 (2016).
Google Scholar
Petitjean, P., Ketterlin, A. & Gancarski, P. A global averaging method for dynamic time warping, with applications to clustering. Pattern Recogn. 44, 678–693 (2011).
Article ADS Google Scholar
Rohani, N. & Eslahchi, C. Classifying breast cancer molecular subtypes by using deep clustering approach. Front. Genet. 11(553587), 2020. https://doi.org/10.3389/fgene.2020.553587.eCollection (2020).
Article Google Scholar
Halkidi, M., Batistakis, I. & Vazirgiannis, M. On clustering validation techniques. J. Intell. Inf. Syst. 17(2/3), 107–114. https://doi.org/10.1023/A:1012801612483 (2001).
Article MATH Google Scholar
Jain, A. K. Data clustering: 50 years beyond K-means. Pattern Recognit. Lett. 31(8), 651–666. https://doi.org/10.1016/j.patrec.2009.09.011 (2010).
Article ADS Google Scholar
Handl, J., Knowles, J. & Kell, D. B. Computational cluster validation in post-genomic data analysis. Bioinformatics 21(15), 3201–3212. https://doi.org/10.1093/bioinformatics/bti517 (2005).
Article CAS PubMed Google Scholar
Hennig, C. Cluster-wise assessment of cluster stability. Comput. Stat. Data Anal. 52, 258–271. https://doi.org/10.1016/j.csda.2006.11.025 (2007).
Article MathSciNet MATH Google Scholar
Meila, M. Comparing clusterings—An information based distance. J. Multivar. Anal. 98, 873–895. https://doi.org/10.1016/j.jmva.2006.11.013 (2007).
Article MathSciNet MATH Google Scholar
Chen, T., Guestrin, G. XGBoost: A scalable tree boosting system. In KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 https://doi.org/10.1145/2939672.2939785 (2016).
Lundberg, S. M. et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat. Biomed. Eng. 2(10), 749–760. https://doi.org/10.1038/s41551-018-0304-0 (2018).
Article PubMed PubMed Central Google Scholar
Auer, J. What does the liver tell us about the failing heart?. Eur. Heart J. 34(10), 711–714. https://doi.org/10.1093/eurheartj/ehs440 (2013).
Article PubMed Google Scholar
Gao, M. et al. Association of serum transaminases with short- and long-term outcomes in patients with ST-elevation myocardial infarction undergoing primary percutaneous coronary intervention. BMC Cardiovasc. Disord. 17(1), 43. https://doi.org/10.1186/s12872-017-0485-6 (2017).
Article PubMed PubMed Central Google Scholar
Naschitz, J. E., Slobodin, G., Lewis, R. J., Zuckerma, E. & Yeshurun, D. Heart diseases affecting the liver and liver diseases affecting the heart. Am. Heart J. 140(1), 111–120. https://doi.org/10.1067/mhj.2000.107177 (2000).
Article CAS PubMed Google Scholar
Schmiechen, N. J., Han, C. & Milzman, D. P. ED use of rapid lactate to evaluate patients with acute chest pain. Ann. Emerg. Med. 30(5), 571–577. https://doi.org/10.1016/s0196-0644(97)70071-4 (1997).
Article CAS PubMed Google Scholar
Corti, M. C., Guralnik, J. M., Salive, M. E. & Sorkin, J. D. Serum-albumin level and physical-disability as predictors of mortality in older persons. JAMA 272, 1036–1042. https://doi.org/10.1001/jama.1994.03520130074036 (1994).
Article CAS PubMed Google Scholar
Goldwasser, P. & Feldman, J. Association of serum albumin and mortality risk. J. Clin. Epidemiol. 50, 693–703 (1997).
Article CAS Google Scholar
Vincent, J.-L., Dubois, M.-J., Navickis, R. J. & Wilkes, M. M. Hypoalbuminemia in acute illness: Is there a rationale for intervention? A meta-analysis of cohort studies and controlled trials. Ann. Surg. 237(3), 319–334. https://doi.org/10.1097/01.SLA.0000055547.93484.87 (2003).
Article PubMed PubMed Central Google Scholar
Vincent, J.-L. et al. Anemia and blood transfusion in critically ill patients. JAMA 288(12), 1499–1507. https://doi.org/10.1001/jama.288.12.1499 (2002).
Article PubMed Google Scholar
Wilson, J. G. & Calfee, C. S. ARDS subphenotypes: Understanding a heterogeneous syndrome. Crit. Care 24, 102. https://doi.org/10.1186/s13054-020-2778-x (2020).
Article PubMed PubMed Central Google Scholar
Perner, A. & Finfer, S. Do trials that report a neutral or negative treatment effect improve the care of critically ill patients? Yes. Intensive Care Med. 44, 1985–1988. https://doi.org/10.1007/s00134-018-5129-5 (2018).
Article PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Hanzeplein 1, P.O. Box 30.00, 9700 RB, Groningen, The Netherlands
José Castela Forte
Department of Anesthesiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
José Castela Forte, Bart Hiemstra, Thomas Kaufmann & Anne H. Epema
Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, Groningen, The Netherlands
José Castela Forte, Galiya Yeshmagambetova, Maureen L. van der Grinten & Marco A. Wiering
Department of Internal Medicine, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Ruben J. Eck
Department of Critical Care, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Frederik Keus
Department of Intensive Care, Maastricht University Medical Centre+, University Maastricht, Maastricht, The Netherlands
Iwan C. C. van der Horst

Authors

José Castela Forte
View author publications
You can also search for this author in PubMed Google Scholar
Galiya Yeshmagambetova
View author publications
You can also search for this author in PubMed Google Scholar
Maureen L. van der Grinten
View author publications
You can also search for this author in PubMed Google Scholar
Bart Hiemstra
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Kaufmann
View author publications
You can also search for this author in PubMed Google Scholar
Ruben J. Eck
View author publications
You can also search for this author in PubMed Google Scholar
Frederik Keus
View author publications
You can also search for this author in PubMed Google Scholar
Anne H. Epema
View author publications
You can also search for this author in PubMed Google Scholar
Marco A. Wiering
View author publications
You can also search for this author in PubMed Google Scholar
Iwan C. C. van der Horst
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.C.F., G.Y., M.W., and I.H. contributed to study concept and design, analysis and interpretation, and prepared the first draft of the manuscript. B.H., T.K., R.E., and F.K. contributed to the original study design and data collection, and reviewed the manuscript. J.C.F., G.Y., M.G. contributed to processing the data and implementing machine learning models. AE contributed to analysis and interpretation, and carefully revised the manuscript.

Corresponding author

Correspondence to José Castela Forte.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Castela Forte, J., Yeshmagambetova, G., van der Grinten, M.L. et al. Identifying and characterizing high-risk clusters in a heterogeneous ICU population with deep embedded clustering. Sci Rep 11, 12109 (2021). https://doi.org/10.1038/s41598-021-91297-x

Download citation

Received: 30 December 2020
Accepted: 25 May 2021
Published: 08 June 2021
DOI: https://doi.org/10.1038/s41598-021-91297-x
Springer Nature Limited

This article is cited by

A robust clustering strategy for stratification unveils unique patient subgroups in acutely decompensated cirrhosis
- Sara Palomino-Echeverria
- Estefania Huergo
- David Gomez-Cabrero
Journal of Translational Medicine (2024)
Deep embedded clustering generalisability and adaptation for integrating mixed datatypes: two critical care cohorts
- Jip W. T. M. de Kok
- Frank van Rosmalen
- Bas C. T. van Bussel
Scientific Reports (2024)
Patient and hospital characteristics predict prolonged emergency department length of stay and in-hospital mortality: a nationwide analysis in Korea
- Kyung-Shin Lee
- Hye Sook Min
- Ho Kyung Sung
BMC Emergency Medicine (2022)
Cluster analysis integrating age and body temperature for mortality in patients with sepsis: a multicenter retrospective study
- Moon Seong Baek
- Jong Ho Kim
- Young Suk Kwon
Scientific Reports (2022)
Clustering analysis of geriatric and acute characteristics in a cohort of very old patients on admission to ICU
- Oded Mousai
- Lola Tafoureau
- Sigal Sviri
Intensive Care Medicine (2022)

Identifying and characterizing high-risk clusters in a heterogeneous ICU population with deep embedded clustering

Abstract

Similar content being viewed by others

Introduction

Methods

Data sources

Co-morbidity, clinical examination, and laboratory data

Outcome

Development and comparison of different clustering methodologies

Cluster membership prediction and feature importance analysis for cluster characterization

Statistical analysis

Results

Study population and outcome

Performance of different clustering methodologies

Feature importance analysis and cluster characterization

Discussion

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation