Cluster analysis driven by unsupervised latent feature learning of medications to identify novel pharmacophenotypes of critically ill patients

Sikora, Andrea; Jeong, Hayoung; Yu, Mengyun; Chen, Xianyan; Murray, Brian; Kamaleswaran, Rishikesan

doi:10.1038/s41598-023-42657-2

Cluster analysis driven by unsupervised latent feature learning of medications to identify novel pharmacophenotypes of critically ill patients

Article
Open access
Published: 20 September 2023

Volume 13, article number 15562, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Cluster analysis driven by unsupervised latent feature learning of medications to identify novel pharmacophenotypes of critically ill patients

Download PDF

Andrea Sikora¹,
Hayoung Jeong²,
Mengyun Yu³,
Xianyan Chen³,
Brian Murray⁴ &
…
Rishikesan Kamaleswaran^5,6

1578 Accesses
1 Citation
4 Altmetric
Explore all metrics

Abstract

Unsupervised clustering of intensive care unit (ICU) medications may identify unique medication clusters (i.e., pharmacophenotypes) in critically ill adults. We performed an unsupervised analysis with Restricted Boltzmann Machine of 991 medications profiles of patients managed in the ICU to explore pharmacophenotypes that correlated with ICU complications (e.g., mechanical ventilation) and patient-centered outcomes (e.g., length of stay, mortality). Six unique pharmacophenotypes were observed, with unique medication profiles and clinically relevant differences in ICU complications and patient-centered outcomes. While pharmacophenotypes 2 and 4 had no statistically significant difference in ICU length of stay, duration of mechanical ventilation, or duration of vasopressor use, their mortality differed significantly (9.0% vs. 21.9%, p < 0.0001). Pharmacophenotype 4 had a mortality rate of 21.9%, compared with the rest of the pharmacophenotypes ranging from 2.5 to 9%. Phenotyping approaches have shown promise in classifying the heterogenous syndromes of critical illness to predict treatment response and guide clinical decision support systems but have never included comprehensive medication information. This first-ever machine learning approach revealed differences among empirically-derived subgroups of ICU patients that are not typically revealed by traditional classifiers. Identification of pharmacophenotypes may enable enhanced decision making to optimize treatment decisions.

Pharmacophenotype identification of intensive care unit medications using unsupervised cluster analysis of the ICURx common data model

Article Open access 02 May 2023

Hypernatremia subgroups among hospitalized patients by machine learning consensus clustering with different patient survival

Article 08 October 2021

Learning Treatment Regimens from Electronic Medical Records

Introduction

Medication regimens of critically ill patients in the intensive care unit (ICU) are complex and heterogeneous^1,2. This heterogeneity of medication regimens has parallels to the common and lethal disease states of critical illness including sepsis and acute respiratory distress syndrome (ARDS)^3,4. Managing the heterogeneity of critical illness is a nearly universally cited challenge for ICU clinicians and researchers^5,6. Phenotyping has been proposed to identify patterns of diagnosis and treatment response among these complex heterogenous syndromes^7,8,9. In particular, phenotyping via artificial intelligence (AI) and machine learning (ML) has demonstrated potential to be a powerful methodology to handle Big Data generated by critically ill patients for identification of novel patient subgroups and prediction of patient outcomes including sepsis, acute kidney injury, mechanical ventilation, ARDS, and more^{10,11,12,13,14,15,16,17}. However, to date, this methodology has only been applied in a limited fashion to the highly complex and heterogenous nature of ICU medication regimens¹.

Critically ill patients are often prescribed greater than 20 medications, with many deemed high-risk for patient harm by the Institute of Safe Medication Practices^18,19,20,21. Further, it has been estimated that each day, a critically ill patient will suffer at least one medication related error. These medication related errors can lead to serious adverse drug events associated with a doubled risk of mortality^20,21. Medication therapy optimization has significant potential to improve patient outcomes and reduce healthcare costs^1,2. Thus, the development of novel prediction models with granular medication information to predict adverse events and direct resources is warranted¹. However, identifying patterns associating medication therapy with patient outcomes within the vast amounts of data generated by ICU patients has remained a challenge, and to date, no AI/ML models have incorporated comprehensive ICU medication regimens into their analyses²².

We hypothesized that a similar approach as has been explored with other disease states of critical illness could be applied to ICU medications. Here, we sought to identify novel pharmacophenotypes using unsupervised machine learning to cluster medications used in the ICU and explore their relationship to patient-centered outcomes.

Methods

Study sample

Patients were drawn from the University of North Carolina Health System, an integrated healthcare delivery system where clinical care is managed via a comprehensive electronic health record (EHR). Patients were included if they were ≥ 18 years old with an ICU admission greater than 24 h between October 2015 and October 2020. ICUs included medical, surgical, neurosciences, and burn specialties. The hospitals varied including community hospitals and academic medical centers. Only the index ICU admission per each patient was considered in this analysis. The institutional review board at The University of Georgia approved this study and included waiver of consent (PROJECT00002652), and all methods were performed in accordance with the relevant guidelines and regulations.

The EHR was queried for patient demographics, medication information, and patient outcomes. Patient demographics included age, sex, admission diagnosis, ICU type, and Acute Physiology and Chronic Health Evaluation II. Medication information including drug, dose, route, duration, and timing of administration were recorded. Patient outcomes included mortality, hospital length of stay, development of delirium (defined by a CAM-ICU positive score), duration of mechanical ventilation, duration of vasopressor use, and acute kidney injury (defined by the presence of renal replacement therapy or a serum creatinine greater than 1.5 × baseline).

Feature extraction

Patient demographics

There were 30,550 given medication entries in the dataset from a total of 991 patients. Of these 30,550 administered medications, there were 440 unique medications when the filter of generic drug names was used and when dose and route information were excluded (e.g., cefepime 1gm and 2gm were counted under the feature of cefepime). Medication records from the raw dataset included a variety of medication administration record (MAR) actions including “given”, “missed”, “hold,” etc. To ensure this analysis only included records of medication that were administered to the patient (not just ordered), only the entries where the medication action label corresponded to "Given", "New Bag", "Restarted," or "Rate Change" were used for the analysis. Some entries contained "free-text" for ICU personnel communication purposes and were discarded. Additionally, duplicate and incomplete entries were filtered out. After cleaning the dataset, the data were transformed into a binary (boolean) vectored form where the 440 unique medications were assigned as the rows, and 991 patients were assigned as the columns. For each patient, a binary value of 1 was assigned to indicate whether the patient received a particular drug. For patient outcomes, the labels for categorical features were relabeled as numeric values. In the cases of unknown or missing entities, these were replaced with “negative” or “no.” The entire mapping of original labels to new labels is provided in Appendix Table 1.

Unsupervised learning approach

Medication clustering

After performing principal component analysis (PCA) on the large, binary medication dataset, the Restricted Boltzmann Machine (RBM) was used to further enrich the latent feature space, which was use as input to the hierarchical clustering algorithm to support the novel discovery of unique pharmacotherapy profiles²³.

Principal component analysis

During PCA, each of the 440 unique medications was treated as an independent variable. PCA is a widely used dimensionality reduction technique to reduce the dimensionality of a dataset with p random variables to q, which is the desired number of variables²⁴. The optimal number of principal components was selected after plotting the explained variance against the number of principal components (see Appendix Fig. 1). The number of principal components was selected as 150 to maintain sufficient variance (approximately 75%) in the data while significantly reducing the dimensionality.

Restricted Boltzmann Machine

RBM was used to learn unsupervised feature abstractions or ‘latent factors’ of the PCA reduced data²⁵. RBM is a simple, two-layered neural network with one visible layer and one hidden layer. It is typically used for collaborative filtering as RBM is capable of learning internal representations of the input variables using unsupervised methods enabling complex relationships to be discovered in the process. For medication clustering purposes, we trained the RBM^25,26 to learn the high dimensional and non-linear nature among medication assignments based on the co-occurrence of medications for each patient. The default hyperparameters for implementation were used based on the works of Chen²⁶. From each patient’s binary assignment of medications, the RBM learned the weight coefficients to ultimately determine which nodes out of all nodes were activated or inactivated for each hidden unit. For clustering purposes, each medication is an independent node from the visible layer (440 units), and connections that are activated to a single hidden layer indicate cluster assignment (see Fig. 1). For example, if acetaminophen (from the visible layer) and Cluster 1 (from the hidden layer) connection was activated, acetaminophen would be assigned to Cluster 1. After assigning medications to each cluster from the created hidden layers, medications that were unassigned (never activated in the five hidden layers) were grouped as Cluster 6. Table 1 lists the medications assigned to Clusters 1–5, and Table 2 lists the unassigned medications in Cluster 6.

Table 1 Medication clusters assigned by restricted boltzman machine.

Full size table

Table 2 Cluster 6—medications unassigned through restricted boltzman machine.

Full size table

II. Patient clustering

For each patient, the frequency of each medication cluster was counted (see Fig. 1). To obtain a normalized medication cluster distribution for each patient, the frequency table was normalized by the total number of medications taken by each patient. This normalized medication cluster distribution was used as a derived feature for patient clustering.

Hierarchical agglomerate clustering

The normalized medication cluster distribution was used to cluster patients using Hierarchical Agglomerative Clustering, which builds a tree to represent data with successor nodes²⁷. The optimal number of clusters (n = 5) was identified through the use of the unsupervised pipeline, including visual inspection of the dendrogram (see Fig. 1) and silhouette scores analysis, which was used to identify cluster numbers that provided an equal width among clusters where all clusters are found to have an above average silhouette score (see Appendix Fig. 2). Table 3 describes relevant demographic and outcomes information for each cluster. For implementation, scikit-learn 1.0.2 python library was used to obtain a total of five cluster labels.

Table 3 Demographic characteristics by patient cluster.

Full size table

Validation of clusters

Upon selection of the optimal number of clusters, the validity of these clusters as clinically meaningful subgroups was assessed via surrogate validation conducted by comparing patient outcomes with medication data to see if clinically relevant characteristics were distinguishable.

Wilcoxon rank sum and signed rank tests were performed for continuous characteristics. Fisher's Exact tests were performed for categorical characteristics. Holm’s adjustment of p-values was applied to the comparisons within each outcome to control the familywise error rates. Permutation multivariate analysis of variance (MANOVA) was also used to confirm if the clusters were significantly different considering all clinical outcomes simultaneously²⁸. Significance was assessed at p-value < 0.05.

Results

From the original 1000 patients, a total of 991 patients were included in the analysis with nine excluded due to being repeat ICU admissions. Demographic features are summarized in Table 4 with additional information about the health system provided in Appendix Table 2. The average was 61.2 years old (SD 17.5) with 43% female sex. The patients were managed in the medical ICU 40.7% of the time followed by 9.8% in the surgical and 9.4% in the neurosciences ICU. The mean APACHE II score at 24 h was 14.2 (SD 6.3). The frequency of use for each medication in the analysis is provided in Appendix Table 3, with the top ten medications used including sodium chloride, acetaminophen, potassium chloride, heparin, fentanyl, magnesium sulfate, insulin, furosemide, pantoprazole, and vancomycin.

Table 4 Summary of patient population.

Full size table

Comparison of patient and medication clusters

Five patient clusters were identified through the use of the unsupervised pipeline. Additionally, the silhouette scores analysis plot further suggested a cluster number of 5 provides an equal width between clusters with all clusters having an above average silhouette score (see Appendix Fig. 2). Table 3 describes relevant demographic and outcomes information for each cluster, and Fig. 1 provides a visualization of the distribution of patient clusters by medication clusters and patient outcomes, with lower mean values indicating less severe outcomes. Patient Cluster 1 had a well-rounded distribution overall when compared to other patient clusters and did not have any distinctive distribution for a particular medication cluster. In contrast, Patient Cluster 4 had a high distribution in Medication Cluster 6. Figure 2 summarizes the mean medication cluster distribution for each patient cluster, with the mean medication cluster distribution for each patient cluster provided in Appendix Table 4.

Comparison of patient clusters by clinical outcomes

Patient Cluster 3 and 5 had the least serious outcomes while Patient Cluster 2 and 4 generally had worse patient outcomes. Box plots of outcomes by patient clusters are presented in Fig. 3. For medication clustering purposes, we trained the RBM^25,26 to learn the high dimensional and non-linear nature among medication assignments based on the co-occurrence of medications for each patient. The default hyperparameters for implementation were used based on the works of Chen²⁶. A notable finding was that Patient Clusters 2 and 4 had no statistically significant difference in ICU length of stay, duration of mechanical ventilation, or duration of vasopressor use, but their mortality differed significantly (9.0% vs. 21.9%, p < 0.0018). Patient Cluster 4 had a mortality rate of 21.9% compared with the rest of the clusters ranging between 2.5 and 9% (see Fig. 4). Patient Cluster 4 also had the highest number of outliers (see Appendix Fig. 3). The difference of ICU duration between Patient Clusters 1 and 5 and Patient Clusters 2 and 4 were statistically insignificant. Significance of the differences between patient clusters are summarized in Table 5. Permutation MANOVA further confirmed these differences (p < 0.001) (see Appendix Table 5).

Table 5 Pair-wise comparison of differences in patient outcomes by patient cluster.

Full size table

Discussion

In the first unsupervised machine learning analysis of critically ill patients and their medication regimens, five unique patient clusters were identified with significant differences in severity of illness and outcomes. Six pharmacophenotypes were identified, and each patient cluster displayed a unique distribution of these six pharmacophenotypes. This study is the first to apply AI to the complete medication list of ICU patients and demonstrates the ability to appropriately categorize patients with their outcomes, which lays the groundwork for future investigations.

Unsupervised machine learning methods have been previously explored for the derivation of distinct clinical phenotypes and biological endotypes^29,30,31. Prior approaches have frequently used methods such as Latent Class Analysis (LCA) to identify clusters that are separatable by the input data. Latent Class Analysis is a set of Finite Mixture Models, which utilize a probablistic model-based clustering approach, in which each cluster are characterized on a probabilistic distribution rather than their centroid-based distance (such as with k-Means). Thus, each cluster has a probability of association, rather than a clear membership assignment. Due to the probabilistic nature of the class assignment, it may be difficult to derive instance-level associations, thus a single instance may belong marginally to multiple classes^32,33. Alternatively, k-means allows for a characterization of clusters driven by centroid-based distances, allowing for a quantitative estimate of the membership³⁴. Due to the heterogeneity of the input data, our goal in this work was to distinguish between a finite set of classes and better understand their distance-based profile when medications are utilized in the derivation rather than a probabilistic model of their likelihood.

Critically ill patients are medically complex with requisitely complex medication regimens. The significant challenges to characterizing complex, heterogeneous ICU medications in a meaningful way to drive clinical decision making parallel the challenges of managing and researching complex ICU syndromes like ARDS and sepsis. Indeed, it was reported that 62 of 76 randomized-controlled trials evaluating mortality showed no significant difference and just three of those positive studies have been accepted into practice³⁵. Similar findings have been paralleled in ARDS³⁶. Thoughtful editorials on this statistically unlikely preponderance of negative results have been published, and although common reasons for negative ICU studies likely account for some of these negative trials (e.g., underpowered studies, need for the use of a more conservative p-value cut-off), these statistical explanations ignore the potentially biological ones, wherein the target of an intervention is absent due to limitations in specificity of diagnosis, animal models of disease, or understanding of underlying pathophysiology^37,38,39. Additionally, we would like to propose another relevant driver of patient outcomes that is generally unaccounted for in both RCTs and predictive modeling studies: the complete ICU medication regimen. Traditionally, ICU medications are often thought to be direct results of critical illness (e.g., a septic patient with a high lactate is prescribed broad-spectrum antibiotics and vasopressors). However, this simplified pathway does not incorporate that ICU medications are also independent risk factors for ICU complications that worsen patient outcomes (e.g., this septic patient develops acute kidney injury, which may be due to the shock state or the use of nephrotoxic medications or the combination of disease plus medication). Thus, when making medication-related decisions, medications must be thought of as both treatments and causes of outcomes (see Fig. 5). Aside from overt medication errors, ICU medications are also associated with significant ICU complications that increase risk of mortality and length of stay including ICU delirium, fluid overload, acute kidney injury, etc^40,41,42,43. Ultimately, the benefits to medications used to manage critical illness must be balanced by mitigating the harms of those same treatments. Because medications in the ICU are always used in combination with other medications and interventions, identifying which medication and which medication combinations confer less risk for ICU complications has the potential to guide safer medication use. However, the dynamic relationships among patients, medications, and outcomes have been difficult to delineate given the inherent complexities and largess of ICU patient care and the data generated in that process. Given that medications are independent risk factors, a future direction for this type of clustering analysis is to generate a dataset capable of matching patients by demographic and clinical presentation variables and then compare outcomes of those with similar vs. different pharmacophenotypes. Moreover, this type of analysis will be aided by a multi-center design that improves external generalizability given that medication regimens may reflect local practices or other healthcare team related origins, instead of purely patient pathophysiology.

Phenotyping, especially when conducted through artificial intelligence methods, has significant potential to overcome challenges related to heterogeneity and non-linear relationships present in critically ill populations. When Calfee et al. used biomarker-based phenotyping in a re-analysis of a large randomized-controlled trial evaluating simvastatin (a trial that notably had previously shown negative results), significantly different treatment response wherein one phenotype showed mortality benefit from simvastatin was observed⁴⁴. Moreover, these ARDS phenotypes also showed differential treatment response to fluid management strategy⁴⁵. Similarly, AI methods have demonstrated the presence of unique clusters in shock, sepsis, and fluid overload^46,47. Notably, Seymour et al. demonstrated that the results of three major randomized controlled trials were sensitive to the sepsis phenotypes they derived via unsupervised machine learning methods. Another series of shock sub-phenotypes was characterized by features associated with common ICU interventions (e.g., “well resuscitated” or “still hypovolemic”) that upon appropriate validation could yield highly relevant insights for bedside decision-making⁴⁷. Our cluster pipeline driven by unsupervised feature learning using RBM and hierarchical clustering categorized medications into five unique clusters, with the remaining medications creating a sixth category. Of the patient clusters, Clusters 2 and 4 had the highest acuity, as measured by APACHE II. This high acuity was accompanied by significantly worse outcomes, including length of stay, ICU length of stay, presence of delirium and fluid overload, and need for mechanical ventilation. Interestingly, despite being similar, Patient Cluster 4 had a mortality rate over twice as high as Cluster 2. When evaluating the distribution of pharmacophenotypes, Cluster 4 had the highest density of Medication Cluster 6 and limited representation among the other five clusters. This particular pharmacophenotype contains many of the medications classically associated with ICU care including vasopressors and broad-spectrum antibiotics. Conversely, Cluster 3 had the lowest severity of illness and best outcomes and also had the lowest density of all the pharmacophenotypes. This suggests a possibility of non-linear relationships between medication regimen complexity and outcomes seen in other analyses⁴⁸. Medication regimen complexity, as measured by the MRC-ICU, has been previously incorporated into ML prediction models along with other relevant patient characteristics and resulted in improved mortality prediction in a small cohort of patients⁴⁹. In this study, medication regimen complexity was highest in Patient Clusters 2 and 4, which is in line with previous investigations of MRC-ICU that used traditional inferential statistics to demonstrate a relationship between increasing medication regimen complexity and increased mortality, length of stay, and fluid overload as well as increased need for critical care pharmacist interventions to optimize the medication regimens^{50,51,52,53,54,55}. Taken together, the methodologies in this study appear to be able to appropriately group degree of critical illness (i.e., severity) with degree of intervention intensity (e.g., mechanical ventilation, medications) with patient outcomes (e.g., mortality). This congruence sets the foundation for future investigations to predict ICU complications based on unique medication combinations that deleteriously affect patient outcomes.

Overall, this evaluation was a proof-of-concept investigation to explore how unsupervised clustering methods may be applied to ICU medications, and while it has novel implications, future evaluations to address certain limitations are warranted that include comparative approaches, larger datasets, and more granular medication information. Comparative evaluations may include matrix factorization or other robust forms of RBM (e.g., Gaussian-Bernoulli RBM)²³. Only generic drug name was used to describe the medications with dose, route, and other formulation information excluded. Establishing uniform means of describing and comparing ICU medication dosing strategies (e.g., a common data model) and validating these approaches in external datasets remains an area of future work. We assumed homogeneity across medication regimens; however, in practice this may be a highly complex and noisy interaction: therefore, in future work, we seek to utilize Trust Discover platforms to generalize pharmacotherapy profiles that are normalized independent of clinician and institutional bias⁵⁶. Causal inference cannot be assessed by the current study, so it is unknown whether the high mortality observed in Patient Cluster 4 was partly caused by the unique distribution of pharmacophenotypes versus other factors (although notably, Cluster 4 shared similarities among groups). Even with these limitations, this analysis marks the first time the complete medication profile has been incorporated into outcomes analysis for ICU patients. Future analyses with more granular pharmacophenotype groupings or more programmed directives incorporating data from a myriad of ICUs and centers may improve the face validity form the viewpoint of the clinician for these pharmacophenotypes.

Conclusion

The medication regimens of critically ill patients have unique pharmacophenotypes. Given the significant role of medication therapy in patient outcomes, delineating the complex relationships among patients, medications, and outcomes using artificial intelligence warrants future investigation.

Data availability

Data will be made available upon request by the editor and/or reviewers from the corresponding author.

References

Newsome, A. S. et al. Optimization of critical care pharmacy clinical services: A gap analysis approach. Am. J. Health Syst. Pharm. 78, 2077–2085 (2021).
PubMed PubMed Central Google Scholar
Lat, I. et al. Position paper on critical care pharmacy services: 2020 Update. Crit. Care Med. 48, e813–e834 (2020).
PubMed Google Scholar
Matthay, M. A. et al. Acute respiratory distress syndrome. Nat. Rev. Dis. Primers 5, 18 (2019).
PubMed PubMed Central Google Scholar
Leligdowicz, A. & Matthay, M. A. Heterogeneity in sepsis: New biological evidence with clinical applications. Crit. Care 23, 80 (2019).
PubMed PubMed Central Google Scholar
Prescott, H. C., Calfee, C. S., Thompson, B. T., Angus, D. C. & Liu, V. X. Toward smarter lumping and smarter splitting: Rethinking strategies for sepsis and acute respiratory distress syndrome clinical trial design. Am. J. Respir. Crit. Care Med. 194, 147–155 (2016).
PubMed PubMed Central Google Scholar
Cohen, J. et al. Sepsis: A roadmap for future research. Lancet Infect. Dis. 15, 581–614 (2015).
PubMed Google Scholar
Su, L. et al. Five novel clinical phenotypes for critically ill patients with mechanical ventilation in intensive care units: A retrospective and multi database study. Respir. Res. 21, 325 (2020).
CAS PubMed PubMed Central Google Scholar
Alipanah, N. & Calfee, C. S. Phenotyping in acute respiratory distress syndrome: State of the art and clinical implications. Curr. Opin. Crit. Care 28, 1–8 (2022).
PubMed PubMed Central Google Scholar
Messmer, A. S. et al. Fluid overload phenotypes in critical illness: A machine learning approach. J. Clin. Med. 11, 336 (2022).
CAS PubMed PubMed Central Google Scholar
Yao, L. et al. A Survey on causal inference. ACM Trans. Knowledge Discovery from Data (TKDD) (2021).
Churpek, M. M. et al. Multicenter comparison of machine learning methods and conventional regression for predicting clinical deterioration on the wards. Crit. Care Med. 44, 368–374 (2016).
PubMed PubMed Central Google Scholar
Ginestra, J. C. et al. Clinician perception of a machine learning-based early warning system designed to predict severe sepsis and septic shock. Crit. Care Med. 47, 1477–1484 (2019).
PubMed PubMed Central Google Scholar
Koyner, J. L., Carey, K. A., Edelson, D. P. & Churpek, M. M. The development of a machine learning inpatient acute kidney injury prediction model. Crit. Care Med. 46, 1070–1077 (2018).
PubMed Google Scholar
Liu, R., Greenstein, J. L., Fackler, J. C., Bembea, M. M. & Winslow, R. L. Spectral clustering of risk score trajectories stratifies sepsis patients by clinical outcome and interventions received. Elife 9, 58142 (2020).
Google Scholar
Grunwell, J. R. et al. Cluster analysis and profiling of airway fluid metabolites in pediatric acute hypoxemic respiratory failure. Sci. Rep 11, 23019 (2021).
ADS CAS PubMed PubMed Central Google Scholar
Holder, A. L., Shashikumar, S. P., Wardi, G., Buchman, T. G. & Nemati, S. A locally optimized data-driven tool to predict sepsis-associated vasopressor use in the ICU. Crit. Care Med. 49, e1196–e1205 (2021).
PubMed PubMed Central Google Scholar
Singhal, L. et al. eARDS: A multi-center validation of an interpretable machine learning algorithm of early onset Acute Respiratory Distress Syndrome (ARDS) among critically ill adults with COVID-19. PLoS ONE 16, e0257056 (2021).
CAS PubMed PubMed Central Google Scholar
Practices, I.o.S.M. High Alert Medications (2018).
Maslove, D. M., Lamontagne, F., Marshall, J. C. & Heyland, D. K. A path to precision in the ICU. Crit. Care 21, 79 (2017).
PubMed PubMed Central Google Scholar
Halpern, N. A., Goldman, D. A., Tan, K. S. & Pastores, S. M. Trends in critical care beds and use among population groups and medicare and medicaid beneficiaries in the United States: 2000–2010. Crit. Care Med. 44, 1490–1499 (2016).
PubMed PubMed Central Google Scholar
Cullen, D. J. et al. Preventable adverse drug events in hospitalized patients: A comparative study of intensive care and general care units. Crit. Care Med. 25, 1289–1297 (1997).
CAS PubMed Google Scholar
Nguyen, D., Ngo, B. & van Sonnenberg, E. AI in the intensive care unit: Up-to-date review. J. Intensive Care Med. 36, 1115–1123 (2021).
PubMed Google Scholar
Upadhya, V. & Sastry, P. S. Learning Gaussian-Bernoulli RBMs using difference of convex functions optimization. IEEE Trans. Neural Netw. and Learn. Syst. https://doi.org/10.1109/TNNLS.2021.3071358 (2021).
Article Google Scholar
Jolliffe, I. Principal component analysis. In International Encyclopedia of Statistical Science (ed. Lovric, M.) (Springer, 2014).
Google Scholar
Hinton, G. E. A practical guide to training restricted Boltzmann machines. In Neural Networks: Tricks of the Trade 2nd edn 599–619 (Springer, 2012).
Google Scholar
Edwin, C. Restricted-boltzmann-machines. GitHub Repository. https://github.com/echen/restricted-boltzmann-machines.
Murtagh, F. & Legendre, P. Ward’s hierarchical agglomerative clustering method: Which algorithms implement ward’s criterion?. J. Classif. 31, 274–295. https://doi.org/10.1007/s00357-014-9161-z (2014).
Article MathSciNet MATH Google Scholar
Anderson, M. J. A new method for non-parametric multivariate analysis of variance. Austral. Ecol. 26, 32–46. https://doi.org/10.1111/j.1442-9993.2001.01070.pp.x (2001).
Article Google Scholar
Patel, R. B. et al. Association of longitudinal trajectory of albuminuria in young adulthood with myocardial structure and function in later life: Coronary Artery Risk Development in Young Adults (CARDIA) study. JAMA Cardiol. 5(2), 184–192 (2019).
PubMed Central Google Scholar
Seymour, C. W. et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 321(20), 2003–2017. https://doi.org/10.1001/jama.2019.5791 (2019).
Article CAS PubMed PubMed Central Google Scholar
Reddy, K., Calfee, C. S. & McAuley, D. F. Acute respiratory distress syndrome subphenotypes beyond the syndrome: A step toward treatable traits?. Am. J. Respir. Crit. Care Med. 203(12), 1449–1451 (2021).
PubMed PubMed Central Google Scholar
Lezhnina, O. & Kismihók, G. Latent class cluster analysis: Selecting the number of clusters. MethodsX 9, 101747 (2022).
PubMed PubMed Central Google Scholar
Mori, M., Krumholz, H. M. & Allore, H. G. Using latent class analysis to identify hidden clinical phenotypes. JAMA 324(7), 700–701. https://doi.org/10.1001/jama.2020.2278 (2020).
Article PubMed Google Scholar
Grant, R. W. et al. Use of latent class analysis and k-means clustering to identify complex patient profiles. JAMA Netw. Open 3(12), e2029068. https://doi.org/10.1001/jamanetworkopen.2020.29068 (2020).
Article PubMed PubMed Central Google Scholar
Ospina-Tascon, G. A., Buchele, G. L. & Vincent, J. L. Multicenter, randomized, controlled trials evaluating mortality in intensive care: Doomed to fail?. Crit. Care Med. 36, 1311–1322 (2008).
PubMed Google Scholar
Tonelli, A. R., Zein, J., Adams, J. & Ioannidis, J. P. Effects of interventions on survival in acute respiratory distress syndrome: An umbrella review of 159 published randomized trials and 29 meta-analyses. Intensive Care Med. 40, 769–787 (2014).
PubMed PubMed Central Google Scholar
Laffey, J. G. & Kavanagh, B. P. Negative trials in critical care: why most research is probably wrong. Lancet Respir. Med. 6, 659–660 (2018).
PubMed Google Scholar
Di Leo, G. & Sardanelli, F. Statistical significance: p value, 0.05 threshold, and applications to radiomics-reasons for a conservative approach. Eur. Radiol. Exp. 4, 18 (2020).
PubMed PubMed Central Google Scholar
Lewis, A. J., Seymour, C. W. & Rosengart, M. R. Current murine models of sepsis. Surg. Infect. (Larchmt) 17, 385–393 (2016).
PubMed Google Scholar
Hawkins, W. A. et al. Fluid stewardship during critical illness: A call to action. J. Pharm. Pract. 33, 863–873 (2020).
PubMed Google Scholar
Huang, J. Drug-induced nephrotoxicity and drug metabolism in renal failure. Curr. Drug Metab. 19, 558 (2018).
CAS PubMed Google Scholar
Devlin, J. W. et al. Clinical practice guidelines for the prevention and management of pain, agitation/sedation, delirium, immobility, and sleep disruption in adult patients in the ICU. Crit. Care Med. 46, e825–e873 (2018).
PubMed Google Scholar
Lee, H. et al. Impact on patient outcomes of pharmacist participation in multidisciplinary critical care teams: A systematic review and meta-analysis. Crit. Care Med. 47, 1243–1250 (2019).
CAS PubMed Google Scholar
Calfee, C. S. et al. Acute respiratory distress syndrome subphenotypes and differential response to simvastatin: Secondary analysis of a randomised controlled trial. Lancet Respir. Med. 6, 691–698 (2018).
CAS PubMed PubMed Central Google Scholar
Famous, K. R. et al. Acute respiratory distress syndrome subphenotypes respond differently to randomized fluid management strategy. Am. J. Respir. Crit. Care Med. 195, 331–338 (2017).
CAS PubMed PubMed Central Google Scholar
Seymour, C. W. et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 321, 2003–2017 (2019).
CAS PubMed PubMed Central Google Scholar
Geri, G. et al. Cardiovascular clusters in septic shock combining clinical and echocardiographic parameters: A post hoc analysis. Intensive Care Med. 45, 657–667 (2019).
PubMed Google Scholar
Sikora, A. et al. Evaluation of medication regimen complexity as a predictor for mortality. Sci. Rep. 13(1), 10784. https://doi.org/10.1038/s41598-023-37908-1 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Al-Mamun, M. A., Brothers, T. & Newsome, A. S. Development of machine learning models to validate a medication regimen complexity scoring tool for critically ill patients. Ann. Pharmacother. 55, 421–429 (2021).
PubMed Google Scholar
Gwynn, M. E., Poisson, M. O., Waller, J. L. & Newsome, A. S. Development and validation of a medication regimen complexity scoring tool for critically ill patients. Am. J. Health Syst. Pharm. 76, S34–S40 (2019).
PubMed Google Scholar
Newsome, A. S., Anderson, D., Gwynn, M. E. & Waller, J. L. Characterization of changes in medication complexity using a modified scoring tool. Am. J. Health Syst. Pharm. 76, S92–S95 (2019).
PubMed Google Scholar
Newsome, A. et al. Medication regimen complexity is associated with pharmacist interventions and drug-drug interactions: A use of the novel MRC-ICU scoring tool. J. Am. Coll. Clin. Pharm. 3, 47–56 (2020).
Google Scholar
Newsome, A. S., Smith, S. E., Olney, W. J. & Jones, T. W. Multicenter validation of a novel medication-regimen complexity scoring tool. Am. J. Health Syst. Pharm. 77, 474–478 (2020).
PubMed Google Scholar
Olney, W. J., Chase, A. M., Hannah, S. A., Smith, S. E. & Newsome, A. S. Medication regimen complexity score as an indicator of fluid balance in critically ill patients. J. Pharm. Pract. 35, 573–579 (2021).
PubMed PubMed Central Google Scholar
Smith, S. E., Shelley, R. & Newsome, A. S. Medication regimen complexity vs patient acuity for predicting critical care pharmacist interventions. Am. J. Health Syst. Pharm. 79, 651–655 (2021).
PubMed Central Google Scholar
Li, Y. et al. A survey on truth discovery. ACM SIGKDD Explor. Newsl 17(2), 1–16 (2016).
Google Scholar

Download references

Acknowledgements

Data acquisition were supported by NC TraCS, funded by Grant Number UL1TR002489 from the National Center for Advancing Translations Sciences at the National Institutes of Health, and Data Analytics at the University of North Carolina Medical Center Department of Pharmacy. American Society of Health-System Pharmacists Innovations in Technology Grant supported this work.

Funding

Funding through Agency of Healthcare Research and Quality for Sikora and Kamaleswaran was provided through 1R21HS028485.

Author information

Authors and Affiliations

Department of Clinical and Administrative Pharmacy, University of Georgia College of Pharmacy, Augusta, GA, USA
Andrea Sikora
Georgia Institute of Technology, Atlanta, GA, USA
Hayoung Jeong
Department of Statistics, University of Georgia Franklin College of Arts and Sciences, Athens, USA
Mengyun Yu & Xianyan Chen
Department of Pharmacy, University of North Carolina Medical Center, Chapel Hill, NC, USA
Brian Murray
Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, USA
Rishikesan Kamaleswaran
Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
Rishikesan Kamaleswaran

Authors

Andrea Sikora
View author publications
You can also search for this author in PubMed Google Scholar
Hayoung Jeong
View author publications
You can also search for this author in PubMed Google Scholar
Mengyun Yu
View author publications
You can also search for this author in PubMed Google Scholar
Xianyan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Brian Murray
View author publications
You can also search for this author in PubMed Google Scholar
Rishikesan Kamaleswaran
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AS, RK, BM, and XC participated in conception and design of the study. HY, MY, XC, and RK participated in analysis and interpretation. AS, HY, and MY participated in drafting, and all authors participated in substantive revisions. All authors edited and approved the final version of this manuscript.

Corresponding author

Correspondence to Andrea Sikora.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sikora, A., Jeong, H., Yu, M. et al. Cluster analysis driven by unsupervised latent feature learning of medications to identify novel pharmacophenotypes of critically ill patients. Sci Rep 13, 15562 (2023). https://doi.org/10.1038/s41598-023-42657-2

Download citation

Received: 10 June 2022
Accepted: 13 September 2023
Published: 20 September 2023
DOI: https://doi.org/10.1038/s41598-023-42657-2
Springer Nature Limited

Cluster analysis driven by unsupervised latent feature learning of medications to identify novel pharmacophenotypes of critically ill patients

Abstract

Similar content being viewed by others

Pharmacophenotype identification of intensive care unit medications using unsupervised cluster analysis of the ICURx common data model

Hypernatremia subgroups among hospitalized patients by machine learning consensus clustering with different patient survival

Learning Treatment Regimens from Electronic Medical Records

Introduction

Methods

Study sample

Feature extraction

Patient demographics

Unsupervised learning approach

Medication clustering

Principal component analysis

Restricted Boltzmann Machine

II. Patient clustering

Hierarchical agglomerate clustering

Validation of clusters

Results

Comparison of patient and medication clusters

Comparison of patient clusters by clinical outcomes

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation