Introduction

Sepsis is a common and often lethal condition; even with improving mortality, approximately 20 % of patients still die of acute sepsis [1]. Severe sepsis and septic shock, the most severe forms of sepsis, are commonly associated with multiple organ dysfunction syndrome (MODS) [2]. Septic patients may differ from each other with regard to severity of disease, etiology of sepsis, and in the degree or type of MODS. Different etiologies of sepsis—e.g., pneumonia, urinary tract infection—are commonly associated with different clinical outcomes [3, 4]. Whether patterns of organ dysfunction define distinct clusters within severe sepsis and septic shock is not known. Identification of such clusters, if they represent distinct genetic or pathophysiological entities or patient groups with distinct natural histories, could support novel pathophysiological insights, allow improved power and efficiency for clinical studies, and increase personalization of treatments.

Historically, classification of sepsis has been limited to etiology and distinguishing severe sepsis from septic shock [3, 4], albeit with the recent addition of the predisposition, infection, response and organ dysfunction (PIRO) framework [5]. The complex physiological abnormalities associated with septic MODS have prevented sophisticated sepsis classification because there are so many relevant abnormalities [6]. Several investigations have probed the individual contributions of specific organ systems: liver [7], kidney [8], and cardiovascular [9]. These prior investigations evaluated associations between specific variables and clinical outcomes. These techniques poorly identify patterns in multiple covariates within complex data [10, 11]. Such patterns may lead to the discovery of clusters that could illuminate previously unrecognized pathophysiologic elements of disease, an important need [12].

Kohonen self-organizing maps (SOMs) are a kind of neural network used to simplify complex, multidimensional data [13]. SOMs group patients to facilitate identification of meaningful clusters. SOMs have identified clusters in asthma, COPD, and pain syndromes [1416]. Given the large amount of physiological data associated with septic MODS, we applied SOMs to patients with severe sepsis or septic shock admitted to the intensive care units (ICUs) of three hospitals. An abstract of preliminary study results has been published [17].

Methods

We retrospectively studied patients with severe sepsis or septic shock, admitted directly from the emergency department (ED) to an ICU between January 1, 2006 and May 15, 2013 at three Utah hospitals: Intermountain Medical Center, LDS Hospital, and McKay-Dee Hospital. The Intermountain Healthcare Institutional Review Board (IRB #1014318) approved this study with waiver of informed consent.

All subjects admitted to the ICU from the ED, either directly or after emergency surgery, were screened as part of an ongoing effort to improve the quality of sepsis care. We excluded subjects younger than 18 years and those in whom data were incomplete. Trained coordinators reviewed each subject’s medical record to identify standard criteria for severe sepsis or septic shock, which includes a suspected infection plus at least two systemic inflammatory response syndrome criteria: temperature >38 or <36 °C, heart rate >90 min−1, respiratory rate >20 min−1 or PaCO2 <32 mmHg, white blood cell count >12,000 or <4000 mm−3 or >10 % immature forms [18], and the further requirement of organ dysfunction or persistent hypotension. The current study cohort overlaps partially with a larger, 11-hospital, cohort identified as part of the adoption of a surviving sepsis care bundle, the results of which (for 2004–2010) have been published [19]. We restricted our study to the three hospitals that had electronic data required to calculate acute physiology and chronic health evaluation II (APACHE II) [20] and sequential organ failure assessment (SOFA) scores; [2] in addition, we extended the cohort period through 2013.

We measured organ dysfunction using the SOFA score and its subcomponents. We measured comorbidities using the Elixhauser formula [21]. Our primary outcome was 30-day all-cause mortality. Our secondary outcome was ICU-free days at 30 days after admission (patients who died within 30 days were considered to have 0 ICU-free days). Three reviewers performed manual chart review to ascertain the etiology of sepsis, with an interobserver kappa = 0.71 from 83 randomly selected charts.

Calculating SOFA scores

For the admission SOFA score, we included data obtained up to 6 h before and 24 h after ICU admission. We included data up to 6 h prior to ICU admission in order to include values obtained in the ED, as clinicians may not repeat analyses during the first ICU day [22, 23]. We calculated SOFA scores for each patient according to standard methods [2] except urine output in the renal sub-score; we only used urine output to reclassify patients with a renal sub-score of 3 to a renal sub-score of 4 when urine output was <200 ml/24 h, due to less reliable charting of hourly urine output (Table 1). The SOFA central nervous system sub-score is based on the Glasgow coma scale (GCS), which is routinely measured in the study ICUs [24]. As per standard methodology, all patients undergoing mechanical ventilation through an endotracheal tube were assigned a GCS verbal score of 1, irrespective of the clinician’s estimate of the patient’s underlying verbal capacity [23]. When arterial blood gases were not available, PaO2/FiO2 was estimated from SpO2/FiO2, using the Ellis inversion [25] of the Severinghaus equation [26] to convert SpO2 to PaO2 as in prior studies [27].

Table 1 SOFA Score

Statistical methods

We utilized SOMs, which are a nonparametric, neural network classification technique used to simplify datasets with large numbers of variables and/or observations [13]. While we describe technical details of the technique in the online Appendix, briefly SOMs group similar patients—based on multivariate distance—and display them on two-dimensional grids called “maps”. Patients close together on the maps are similar to each other in terms of their pattern of organ dysfunction, while patients far apart on the maps are different from each other.

We utilized SOFA sub-scores and age to generate the SOM grids. K-means clustering, a technique for grouping observations on the basis of similarity, was performed on the nodes within the SOM grids to assist with identification of distinct groupings, with manual adjustment to optimize contiguity, as is common with SOMs [28]. We validated the SOM clustering using a permutation technique (with 100 iterations) to compare quantitation (q)-errors for our SOM clustering versus randomly generated datasets [29], to assure that clusters were not identified merely by chance.

We then projected data (e.g., etiology of sepsis, comorbidities, ICU-free days) not utilized in the SOM construction process onto the two-dimensional SOM grids to explore the implications and substructure of the SOM-identified clusters. Because SOMs cannot calculate distance/similarity for incomplete cases, we excluded the few patients with missing data rather than risk the creation of spurious clusters based on imputation.

We used logistic and/or linear regression to explore associations among SOM cluster, SOFA scores, APACHE II scores, and clinical outcomes. Outcomes included 30-day mortality and ICU-free days at 30 days. (We did not incorporate ventilator-free and vasopressor-free days because they are highly collinear with ICU-free days.) In an initial analysis, we used simple multivariate regression to confirm that cluster membership was associated with mortality even after controlling for severity of illness and overall SOFA score. In further exploration of the relevance of cluster membership, we used two statistical methods. First, we calculated likelihood ratios for regression models with and without clusters included. Second, we employed multivariate adaptive regression splines (MARS) to detect statistical interaction between SOM cluster and other predictors. MARS performs an objective, automated feature selection strategy for regression that includes predictors which improve the squared error of a model. MARS thereby limits problems with simple forward or backward stepwise regression techniques [30]. We studied statistical interactions to evaluate the possibility that the relationship between overall severity and mortality differed among clusters, which would provide further evidence that the clusters were identifying meaningful differences among patients.

Summary statistics are reported as: mean/standard deviation, median/interquartile range, or count/percentage, as appropriate. We compared central tendencies with Chi square or Kruskall–Wallis statistic where appropriate. Two-tailed p < 0.05 was considered significant. We performed all analyses in the R Statistical Package, v.3.0.2 (www.r-project.org) using packages kohonen and earth.

Results

During the study period, 2687 patients were admitted to the ICUs from the ED with severe sepsis or septic shock. Of those, 2533 (94 %) patients had non-missing SOFA data and were included in the analysis, constituting the study cohort (Fig. 1). Four apparent clusters were identified by visual inspection of the SOM grids and k-means clustering of nodes. The resulting SOM grids are depicted in Figs. 2 and 3.

Fig. 1
figure 1

Study population, with subjects excluded due to missing data. Asterisk two patients were missing more than one value

Fig. 2
figure 2

Kohonen Self-Organizing Maps. The maps depict both overall clusters and individual nodes (circles) to show the internal patterns within clusters. Nodes represent smaller groupings of patients: each node contains 0–20 patients who are extremely similar to each other. The four clusters are divided by black lines and depicted in (a). bf Show those same clusters, but with depictions of different attributes of the nodes within each cluster. Within each node, the given value is represented by the darkness of the colour in the node. Each node is shaded from white to dark green, where darker colours represent higher average values (e.g., higher SOFA subscore) among the patients in the given node. The patterns visible in (bf) suggest that the four clusters represent: (1) shock with elevated creatinine, (2) minimal multiple organ dysfunction syndrome, (3) shock with hypoxemia and altered mental status, and (4) hepatic disease

Fig. 3
figure 3

Distribution of variables not employed in clustering, across the nodes within each cluster. The clusters depicted here are the same as those depicted in Fig. 2. Each node on the map represents a group of patients and the value displayed represents the mean value of those patients. Within each node, the given value is represented by the darkness of the colour in the node. Each node is shaded from white to dark green, where darker colours represent higher average values (e.g., proportion of patients in that node with urinary infection as the cause of sepsis) among the patients in the given node. A cluster of patients with renal comorbidities is identified (d), which spreads over three clusters. In the pneumonia grid (e), two broad types are apparent: pneumonia with septic shock (f mainly in cluster 3) and pneumonia with severe sepsis (b mainly in cluster 2)

The identified clusters represent four distinct clusters: (1) shock with elevated creatinine, (2) minimal MODS, (3) shock with hypoxemia and altered mental status, and (4) hepatic disease. Mortality (95 % confidence intervals) for these clusters was 11 (8–14), 12 (11–14), 28 (25–32), and 21 (16–26) %, respectively (p < 0.0001 for comparison by Kruskal–Wallis). Further characteristics of the four clusters are presented in Table 2; all predictors differed significantly across clusters (all p < 0.001). Age, chronic obesity and chronic lung disease did not differ across clusters. Patients with chronic renal disease were mainly grouped within the center of the SOM grid, spanning all four clusters. The permutation technique (n = 100) confirmed reasonable quantitation-errors (q = 4.7) that were lower than would be observed by chance in the SOM classification.

Table 2 Descriptive statistics of the clusters

Patients with septic shock were largely in clusters 1 (shock with elevated creatinine) and 3 (shock with hypoxemia and altered mental status), whereas patients with severe sepsis were largely in clusters 2 (minimal MODS) and 4 (hepatic disease). There are two groupings of patients with pneumonia as their etiology of sepsis as seen in Fig. 3e. A group with severe pneumonia is located in cluster 3 (shock with hypoxemia and altered mental status), while a group with milder pneumonia is located in cluster 2 (minimal MODS).

Figure 2e highlights the fact that coagulation dysfunction occurs mostly concomitant with liver dysfunction in cluster 4. There does not appear to be a distinct population of coagulopathy in cluster 3 (shock with hypoxemia and altered mental status).

In an exploratory analysis, we evaluated whether cluster 4 (hepatic disease) was dominated by patients with chronic hepatic failure as defined by Elixhauser comorbidity scoring. Cluster 4 included both chronic liver disease as well as patients with acute hepatic injury pattern of sepsis from various causes (Fig. 3h). As evidence of an acute hepatic injury pattern, 41 % (126 patients) of the patients in cluster 4 did not have chronic liver disease, and they were distributed throughout cluster 4. In an additional exploratory analysis, we did not find significant changes in mortality among clusters when we restricted our analysis to patients who were “full code” at the time of admission.

Results of simple multivariate logistic regression of 30-day mortality are displayed in eTable 1, with p values from the likelihood ratio test. Results of simple multivariate linear regression of ICU-free days are displayed in eTable 2. Our ultimate logistic regression model of 30-day mortality included age, APACHE II score, SOFA score, peak lactate, urinary tract infection, pneumonia, or soft tissue infection as etiology of sepsis, and cluster membership after feature selection using MARS. The MARS algorithm identified an interaction between APACHE II and cluster membership. We therefore developed logistic regression models stratified by cluster membership to allow better visualization of the meaning of the interaction between APACHE II and cluster membership, the results of which are displayed in eTable 3, where we display only those predictors that were significantly associated with mortality. In eTable 4, we display the results of stratified linear regression analysis of ICU-free days, again displaying only those predictors that were significantly associated with outcome.

Discussion

We identified four distinct clusters of MODS in a large cohort of patients with severe sepsis or septic shock. The clusters were largely defined as: shock with renal dysfunction, minimal MODS, shock with hypoxemia and altered mental status, and hepatic dysfunction. These clusters differed substantially from each other in terms of the distribution of outcomes and the associations between severity scores and clinical outcomes. Septic shock tended to dominate clusters 1 (shock with elevated creatinine) and 3 (shock with hypoxemia and altered mental status) and severe sepsis to dominate clusters 2 (minimal MODS) and 4 (hepatic disease), suggesting that clusters exist within the traditional classification of sepsis into severe sepsis and septic shock. The clusters we identified do not simply mirror the traditional classification: patients with septic shock in cluster 1 (shock with elevated creatinine) had mortality similar to patients with severe sepsis in cluster 2 (minimal MODS), while patients with severe sepsis in cluster 4 (hepatic disease) had much higher mortality.

While severe pneumonia patients in septic shock represented a substantial proportion of cluster 3 (shock with hypoxemia and altered mental status) patients, and end-stage liver disease represented a substantial proportion of cluster 4 (hepatic disease) patients, the clusters we identified via SOMs were not merely surrogates for etiology of sepsis or the presence of end-stage liver disease. The presence of cluster 1 corroborates recent evidence suggesting a close association between hypotension and kidney failure among septic patients [31].

Because SOFA relies on GCS for evaluation of central nervous system (CNS) dysfunction, and GCS does not distinguish between decreased verbal ability related to mechanical ventilation from that due to CNS dysfunction, the altered mental status in cluster 3 may reflect mechanical ventilation rather than CNS dysfunction. However, in parallel work, we found that imputed verbal scores (the GCS component most affected by mechanical ventilation) did not substantially affect the association between SOFA and mortality [23]. A technique for identifying CNS dysfunction independent of mechanical ventilation is desirable for future studies in this area.

Prior studies have focused on sepsis subgroups on the basis of general severity (i.e., severe sepsis vs. septic shock) and etiology of sepsis (e.g., pulmonary vs. urinary, etc.) [3], with some early work on the PIRO system [5, 32]. The clusters we identified were largely independent of the etiology of sepsis, with a wide distribution of origins of sepsis throughout the clusters. We observed certain organ dysfunctions as predominating in specific clusters. Specific organ dysfunctions, such as liver [7] or kidney failure [33], have also been singled out as contributing to mortality in severe sepsis/septic shock.

Exploratory analyses of morbid obesity and pre-existing disease of the kidney, lung, and liver demonstrated that the clusters we identified were largely independent of obesity and other comorbidities. We also noted a cluster of patients with renal disease (and elevated SOFA subscores) that extended over clusters 1 (shock with elevated creatinine), 2 (minimal MODS), and 3 (shock with hypoxemia and altered mental status). These findings suggest that the clusters we identified are not surrogates for pre-existing comorbidities, nor are they simply surrogates for the overall severity of sepsis.

Clusters were independent of age despite age being utilized in the SOM mapping process, suggesting that clusters are unrelated to age. The presence of significant statistical interaction suggests that the clusters identify important heterogeneity in MODS among septic patients.

Within cluster 2 (minimal MODS), certain etiologies of sepsis are associated with worse outcome. As compared to abdominal, biliary, or “other” etiologies of sepsis, pneumonia, urinary tract, and soft tissue infection are associated with lower mortality. This association between etiology of sepsis and outcome was not present within other clusters.

Statistical learning techniques such as SOMs have been applied in many areas of biomedicine. While SOMs are applicable to classifying other complex diseases, we believe that they may be especially useful in the critically ill patient. The modern ICU collects vast quantities of laboratory results, vital signs, comorbidities, and imaging into an electronic medical record. These data are complex and multidimensional [6]. Clustering analyses could allow that complexity be reduced to a degree that could be useful for tailored therapy, study enrollment, or improved diagnostics, as well as the potential for improved pathophysiological understanding of MODS. Consistent with other studies of SOMs [1416, 34], because SOMs have been shown to be superior to traditional clustering techniques [35], we did not compare SOM clustering with other cluster techniques.

We anticipate that the clustering performed in this current study will lay the groundwork for additional classifications that incorporate novel biomarkers (several of which have been evaluated recently [36]), including laboratory and physiological measurements and future efforts to understand pathophysiological differences among different clusters.

Our study has several limitations. While our cohort is large and from multiple centers, we do not include a separate validation cohort. We anticipate the need for prospective, external validation of our findings. This study is retrospective, and consequently may suffer from biased missingness and imprecision in measurement related to clinical rather than research measurements. However, because SOFA uses frequently obtained measures that are well established in clinical laboratories, we believe these risks are relatively minor. Additionally, the SOM technique cannot accommodate missing data. However, missing data were rare in this cohort, suggesting that this is not a substantial limitation. We also acknowledge that we used relatively simple measures of organ dysfunction, the sub-scores of the SOFA score, and we used only the admission SOFA score rather than sequential SOFA scores. We did so because those measures are commonly available in clinical practice, but we acknowledge that other physiological and laboratory measures might enhance the sophistication of these clusters. While other studies have not specified how they handled laboratory or physiological values obtained in the ER, we do not believe that our inclusion of data from the ER represents a substantial change in the usual calculation of the admission SOFA score. We also acknowledge that we did not collect data necessary to compare our findings with a PIRO-based approach, while noting that PIRO is not yet a reproducible system for classification [32].

In terms of generalizability, we acknowledge that our mortality is lower than described in some other recent cohorts [37], although it is very similar to the mortality observed in the recent multicenter ProCESS trial [38] and recent reports from Australasia [39]. Our choice to exclude patients who developed sepsis while in hospital (e.g., surgical site infections or hospital-associated pneumonia) does not allow us to describe such patients, who may have higher mortality than patients admitted directly from the ED.

Conclusions

We identified four distinct clusters of MODS within a cohort of critically ill septic patients admitted from the ED to the ICU using Kohonen self-organizing maps. The association between clinical predictors and mortality differed among cluster. These clusters may be useful in supporting tailored research and clinical therapies for septic patients.