Introduction

Sepsis is a leading cause of mortality in children worldwide [1]. In children age ≥1 year in the United States, sepsis is the second most common cause of death [2, 3]. The incidence of sepsis in children is increasing. Researchers in one study found an 81 % increase in hospitalizations for severe sepsis between 1995 and 2005 [46]. In addition, sepsis may lead to significant physical, neuropsychological, and neurocognitive morbidity in survivors [710]. For example, in a recent multicenter study, investigators found that 34 % of survivors of severe sepsis had a decline in their functional status at 28 days, and 18 % were determined to have a “poor” functional outcome (moderate, severe, or vegetative disability) [10].

Early, appropriate antibiotic therapy, fluid resuscitation, and vasoactive support are associated with improved outcomes after sepsis in a highly time-dependent manner [1113]. The first steps in this therapy are the early diagnosis of sepsis followed by the appropriate stratification of patients (e.g., admission to a hospital ward for observation or to a more intensive monitoring environment in a critical care unit). These first steps are left to the physician’s judgment, as there is currently no accepted, accurate biomarker available to help in making this decision [1315]. Physicians’ judgment in pediatric emergency departments (EDs) with experienced physicians and in tertiary pediatric intensive care units (PICUs) is likely quite different from that in smaller centers without extensive experience with sepsis in children [16]. The vast majority (>80 %) of children requiring emergency care present to an ED that does not have this specialized pediatric expertise [16, 17]. This problem has led to “great interest in developing diagnostic and stratification biomarkers for sepsis” [18]. A systems biology approach to biomarker phenotyping of the systemic response to sepsis has the potential to provide diagnostic and patient stratification profiles that inform clinical decision-making [19]. Unlike more proximal genomic and transcriptomic analysis, untargeted metabolomics and targeted proteomics reflect the downstream systems-level metabolic processes and pathways at play, defining specific phenotypes and shedding light on underlying pathophysiology [20, 21].

In previous work, our group has found that metabolomic modeling in adults accurately differentiated 39 patients with septic shock patients from 20 intensive care unit (ICU) control patients and that in children, it differentiated 58 PICU patients with septic shock from 39 PICU control patients [22, 23]. In adults, combining metabolomic and protein mediator data more accurately differentiated patients with septic shock from ICU controls [24]. In these studies, our group demonstrated promising metabolic biomarker profiles for diagnosing septic shock in ICU patients. In the present study, we asked different questions:

1. Is there a biomarker-defined phenotype that can differentiate children with sepsis who require intensive care from those who do not?

2. Does combining metabolic profiling with an analysis of protein mediators improve modeling?

3. Are there a limited number of biomarkers that may be used for targeted phenotyping in future studies?

As there is currently no objective gold standard for making this decision in the clinic, we used the judgment of experienced regional pediatric ED and PICU attending physicians as the standard for triage location of care. This standard is the same as that used for telemedicine and pediatric transport systems, where the expertise at the specialized center is relied upon [16].

Methods

Ethical approval

This study was approved by the Health Research Ethics Board of the University of Alberta (Pro00008797) and the Conjoint Health Research Ethics Board of the University of Calgary (Ethics ID 23426).

Patient cohorts

PICU sepsis cohort

The Alberta Sepsis Network (ASN) prospectively enrolled all eligible children up to age 17 years admitted to the only two PICUs in the Province of Alberta, Canada, with a diagnosis of sepsis between April 2010 and October 2013. Sepsis was defined as the systemic inflammatory response syndrome (SIRS) caused by a suspected or proven bacterial or fungal infection [25], with antibiotics ordered and an arterial and/or central venous line in place. Patients not expected to survive ≥24 h, refusing intubation or vasoactive infusions (i.e., palliative care), or already having had severe sepsis for ≥48 h (defined as sepsis with cardiovascular dysfunction, acute respiratory distress syndrome, or two other organ dysfunctions) [25] were excluded. Demographic, infection and severity-of-illness variables (including pediatric logistic organ dysfunction [PELOD] and Pediatric Risk of Mortality [PRISM III] scores) were recorded prospectively [26, 27]. Blood was drawn as soon as possible on the day the patient met the eligibility criteria, using deferred consent. If consent was subsequently refused, the blood was not used and the patient was not enrolled. Patients were divided into the predefined 2–5-year-old and 6–17-year-old age groups on the basis of suspected pathophysiology, as done by others [23, 25], with microbiologically confirmed sepsis (positive culture from a normally sterile site, including blood, cerebrospinal fluid, peritoneal fluid, or tissue) or pneumonia without microbiological confirmation (SIRS with chest infiltrate suggestive of pneumonia). We did not consider sputum or endotracheal aspirates as sterile site cultures or require them as confirmatory of pneumonia, and it is not current practice to perform bronchoalveolar lavage in patients with suspected pneumonia. The site of infection was defined as that diagnosed by the attending medical team.

ED sepsis cohort

The ASN prospectively enrolled all children age ≤17 years admitted to one pediatric ED in Alberta with a diagnosis of sepsis. Sepsis was defined as SIRS caused by a suspected or proven bacterial or fungal infection [25], with antibiotics and blood culture ordered. Patients who were admitted to the PICU from the ED were included in the PICU sepsis cohort and not in the ED sepsis cohort. Basic demographic information was recorded, and blood was drawn as soon as possible on the day the patient met the eligibility criteria after informed consent was obtained. The site of infection was defined as that diagnosed by the attending medical team.

ED control cohort

In the same pediatric ED, all previously healthy children age ≤17 years admitted for a procedure and without an infection (i.e., no history of fever within 2 weeks and no clinical evidence of infection) were prospectively enrolled. These children had traumatic lacerations, fractures and/or dislocations, or foreign body removal requiring intravenous sedation and/or analgesia. Blood was drawn after consent was obtained, and basic demographic information was recorded.

Sample collection and preparation

Samples were obtained via an existing arterial or central venous catheter (PICU sepsis cohort) or an intravenous tube insertion or blood culture draw (ED cohorts). Sample processing was performed as described in Additional file 1.

Proton nuclear magnetic resonance spectroscopy and metabolite concentration profiling

Proton nuclear magnetic resonance (1H-NMR) spectra were acquired on a Bruker AVANCE-II 600 MHz spectrometer (Bruker BioSpin, Milton, ON, Canada). All spectra were randomly ordered for untargeted profiling to avoid progressive bias. Untargeted profiling involves identification of different compounds by their characteristic spectral signature using information that is stored in an external metabolite reference database [2830]. Detailed methods are described in Additional file 1.

Protein mediator profiling

Quantification of targeted protein mediators (cytokines, chemokines, and acute-phase proteins involved in inflammation) was done using validated Luminex bead-based multiplexing assays according to manufacturer’s instructions (Luminex, Austin, TX, USA). Detailed methods are described in Additional file 1.

Statistical modeling

Multivariate statistical analysis was performed using the SIMCA-P+ software (v12.0.1; Umetrics, Umeå, Sweden) [19, 3135]. All metabolites or protein mediators with >50 % missing values were excluded from analysis. Data preprocessing (median fold change normalization, logarithmic transformation, centering, and unit variance scaling) was first conducted separately for the metabolic and protein mediator datasets and then for the combined dataset [33]. An unsupervised principal component analysis (PCA) was performed to obtain an overview of the multivariate dataset and to identify and exclude outliers that could seriously disturb the supervised models [31]. Outliers are defined as those samples that are situated outside the 95 % confidence interval of the Hotelling’s T 2 distribution (elliptical or spherical area in the score scatterplot) [31]. Then, supervised partial least squares discriminant analysis (PLS-DA) and orthogonal PLS-DA (OPLS-DA) models were developed to determine the best class discrimination (PICU sepsis, ED sepsis, and ED control) based on the preprocessed original data [3135]. The OPLS-DA method was applied to models including only two classes. The OPLS-DA models for the combined datasets were based on potentially relevant metabolites and protein mediator data selected using the variable importance to projection (VIP), and only those variables with VIP >1 were chosen [31]. In supervised analysis, R 2Y (the percentage of variation explained by the model) and Q 2 (the predictive ability of the model) metrics were calculated using a sevenfold cross-validation method [31, 36]. Additionally, the OPLS-DA models were validated by calculating coefficient of variation-analysis of variance p values and the receiver operating characteristic curve (ROC) (Metz ROC Software; University of Chicago, Chicago, IL USA) [37, 38]. The sensitivity, specificity, and accuracy were determined on the basis of sample class prediction during sevenfold cross-validation (Y-predcv) using SIMCA-P+ software. To describe specific biopatterns for 2–17-year-old children using combined NMR and protein mediator data, the OPLS-DA regression coefficients were calculated and metabolites and/or protein mediators with significant changes in concentration (p <0.05) were considered as the most important variables [31].

Results

Description of the cohorts

The demographics, sites of infection, and severity-of-illness measures for each age and category cohort are given in Table 1. Children meeting ASN eligibility were prospectively entered into the PICU-ASN database after deferred consent was obtained (refusal rate of 22 %). Of 205 patients in the PICU-ASN database, those under 24 months of age (n =63), with missing data (n =3), or without clear bacterial sepsis (unknown, viral or other causes of SIRS) were excluded from this PICU sepsis nested cohort. The PICU sepsis cohort patients had PELOD and PRISM III severity-of-illness scores comparable to other PICU sepsis trial patients [39, 40]. Most PICU patients were ventilated (for about 1 week) and received supportive vasoactive infusions on the first day of sepsis. A significant number required renal replacement therapy and extracorporeal life support during their PICU stay. The average PICU length of stay after sepsis was about 1 week. The ED sepsis cohorts showed comparable demographics, except for having fewer comorbidities and better values for platelet count and systolic blood pressure. The ED control cohorts had proportionately more male subjects. Only 3 (3 %) of 94 of the PICU sepsis cohort were admitted to the PICU from the ward; these 3 patients had blood for analysis drawn in the ED, and none were outliers in the models.

Table 1 Description of the three cohorts of patients

2–17-year-old children

Ninety-four PICU sepsis, 81 ED sepsis, and 63 ED control patients were included. A total of 58 metabolites and 48 protein mediators were recognized and quantified for each patient (Additional file 1: Table E1). Totals of 1.12 % and 2.01 % missing values were observed in the NMR and protein mediator datasets, and these were randomly distributed.

Profiling

Metabolomic profiling

Three principal components (PCs) were calculated via cross-validation to build the PCA model, with good data grouping, explaining the following percentages of variation: PC1 17.7 %, PC2 11.7 %, and PC3 7.4 % (Fig. 1a and Additional file 2: Figure E7a). There were nine outliers comprising 6 (6 %) in the PICU sepsis cohort and 3 (4 %) in the ED sepsis cohort. These outliers were excluded from subsequent analyses. A supervised PLS-DA showed that the three different cohorts were well clustered, with specific metabolic profiles for each. The model showed excellent goodness of fit (cumulative R 2Y =0.56) and goodness of prediction (cumulative Q 2 =0.47) (Fig. 2a). The OPLS-DA method was applied to compare metabolic variance in patient groups consisting of only two classes. The score scatterplots for each statistical analysis show clear separation of groups, with high values for the R 2Y and Q 2 parameters (Fig. 3a and Table 2). The predictive accuracy statistics for differentiating PICU sepsis from ED sepsis (Table 2) show accuracy of 0.89 and area under the ROC (AUROC) of 0.96 (standard deviation [SD] 0.01).

Fig. 1
figure 1

Principal component analysis (PCA) results for the 2–17-year-old cohorts based on the preprocessed original data (where each point represents one patient). a Metabolomic profiling data. b Inflammatory protein mediator profiling data. c Combined biomarker profiling data. The three-dimensional PCA score scatterplots show the distribution of observations (red dots, pediatric intensive care unit sepsis patients; blue dots, emergency department [ED] sepsis patients; green dots, ED controls) in the three-dimensional space formed by principal components PC1, PC2, and PC3. The PCs are the lines in the multivariable dimensional space (variables: metabolites, protein mediators) that best approximate the observations in the least squares sense. The sphere describes the 95 % confidence interval of the Hotelling’s T 2 distribution

Fig. 2
figure 2

Partial least squares discriminant analysis (PLS-DA) for the 2–17-year-old cohorts based on the preprocessed original data. a Metabolomic profiling data. b Inflammatory protein mediator profiling data. c Combined biomarker profiling data. The three-dimensional PLS-DA score scatterplots show the distribution of observations (red dots, pediatric intensive care unit [PICU] sepsis patients; blue dots, emergency department [ED] sepsis patients; green dots, ED controls) in the three-dimensional space formed by PLS components (PLS1, PLS2, and PLS3). During the model construction, a discriminant plane (PLS component) was found in which the projected observations were well separated according to the class (PICU sepsis, ED sepsis, ED controls). The sphere describes the 95 % confidence interval of the Hotelling’s T 2 distribution

Fig. 3
figure 3

Orthogonal partial least squares discriminant analysis for the 2–17-year-old cohorts, using metabolomic profiling (a), protein-mediator profiling (b), and combined biomarker profiling (c) data. Red dots, pediatric intensive care unit (PICU) sepsis cohort (primary sepsis); blue dots, emergency department (ED) sepsis cohort (secondary sepsis); green dots, ED control cohort

Table 2 Accuracy results of orthogonal partial least squares discriminant analysis models

Protein mediator profiling

PCA revealed three PCs explaining the following percentages of variation: PC1 19.4 %, PC2 14.6 %, and PC3 7.6 % (Fig. 1b and Additional file 2: Figure E7b). A total of 20 outliers comprising 13 (14 %) in the PICU sepsis cohort, 4 (5 %) in the ED sepsis cohort, and 3 (5 %) in the ED control cohort were excluded from further analyses. The PLS-DA model shows that the three different cohorts are reasonably well clustered, with an R 2Y cumulative score of 0.54 and a Q 2 cumulative score of 0.47 (Fig. 2b). The score scatterplots for each OPLS-DA statistical analysis show clear separation of groups, with high values for the R 2Y and Q 2 parameters (Fig. 3b and Table 2). The predictive accuracy statistics for differentiating PICU sepsis from ED sepsis (Table 2) showed accuracy of 0.78 and AUROC of 0.88 (SD 0.03), which were not as high as the values for NMR metabolomics.

Combined results

When PCA was performed, a three-PC model explained the following percentages of variation: PC1 21.5 %, PC2 8.7 %, and PC3 6.5 % (Fig. 1c and Additional file 2: Figure E7c). There were eight outliers comprising seven (7 %) in the PICU sepsis cohort and one (1 %) in the ED sepsis cohort. These outliers were excluded from subsequent analyses. A supervised PLS-DA shows that the three different cohorts are well clustered, with excellent model descriptive values: cumulative R 2Y 0.63 and cumulative Q 2 0.56 (Fig. 2c). The score scatterplots for each OPLS-DA statistical analysis show clear separation of the groups, with high values for the R 2Y and Q 2 parameters (Fig. 3c and Table 2). The predictive accuracy statistics for differentiating PICU sepsis from ED sepsis (Table 2) show accuracy of 0.87 and AUROC 0.95 (SD 0.01). This model had very similar accuracy statistics compared with NMR alone; however, the sensitivity was higher (0.90 vs 0.86), the specificity was lower (0.85 vs 0.91), and goodness of prediction was higher (Q 2 0.62 vs 0.60).

Models without outliers excluded

To confirm that the outliers detected in the PCA models (and subsequently excluded) did not bias the results of the supervised analyses, we recalculated the PLS-DA and OPLS-DA models including all outliers. This had only minor influence on the discriminative and predictive ability of the models (Additional file 1: Table E2). For example, for the combined dataset model differentiating PICU sepsis from ED sepsis, R 2Y =0.67 and Q 2 =0.63 compared with the model with outliers excluded where R 2Y =0.69 and Q 2 =0.62.

Age subgroups

Full details of the results for the separate metabolomic and protein mediator analyses and the combined analyses in the 2–5-year-old and the 6–17-year-old children are given in Additional files 1 and 2. A summary of the results is shown in Table 2.

Most meaningful metabolites and protein mediators

Using the combined data, a total of 14 metabolites and 3 protein mediators defined the most significant differences responsible for the separation between PICU sepsis and ED sepsis cohorts in the OPLS-DA model (Fig. 4).

Fig. 4
figure 4

The regression coefficient plot for the orthogonal partial least squares discriminant analysis model differentiating pediatric intensive care unit (PICU) sepsis from emergency department (ED) sepsis cohorts in children ages 2–17 years (Fig. 3c). Positive values of the coefficients indicate increased concentrations in the PICU sepsis cohort samples, and negative values indicate a decrease in concentration in the PICU sepsis cohort samples, compared with the ED sepsis cohort samples. Only statistically significant metabolites and protein mediators are shown (p <0.05). A2M α-macroglobulin, SAA serum amyloid A, TRAIL tumor necrosis factor-related apoptosis-inducing ligand

Discussion

The main findings of this nested cohort comparison of three groups of patients ages 2–17 years with or without sepsis include the following. First, our metabolomic analysis using 1H-NMR spectroscopy clearly distinguished patients with sepsis requiring care in a PICU (n =94) from those with sepsis in the ED (n =81) and those without sepsis in the ED (controls, n =63) never requiring care in a PICU, with few outliers excluded from the model (6 % of the PICU sepsis cohort). Second, protein mediator profiling also distinguished between these groups; however, there were more outliers that had to be excluded from these analyses, limiting the accuracy of this method alone. Third, combining metabolomic and protein mediator data produced relatively few outliers (7 % of the PICU sepsis cohort), AUROC of 0.95 (SD 0.01), and the strongest model (Q 2 =0.62), being more informative than either dataset alone. Fourth, the prespecified age subgroups resulted in only marginally more accurate models, but at the price of smaller patient numbers, and more outliers were excluded from model development. Fifth, a small group of 17 metabolites and protein mediators accounted for the separation of PICU sepsis and ED sepsis cohorts with 95 % confidence. Taken together, our data suggest that development of a laboratory test using these findings to help make diagnostic and triage decisions in children with sepsis is feasible.

These results are important for several reasons. First, they demonstrate proof of concept for using biomarker phenotyping in a clinical environment, generating information on patient biology that can guide clinicians’ diagnosis and triage decisions [19]. Second, the potential of this biomarker phenotyping has been demonstrated in a disease where decisions are time-sensitive [1113]. In sepsis, poor decisions can mean the difference between survival and death and between survival with or without significant functional and/or neurocognitive sequelae [713]. For example, in meningococcemia, the symptoms in the first 4–6 h are non-specific, and many children are initially misdiagnosed by their physicians [41]. Third, these results were obtained in children, and regionalization of clinical experience in pediatric EDs and PICUs leaves most hospitals where children present without the specialized expertise to make timely diagnostic and triage decisions [16, 17]. Biomarker phenotyping accurately tracked the clinical decisions made by the specialized pediatric physicians, differentiating a PICU cohort that had high need for ventilation and vasoactive infusions and a prolonged PICU length of stay from an ED cohort that did not require the specialized and costly care of a PICU. Fourth, by using this discovery and systems biology–based approach, we identified a limited number of metabolites and protein mediators of interest that may realistically lead to development of a point-of-care decision aid and inform future research into the mechanisms of severe sepsis in children.

Whether the 17 metabolites and mediators of interest cause or are the result of manifestations of sepsis cannot be determined on the basis of the observational design of our study. Nevertheless, overall, they suggest broad changes in metabolic and inflammatory processes induced by severe sepsis, identified together as a specific biopattern to inform patient triage and possible pathophysiological mechanisms (see Additional file 1). For example, the metabolite changes suggest PICU sepsis–associated enhanced fatty acid breakdown, ketoacidosis, and dysfunction in amino acid metabolism [30]; hepatic glycogen catabolism [42]; and disruption in glycerophospholipid and sulfur metabolism [30]. The protein mediator changes suggest altered leukocyte recruitment and function [4345]. A representative metabolic pathway network identified the most impaired biological pathways in PICU sepsis: taurine and hypotaurine metabolism; glycine, serine, and threonine metabolism; aminoacyl–transfer RNA biosynthesis; pyruvate metabolism; and, in 6–17-year-olds, arginine and proline metabolism [46]. This is similar to the most perturbed pathways in a previous PICU septic shock (vs PICU control) cohort [23].

This study has some limitations. It was performed at two centers, and the ED cohorts were from one center, possibly limiting the generalizability of the results. The number of patients included in each cohort was modest. The exact timing of the onset of sepsis in patients is unclear, as some present at different times in the course of the systemic response. In addition, the time from presentation until blood was drawn was likely different in the PICU sepsis and ED sepsis cohorts. We cannot be sure if different models would be obtained if stricter criteria for time course were applied. The PICU and ED sepsis cohorts were broadly separated by age groups a priori, but not strictly matched for age, sex, and clinical severity, and the definition of sepsis was more stringent in the PICU cohort. The ED control group was meant to reflect children with acute stress known not to be due to infection; thus, that group included children with predominantly fractures and lacerations. We did not determine whether the model could differentiate PICU sepsis and ED sepsis from other non-sepsis diagnoses that may require intensive care. Nevertheless, these patients were recruited from the only two PICUs serving the Province of Alberta and much of Northern Canada, with a catchment population >4 million. The biomarker phenotyping was applied in the real-world setting of patients presenting to the hospital with sepsis, regardless of the exact time of onset of their disease. Finally, the number of patients included is the largest sample for biomarker phenotyping in children of which we are aware [18, 19, 22, 23, 47].

We did not compare our biomarker phenotyping with existing Pediatric Early Warning Scores (PEWSs) used in the ED. We do not believe that this is a major limitation, for several reasons. First, most of these scores include, in addition to vital signs, subjective descriptions of the level of consciousness, capillary refilling, work of breathing, and worry about clinical status [48]. The goal of our model is to allow decisions that do not rely on this subjective expertise. Second, evaluations of the existing PEWSs have concluded that they are not accurate enough to replace clinical judgment, having inadequate discriminant ability for predicting PICU admission [4851]. A related limitation is that we did not determine whether PICU sepsis patients initially presented with obvious fluid-refractory or vasoactive-dependent sepsis making a biomarker unnecessary. We retrospectively determined timing of interventions in ASN patients admitted to one PICU and found that, in patients ventilated on day 1 of sepsis, the times from initial presentation to 20 ml/kg volume bolus or vasoactive infusions were, on average, >3 h and >8 h, respectively. This suggests that few patients were declared to have fluid-refractory or vasoactive-dependent septic shock in the first hours after presentation to the ED.

A prospective validation of our findings in an independent multicenter cohort using clear definitions of sepsis upon presentation to an ED is needed. Although there is need for some caution [52, 53], we believe that developing a point-of-care test targeted at detecting the metabolites and protein mediators of interest identified here holds great promise, as recently found for gene expression mosaics [54]. This may involve enzyme-linked immunosorbent assay or novel, rapid liquid chromatography-mass spectrometry techniques [55, 56].

Conclusions

In children ages 2–17 years, combining metabolomic and inflammatory protein mediator profiling early after presentation can differentiate children with sepsis requiring care in a PICU from children with or without sepsis who can be safely cared for outside a PICU. By using this discovery and systems biology–based approach, we identified a limited number of metabolites and protein mediators of interest that may realistically lead to development of a point-of-care decision aid. This may aid triage decisions, particularly in EDs without pediatric expertise. This finding requires validation in an independent cohort.

Key messages

• In children ages 2–17 years, combining metabolomic and inflammatory protein mediator profiling on serum and plasma early after presentation can differentiate children with sepsis requiring care in a PICU from children with or without sepsis who can be safely cared for outside a PICU.

• By using this discovery and systems biology–based approach, we identified a limited number of metabolites and protein mediators of interest that may realistically lead to development of a point-of-care decision aid.