Introduction

The early and accurate identification of infection in a critically ill patient, either as a cause of their critical illness or as a complication of it, is key to reducing infection-associated morbidity and mortality from delayed treatment [1].

Critical illness of almost any aetiology, however, can mimic infection. Classic signs of infection (e.g. pyrexia, tachycardia) are common and non-specific amongst patients in the intensive care unit (ICU) [2, 3]. It is therefore tempting to adopt a ‘just in case’ mindset to the treatment of infection in the ICU to avoid missing a curable infectious disease. This approach involves the widespread, relatively indiscriminate administration of broad-spectrum antimicrobials to critically ill patients.

However, unnecessary antimicrobial use lead to harms, including adverse drug events, risk of secondary opportunistic infections, and antimicrobial resistance [2, 4]. Such harms are likely to be particularly pronounced in the critically ill population, partly due to the underlying critical illness that increases susceptibility to organ injury.

A biological marker (biomarker) is defined as ‘a characteristic that is objectively measured and evaluated as an indicator of a normal biological process, pathogenic processes, or pharmacological responses to therapeutic interventions’ [5]. Biomarkers have been touted as a solution to the clinical dilemma of differentiating infection from non-infectious mimics [6]. Diagnostic biomarkers of infection can be broadly divided into two groups: (i) those that identify microbes and (ii) those that identify the host response to microbes. Point-of-care tests that aim to identify the presence of microbes are being developed, but these assays are susceptible to false positives due to the identification of microorganisms that are not responsible for patient deterioration (i.e. from contamination or colonisation) [2].

Herein, we discuss current approaches to the diagnosis of infection in the ICU, with an emphasis on commonly used biomarkers of the host response to infection. We also discuss emerging approaches to biomarker development and examine how we can improve the quality of sepsis biomarker research with the aim of improving outcomes for critically ill patients with infection.

Current Methods of Diagnosing Sepsis

Clinical Gestalt

In the absence of a reliable independent biomarker of infection, clinicians rely heavily on clinical gestalt, which is informed by adjunctive laboratory tests of the host response to infection including white cell count (WCC), C-reactive protein (CRP) and/or procalcitonin (PCT), and microbiological cultures. Clinical gestalt describes the concept that clinicians actively organise clinical perceptions into a coherent construct with the aim of answering a clinical question. It involves making a decision in the absence of complete information based on pattern recognition.

Clinicians rarely rely on a single clinical sign (e.g. fever) or a biomarker result to decide whether a patient has an infection. Rather, they make an ‘educated guess’ that a patient likely has an infection based on their history, examination, and available laboratory, microbiology, and/or radiology results without applying any specific evidence-based clinical decision rules. Such an approach affords flexibility and the inclusion of factors that may be difficult to quantify [7] but may be inaccurate depending on the experience and ability of the clinician [8].

The inaccuracy of clinical gestalt in the diagnosis of sepsis in ICU patients has been demonstrated in multiple studies [9,10,11]. In a recent multicentre clinical cohort examined by Lopansri and colleagues [10], agreement between initial attending physician diagnosis of sepsis in ICU patients compared to retrospective diagnosis by an expert panel was weak (free-marginal kappa agreement 0.58). The tendency was to overdiagnose sepsis early, with ~30% of those initially determined to be septic being subsequently reclassified retrospectively. Overtreatment was also observed, with ~60% of patients ultimately classified as having a non-infectious systemic inflammatory response syndrome (SIRS) being given systemic antimicrobials.

One reason that clinical gestalt may perform so poorly when it comes to the diagnosis of infection in critically ill patients is that it is susceptible to multiple cognitive biases, including base-rate neglect, outcome, and premature closure (Table 1). Given the potential for poor outcomes with delayed antimicrobial initiation [1], clinicians may focus on the perceived benefit of antimicrobial administration even if the presentation may not be consistent with an infectious diagnosis—an example of base-rate neglect [12]. Clinicians tend to therefore over-estimate the true prevalence of infection [10, 13•, 14].

Table 1 Examples of biases applied to the diagnosis of infection [15, 16]

Base-rate neglect is intertwined with outcome bias in that clinicians may inappropriately attribute a patient’s clinical improvement to antimicrobial administration as part of a presumed infection diagnosis, even in cases where infection is unlikely [12]. Clinicians may also rely on the perceived safety of antimicrobials in ascribing a patient’s presentation to infection, despite evidence demonstrating significant harms from unnecessary antimicrobial therapy [2]. Finally, healthcare institutional pressures may bias clinicians towards a diagnosis of infection. For example, a study by Kanwar and colleagues [17] identified an increase in community acquired pneumonia diagnosis rates when a 4-h antimicrobial administration rule was implemented in the emergency department (ED).

To avoid such clinical biases, a clinician may put increased weight on biomarkers of infection, which may be perceived as potentially more objective. Some of the more commonly used in current clinical practice are discussed in detail below.

White Cell Count

The WCC has long been considered an important and readily available clinical marker of infection. However, the utility of this commonly used haematological parameter in the diagnosis of infection in critically ill patients is questionable.

In a retrospective review of 2279 patients admitted to a tertiary hospital with suspected infection, an abnormal WCC (≥12×109/L or ≤4×109/L) was demonstrated to have a sensitivity, specificity, positive predictive value, and negative predictive value of 52.5%, 52.8%, 63.3%, and 41.7%, respectively [18]. The poor diagnostic utility of the total WCC in ICU patients is a consistent finding [19]. This is unsurprising given the wide variety of non-infectious mimics, drugs, and comorbidities that influence the total WCC [20,21,22,23,24]. Further, the total WCC may decrease acutely due to sequestration of granulocytes at the site of infection, with a delay in the release of neutrophils from the bone marrow [25]. Therefore, the clinician may see a different WCC picture depending on the time the sample was taken relative to symptom onset.

To improve the diagnostic yield of WCC in diagnosing infection, other methods of interrogating the WCC have been explored, including assessment of immature granulocytes and, more recently, the neutrophil-lymphocyte ratio (NLR).

Immature granulocytes (‘bands’) are released from bone marrow following cytokine stimulation. The presence of bands is described as a ‘left shift’ and has been associated with infection (OR 8.91, 95% CI 3.46–22.93) [19] and with a sensitivity and specificity of 84% and 71%, respectively, when an 8.5% cut-off is used [26]. In separate studies, increasing the cut-off to 19–25% improved the specificity to approximately 86% with a corresponding reduction in the sensitivity by up to 25% [27]. The presence of toxic neutrophil features, such as toxic granules and cytoplasmic vacuoles, was the most sensitive blood film feature with a sensitivity of 88% [27, 28].

Another potentially discerning feature of the WCC and differential is the NLR. During periods of physiological stress and in response to bacterial infections, the neutrophil count typically increases with a reduction in the lymphocyte population, potentially due to cortisol release [29, 30]. In one study of 1572 patients in the ED with suspected sepsis, the NLR modestly predicted bacteraemia (AUROC 0.71, 95% CI 0.67–0.75), which was not statistically different from PCT [31]. Moreover, the NLR modestly predicted microbiologically confirmed infection (AUROC 0.63, 95% CI 0.61–0.66) [31]. Similar results have been observed in other cohorts [32, 33]. The utility of the NLR decreases with viral infection [34]. Overall, the NLR has improved sensitivity but reduced specificity when compared with other commonly used biomarkers (CRP and PCT) for diagnosing bacterial sepsis [31, 32].

Taken together, the total value of the WCC is a poorly performing diagnostic test of infection. Other measures, such as the NLR and band count may be an improvement on the total WCC but still lack both the sensitivity and specificity to discern inflammation from infection in critically ill patients.

C-Reactive Protein

CRP is an acute phase reactant synthesised and released from the liver in response to infection and inflammation [35]. Numerous studies have been performed detailing the diagnostic accuracy of CRP for infection.

A meta-analysis of 12 studies investigating the diagnosis of bacterial infections in immunocompetent patients primarily with sepsis identified a CRP sensitivity and specificity of 0.75 (95% CI 0.62–0.84) and 0.67 (95% CI 0.56–0.77), respectively, a finding which has been replicated in ICU cohorts [36, 37]. Conversely, CRP was not able to differentiate patients with and without microbiologically proven infection meeting Sepsis-3 criteria with a CRP AUROC of only 0.57 (95% CI 0.54–0.60) [31, 38•]. Results between studies may vary due to different cut-off values used. Overall, the optimal cut-offs identified generally range between 20 and 100 mg/L, with lower and higher CRP concentrations improving sensitivity and specificity, respectively [31, 36, 37, 39, 40]. This should also be considered when utilising the CRP to differentiate between bacterial and viral infections. CRP has a sensitivity range of 0.61 to 1 and specificity of 0.26 to 1 depending on the cut-off CRP value used in the study to differentiate bacterial from viral infections [36, 41]. Optimal sensitivity and specificity cut-off values are approximately 40 mg/L and >80 mg/L, respectively, in distinguishing bacterial from viral infections [36, 41,42,43]. No difference in CRP has been shown between patients with and without SARS-CoV-2 (COVID-19); however, the CRP 95% CI ranges from 24.7 to 31.4 mg/L suggesting that high CRP concentrations may imply an alternative diagnosis [44, 45].

A commonly touted reason for the variability in the diagnostic utility of CRP between studies is patient heterogeneity. However, no appreciable difference exists in the CRP AUROC, sensitivity, or specificity for diagnosing infection in patients with febrile neutropenia compared with the previously described immunocompetent patients [40]. This is similar to other immunocompromised populations (chronic liver disease patients, transplant patients, human immunodeficiency virus [HIV] patients, and haematopoietic stem cell transplant recipients) and patients with hepatic dysfunction, with the potential exception of fulminant hepatic failure [39, 46•, 47, 48]. Therefore, CRP may assist clinicians where the WCC and differential may be less useful, such as those with febrile neutropenia.

One method to improve the diagnostic utility of CRP may be to repeat the test at different time points. Patients presenting early in the illness course may have an increased risk of initial false negative results resulting in delayed treatment initiation that may be associated with a mortality of 19.4% [35, 49]. Daily CRP monitoring in ICU patients with an increase of >41 mg/L from the previous day predicted bacterial infection with a sensitivity and specificity of 0.92 and 0.71, respectively, with similar results improving the diagnosis of bacterial versus viral infections [50, 51].

Taken together, CRP has modest diagnostic potential in differentiating infectious from non-infectious inflammation; however, a ‘normal’ initial CRP does not exclude the potential for severe bacterial infection and should not be used to withhold appropriate treatment.

Procalcitonin

PCT is one of the most widely studied sepsis diagnostic biomarkers in ICU [52, 53]. PCT, the peptide precursor to calcitonin, is released from thyroid C glands at undetectable concentrations healthy individuals, but is synthesised in extra-thyroid tissues in response to infection and inflammation via interleukins and tumour necrosis factor-α [52, 53].

Several systematic reviews and meta-analyses have been performed that describe the utility of PCT in distinguishing bacterial infections or sepsis from SIRS [36, 54,55,56,57,58,59,60,61,62,63,64,65,66, 67•, 68]. The overall sensitivity and specificity range from 0.72 to 0.93 and 0.64 to 0.84, respectively [36, 54,55,56,57,58,59,60,61,62,63,64,65,66, 67•, 68]. The associated positive and negative likelihood ratio ranged 3 to 5.9 and 0.11 to 0.44, respectively [36, 54,55,56,57,58,59,60,61,62,63,64,65,66, 67•, 68]. Taken together, results suggest that PCT has modest discriminatory ability of distinguishing bacterial infection or sepsis from non-infectious inflammation; however, a negative result cannot rule out infection.

Potential differences between study results may be related to specific patient populations and infections. A lower diagnostic AUROC (0.71) and sensitivity (0.66, 95% CI 0.54–0.76) was identified in immunocompromised patients with bacteraemia when compared with immunocompetent patients (sensitivity 0.76, 95% CI 0.75–0.83) [57]. Similarly, in patients with renal impairment (estimated glomerular filtration rate <30 mL/min/1.73m2), PCT poorly differentiated infectious from non-infectious inflammation, which may be related to potentially higher baseline PCT concentrations in this patient cohort, although results are mixed [69, 70]. PCT had reduced sensitivity but increased specificity in differentiating bacterial infection from disease flare in patients with autoimmune conditions [71]. Conversely, there was no appreciable difference in the sensitivity, specificity, or AUROC in differentiating bacterial infection from non-infectious inflammation between medical and surgical/trauma patients, paediatric and adult patients, patients with hepatic dysfunction, and burns patients [47, 54, 56, 58, 61, 63].

Heterogeneity may also result from different infectious aetiologies. PCT has mixed results in the ability to differentiate bacterial from viral infections with sensitivity and specificity ranges of between 0.64 to 0.95 and 0.42 to 0.91, respectively [36, 65, 72•]. More recently, no difference in PCT concentrations were identified between patients with and without COVID-19; however, the PCT concentration was low with the 95% CI ranging from only 0.07 to 0.12 ng/mL [44, 45]. Similar diagnostic results have been shown in distinguishing bacterial from fungal infections [66, 73]. PCT can differentiate extracellular bacterial pathogens from intracellular bacteria, such as those commonly causing atypical community acquired pneumonia (Legionella spp., Chlamydophila pneumoniae, Mycoplasma pneumoniae), where the latter atypical bacteria usually do not appreciably increase the PCT concentration [74•]. Similar findings have been observed in intracellular bacteraemia [75]. Overall, PCT cut-offs <2 ng/mL improve diagnostic sensitivity and may improve early initiation of treatment [76]. However, even with this low cut-off, clinicians should exercise caution in patients presenting acutely who may yet to have a detectable increase in PCT given the delay to peak concentrations of up to 24-h, although no benefit of repeated measurements over a shorter time frame of 12-h has been shown in ICU patients [77, 78]. Conversely, PCT concentrations >2 ng/mL improve diagnostic specificity and may assist in confirming treatment requirement for extracellular bacterial infection.

PCT is a potentially useful biomarker in assisting clinicians in the diagnosis of bacterial infections but may be limited by variability in some patient subgroups and infectious aetiologies. The clinician should consider the probability of bacterial infection as a continuum of the PCT with an increased likelihood at higher PCT concentrations.

Biomarkers for Antimicrobial Treatment Modification

Antimicrobial de-escalation (reducing antimicrobial spectrum) and truncated treatment durations have been proposed as methods to reduce the ecological collateral impact of antimicrobial use without compromising (and potentially improving) patient outcomes [79], a therapeutic decision which may be guided by biomarkers.

Data are available for CRP and PCT demonstrating a role in guiding antimicrobial cessation. Two recent studies did not show any difference in clinical cure rates or clinically significant difference in antimicrobial treatment durations for critically ill patients receiving CRP-guided antimicrobial therapy, compared with either a fixed 7-day course or clinician-directed treatment [80, 81]. On the other hand, a PCT concentration <1 ng/mL and/or a reduction from the peak concentration by 65–90% has been used to safely cease antimicrobial therapy. This method has been shown to reduce the duration of antimicrobials by ~2 days, without influencing length of stay or patient mortality [82,83,84]. However, the mean length of antimicrobial therapy in the PCT-guided, CRP-guided, and standard care treatment arms for most studies remains >7 days [82,83,84]. Emerging evidence suggests that antimicrobial durations as short as 5–7 days may be appropriate in many clinical circumstances, such as community-acquired pneumonia or intra-abdominal infections with appropriate source control [85,86,87,88,89]. It is currently unclear whether CRP or PCT may individualise the patient’s treatment duration by reducing the duration below that recommended for ‘short-course’ therapy or whether there is a role for existing biomarkers in other forms of antimicrobial de-escalation.

Emerging Research into Diagnostic Biomarkers of Sepsis

Hundreds of diagnostic sepsis biomarkers been reported in the literature [90]. In addition to the ones described above, other commonly purported diagnostic protein biomarkers include sTREM-1 [91, 92], suPAR [93], CD64 [94], presepsin [95, 96], pentraxin-3 [97], calprotectin [98], and IL-6 [99]. However, none of these biomarkers have yet to have proven obviously superior when compared with each other and currently used biomarkers such as PCT.

Given the biological complexity of sepsis, a strategy based on a panel of multiple biomarkers is likely to have more potential to meet the needs of an ideal diagnostic test than any single biomarker [2]. Thus, using combinations of previously identified biomarkers has been recently investigated to improve diagnosis accuracy [100,101,102,103]. For example, a recent meta-analysis concluded that the combined application of CRP and CD64 improved the accuracy of neonatal sepsis diagnosis [102]. Combining a panel of biomarkers together with clinical information may prove even more accurate [90, 104].

Others are investigating the host gene response to infection in order to develop a genetic ‘sepsis signature’ from differentially expressed genes. There are currently two commercial tests based on host gene expression under development, SeptiCyte LAB (Immunexpress Inc., Seattle WA, USA) [105] and HostDX Sepsis (Inflammatix, Burlingame CA, USA) [106]. The diagnostic accuracies of these tests have been recently independently validated using gene expression data prospectively collected in an ICU-based randomised control trial, revealing a AUROC of 0.8 using the HostDX Sepsis ‘Sepsis MetaScore’ and an AUROC of 0.68 for the SeptiCyte Lab SeptiScore [107]. Performance of these assays improves in less heterogeneous subpopulations [107, 108].

The use of site-specific biomarkers is an emerging research area that may assist clinicians to differentiate infection from either colonisation or aseptic inflammation. For example, pentraxin-3 in bronchoalveolar lavage samples has been shown to predict pneumonia in intubated patients [109]. Similarly, the use of cerebrospinal fluid analysis may also assist with central nervous system infections. Cerebrospinal fluid interleukin-1β was able to differentiate identify culture-positive results in patients with suspected ventriculitis and ventriculoperitoneal shunt infections [110]. Cerebrospinal fluid kynurenine [111] and faecal CXCL-5 messenger-RNA [112] also represent new potential site-specific biomarkers.

Alternative avenues of biomarker research include exploring the biomechanical properties of immune cells. For example, Crawford and colleagues have conceptualised an assay based on changes in granulocyte deformability in patients with sepsis compared controls [113]. Others have proposed measuring leucocyte adhesion [114]. However, these biomechanical assays remain in their infancy.

Finally, there has been increasing interest in machine learning techniques to improve diagnostic accuracy. Machine learning enables the incorporation and quantification of the effect of multiple non-linear variables, allowing for the identification of unfamiliar trends and the development of prediction models. A recent systematic review demonstrated that machine learning models can accurately predict sepsis onset with good discrimination in retrospective cohorts [115]. Combining machine learning algorithms using electronic medical record data together with previously identified biomarkers may further increase diagnostic accuracy [108, 116]. However, prospective studies are required to ensure the clinical relevance of machine learning techniques. Notably, machine learning techniques remain susceptible to many of the same limitations of other research approaches, especially in the absence of a gold standard diagnostic test and prospective validation [117•].

Sepsis biomarker discovery remains a very much active field with several new investigative approaches. However, in order to ensure effective translation to the bedside, several prerequisites need to be fulfilled. Some of the problems of, and solutions to, biomarker research are discussed below.

Addressing Problems with Sepsis Biomarker Research

Unfortunately, sepsis biomarker research is riddled with problems that have precluded the translation of many newly discovered biomarkers into routine clinical practice. Some of these problems have been recently described [90, 118].

The biomarker literature is littered with small studies (<100 patients) with a high risk of bias and an absence of large-scale, high-quality prospective randomised trials [90, 118].

Further, many studies use control groups that are extremely unlikely to have sepsis, which not only fails to reflect the ‘real world’ utility of sepsis biomarkers but also result in an over-estimation of biomarker performance. In the worst examples, sepsis biomarkers are evaluated against healthy control non-hospitalised subjects. Like any laboratory-based biomarker (e.g. troponin, d-dimer), it is proposed that sepsis biomarkers are best used in cases where true clinical equipoise exists [119], that is, when the bedside clinician is unsure as to whether to prescribe an antimicrobial.

Hindering this approach has been the absence of a true gold standard for diagnosing sepsis, even retrospectively. Microbiological cultures are a commonly used gold standard but can be problematic due to contamination, colonisation, and the high prevalence of culture negative infection [2]. In ICU patients, 28–49% of patients with a syndrome consistent with likely sepsis have negative cultures [1, 120,121,122,123]. One proposed solution to this problem is to use a panel of independent clinicians to retrospectively diagnose sepsis using a combination of: clinical findings; laboratory, radiological, and microbiological culture results; and response to therapy. Ideally, the group of clinicians would include at least one clinical microbiologist.

Another commonly stated problem impeding sepsis biomarker research is the heterogeneity of the sepsis syndrome. The host immune response is likely to vary depending on the precise microbiological organ insult and host [124•]. A possible solution to this is testing sepsis biomarkers well-defined cohorts based on a specific disease (e.g. burns, community acquired pneumonia) or identified sepsis subphenotypes [124•].

A summary of common problems encountered in biomarker research and possible solutions are described in Table 2.

Table 2 Problems encountered in sepsis diagnostic biomarker research

Conclusions

Given the above problems troubling sepsis biomarker research, it is not surprising that the vast majority of discovered host diagnostic biomarkers of sepsis have not found their way into routine ICU clinical practice. The biomarkers that currently influence antimicrobial decision-making for many clinicians, including WCC, CRP, and PCT, do not meet the prerequisite accuracy and validity to be utilised as an independent diagnostic test to guide the initiation of sepsis treatment.

Whilst we await the emergence of a valid, accurate point-of-care diagnostic test for sepsis, we would endorse an approach to current sepsis diagnostic biomarkers as described recently by Huang and Ramirez [119]. Namely, the clinician at the bedside should consider the pre-test probability for the diagnosis and consider whether confounding factors exist that could interfere with the interpretation of the result.

Novel approaches to sepsis diagnostic biomarker research, including the use of combination biomarker panels and/or incorporation of biomarkers into machine learning models, hold promise. We await further large-scale prospective studies assessing these techniques, with the ultimate aim of demonstrating improvements in patient-oriented outcomes in critically ill patients with sepsis.