INTRODUCTION

Fluid resuscitation is integral to the management of the hypovolaemic, acutely unwell patient,1 but assessing whether patients require intravenous fluid is complex as multiple factors can affect fluid balance.2 The process currently relies on recognising clinical features of hypovolaemia and hypervolaemia but these features are non-specific.3 While competence in fluid assessment is a training requirement for all grades of clinicians,4 preventable morbidity due to intravenous fluid mismanagement is commonly identified, with one in five patients suffering complications.5 An effective assessment tool must strike a balance that limits such harms without impacting upon or delaying the life-saving benefits of fluid resuscitation.

The National Institute of Health and Care Excellence (NICE) intravenous fluid guidelines5 for the UK advocate fluid assessment using a combination of clinical features and bedside tests. Although there is no formal definition of the term “hypovolaemia”, the guidelines list an approach to its identification. However, all but one of the suggested measures are non-specific and two, respiratory rate and the National Early Warning Score,6 do not reflect fluid status but simply identify acutely unwell patients. Of note, the NICE guideline development group stated that their recommendations were based on consensus opinion, extrapolated from systematic reviews within older guidelines7 that focused on acute illness rather than hypovolaemia. There are no equivalent guidelines from American or European healthcare bodies.

Fluid assessment is particularly challenging in sepsis as haemodynamic compromise is primarily caused by three different factors: vasodilatation, increased vascular permeability, and cardiac dysfunction. Ensuring the dose of IV fluid is optimised is important as excess intravenous fluid has been shown to cause harm in patients with sepsis,8,9,10 with further trials ongoing.11, 12 The Surviving Sepsis guidelines advise a dose of fluid over the first 3 h equating to > 2 L in a 70-kg patient.13 Following this initial dose, they recommend additional fluid use is guided by frequent reassessment of hemodynamic status. However, they are unable to specify how this should be performed. Furthermore, they acknowledge that these recommendations are supported by a low quality of evidence.13

Over the last two decades, studies and systematic reviews14,15,16,17 have focused on the ability of fluid assessment tools to predict fluid responsiveness: a haemodynamic improvement following fluid bolus. It is a dynamic test, i.e. describes change over two measurements, in contrast to static tests that use a single measurement. The presence of fluid responsiveness can be estimated through indirect means such as performing a passive leg raise18 which increases cardiac venous return, or by observing changes in measurements during respiration.19

The vast majority of published studies of fluid responsiveness are based in the intensive care unit (ICU), using patients who are sedated and ventilated. Because differences in accuracy between mechanically ventilated and non-mechanically ventilated patients have been observed,20 these findings may not translate to acute medical patients.

This systematic review assessed studies of adults with sepsis who would be appropriate for admission to the acute medical unit (AMU) and compared index tests designed to determine the need for fluid resuscitation with reference standards that identify the presence of fluid responsiveness.

METHODOLOGY

This systematic review was reported using PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-analyses) guidelines.21 Prior to commencing the review, the protocol was published (PROSPERO:CRD42017048651).22

Search Strategy

An electronic database search was undertaken up to 14 June 2018, following consultation with an information specialist, using keywords and subject headings that encompassed three domains: sepsis, intravenous fluid, and patient location. Search strategies were piloted prior to use. Appendix 1 (online) contains a sample search strategy for MEDLINE. The following bibliographic databases were searched: MEDLINE (Ovid) from 1946; Embase (Ovid) from 1947; CINAHL (Ebsco) from 1937; and the Cochrane Library (Wiley) from 1996. No restrictions on publication language or date were applied.

References of included articles and relevant systematic reviews were hand-searched, forward citation searching of included articles was performed via Web of Science, and simplified search strategies were completed in the Zetoc database (The British Library) and the Conference Proceedings Citation Index (Web of Science) to identify grey literature. A search of research registers (ClinicalTrials.gov, UK Clinical Research Network Study Portfolio Database, and World Health Organization International Clinical Trials Registry Platform) was undertaken to identify relevant, ongoing studies.

Results from these searches were entered into reference management software (Endnote version 7.3.1 [Clarivate Analytics, Philadelphia, PA, USA]). Duplicate records were removed using an automated algorithm and subsequent manual searching.

Study Selection

Studies were considered for inclusion based on pre-defined eligibility criteria (Appendix Table 5, online). In brief, these were diagnostic test accuracy studies incorporating an index text used to assess hypovolaemia or the need for fluid resuscitation, including adults with sepsis who were not sedated or anaesthetised. If a study included multiple presenting diagnoses, the authors were contacted to obtain data for patients with sepsis. If > 50% of participants had sepsis or confirmed infection, the study was included regardless. As per the protocol, studies set in ICU were included because of a paucity of evidence. During the search process, it was noted that the majority of studies did not report the proportion of patients who were sedated or anaesthetised. Therefore, the eligibility criteria were changed to exclude patients who were mechanically ventilated.

An initial screening process was completed independently by two reviewers using titles and abstracts. Selected studies were reviewed in full according to the above criteria. Disagreements were resolved through discussion, supported by a third reviewer. Results were recorded using Microsoft Excel 2010 (Microsoft Corporation, Redmond, WA, USA).

Data Extraction and Evidence Synthesis

Data were extracted using a piloted, standardised form, following translation of non-English language articles if required. The following data were extracted: study characteristics (including setting and sample size); patient characteristics (including age, gender, acuity score, blood pressure, heart rate, preceding intravenous fluid volume, concurrent use of vasopressors/inotropes, admission diagnoses); and details of the index test(s), reference standard, and target condition. Study authors were contacted directly to request missing data. Study quality and risk of bias were assessed using a modified version of QUADAS-2.23 The heterogeneity of study design and outcomes precluded a meta-analysis of quantitative results, so a narrative overview of the studies is presented.

RESULTS

Searches returned 26,841 records from the four databases. Two hundred sixteen records were identified from conference proceedings databases and trial registries. Following screening by title and abstract, 463 full-text articles were reviewed. Of the 124 that were diagnostic test accuracy studies, 83 were excluded because the study population was ventilated. Fourteen articles were identified for inclusion.24,25,26,27,28,29,30,31,32,33,34,35,36,37 Figure 1 details the selection process and reasons for exclusion.

Figure 1
figure 1

Flowchart of study selection. Legend: flowchart summarising study selection and inclusion processes in this systematic review, including the reasons for exclusion of all full-text articles that were reviewed.

Study Characteristics

The 14 included studies had a median sample size of 33 (range 14–116) and were primarily single-centre. Three were published as a conference abstract and two were set outside the ICU, both in emergency departments (ED).

Nine studies based inclusion on the presence of a composite definition of shock, most commonly referred to as acute circulatory failure, defined by the presence of one or more haemodynamic markers, clinical signs, and blood tests. Blood pressure was the key driver for study inclusion. In 13 studies, patients met the inclusion criteria if they were hypotensive alone (Table 1). The most common reason for patient exclusion was an arrhythmia (including irregular heart rhythm), used in seven studies. Another excluded all patients with any form of cardiac disease.

Table 1 Main Characteristics of Included Studies

Patient Population

Eight studies exclusively included patients with sepsis. Two authors from the remaining six studies provided individual patient data for septic patients. Of note, patients with an admission diagnosis of heart failure were present in two studies. The recording of preceding intravenous fluid use was limited, mentioned in five studies. In two of these, patients received a median of 4 L prior to participation, double the minimum initial fluid dose for sepsis according to the Surviving Sepsis guidelines13 (Table 2).

Table 2 Main Patient Characteristics

Target Condition

The target condition for all studies was fluid responsiveness. However, its relationship to hypovolaemia was poorly articulated. Nine studies failed to use the term “hypovolaemia” at any point in the rationale, methods, or interpretation of findings (Table 1).

Reference Standard

Table 3 shows the significant heterogeneity in the reference standard, the test used to determine the presence or absence of fluid responsiveness. Only two studies used the same parameter.32, 34 There was marked variation in the proportion of patients meeting the reference standard: median 50% (range 17.4–65.4%).

Table 3 Summary of Reference Standards

Two studies used static tests to identify fluid responsiveness, both from the Surviving Sepsis guidelines.13 The remaining 12 used a dynamic test. One study used absolute (> 10 mmHg) rise in non-invasive systolic blood pressure. Eleven studies used a relative rise in cardiac function; the most common measurements were cardiac index and stroke volume (SV), used by four studies each. The most common measurement tool was echocardiography, used in eight studies.

Of the 12 studies that used a dynamic test, all used a fluid bolus. Nine gave a fixed 500 mL bolus. The other three gave a weight-based bolus that ranged between 490 and 2100 mL for a 70-kg patient. The rate of the bolus ranged between 15 and 30 min, although one study gave 500 mL using a pressure bag and noted a variable infusion rate. Five studies used crystalloid, six used colloid, and one did not specify the type of fluid.

Index Tests

Five categories of index test (i.e. the diagnostic test under evaluation) were identified in the studies and are summarised in Table 4. They included inferior vena cava (IVC) measurements, change following passive leg raise (PLR), change with respiration, change following intravenous fluid, and static measurements.

Table 4 Summary of Studied Index Tests

Six studies assessed inferior vena cava collapsibility index (IVCCI), calculated using the difference in IVC diameter during respiration divided by end-expiratory IVC diameter. Four studies measured IVCCI before a fluid bolus was given and used a measure of left ventricular function to identify fluid responsiveness. The area under the curve of the receiver operating characteristic (AUROC) for IVCCI was similar for each of these studies (0.77, 0.82, 0.82, 0.83). A fifth used systolic blood pressure to identify fluid responsiveness, giving an AUROC of 0.68. The final study looked at IVCCI after a fluid bolus was given and reported an AUROC of 0.91.

Three studies, two published as conference abstracts only, explored haemodynamic change after a PLR. Three separate studies reported haemodynamic change during respiration. A number of different haemodynamic measurements were explored in both categories with SV and pulse pressure the most common. Two studies explored a static tool as part of their primary aim, although others included results for additional index tests (Table 1). Finally, one study assessed haemodynamic change after a fluid bolus, the reference standard in 12 of the other included studies.

Risk of Bias

Several factors contributed to a high overall risk of bias (see Fig. 2). Two studies used static measurements as reference standards (central venous pressure35 and blood pressure/lactate levels36) whilst acknowledging they would not effectively identify fluid responsiveness. This would bias estimates of sensitivity and specificity. The remaining 12 used fluid responsiveness; however, none provided satisfactory evidence to support their choice of reference standard.

Figure 2
figure 2

Risk of bias assessment. Legend: table summarising a risk of bias assessment performed using a modified version of QUADAS-2. 23 “+”: low risk of bias; “?”: unclear risk of bias; “-”: high risk of bias.

Twelve studies failed to report acceptable observer variation data: nine provided no data for the index test, one reported intra-observer variation only, one reported observer variation in a healthy cohort, and one reported inter-observer variation greater than the index test’s threshold values. Twelve studies calculated the index test’s optimal threshold using post hoc analysis with no a priori definition of a diagnostic cut-off. As a result, in the six studies assessing IVCCI, thresholds ranged from > 15 to >50% (median > 37%). Finally, only five studies reported the proportion of patients with uninterpretable results: three for echocardiography (11.5%, 12.8%, and 16.1%) and two for IVCCI (12.5%, 13.5%).

DISCUSSION

This is the first systematic review which explores diagnostic tests that determine the need for fluid resuscitation in adults with sepsis who are not mechanically ventilated. Despite this, all but two studies were based solely in ICU as opposed to a ward or AMU, where the majority of medical patients are managed.

The characteristics of included patients varied between studies and were often poorly described. Six studies combined septic shock with shock arising from very different pathophysiologies, and seven excluded patients with arrhythmia without justification. This exclusion criterion is especially relevant given our ageing population and the increased prevalence of arrhythmia in this cohort (up to 17% of adults > 80 years have atrial fibrillation38).

The most common inclusion criterion was hypotension, which was enough to warrant inclusion by itself in 13 studies. All studies identified fluid responsiveness as the target condition, which was universally seen as equivalent to a benefit from fluid resuscitation.

Fluid responsiveness, the target condition in all studies, was dynamically assessed in 12 studies. However, it was defined in 11 different ways, including variations in the delivery of the fluid bolus, the threshold value, and the measurement tool. A direct measure of cardiac function was used in 11 studies and was assessed by echocardiography in eight studies.

Five categories of index test were described. However, the small size, risk of bias, and clinical heterogeneity of included studies prevented meaningful comparisons. Index test thresholds were chosen post hoc in all but two studies; 12 studies failed to report adequate observer variability data; and none of the included studies specified how their index test should be used in practice, i.e. replace, be added to, or triage for the current diagnostic process. Comparisons were further limited by the inadequate reporting of patient characteristics which impact upon the accuracy of the included index tests. Increases in respiratory rate and tidal volume, frequently seen in critically ill patients, have been shown to affect the IVCCI and respiration variation measurements, for example.39

In summary, there was a small number of heterogeneous studies available to guide the provision of intravenous fluid in adults with sepsis who are not mechanically ventilated. This heterogeneity, and a lack of consensus regarding associated definitions, is found throughout the literature.

Given that fluid responsiveness is widely assumed to be the most effective way of guiding fluid resuscitation, a single measurement strategy with a pre-defined threshold would be of great benefit to the medical community. Currently, however, the heterogeneity surrounding the definition of fluid responsiveness limits our ability to learn from previous research and plan for future studies. Others have noted such heterogeneity40 and have recognised the need for a consensus definition.41 Cardiac output is widely regarded as the best means of determining fluid responsiveness42 and is supported by Starling’s principles.43 However, its use makes the assumption that an increase in cardiac output is always beneficial. In addition, the ability of cardiac output rises to predict benefit from fluid resuscitation has never been tested against more commonly used haemodynamic markers (e.g. blood pressure) and there are concerns about the practicalities of monitoring cardiac output in a medical ward.

Despite excluding mechanically ventilated patients, the eligibility criteria of this systematic review (Appendix Table 5, online) allowed inclusion of studies that are more limited in their generalisability to an acute medical population. Most studies were set in ICU and many participants had been treated for days before inclusion. The absence of patients who are not critically unwell would have artificially increased the sensitivity of all index tests.44 This risk was acknowledged in the protocol22 after scoping identified a lack of evidence outside of ICU.

The wide clinical heterogeneity meant statistical tests to exclude publication bias were not feasible. However, the identification of three unpublished conference proceedings reflects a robust search strategy and suggests that the low number of included studies is due to limited evidence rather than methodological shortcomings.

It should also be noted that the diagnostic test accuracy methodology itself limits the analysis of the index tests. This methodology assumes a dichotomous status for an individual, e.g. hypovolaemic or not hypovolaemic. In reality, a spectrum exists between euvolaemia and hypovolaemia. All described index tests provide the clinician with continuous data and therefore would support a nuanced analysis. Simplifying measurement tools to provide a binary answer limits a clinician’s ability to optimise fluid provision.

This systematic review has highlighted multiple research opportunities that should be pursued. Given the variety of assumptions surrounding fluid responsiveness and its frequency in the literature, it should be explored as a priority. After observer variability data has been gathered, the proportion of well and acutely unwell adults who are fluid responsive should be assessed using several haemodynamic measurements. Consideration should be given to the impact of factors such as age, disease severity, and comorbidities on these proportions. To ensure feasibility outside of ICU, non-invasive measurement tools should be used.

Appropriately powered observational studies should then examine potential associations between fluid responsiveness and commonly used outcome measures. Fluid responsiveness has been noted in health and may simply be a marker of cardiac function, and so should also be explored as a prognostic indicator. Each approach should be tested in different aetiologies of shock.

To support the integration of these diagnostic tests into clinical practice, an understanding of the current decision-making process is needed, which will clarify the exact purpose of the chosen diagnostic test. Beyond this point, a randomised controlled trial will be able to measure its impact in a clinical setting.

CONCLUSION

Intravenous fluid remains a central recommendation in guidelines for the management of sepsis (including the Surviving Sepsis guidelines13 and NICE CG1745) but evidence to support these recommendations is lacking. There is no consensus definition of hypovolaemia. While recent studies have used fluid responsiveness to identify when fluid resuscitation is required, there is no agreement on which reference standard test should define fluid responsiveness. Finally, if fluid responsiveness is identified, there is no evidence to clarify how it should be treated (i.e. which fluid, volume, and rate), or whether treatment is indicated at all.

While this systematic review highlights a lack of evidence, it identifies research recommendations that, if met, would build an evidence base for the provision of fluids in acutely unwell adults outside of ICU. Intravenous fluids are a common treatment across all medical specialities. Prompt administration can be life-saving, but excessive use is associated with patient harm. Research in this field should, therefore, be a priority for the research community.