Introduction

The computation of stroke volume from an arterial pressure waveform is not a new concept. Proposed for the first time in 1904 [1], stroke volume calculation has been improved and refined by many investigators and companies over the last century. Deriving stroke volume from a peripheral arterial pressure curve is very challenging. Indeed, the arterial pressure waveform depends not only on stroke volume but also on arterial compliance, vascular tone, and reflection waves [2, 3]. Most methods currently available on the market require regular manual calibration to capture differences in arterial compliance and vascular tone from one patient to another and in a given patient from one time to another [2, 3]. Therefore, the accuracy of these techniques is highly dependent on the delay between two manual calibrations and on the hemodynamic stability of the patient [4].

A self-calibrated method has been available on the market since 2005 (FloTrac, Edwards LifeSciences, Irvine, CA). This method was described in detail elsewhere [5]. Briefly, cardiac output (CO) is computed from the equation:

$$ {\text{CO }} = {\text{ pulse rate }} \times {\text{ APsd }} \times K, $$

where APsd is the standard deviation of arterial pressure and Κ an autocalibration factor derived from a proprietary multivariate equation. CO is updated every 20 s. This equation includes biometric variables (e.g., age and sex, which, according to the work of Langewouters et al. [6, 7], are known to affect arterial compliance) and “shape variables” describing in mathematical or statistical terms the shape of the arterial pressure curve. The equation was developed from and validated in a human database of arterial pressure tracings and thermodilution CO reference values.

The first-generation software was developed from a limited human database, and the multivariate equation updated only every 10 min. Validation studies were somewhat disappointing [8, 9]. The second-generation software is based on a larger human database, and the multivariate equation (i.e., Κ) is updated every minute (allowing the rapid capture of acute changes in vascular tone). The second-generation software has been shown to be reliable in the measurement of CO, and in the tracking of acute changes in CO but these studies were conducted mostly in cardiac surgery patients [1013]. Some concerns have been raised regarding the second-generation software in patients with hyperdynamic and vasoplegic states [1417]. A relationship (logarithmic) has been established between FloTrac accuracy and systemic vascular resistance (SVR), with FloTrac underestimating CO when SVR is low [15, 16]. To address this limitation, a third-generation software was recently developed from an even larger human database, which contains a greater proportion of hyperdynamic and vasoplegic patients.

The aim of the present multicenter study was to investigate, in a separate cohort of patients than that used to develop the algorithm, whether the third-generation software is able to accurately measure CO in patients with sepsis, particularly when SVR is low.

Materials and methods

The study was conducted in four university-affiliated intensive care units (ICUs) after approval by the ethical committee of each institution. Written informed consent was given by all patients or their legal guardian. Patients with sepsis [18], with a pulmonary artery catheter in place for hemodynamic monitoring, and with a peripheral arterial line in place for continuous blood pressure monitoring were considered for the study. Patients younger than 18 years old or less than 40 kg in weight (as this may limit the use of continuous CO measurement with a pulmonary artery catheter), with significant aortic or tricuspid valve regurgitation, or being treated with an intra-aortic balloon pump were excluded.

Measurements

All patients had a pulmonary artery catheter connected to a specific monitor (Vigilance, Edwards LifeSciences, Irvine, CA, USA) for CO measurement by bolus thermodilution (iCO) and by semi-continuous thermodilution (CCO). Bolus measurements were obtained by using 3–5 cold (<10°C) 10 mL saline boluses (CO-Set, Edwards) randomly injected throughout the respiratory cycle. The temperature of the injectate was measured at the site of injection. The consistency of the thermodilution curve was judged visually on the monitor and all sets of iCO measurements with a reproducibility less than 15% were considered for analysis [19]. CCO was obtained with automated and intermittent heating of filament wire. Five CCO values were averaged over a 5-min period (2 CCO values before and 3 CCO values after iCO measurements). All patients also had a radial (n = 32) or femoral (n = 26) arterial catheter for continuous arterial pressure monitoring. Quality of damping was ensured by visualization of oscillation decay after flushing the lines. The arterial pressure curve was recorded on a computer via a high-fidelity pressure transducer (FloTrac, Edwards Lifesciences) and CO was computed off-line from the arterial pressure curve using the second-generation (COG2, version 1.14) and the third-generation (COG3) FloTrac software. Investigators measuring and collecting iCO values were, therefore, blind to the FloTrac CO measurements. Fifteen COG2 and COG3 values were averaged over a 5-min period (7 values before and 8 values after iCO measurements).

Mean arterial pressure (MAP) was also recorded and total SVR (TSVR) was calculated [central venous pressure (CVP) was not collected] by using the equation:

$$ {\text{TSVR }} = {\text{ MAP }} \times { 8}0/{\text{iCO}} $$

where iCO was the reference CO for the study.

Hemodynamic measurements were performed at the discretion of the attending physician, suggested to be at least every 4 h during daytime whereas more measurements could be obtained in case of hemodynamic instability or to evaluate the effects of therapeutic interventions. Patients were followed for 48 h.

The study was sponsored by Edwards Lifesciences. Computation of CO from arterial traces was performed by Edwards Lifesciences’ technicians. The investigators had full control of the database which was locked before analysis. Data analysis was performed by the investigators (DDB). The manuscript was drafted by the first author, and all authors reviewed the manuscript.

Statistical analysis

We used SPSS software (version 13). Results are expressed as mean ± SD. A p < 0.05 was considered significant. Bias was calculated as the mean difference of COG3, COG2, or CCO minus iCO. Limits of agreement (LOA) were calculated as ±2SD of these differences. Bias and LOA are presented as a percentage (absolute values are also provided in brackets) as suggested by Critchley and Critchley [20], because a difference between measurements (bias) of 1 L/min, for example, is more clinically significant at lower than at higher COs. The percentage error was calculated as two times the SD of the bias over the mean iCO, as previously recommended [16, 20]. A percentage error less than 30% was considered as satisfactory [20]. The precision of the reference technique (iCO) was calculated as twice its coefficient of variability of individual bolus injections at each iCO measurement [21]. Differences in bias were evaluated by a Student t test with Bonferroni correction for multiple comparisons. Differences in LOA were evaluated with a Kolmogorov–Smirnov test [22]. The relationships between TSVR and the differences of COG3, COG2, or CCO minus iCO were tested for each method using a logarithmic regression analysis.

As multiple measurements were obtained, two different analyses were performed to limit the influence of multiple measurements per patient. First we used a correction for Bland and Altman analysis for multiple measurements [23, 24]. Second, as all analyses cannot be corrected by this technique, we also present (in the ESM) an analysis conducted using only the first measurement for each patient.

Finally, we evaluated the ability of the different techniques to track changes in CO using pairs of successive CO measurements, with each CO measurement used only once. For this purpose, the direction and the amplitude of changes were evaluated [21]. Receiver operating characteristic (ROC) curves were constructed to evaluate the ability of each technique to detect concordant directional changes in CO of at least 15%, with iCO used as reference.

Results

A total of 58 patients admitted to four ICUs (Honolulu, Hawaii, USA n = 20; Jena, Germany n = 15; Brussels, Belgium n = 14; and Washington DC, USA n = 9) were enrolled in the study. Patient characteristics are summarized in Table 1. Hemodynamic profiles are provided in Table 1 (aggregate of all measurements during the study period) and in Table 1 in the ESM (baseline measurements). A total of 401 (6.9 ± 5.0 per patient) simultaneous determinations of iCO, COG2, COG3, and CCO were available for comparison. The median time between measurements was 238 (percentiles 25 and 75: 158 and 443 min, respectively) min.

Table 1 Patient characteristics

Accuracy and precision of CO measurements

Overall, iCO ranged from 2.7 to 14.6 L/min, COG2 from 2.5 to 14.4 L/min, COG3 from 2.5 to 17 L/min, and CCO from 2.8 to 16 L/min. The precision of iCO was 12.4%. The mean bias and LOA were −10 and 29% (2.2 L/min) between COG2 and iCO, 0 and 30% (2.2 L/min) between COG3 and iCO, and 9 and 28% (2.1 L/min) between CCO and iCO (Fig. 1). The bias between COG3 and iCO did not differ from 0 (0.2%, 95%CI −3.7 to 4.2%, p = 0.90) and was significantly less negative than the bias between COG2 and iCO (p < 0.0001). Kolmogorov–Smirnov testing showed significant differences in the LOA between COG3 and COG2 but not between COG3 and CCO (p < 0.001 and 0.59, respectively). LOA were similar when COG3 was compared with iCO and CCO (27.2 ± 6.2% for COG3, other data not shown). Similar results were observed when a correction for multiple measurements per patient was not applied (Table 2).

Fig. 1
figure 1

Bland & Altman representations of COG2, COG3, and CCO versus iCO. Panel a shows relation between COG2 and iCO, panel b COG3 and iCO, and panel c CCO and iCO. The bias and limits of agreements, computed with correction for multiple measurements [23, 24], are provided in Table 2

Table 2 Mean bias and limits of agreements (95% confidence interval)

When only the first measurement of CO obtained at inclusion of the patient was considered, similar bias and LOA were observed (Table 2, and Fig. 1 in the ESM; hemodynamic data are presented in Table 1 in the ESM). The bias between COG3 and iCO did not differ from 0 (−0.2%, 95%CI −4 to +4, p = 0.90) and was significantly less negative than the bias between COG2 and iCO (p < 0.0001). Kolmogorov–Smirnov testing showed significant differences in LOA between COG3 and COG2 and between COG3 and CCO (p < 0.001 and <0.01, respectively).

Influence of TSVR on the accuracy of CO measurements

The difference between iCO and COG2 was significantly correlated with TSVR (r 2 = 0.37, p < 0.0001) (Fig. 2). A very weak (r 2 = 0.05) but still statistically significant (p < 0.0001) relationship was also observed for the difference between iCO and COG3 (Fig. 2). The difference between iCO and CCO was not correlated with TSVR (Fig. 2). When only one point per patient was considered, there was no relationship between TSVR and the differences between iCO and COG3 or iCO and CCO, whereas a significant relationship was observed between TSVR and the difference between iCO and COG2 (Fig. 2 in the ESM).

Fig. 2
figure 2

Logarithmic relationships between total systemic vascular resistance (TSVR) and the differences between COG2 and iCO, COG3 and iCO, and CCO and iCO using all patient data. Panel a shows relation between COG2 and iCO, panel b COG3 and iCO, and panel c CCO and iCO

Separating datapoints according to the median TSVR value or even to extreme quartiles of TSVR showed that the percentage errors for COG3 were similar for all TSVR ranges explored (Table 2 in the ESM).

Detection of changes in CO measurements

The AUC of the ROC curves for detecting changes in iCO of more than 15% in the same direction were 0.79 (0.72–0.87) for COG3, 0.78 (0.69–0.86) for COG2, 0.75 (0.66–0.84) for CCO, and 0.51 (0.42–0.62) for MAP (Fig. 3 in the ESM). The AUCs for COG3, COG2, and CCO were significantly higher than 0.5 and than MAP (p < 0.001) but there were no differences among the different techniques. Of note, MAP was unable to detect changes in CO in the same direction (p = 0.80). The sensitivity, specificity, positive and negative predictive values, positive and negative likelihood ratios, and percentage of correct classification (with 95%CI) were, respectively, 0.78 (0.60–0.89), 0.78 (0.69–0.84), 0.50 (0.36–0.65), 0.93 (0.85–0.96), 3.5 (2.4–5.1), 0.3 (0.2–0.5), and 0.78 (0.61–0.90) for G3; 0.76 (0.57–0.88), 0.75 (0.66–0.82), 0.44 (0.31–0.57), 0.92 (0.85–0.96), 2.9 (2.1–4.3), 0.3 (0.2–0.6), and 0.75 (0.63–0.84) for G2; and 0.72 (0.55–0.85), 0.72 (0.64–0.80), 0.43 (0.30–0.56), 0.90 (0.83–0.95), 2.6 (1.9–3.7), 0.4 (0.2–0.6), and 0.73 (0.59–0.80) for CCO.

There were no differences in the magnitude of changes in CO detected by iCO and by the other techniques (Table 3 in the ESM).

Influence of radial versus femoral line site on the accuracy of CO measurements

The bias (−3.5 vs. −1.8%) and percentage error (30.4 vs. 26.6%) between COG3 and iCO were slightly greater for radial than for femoral sites, but these differences were not significant (ns). Similar trends were observed for bias (−13.4 and −11.6) and percentage errors (33.6 vs. 30.9%) between COG2 and iCO in radial and femoral sites (p = ns).

The difference between iCO and COG3 was significantly correlated with TSVR at the radial site (r 2 = 0.13, p < 0.0001) but not at the femoral site (r 2 = 0.01, p = 0.74). The difference between iCO and COG2 was significantly correlated with TSVR at both sites, but the correlation was greater at the radial (r 2 = 0.51, p < 0.0001) than at the femoral (r 2 = 0.19, p < 0.001) site.

Discussion

Our study shows that the third-generation FloTrac software is more accurate (lower bias), as precise (%error < 30), and much less affected by SVR than the second-generation software.

Several studies have shown that the second-generation FloTrac is accurate (bias ranging between −0.30 to +0.19 L/min) and precise (%error < 30) compared with thermodilution techniques [1013]. However, other studies showed that the second-generation FloTrac may underestimate CO in hyperdynamic and vasoplegic states [1417]. In patients undergoing liver transplantation, Biais et al. [15] were the first to report a (logarithmic) relationship between FloTrac accuracy and SVR, such that the lower the SVR, the greater the bias between the FloTrac value and the reference thermodilution value. These findings were recently confirmed by Biancofiore et al. [16] in the same clinical setting. Our results obtained in septic patients are in line with these findings in that we also observed a large bias between COG2 and iCO and a logarithmic relationship between TSVR and COG2.

It has been hypothesized that significant gradients between central and peripheral arterial pulse pressures may be responsible for the underestimation of CO with second-generation FloTrac software in patients with low SVR. Indeed, such a gradient, likely related to a significant decrease in peripheral reflection waves, has been reported in vasoplegic and/or septic states and was shown to be reversible as soon as patients recovered [2529]. Logically, this “decoupling” of aortic and radial pulse pressure may be responsible for an underestimation of CO when it is computed from a peripheral arterial pressure curve. In this regard, it is important to note that all pulse contour methods should be affected by this physiological phenomenon. Results from several studies using different pulse contour methods are consistent with this hypothesis. When comparing a pulse contour method to pulmonary thermodilution in patients with septic shock, Jellema et al. [30] showed that the limits of agreement were almost two times wider in patients with low SVR (<800 dyn/(s cm5)) than in other patients (SVR > 800 dyn/(s cm5)). More recently, in cardiac surgical patients, Yamashita et al. [31] showed that when SVR was decreased dose-dependently by prostaglandin E1 infusion, the PiCCO pulse contour method underestimated CO by up to 40% compared with the reference thermodilution method. The same authors observed similar influences of vasodilation on bias and precision between the LiDCO pulse contour method and thermodilution [32]. Another group recently reported that differences between femoral and radial arterial traces may develop during patient course and generate differences in CO up to 3.0 L/min [29]. Although we strongly suspect that a decoupling between central and peripheral pulse pressure may be responsible for the lack of accuracy of pulse contour methods in vasoplegic states, evidence to support this hypothesis is still lacking and our study was not designed to address this issue.

Consistent with this hypothesis, we nevertheless found that bias and percentage error tended to be higher in the radial than the femoral site for both COG2 and COG3. This was not related to factors specific to these patients, as bias and percentage errors between CCO and iCO did not differ between patients equipped with radial or femoral lines. Even though the percentage error with COG3 using a radial line was still acceptable in these septic patients, femoral arterial lines should be preferred in these patients, whenever feasible, for CO monitoring and for arterial pressure measurements.

The new third-generation FloTrac software was developed from a human database containing many recordings from septic and liver transplant patients who were often vasodilated. Some proprietary arterial pressure waveform characteristics were identified in this subset of patients and integrated as new “shape variables” in the multivariate equation used to calculate and update Κ every minute. As a result, precision has been preserved and accuracy has been improved and is now at least as good as that of semi-continuous pulmonary thermodilution (CCO), a method widely used and accepted in critically ill patients.

Our study has some limitations. First, the ideal reference method for measuring CO has not been described. However, the most commonly used technique is an averaged set of bolus thermodilution values taken from a pulmonary artery catheter, reported to have a precision of 10–20% [19, 20]. Ideally, we should have used a highly reliable reference standard to make comparisons, such as an aortic flow probe applied directly to the aorta, but this is obviously not possible in ICU patients. However, we did optimize the precision of our thermodilution measurements by using sets of measurements with a reproducibility less than 15% [19] and using bolus thermodilution measurements as a reference. Moreover, we report similar percentage error when CCO was used as an alternative reference technique. Second, we studied patients with sepsis-induced vasoplegia, most of them receiving vasoactive support, with low TSVR as intended. Whether our findings may be extrapolated to patients with drug-induced (e.g., during anesthesia induction) or ischemia/reperfusion-induced (e.g., post liver reperfusion during transplantation surgery) vasoplegia remains to be determined. One may argue that changing the algorithm could decrease the ability of the software to measure CO in normal or high TSVR states. Our study was not specifically designed to test this hypothesis, but several measurements were obtained later in the patients’ course when SVR had normalized. In these conditions, the bias was minimal with COG3, as with previous versions, suggesting that the software also performs adequately in high and normal TSVR states. This should be confirmed in another prospective trial. Third, TSVR was computed without CVP, inducing an overestimation of SVR of around 15%. Of note, TSVR provides a better estimate of arterial compliance, a key determinant of the relationship between stroke volume and arterial waveform, than SVR as CVP is not included in this relationship. Finally, we used both radial and femoral arterial lines. Even though these were mixed in the main analysis, a separate analysis allowed us to look at the specific behavior of each catheterization site, as reported above.

In conclusion, our study shows that in patients with sepsis, the third-generation FloTrac software is more accurate, as precise, and influenced much less by TSVR than the second-generation software. Our data also demonstrate that the overall performance of the third-generation FloTrac is comparable to semi-continuous pulmonary thermodilution, a technique already widely used in septic patients.