The accuracy of arterial lines (AL) using the flush test or stopcock test has not been described in children, nor has the difference between invasive arterial blood pressure (IABP) versus non-invasive cuff (NIBP) blood pressure.
After ethics approval and consent, we performed the flush test and stopcock test on AL (to determine over damping, under damping, and optimal damping), and determined the difference (NIBP–IABP) in systolic, diastolic, and mean blood pressure (ΔSBP, ΔDBP, and ΔMAP). The primary outcome was incidence (95 % CI) of optimally damped AL. Predictors of ΔBP (effect size (95 % CI)) were determined using multiple linear regression.
There were 147 AL tests in 100 enrolled patients with mean age 44.7 (SD 56) months, weight 16.8 (SD 18.3) kg, male 59 %, postoperative-cardiovascular 52 %, peripheral-AL 78 %, inotropes 29 %, vasodilators 15 %, and ventilated 73 %. The flush test performed in 66 patients (45 %) showed optimal damping in 30 (46 %; 95 % CI 34, 57 %), over damping in 25 (38 %) and under damping in 11 patients (17 %). The stopcock test was over-damped in 128/146 patients (88 %), with the same damping as the flush test in 24/64 (38 %). In optimally damped (flush test) AL, ΔSBP, ΔDBP, and ΔMAP were 0.8 (SD 12.2), −5.2 (SD 8.7), and −4.9 (7.6) respectively. A second set of AL tests was done 2 h later on the same day in 62 patients; AL damping often changed (10/28 flush tests) and ΔBPs correlated poorly (r = 0.31–0.55). Predictors (effect size) of ΔDBP were vasodilator infusion (15.6 (2.9 to 28.3); p = 0.016) and optimal damping (−7.2 (−12.2 to 2.2); p = 0.005); and of ΔMAP were vasodilator infusion (10.0 (−0.3 to 20.4); p = 0.057) and optimal damping (−4.0 (−8 to 0.1); p = 0.058). There were no independent predictors of damping category (n = 66 flush tests).
Optimally damped AL occur in half of critically ill children, and this is not predictable. There is much variability in ∆BP between NIBP and the gold standard IABP, and this varies even in the same patient on the same day, and is not easily predictable. In critically ill children, NIBP may not be accurate enough to guide management, and more attention to ensuring the AL is optimally damped is needed.
Blood pressure is a crucial vital sign in critically ill children. Accurate measurement of blood pressure is assumed in the diagnosis of hypovolemic, cardiogenic, vasodilatory, and obstructive shock, and of hypertension from any cause. Accurate measurement of blood pressure is also assumed in the (often urgent) management of any of these conditions with volume, vasoactive medication infusions, and even extracorporeal support. Even triage and resource allocation decisions, such as whether to transport and admit to the pediatric intensive care unit (PICU), are often based on the assumption of accurate blood pressure measurement. Nevertheless, there is no study we are aware of that determines the accuracy of blood pressure measurement in children in the PICU, whether invasive arterial blood pressure (IABP) is measured using an invasive arterial line (AL), or whether non-invasive cuff blood pressure (NIBP) is measured.
Accurate IABP measurement requires an optimally damped measurement system, and if the pressure is over or under damped the measured IABP is theoretically underestimated or overestimated, respectively [1, 2]. Determination of the damping condition of the AL can be done using a flush test, whereby a small volume of fluid is rapidly infused into the system, and the subsequent ‘ringing’ of the waveforms is recorded and used to calculate the natural frequency (how rapidly the system oscillates after a stimulus) and amplitude ratio (or damping coefficient; how quickly the system comes to rest due to frictional forces after a stimulus) [1, 2]. This test was first described in 1981, and has since been used in studies in adults in intensive care, and is suggested in standard anesthesia texts [1–4]. This test can easily be done when AL are set up with an Intraflo continuous flush element. Alternatively, it has been suggested that closing the stopcock to the continuously infusing AL for several seconds followed by quickly opening the stopcock will result in a similar rapid flush to the system . To our knowledge, the usefulness of this stopcock test has never been reported.
In this study we aimed to determine the accuracy of AL measurement of IABP using the flush test and stopcock test. In addition, we aimed to determine the difference between IABP and NIBP in critically ill children, particularly when the AL is known to be accurate (optimally damped).
This study was approved by the Health Research Ethics Board of the University of Alberta. Although NIBP is routinely measured in the PICU, the ethics committee required signed informed consent from legal guardians prior to inclusion in the study because it was decided that the extra NIBP measurement may be of discomfort to the patients.
All patients in PICU at Stollery Children’s Hospital, and who had an AL, were eligible for the study from late June to mid October 2015. Exclusion criteria were: extracorporeal life support; abnormal aortic arch, including after subclavian flap repair of coarctation; known non-functioning AL (e.g., losing waveform and no longer thought to be accurate, or unable to withdraw bloodwork); ongoing patient agitation; AL in an umbilical site; or lack of signed consent. A case report form and study instruction manual were created prior to patient recruitment, with standard definitions, calculation instructions, and procedure instructions (Additional files 1 and 2). Demographic (age, sex, diagnostic category), severity of illness (inotrope infusion score, vasodilator infusion in use, ventilation in use), potentially confounding factors for NIBP (obesity defined as over the 90th percentile weight for age; severe edema in the limb used for NIBP; chronic hypertension; and obstructive airway disease), and site of the AL (peripheral or femoral) variables were recorded.
A flush test was done for children weighing ≥10 kg and the AL waveform printed for later calculation of natural frequency and amplitude ratio, and (using a published graph) determination of optimal, under, or over damping of the AL (see Additional file 3 for arterial line setup, and flush test demonstrations) [1, 2]. A stopcock test was then done and the AL waveform printed for later calculations. The flush test could only be done on patients ≥10 kg in weight because in our PICU the Intraflo continuous flush element is not used on smaller patients. Following the flush and stopcock tests, the NIBP was measured in a different limb to the one with the AL. The NIBP and IABP were recorded at the same time (i.e., the IAPB at the end of deflation of the cuff), including systolic (SBP), diastolic (DBP), and mean (MAP) pressures. If the difference in SBP was >10 mmHg, the NIBP and IABP were repeated, and the closest (in SBP) of the two measurements was recorded in the case report form. These procedures were done on the day (d) of the AL as follows: d1–3, d4–6, and d7–10 as appropriate; on d1–3 a second set of procedures was done 2–3 hours later on the same day if this occurred during working hours.
The arterial line was set up as follows. The intra-arterial catheter (24 G (5/8 in), 22 G (1 in), or 20 G (1-1/4 in)) was connected to a straight connector (ICU Medical Smallbore Extension Set with MicroClaveR clear; 7 in, 0.24 ml; San Clemente, CA, USA), a stopcock (Hi-Flo™ Smiths Medical, Brisbane, Australia), and high pressure tubing with disposable transducer (Edwards Lifesciences TruWave™ 3 cc/72 in (180 cm) pressure monitoring set). The transducer was connected by the invasive pressure cable into the blood pressure module of the Phillips IntelliVue MP70 bedside monitor. In patients weighing at least 10 kg, the transducer set was connected to the 300 mmHg pressure bag to run at 3 ml/h. In patients under 10 kg, the transducer set was connected to the Alaris pump (Attach SmartSiteR Burette Set, CareFusion, San Diego, CA, USA) to run at 1.5 ml/h. The pressure monitoring set has an Intraflo continuous flush element pigtail that can be pulled to allow rapid flush of the system, and this is functional when not on the Alaris pump. The NIBP was done using Phillips EasyCare cuffs of appropriate size, and connected with the NIBP pressure cable into the blood pressure module of the Phillips IntelliVue monitor.
The primary outcome was the proportion of AL that were non-optimally damped on the flush test, with adjusted Wald 95 % confidence intervals (CI). Assuming a similar prevalence of non-optimally damped AL as described recently in adults in intensive care (31 %), we estimated that a sample of 80 children will allow a reasonable 95 % CI of +/− 10 % . Secondary outcomes planned were: difference between NIBP and IABP (ΔSBP, ΔDBP, and ΔMAP) according to the damping category of the AL, described as mean (SD) and median (IQR) difference, and with Bland-Altman plots ; comparison of the flush test and stopcock test, described as the proportion of tests having the same damping result; predictors of optimally damped AL using multiple logistic regression models; and predictors of the NIBP–IABP difference using multiple linear regression models. Pre-specified variables entered in the regression models were: gender, weight, inotrope in use, vasodilator in use, ventilation in use, peripheral site of the AL, day category of AL (d1–3, d4–6, or d7–10), diagnostic category of the patient (postoperative cardiovascular vs other), and optimally damped AL (for predicting NIBP–IABP). Finally, correlation between the first and second set of procedures on the same day (for the d1-3 category AL) was determined for ΔSBP, ΔDBP, and ΔMAP, and the difference between these variables on the two tests described. Sensitivity analyses were done using results from only the first testing of the AL that were in the d1–3 category.
Description of the cohort
The inclusion/exclusion patient flow is shown in Fig. 1. The 147 patients having the AL tested had mean age of 44.7 (SD 56) months, median 13.5 (IQR 4–78) months and mean weight of 16.8 (SD 18.3) kg, median 8.6 (IQR 4.8–23.4) kg, and 86 patients (59 %) were male. Diagnostic categories included postoperative cardiovascular surgery (n = 76 (52 %)); non-operative cardiovascular (n = 10 (7 %)); postoperative general surgery (n = 16 (11 %)); and medical (n = 45 (30 %)). The AL were peripheral in 114 patients (78 %: radial in 87, ulnar in 12, brachial in 8, and dorsalis pedis in 7 patients) or femoral in 33 patients (22 %). The peripheral AL were size 24 G in 12 (13 %), 22 G in 66 (73 %), and 20 G in 12 (3 %) patients. Interventions included inotropes in use in 43 (29 %) (inotrope score 6.6 (SD 3.6), range 2–18.5), vasodilator in use in 22 (15 %), and ventilation in use in 107 patients (73 %) (invasive ventilation 78, non-invasive ventilation 25, and high-flow nasal cannula in 4 patients). There were few patients with potential confounders to NIBP: obesity (n = 13 (9 %)), severe arm edema (n = 7 (5 %)), chronic hypertension (n = 6 (4 %)), and obstructive airway disease (n = 5 (3 %)).
For the primary outcome, AL was optimally damped in 30/66 flush tests (46 %; 95 % CI 34, 57 %), over damped in 25 (38 %; 95 % CI 27, 50 %), and under damped in 11 (17 %; 95 % CI 9, 28 %); thus, the prevalence of non-optimally damped AL was 36 (55 %; 95 % CI 43, 66 %). The proportions in each damping category were virtually identical in d1–3 AL and other day categories of AL (optimally damped in 45 % vs 46 %, over damped in 38 % vs 38 %, and under damped in 17 % vs 17 %). For the AL that were tested for a second time on the same day, the proportions were also similar on the second flush test (n = 29), being optimally damped in 13 (45 %), over damped in 13 (45 %), and under damped in 3 patients (10 %); however, the same damping result was obtained in only 18/28 patients (64 %). When there was ‘ringing’ of the AL (n = 41; 62 %) the natural frequency and amplitude ratio was calculated and was 22 (SD 5, median 25 (IQR 17–25)) and 0.5 (SD 0.2; median 0.5 (IQR 0.37–0.62)). The stopcock test was done in 146 patients (99 %), and demonstrated an optimally damped AL in 5 (3 %), over damped AL in 128 (88 %; always because of absent ‘ringing’), and under damped AL in 12 patients (8 %). When both were done, the stopcock test had the same result as the gold standard flush test in 24/64 (38 %). For the AL that were tested for a second time on the same day, the proportions were also similar on the second stopcock test (n = 61), being optimally damped in 2 (3 %), over damped in 56 (92 %; all with no ‘ringing’), and under damped in 3 patients (5 %), and with the same result as the second flush test in 13/29 (45 %).
Difference in NIBP–IABP
The ΔSBP, ΔDBP, and ΔMAP for each category of the AL damping on the flush test was on average small, but the SDs and IQRs were wide (Table 1, Fig. 2), and the limits of agreement on Bland-Altman plots were also wide (Fig. 3). Because the stopcock test was so often over damped due to absent ‘ringing’, and was usually different from the flush test, we did not consider determining the difference between NIBP and IABP by stopcock damping category; rather we report the results for all 147 AL tested. Again, although the mean differences were small, the SDs and IQRs were wide (Table 1, Fig. 4), and the limits of agreement on Bland-Altman plots were also wide (Additional file 4: Figure S1). The second set of AL tests on d1–3 had similar results (Table 1). Although the correlations between the same day paired (n = 62) ΔSBP (r = 0.55, p < 0.001), ΔDBP (r = 0.32, p = 0.01), and ΔMAP (r = 0.31, p = 0.013) were statistically significant, the r values were low, and the difference between the paired ΔSBP (0.4 (SD 12.5); median −0.5 (IQR −8.0, 8.3)), ΔDBP (−2.3 (SD 10.1); median −2.5 (IQR −8.0, 3.0)), and ΔMAP (−1.2 (SD 9.1); median −0.5 (IQR −7.3, 5.0)) were on average small, but with wide SD and IQR (Additional file 4: Figure S2).
Predictors of damping category
On multiple logistic regression models, there were no independent predictors of AL damping category on the flush test (n = 66). When the inotrope score was used instead of inotrope in use, there were still no independent predictors. Age was collinear with weight (r = 0.91); however, age was not a predictor on univariate regression.
Predictors of NIBP–IABP
Independent predictors of ΔSBP, ΔDBP, and ΔMAP are shown in Table 2, both for AL with the flush test (with damping category entered as a variable), and for all AL (damping category not entered as a variable). Having a vasodilator in use resulted in the NIBP overestimating DBP and MAP. The NIBP underestimated DBP and MAP in patients with an optimally damped AL.
The results for the first test on the AL in the d1–3 category were analyzed separately. For the 42 flush tests, the AL was optimally damped in 19 (45 %), over damped in 16 (38 %), and under damped in 7 patients (17 %). The NIBP–IABP differences according to damping category, and for all AL, are given in Table 1. There were no independent predictors of AL damping category on multiple logistic regression. Predictors of NIBP–IABP difference for flush-tested AL, and for all AL, are given in Table 2. The results of these analyses were similar to analysis including all day categories of AL.
This is the first study we are aware of that has determined the accuracy of AL in PICU, and the first to compare NIBP to IABP in critically ill children, in whom the accuracy of the AL is known. There are several important findings from this study. First, the AL were accurate (i.e., optimally damped) in 30/66 (46 %; 95 % CI 34, 57 %) of AL that were flush-tested. In fact, the damping category of the AL could change (in 36 % of patients) even within hours on the same day. Second, the stopcock test is not a useful method to determine AL damping, as most tests (88 %) do not cause ‘ringing’, and the test is often different (in 62 % of cases) from the gold-standard flush test. Third, although the mean difference between NIBP and IABP is usually small, and there is significant correlation between the two measurements, there is wide variability in the difference as evidenced by wide SD, IQR, and limits of agreement. Fourth, we identified no predictors of an optimally damped AL, suggesting that the AL mechanical setup is more important than patient-relevant variables. Finally, there were some predictors of a larger difference in NIBP–IABP; in particular, a vasodilator in use (where the NIBP overestimates DBP and MAP), and an optimally damped AL (where the NIBP underestimates the DBP and MAP). In addition, when inotropes are in use the NIBP may underestimate the SBP.
There are several implications of these findings for practice in the PICU. First, monitoring of IABP is often done with non-optimally damped AL, and how to address this problem should be a priority. We did not find any patient-relevant variables that predict this, and did not examine potential mechanical causes of this problem in this study (e.g., air bubbles, clots, excessive tubing, etc.). Nevertheless, this study brings attention to the problem, and suggests further study is needed to improve this situation. Second, when testing IABP accuracy, a flush test is required, as the stopcock test is not useful. Methods of flush testing in infants are currently being tested in our PICU. Third, there is clinically relevant variability in blood pressure measured by IABP and NIBP, even using optimally damped AL, and this applies to SBP, DBP, and MAP. Thus, in a patient suspected of having abnormal blood pressure, or on vasoactive infusions, the NIBP does not appear accurate enough to guide diagnosis and management.
To our knowledge, there has been little study of the accuracy of AL blood pressure monitoring in children. A recent study in adults undergoing major vascular and cardiac surgery found that 30.7 % of AL were under damped, resulting in clinically significant overestimation of SBP and MAP compared to NIBP . In the optimally damped AL, the differences in BP were on average small, but with wide ranges and wide limits of agreement . In the only pediatric study comparing NIBP and IABP (n = 40 children) it was found that NIBP “may seriously underestimate the severity of hypertension and hypotension in PICU patients potentially leading to undertreatment” . However, in this study the damping category of AL was not determined , the clinical characteristics of the children were not described, and the variance of measurements was not reported. Several studies of adults in intensive care have identified wide variability in NIBP–IABP [3, 4, 8–10], although one study (in which flush tests were not performed) suggested that NIBP is accurate enough to detect MAP <65 mmHg . Several studies in newborn babies (none reporting use of the flush test) also suggest significant variability between NIBP and IABP measurements predominantly using an umbilical AL [11–13]. One study in children using manual sphygmomanometry found the difference between this and optimally damped IABP was −1 (SD 12) for SBP and 7 (SD 12) for DBP; the patients’ clinical conditions were not reported . The results of our study are broadly similar to these previous studies.
There are limitations to this study. This was a single-center study, and we do not know if the results can be generalized to other PICUs. Although we included 100 patients and 147 AL tests, the numbers of patients having the flush test (n = 66 and 29, respectively) and thus, with proven optimally damped AL (n = 30 and 13, respectively) are fairly small. We did not examine the mechanical setup of the AL or NIBP. The multiple statistical testing may have identified spurious predictors of differences in NIBP–IABP. Despite these limitations, the findings from this study are similar to those from a recent adult study , and from the only other pediatric study , suggesting generalizability of the findings. Although the numbers are modest, this is the only study reporting the accuracy of AL in PICU, and the largest study examining differences in NIBP–IABP in critically ill children. Although the examination of the mechanical setup was not included in the study protocol, the instructions did specify that prior to the AL tests the bedside nurse determined the AL was “zeroed; levelled; free of bubbles” and was “working for blood draws”, and the “NIBP cuff is optimal: bladder 40 % of arm circumference”. The study limitations do not change the main findings, that is, that AL in PICU are often not optimally damped, and that there are clinically relevant differences between NIBP and IABP even when using optimally damped AL. Further study should confirm these findings in other PICUs.
In critically ill children, AL are often not optimally damped and thus, often give inaccurate measurements of IABP. The stopcock test is not useful to determine the damping condition of the AL, and a flush test is necessary. The NIBP may not be accurate enough to guide management compared to an optimally damped AL reading of IABP, particularly when the patient is on a vasodilator or inotrope infusion. These findings should be confirmed in a different PICU.
Optimally damped arterial lines occur in about half of critically ill children, and this is not predictable by demographic or clinical variables
There is much variability in ∆BP between NIBP and the gold standard IABP, and this varies even in the same patient on the same day, and is not easily predictable
In critically ill children, NIBP may not be accurate enough to guide management, and more attention is needed to ensuring the arterial line is optimally damped
Gardner RM. Direct blood pressure measurement-dynamic response requirements. Anesthesiology. 1981;54:227–36.
Schroeder B, Barbeito A, Bar-Yosef S, Mark JB. Chapter 45: Cardiovascular monitoring. In: Miller RD, editor. Miller’s Anesthesia. 8th ed. USA: Saunders; 2014. p. 1345–95.
Romagnoli S, Ricci Z, Quattrone D, Tofani L, Tujjar O, Villa G, et al. Accuracy of invasive arterial pressure monitoring in cardiovascular patients: an observational study. Critical Care. 2014;18:644.
Ribezzo S, Spina E, Bartolomeo SD, Sanson G. Noninvasive techniques for blood pressure measurement are not a reliable alternative to direct measurement: a randomized crossover trial in ICU. Sci World J. 2014;2014:Article ID 353628.
Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;327(8476):307–10.
Holt TR, Withington DE, Mitchell E. Which pressure to believe? A comparison of direct arterial with indirect blood pressure measurement techniques in the pediatric intensive care unit. Pediatr Crit Care Med. 2011;12:e391–4.
Romagnoli S, Ricci Z, De Gaudio AR. Invasive arterial pressure: test it before believing in it. Pediatr Crit Care Med. 2012;13(2):248.
Manios E, Vemmos K, Tsivgoulis G, Barlas G, Koroboki E, Spengos K, et al. Comparison of noninvasive oscillometric and intra-arterial blood pressure measurements in hyperacute stroke. Blood Press Monit. 2007;12(3):149–56.
Rutten AJ, Ilsley AH, Skowronski GA, Runciman WB. A comparative study of the measurement of mean arterial blood pressure using automatic oscillometers, arterial cannulation and auscultation. Anaesth Intensive Care. 1986;14(1):58–65.
Lakhal K, Ehrmann S, Runge I, Legras A, Dequin P-F, Mercier E, et al. Tracking hypotension and dynamic changes in arterial blood pressure with brachial cuff measurements. Anesth Analg. 2009;109:494–501.
Kimble KJ, Darnall Jr RA, Yelderman M, Ariagno RL, Ream AK. An automated oscillometric technique for estimating mean arterial pressure in critically ill newborns. Anesthesiology. 1981;54(5):423–5.
Chia F, Ang AT, Wong T-W, Tan KW, Fung K, Lee J, et al. Reliability of the Dinamap non-invasive monitor in the measurement of blood pressure of ill Asian newborns. Clin Pediatrics. 1990;29(5):262–7.
Lang SM, Giuliano Jr JS, Carroll CL, Rosenkrantz TS, Eisenfeld L. Neonatal/infant validation study of the CAS model 740 noninvasive blood pressure monitor with the Orion/MaxIQ NIBP module. Blood Press Monit. 2014;19:180–2.
Clark JA, Lieh-Lai MW, Sarnaik A, Mattoo TK. Discrepancies between direct and indirect blood pressure measurments using various recommendations for arm cuff selection. Pediatrics. 2002;110(5):920–3.
RJ was funded by the Women and Children’s Health Research Institute (WCHRI) Summer Studentship Grant. The sponsor had no role in any of the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript.
RJ contributed to conception and design, acquisition of data, and analysis and interpretation of data, drafted the article, and gave final approval of the version to be published. JD, GG, and ARJ contributed to conception and design, interpretation of data, revising the article critically for important intellectual content, and gave final approval of the version to be published. JP contributed to acquisition of data, interpretation of data, revising the article critically for important intellectual content, and gave final approval of the version to be published. ARJ had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. RJ and ARJ conducted and are responsible for the data analysis. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Ethics approval and consent to participate
This study was approved by the University of Alberta Health Research Ethics Board (Pro00056693), and all included patients parent/guardians gave signed informed consent to participate.
The case report form for the study. (PDF 100 kb)
The study manual for the study. This file provides the definitions used, and the details of methods for determining the damping condition of the arterial lines. (PDF 406 kb)
Demonstration of setting up an arterial line (Part 1), zeroing and performing the flush test on an arterial line (Part 2), and calculations for determining the damping condition of the arterial line (Part 3). (WMV 29043 kb)
Supplemental figures. Figure S1. Bland Altman plots for non-invasive blood pressure compared to the invasive arterial blood pressure for all 147 arterial lines. a Systolic blood pressure difference; b diastolic blood pressure difference; c mean blood pressure difference. Figure S2. Difference between blood pressure measurements on the same day. a Systolic blood pressure difference; b diastolic blood pressure difference; c mean blood pressure difference. (PDF 133 kb)
About this article
Cite this article
Joffe, R., Duff, J., Garcia Guerra, G. et al. The accuracy of blood pressure measured by arterial line and non-invasive cuff in critically ill children. Crit Care 20, 177 (2016) doi:10.1186/s13054-016-1354-x
- Arterial line
- Blood pressure
- Intensive care units