Major intraoperative hemorrhage is a foundational aspect of perioperative practice encountered daily by anesthesiologists and surgeons. Prevention and treatment of hemorrhage requires accurate and reliable measurement of intraoperative blood loss. Robust research aimed at improving patient outcomes also requires accurate methods to measure blood loss, so that surgical and anesthetic techniques can be effectively compared and volume-based physiologic consequences of blood loss can be studied. Blood loss is universally quantified within the intraoperative setting, and the estimated blood loss (EBL) value is a standard part of the intraoperative medical record. Nevertheless, methods of measuring blood loss are often complex, impractical in clinical practice, and may differ across institutions and disciplines.1

Clinical guidelines currently highlight the importance of monitoring EBL while acknowledging deficiencies in measurement techniques. In clinical practice, subjective visual quantification is most commonly utilized.1 The American Society of Anesthesiologists Task Force on Perioperative Blood Management Practice Guidelines recommends periodic visual assessments of the surgical field, suction canisters, surgical sponges, and surgical drains to assist with intraoperative blood loss monitoring. Nevertheless, the guidelines also note that the available literature is insufficient to assess the impact of EBL measurement on patient outcomes.2 Estimated blood loss is also an important component of widely used prognostic models, such as the surgical Apgar score for postoperative mortality and morbidity,3 the pancreatic fistula risk score for pancreatic fistula after a Whipple procedure,4 or the P-POSSUM score for mortality after a general surgery procedure.5 Longer-term outcomes are also predicted by EBL, including survival after colon cancer surgery6 and survival and recurrence rates after resection for hepatocellular carcinoma.7

Given the strong prognostic influence of EBL on postoperative outcomes, despite the heterogeneity and possible inaccuracy in its measurement, standard and robust methods for estimation are required to guide clinical care and research. In addition to visual estimation,8,9 other methods currently employed worldwide include formula-based estimation, incorporating changes in hemoglobin or hematocrit,10,11 and gravimetric-based estimation incorporating changes in dry and wet weight of surgical sponges,12,13 among others. Nevertheless, methods for comparison have not previously been systematically reviewed or adequately compared in terms of agreement and reliability. A nuanced understanding of the quantitative limitations of blood estimation techniques would allow a more representative interpretation of EBL across studies utilizing different techniques. Therefore, we conducted a systematic review and meta-analysis with the primary objective of comparing techniques of intraoperative quantitative blood loss estimation in adult patients undergoing major non-cardiac surgery. In particular, we aimed to highlight the comparison between visual estimation and lab value-based formula estimation, focusing on differences in generated values and intrinsic variation.

Methods

This systematic review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta Analyses (PRISMA) guidelines.14 The review protocol was registered in the PROSPERO register of systematic reviews (CRD42015029439).

Search strategy

We conducted a search of the Ovid Medline, Embase, and the Cochrane Library databases from inception to 22 March 2020. A search strategy was developed under the guidance of a health research librarian from the Ottawa Hospital Library services. The search was conducted using the terms “blood loss”, “blood volume”, “bleeding” and “hemorrhage” with database-specific search strategies (eAppendix 1, as Electronic Supplementary Material [ESM]). Additional articles were identified from a hand-search of included references. We reviewed the clinicaltrials.gov database to assess for any in-progress or unpublished studies.

Study selection

We included all English-language full-text articles describing retrospective and prospective observational studies and randomized-controlled trials. We included studies meeting the following criteria: 1) enrolled adult patients (≥ 16 yr), 2) conducted in the operative setting for major non-cardiac procedures, 3) generated EBL volumes using two or more different techniques, and 4) evaluated either visual estimation or lab value-based formula estimation techniques in at least one of the comparison arms (Table 1). We excluded case reports, case series, animal studies, and pediatric studies. We additionally hand-searched the reference lists of all included studies to identify relevant grey literature.

Table 1 Descriptions of blood loss estimation techniques

Eligible surgical procedures included open, laparoscopic, or robotic procedures within neurosurgery, otolaryngology, thoracic surgery, general surgery, urology, gynecology, and orthopedic surgery. Studies involving obstetrical, endoscopic, oral, or cardiac surgeries were excluded because of negligible blood loss or intrinsic differences in underlying patient physiology (including blood volume and induced coagulopathy). Eligible techniques of blood loss estimation were limited those that generated EBL volumes—those evaluating measures of association alone, such as correlation coefficients or linear regression coefficients, were excluded. Any disagreements were resolved by consensus or by the senior author (G.M.).

We screened studies using the Covidence online screening platform (Melbourne, Australia). We imported titles into Covidence directly from the search databases and removed duplicates. Two reviewers (A.T. and J.H.) independently completed title and abstract screening followed by full-text screening. Disagreements at each stage were resolved by discussion and reaching of consensus.

Data extraction

Two investigators (A.T., J.H.) independently extracted data into a pre-defined data collection form. Publication characteristics (study title, year, author, country of origin), population characteristics (sample size, surgical discipline), and method characteristics (type of blood estimation method) were included. The primary outcome included the mean EBL of each method, as well as its coefficient of variation (CV). The primary outcome included the mean EBL of each method, as well as its measure of dispersion about the mean and CV. In addition, the mean difference in EBL between techniques was retrieved, together with measures of dispersion (standard deviation [SD], standard error, or confidence interval) or additional data to calculate the measure of dispersion. The SD of the mean difference was back-calculated from limits of agreement, correlation coefficients, and P values for paired t tests.15,16 Where P values were less than 0.001, a conservative estimate of the SD was generated by assuming the P value to be 0.00099. All P values were assumed to be two-tailed unless otherwise reported by the authors. In one study reporting a P value less than 0.001 from a Wilcoxon test,17 a conservative estimate of the SD was generated by assuming that the P value from a comparable paired t-test would likely yield a lower P value than a Wilcoxon test. As such, a P value of 0.0005 was assigned to this study, as this was the lowest P value that would round up to 0.001. Where studies did not provide calculated summary estimates of EBL or provided insufficient data to calculate these, individual data points were extracted where feasible from correlation plots comparing blood estimation methods. Estimated blood loss volumes were extracted as or converted to millilitres for each technique.

Quality assessment

Methodological quality was assessed using the Methodological Index for Non-Randomized Studies (MINORS).18 As per the MINORS guidelines, studies were assessed for clarity of objective, eligibility criteria, data collection, study endpoints, follow-up, control group, and statistical analyses. Studies were assigned a score out of 24 possible points. All studies that met eligibility criteria were included in this review regardless of their risk of bias.

Evidence synthesis

For each study, we extracted or calculated the mean, SD, and CV for each blood estimation technique. The CV is the ratio of SD/mean and provides a measure of relative variability (or dispersion) for a given blood loss estimation technique. Interpreting the SD as a measure of variation is done within the context of the mean. Nevertheless, the CV is unitless and allows for direct comparison across multiple samples with different means.19 We additionally calculated a ratio of means between the two comparison arms of each study to facilitate relative comparisons across varying effect sizes.20 Similarly, mean differences and SD were obtained for each pair of blood estimation techniques. Mean differences, SD, and 95% confidence intervals (CI) were pooled using the methods described by Williamson et al. for the meta-analysis of method comparison studies.16 As suggested by the authors, a random-effects model with an inverse variance method was used for meta-analysis, under the assumption of significant between-study heterogeneity. We summarized agreement between groups of pairwise comparisons as visual estimation vs formula estimation, visual estimation vs other, and formula estimation vs other. For each of these comparisons, we described tendencies for higher or lower EBL values, consistency of findings, pooled mean differences, SD, and limits of agreement. Funnel plots were generated to examine the possibility of publication bias. All analyses were carried out using Comprehensive Meta-Analysis version 2.2 (Englewood, NJ, USA), the R statistical package, version 4.0.2 (R Core Team [2020], www.r-project.org), and Microsoft Excel 2010 (Redmond, WA, USA).

Results

Search results

We identified 2,002 studies. Following removal of duplicates, we screened 1,324 studies, of which 74 underwent full-text review. The PRISMA flow diagram is presented in eAppendix 2.14 We included 26 studies involving 3,297 patients.

Study characteristics

Summarized study characteristics are presented in Table 2. Of the studies, nine were from North America,8,11,17,21,22,23,24,25,26 11 were from Europe,10,12,13,27,28,29,30,31,32,33,34 and four were from Asia.9,35,36,37 Nine studies were published before 2000,11,21,25,27,29,30,32,35,38 two studies were published between 2000 and 2009,8,33 and 14 studies were published in 2010 or later.10,12,13,17,22,23,24,26,28,34,36,37,39 The most commonly studied operative procedures were orthopedic surgery (nine studies, 1,596 patients),13,23,26,28,30,33,36,37,39 general surgery (five studies, 687 patients),17,21,22,27,38 urology (four studies, 507 patients),8,11,12,31 and mixed (three studies, 336 patients).25,34,35 Individual study characteristics are presented in eAppendix 3.

Table 2 Characteristics of included studies (n = 26)

Quality assessment

Risk of bias assessments were completed using the MINORS tool and are presented in eAppendix 4. Strengths of the studies represented in this review included the use of appropriate, unbiased endpoints with excellent control group standards given that blood loss evaluation allows for comparison within a single patient. Weaknesses included lack of reporting on missingness of data and sample size calculation, and variable reporting of eligibility criteria.

Results of synthesis

Summarized findings for comparisons between visual estimation vs formula estimation, visual estimation vs other, and formula vs other are presented in Table 3. Summarized findings for intrinsic variation for each blood estimation technique are presented in Table 4.

Table 3 Agreement between blood estimation techniques
Table 4 Intrinsic variation among blood estimation techniques

Visual estimation vs formula estimation

We identified 12 studies8,9,10,11,17,23,26,31,32,33,36,37 involving 2,225 patients that compared visual and formula-based estimation techniques. Estimated blood loss values derived from visual techniques were lower than those derived from formula-based techniques, a finding observed in 12 of 13 (92%) studies. Six studies (n = 974) provided sufficient data for meta-analysis8,10,11,20,23,30 (Fig. 1A). The pooled mean blood loss difference was −462 mL (95% CI, −1,090 to 166; I2 = 99%). Statistical heterogeneity was primarily driven by two papers,11,23 as their removal yielded a mean difference of −303 mL (95% CI, −409 to −197; I2 = 17%). A funnel plot is presented in eAppendix 5. Egger’s test for publication bias was not significant (P = 0.480).

Fig. 1
figure 1

Meta-analysis comparing blood loss evaluation methods

Visual estimation versus other

We identified seven studies22,24,25,28,34,35,39 involving 1,322 patients that compared visual and other estimation techniques. Estimated blood loss values derived from visual techniques were lower than those derived from other techniques. Five studies (n = 327) provided sufficient data for meta-analysis22,24,25,28,35 (Fig. 1B). The pooled mean blood loss difference was −40.7 mL (95% CI, −136 to 55.0; I2 = 97%). Two of the pooled studies compared visual estimation with gravimetric estimation,28,35 two with colorimetric estimation,22,25 and one with pictographic estimation.24 Given the small number of pooled studies and these clear methodological differences, sensitivity analyses for statistical heterogeneity were not conducted. A funnel plot is presented in eAppendix 5. Egger’s test for publication bias was not significant (P = 0.746).

Formula estimation versus other

We identified six studies12,13,27,29,30,38 involving 329 patients that compared formula and other estimation techniques. In all six papers, formula estimation was compared with gravimetric estimation methods. Estimated blood loss values derived from formula-based techniques were higher than those derived from gravimetric techniques, a finding observed in all six studies. All six studies provided sufficient data for meta-analysis (Fig. 1C). The pooled mean blood loss difference was 386 mL (95% CI, 235 to 537; I2 = 86%). Statistical heterogeneity was primarily driven by two papers,13,29 as their removal yielded a mean difference of 262 mL (95% CI, 203 to 322; I2 = 16%). A funnel plot is presented in eAppendix 5. Egger’s test for publication bias was not significant (P = 0.0810).

Intrinsic variation

We used the CV to describe the dispersion of generated blood estimation values relative to the mean for each technique. The case-to-case variation was similar across all techniques with a median [interquartile range (IQR)] CV of 0.51 [0.43–0.67] for visual estimation, 0.57 [0.45–0.60] for formula-based estimation, and 0.51 [0.36–0.65] for other methods. In other words, when accounting for the mean blood loss value of a particular set of surgeries, similar variation in case-by-case EBL volumes can be expected for each technique.

Discussion

We performed a systematic review and meta-analysis to compare techniques for intraoperative quantitative blood loss estimation in adult patients undergoing major non-cardiac surgery. We found that visual estimation is the most popular technique studied. In addition, visual techniques tended to provide lower EBL values than formula-based estimation or other techniques, though this effect was not statistically significant in pooled analyses, likely because of sample size limitations. When accounting for the mean blood loss value, similar case-to-case variation was noted for all estimation techniques. Taken together, our study provides important evidence for the tendencies for higher or lower EBL values as well as intrinsic variation of existing techniques for quantitative blood loss estimation during surgery. Such findings can be used to contextualize relative EBL values across studies that utilize different techniques for blood loss assessment.

Visual blood loss estimation (i.e., visual inspection of the surgical field, canisters, and sponges) was the most popular method identified in this review and typically involves a coarse interpretation of the appropriateness or excessiveness of the blood loss. Advocates of this often-used approach note its intuitive ease and that, in the context of appropriate supporting laboratory and vital sign metrics, further granularity rarely provides additional short-term clinical insight.

The specific categorization threshold for what is deemed a meaningful level of blood loss depends upon its specific context and application. The threshold for blood loss necessitating immediate clinical action (i.e., a blood transfusion) for a single patient naturally should be different than the threshold utilized to evaluate short-term perioperative events5 or long-term cancer recurrences at a population outcomes level.6

This is why we advocate strongly for a reliable and reproducible blood loss estimation as a continuous variable to preserve its integrity as critical primary data. It is well proven that categorization results in meaningful data waste, reduced statistical power, underestimation of between-group variance, and loss of ability to evaluate non-linear relationships.40 Specifically, in prediction modelling, coarse categorization of predictors is strongly discouraged in the model development phase as valuable information is lost and the risk of overfitting is introduced.41 Preservation of this data integrity allows each stakeholder (whether that be the treating clinician, the quality improvement lead, or the large database methodologist) to utilize the data in the manner that is most appropriate for their needs.

In this study, we note that visual estimation tended to provide lower blood loss volumes than formula-based or other techniques, a finding that was consistently found across most included studies. Nevertheless, given the sample sizes in most studies and lack of a priori power calculations in all included studies, any conclusions drawn would be at risk of type II error.42 Our findings are consistent with the existing literature describing visual estimation in the simulation setting, which allow for evaluation against a known blood quantity. Previous evaluations of visual techniques in a simulated operative environment show low precision or accuracy.1,43 Healthcare practitioners frequently underestimate actual blood losses, a tendency that appears to be consistent across practitioner age, sex, and professional experience.1,43,44 While there is some evidence of potential improvement in blood estimation accuracy following didactic teaching in a simulated environment,1,43 these findings have never been reproduced in a clinical setting.

Strong calls exist to improve the consistency and quality of surgical blood loss estimation, with some authors suggesting that blood loss is a neglected quality of care indicator that requires advancement of techniques for minimization in surgery.45 Nevertheless, as Stahl et al. note, the evaluation of any new blood loss estimation method in the operative setting is limited by the lack of a gold standard, and we must therefore make comparisons between potentially imperfect techniques.46 This is reflected in the common theme that we documented across estimation techniques and studies, where large SDs and CVs for EBL values were observed. This is partly influenced by significant differences between patients even within a similar cohort undergoing a similar surgery, a reality well known to practitioners. Nevertheless, the significant variability is likely also influenced by poor intrinsic reliability of the blood loss estimation techniques themselves. Interestingly, we found no consistent trend when comparing the CV across different techniques.

Despite the widespread use of EBL in clinical practice to guide perioperative blood management2 and as a quality of care or prognostic variable in the surgical literature, we have shown in this review that little standardization exists for its methodology.45 In addition, there is significant disagreement between techniques and these techniques are limited by significant case-to-case variation. While many perioperative risk models incorporate blood loss as an important predictor variable, their categorization as quantity-based cutoffs3,4,5 places a great deal of importance on the accuracy and reliability of blood loss measurement. In the era of evidence-based medicine, we cannot continue to ignore blood loss as a metric and must acknowledge the need for a fundamental change in the manner in which it is valued and assessed.

Based on our appraisal of the literature, we suggest formula-based techniques as the preferred standard methodology for estimating blood loss. The ideal blood loss formula should be simple to conduct, interpret, and reproduce. Robust formulae should incorporate patients’ body habitus and fractional changes in hemoglobin or hematocrit, which account for the influence of hemodilution. A recent randomized-controlled trial by Ross et al.,47 identifying expected changes in hemoglobin based on simultaneous controlled blood loss and crystalloid replacement in healthy blood donors offers considerable promise in this regard. The authors provide proof of hemodilution following blood loss and resuscitation, deriving a mathematical model based on Nadler’s original work on blood volume estimates48 and using changes in hemoglobin to calculate blood loss. We provide a sample calculation in Fig. 2 for a 70 kg male patient with a 1,000-mL crystalloid resuscitation and a hemoglobin drop from 15 g·dL−1 to 12.5 g·dL−1 for which an EBL of 833 mL is generated.

Fig. 2
figure 2

Sample blood loss calculation

While it would be virtually impossible to assess accuracy against a true gold standard, the minimization of dependence of inter-observer reliability confers a potential advantage for formula-based techniques. In addition, the practical, reproducible, and non-resource intensive nature of the methodology would allow the surgical research community to speak a common language and interpret the EBL variable in the same manner worldwide. Nevertheless, given the substantial variation that was identified in our review even with formula-based techniques, optimization of this technique is still required.

This review has a number of strengths. It was performed using a comprehensive search with clear eligibility criteria, examines multiple blood estimation techniques, and adheres to modern best practice guidelines for the conduct of systematic reviews.14 Given the paucity of literature, we employed a pragmatic approach to data collection and presentation to emphasize clinical relevance.

This review has the following limitations, which should be considered when appraising and contextualizing these findings. Because of the expected paucity of literature, we sought to maximize inclusiveness and potential sample size by including a variety of blood loss estimation techniques, patient populations, surgical techniques, and surgical disciplines. As such, our findings are notably limited by significant clinical heterogeneity. While our pooled analyses provided point estimates that suggest visual estimation techniques provide lower EBL values than other techniques, these findings were not statistically significant. Importantly, our findings are limited by the modest sample sizes and lack of power calculations in the included studies, thus introducing potential type II errors.42 Nevertheless, as previously suggested by the Bradford Hill criteria,49 existence of a meaningful association is elucidated not only by strength but also by consistency among other factors, which we show across most studies included in this review. Our findings are also reliant on observational data, although it should be noted that the unique nature of blood loss evaluation allows for comparison within a single patient, thus eliminating the selection bias of intervention vs control group differences. While the proposed standardization of blood loss estimation offers the potential to improve its utility at a population outcome level by reducing statistical noise, this may not necessarily translate to improved accuracy or increased prognostic significance. Specifically, the intended focus of this review is to summarize and advocate for improved methodology for blood loss estimation. Identifying meaningful clinical thresholds for subsequent action is not within the scope of this study. We acknowledge that a change in blood loss estimation methodology is unlikely to significantly alter short-term clinical decision-making for any one patient. Decisions to deviate from typical care pathways are often based on a more holistic evaluation of patient status, including hemodynamics and laboratory values because subtle differences in precision of blood loss estimation alone are less likely to be clinically relevant.

Conclusion

In summary, this systematic review and meta-analysis evaluated techniques for intraoperative blood loss estimation in major non-cardiac surgery. We found that visual techniques tended to provide lower EBL values than formula-based estimation or other techniques, though this effect was not statistically significant in pooled analyses, likely due to sample size limitations. We found significant case-by-case variation for all methods of blood loss evaluation and significant disagreement between techniques. Given the importance placed on EBL values, particularly for perioperative prognostication models, clinicians should consider the universal adoption of a practical and reproducible method for blood loss evaluation.