1 Introduction

Estimating intraoperative blood loss is a daily challenge of clinicians and until now a standardized method is not being used routinely. Despite the knowledge of the inaccuracy of visual estimation, intraoperative blood loss is still recorded visually [1, 2]. However, the quantified blood loss plays a key role in blood transfusion decisions, along with other information such as hemoglobin values and individual transfusion triggers. Inappropriate transfusion of blood products is associated with risks and influences patient´s outcome [3, 4]. Changes in vital and laboratory parameters recorded during routine monitoring only occur in the event of imminent or manifest hemodynamic instability and organ perfusion diminution.

Especially in obstetrics, estimating blood loss is challenging, as amniotic fluid erroneously causes the impression of high blood loss. In addition, the volume of external blood loss is also difficult to estimate [5, 6]. It is noteworthy that loss of lower blood volumes is estimated more correctly than loss of higher blood volumes. However, large blood loss is life-threatening and therefore more relevant in transfusion decisions [7,8,9]. Other methods for recording intraoperative blood loss such as pictograms or direct measurement, are rarely used [1]. As part of various programs, the weighing of contaminated material, the so-called gravimetric method, was introduced as a supplement [2, 10,11,12,13]. However, the direct measurement by using calibrated collection bags is not used in the operating theatre, whereas this method has long been practiced during childbirth [14].

Within the framework of Patient Blood Management (PBM) as a multidisciplinary and evidence-based treatment concept [4], the recording of blood loss is becoming increasingly important. PBM is based on three main pillars: (1) Anaemia management, (2) Minimizing blood loss and increased use of donor blood saving strategies, (3) Rational use of blood reserves [15]. Reducing blood loss and optimizing patient care starts with the measurement of intraoperative blood loss.

Recently developed methods, such as photometric analysis, are becoming more popular in clinical practice. Three systematic reviews [2, 14, 16] have been published on blood loss estimation in obstetrics. Since then, new approaches and methods have been developed, however, none of these reviews addressed the setting of the operation theatre. This review and meta-analysis are intended to show a range of measurement methods and to highlight the strengths and weaknesses of the methods used within surgery.

2 Methods

The protocol of this systematic review was registered in PROSPERO 2020 CRD42020166803 (https://www.crd.york.ac.uk/PROSPERO).

2.1 Data sources and search strategy

The research was carried out using PubMed and Google Scholar databases. Only studies published between 01.01.2000–11.11.2019 in English or German language were considered. This period has been chosen to include primarily current literature with more recent reference methods. The results were merged using the reference management software Citavi 6 (Version 6.3.0.0, Swiss Academic Software GmbH, Wädenswil, Schweiz). For the advanced search on PubMed, synonyms of the word blood loss (blood loss, haemoglobin loss, fluid management) were combined with synonyms of the word measurement (estimate*, measurement, evaluation, determinat*, quantification, quantify, assessment, monitoring). With the feature “Show similar articles” the search has been extended. For the Google Scholar Search, the above stated method refining the PubMed search was also used. Two independent reviewers (LG and FP) screened all abstracts for inclusion and exclusion criteria.

Studies were included investigating the accuracy of techniques for quantifying blood loss in vivo and in vitro. We excluded nonhuman trials and studies using only monitoring parameters to estimate blood loss. Finally, we excluded comments or letters to the editor. Full texts of all selected studies were read and analyzed.

2.2 Assessment of bias

Two reviewers (LG and FP) independently assessed the quality of the included primary studies using the Acrobat-NRSI tool published by the Cochrane Collaboration (https://www.cochrane.de/de/rob-manual). This quality assessment tool evaluates non-randomized studies for bias due to confounders (1), by selecting participants into study groups, (2), by recording the intervention of the performance (3), by deviations in the intervention phase (4), by missing data (5), by endpoint survey (6) and by selective reporting of endpoints (7). After answering the predefined questions for each of the domains, the reviewers have assessed the bias potential for each domain according to the following classification: low Risk of Bias (RoB) “++” (study is comparable to a randomized study); moderate RoB “+” (study correctly performed represents a non-randomized study); significant RoB “−” (study has some serious problems); critical RoB “−−” (study has too many serious problems); unclear RoB “?” (due to lack of information, no assessment is possible). The overall rating of a non-randomized study is based on the domain with the highest RoB.

2.3 Meta-analytical procedures

A meta-analysis was performed to evaluate systematic measurement errors of the different methods. All studies were screened using reference method “hemoglobin extraction assay”. Included in the meta-analysis were studies that used this valid method as reference. The hemoglobin extraction assay is a laborious and therefore an accurate method for quantifying blood volume. In this procedure, blood-soaked products are rinsed and examined to determine the hemoglobin concentration spectrophotometrically [17]. Studies with incomplete data concerning a Bland–Altman analysis were secondarily excluded.

2.4 Statistical analysis

Statistical analyses were carried out with the metafor Package: A Meta-Analysis Package for R (version 2.4-0, Free Software Foundation Inc., Boston, USA) [18]. For the individual studies, the systematic bias ± standard deviation (SD) was extracted from the given Bland Altman analyses. Confidence intervals of bias in the individual studies were calculated. The total bias estimated value with total confidence interval was calculated. The upper and lower tolerance limits were determined. I2 statistics were computed to quantify the heterogeneity. If significant heterogeneity of results was found (defined as I2 > 50%), we investigated the relationship between this result and the predefined variables outlined above using meta-regression analysis.

Given the heterogeneity of the studies examined in the review, the approach of modelling random effects was used to merge the result data.

3 Results

3.1 Study identification

The advanced search on PubMed produced a total of 7877 results. The function “Shows similar articles” identified another 158 potentially relevant studies. After reviewing the titles of 8035 studies, 7747 studies were primarily excluded. Of the 288 potentially relevant studies, all studies involving children or animals were excluded secondarily, as well as those studies that used only vital parameters as an indication of anaemia. Therefore, a total of 140 studies were left for full-text review. Finally, 50 further studies were excluded due to: no availability of the full text; letters or comments to the editor; young participants under 18 years of age; the measurement of intraoperative blood loss not primarily addressed and primarily investigation of measurement accuracy of an instrument compared to the laboratory. A total of 90 studies were included in the systematic review (Supplement 1) and six studies were included in the meta-analysis (Fig. 1). The main reason for exclusion was the lack of comparison with a validated reference method.

Fig. 1
figure 1

PRISMA flow diagram. From: Moher D, Liberati A, Tetzlaff J, Altman DG, The PRISMA Group (2009). Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med 6(7): e1000097. 10.1371/journal.pmed1000097. For more information, visit www.prisma-statement.org

3.2 Risk of bias assessment

The assessment regarding the risk of bias of the individual studies is presented in detail in Table 1. The risk of bias due to confounders was increased in the majority of the studies. The selection of participants in study groups showed an increased risk of bias in four studies [19,20,21,22] and an unclear risk of bias in two studies [23, 24]. The documentation and endpoint assessment were adequately described in most of the studies. A deviation in the intervention phase was not described in any of the studies. The handling of missing data was not adequately described in most studies, so that an unclear risk of bias was found. Double-blinding was found in only a few studies. The risk of selective reporting was low in all except four studies [25,26,27,28]. The overall methodological quality of the studies reviewed was rather low and the overall risk of bias across the studies was high.

Table 1 Risk of bias of the individual studies

3.3 Used methods within surgery or obstetrics

Based on the studies, we examined the methods according to various criteria. The advantages and disadvantages of the individual methods are summarized in Fig. 2.

Fig. 2
figure 2

Pros and cons of blood loss estimation methods. Pros and cons of blood loss estimation methods are presented. Red = extremely poor, yellow = moderate; green = excellent

3.3.1 Visual estimation

The visual estimation of blood loss by clinicians is not only one of the most widely used methods, but also the most examined one. This includes the estimation of blood volumes in sponges and suction containers but also the recording of external blood losses. Forty-eight studies dealt with the accuracy and improvement of visual assessment and 29 of these were performed in obstetrics. The study results are heterogeneous, so there are different results about the influence of different factors of professional experience, gender, age on the accuracy of the estimate. Even the use of pictograms and other measures does not lead to clear improvements (for a detailed description of the various studies, please refer to Supplement 2) [29,30,31,32,33,34,35,36,37,38,39].

3.3.2 Gravimetric

The gravimetric method is an indirect measurement of blood loss. Blood loss can be deduced by weighing the surgical material contaminated with blood and subtracting the dry weights. By summing up the measured weight of the blood and estimating the amount (ml) of mixed liquids (e.g. blood, rinse liquid) in the suction container, the blood loss can be calculated with a conversion of 1 g = 1 ml blood [38, 66]. The study results for gravimetric methods show a higher degree of correlation but are nevertheless heterogenic. Especially the factor of increased dilution by amniotic fluid or rinsing are relevant factors that lead to inaccuracy in the calculation [27, 67, 68](for a detailed description of the various studies, please refer to Supplement 3).

3.3.3 Direct measurement

The direct measurement of blood loss is a simple and long-established method that is mainly used in the field of obstetrics. Nine studies focused on calibrated collection bags specially designed for vaginal deliveries. The collector bag is placed under the woman's buttocks immediately after the birth of the child and collects all mixed liquids (e.g. blood or amniotic fluid). At the bottom of the plastic foil there is a calibrated collector bag with a scale on which the current blood loss can be read. This method is easy to use and, especially in resource-poor areas, in combination with a visual assessment can somewhat improve the quantification of the total blood volume, e.g. during a birth. However, study results still show significant deviations from real blood volume when used [71,72,73,74,75] (for a detailed description of the various studies, please refer to supplement 4).

3.3.4 Calculated blood loss

Currently, various mathematical approaches are used in clinical practice to evaluate blood loss. To calculate the most exact intraoperative blood loss, the formula has been modified over time. Thus, three different formulas are used in the literature to calculate the total blood volume of patients. The Moore formula [75, 76], which, like Nadler's formula [75,76,77,78,79,80,81] takes into account height, weight and sex for the calculation, and the ICSH formula [75, 76], which uses gender and body surface area to calculate the total blood volume. The formula according to Nadler was used most often in the literature. All blood loss estimation formulas showed a significant tendency to overestimate blood loss. (For a detailed description of the various studies, please refer to supplement 5).

3.3.5 Colorimetric blood loss estimation

A smartphone application (Triton™) developed by Gauss Surgical Inc. is able to calculate blood loss by taking photographs of used surgical gauze and canisters. The colorimetric technique analyses photographic and geometric information from relevant areas, with the aim of automatically filtering out the effects of non-blood components mixed in each sponge and canister and calculating the Hb mass present in the gauze or canister from the image. By entering the preoperative Hb-level, the blood loss can then be calculated. In the studies analyzed, high degrees of correlation with the reference blood volumes were found [22, 24, 26,27,28, 39, 67, 68, 70, 83] (For a detailed description of the various studies, please refer to supplement 6).

3.3.6 Miscellaneous methods

Other methods for the intraoperative recording of blood loss are rarely described and have not yet been sufficiently tested. Ultrasound of the inferior vena cava [20, 84, 85], contrast enhanced ultrasound (CEUS) [86], hemodynamic esophageal Doppler monitoring [19], near-infrared spectroscopy [87] or continuous non-invasive intraoperative Hb monitoring [23, 88, 89] for intraoperative detection of blood loss have been investigated. None of these methods represents a valid technique for blood loss detection. (For a detailed description of the various studies, please refer to supplement 7).

3.4 Meta-analysis

Six studies [26,27,28, 67, 68, 70] used Hb assay as a validated reference method and Bland Altman results were available for meta-analysis. The mixed effect meta-analysis showed the highest correlation to the reference for colorimetric methods (0.93 95% CI 0.91–0.96), followed by gravimetric (0.77 95% CI 0.61–0.93) and finally visual methods (0.61 95% CI 0.40–0.82) (Table 2). The bias for estimated blood loss (ml) was lowest for colorimetric methods (57.59 95% CI 23.88–91.3) compared to the reference, followed by gravimetric (326.36 95% CI 201.65–450.86) and visual methods (456.51 95% CI 395.19–517.83) (Table 2).

Table 2 Results of meta-analysis

Figure 3 shows the bias of the blood loss [ml] measurements with the colorimetric method compared to the reference. The overall bias estimate is significantly different from zero (p = 0.0008). There is no significant increasing trend toward a higher bias for higher mean or median blood loss used in the individual studies (p = 0.2317). The results of the meta-regression analysis show significant heterogeneity I2 = 87.9%, p < 0.0001 and R2 = 15.24% of the heterogeneity considered in the meta-regression model.

Fig. 3
figure 3

Bias of Hb [g] measurements with colorimetric method. Bias of Hb [g] measurements with colorimetric method compared to reference. Diamonds show the bias estimates from the single studies (size of diamonds is inversely related to standard error). Small vertical bars show confidence intervals of bias in the single studies. The darkest grey shaded area is the overall confidence interval for the estimated bias. The thick joined line shows the overall bias estimate. The medium shaded area reaches from the overall estimate of the lower tolerance limit to the overall estimate to the upper tolerance limit, both limits are also indicated by a dotted line. The lightest shaded area reaches from the lower confidence interval limit of the lower tolerance limit to the upper confidence interval limit of the upper tolerance limit

The overall bias of colorimetric estimated blood loss is significantly different from 0 (p = 0.026). The bias of the Hb mass [g] measurements with the colorimetric system compared to the reference is shown in Fig. 4. The correlation for colorimetric Hb measurement correlated strong (0.930 95% CI 0.96–0.94) with reference Hb. There is a significantly increasing trend toward higher distortion for higher mean or median blood volume used in the individual studies (p = 0.020). The results of the meta-regression analysis still show a significant heterogeneity I2 = 99.9%, p < 0.0001 and R2 = 53.26% of the heterogeneity considered in the meta-regression model.

Fig. 4
figure 4

Bias of blood volume [ml] estimates with colorimetric method. Bias of blood volume [ml] estimated by colorimetric method compared to reference. Diamonds show the bias estimates from the single studies (size of diamonds is inversely related to standard error). Small vertical bars show confidence intervals of bias in the single studies. The darkest grey shaded area is the overall confidence interval for the estimated bias. The thick joined line shows the overall bias estimate. The medium shaded area reaches from the overall estimate of the lower tolerance limit to the overall estimate to the upper tolerance limit, both limits are also indicated by a dotted line. The lightest shaded area reaches from the lower confidence interval limit of the lower tolerance limit to the upper confidence interval limit of the upper tolerance limit

4 Discussion

We were able to show in our meta-analysis that the colorimetric blood estimation provides a strong correlation to the reference blood volume. In contrast, gravimetric method or visual estimation shows only a medium correlation. It was shown that the bias for blood loss in the colorimetric system is significantly lower than in the other methods.

Visual estimation is the most widely used method for recording intraoperative blood loss and can be performed directly in the operating theatre without the need for additional equipment. However, regardless of specialty, professional experience or education level, visual estimation is not accurate. The estimation of lost blood volume depends on the physicians and is hard to reproduce. Especially in scenarios with larger bleeding the visual estimation often fails. A clear tendency to over- or underestimation cannot be detected, but either one can lead to wrong decisions in patient care.

The gravimetric method considers external blood loss. Frequently used conversion in literature “1 g equals 1 ml of blood” is only an approximation. Blood, depending on the current Hct, does not have the same density as water. Both lead to measurement inaccuracies. The calculation of the lost blood volume is a cost-effective way to quickly provide information about the suspected blood loss. Many of the formulas are based on Hb-value or Hct changes and require normovolaemia to calculate the patient's total blood volume. This assumption can lead to measurement inaccuracies. It is known, that the measurement of the Hb-level through blood gas analysis only shows indications of blood loss after adequate volume therapy and is therefore delayed. This can also lead to measurement inaccuracies. The result is independent of the user and easy to apply, especially if a user-friendly program is used for support.

The latest innovative method to detect blood loss is the colorimetric estimation technique. Our meta-analysis revealed a strong correlation to reference blood volume. The overall bias estimate shows that the systematic bias is within the 30 g Hb mass, which was defined as the clinically relevant limit. Thirty grams Hb-mass corresponds to approximately 1/10 of a whole blood unit. External blood loss cannot be taken into account by the system. If a cell saver or the heart–lung machine is involved, the system can only be used partially because the calibration of the canisters required for the colorimetric system is not sterile.

4.1 Limitations

We have identified several limitations for this review and meta-analysis. The selected reference methods differ widely and are usually not considered to be sufficiently valid themselves. There is no gold standard reference for recording intraoperative blood loss. The sample size of the individual studies was mostly small. Most of the studies were prospective observational studies without double-blinding and control group. The risk of bias and heterogeneity for the individual studies was high. Although many studies are available in this area, one weakness of most studies is that no validated reference was used for comparison. Many studies compare themselves with other equally inaccurate methods. This underlines the need for more high-quality large-scale studies in this area.

5 Conclusion

The recording of intraoperative blood loss plays a very central role in the daily routine of clinicians. Based on these estimations, patient’s treatment and transfusion decisions are made. Consequently, for the patient safety, we should aim fort the highest possible accuracy of measurement. Visually and gravimetric blood loss estimation measurements show a high degree of bias, so its usage cannot be recommended. Colorimetric technology offers real-time measurement and has a high degree of correlation.