1 Introduction

Assessment of the inferior vena cava (IVC) is a central part of many ultrasound-guided fluid management protocols. It has been mainly used for the assessment of fluid responsiveness (FR). However, after an initial enthusiasm due to relatively fast learning curve, it appeared that its predictive value in critically ill patients is suboptimal [1,2,3]. More recently, the study of the IVC has been included in algorithms attempting the quantification of venous congestion. The main goal of volume expansion with intravenous fluid in critically ill patients is to increase their cardiac output and ultimately optimize perfusion and the delivery of oxygen to the organs, without causing additional harm related to the fluid overload. Both Hypovolemia might be responsible for a reduced preload, stroke volume and ultimately decreases organ perfusion [4,5,6]; however, an excess of intravenous fluid administration may promote venous congestion—interstitial edema and congestion at pulmonary and/or systemic level [7,8,9], hence, both conditions are associated with increased mortality in the intensive care unit (ICU) [10]. Recent evidence highlights that a non-negligible proportion of fluid-responder patients may be not fluid tolerant, due to signs of venous congestion [11]. The IVC assessment, whether it is performed for the evaluation of FR and/or venous congestion, is feasible in most clinical settings. However, there are several limitations when using the IVC indexes of FR, namely the IVC collapsibility index (CI) in case of spontaneous breathing with negative pressure, and/or the IVC distensibility index (DI) under positive pressure ventilation. The subcostal approach (SC, also known as sagittal view) is the traditional method to visualize the IVC, though it may not be practical in several scenarios, such as in morbidly obese patients, individuals after laparotomy surgical incision or with in situ chest drains, or those with dilated bowel. Alternatively, the trans-hepatic method (TH, also known as coronal or right lateral) approach has been proposed, and it offers a latero-lateral IVC visualization. There is conflicting evidence on the interchangeability of use of these two views [12,13,14,15], and uncertainties have been confirmed by a systematic review [16]. Hence, the TH view remains not validated and the cut-offs used with data gathered from IVC in SC view cannot be adopted.

Meaningfully, the use of artificial intelligence is rapidly expanding across various medical contexts, particularly in echocardiography. Artificial intelligence has been incorporated to assess not only left and right ventricular function [17,18,19,20], but also in the assessment of valvular pathology [21] and congenital heart anomalies [22]. Preliminary data supports also the implementation of machine learning programs for the IVC assessment with automated border tracking (ABT) [23].

We previously studied healthy volunteers and ICU patients supported with pressure-controlled ventilation [14, 15]. Overall, comparison between ABT and M-mode (both in SC and TH windows) showed good accuracy but low precision in both studies. When comparing SC view with the TH window, we also found good accuracy and low precision in mechanically ventilated patients [14], but accuracy in healthy volunteers was not acceptable [15], suggesting greater challenges in the use of IVC in spontaneous breathing patients, as the breathing pattern itself may greatly influence results [24].

We conducted a pre-planned third study with prospective observational design on intubated ICU patients with assisted breathing on pressure-support ventilation (PSV) mode, aiming at comparison of differences in IVC measurements taken in M-mode or ABT approach, as well as those obtained from different anatomical sites (SC and TH). We hypothesized that SC and TH views do not provide interchangeable results; we also postulated that ABT may be a valid clinical asset with fair reproduction of M-mode imaging.

2 Materials and methods

This is a prospective observational single center study. Our research received approval from the local Ethical Committee (Reference protocol: 53/2022/PO) before patient enrolment. We evaluated differences between the dimensions of the IVC in two methods of visualization (SC and TH windows) in critically ill adult patients supported by PSV mode admitted to the Azienda Ospedaliera Universitaria Policlinico G Rodolico – San Marco”, Catania (Italy).

2.1 Participants

Patients were included if they were intubated and assisted in PSV with inspiratory support set at 6–8 ml/kg of predicted body weight and a positive end-expiratory pressure (PEEP) ≤ 8 cmH2O, and under stabilized and steady hemodynamic conditions (defined arbitrarily as no changes in vasopressor dose in the previous four hours). From neurological perspectives, patients with fluctuating sedation level were not included. Although we did not formally record the sedation score, most of them were still sedated at the time of imaging. We followed as much as possible the PRICES guidelines [25, 26].

2.2 Study procedure

At the time of the ultrasound examination, all patients were in a semi-recumbent position (30–40°). One operator with advanced echocardiography certification (FS) performed all scans acquiring four types of visualizations. In particular, the SC and TH windows were acquired at the respective anatomical sites, with two approaches for calculation of IVC diameters: standard M-mode or with ABT. Images were all acquired using the General Electric (GE) Venue Go R2 device holding a tracking functionality called “auto IVC”, automating the assessment of the vessel’s diameter and providing the IVC-DI and IVC-CI depending on patterns of ventilation (selection made by the operator). The sequence of imaging was not randomized, as M-mode image was always recorded before activating the ABT function, to avoid bias and influence from the values appearing on the screen when ABT mode is active. Despite the IVC variation is influenced by depth of inspiration [24] and by diaphragmatic excursion [27], images were taken within seconds so that it is reasonable to assume that patient’s contribution in the pattern of breathing was constant. The operator acquired images as close as possible to the right atrium and within 4 cm of the cavo-atrial intersection. Attempts were made by the operator to minimize errors due to the cranio-caudal displacements of the vessel during the respiratory cycle, aligning the target sector of the IVC in the center of the screen so the M-mode angle would be as close to 90° as possible.

2.3 Off-line calculation procedure

After the sonographic assessment, the images obtained were downloaded from the ultrasound machine and saved in a password-protected file. After the entire cohort of patients was recruited, in a single session the operator performed an off-line calculation of the IVC diameters and DI obtained with M-mode; when calculating the parameters from M-mode imaging, he was blinded from the ABT results. Conversely, the values of the IVC derived with ABT approach were directly obtained at the bedspace using the software tracking IVC boundaries, with each record lasting six seconds, with several images collected and stored. After the exam the operator performed a check for artifacts and errors before calculating the M-mode IVC diameters and DI, to avoid any bias.

2.4 Study groups and outcomes

We used a 2 × 2 methodological design, allowing to compare the interchangeability between: a) modalities of calculation (in M-mode or with ABT) from images obtained with a single sonographic window (SC or TH), or b) values calculated with the same modality (M-mode or ABT) from images obtained in different sonographic windows (SC vs TH).

Data obtained were divided in 4 different groups: a) SC on ABT; b) SC on M-mode; c) TH on ABT; and d) TH on M-mode. We planned as primary outcome to use the IVC-DI as the intrathoracic pressures of patients in PSV are positive throughout the respiratory cycle. Secondary outcomes evaluated were the interchangeability of IVC min and max diameters with measurements performed with M-Mode and ABT strategy.

2.5 Statistical analysis

Previous investigations evaluating the interchangeability of the SC/TH views showed highly variable results (Pearson coefficient, comprised between values of r from 0.14 [28] to 0.86 [29]). A systematic literature review highlighted that the agreement between the two views was poor to moderate [16]. Therefore, we calculated the necessary sample with a statistical power set of 80% and an alpha level at 0.05, estimating the correlation at r = 0.60. Estimated sample size was 19. Accounting for patients with no SC or no TH windows, we decided to increase the sample size to 23. Concerning the interchangeability between M-Mode and ABT calculations, literature on this correlation is rather limited. As a result, we did not determine a sample size specifically for that aspect of the study; instead, we relied on the recruited sample to perform comparisons between SC and TH windows. We computed the mean bias and limits of agreement [LoA] for IVC diameters and IVC-DI using the Bland–Altman method. The obtained plots were adjusted to account for the impact of multiple measures when comparing ABT modalities [30]. Our analyses are described with mean bias and LoA coupled with their 95% confidence interval (CI). For the comparison between ABT methods (SC vs TH), we used the Bland Altman with repeated measures calculation, and followed the CI95% calculation as suggested by Zou et al. [30].

As per methodology our previous study on mechanically ventilated patients [14], regarding the accuracy and the precision of the IVC-DI we defined as acceptable or good accuracy a mean bias of 4% and 2% (absolute values), respectively; similarly, for the precision we pre-established a LoA of 16% (acceptable) and 8% (good). To characterize the inter-rater variability among measures obtained using the same modality or approach, we computed the Spearman correlation and the intra-class correlation (ICC) coefficient. Correlation interpretation was conducted based on predefined thresholds [31].

3 Results

The baseline characteristics of the enrolled patients are outlined in Table 1. The primary admission diagnosis and individual severity scores are separately presented in the Additional material. In all 23 patients included the IVC was visualized in the SC method, whilst in four patients the TH window was not obtained (17.4%). The mean IVC-DIs were 16.3 ± 9.5% (SC) and 14.0 ± 9.2% (TH). The mean for the overall ABT calculation in SC and TH views were 26.6 ± 29.3% and 27.3 ± 30.3%, respectively.

Table 1 Baseline characteristics, ventilatory settings and hemodynamic variables of the population of patients studied whilst ventilated in pressure support ventilation (PSV) mode

The outcomes of the Bland–Altman analysis are detailed in Table 2, including mean bias, LoA (with lower and upper bound) and the Spearman rho and the ICC to convey the degree of resemblance between measurements. All values are indicated with their 95%CI. Overall, the lowest mean bias for IVC-DI (primary outcome) was found comparing the ABT images in the two anatomical regions (SC and TH), but with large LoA, as for all the other evaluations. The comparison of IVC-DI values obtained in ABT and M-mode yielded large discrepancies, as shown by a mean bias around 6% for both the SC and the TH windows of measurement, and again wide LoA.

Table 2 Summary of comparisons between measurement of the inferior vena cava (IVC) in adult patients mechanically ventilated in pressure support mode. In case of the IVC size analysis in M-mode (M), we analyzed a single measure which was the most reliable measure as decided by the experienced operator performing the calculations. In case of the analysis with automated border tracking (ABT), repeated measures were taken and saved in the database. Results of IVC distensibility index, minimum and maximum diameters (IVC-DI, IVC min and IVC max, respectively) are provided in term of mean Bias and limits of agreement (LoA) with their relative 95% confidence interval (CI), where appropriate. We also provide correlation with rho and intraclass coefficient (ICC) to describe how strong the measurements resemble each other

3.1 Different acquisition modalities

We found limited accuracy and precision for both SC and TH methods in both modalities of acquisition. Particularly, for the SC imaging of the IVC we found a DI mean bias of 5.9% [LoA − 18.4; 30.2] (Fig. 1), an IVCmax mean bias -1.9 mm, [LoA − 5.8; 2.0], and an IVCmin mean bias -2.9 mm [LoA − 7.1; 1.3]. Comparing M-mode and ABT in the TH, we found an IVC-DI bias 6.2% [LoA − 8.0; 20.4] (Fig. 2), an IVCmax mean bias -2.9 mm [LoA − 5.9; 0.2], and an IVCmin mean bias -3.5 mm [LoA − 7.8, 0.7].

Fig. 1
figure 1

Bland–Altman plot depicting the distensibility index (DI) of the inferior vena cava obtained at the subcostal site, using standard M-mode (SC-M) and artificial intelligence (SC-ABT)

Fig. 2
figure 2

Bland–Altman plot depicting the distensibility index (DI) of the inferior vena cava obtained at the transhepatic site, using standard M-mode (TH-M) and artificial intelligence (TH-ABT)

The analysis of correlation with Spearman showed poor-to-moderate reliability for the IVC-DI in SC imaging (0.43 [95%CI 0.01, 0.73]), and moderate for the TH visualization (0.60 [95%CI -0.01, 0.89]). Table 2 reports also the ICC coefficients, which were slightly higher.

3.2 Different acquisition sites

In the comparison of IVC assessment between SC and TH methods, we found that in M-mode assessment the IVC-DI showed a 1.9% mean bias [LoA − 13.7; 17.5] (Fig. 3). Likewise, the IVC diameters presented differences between sites of measurement (IVC max: mean bias 1.0 mm [LoA − 8.2; 10.3]; IVC min: mean bias 0.4 mm [LoA − 9.2; 10.1]). With the ABT evaluation, such differences between sites were: IVC-DI mean bias 1.1% [LoA − 19.6; 21.9] (Fig. 4), IVCmax mean bias 2.0 mm [LoA − 5.9; 9.8], and IVCmin mean bias 1.3 mm [LoA − 6.5, 9.2].

Fig. 3
figure 3

Bland–Altman plot depicting the distensibility index (DI) of the inferior vena cava obtained with standard M-mode, at two different sites: subcostal (SC-M) and transhepatic (TH-M)

Fig. 4
figure 4

Bland–Altman plot depicting the distensibility index (DI) of the inferior vena cava obtained with artificial intelligence mode, at two different sites: subcostal (SC-ABT) and transhepatic (TH-ABT)

Comparing the two anatomical sites, the analysis of correlations with Spearman coefficient between IVC-DIs were moderate for both M-mode (ICC 0.42 [95%CI -0.03, 0.73]) and ABT assessment (0.59 [95%CI 0.48, 0.69]). The ABT assessment showed greater correlation when measured with ICC (0.83 [95%CI 0.76–0.87]).

4 Discussion

Our investigation adds new insights to the growing literature on the interchangeability of IVC assessments conducted at two anatomical sites (SC and TH) and also with two different acquisition modalities (M-mode and ABT). The present study follows two previous studies conducted by our group [14, 15], and for ease of interpretation, we summarized the results of these three studies in Table 3. Balancing the findings of these studies it seems apparent that assessment of the IVC from different views or with different measurements approaches yields suboptimal precision, which hampers the interchangeability of results. Moreover, the correlation between calculations is generally poor to moderate.

Table 3 A summary of comparisons made between measurements of the inferior vena cava (IVC) in the three studies performed in healthy volunteers, and in patients intubated and ventilated in pressure control or pressure support mode (PCV and PSV, respectively)

The mean bias reported by the present study is similar to the one conducted in patients ventilated in controlled mode for the differences between images acquired at different anatomical sites (SC or TH). This suggests a similar influence of mechanical ventilation on direction of changes in the diameter of IVC. However, slightly greater mean bias (lower accuracy) was seen when comparing these findings acquired at the same anatomical site with different approach (m-mode or ABT). It cannot be excluded that this is due to greater fluctuations in patient’s contribution and influence on breathing, though images were recorded within seconds. The progressive stimulation produced by the pressure of the ultrasound probe during the course of the exam may have produced ups and downs in the level of consciousness and breathing contribution of the patient, with variable interaction with the ventilator.

In general, it is likely an irregular/anisotropic change of IVC diameters during the respiratory cycle, with an elliptical shape occurring in most cases due to the not proportional reduction in antero-posterior (seen in SC) and latero-lateral (visualized in TH) diameters [32, 33].

It must be noted that the use of the IVC-DI as primary outcome can be challenged. There is still ambiguity in the literature on which index should be applied to patients ventilated in PSV. In a post-hoc analysis comparing the IVC-CI and using the cut-off commonly adopted in spontaneously breathing patients, our whole cohort would have been identified as not being FR, which is clinically unlikely (patients in weaning and most commonly without noradrenaline support). Therefore, we think that choosing the IVC-DI was the most adequate approach. and Juhl-Olsen et al. demonstrated that IVC-CI cannot be used as an estimate of FR in both SC and TH views in patients in positive pressure ventilation [34].

We think our study has also the value on warning clinicians that IVC assessment in different projections and modalities is not interchangeable, and this assumes more importance in the evolving ICU landscape with shift towards reduced sedation and increased spontaneous breathing [35,36,37]. Until now, there was scarce evidence about the clinical utility of IVC assessment for FR in patients ventilated in PSV. However, it must be clear that other commonly utilized dynamic indexes as stroke volume variation and pulse pressure variation have shown poor prediction of FR in the transition from fully controlled mechanical ventilation to the PSV and spontaneous breathing modes [38].

We speculated that ABT technology might reproduce the standard M-mode IVC measurements, bringing the additional value in enhancing everyday clinical practice, with the option of faster and multiple measurements and reduced clinical workload; however, our results point in a different direction. One study reported that machine learning generated multiple models for the prediction of FR and these were comparable with the hemodynamic findings of passive leg raising. Among other parameters, the IVC CI was one of the variables supported by the model in addition to Doppler measurements such as velocity–time integral, S-wave, E/Ea ratio, and E-wave [39]. Despite deep learning algorithms enabled of IVC video classification with the aim to evaluate FR have demonstrated moderate performances by the trained algorithm (AUC 0.70) [23], the same group of authors demonstrated that algorithms used for implementation of ABT for IVC measurements are significantly influenced by the image definition, with worse results obtained with images of lower definition [40].

4.1 Strengths and limitations

The main strength of our study is its factorial 2 × 2 design, which allowed for cross-validity of our measurements with a reference method. This study was conducted in a homogenous cohort intubated and assisted in PSV with their own respiratory trigger and on stable hemodynamic conditions, with roughly one third of patients on a single drug for hemodynamic support (norepinephrine, mostly low dose).

Our study has several limitations. First, the study did not allow entire blinding of the operator as the ABT calculation is obtained in real-time. However, we minimized the impact of this issue as the M-mode calculations were performed off-line. Second, we did not assess FR and therefore we cannot extrapolate a cut-off regarding this aspect, and our study remains an investigation limited to the comparison of results between different acquisition sites (SC or TH) and calculation modes (M-mode or ABT). Third, despite the operator taking measurements has advanced echocardiography experience, the absence of other operators limits the external validity of the study. Fourth, this study was performed with a single manufacturer ABT equipment. There might be differences in other vendors' tracing technology, finally resulting in different precision and accuracy. Our findings hold true for the currently available technology, and we might expect a dynamic development in advanced ABT. Fifth, this study evaluated a standard approach to measuring IVC-DI with M-mode and ABT. Because IVC demonstrates an anisotropic change, single-point and single-axis measurements might not reflect the true IVC size changes, and the use of 2-D assessment with IVC area may produce better results. Although sounding, the evaluation of IVC area change has been less studied and deserve further investigation. Sixth, we did not perform an assessment of respiratory efforts. All measurements were taken during a period of a quite breathing with a standardized tidal volume of 6–8 ml/kg of predicted body weight, but no formal assessment of respiratory effort (i.e. P0.1), esophageal pressure or diaphragmatic excursion were utilized. Seventh, though patients were on steady hemodynamic and neurological condition, we did not record the sedation scores and depth of breathing may influence results [27].

5 Conclusions

In critically ill patients with assisted breathing in pressure support ventilation, we found that the measurements of the inferior vena cava distensibility index calculated with M-mode or automated border tracking software are not interchangeable, due to suboptimal precision. Moreover, the inferior vena cava distensibility index gathered from subcostal or transhepatic view produces not comparable results in regards to accuracy and precision.