Background

The process of weaning from mechanical ventilation remains one of the most critical challenges in patients undergoing mechanical ventilation in the intensive care unit (ICU) [1]; the multidisciplinary team must study the optimal time for weaning from the mechanical ventilator as premature weaning may lead to weaning failure and thus increase the risk of hospital acquired infections, costs in care, ICU length of stay, hospital length of stay and diaphragmatic dysfunction [2, 3].

Current guidelines recommend several indices applied at the bedside to predict successful weaning from mechanical ventilation. However, they have yet to prove ideal [4], probably due to the heterogeneity of critically ill patients, which limits the predictive ability of these indices in different patient subgroups [5]. A spontaneous breathing trial (SBT) is an appropriate way to prepare the patient for extubation [6]; however, even after successful SBT, failure rates and subsequent reintubation can exceed 20% in the highest-risk patients [7].

Patients on mechanical ventilation may have a multifactorial deterioration of diaphragmatic function that can lead to weaning failure and prolongation of invasive mechanical ventilation [8, 9]; therefore, assessing diaphragmatic function could help predict the patient's ability to maintain spontaneous breathing over time [10].

The use of diaphragmatic ultrasound in the intensive care unit is a technique of growing interest due to its portability, speed, and safety. Its use allows reporting on the structural and functional status of the diaphragm and can predict the probability of successful mechanical ventilator weaning [11]. Although some studies have demonstrated the usefulness of ultrasound in predicting the success of mechanical ventilator weaning, others have shown controversial results that continue to motivate continued research of this technique. This systematic review and meta-analysis aim to compile the best available evidence to elucidate the effectiveness of diaphragmatic ultrasound as a predictor of successful weaning from the mechanical ventilation.

Materials and methods

Search for studies

A systematic review and meta-analysis of observational studies involving intubated patients connected to mechanical ventilation who underwent uni- or bilateral diaphragmatic ultrasound to assess diaphragm function prior to extubation was performed to identify whether there is an association between diaphragm function and extubation success. The systematic review protocol was registered in the Prospective International Register of Systematic Reviews (PROSPERO) CRD42022316349 database, and the systematic review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.

Two investigators conducted independent systematic literature searches of PUBMED, TRIP, EMBASE, COCHRANE, SCIENCE DIRECT, and LILACS databases published between January 2016 through July 2022, discrepancies between the two investigators were resolved with the intervention of a third investigator. The terms diaphragm, diagnostic imaging, ultrasound, weaning, mechanical ventilation, extubation, ultrasonography, and articles in all languages were included.

Methodological quality assessment was performed independently by two investigators using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) instrument, followed by the kappa coefficient to assess inter-investigator agreement; additionally, the certainty of the evidence is evaluated using the GRADE (Grading of Recommendations Assessment, Development, and Evaluation) methodology.

Selection criteria and outcome measures

The meta-analysis included studies published between 2016 and 2022, and weaning success was defined as maintaining spontaneous breathing for the next 48 h after extubation.

Inclusion criteria

Prospective or retrospective observational studies involving adult patients with more than 24 h of invasive mechanical ventilation in whom uni- or bilateral diaphragmatic ultrasound was performed during spontaneous breathing trial.

Exclusion criteria

Non-primary studies, studies with less than 20 participants, patients with neuromuscular disease, studies in pregnant patients, case reports, animal studies, and editorials.

Statistical analysis

Rev Man 5.4 the Cochrane Collaboration (2014) software was used for bias assessment, and Metadisc software (Hospital Ramón Y Cajal, Madrid, Spain) for meta-analysis [12]. Independent analyses were performed for diaphragm excursion (DE) and diaphragm thickening fraction (DTF); likewise, an independent analysis of the results according to hemi-diaphragm was assessed; weaning success was defined as the absence of disease in the 2 × 2 table. Sensitivity and specificity analysis was performed for the studies that evaluated DE and DTF, positive and negative likelihood ratios, and diagnostic odds ratios with their respective 95% confidence intervals (CI). A statistically significant value of P < 0.05 was considered.

Summary receiver operator characteristic curves (SROC) and area under the summary curve (AUSROC) were created to assess the accuracy of DE and DTF for predicting extubation success [13]. Publication bias was assessed using the funnel plot and Egger's statistic [14].

The Cochrane Q and I2 tests assessed heterogeneity, and the source of heterogeneity was assessed by meta-regression analysis and a subgroup analysis on both sensitivity and specificity. Study characteristics that could cause uncertainty related to the diagnostic accuracy of diaphragm ultrasound were examined; for example, the cutoff values used as a reference, the risk of bias assessed concerning index test and flow and times, the type of spontaneous breathing trial (pressure support (PS) or T-piece), the homogeneity versus heterogeneity of age, the prevalence of success, and the position of the patient. In addition, an analysis was performed to identify whether the cause of the requirement for mechanical ventilation could influence the diagnostic performance of ultrasound.

Results

In the initial search, 2845 articles were obtained in six databases, 85 duplicates and 2700 articles were eliminated by titles and abstracts, leaving 60 articles; subsequently, the full-text reading proceeded, eliminating 34 articles. (The detailed flow diagram is shown in Fig. 1.)

Fig. 1
figure 1

Study selection flowchart

Twenty-six studies were included in the qualitative analysis of which 19 were included in the quantitative analysis. The meta-analysis included 1204 subjects, of whom 908 had DE assessment, and 945 had DTF assessment; in several studies, both DE and DTF were measured.

Characteristics of the included studies

The different characteristics of the studies in the systematic review are listed in Table 1. Most of the studies were of the prospective cohort type, with the exception of [15], a retrospective observational study, and [16], a prospective randomized clinical trial.

Table 1 Studies included in the systematic review and meta-analysis

Those studies published between 2016 and 2022 were conducted in different countries such as eight from Egypt [1, 2, 1621]; one from the USA [22]; five from France [10, 2326]; one from Greece [27]; one from Pakistan [28]; two from China [29, 30]; one from Korea [15]; two from Thailand [31, 32]; one from Spain [33]; one from Norway [34]; one from Colombia [35]; one from Turkey [36]; one from Bangladesh [37].

Twelve studies evaluated DTF and DE [1, 2, 10, 15, 17, 20, 21, 26, 30, 35-37]; five studies evaluated only DE [24, 2729, 32]; six studies evaluated only DTF [18, 19, 22, 23, 31, 33]; one study measured index excursion time [34]; two studies measured diaphragmatic rapid shallow breathing index (DRSBI) [10, 16]; and two studies measured contraction velocity [21, 35]. Most studies evaluated both diaphragms (57.7%), and 11 studies evaluated only the right diaphragm (42%).

Ultrasound measurements were performed in different positions, the most prevalent being semi-sitting from 20 to 45° reported in 16 studies [1, 2, 16, 17, 1924, 24, 25, 31, 33, 35, 37]; three studies performed the measurement in a supine position [27, 28, 32], and six studies did not report the patient position [10, 15, 18, 26, 30, 36].

There was a variation of 24 to 48 h in the minimum duration of mechanical ventilation prior to inclusion in the studies; five studies did not describe the time on mechanical ventilation prior to the start of weaning; in one study, the duration of mechanical ventilation prior to inclusion was 11 days [37]; in two studies, it was seven days [24, 26]. Exclusion criteria for most studies were conditions affecting diaphragm function, predominantly phrenic nerve injury.

Different definitions of weaning were identified. Successful weaning was defined as the patient's ability to maintain spontaneous breathing within 48 h after extubation without requiring invasive or noninvasive ventilatory support or performance of tracheostomy. Failure of extubation was defined in most studies as the inability to maintain spontaneous breathing within 48 h after extubation with a requirement of noninvasive ventilation, high-flow nasal cannula, reintubation, or tracheostomy.

Regarding the weaning protocol, in all studies, patients were prepared by a spontaneous breathing trial, which was performed with pressure support (PS inspiratory pressure = 5–8 cmH2O and PEEP = 0–5 cmH2O) [1, 2, 10, 16, 18, 22, 25, 34], or using the T-tube [17, 19, 20, 24, 26, 30, 36, 37], some studies combined PS and T-tube [15, 23, 27, 29, 32, 33, 35], and two studies did not record the spontaneous breathing trial method [20, 28].

Evaluation of methodological quality

The results of the risk of bias assessment for the studies included in the meta-analysis are shown in Fig. 2. There was agreement among the authors Cohen's Kappa 0.67 (95% CI 0.54–0.73); in 68% of the studies, there was the unclear risk of bias regarding patient selection, in 5.2% of the studies, there is an unclear risk of bias regarding the index test, in 21% of the studies, there is an unclear risk of bias regarding the standard reference, and 53% in flow and timing; regarding applicability concerns in two studies (10.5%), there was severe concern regarding patient selection [15, 18] unclear concern in one (5.2%) study regarding the index test [18] and in five (26.3%) studies regarding the standard reference [17, 18, 25, 31, 33]. The evaluation of the certainty of the articles included in the meta-analysis shows a moderate accuracy of the test (diaphragmatic ultrasound) (see Additional file 1: Table S1).

Fig. 2
figure 2

Risk of bias and applicability issues with Quality Assessment of Diagnostic Accuracy Studies (QUADAS)

In most studies, diaphragmatic ultrasonography was performed during the spontaneous breathing trial with a variation of the exact timing of the test, and in other studies, diaphragmatic ultrasonography was performed before and after extubation. Most studies did not clearly report the elapsed time between diaphragmatic ultrasound and extubation [2, 1721, 25, 27, 28, 33, 34].

The method of patient selection was not reported in most studies. Several studies did not present the flowchart explaining in detail the patient selection and follow-up [1, 15, 17, 18, 21, 28, 30, 31, 35, 37].

The outcome of weaning varied according to the definition of extubation success or failure and, in some studies, needed to be clearly defined. In most of the included studies, the index test (ultrasound of the diaphragm) was interpreted without knowing the outcome of weaning.

No publication bias is evident for the studies that analyzed DE, funnel plot (see Fig. 3), and Egger's test 0.75, nor for those that analyzed the DTF Egger's test 0.73. (Additional file 1: Fig. S1).

Fig. 3
figure 3

Funnel plot for diaphragmatic excursion (A) and diaphragm thickening fraction (B)

Heterogeneity of studies

Nineteen studies were included in the meta-analysis [1, 2, 15, 1721, 25, 27-34, 36, 37], and seven studies were excluded from the meta-analysis. After all, their definition of success did not include the ability to maintain spontaneous breathing for 48 h after extubation, because they used other ultrasound measurements and because they did not present accurate data for quantitative analysis.

Figure 4 presents the forest plot of sensitivity 0.80 (95% CI 0.77–0.83) and specificity 0.80 (95% CI 0.75–0.84) for DE and sensitivity 0.85 (95% CI 0.82–0.87) and specificity 0.75 (95% CI 0.69–0.80) for the DTF; Fig. 5 shows the SROC curve illustrating the summary point and the estimation of the sensitivity and specificity of each of the studies; also, the prediction contours with 95% CI and for DE AUSROC 0.87 and DTF AUSROC 0.87. The likelihood ratios obtained in the bivariate analysis for DE were L.R. (+) 4.64 (95% CI 4.19–5.0) L.R. (-) 0.21 (95% CI − 0.08–0.5) and for DTF L.R. (+) 3.5 (95% CI 3.19–3.84) LR (−) 0.18 (95% CI − 0.17–0.54) (see Additional file 1: Table S2).

Fig. 4
figure 4

Forest plot of sensitivity and specificity (A) Diaphragmatic excursion. B Diaphragmatic thickening fraction. Flevari DE left > 10 mm*, **DE right < 10 mm; Huang*DE right; Yoo*DE ≥ 1.4 cm, DTF > 30%

Fig. 5
figure 5

SROC curves of sensitivity and specificity (A) diaphragmatic excursion (B) diaphragmatic thickening fraction

Additional file 1: Fig. S2 shows the forest plot for the diagnostic odds ratio (DOR) for DE 17.1 (95% CI 10.2–28.6) and for DTF 17.2 (95% CI 9.16–32.3). Heterogeneity was evidenced in the sensitivity and specificity for DE (I2 65.1% Chi-square 45.86 P 0.001; I2 67.8% Chi-square 49.7 P 0.001, respectively) and for DTF (I2 68.8% Chi-square 44.8 P 0.001; I2 59.4% Chi-square 34.4 P 0.001); the threshold effect measure was evaluated by Spearman correlation, obtaining a value of 0.125 (P 0.63) for DE and − 0.198 (P 0.47) for DTF.

A Fagan nomogram was constructed to illustrate diaphragm ultrasonography's pre- and post-test probability of predicting extubation success (Fig. 6). The pretest probability of ultrasonography predicting successful extubation was 67% for DE and 68% for positive DTF (above the cutoff point); the post-test probability for DE and DTF was 87% and 90%, respectively. The post-test probability of successful extubation for negative DE (below the cutoff point) was reduced to 26%, and for thickening, the DTF was reduced to 30%, respectively.

Fig. 6
figure 6

Fagan's nomogram for diaphragmatic excursion (A) and diaphragmatic thickening fraction (B)

Subgroup analysis and meta-regression

After eliminating studies with outlier DTF cutoff values [15, 20, 37], sensitivity increased to 0.86 (CI 0.83–0.89 I2 68.5% Chi2 34.9 P 0.002) with no changes in heterogeneity; specificity increased to 0.78 (0.72–0.83 I2 56.9% Chi2 25.5 P 0.01), with decreasing heterogeneity, and AUSROC increased to 0.90; continuing with the manual analysis by subgroups, studies with atypical values of sensitivity [15, 21] and specificity [32] for the DE were eliminated, showing an increase in sensitivity to 0.83 (95% CI 0.79–0.86) with a decrease in heterogeneity (I2 43.9%, Chi2 23.1 P 0.03); and decreased specificity to 0.77 (95% CI 0.71–0.83); likewise, decreased heterogeneity (I2 52.9%, chi2 27.5 P 0.01); the AUSROC increased to 0.88 (see Additional file 1: Fig. S3).

The exclusion of studies that presented a high risk of applicability according to the QUADAS 2 evaluation [15, 18] was performed, showing an increase in sensitivity to 0.87 (I2 66% Chi2 32.4 P 0.006) and specificity to 0.77 (I2 60% Chi2 27.7 P 0.003) without significant changes in the heterogeneity for DTF; concerning DE, there is an increase in sensitivity to 0.82 (I2 60% chi2 40 P 0.003) with no change in heterogeneity, no change in specificity for diaphragmatic excursion; the AUSROC was modified to 0.88 for a diaphragmatic excursion and 0.89 for DTF (see Additional file 1: Fig. S4).

A subgroup analysis was performed to evaluate whether the cause for which mechanical ventilation was required affects the diagnostic performance of ultrasound, eliminating studies where the main cause was not respiratory; there was no increase in sensitivity and specificity for DE 0.80 and 0.79, respectively, and AUSROC 0.86; similarly, there was no increase in sensitivity and specificity for thickening fraction 0.85 and 0.73, respectively, and AUSROC 0.86 (see Additional file 1: Fig. S5).

Finally, a subgroup analysis was performed for studies using PS and T-tube during the spontaneous breathing trial, with no evidence of statistically significant changes in sensitivity and specificity for DE and DTF (see Additional file 1: Fig. S6).

The meta regression analysis was performed, exploring possible sources of heterogeneity such as age, the prevalence of success, and patient position during the index test (diaphragmatic ultrasound); it was evidenced that patient position was a cause of heterogeneity for the diagnostic accuracy of the diaphragmatic thickening fraction in the studies included in the meta-analysis (coefficient of − 1,99 P = 0.012 DOR 0.14 CI 95% 0.03–0.6) (see Additional file 1: Table S3); no statistically significant differences were found between the age of patients who successfully weaned from mechanical ventilation and those who failed; likewise, there was no evidence that the prevalence of success was a source of heterogeneity.

Discussion

The results of this study suggest adequate accuracy of diaphragmatic ultrasound in predicting weaning success; the combined sensitivity and specificity of DE and the AUSROC were 0.85, 0.75, 0.87, respectively, and for DTF 0.80, 0.80, 0.87. Our data show a satisfactory diagnostic accuracy for predicting extubation success. It is relevant to report that there was heterogeneity in the sensitivity and specificity of the studies included in the meta-analysis; likewise, several studies presented significant methodological weaknesses, two studies with a high risk of applicability in patient selection, and several studies with unclear risk of bias in patient selection, flow and timing.

During the last few years, some systematic reviews and meta-analyses have been published on the usefulness of diaphragmatic ultrasound in predicting the success or failure of weaning in patients undergoing mechanical ventilation [3841]. The results of our study are consistent with most of the previously mentioned published studies. Li et al. evidenced in their study a satisfactory diagnostic accuracy in predicting the outcome of extubation; Llamas Alvarez concluded that DTF is by itself a modest predictor of weaning outcome; Garcia Sanchez et al. concluded that ultrasound dysfunction of the diaphragm is associated with an increased risk of extubation failure; Le Neindre et al. demonstrate that low values of diaphragmatic excursion and diaphragmatic thickening fraction predict the risk of extubation failure with moderate to high specificity.

This research included five new studies compared to the previous meta-analysis [41] that mainly analyzed DE and DTF, which allowed the number of subjects studied to be increased to 1204; likewise, an exhaustive subgroup analysis was performed to find sources of heterogeneity in the sensitivity and specificity of diaphragmatic excursion and diaphragmatic thickening fraction that could affect the ability of diaphragmatic ultrasound to predict extubation success, finding mainly factors such as cutoff values or atypical thresholds for each of the measurements and the risk of applicability found in the quality assessment of the studies; likewise, a bivariate meta-regression analysis was performed, finding the patient's position at the time of the test as the primary source of heterogeneity.

The high sensitivity values reported in the present study indicate that patients with values above approximately 29% for DTF and > 1 cm for DE have a high probability of successful extubation; however, it is essential to mention that weaning success may be influenced by additional factors such as nutritional status, respiratory and cardiovascular integrity and psychological conditions to mention a few [42].

We can evidence in the limitations of this study; the possible biases that may contain each of the studies included in the meta-analysis as randomized trials were not included; the absence of a common reference value for diaphragmatic thickening and excursion fraction may introduce biases that cause measurement imprecision; sex was not considered in the analysis by subgroups, nor the time on mechanical ventilation before the spontaneous breathing trial and ultrasound measurement; and these could influence the ultrasound result.

In contrast to the findings of this meta-analysis, the two studies mentioned here found no association between values below the cutoff point of diaphragmatic excursion and thickening fraction measured with ultrasound and the outcome of weaning from mechanical ventilation [24, 26]. Mariani et al. defined extubation failure as the need for intubation within 72 h after extubation, and Vivier et al. defined extubation failure as the need for intubation or death seven days after extubation, which differs from our investigation since the studies included in this meta-analysis evaluated extubation success 48 h after mechanical ventilator weaning.

Slight variations in the measurement between observers may affect the measurement result and cause heterogeneity; it is undoubtedly an observer-dependent technique; despite this, several studies have concluded that diaphragmatic ultrasound measurements are reproducible.

The results of this study have implications for clinical practice, showing that diaphragmatic ultrasound is a technique that can be used in the intensive care unit during the spontaneous breathing trial to contribute to objectively predict the success of weaning from mechanical ventilation; it is a portable, fast, noninvasive, simple and safe technique that does not emit any ionizing radiation that affects health-care personnel; However, given the high heterogeneity found, which is frequent in meta-analyses of diagnostic tests, the results of the pooled measurements should be interpreted with caution, especially in the different subgroups of critically ill patients, in order to achieve a personalized determination of the optimal result.

Therefore, access to diaphragmatic ultrasound in the intensive care unit should be generalized, and priority should be given to achieving its universal use, especially in routine respiratory monitoring, to guide the management of patients undergoing mechanical ventilation.

Conclusions

The results of this systematic review and meta-analysis show that measurement of diaphragmatic excursion and diaphragmatic thickening fraction predict the probability of successful weaning from mechanical ventilation with satisfactory diagnostic accuracy; however, significant heterogeneity was evident in the different included studies. Studies of high methodological quality in specific subgroups in intensive care unit patients are needed to evaluate the role of diaphragmatic ultrasound as a predictor of weaning from mechanical ventilation.