Introduction

Mechanical ventilation is used to treat 30–40% of patients admitted to critical care [1, 2]. Duration of invasive ventilation is associated with increased mortality [2]. Successful weaning and liberation from invasive mechanical ventilation is important to improve outcomes in critically ill patients [3]. Current international guidelines recommend daily assessment of readiness for extubation with a spontaneous breathing trial, regular breaks in sedation, early mobilisation and protocolised rehabilitation to help with weaning [4]. However, in common with many critical care interventions, successful liberation from ventilation is a complex process that requires patients to co-operate, breathe without mechanical aid, maintain their airway, expectorate secretions and tolerate ensuing physiological stress. The process of reliably identifying when a patient is ready to be extubated following invasive mechanical ventilation is clinically challenging [5]. Spontaneous breathing trials (SBT) are a commonly used test to assess patients’ readiness to wean [6, 7]. They assess the patients ability to breathe without positive airway pressure support. Most patients are successfully weaned off ventilation following the first SBT, but up to a third of patients fail one or more SBTs, requiring prolonged ventilation and are deemed ‘difficult to wean’ [3, 8]. Whilst modern ventilators have improved with advancing technology, there is a lack of consensus of how best to conduct weaning in this population [5].

Non-invasive ventilation has become a commonly used alternative to invasive ventilation [9]. Extubation to non-invasive ventilation (NIV) following a failed SBT may be an attractive weaning strategy. The benefits of this approach include avoidance of the injurious effects of invasive mechanical ventilation and reduction in sedation requirements and a lower risk of nosocomial pneumonia [3, 8]. The key risk is the potential need for re-intubation, which is associated with an increased risk of mortality [10]. Non-invasive ventilation could, however, prolong the period of weaning if it led to continuation of non-invasive mechanical ventilation longer than would have occurred with an invasive ventilation weaning strategy. Thus uncertainty exists about the most effective strategy. Current European Respiratory Society/American Thoracic Society guidelines recommend that NIV be used as a weaning strategy from invasive ventilation, but this recommendation is limited to hypercapnic respiratory failure patients [11]. These guidelines are based on the 2014 Cochrane review on the use of non-invasive weaning, which included 16 randomised studies and concluded that NIV weaning was superior to invasive weaning with significantly reduced mortality, weaning failures, ventilator associated pneumonia (VAP), intensive care and hospital length of stay and total duration of mechanical ventilation [12]. However, these studies were often conducted in single centres involving small number of patients, limiting the generalisability of their review findings.

On the basis of ongoing clinical uncertainty as to optimum weaning strategy, the UK National Institute of Health Research commissioned the Breathe study in 2013. As the largest randomised controlled trial published to date which addresses this clinical question, the recent publication of the Breathe study provides a timely opportunity to review clinical evidence in this area [13]. The aim of this systematic review and meta-analysis is to evaluate the effect of a weaning strategy of using non-invasive ventilation, compared with ongoing invasive mechanical ventilation, in adult patients that are considered clinically ready for weaning on mortality at hospital discharge and other clinically important outcomes.

Methods

We conducted this systematic review and meta-analysis in accordance with a protocol, registered with PROSPERO (CRD42017076522).

Study eligibility criteria

We included all randomised and quasi-randomised controlled trials that evaluated the use of non-invasive ventilation, compared to invasive ventilation, as a weaning strategy in adults with respiratory failure intubated for at least 24 h. We excluded studies of weaning in the immediate (up to 24 h) postoperative period, or where a comparator group was either standard oxygen therapy or continuous positive airway pressure (CPAP). Quasi-randomised controlled trials were defined as interventional trials where the group allocation was not truly random (e.g. group allocation by day of week). Studies of NIV use after unplanned extubation, as a rescue therapy after failed extubation, and to facilitate tracheostomy weaning were also excluded.

We included studies reported only as abstracts. No date or language restrictions were applied.

Information sources and search strategy

We searched EMBASE, MEDLINE and the Cochrane Central Register of Controlled Trials (CENTRAL) bibliographic databases (from inception to 2018) using a combination of keywords and MeSH terms in February 2018. An example search strategy is included in the electronic supplementary material (ESM). Additional citations were identified through citation tracking (forward and backward) of eligible studies and relevant systematic review papers (see Supplementary Appendix A in ESM for search strategy).

On search completion and following duplicate removal, two authors (JY/KC) independently assessed the title of each citation and excluded obviously irrelevant titles. A third author (SG) acted as adjudicator where agreement could not be reached. Following title screening, the same independent review process was adopted for the screening of abstracts and full texts.

Data extraction

Data were extracted onto a piloted proforma by one author and checked for accuracy by a second author (JY/KC). Extracted data included study characteristics (e.g. setting, publication year), population characteristics (e.g. proportion of patients with chronic pulmonary disease), characteristics of the intervention and comparator, and outcomes.

Outcomes

The primary review outcome was mortality at hospital discharge. Secondary outcomes were mortality (measured at 30, 60, 90 and 180 days), health-related quality of life, duration of invasive ventilation, weaning failure, intensive care unit (ICU) length of stay, hospital length of stay and adverse events.

We acknowledged that there would likely be variation in the definition of outcomes such as weaning failure, so elected to accept the outcome as defined in each study. For the adverse event of VAP, we excluded studies that reported non-specific pneumonia outcomes, such as nosocomial pneumonia or antibiotic use.

Risk of bias

We assessed risk of bias in individual studies using the Cochrane tool for bias assessment in randomised controlled studies. The assessment was undertaken by two authors (JY/KC). We assessed effects of publication bias on primary outcome of mortality by constructing and visually inspecting a funnel plot of study effect estimates and standard error of log RR.

Synthesis of results

Bayesian hierarchical models were used to perform the random effects meta-analysis to account for between-trial variations in treatment effects, as well as variability within a trial. Bayesian hierarchical models allow for partial pooling of information across trials, and so the Bayesian estimate of the treatment effect for each trial is informed by the results of other trials. This improves estimates of treatment effects where there is little information, and reduces the sensitivity of the estimates to studies with extreme results. We employed the Bayesian approach for its flexibility and ability to model a small number of studies and account for uncertainty in the treatment effect. Bayesian analyses allow previous information about the treatment effect to be incorporated via the prior distribution. This is then updated using the current data to become the posterior distribution, which provides the probability of various estimates of the treatment effect.

Posterior distributions were obtained for the treatment effect for each study and the posterior mean estimate and 95% highest posterior density interval (95% HDI) were used to summarise the Bayesian estimates. HDIs are Bayesian alternatives to confidence intervals and for a 95% HDI there is a 95% probability that the treatment effect falls in this interval. A posterior distribution for the treatment effect across the studies was also obtained from the Bayesian hierarchical model, where the mean represented the pooled effect and the variance of this distribution describes the between-study heterogeneity.

The Bayesian model requires specification of prior distributions for the pooled effect and between-study heterogeneity. We used minimally informative prior distributions for the meta-analysis since we incorporated all relevant previous studies into the meta-analysis and wanted the data from these trials to drive the final inferences. A normal distribution with a large variance, N(0, 106), was used for the pooled treatment effect. A uniform distribution with a lower bound of 1/1000 was used for the between-study standard deviation to ensure positive values; an upper bound of 10 was generally used, apart from the 30-, 60- and 90-day mortality which used an upper bound of 2 due to convergence issues and the small number of studies.

For each outcome, we assumed that the treatment effect for each study was normally distributed and had its own mean and variance (assumed to be equal to the study’s observed variance). For binary outcomes (i.e. hospital mortality, 30-, 60- and 90-day mortality, VAP), we assumed that the logarithms of the odds ratios (OR) were normally distributed. Since most of the studies have a small sample size, and there are zero counts for some studies, a value of 0.5 was added to each of the cells (in the 2 × 2 table that the data formed for each study) so that the log OR could be calculated [14]. For the duration of invasive ventilation and length of stay (LOS; hospital and ICU LOS) outcomes, means and standard deviations from each of the studies were used in the meta-analysis. Studies that report medians and interquartile ranges were not included in the meta-analyses. Effect sizes for the duration/LOS outcomes were calculated as the standardised difference in the means (SMD), i.e. Hedges’ g [15], and were assumed to be normally distributed. The Bayesian hierarchical models did not include covariate adjustment.

The Bayesian meta-analyses were performed using the Markov chain Monte Carlo (MCMC) algorithm and were conducted in R (version 3.4.1) using the rjags package (JAGS version 4.3.0). Three chains were run using the MCMC algorithm, each with 50,000 iterations and a burn-in of 1000 iterations (and a thinning of 10). Convergence of the chains was determined by examining the trace and density plots of the parameters, and the Gelman–Rubin diagnostic [16] (see Supplementary Appendix B in in ESM for code and Appendix C for datasets).

Analysis of subgroups or subsets

We performed subgroup analysis for the majority of outcomes to examine studies by patient population: chronic pulmonary disease (COPD) patients compared with the mixed ICU population. On the basis of a previous systematic review by Burns et al., there might be significantly higher mortality benefit for COPD patients in NIV weaning [17]. Clinical guidelines from Canadian Critical Care Trials Group/Canadian Critical Care Society Non-invasive Ventilation Guidelines Group also suggested that NIV should be used to facilitate early liberation from mechanical ventilation in COPD patients but no recommendation was made for the mixed ICU population due to lack of benefit [18]. We were unable to perform this subgroup analysis for 30-, 60- and 90-day mortality due to limited number of studies.

For the primary outcome, we also performed an additional subgroup analysis to examine studies by intervention type: studies that used protocolised weaning compared with studies where a weaning protocol was not used (or not stated). We defined protocolised weaning as a strategy where, provided there was no contraindication, ventilatory support was reduced by a set amount (e.g. 2 cmH2O) over a set time period (e.g. every 2 h).

We conducted two sensitivity analyses for the primary outcome, which were not defined a priori. Firstly, we repeated our meta-analysis with different priors to examine the impact of our choice of prior on the model and findings of our meta-analysis. We also examined the group of studies that used a failed SBT as a requirement for study entry as this reflects current standard practice.

Results

Database searches identified 1508 citations with a further nine citations identified through citation tracking (Fig. 1). Following removal of duplicates and ineligible citations, we included 25 relevant studies involving 1609 patients in the quantitative analysis. Characteristics of included studies are shown in Table 1. The majority of included studies were conducted in Asia (n = 11, 46%) and Europe (n = 8, 32%). Eighteen studies (72%) were reported as single-centre, with only three studies (12%) having more than five centres. Across all studies, the total sample size was 1609 with a median sample size per study of 50 (IQR 30–69). Most studies recruited only patients with chronic respiratory disease (n = 15, 60%), although these studies accounted for only 49% (n = 783) of the total sample. The remaining ten studies (40%) recruited a mixed ICU population. None of the included studies reported outcome data adjusted for baseline patient characteristics.

Fig. 1
figure 1

PRISMA flow diagram. ‘Qualitative’ refers to qualitative assessment of the study methodology and ‘quantitative’ is the number of studies included in meta-analysis. Studies included in quantitative synthesis (meta-analysis)

Table 1 Characteristics of included studies

Studies included patients that were considered potentially suitable for extubation (e.g. acceptable level of consciousness, oxygen requirement and haemodynamic status), although precise inclusion criteria varied between studies. Fourteen studies (56%) used a failed SBT as a trigger for study entry, which excluded patients who would be suitable for immediate extubation. Other studies used completion of the pulmonary infection control (PIC) window (20%) or a specified period of invasive mechanical ventilation. Only six studies described the use of a standardised weaning protocol whilst no details were provided in the other studies.

Risk of bias

The risk of bias across studies varied markedly (Fig. 2). There was a lack of description of randomisation in the majority of studies. The inability to blind caregivers to treatment allocation meant that all studies were considered to be at high risk of performance bias. The majority of the included studies did not describe any strategies to blind outcome assessors from group allocation. Studies were rarely registered, creating a risk of selective reporting. Visual inspection of funnel plot did not reveal important asymmetry or publication bias for mortality outcome.

Fig. 2
figure 2

Risk of bias

Outcomes

The primary outcome of hospital mortality was reported in 16 studies [13, 19,20,21,22,23,24,25,26,27,28,29,30,31,32,33], which included 1156 patients. Overall, hospital mortality was lower in the NIV weaning group compared to the invasive weaning group (pooled OR 0.58, 95% HDI 0.29–0.89) [27]. Subgroup analyses of studies by patient population showed that pooled hospital mortality rates were lower in the NIV arm than those in the control group in patients with COPD (pooled OR 0.43, 95% HDI 0.13–0.81). In the mixed ICU population, pooled hospital mortality rates were similar and there was no evidence of difference in survival observed between the two arms (pooled OR 0.88, 95% HDI 0.25–1.48) but a wide HDI for this outcome made the results less clear-cut for this population (Fig. 3).

Fig. 3
figure 3

Forest plot comparing hospital mortality rates for NIV and invasive weaning, by patient population (COPD vs. mixed ICU population). The estimated odds ratio (OR) from the posterior distribution for each study is shown as a circle, with 95% HDI represented by horizontal lines. The observed OR are given by crosses. The pooled OR estimates (and 95% HDI) are also displayed as the last row for each patient population, and the overall pooled estimate for all studies is displayed as the last row. An OR < 1 means that the intervention is superior

In a subgroup analysis on the primary outcome, we compared the results of studies which had a specified protocol for weaning and studies that did not specify a protocol for weaning. For the protocolised weaning studies, the pooled OR for hospital mortality was 0.59 (95% HDI 0.06, 1.25). For the studies that had no protocol for the weaning, the pooled OR for hospital mortality was 0.60 (95% HDI 0.23, 1.01) (Fig. 4).

Fig. 4
figure 4

Forest plot comparing hospital mortality rates for NIV and invasive weaning, by studies that used protocolised weaning and those that did not. The estimated odds ratio (OR) from the posterior distribution for each study is shown as a circle, with 95% HDI represented by horizontal lines. The observed OR are given by crosses. The pooled OR estimates (and 95% HDI) are also displayed as the last row for each patient population. An OR < 1 means that the intervention is superior

Other mortality outcomes were infrequently reported, such that meta-analyses included only three 30-day studies [13, 36, 37], three 60-day studies [34, 35, 39] and three 90-day studies [13, 19, 38]. In the meta-analyses, the pooled OR for 30-day, 60-day and 90-day mortality was lower in NIV group, but the 95% HDI crossed one for each of these analyses (Supplementary Fig. 5 in ESM). Only one study reported on survival at 180 days and did not find a survival benefit with NIV weaning [13].

Health-related quality of life was reported in only one study and was reported to be similar between NIV weaning and invasive weaning patients [11].

Duration of invasive ventilation was reported to be reduced by NIV weaning in all 19 studies (1171 patients) [13, 19,20,21,22,23,24, 28,29,30,31,32, 34, 36,37,38,39,40,41]. Overall NIV weaning reduced the duration of invasive ventilation duration in patients (SMD − 1.34, 95% HDI − 1.92 to − 0.77). In the subgroup analysis, the duration of invasive ventilation was reduced in both COPD patients (SMD − 1.63, 95% HDI − 2.52 to − 0.80) and the mixed ICU population (SMD − 0.78, 95% HDI − 1.44 to − 0.08) (Supplementary Fig. 6 in ESM).

ICU and hospital LOS were reported in 17 (1094 patients) [13, 19, 21,22,23,24,25, 28, 29, 31,32,33, 36,37,38,39,40,41] and 9 (780 patients) [13, 19, 20, 22, 23, 30, 31, 38, 40] studies, respectively. One study could not be included in the meta-analysis for ICU LOS due to data discrepancies within the paper [42]. Overall NIV weaning reduced LOS on ICU compared to invasive weaning (SMD − 0.70, 95% HDI − 0.94 to − 0.46). In the subgroup analysis, all 11 studies in COPD patients reported reduction in ICU LOS with the use of NIV weaning (SMD − 0.84 days, 95% HDI − 1.10 to − 0.56) (Supplementary Fig. 7 in ESM). Six studies reported reduced ICU LOS but the reduction was smaller (SMD − 0.47, 95% HDI − 0.98 to 0.03) and HDIs were wide in the mixed ICU population. There was also a reduction in hospital LOS overall (SMD − 0.76, 95% HDI − 1.60 to 0.03) which was less striking and the overall pooled estimates did not suggest a reduction in LOS in COPD or the mixed ICU population (Supplementary Fig. 8 in ESM).

Weaning failure was reported in 13 studies [13, 19, 24, 27,28,29, 31, 33, 34, 37, 38, 41, 43]. Marked heterogeneity across studies in the way that weaning failure was defined precluded meta-analysis. The reported OR in 11 studies favoured non-invasive ventilation, although the 95% confidence interval in nine of these studies transected one (Supplementary Fig. 9 in ESM).

For adverse events, we were able to compare VAP which was clearly defined and reported by 14 studies (722 patients) [20,21,22,23,24, 28, 30, 31, 33,34,35,36, 41, 43]. Reported rates of VAP were lower in patients that received NIV weaning across all studies and pooled estimates suggested that NIV lowered VAP rates in COPD and mixed ICU patients (Supplementary Fig. 10 in ESM). There were insufficient data to compare other reported adverse events.

Sensitivity analyses

For the majority of outcomes, there were little differences observed in the results of the meta-analyses from using different priors for the population standard deviation or precision (Supplementary Table 2 in ESM). There were convergence issues in the MCMC algorithms when gamma priors were used for the population precision for the 30-, 60- and 90-day mortality, most likely due to the small number of studies for these measures. More precise priors were also explored for the population mean, but these had little impact on the results (results not shown).

A sensitivity analysis of hospital mortality using results from the nine studies (788 patients) that used a failed SBT as a requirement for study entry found there was no evidence that use of NIV weaning led to a difference in hospital mortality but the wide HDI precluded a definitive statement of the effect on this outcome (pooled OR 0.84, 95% HDI 0.31–1.37) (Supplementary Fig. 11 in ESM).

Discussion

This systematic review included 25 studies with 1609 patients that compared NIV weaning to invasive weaning in a randomised controlled trial. Our systematic review included the largest randomised controlled study conducted to date that addressed this research question. We were able to confirm that mortality at hospital discharge was lower in the NIV weaning group compared to the invasive weaning group. In addition, NIV weaning reduced both the duration of invasive ventilation and ICU LOS. To add to evidence in this area, our results also demonstrated that NIV led to substantially lower mortality at hospital discharge in patients with COPD but there was considerable uncertainty about the effects in the mixed ICU population. In addition, NIV weaning reduced both the duration of invasive ventilation and ICU LOS. In contrast to a previous systematic review, we did not find any difference in survival between patients that received protocolised versus non-protocolised weaning [44].

Our results showed that NIV led to substantially lower hospital mortality in patients with COPD but there was considerable uncertainty about the effects in the mixed ICU population. Compared to patients without COPD, COPD patients are more prone to acute exacerbations and respiratory infections [45, 46]. Prolonged weaning, prolonged mechanical ventilation and tracheotomy use are more common in patients with COPD and even more common in COPD patients with respiratory failure [47]. Since the most common reported causes of death in critically ill patients were refractory multi-organ failure and non-respiratory organ dysfunction, we suggest that the large survival effect seen in COPD patients may reflect the importance of reducing the duration of ventilation and lowering the risk of secondary complications in those with COPD [46, 48, 49]. It is also possible that the magnitude of the effect was magnified by the fact that studies that exclusively recruited COPD patients were mostly single-centred, involved fewer patients and were at increased risk of bias.

A potential limitation of previous reviews was the pooling of studies that used and did not use (or did not report the use of) protocolised weaning strategies. This is important as the use of weaning protocols per se is associated with reduced duration of mechanical ventilation and ICU LOS. In contrast to a previous systematic review, separate subgroup analysis examining protocolised versus non-protocolised weaning (or not stated) did not find additional survival benefit in patients who were weaning using a protocol [50]. However, description of protocol and their adherence were not always well described (Table 1). The point estimates for the odds ratio are similar for studies which used protocolised weaning and those that did not. This finding suggests that the effects of non-invasive weaning are not limited to whether weaning protocols are used.

No long-term survival benefit of NIV was demonstrated at 30, 60 or 90 days but firm conclusions could not be drawn from a limited number of studies. NIV weaning also reduced the duration of invasive ventilation and LOS in intensive care in COPD and the mixed ICU populations. Our findings are in agreement with contemporary literature in that the benefits of NIV weaning may stem from avoiding the injurious effects of prolonged invasive ventilation such as VAP, a complication that can often lengthen intensive care stay [51, 52]. It might also explain the survival benefit seen in COPD patients in whom such injurious effects may have the biggest impact. NIV has been used extensively to treat acute respiratory failure in patients with exacerbation of COPD. The beneficial effects of improved gas exchange reduce the work of respiratory muscles in respiratory failure and also facilitate resting of respiratory muscle, which may prove to be crucial to patient outcomes in this cohort [53]. In contrast to the previous Cochrane review, our results did not find an overall reduction in hospital LOS. As a result of insufficient data, our review was unable to draw any inferences on the impact of NIV weaning on weaning failure and health-related quality of life.

Bayesian methods have previously been used to perform meta-analyses for clinical trials to overcome the limitations of traditional meta-analysis, such as accounting for missing data, a small number of studies, sparse event data, uncertainty in the unknown model parameters, incorporating external information and handling complex models (such as those which include covariates) [54, 55]. We employed the Bayesian approach for its flexibility and ability to model a small number of studies and account for uncertainty in model parameters.

The main limitation of our systematic review is the variable quality of included studies. As most of the studies were not on a clinical trials register and did not have published protocols, it was not possible to assess reporting bias. There was often a lack of detailed description of the intervention and process of weaning and heterogeneity between studies in the patients included. Other valuable information such as sedation guidelines and weaning protocols were not available. Inconsistencies and variation in the reporting of adverse events precluded further analyses of other complications including weaning failure in the patient cohort. The need for core outcome sets in mechanical ventilation studies has been addressed by Core Outcomes in Ventilation Trials (COVenT) but the lack of consistent use of defined adverse events will continue to impact on the ability of future systematic reviews to reliably pool adverse event data [56, 57].

Despite evidence that critical care survivors can suffer from poor physical and psychological outcomes and long-term impact on quality of life [58, 59], only one study reported on long-term outcomes [13]. The study did not find any significant difference in the patient-centred outcome of health-related quality of life at 90 and 180 days but was not powered to do so [13]. Prevention of long-term sequelae and supporting patients post critical care discharge remains a challenge and it is important that future critical care research examines patient-centred long-term outcomes [60].

It remains difficult to combine the results of nine studies that used a pulmonary infective control (PIC) window instead of an SBT to determine whether the patients are ready to be extubated. The PIC window uses clinical signs to determine whether the patient has recovered from pulmonary infection after receiving invasive ventilation and adequate antibiotics for 6–7 days. These include a significant decrease in infectious infiltrations demonstrated by chest radiograph; significantly decreased quantity and viscosity of sputum; normal temperature and normalising of leukocyte count. Whilst popular in China, the ability to breathe spontaneously with reduced ventilator support is not tested and the clinical efficacy of the PIC window as an assessment of readiness to wean remains unproven [61]. Results from these studies may not be generalisable to intensive care populations in other countries.

Conclusions

Our review demonstrates that the use of NIV in weaning from mechanical ventilation, compared with ongoing invasive ventilation, reduces hospital mortality, the incidence of VAP and length of ICU stay, particularly in patients with COPD. On this basis, extubation to NIV may be a reasonable clinical strategy in patients that fail an SBT, particularly in patients with COPD.