Background

Due to the complexity and variability of conditions, patient life is sustained in Intensive Care Units (ICUs) using mechanical ventilation. Approximately 39.0%–72.0% of ICU patients require invasive mechanical ventilation, and reintubation after mechanical ventilation is considered a potentially adverse event [1,2,3,4]. Previous studies have shown that 10%–20% of extubated patients require reintubation, and that 20–30% have a high risk of extubation failure [4,5,6]. Reintubation is associated with a longer hospital stays, poor prognosis, higher healthcare costs, and more complications [6,7,8].

To reduce reintubation, a prophylactic high-flow nasal cannula (HFNC) and non-invasive mechanical ventilation (NIV) after extubation are used to improve oxygenation in patients at high-risk of extubation failure. HFNC delivers a heated, humidified, and adjustable air–oxygen mixture through a large-caliber nasal cannula. NIV assists the patient’s breathing by applying different levels of positive pressure to the airway through an oral or nasal mask, without endotracheal intubation or laryngeal mask airway insertion. Nava et al. [9] and Ferrer et al. [10] demonstrated that NIV helped to reduce respiratory failure, reintubation, and mortality after extubation in high-risk patients. However, NIV can cause many side effects, such as lung damage, gastric distension, mask discomfort, and even claustrophobia, conditions that are difficult for patients endure [11, 12]. Numerous studies have shown that HFNC is not inferior to NIV in preventing the reintubation of high-risk patients. In addition, HFNC was better tolerated, more comfortable, and resulted in fewer adverse events than NIV [13,14,15]. The international clinical practice guidelines indicate that when compared with HFNC, NIV is more effective in preventing reintubation. However, HFNC does not lead to adverse events among high-risk patients [16].

Therefore, we conducted a systematic review and meta-analysis based on randomized-controlled trials (RCTs) to evaluate the role of HFNC in preventing reintubation, mortality, and adverse events as well as improving respiratory function, and shortening ICU length of stay (LOS).

Methods

The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) [17] checklist was used to report the systematic review and meta-analysis (Materials Additional file 1), and we registered the study with PROSPERO (CRD42022311969).

Search strategy

Two investigators (W-Q and P-Y) independently searched the MEDLINE, Cochrane Library, EMBASE, CINAHL Complete, Web of Science, China National Knowledge Infrastructure, Wan-Fang database, and Chinese Biological Medical Database for relevant studies. The retrieval dates ranged from the establishment of the database until February 28, 2022. The retrieval strategy combined non-invasive ventilation (‘niv’ or ‘nippv’ or ‘noninvasive ventilation’ or ‘non invasive positive pressure ventilation’ or ‘noninvasive positive pressure ventilation’ or ‘non invasive ventilation’) with high-flow nasal cannula(‘hhfnc’ or ‘hhfn’ or ‘hfnc’ or ‘hfnct’ or ‘high flow nasal cannula’ or ‘high flow nasal cannula oxygen therapy’ or ‘nasal high flow oxygen therapy’ or ‘high flow nasal cannulae’ or ‘humidified high flow nasal cannula therapy’). See Material Additional file 2, for example, Medline search.

Study selection

The following inclusion criteria were used: (1) patients at high risk of extubation failure who fulfilled at least one of the following [9, 10, 13, 16, 18,19,20]: age older than 65 years, underlying cardiac or respiratory disease, mechanical ventilation time > 7 days, airway patency problems including ineffective cough or excessive tracheobronchial secretion, an Acute Physiology and Chronic Health Evaluation II (APACHE II) score > 12 on extubation day, body mass index (BMI) > 30 kg/m2, and 2 or more comorbidities; (2) HFNC and NIV were used as interventions; (3) outcomes including, but not limited to, reintubation, mortality, LOS, the incidence of adverse events (abdominal distension, aspiration, facial injury, delirium, pulmonary adverse events, and intolerance), and respiratory function indices (partial pressure of carbon dioxide [PaCO2], oxygenation index [PaO2/FiO2, P/F], respiratory rate [Rr]); (4) RCTs as the study type; (5) and studies either in Chinese or English language.

The exclusion criteria included insufficient information, incomplete data; age < 18 years; repeated publications; and high-risk bias literature (Grade C).

Data extraction and quality assessment

Data were independently extracted by two researchers (W-Q and X-S), and any discrepancies were resolved by a third researcher (P-Y). The following data were extracted from the selected studies: first author, year of publication, country, disease type, sample sizes, gender, age, intervention measures, reintubation, mortality, LOS, adverse events, and respiratory function-related indices. The quality of each RCT that satisfied the criteria was examined using the Cochrane bias risk assessment tool (Cochrane 5.1.0) [21]. The bias risk was classified into three categories: Grade A, items are all low risk of bias; Grade C, items are all high risk of bias; and Grade B, items are not all high or low risk of bias. In addition, two researchers independently assessed the credibility of the pooled results using GRADE guidelines. The results of the meta-analysis were assessed using five downgrades and three upgrades, and the credibility of the results was classified as high, moderate, low, or very low.

Outcome measures

The primary outcome was reintubation. The secondary outcomes were mortality, LOS, the incidence of adverse events (abdominal distension, aspiration, facial injury, delirium, pulmonary adverse events, and intolerance), PaCO2, P/F, and Rr.

Statistical analysis

RevMan5.4.1 software was used to create forest maps and merge data. Dichotomous data, such as reintubation, mortality, and incidence of adverse events, are expressed as a risk ratios (RR) and 95% confidence intervals (CI). Continuous data, such as LOS and respiratory function-related indices, are expressed as a mean difference (MD) and 95% CI. A P < 0.05 was considered statistically significant.

We used the Cochran’s Q test and I2 test statistics to test the heterogeneity of the studies. A P > 0.05 and I2 < 50%, indicated low heterogeneity, and a fixed-effects model (FD) was used. A P < 0.05 or I2 > 50% indicated high, and a random-effects model (RD) was used. Subgroup analysis of each outcome was carried out by language (English versus Chinese), extraction method (conventional versus non-conventional), NIV parameter settings (fixed expiratory positive airway pressure [EPAP] and inspiratory positive airway pressure [IPAP] versus non-fixed), and HFNC flow rate (fixed versus non-fixed) for comparison between HFNC and NIV. At the same time, we performed sensitivity analysis by removing one study at a time to ensure the stability of the results. Funnel plots were used applied to detect publication bias in more than ten trials.

Results

Study selection and characteristics

After removing the duplicate articles, a total of 4787 of 6282 articles were available for analysis. Of the total, 97 articles were selected for full-text review after reading the titles and abstracts of the publications, and finally, 13 RCTs [13, 15, 22,23,24,25,26,27,28,29,30,31,32], including 1457 patients, were eventually included. The screening process is shown in Fig. 1. Of the 13 RCTs included, 6 were published in English [13, 15, 27, 30,31,32] and 7 in Chinese [22,23,24,25,26, 28, 29]. The included study objects included chronic obstructive pulmonary disease (COPD) [15, 23,24,25,26,27,28, 32], APACHEII > 12 [30], elderly patients (> 65) with sepsis [31], patients at risk of extubation failure [13], acute pancreatitis with respiratory distress syndrome [29], and hypoxemia after cardiac surgery [22]. The mean age in 9 of the RCTs was > 65 years [15, 23, 24, 26, 27, 30,31,32], and 5 of the RCTs < 65 years [13, 22, 25, 28, 29]. One RCT was identified as Grade A, and 12 RCTs were identified as Grade B (Table 1). Nine of the RCTs [13, 22,23,24, 27,28,29,30,31] were on routine standard extubation, and four [15, 25, 26, 32] were on unconventional extubation following the pulmonary infection control window (PIC).

Fig. 1
figure 1

The PRISMA flow diagram of selected studies. WOS: Web of Science, CNKI: China National Knowledge Infrastructure, CBM: Chinese Biological Medical Database

Table 1 Basic characteristics of the study were included

The 13 included RCTs were assessed for risk of bias using the Cochrane risk assessment tool. Of the 13 total RCTs, 5 used computer number generators, 8 used the random table to ensure adequate random sequence generation, 6 described allocation concealment, 1 adopted the single-blind method, 5 did not adopt the blind method, and 7 did not know whether to apply the blind method. All the literature results and data were reported thoroughly. One RCT reported research findings selectively, and other sources of bias in the remaining 12 RCTs were unclear. Figure 2 shows a summary of the risk of bias. Figure 3 shows the risk map of bias. Blinding participants, interveners, and outcome measures in studies are challenging, and therefore, we classified most studies as having a high risk of bias, because they had not been blinded.

Fig. 2
figure 2

Risk of bias graph: each risk of bias item presented as percentages across studies

Fig. 3
figure 3

Risk of bias summary: judgements about each risk of bias item for each study. Green circles indicate a low risk of bias, yellow circles indicate an ambiguous risk of bias, and red circles indicate a high risk of bias

Efficacy of HFNC

Reintubation

The effect of HFNC versus NIV on the reintubation rate of patients at high risk for extubation failure was described in the 13 included RCTs. We found low heterogeneity across the studies, and the FD was performed. The meta-analysis revealed no significant difference in reintubation rates between the HFNC and NIV groups (n = 1457, I2 = 0%, RR = 1.10, 95%CI = 0.87–1.40, P = 0.42) Fig. 4.

Fig. 4
figure 4

Forest plots for reintubation between high-flow nasal cannula and non-invasive ventilation

Mortality

Nine RCTs described the effect of HFNC versus NIV on the mortality of patients at high risk of extubation failure. Low heterogeneity was found across the studies, and FD was performed. The meta-analysis revealed no significant differences in mortality between the HFNC and NIV groups (n = 1084, I2 = 0%, RR = 1.09, 95%CI = 0.82–1.42, P = 0.54) Fig. 5.

Fig. 5
figure 5

Forest plots for mortality between high-flow nasal cannula and non-invasive ventilation

ICU length of stay

The effect of HFNC versus NIV on the LOS of patients at a high risk of extubation failure was described in 9 of the 13 included RCTs. As high heterogeneity was found across the studies, and RD was performed. As a result, we found that HFNC had a significant advantage over NIV (n = 1032, I2 = 93%, RR = − 1.03, 95%CI = − 1.86–0.20, P = 0.02) Fig. 6.

Fig. 6
figure 6

Length of ICU stay forest plot between high-flow nasal cannula and non-invasive ventilation

Incidence of adverse events

In 11 of the 13 included RCTs, the effect of HFNC versus NIV on the incidence of adverse events, including abdominal distension, aspiration, facial injury, delirium, pulmonary complications, and intolerance in patients at a high risk of extubation failure, was described. We found low heterogeneity across the studies, and FD was performed. A meta-analysis revealed that HFNC presented clear advantages over NIV in abdominal distension, facial injury, pulmonary complications, and intolerance (abdominal distension: n = 315, I2 = 0%, RR 0.09, 95% CI 0.04–0.24, P < 0.01; facial injury: n = 309, I2 = 0%, RR = 0.27, 95% CI 0.09–0.88, P = 0.03; pulmonary complications: n = 764, I2 = 0%, RR 0.67, 95% CI 0.46–0.09, P = 0.05; intolerance: n = 147, I2 = 0%, RR 0.22, 95% CI 0.08–0.57, P < 0.01). No significant differences in aspiration or delirium were found between the HFNC and NIV groups (aspiration: n = 199, I2 = 0%, RR 0.30, 95% CI 0.09–1.07, P = 0.06; delirium: n = 105, I2 = 0%, RR 0.30, 95% CI 0.07–1.39, P = 0.12) Fig. 7.

Fig. 7
figure 7

Forest plots for abdominal distension A, aspiration B, facial injury C, delirium D, pulmonary complications E, and intolerance F

Respiratory function-related indices

In 11 of the 13 included RCTs, the effect of HFNC versus NIV on the PaCO2, P/F, and Rr of patients at high risk of extubation failure was described. Of the total, 11 RCTs described PaCO2, 8 described P/F, and 9 described Rr. High heterogeneity was found in all outcomes, and RD was performed. The results showed that HFNC provided several advantages over NIV (PaCO2: n = 1234, I2 = 81%, MD − 1.31, 95% CI − 2.76–0.13, P = 0.07; P/F: n = 997, I2 = 57%, MD − 2.18, 95% CI − 8.49–4.13, P = 0.50; Rr: n = 585, I2 = 80%, MD − 0.50, 95% CI − 1.88–0.88, P = 0.47) Fig. 8.

Fig. 8
figure 8

Forest plots for partial pressure of carbon dioxide A, oxygenation index B, and respiratory rate C

Subgroup analysis

Subgroup analyses of reintubation, mortality, LOS, adverse events, and respiratory function-related indices by language, extraction method, NIV parameter settings, and HFNC flow rate to compare HFNC versus NIV. Our results showed no significant differences or reductions in heterogeneity regarding reintubation, mortality, abdominal distension, aspiration, and Rr. In LOS, with exception of the Chinese-language group (n = 252, I2 = 96%, RR = − 1.32, 95%CI = − 2.56–− 0.09, P = 0.04), we did find statistically significant differences in the other subgroup analyses (English language: n = 780, I2 = 41%, RR = − 0.55, 95%CI = − 1.34–0.24, P = 0.17; conventional extubation standards: n = 911, I2 = 91%, RR = − 0.62, 95%CI = − 1.37–0.63, P = 0.11; unconventional extubation standards: n = 121, I2 = 88%, RR = − 2.48, 95%CI = − 5.22–0.24, P = 0.08; unfixed NIV setting: n = 128, I2 = 94%, RR = − 1.65, 95%CI = − 3.49—0.18, P = 0.08; fixed NIV setting: n = 904, I2 = 64%, RR = − 0.59, 95%CI = − 1.25–− 0.06, P = 0.08; unfixed HFNC flow rate: n = 225, I2 = 90%, RR = − 1.15, 95%CI = − 2.99–0.70, P = 0.08; fixed HFNC flow rates: n = 807, I2 = 95%, RR = − 0.85, 95%CI = − 1.85–− 0.15, P = 0.10). The P/F in the subgroup analysis of language revealed no significant heterogeneity in the pooled results (English language: n = 780, I2 = 25%, RR = 2.91, 95%CI = − 4.48—10.29, P = 0.44; Chinese-language: n = 217, I2 = 25%, RR = − 1.18, 95%CI = − 12.27–0.47, P = 0.11). However, significant changes in the pooled results for PaCO2 were found in the English-language group, fixed NIV parameter settings, and fixed HFNC flow rate (English language: n = 780, I2 = 0%, RR = − 1.01, 95%CI = − 1.46–− 0.56, P < 0.01; fixed NIV setting: n = 991, I2 = 84%, RR = − 2.19, 95%CI = − 3.89–− 0.48, P = 0.01; fixed HFNC flow rates: n = 807, I2 = 0%, RR = − 1.00, 95%CI = − 1.46–− 0.54, P < 0.01). See Additional file 3 for details.

Sensitivity analysis

Sensitivity analyses were performed by removing one study each for reintubation, mortality, LOS, adverse events, and respiratory function-related indices. Reintubation, mortality, LOS, and adverse events showed no significant changes in the pooled results or heterogeneity. However, heterogeneity was significantly reduced after excluding the study by Yang et al. for respiratory function-related indices (PaCO2: I2 = 40%, P = 0.14; P/F: I2 = 21%, P = 0.60; Rr: I2 = 57%, P = 0.80). See Table 2 for details.

Table 2 Sensitivity analysis on effects of HFNC on respiratory function indices

Assessment of publication bias

A funnel plot analysis was performed for meta-analyses of more than ten studies to assess publication bias. The funnel plots of the reintubation were symmetrical, suggesting a slight publication bias. The funnel plot for PaCO2 showed asymmetry, indicating a risk of publication bias Fig. 9.

Fig. 9
figure 9

Funnel figure Reintubation A and partial pressure of carbon dioxide B

Certainty assessment

The certainty of the comprehensive assessment was evaluated using GRADE guidelines (Table 3). Heterogeneity and the risk of bias were the key factors that decreased the certainty of the evidence for most outcomes. Reintubation and facial injuries exhibited high certainty, abdominal distension, and pulmonary complications exhibited moderate certainty, and LOS exhibited low certainty.

Table 3 Summary of findings based on the GRADE guidelines for HFNC

Discussion

HFNC is a relatively new respiratory support technology that consisting of an air/oxygen blender, an actively heated humidifier, and a nasal cannula delivering up to 60 L/min of heated, humidified oxygen [33]. HFNC not only washes out the anatomical dead space, delivers positive end expiratory pressure (PEEP), maintains constant inspired oxygen concentration (FiO2), and provides sufficient humidification, but also it offers exceptional comfort and good applicability in the clinic [33,34,35]. Furthermore, it has been shown to be superior to traditional oxygen therapy. However, whether HFNC is preferable to NIV remains unclear in some patients, notably those with a high risk of extubation failure. Consequently, we conducted a systematic review and meta-analysis of the efficacy of HFNC therapy in patients at high risk of extubation failure. The findings demonstrated that the reintubation, mortality, and improvement in respiratory function in high-risk patients using HFNC as a preventive intervention were not inferior to those of NIV when the same parameters were compared. Moreover, fewer adverse effects, such as facial injuries and abdominal distension, were observed in patients using HFNC.

We analyzed the data from a total of 13 RCTs and found that HFNC was not inferior to NIV in preventing reintubation in patients at a high risk of extubation failure. This result was consistent with the conclusions of Hernández et al. [13] and Zhou et al. [36] but inconsistent with the recommendations of the European Respiratory Society guidelines published by Oczkowski et al. [16]. A possible reasons for this inconsistency could be the differing inclusion criteria for the interventions. The RCTs in our study excluded combined HFNC and NIV interventions; instead, we included only RCTs incorporating simple HFNC or simple NIV. Therefore, we excluded the publications by Thille et al. [37] who included large-sample multicentre RCT of up to 631 patients with a high risk of extubation failure, accounting for a significant weight in the meta-analysis. As a result, our findings are at odds with the recommendations. For mortality, our results showed no difference between the two groups, which is consistent with the studies by Oczkowski et al. [16] and Chang et al. [38].

Although consistent with the findings of Oczkowski et al. [16] and Hernández et al. [13], we found substantial heterogeneity and low certainty that HFNC would shorten the LOS. In our study, when subgroup analysis was carried out according to language, extraction method, NIV parameter settings, and HFNC flow rate, we discovered that with exception of the Chinese-language group, HFNC still demonstrated an advantage NIV. The English-language group and other subgroup analyses showed no statistical differences between HFNC and NIV. This finding may be related to a higher risk of literature bias in the Chinese-language group. Therefore, further research is necessary to determine whether HFNC therapy has a beneficial effect on the LOS.

Regarding adverse events, we discovered that HFNC reduced the incidence of abdominal distension, facial injury, pulmonary problems, and intolerance. Delirium and the aspiration were not significantly different between the HFNC and NIV groups; however, the HFNC group displayed an optimization trend. This result is in agreement with the findings of Hernández et al. [13] and Stéphan et al. [39].

Moreover, HFNC had the same effect on respiratory function indices as NIV, and improved PaCO2, P/F, and RR, which is consistent with the findings of Hernández et al. [13] and Jing et al. [27]. However, the results of those three indices revealed high heterogeneity. We found heterogeneity of P/F associated with language using subgroup analysis. The results of the Chinese-language group revealed no statistically significant differences between the two interventions, whereas those of the English-language group demonstrated that NIV was superior to HFNC. The reason for this discrepancy might be that the period of HFNC use in the Chinese-language articles was longer than that in the English-language articles. We also found that with respect to PaCO2, HFNC outperformed NIV when the HFNC flow rate was adjusted to the patient's condition; however, when the NIV parameters were adjusted, the two interventions were equivalent. Therefore, the personalized settings for the HFNC flow rate appeared to have a more significant effect on reducing CO2 retention than those of the NIV parameters. In addition, we carried out a sensitivity analysis, and found that the heterogeneity was dramatically reduced, and the pooled effect did not change significantly when the study by Yang et al. [22] was removed. We hypothesize that this finding was related to the study population, which included patients undergoing cardiac surgery, whereas other studies included patients with respiratory diseases. In patients with cardiac disease, increased flow in HFNC produces a degree of PEEP that causes a reduction in inferior vena cava collapse, subsequently reducing cardiac preload [40,41,42]. However, HFNC mainly decreases CO2 rebreathing by washing out the anatomical dead space and providing a steady oxygen concentration, which has an effective respiratory management effect in patients with respiratory diseases, especially COPD [43, 44]. Therefore, we believe that the heterogeneity was probably due to the different mechanisms of action between patients.

To ensure the certainty of the evidence, we used the GRADE system to rate each outcome. We also performed subgroup analyses due to the inclusion of English-language and Chinese-language articles, conventional extubation (satisfying extubation criteria), and unconventional extubation (intentional extubation in the presence of PIC). However, we did not have uniform inclusion criteria for the flow and temperature of HFNC and the inspiratory and expiratory air pressures of NIV. It is worth mentioning that although we performed subgroup analyses to determine whether the NIV parameters or HFNC flow rate changed, we could not perform more accurate subgroup analyses (e.g., comparing both NIV parameter settings and HFNC flow rate change, no change in both NIV parameter settings and HFNC flow rate, change in NIV setting parameters, and change in HFNC flow rate), because we included only 13 publications. Although our subgroup analysis confirmed that the NIV parameter settings and HFNC flow rate did not affect most study outcomes, it is undeniable that this does not completely eliminate the effect of such confounding factors. Therefore, future studies will require more refined settings for NIV parameters and HFNC flow rates. In addition, we only compared how HFNC alone and NIV alone affected high-risk patients of extubation failure with no discussion on whether NIV combined with HFNC is preferable to either one alone. According to Thille et al. [6, 45], HFNC combined with NIV significantly outperformed HFNC alone in reducing reintubation in high-risk patients with extubation failure. Therefore, future studies should focus on the effect of HFNC combined with NIV on patient comfort, durability, and prevention. Finally, the large-sample and multicentre RCT by Hernández et al. [13] holds sizable weight in our meta-analysis, which results in partial reliance on their findings and certain restrictions on our findings.

Currently, it is unclear whether HFNC and NIV are equally effective in preventing reintubation and how to define and diagnose high-risk patients with extubation failure. Consequently, more reliable clinical data are required to validate this finding.

Conclusions

Among adult patients at high risk of extubation failure, HFNC is not inferior to NIV in preventing reintubation, mortality, and respiratory failure, and may shorten LOS, reduce adverse reactions, and increase patient comfort. However, heterogeneity of PaCO2, Rr, and LOS was evident in our results. Thus, a high-quality RCT is required to confirm whether HFNC therapy can provide an advantage in LOS and respiratory function in such patients.