1 Introduction

High-flow nasal cannula (HFNC) is a form of non-invasive respiratory support able to deliver humidified and heated oxygen at up to 80 L per minute, compared to traditional low-flow nasal cannulas at up to 15 L per minute. It was developed to provide high FiO2 to patients with non-hypercapnoeic hypoxemic respiratory failure. HFNC’s implementation for acute hypoxemic respiratory failure has been widely promoted during the COVID-19 pandemic to mitigate the need for invasive mechanical ventilation. Reported benefits of HFNC include improved ventilation and oxygen delivery, better removal of secretions and decreased work of breathing [1]. Clinical applications of HFNC in the context of acute hypoxemic respiratory failure include pneumonia, acute respiratory distress syndrome (ARDS) and acute cardiogenic pulmonary oedema [2].

However, randomized trials demonstrate inconsistent clinical outcomes such as intubation rates and mortality benefit. While some meta-analyses of trials report a reduction in the need for escalation of respiratory support with HFNC use, particularly intubation rates, compared to conventional oxygen therapy [3], other studies report no associated benefits for intubation, mortality or length of stay [4]. Overall, HFNC has been met with variable success. Ait Hamou et al. reported an association between HFNC failure (defined as requiring escalation to invasive mechanical ventilation) and increased intensive care unit (ICU) mortality [5]. Kang et al. reported an association between delayed intubation following HFNC use and increased mortality rates and poor clinical outcomes [6].

Differing practices regarding indications and thresholds for HFNC initiation could possibly account for the inconsistent clinical outcomes. As an example, an observational study surveying ICU practices found HFNC initiated in various clinical contexts (e.g., pneumonia, post-extubation support) using different clinical parameters [7]. By extension, these variable clinical practices are likely reflected in the designing of randomized controlled trials (RCTs) comparing HFNC with other forms of respiratory support. We hypothesize that HFNC’s variable success is in part due to variable stopping criteria in RCTs, which would lead to differing practices regarding indications and thresholds for HFNC use. We therefore aim to review these stopping criteria and investigate if these have any associations with HFNC outcomes.

2 Methods

The protocol for this systematic review was registered on PROSPERO (ID: CRD42024519794). It is available in full on the https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=519794.

2.1 Search strategy

We based our review on the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) statement [8] (Supplementary 1), and searched two databases (PubMed and EMBASE) for all English-language papers published from January 1, 2007, to December 31, 2022, using the following search terms: (("high-flow" OR "high flow") AND ("nasal" OR "oxygen" OR "therapy")) AND ("acute respiratory failure" OR "respiratory failure" OR "respiratory insufficiency" OR "acute respiratory distress" OR "hypoxemia" OR "hypoxaemia") AND ("Randomized Controlled Trial"[pt] OR "Controlled Clinical Trial"[pt] OR "Clinical Trials as Topic"[mh] OR randomized[tiab] OR placebo[tiab] OR randomly[tiab] OR trial[tiab] OR groups[tiab]) NOT (Animals [mh] NOT Humans [mh]).

2.2 Inclusion and exclusion

Both authors performed independent screening and selection of RCTs comparing the use of HFNC with conventional oxygen therapy/non-invasive ventilation/continuous positive airway pressure in patients with acute hypoxemic respiratory failure. Any disagreements were resolved by consensus. Decisions on inclusion were recorded on Microsoft Excel (Redmont, WA) and included in the PRISMA flow chart (Fig. 1).

Fig. 1
figure 1

PRISMA flow diagram. COPD chronic obstructive pulmonary disease, HFNC High-flow nasal cannula, RCT randomized controlled trial

2.3 Data analysis

Data extracted included trial design, study design, clinical setting, patient population, comparators, number of participants, number immunocompromised/with COVID-19/Do-Not-Resuscitate, mean age and PaO2/FiO2 (P/F) ratio, cause of hypoxemic respiratory failure, study inclusion and exclusion criteria, starting criteria, success and failure criteria, HFNC protocol, effect on need for intubation, ventilator-free days (VFDs), length of stay, mortality, desaturation and patient comfort (Tables 1, 2). The quality of included studies was assessed using the Cochrane risk of bias tool (Table 3). Preplanned subgroup analysis was done based on variability of trial stopping criteria, which was analyzed qualitatively.

Table 1 Characteristics of included studies
Table 2 Patient characteristics of included studies. SD standard deviation
Table 3 Risk of bias of included studies

Most RCTs defined HFNC treatment failure as a need for escalation of therapy to intubation and mechanical ventilation, utilizing different thresholds for escalation of oxygen therapy (e.g., vital signs, physical examination findings). In this study, we focused on respiratory rate as an objective threshold for escalation of therapy. Respiratory rate is an important objective measure for clinical severity, holding diagnostic and prognostic significance, particularly in critically ill patients. Garrido et al. found that increases in respiratory rate were significantly associated with ICU admissions, and could guide management and prevent morbidity and mortality by predicting clinical deterioration necessitating intervention [9]. Additionally, respiratory rate is used in the CURB-65 Score for Pneumonia Severity to risk stratify community-acquired pneumonia [10]. External validation of the CURB-65 demonstrated its ability to predict mortality, need for ICU admission [11] and need for intubation [12].

Subgroup analysis was conducted based on the trial failure criteria (categorical variable) used by the included studies. These criteria were designated as 'subjective criteria' (i.e., did not indicate a cut-off respiratory rate in their failure criteria), 'objective criteria with a low threshold to intubate' (cut-off respiratory rate < 35), and 'objective criteria with a high threshold to intubate' (respiratory rate ≥ 35). A respiratory rate of 35 was used as the threshold as several RCTs had used this as a marker of worsening respiratory failure for early termination [13, 14]. The studies were eventually categorized into 'high threshold' and 'low threshold or subjective'.

2.4 Outcome measures

Clinical outcomes of interest were intubation benefits (Table 4) (measured by number of patients intubated and prevalence of intubation by the end of the trial) and mortality benefit (measured by mortality rate) (Table 5). Effect measures extracted included odds ratio (OR), relative risk (RR) and hazard ratio (HR). Intubation benefits were recorded as categorical variables (designated as either 'none', ‘benefit’ and 'harm'). 'Benefit' was defined as RCTs demonstrating statistically significant reduction in intubation/ventilation/mortality rates or longer VFD, and the converse was true for 'harm'. Studies that reported no significant relationship between mode of oxygen therapy and the relevant clinical outcomes or with unknown/unreported outcomes were labeled as 'none'. Studies designated 'none' or 'harm' were ultimately categorized as 'no benefit'. Since both failure criteria and clinical outcomes were binary categorical variables, Fisher’s exact test was performed to determine the statistical significance, following a 5% level of significance (i.e., results were considered statistically significant if P < 0.05). Subgroup analysis based on the type of non-HFNC comparator used was conducted (Tables 6, 7 and 8

Table 4 Intubation benefit (i.e., prevention of escalation to intubation) and failure criteria of included studies. RR relative risk, OR odds ratio, HR hazard ratio
Table 5 Mortality benefit (i.e., prevention of escalation to non-invasive ventilation, or greater ventilator free days) and failure criteria of included studies
Table 6 Fisher exact test result for intubation benefit
Table 7 Fisher exact test result for intubation benefit in trials with COT comparator
Table 8 Fisher exact test result for intubation benefit in trials with NIV or CPAP comparator

3 Results

3.1 Study selection

Of 1566 articles, 22 RCTs were included for review (Fig. 1) [13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34]. RCTs concerning HFNC for treatment of acute hypoxaemic respiratory failure in an adult population were included if they reported any of the outcomes of interest (i.e., intubation rates, mortality, length of stay) and included a non-HFNC comparator (e.g., conventional oxygen therapy (COT), non-invasive ventilation (NIV) and continuous positive airway pressure (CPAP)). Non-RCTs (i.e., case reports, case series, observational, retrospective and cohort studies, and reviews), as well as RCTs concerning HFNC indicated for reasons other than acute hypoxemic respiratory failure (e.g., hypercapnoeic respiratory failure, perioperative and periprocedural support, pre-intubation and post-extubation respiratory support) were excluded from the review.

3.2 Study characteristics

Among the included studies, 12 studies (54.5%) were conducted in an ICU setting, followed by 7 (31.5%) in an Emergency Department setting (Table 1). Non-HFNC comparators used included COT, NIV and CPAP (Tables 1, 2).

3.3 Study analysis

Of the 22 RCTs included, 5 (22.7%) had high threshold failure criteria, while 17 (77.3%) did not have a high threshold failure criterion (of which 2 (9.1%) were objective with a low respiratory rate threshold, and 15 (68.2%) had subjective criteria). 4 trials (18.2%) reported significant intubation reduction. The presence of objective failure criteria with a prespecified high respiratory rate was found to have a significant effect on intubation rate reduction (P = 0.024). Subgroup analysis by non-HFNC comparator showed non-significant results with the same direction of benefit: COT comparators (P = 0.055) (Table 7); NIV and CPAP comparators (P = 0.378) (Table 8). The study by Frat et al. was excluded from this subgroup analysis due to the presence of multiple treatment arms in the study design. In addition, a secondary analysis excluding studies on COVID-19 patients showed non-significant results with the same direction of benefit (P = 0.097) (Table 9).

Table 9 Fisher exact test result for intubation benefit in trials excluding patients with COVID-19

While the study by Frat et al. reported a significant reduction in mortality when using HFNC, all 21 (95.5%) other RCTs did not report any significant mortality benefits associated with HFNC use. Thus, mortality was not included in the statistical testing.

4 Discussion

When HFNC is applied as a means for acute hypoxemic respiratory failure support, having objective trial failure criteria with a high respiratory rate threshold for intubation confers significant benefits in intubation reduction, but without any significant reduction in overall mortality, as compared to subjective or less stringent trial failure criteria, which were neither associated with intubation or mortality benefit.

An objective high respiratory rate threshold may serve as a double-edged sword, resulting in no net mortality benefit despite reducing the risk of intubation. In this review, high thresholds were associated with a reduction of intubation without increasing mortality. We hypothesize that among patients given HFNC who were eventually not intubated, the avoidance of intubation led to better clinical outcomes. Higher intubation thresholds allow the benefits of HFNC to be actualized, and serve as a resource-saving measure, especially in the ICU setting. Conversely, among patients who are eventually intubated, delays in intubation can lead to worse outcomes. This was demonstrated in the trial by Andino et al., which reported intubation benefits with HFNC using objective failure criteria. Despite identical intubation criteria, intubation occurred significantly later in patients treated with HFNC as compared to COT, leading to a lack of an improvement trend in P/F ratio from the first 8 h among HFNC-treated patients who were eventually intubated [27]. Prospective cohort studies (e.g., Kang et al.) have similarly observed an association between delayed intubation post-HFNC and increased mortality [6]. Gonzales et al. also found that increased time to intubation from first respiratory support (HFNC and NIV) was associated with an increased risk of in-hospital mortality and worsened diffusion capacity for carbon monoxide (DLCO) during subsequent follow-up of surviving patient [35]. The proposed mechanism for this is patient self-induced lung injury, where spontaneous vigorous inspiratory effort results in high transpulmonary pressures that damage the lungs [36]. By extension, sedation and endotracheal intubation minimizes the progression of lung injury [37]. However, this was not consistently observed across all the included RCTs. For instance, Elagamy et al. reported intubation benefits with HFNC using objective failure criteria, but no significant difference (P = 0.785) in time to intubation was noted between HFNC-treated patients (51.17 ± 14.94 h) and NIV-treated patients (52.18 ± 6.67 h) [30].

This review’s data suggest that to optimize clinical outcomes for patients on HFNC, clinicians must reconcile adherence to a high respiratory rate threshold for intubation and not delaying necessary escalation to invasive mechanical ventilation. This requires identification of patients at risk of poor outcomes following delayed intubation. In a propensity score analysis of the RISC-19-ICU cohort of critically-ill COVID-19 patients, the authors suggested a closely monitored trial of HFNC therapy, followed by rapid intubation in patients with a high risk of failure and mortality (identified with prognostic scores) [38]. Since this did not account for prior respiratory support initiation in general wards, which has proven to be feasible [39], Gonzalez et al. suggested intubation within the first 48 h from the first respiratory support (e.g., HFNC) [35].

More recently, the respiratory oxygenation (ROX) index, which incorporates respiratory rate, SpO2 and FiO2, was shown to predict the need for escalation of respiratory support to intubation post-HFNC [40]. Patients started on HFNC were reassessed at 2-, 6- and 12-h intervals after HFNC initiation, and ROX scores ≥ 4.88 correlated with lower intubation risk. Conversely, lower ROX scores (< 2.85, < 3.47, and < 3.85 at 2, 6, and 12 h respectively) consistently predicted HFNC failure. However, the ROX index requires measurement of SpO2, which may differ from SaO2, particularly in patients with dark skin pigmentation [41]. The respiratory rate itself could be more objective than the ROX index as a stopping criterion for HFNC, and both could be prospectively compared in future HFNC trials.

4.1 Strengths and limitations

The strength of this review is the inclusion of trials that report both HFNC benefits and no benefits across various clinical outcomes. At the same time, we included RCTs that studied patient populations with acute hypoxemic respiratory failure secondary to COVID-19, which is one of the main areas of interest for HFNC use. While COVID-19 associated ARDS has been shown to be substantially similar to non-COVID associated ARDS in terms of alveolar damage and mechanisms of hypoxaemia [42], a secondary analysis of trials not involving COVID-19 patients showed the effect of objective failure criteria was not significant. This is likely due to a lack of sufficient studies. Respiratory mechanics, response to treatment and prognosis are similar in both COVID-19 and non-COVID-19 associated ARDS [43, 44], and thus they were included in the main analysis. A limitation of the study is that non-English databases were not studied, however PubMed and EMBASE contained most of the important HFNC trials. Due to the lack of consistently reported continuous data in the included trials, other parameters such as P/F ratio, SF ratio and PaCO2 could not be included in our analysis. However, there is merit to examining these clinical outcomes in future studies. Regression analysis was considered, but was not feasible as the failure thresholds analysed were discrete criteria as opposed to continuous variables, as well as due to a lack of sufficient trials. While the time to intubation will be a useful clinical outcome to ascertain the association between delays to intubation post-HFNC and poor clinical outcomes, this was not consistently reported across the included RCTs, albeit an interesting outcome to evaluate in future. The effect of objective failure criteria was not significant when subgroup analysis was performed by type of non-HFNC comparator and is likely due to lack of sufficient studies. While high respiratory rate thresholds might be associated with a reduction of intubation, this result may be limited by the heterogeneity of the included studies and a lack of consistency in the data reported. As such, further RCTs are required to ascertain this association.

5 Conclusion

In order to actualise the benefits of HFNC, a balance must be struck between adhering to high thresholds for intubation and avoiding unnecessary delays to escalation of respiratory support. The respiratory rate is an important objective threshold to reduce intubation rates without increasing mortality, and could potentially be used as a simple but objective stopping criterion for HFNC.