Introduction

Recent recommendations have been published favoring the use of heated humidifiers (HH) during noninvasive ventilation (NIV) [1]. There are few physiologic studies that support these recommendations but no study reporting the impact of the humidification device use during NIV for acute respiratory failure on patient outcome [2]. Although the upper airways are not bypassed, several arguments plead for the humidification of delivered gases during NIV. The inspiratory gases are dry with ICU ventilators, respiratory rates are high, and mouth breathing is frequent during NIV [3, 4]. Mucosal dryness is also a frequent complaint during NIV [5, 6] and a good tolerance of the technique is necessary for its success [7, 8]. Most of all, in patients receiving NIV, bronchial hyperreactivity may be increased by dry medical gases [911].

It was shown in short-term studies that HH can deliver gases with higher water content than heat and moisture exchangers (HME) during NIV, especially in the case of leaks [12]. It was also shown with short-term physiologic studies that HH improved alveolar ventilation and CO2 elimination [13, 14] and reduced the work of breathing [15] when compared with HME. These results are related to the additional instrumental dead space on the circuit with HME that must be placed after the Y-piece on the patient’s side [15]. Interestingly, the lack of CO2 removal during NIV in COPD patients was found to be a major criterion for NIV failure [1620].

Therefore, the combined physiologic data plead for the use of HH during NIV. However, taking into account the economic implications and ergonomic aspects, the widespread use of HH can not be promoted without obtaining clinical studies demonstrating an impact on important outcome data such as the rate of intubation, the ICU length of stay, or the mortality. The purpose of this randomized controlled multicenter study was to test the hypothesis that NIV delivered via ICU ventilators with HH is associated with a reduced rate of intubation in comparison to HME [21].

Methods

The study was conducted in 15 medical, surgical, or mixed ICUs, in university hospitals in France, Tunisia, Italy, and Canada. The participating centers were experienced in the use of NIV, with implementation of this technique on average 8 ± 3 years before the start of the study. The study centers received approval from an ethics committee. Written informed consent was obtained from all participants or their next of kin.

Patients

Inclusion criteria were age between 18 and 85 years, exacerbation of dyspnea for less than 2 weeks, and presence of at least two of the following criteria: respiratory rate above 25 breaths/min, SaO2 below 90 % (ambient air or oxygen), arterial pH less than 7.35. The exclusion criteria were need for immediate intubation, major facial deformity, high probability of a surgical procedure, prior episode of invasive mechanical ventilation of more than 48 h during the same hospital stay, pneumothorax, refusal of intubation by the patient, decision not to resuscitate or not to intubate the patient already known at inclusion, pregnancy, organ failure other than lungs based on absence of organ dysfunctions and/or infection (ODIN) score [22]. Registration of inclusion and exclusion was carried out using data from each site concerning patients requiring NIV during the year of the study.

Study protocol

Patients were prospectively randomized into the HH arm (MR850, Fisher & Paykel) or HME arm (Hygrobac, Tyco Healthcare; dead space 95 ml [23]). Central randomization was conducted by phone with a voice recognition system and stratification was performed on the basis of the study center and on the type of respiratory failure as follows: “respiratory acidosis”, pH less than 7.35 and PaCO2 greater than 45 mmHg at baseline; or “hypoxemia”, SaO2 no greater than 90 % in ambient air or oxygen. If respiratory acidosis was present with hypoxemia, respiratory acidosis was prioritized.

The humidification device allocated at randomization was used during all NIV sessions, until intubation or NIV cessation. All other technical aspects of NIV (including same oronasal mask and the absence of flex tube use) were standardized for each group. Only ICU ventilators were used to deliver NIV. The recommended ventilatory mode was pressure support with the aim of obtaining an expired tidal volume of at least 7 ml/kg. It was also recommended to perform more than 3 h of NIV within the first 6 h following inclusion and more than 6 h during the first 24 h.

Endpoints

The primary endpoint of the study was the intubation rate. Criteria for intubation were standardized and based on previous publications [24] (criteria detailed in the electronic supplementary material).

The secondary endpoints were intubation rate in the hypoxemia and respiratory acidosis groups, physiological parameters (respiratory rate, hemodynamic parameters, arterial blood gas), NIV tolerance (mucosal dryness), the total length of mechanical ventilation, the length of intensive care unit and hospital stay, and intensive care unit and hospital mortality rates.

Data collection

The demographic characteristics (age, sex, BMI, comorbidities), the indication for NIV, the severity scores (SAPS II [25], LODS [26], McCabe [27]), and the physiologic characteristics were recorded at baseline.

Daily collected data were standard respiratory and hemodynamic physiologic data, as well as an assessment of upper respiratory tract mucosal dryness and duration of NIV.

Statistical analysis

The expected rate of intubation for patients undergoing NIV was around 40 % [7, 8]. A sample of 125 patients in each group was thus selected to detect an absolute reduction of 15 % in intubation rate, with a β risk of 0.2 and an α risk of 0.05, with a bilateral test. The analysis was performed on an intention to treat basis. Results are given as median (25th–75th interquartiles). Proportions were compared using the Chi square test or Fisher’s exact test. To compare the continuous variables, the Mann–Whitney U test was used for non-normally distributed variables or a t test was used when appropriate. Survival curves were estimated by the Kaplan–Meier method and compared using log-rank tests. Administrative censoring was performed at day 28. We performed an adjusted analysis with the Cox regression model with treatment arm and stratification group (hypoxemia and hypercapnia) as categorical covariables with interactions evaluation. p values of 0.05 or less were considered significant.

Results

Demographic characteristics and baseline physiologic data

Patients were included between November 2002 and December 2003. During the 13 months of the study, 249 patients were randomized. Two patients were excluded from the analysis, one for inclusion error and one because inclusion criteria were not met. Thus, data from 247 patients were analyzed, 119 in the HH arm and 128 in the HME arm (Fig. 1). Centers included an average of 16.6 ± 13.8 patients during this period, which represents an average of 1.4 inclusions per month per center.

Fig. 1
figure 1

Flow chart of the study with included and non-included patients and stratification subgroups. HH heated humidifiers, HME heat and moisture exchangers, NIV noninvasive ventilation. *Estimation based on the number of patients managed with NIV during the study time in the participating centers

The characteristics of the included patients are depicted in Table 1. Many of the baseline characteristics of the two study arms such as age, severity scores, comorbidities, and indications for NIV were similar. In the hypoxemia subgroup, LODS was significantly greater in the HH arm (Table 1).

Table 1 Main demographic characteristics of included patients and main indication for NIV for the entire population and for both study subgroups

Main arterial blood gases values and physiological characteristics were similar in the two arms at baseline (Table 2). In the hypoxemia subgroup, median PaCO2 was higher in the HME arm and the bicarbonate levels were lower in the HH arm (Table 2).

Table 2 Main physiologic characteristics at inclusion for the entire population and for both study subgroups

Primary endpoint

There was no significant difference in the intubation rate between HH and HME for the entire population (36.9 vs. 29.7 % respectively, p = 0.28, Table 3). Criteria for intubation met in both groups are reported in Table 4.

Table 3 Impact of humidification device on intubation rate (main endpoint), ICU and hospital length of stay, and mortality for the entire population and for both subgroups
Table 4 Presence of major and minor criteria for intubation (predefined criteria)

Secondary endpoints

In the respiratory acidosis group, the intubation rate was very similar between HH and HME, whereas there was a nonstatistically significant difference in the rate of intubation with HH in the hypoxemia group (41 vs. 26 %, p = 0.14, Table 3).

The NIV duration was the same in HH and HME groups. No significant difference was found regarding either ICU or hospital length of stay or mortality (Table 3, Fig. 2). Mortality was increased at day 28 with HH in the hypoxemia subgroup but the difference did not reach statistical significance. Five patients in the HH group vs. one in the HME group died before reintubation (p = 0.11).

Fig. 2
figure 2

Probability of survival at day 28 in all patients (log-rank p = 0.30) and the hypoxemia (log-rank p = 0.09) and respiratory acidosis groups (log-rank p = 0.94). Solid and dotted lines represent HH and HME, respectively. ICU intensive care unit, HH heated humidifiers, HME heat and moisture exchanger

There was no difference in PaCO2 between the two arms, even in the respiratory acidosis group. In this group, the PaCO2 was lower in the HH arm at 3 h but the difference did not reach statistical significance (66 vs. 72 mmHg, p = 0.08). In the same arm, the diastolic blood pressure was significantly lower in the HH arm at 3 h of NIV. The main ventilator settings (pressure support level, PEEP, and FiO2) did not differ (see electronic supplementary material).

There was no significant difference with regard to the mucosal dryness in both groups from day 1 through day 7 with median values between 0 and 1. The percentage of scores of 2 or 3 (witnessed significant mucosal dryness) was similar and moderate in both groups from day 1 through day 7. From day 1 to day 7, the mean number of patients with significant mucosal dryness was 13.8 % in the HH group and 13.2 % in the HME group.

Discussion

This is the first multicenter randomized study that assessed the impact of the type of humidification system used on the success rate of NIV delivered for acute respiratory failure with ICU ventilators. Surprisingly, the expected physiological effects of the two systems were not present and, overall, the intubation rate was not influenced by the humidification system used. The hypothesis of the superiority of heated humidifiers relying on previous physiological studies was not confirmed. The results of the present study do not support recent recommendations favoring the use of HH during NIV [1].

Our initial hypothesis was a reduction in the rate of intubation with HH in comparison with HME, in line with the data obtained during several previously conducted physiologic studies [1215]. Indeed, the lower dead space involved in HH is associated with a reduction in respiratory effort [15], minute ventilation, and an increase in alveolar ventilation (decrease in PaCO2) compared to HME [13, 14]. Our hypothesis was also supported by data showing that the lack of CO2 removal is a predictor of NIV failure in COPD patients [1620]. In addition, these studies demonstrated a higher level of gas humidification with HH [12]. Yet, no impact on the rate of intubation was noted in our study even in the group of hypercapnic patients.

There is currently no clinical study assessing the impact of humidification device used during NIV for acute respiratory failure that can be compared with the present study. One 12-month cross-over pilot study compared HH to HME during chronic NIV in 16 COPD patients. The authors did not find any differences for NIV compliance, tolerance, rate of hospitalization, or for most of the complications related to dry gases [2]. The authors noted more dry throat (50 vs. 36 %) and slightly but significantly increased levels of PaCO2 (52.3 ± 5.4 vs. 51.1 ± 4.2 mmHg) with HME in comparison with HH [2].

Several issues might explain the lack of impact of humidification devices on NIV success rate in this study. It may be more difficult to demonstrate under the conditions of a multicentric study some findings demonstrated within well-controlled physiological studies such as increased work of breathing, reduced CO2 elimination with HME, and higher humidification performances with HH. In this regard, we should consider the impact of the application of PEEP and of leaks, which may differ between well-controlled physiological studies and real-life administration. During real-life NIV administration, a PEEP level is frequently needed, thereby reducing the impact of the difference in dead space between humidification systems as previously shown in physiological studies. It was shown that increased work of breathing related to the additional dead space generated by HME was less marked with the application of a PEEP level of 5 cmH2O [15], possibly explained by a dead space washout due to leaks. In the present study, the median level of PEEP during the first NIV session was 5 cmH2O in both groups; thus the difference in respiratory work of breathing may have been small between the two arms. We analyzed the rate of “NIV failure” (counted as death before reintubation or intubation), given that five patients in the HH group vs. one in the HME group died before intubation. The conclusions are not modified when death without intubation is considered as a NIV failure. We also performed an adjusted analysis with the Cox regression model with the treatment arm and stratification group as categorical covariables which did not modify the conclusions. We observed no significant impact of the humidification device on the arterial gas changes, contrasting with previous physiological studies [14, 15] and with a clinical study [2]. In the respiratory acidosis arm, there was only a nonsignificant difference towards greater PaCO2 clearance at the third hour following inclusion with HH as compared to HME (66 vs. 72 mmHg, p = 0.08) (see electronic supplementary material). It was previously shown that the correction of respiratory acidosis was a major determinant of the NIV success in COPD patients [1620]. However, we did not observe a faster CO2 elimination as could be obtained by reducing the dead space with HH [13, 14]. Again, the most likely explanation is the presence of end expiratory leaks generating a washout effect of any CO2 in the mask. In addition, we observed that a decrease in diastolic arterial pressure and hemodynamic effects could in part explain the poor results obtained with HH. It is difficult, however, to incriminate a decrease of PaCO2 which has been shown to be associated with a decrease in cardiac output during NIV in COPD patients [28].

In the subgroup of patients with hypoxemia, a nonstatistically significant difference for increased intubation and mortality rates with HH was found. The subgroup of hypoxemic patients with HH had the highest LODS scores, a lower PaCO2 suggesting a higher minute ventilation, a lower bicarbonate level, and a higher need for FiO2—all these differences being significant. The higher apparent severity of this subgroup may therefore explain the results. These results rely on analyses conducted on small numbers of patient. Consequently the differences may be due to chance and the sample size for these subgroup analyses does not allow a definitive conclusion. However, these findings raise some questions and at least do not support the routine use of heated humidifiers in all patients during NIV as recently recommended [8].

Another interesting result of this study was that no significant difference was observed between the two groups regarding mucosal dryness, despite presumed higher humidification performances of HH. Indeed, severe dryness frequency was low, ranging from 8.5 to 17.5 % between day 1 and day 7. The better humidification performances of the HH have been shown in favorable conditions [29]. These performances may have been reduced as a result of suboptimal conditions which exist in a real-life setting, especially the high ambient temperatures related to the lack of air conditioning in several participating centers [29]. In a previous study conducted in healthy subjects moderate levels of humidity obtained with HME (even in the case of leaks) were as comfortable as higher humidity obtained with HH [12]. These data show that both humidification devices seem capable under the study conditions of reducing this frequently reported complication [5, 6].

The strength of the current study was the control for usual confounding factors that could bias the study favoring therefore one arm over the other. The median level of pressure support was not significantly different between both arms: 12 (11–16) cmH2O in HME arm vs. 14 (12–15) cmH2O in HH arm (p = 0.24). Moreover, NIV daily duration was equivalent in both groups, except for the second study day during which patients received a few minutes extra NIV within the HH arm (8.2 vs. 8.0 h, p = 0.02). Such a difference is probably not clinically relevant. It is also important to note that participating centers were homogeneous and had several years of experience of NIV. However, it was not possible to control other factors that may have a prominent role on NIV outcome such as the level of leaks, comfort, or patient–ventilator asynchronies, which may be more important than humidification in explaining NIV success or failure. Of note, none of the ventilators used had an “NIV” mode.

There are several limitations to this study. We acknowledge that the study was underpowered and definitive conclusions are not possible; this is especially true for subgroups analysis. Only two types of humidification systems were used. Other systems, especially HME with low dead space, may influence the impact on PaCO2 clearance [30]. Similarly, we did not include a third arm without a humidification system [31]. This option might have been feasible for short NIV periods using turbine ventilators and at low levels of FiO2; however, turbine ventilators are infrequently used in Europe [8], as confirmed by the present study (see electronic supplementary material). Thus, the use of a humidification system to avoid dry gas delivery is well justified, especially in NIV patients that frequently feature bronchial hyperreactivity [9, 10] and which can be further increased by the use of dry gases [11]. In the present study, we used an HME with a relatively large dead space, and these results may not be extrapolated to HMEs with a lower dead space. We did not record ventilator settings such as inspiratory or expiratory triggers and pressurization ramp which may potentially have a clinical impact. We thus do not know if these settings differed between groups. Finally, another potential limitation is that this was an open label study. Indeed, it is impossible to envisage driving this type of study blindly. To reduce the possible bias related to this methodology, we used well-defined criteria for our primary endpoint of intubation.

In conclusion, the humidification device used during NIV with ICU ventilators did not have a statistically significant impact on the success rate of this technique in this study. In addition, no difference in the patients’ mucosal dryness was reported with HH in comparison with HME. Our results suggest that despite a strong physiologic rationale supporting the use of heated humidifiers during NIV, this device cannot be recommended as a first-line treatment in all patients with acute respiratory failure. The use of an HME (ideally with a low dead space) while removing the additional dead space (flex tubing) seems to be acceptable in light of the results of this study. In the presence of persistent high PaCO2 levels associated with threatening encephalopathy, the reduction of dead space with an HH may be considered.