Background

The need for respiratory support is the most frequent cause of admission to intensive care unit among patients with COVID-19[1]. The optimal first-line approach for respiratory support in acute hypoxemic respiratory failure is debated [2, 3].

In hypoxemic patients, high-flow nasal oxygen is recommended as the first-line intervention due to its effectiveness, accessibility and simplicity of use [4]. Noninvasive ventilation (NIV) with helmet interface and specific settings has been proposed as an alternative tool to manage patients with moderate-to-severe hypoxemia [5,6,7,8,9,10]. Its putative benefits include the possibility of providing continuous treatments with good tolerability, improved oxygenation, and potential avoidance of lung injury progression [11, 12]. However, its use is limited to highly selected environments with expert personnel, due to the lack of conclusive evidence regarding its efficacy.

Recently, the first head-to-head randomized trial (HENIVOT) compared helmet NIV, eventually followed by high-flow nasal oxygen, versus high-flow oxygen alone as the first-line treatment of patients with COVID-19 and moderate to severe hypoxemic respiratory failure in the intensive care unit [13]. Results showed no significant difference in the number of days free of respiratory support within 28 days from randomization, but a reduced need for endotracheal intubation and invasive mechanical ventilation in the helmet group, suggesting a potential advantage to explore.

Long-term health consequences of COVID-19 have been widely described. These include impaired pulmonary function, physical health and neuropsychological sequelae [14,15,16,17,18,19,20,21,22,23]. In patients with acute hypoxemic respiratory failure and acute respiratory distress syndrome, avoidance of endotracheal intubation improves long-term outcome and quality of life [24, 25].

Limited evidence is available regarding the functional outcomes of patients who receive helmet noninvasive ventilation (NIV) in acute hypoxemic respiratory failure [26, 27], and there are concerns about the possible risk of self-inflicted lung injury and pulmonary sequelae due to the effect of spontaneous breathing [28, 29]. To date, no study has ever compared helmet NIV with specific settings vs. high-flow nasal oxygen in terms of long-term outcomes.

In this pre-planned analysis of the HENIVOT trial, we compared the 6-month clinical status, respiratory function, and quality of life of intensive care unit survivors treated with helmet NIV or high-flow nasal oxygen. We also assessed the effects of the need for endotracheal intubation on these outcomes.

Methods

Patients enrolled in the “HElmet NonInvasive Ventilation versus high-flow Oxygen Therapy in acute hypoxemic respiratory failure (HENIVOT)” were followed up 6 months after randomization. HENIVOT was an investigator-initiated, two-arm, open-label multicentre randomized trial conducted in 4 intensive care units between October and December 2020. The study was supported by the acute respiratory failure study group of the Italian Society of Anesthesia, Analgesia and Intensive Care Medicine (SIAARTI) and was approved by the ethics committee of all participating centres. All patients provided written informed consent for study participation and data analysis.

One hundred-nine patients with moderate-to-severe hypoxemic respiratory failure (ratio of arterial oxygen partial pressure to inhaled oxygen fraction equal to or below 200 mmHg) due to COVID-19 at intensive care unit were randomly assigned in a 1:1 ratio to receive helmet NIV, eventually followed by high-flow nasal oxygen (N = 54) or high-flow nasal oxygen alone (N = 55). The full study protocol and statistical analysis plan are available elsewhere [13].

Study design, setting, and participants

All patients discharged alive from hospital were contacted 6 months after enrolment to assess their clinical status, respiratory function, and quality of life. Follow-up visits were performed according to the study protocol from April 2021 to June 2021 in the Italian centers involved in the trial, which are equipped with post-acute outpatient services for individuals discharged from hospital after recovery from COVID-19.

Variables and data collection

The primary outcome of the study was the percentage of patients with any symptoms at the 6-month follow-up.

Follow-up visits consisted of a global medical assessment including medical history, physical examination, blood tests and pulmonary function test.

Patients were interviewed face-to-face with a set of questionnaires that measure physiological and psychological status, including: the modified British Medical Research Council (mMRC, a clinical tool to stratify dyspnea) [30], the EuroQoL five dimensions five levels (EQ-5D-5L) questionnaire (an instrument to measure health-related quality of life) [31], the EuroQol Visual Analogue Scale (EQ-VAS, a self-rated health record) [31], the Medical Outcomes Study 36 Items Short form (SF-36, a patient-report questionnaire on quality of life) [32, 33], the Post Traumatic Stress Disorder Checklist for the Diagnostic and Statistical Manual of mental disorder (DMS-5) (PCL-5, that assesses the symptoms of post-traumatic stress disorder) [34] and a self-reported symptoms questionnaire (Additional file 4: Material, E-Appendix).

The physical examination included the six minutes walking test [35] and 30-s chair stand test [36].

Venous blood samples were collected and analyzed to determine complete blood cell count (hemoglobin, platelet count, white blood cell count), hepatic function (total bilirubin and transaminases) and renal function (creatinine, blood urea nitrogen and estimated glomerular filtration rate).

Pulmonary function test consisted of global spirometry with diffusion capacity for carbon monoxide (DLCO) assessment and arterial blood gas analysis.

All data were recorded in an electronic database.

Statistical analysis

Categorical variables were summarized as absolute and percentage frequencies and continuous variables as median and interquartile range. Between-group comparisons were performed using Fisher’s or Chi-square test for categorical variables, Mann–Whitney test for ordinal variables or non-normal quantitative variables and t test for normally distributed quantitative variables.

To estimate the effect of endotracheal intubation on physical and psychological performance status at 6 months, we compared the study variables between patients who required endotracheal intubation and who did not require endotracheal intubation, independently from the assigned treatment.

Some patients had missing data for some outcomes: the exact number of patients with complete data for a determined group of variables is specified in the tables. Given that the data were missing at random, no imputation was performed. The significance level was set at p < 0.05. All statistical tests were two-tailed.

We performed a Random Forest analysis and the respective MDS plot with as independent variables the allocation treatment and the endotracheal intubation, to explore their effects on a number of outcomes (Additional file 12). The model was trained to optimize the number of trees.

Statistical analysis was performed with SPSS (IBM Corp. Released 2019. IBM SPSS Statistics for Windows, Version 26.0. Armonk, NY), GraphPad Prism (GraphPad Software, San Diego, California USA, version 7.0.0 for Windows) and R software (Rstudio Team (2020). RStudio: integrated Development for R. RStudio, PBC, Boston, Version 2023.03.0 + 386).

Results

Eighty-two of the 109 patients enrolled in the HENIVOT trial were discharged alive from the hospital and were contacted at 6 months; among these patients, 2 were deceased, 7 refused to participate and 2 were lost at follow-up. Of remaining 71, who successfully completed the follow-up, 35 received helmet NIV and 36 high-flow oxygen (Fig. 1). Table 1 shows the patient’s demographics.

Fig. 1
figure 1

Study flow-chart

Table 1 Characteristics of patients, according to the study group.*

Symptoms and physical performance test

There was no difference in the primary outcome: the majority of patients reported at least one symptom at the follow-up: 28 (90%) in the helmet group and 28 (85%) in the high-flow group (p = 0.71). Overall, the most common symptoms were dyspnea and fatigue, with a median mMRC, respectively, of 2 [IQR: 1–2] in the helmet group vs. 2 [IQR: 2–3] in the high-flow group (p = 0.25). Patients in the helmet group reported a significantly lower frequency of arthralgia, meant as joint pain or stiffness (16% vs. 55%, OR = 0.16 [95% CI 0.05–0.52], (p = 0.002). No significant difference between groups was found in the incidence of dyspnea, fatigue, dry cough, sore throat, rhinitis, smell and taste disorders and decreased visual acuity (Table 2).

Table 2 Outcomes at 6 months, according to study group*

Vital signs, laboratory tests and arterial blood gas were similar between the two groups and did not present significant abnormalities (Additional file 3: Table S1).

Pulmonary function parameters were impaired in the overall cohort, with no significant differences between groups (Fig. 2, Table 2, Additional file 3: Table S2). DLCO impairment was the most frequent abnormality: 78% [70–85] of the predicted value among patients treated with helmet and 76% [67–88] among those treated with high-flow nasal oxygen (p = 0.67).

Fig. 2
figure 2

Radar chart showing pulmonary function tests at 6 months in our cohort. Both patients treated with Helmet NIV and high-flow had reduced performance at 6 months, but there was no difference between the two groups (upper panel). Patients that did not require endotracheal intubation had pulmonary function tests at 6 months more similar to physiological values, compared to patients that required endotracheal intubation (lower panel)

Median distance at six minutes walking test was 490 m [IQR: 420–540] in the helmet group and 510 m [IQR: 438–540] in the high-flow group (p = 0.82), without significant differences in terms of dyspnea assessed by the BORG scale (4 [IQR: 2–6] vs. 3 [IQR: 3–5], respectively, p = 0.91). One (4%) patient in the helmet group vs. 4 (15%) in the high flow group were unable to walk the predicted distance (p = 0.18).

The 30-s chair stand test was successfully completed by all participants, with 23 cycles in 30 s [20–29] in the helmet group and 23 [19–25] in the high flow group (p = 0.65). (Additional file 3: Table S2).

Health-related quality-of-life assessment

Full details of the EQ-VAS, EQ-5D-5L, PCL5, SF-36 questionnaires are presented in Table 2 and in Additional file 3: Table S2.

The two groups had similar SF-36 scores in all domains; the most impaired domains were “role limitation due to physical health”, “general health” and “health change”, without differences between groups (all p > 0.05) (Fig. 3, Table 2).

Fig. 3
figure 3

Radar chart showing health-related quality-of-life assessment in our cohort. For the SF-36 test the higher score represents a better subjective health experience. The top left panel shows no difference in subjective health experience at 6 months between the high-flow group and the helmet group; however, patients that required endotracheal intubation reported worse subjective health experience compared to those who avoided endotracheal intubation (top right panel). The lower panel shows the EuroQoL five dimension five levels (EQ-5D-5L). The radar chart shows the percentage of patients reporting problems in each of the five dimensions. There was no difference between the high-flow and the helmet group (bottom left panel); however, a higher percentage of patients that needed endotracheal intubation reported impairment in each of the five dimensions, compared to the patients that avoided endotracheal intubation (bottom right panel)

Many patients reported decreased quality of life in the EQ-5D-5L, including pain (21 patients (58%) in the high-flow oxygen group vs. 19 (54%) vs. the helmet group, p = 0.81) and anxiety or depression (14 patients (39%) in the high-flow oxygen group vs. 12 (34%) in the helmet group, p = 0.81) (Fig. 3).

The EQ-VAS score was 70 [60–80] in the high-flow group vs. 78 [69–80] in the helmet group (p = 0.27). The median PCL-5, assessing post-traumatic stress disorder, did not differ significantly between the two groups (8 [2–17] vs. 11 [3–17], p = 0.71), and in the majority of the cohort it was far below the cutoff score of 33 that defines post-traumatic stress disorder.

Effect of endotracheal intubation on outcomes

We compared 17 patients who required endotracheal intubation (3 on Helmet NIV, 14 on HFNC) and 54 patients who did not (32 on Helmet NIV, 22 on HFNC). Besides the type of respiratory support, no other significant differences were found between patients who required endotracheal intubation vs. those who did not (Additional file 3: Table S3).

Patients who required endotracheal intubation, compared to those who did not, showed a higher incidence of fatigue (13 (76%) vs. 22 (47%), OR = 3.69 [95%CI 1.05–13.00], p = 0.05), arthralgia (11 (65%) vs. 12 (26%), OR = 5.35 [95%CI 1.63–17.60], p = 0.007) (Table 3) and decreased estimated glomerular filtration rate (80 [58–91] vs. 92 [IQR 78–101] ml/min/1.73m2, mean difference 14.44 [95%CI 0.80–28.09], p = 0.04) (Additional file 3: Table S4).

Table 3 Outcomes at 6 months according to intubation*

Compared to patients who successfully avoided intubation with noninvasive support, a higher proportion of intubated patients showed DLCO < 80% (OR = 6.27 [95%CI 1.26–31.29], p = 0.03), forced vital capacity < 80% (OR = 5.6 [95%CI 1.40–22.44], p = 0.02) and total lung capacity (OR = 7.43 [95%CI 1.97–28.08], p = 0.003) (Table 3, Additional file 3: Table S4, Fig. 3).

Median EQ-VAS was significantly lower in intubated patients, compared to those who had not (70 [53–70] vs. 80 [70–83], with a mean difference of 9.26 [95%CI 1.89–16.64], p = 0.01) (Fig. 3, Table 3, Additional file 3: Table S4).

Compared to non-intubated patients, a higher proportion of intubated patients reported mild problems (EQ-D5-L5 > 1) doing usual activities of the daily living (65% vs. 31%, OR = 3.99 [95%CI 1.27–12.58], p = 0.02) and in the mobility (59% vs. 26%, OR = 4.08 [95%CI 1.30–12.78], p = 0.02).

Compared to non-intubated patients, patients who had received invasive mechanical ventilation reported lower SF-36 scores, showing limitations in physical functioning (p = 0.01), limitations due to decreased physical health (p = 0.05), chronic pain (p = 0.03), and worsen overall health (p = 0.05) (Fig. 3, Table 3).

Discussion

The results of our study on the functional outcomes of patients affected by COVID-19 moderate-to-severe respiratory failure treated with helmet noninvasive ventilation vs. high-flow nasal oxygen can be summarized as follows:

  • The incidence of symptoms was high in both groups, with no significant difference between them, except for a lower incidence of arthralgia in the helmet group.

  • Both groups demonstrated decreased physical status, functionality for pulmonary tests, and quality of life.

  • Patients who required endotracheal intubation had worse physical and psychological outcome at 6 months compared to those successfully treated with non-invasive respiratory support.

Exploring long-term consequences of these treatments is crucial to inform the ongoing debate regarding the optimal non-invasive management of acute respiratory failure.

Preserving spontaneous breathing in moderate-to-severe hypoxemic patients has benefits and risks.

It can help avoid complications related to invasive mechanical ventilation and prevent diaphragm dysfunction [28, 37], but may also exacerbate lung injury due to the combination of dysregulated respiratory drive and lung inhomogeneities causing patient self-inflicted lung injury (P-SILI) [29, 38,39,40].

P-SILI can occur during any noninvasive support strategy, but most concerns exist when mechanical increases in airway pressure are applied during NIV [41, 42]. In this context, high PEEP (of at least 10 cmH2O) can reduce lung inhomogeneity, improve lung mechanics, and decrease inspiratory effort, potentially mitigating the detrimental effects related to P-SILI [43,44,45,46]. However, if the inspiratory effort is low, helmet NIV leads to increased transpulmonary pressure and tidal volumes, which possibly worsen outcomes [41, 44, 47]. These risks underscore the importance of examining long-term outcomes, to rule out possible adverse effects on lung function caused by noninvasive support.

In our study, a considerable proportion of severe COVID-19 survivors experienced persistent symptoms, decreased respiratory performance and reduced quality of life 6 months after the infection, with no significant difference between helmet NIV and high-flow group. This overall rate of long-term sequalae reported in our study is consistent with current evidence [17, 20,21,22,23, 48, 49]. The only statistically significant difference between groups was an increased incidence of arthralgia in the high-flow group (55% vs. 16%, p = 0.002). Arthralgia is one of the most common long-term symptoms of COVID-19 [50]. The underlying pathophysiological mechanism is not fully understood, but likely involves biochemical and inflammatory response pathways, such as cytokine increase and dysregulation [51]. The higher incidence of arthralgia in the high-flow group may reflect the more frequent use of invasive mechanical ventilation, prone position sessions and muscle paralysis in these patients. Notably, all intubated patients received at least one session of prone positioning and muscle paralysis for a minimum of 24–48 h.

A significant proportion of patients had respiratory impairment 6 months after COVID-19, which is consistent with previous findings [19, 52,53,54,55,56,57,58]. Decreased diffusing capacity and gas exchange impairment, reported in over half of the patients, are related to pulmonary interstitial damage [59]. This may be linked to the reduction of pulmonary volumes, which was detected in up to one-third of our patients without difference between study groups: this may suggest a fibrotic evolution of the disease [60, 61].

Regarding health-related quality-of-life impairment, our findings align with other investigations and are linked to the psychological stress caused by intensive care unit stay, respiratory support and extended hospitalization and isolation periods [17, 48, 49, 56, 62]. Despite the general psychological impairment, our cohort did not present post-traumatic stress disorder symptoms, such as sleeping and behavioral issues, according to the PCL-5 test.

Health-related quality of life of patients treated with helmet NIV was also explored by the recent follow-up study of the Helmet-COVID randomized trial [27]. Consistently with our results, they found no difference in the EQ-VAS and the EQ-5D-5L questionnaire between helmet NIV and usual respiratory support: however, in the Helmet-COVID study, the control group included patients treated with high-flow oxygen, facemask NIV and conventional oxygen according to the clinical practice of participating centers. Differently, our study compared helmet NIV vs. high-flow nasal oxygen, which is the suggested first-line treatment for hypoxemic respiratory failure: this enhances the external validity and reproducibility of our findings.

Despite the difference in endotracheal intubation rate (9% in the helmet group vs. 28% in the high-flow group, p = 0.005), we did not find significant effects on long-term outcomes. This finding may be considered as surprising, since previous reports showed that avoiding intubation yields improved long-term outcomes, while endotracheal intubation, sedation, paralysis and invasive mechanical ventilation are associated with higher mortality and morbidity [24, 25, 63]. Lung protective ventilation bundles (including delivery of low tidal volumes and extensive use of prone position) and prevention measures for intensive care unit-acquired infections, which were rigorously applied in all patients. These may have contributed to limit the detrimental effects of invasive mechanical ventilation among patients who survived [37], mitigating any inter-group difference between patients treated with helmet NIV or high-flow oxygen.

To address this aspect, we conducted a secondary analysis, comparing study outcomes for patients who received endotracheal intubation and invasive ventilation vs. those who did not. We found significant impairment in most items concerning pulmonary function, physical performance, and quality of life in patients who had been intubated vs. those who had not (Table 3, Figs. 2 and 3).

Furthermore, we performed two Random Forest analyses and display the respective MDS plot: one with treatment using high-flow or helmet NIV as independent variables and the other with endotracheal intubation as the independent variable. While the treatment allocation had a high out-of-bag estimation rate (39%), yielding higher uncertainties in its predictive power, a cluster of a group of patients treated with high-flow is identifiable in the confusion matrix (Additional file 1: Figure S1a). Despite exploratory in nature, this finding suggests a possible protective effect of helmet NIV on analyzed outcomes. Conversely, endotracheal intubation had a lower out-of-bag estimation rate (19.72%), suggesting lower uncertainties in its predictive power. In this case, although clustering could not be observed (Additional file 1 and 2: Figure S1a–b), a clear asymmetry in the model is presented, suggesting a strong relationship between endotracheal intubation and our outcomes.

Our study has relevant clinical implications: while avoiding endotracheal intubation has benefits, physicians need to carefully balance the advantages of averting intubation with the risk of treatment failure, which leads to delayed intubation and worse clinical outcomes [64]. In this post hoc analysis of the HENIVOT trial, helmet NIV was not associated with poorer respiratory outcomes compared to high-flow, ruling out the possibility of long-term detrimental effects related to P-SILI caused by helmet NIV. This is likely due to the strict monitoring of patients and pre-specified criteria for endotracheal intubation, which limited intubation delay, with a median time between enrollment and intubation of 29 h [IQR 8–71] for the helmet group and 21 h [IQR 4–65] for high-flow group [13]. In addition, despite prolonged treatments (48 h continuously in 91% of patients) and lower comfort with helmet NIV compared to high-flow [13], no difference was observed in terms of psychological effects. Given comparable physical and psychological medium-term effects, and the possible beneficial effect on endotracheal intubation, helmet NIV as applied in the HENIVOT trial might be an effective and safe strategy in acute hypoxemic failure.

Our study has several strengths: a systematic approach with a well-defined follow-up protocol, a multicenter involvement, a comprehensive assessment of physical and mental health status, a small number of patients lost to follow-up and few missing data.

Our study has some limitations. First, despite a high level of adherence to follow-up (71 participants out of 80 patients alive at 6 months), the sample might be underpowered to detect whether the need for endotracheal intubation and invasive mechanical ventilation mediates the effect of applied interventions on analyzed outcomes. Second, many indices showed a difference between the two groups, although not significant, and we cannot exclude that a larger sample size might have revealed differences. Finally, the lack of pulmonary function tests before COVID-19 infection did not allow us to discriminate the effect of COVID-19 vs. the effects of P-SILI and ventilator-induced lung injury; however, the randomized design of the investigation should have made the two cohorts comparable.

Conclusion

In patients with COVID-19 requiring noninvasive respiratory support due to acute hypoxemic respiratory failure, the use of helmet noninvasive ventilation appears as effective and safe as high-flow oxygen, in terms of 6-month functional status and quality of life. Need for invasive mechanical ventilation after noninvasive support is strongly associated with worse quality of life, physical and psychological status at 6 months. These data indicate that helmet NIV, as applied in the HENIVOT trial, can be safely used in hypoxemic patients.