Introduction

The unmet need for accurate early predictors of treatment failure and death during noninvasive ventilation (NIV)

NIV is being increasingly used as first-line ventilatory support in de novo (i.e., not due to exacerbation of chronic lung disease or cardiac failure) acute hypoxemic respiratory failure (AHRF). However, concern has been raised that mechanical ventilation may exacerbate lung injury (i.e., ventilation-induced lung injury, VILI) and worsen outcome in spontaneously breathing patients with AHRF [1]. Different ventilatory variables have been proposed to contribute to VILI progression, including high tidal volume [2], minute ventilation [3] and inspiratory effort [4]: these parameters have been separately addressed in clinical studies, but data in noninvasively ventilated patients are scarce, inconsistent and inconclusive, at least in part due to the difficulty in controlling and monitoring time-varying individual ventilatory parameters in spontaneously breathing patients.

Potential contribution of Mechanical Power (MP) to clinical prediction in the framework of PPPM

Predictive, Preventive, and Personalized Medicine (PPPM) is an effective integrative approach, which has been promoted by the European Association for Predictive, Preventive and.

Personalized Medicine (EPMA, http://www.epmanet.eu/) [5]. It contains three aspects: individual predisposition prediction, targeted preventive measures and personalized treatment algorithms [6].

In recent years, a unifying patho-physiological theory, based on thermodynamic principles, has been proposed to underlie VILI: this theory attributes lung injury to the energy transfer from the ventilator to the pulmonary parenchyma, with energy dissipation within the lungs leading to heat production, cell integrity and extracellular matrix disruption, and inflammatory cell recruitment [7-9]. Consistently, mechanical power (MP), a measure of the energy transfer rate from the ventilator to the respiratory system, predicted mortality in invasively ventilated ARDS patients, irrespective of the combination of each ventilatory component [10-13].

Whether the energy delivered to the respiratory system during noninvasive ventilation (NIV) affects clinical outcomes in AHRF is unexplored. During COVID-19 pandemic, NIV intensity and duration were linked to an increased mortality in COVD-19-related AHRF [14-16], but no individual ventilatory parameter, including respiratory drive and inspiratory effort, was able to predict clinical outcomes [17-20]. Furthermore, clinical benefits from awake prone position (PP) during NIV have been reported by several, but not all studies, through still unclear mechanisms [21, 22].

Working hypothesis

We hypothesized the energy delivered by noninvasive ventilatory assistance to the respiratory system could contribute to physio-anatomical and clinical responses to NIV in Severe Acute Respiratory Syndrome Corona Virus 2 (SARS‐CoV‐2) pneumonia-related AHRF, and that the clinical benefits of awake PP in these patients could be at least in part mediated by a reduced MP delivery during prone NIV. We therefore investigated:

  1. 1)

    the effect of prone position (PP) on different measures of MP during NIV.

  2. 2)

    the contribution of MP measures delivered early during either supine or prone NIV to physio-anatomical and clinical outcomes in COVID-19 pneumonia

Methods

In this secondary analysis of the non-randomized, controlled Prone position in NonInvasive Ventilation (PRO-NIV) study(study ID: ISRCTN23016116) [23], we studied 216 SARS‐CoV‐2 pneumonia patients with acute (i.e. symptom onset < 14 days of hospital admission) moderate-to-severe AHRF(paO2/FiO2 ratio < 200 mmHg while on a FiO2 50% Venturi mask or a non-rebreather reservoir bag mask): 108 patients treated with NIV (CPAP or PSV) plus early PP and 108 matched controls treated with supine NIV) at HUMANITAS Gradenigo COVID Subintensive Care Unit between June 1st 2020 and June 30th 2021.

In both groups, NIV was initiated within 24 h of admission to Subintensive Care Unit and delivered continuously for ≥ 48 h or until discharge or death; full-face mask was the initial interface of choice. PP was initiated within 24 h of admission in the PP group. NIV and PP duration, equipment, settings, standard care, monitoring, treatment failure criteria were protocolized a priori before patients enrollment (see supplementary text, Supplementary Fig. 1, Supplementary Table 1) [23].

Measurements

Respiratory parameters

NIV duration, posture, ventilatory settings and parameters (spO2, RR, VTe, MV) were continuously monitored and recorded on an hourly basis on a predefined form.

ABGs were performed during NIV, every 24 h and ≥ 1 h after each postural change, after achievement of ventilatory stability (defined by a ≤ 10% variation in RR and VTe and air leaks < 10% for ≥ 30 min).

The following gas exchange parameters were calculated from ABG: paO2/FiO2 ratio and dead space indices (DSIs) [Ventilatory Ratio (VR) and corrected Minute Ventilation (MVcorr)]

The energy delivered by ventilatory assistance to respiratory system per time unit (mechanical power, MP) was estimated via Becher formula [24], which was already evaluated in noninvasively ventilated patients and was recommended to improve noninvasive ventilatory support monitoring in COVID-19 pneumonia [14, 25]. In this formula, ∆Pinsp was replaced by ∆Paw(airway pressure over PEEP) to reflect the energy imparted by ventilator during inspiration [1, 26].

Since NIV isa dynamic process, to reflect the power delivered during the entire ventilatory sessions, time weighted–average of hourly MP values during each NIV session (supine/prone) was calculated as the area under the parameter–versus–time plot (detailed in supplementary text) [27]. The same procedure was followed to calculate time weighted–average values of RR, VTe, and MV for each ventilator session and for each day.

For MV, we also planned to verify consistency between time-averaged values and those obtained at the time of ABGs, which were used to calculate DSIs.

The primary exposure variable of interest was MP normalized by the volume of well-aerated lung (MPWAL), to reflect the “intensity” of the power, i.e., the volume of well-aerated lung exposed to energy load during mechanical ventilation, during the 1st 24 h of NIV [MPWAL(day 1)].

We also normalized MP by predicted body weight (MPPBW) to account for individual lung size variation.

Lung imaging

All patients underwent a lung CT scan on admission: the nature and extent of parenchymal involvement were scored using a validated index [28] and the volume of well-aerated lung (WAL), of poorly aerated lung (PAL) and of non-aerated lung (NAL) were quantified on CT scans via a validated open-source software (3D Slicer ver.4.13.2) (see supplementary text) [29].

Lung ultrasound was performed daily from admission (day 0) to day 7 by three intensivists with expertise in lung and cardiac recording and interpretation (each operator having performed at least 50 supervised procedures and at least 200 non-supervised procedures) [30] using the same equipment (HM70A Samsung, Seoul, Korea), the same convex-array probe and the same setting.

The accuracy of ultrasound examinations in staging lung disease severity was preliminarily evaluated at baseline against the CT scan (double-blinded operators, LUS performed within 24 h of CT examination).

The severity and extent of parenchymal involvement of each of 6 lung regions (2 anterior, 2 lateral, 2 dorsal) were scored (range 0–3) [31] and recorded on a predefined form and the following indices were calculated (supplementary text):

  • regional and global lung ultrasound score (LUS);

  • regional and global LUS reaeration score, a validated index of lung recruitment (i.e., change from consolidated, non-aerated tissue to aerated tissue) [32, 33];

  • additionally, using software-based lung parenchyma segmentation and analysis function [29], each lung was divided into six areas to mirror as much as possible the regions explored by ultrasound, and regional and global LUS-derived WAL volume (i.e. lung volume with LUS score 0–1), PAL (i.e. lung volume with LUS score 2) and NAL (i.e. lung volume with LUS score 3) were derived from admitting LUS scan examinations as previously described in ARDS [34]. Global LUS-derived WAL volume, a predictor of COVID-19 pneumonia outcome [24], was then calculated from daily LUS scans through day 0–7(detailed in supplementary text).

The agreement between regional LUS score and regional CT classification was assessed with Cohen’s kappa coefficient, and the association between regional CT-derived gas/tissue content and regional LUS categories was assessed by simple linear regression and Spearman’s rank correlation (rs).

Details on full derivations are provided in supplementary text.

The PEEP at which each LUS examination was made was recorded.

  • Circulating biomarkers: eighteen circulating inflammatory and procoagulative biomarkers were measured daily from admission to discharge or death [35].

Definitions and timepoints

We adopted the following definitions:

  • day 0 (baseline): the time of NIV initiation;

  • day 1: first 24 h after NIV initiation;

  • day 7: day 7 after NIV initiation;

  • timepoint sp0: supine position. In PP group, the session preceded the first PP session. ABG was performed ≥ 1 h after supine NIV initiation, after achieving ventilatory stability;

  • timepoint pp1: first PP session in PP group. ABG was performed ≥ 1 h after prone NIV initiation, after achieving ventilatory stability;

  • timepoint sp1: supine position; ABG was performed ≥ 1 h after resupination following the first PP session in the PP group and 24 h after NIV initiation in the supine group, after achieving ventilatory stability;

This schedule allowed comparing all groups in the supine position, after 24 h of NIV, while taking into account the effect of the first PP session (pp1) in the PP group.

Further definitions are provided in legends to Table 1 and in a study scheme (Supplementary Fig. 1).

Table 1 Baseline, treatment-related parameters and clinical outcomes of supine and PP group; within each treatment group, patients were divided according to median values of Mechanical PowerWAL during the 1st 24 h of NIV (MPWAL at day 1) into High MPwal (MPwal ≥ 9.1 J/min/L) or Low MPwal (MPwal < 9.1 J/min/L)(n = 216)

Outcomes

Primary outcomes were the occurrence of.

  • NIV failure within 28 days of enrolment, defined as intubation OR death;

  • death, censored at 28 days after enrollment.

Secondary outcomes were:

  • endotracheal intubation (ETI) at 28 days (after excluding patients with a do-not-intubate, DNI, order);

  • 60-day death

  • O2-response: paO2/FiO2sp1–paO2/FiO2sp0 (or ∆paO2/FiO2sp0-1).

  • CO2-response: ∆VRsp0-1;

  • C-reactive protein (CRP) response: ∆CRP0-1;

  • global LUS response at day 1: ∆global LUS0-1;

  • global reaeration score at day 1: global reaeration score1;

  • change in LUS- assessed WAL (%) at day 1 (∆WAL0-1)

Statistical analyses

Sample size calculation and propensity score (PS)-matching of PP and supine group.

Sample size calculation and PS-matching of PP and supine groups for relevant baseline covariates are detailed in supplementary text.

Descriptive statistics

Data are given as median (IQR) or n (%) as appropriate.

We used chi-square test or Fisher´s exact test for categorical variables, T-Test for normally distributed and Kruskal–Wallis test for non-normal continuous variables.

Time change in continuous variables was assessed by computing the AUC with the trapezoid method [36].

Repeated measures two-factor (within subject and between group) ANOVA was used to compare continuous variables assessed at multiple timepoints (i.e., respiratory and biochemical parameters), after log-transformation of non-normal variables.

To explore the effect of early MPWAL delivery on clinical outcomes, the whole cohort was split into 2 groups according to median MPWAL values at day 1. The probability of 28-day NIV failure, death and ETI in low vs high MPWAL group at day 1 was compared using Kaplan–Meier procedure and log-rank test. Data were analyzed on an intention-to-treat basis.

Beside categorizing patients into low/high MPWAL, we explored dose–response relationship between early power delivery and clinical outcomes by dividing the entire cohort into quartiles of power measures at day 1. Comparison between quartiles was made by ANOVA with post-hoc comparison from the first quartile performed using the Tukey test. We planned to assess also reciprocal relationship between MP, MPPBW and MPWAL at day 1 by univariate analysis and Spearman correlation coefficient (rs).

Multivariable Cox proportional regression analysis adjusting for imbalanced covariates between groups was used to assess the effect of confounders on 28-d NIV failure, death and ETI in the whole cohort, with the maximum number of covariates allowed in each model set at (event rate x N)/10, where N is the sample size [37]. The allocation assignment (PP or supine) was entered as a predefined covariate into the models. Calendar month of admission was forced into all models to account for unmeasured temporal disease trends during the pandemic.

Anticipating high collinearity between ventilatory variables, in all (Cox and linear) multivariable models we used a combination of backward procedure and exclusion of highly collinear variables through model-dependent Variance Inflation Factor(VIF) cut-off values to select covariates [38].

MP, MPPBW and MPWAL at day 1 were included into all backward multivariate models to assess relative robustness of the association of each power measure (MP, MPPBW, MPWAL) with outcomes.

We also explored the predictive performance of each power measure and of other ventilatory variables at day 1 for 28-day NIV failure and death and the optimal cut-offs using the area under receiver operating characteristic curve (AUROC) analysis and Youden index.

Comparison between AUROC curves was made by DeLong’s method.

In the physio-anatomical analysis, we explored dose–response relationship between the power delivered at day 1 by the ventilator to the respiratory system and gas exchange, ultrasonographic and circulating biomarker changes after the 1st 24 h of NIV by univariable and multivariable regression analysis, after log transformation of skewed parameters; the best fit among four predictive models (linear, exponential, logarithmic, binomial) was searched using R2 values.

Two-tailed p values < 0.05 were considered statistically significant (MedCalc 19.7, Ostend, Belgium).

Results

Characteristics at Inclusion

We analyzed 216 COVID-19 patients (108 treated with PP and 108 with supine NIV) (Table 1, Supplementary Fig. 2). Pre-post-matching SMDs, PS density and logit(PS) distribution plots revealed good balance between PP and supine groups in selected covariates (Supplementary Fig. 3, Supplementary Table 2).

Baseline demographics, pharmacological therapies, clinical-radiological features, and respiratory parameters of patients treated with PP or supine NIV, subgrouped according to median MPWAL at day 1, were similar across groups; NRS (dyspnea), WOB score [39] and RR showed no evidence for increased respiratory drive, while lung CT scans showed largely predominant ground-glass opacities (Table 1).

LUS lung aeration assessment

The median (IQR) global LUS severity score was 25 (21, 30), closely correlating with global CT severity score (rs = 0.83; p < 0.001: Supplementary Fig. 4).

Regional LUS categorization closely correlated with CT-assessed gas/tissue density, with each step increase of LUS score being associated with a significant increase in lung tissue density (Fig. 1 panel A): CT-LUS agreement was almost complete for WAL (k = 0.91; 95%CI: 0.85–0.97), substantial for PAL(k = 0.61; 95%CI: 0.57–0.66) and moderate (k = 0.55; 95%CI: 0.51–0.60) for NAL (Supplementary Table 3A–B), consistent with previous reports in ARDS [34].

Fig. 1
figure 1

Panel A: categorization of individual lung regions with different CT-assessed gas/tissue density in relation to their LUS score. Panel B: correlation between WAL(%) assessed by software-based analysis of CT scans and WAL(%) as estimated by global lung ultrasound (LUS) score in the 216 COVID-19 pneumonia patients included in the study. The Spearman correlation coefficient (rs) with its 95%CI and statistical significance are shown. Abbreviations: CT: computed tomography; HU: Hounsfield Units; LUS: lung ultrasound; NAL: non-aerated lung volume; PAL: poorly aerated lung volume; WAL: well-aerated lung volume

Based on these findings, an equation was developed to predict WAL(%) from global LUS score (Fig. 1 panel B).

Treatment-related parameters and outcomes

No patient was lost to follow-up and there were no missing data for evaluated end-points.

Patients initiated NIV and PP within 24 h of admission to the Subintensive Care Unit. Over the initial 48 h of treatment, NIV was delivered continuously or until intubation and only brief interruptions were allowed for eventual adjustments and nursing care, lasting no more than few minutes; subsequently, daily breaks, lasting no more than 2 h, were allowed for meals and nursing care, depending on patient clinical condition and tolerance.

In PP and supine groups, ventilatory settings were similar among groups at baseline and remained unchanged from timepoints sp0 to sp1 (Table 1).

Daily hours of NIV and PP, the duration of the longest PP session, daily PP sessions and days of PP therapy are reported in Table 1.

Compared with supine group, PP therapy was associated with improved gas exchange and ultrasonographic parameters and clinical outcomes (Supplementary Fig. 56, Supplementary Table 4).

Effect of PP therapy on MPWAL

PP therapy was associated with a significant MPWAL reduction: MPWAL decreased by a median 34% (IQR: 30–38%) during the first PP session (timepoint pp1) as compared with pre-proning values (timepoint sp0) (Fig. 2 panel A–D). MPWAL reduction during 1st PP session was driven mainly by a RR decrease and secondly by a modest (5%; 2–7%) WAL increase (Fig. 2A–D). The magnitude of WAL % increase during 1st PP session was consistent with recent reports [40-42].

Fig. 2
figure 2figure 2figure 2

Panel AD: time course of MPWAL, MP and their components during the initial 7 days of NIV in the PP (n = 108) and supine (n = 108) group. N at risk and median (IQR) values of PEEP, Pressure support (Ps) and well-aerated lung volume (WAL) are shown at the bottom of panel A. # P < 0.01 vs. baseline (same group); § P < 0.01 vs. baseline (both groups) † P < 0.001 vs. baseline (both groups) ǁ P < 0.01 between PP and supine groups ¶ P < 0.001 between PP and supine groups. In the box plots the middle line represents the median observed value, boxes represent the interquartile range (IQR), whiskers extend to the most extreme observed values with 1.5 times the interquartile range of the nearer quartile, and dots represent observed values outside that range. The connecting line connects the mean values within each box plot at different time points. For VTe, RR, MV, MP, MPWAL time weighted–average of hourly values during each session (supine/prone) of NIV are presented. For MV, we verified consistency between time-average MV values and values obtained at the time of ABG (which were used to calculate Dead Space Indices): the comparison confirmed a good agreement between the two values, with p-value > 0.7 for comparison at all timepoints (not shown). The timepoints are indicated as follows: sp0: supine position timepoint 0 (baseline, after NIV initiation).In PP group, the session preceded the first PP session. ABGs were performed ≥ 1 h after supine NIV initiation, after achieving ventilatory stability; pp1: first PP session in PP group. ABGs were performed ≥ 1 h after prone NIV initiation, after achieving ventilatory stability; sp1: supine position timepoint 1 (after the initial 24 h of NIV); in the PP group this corresponded to the resupination after the 1st PP session) on day 1. pp2: during the 2nd PP session; sp2: supine position timepoint 2 (after the 2nd day of NIV, in the PP group this also corresponded to the resupination after the 2nd PP session on day 2; pp3 during the 3rd PP session; sp3 supine position timepoint 3 (after the 3rd day of NIV, in the PP group this also corresponded to the resupination after the 3rd PP session on day 3. pp4 during the 4th PP sessionx; sp4 supine position timepoint 4 (after the 4th day of NIV, in the PP group this also corresponded to the resupination after the 4th PP session on day 4. pp5 during the 5th PP session; sp5 supine position timepoint 5 (after the 5th day of NIV, in the PP group this also corresponded the 5th PP session on day 5. pp6 during the 6th PP session; sp6 supine position timepoint 6 (after the 6th day of NIV, in the PP group this also corresponded to the resupination after the 6th PP session on day 6. pp7 during the 7th PP session; sp7 supine position timepoint 7 (after the 7th day of NIV, in the PP group this also corresponded to the resupination after the 7th PP session on day 7. This schedule allowed comparing gas exchange responses among group at day 1, after 24 h of NIV, in the same (supine) position, after taking into account the effect of the first overnight PP session (timepoint pp1) in the PP groups

Daily MPWAL swings between prone and supine position subsided after day 5(timepoint pp5)(P supine vs. prone > 0.05), suggesting no additional effect on MPWAL from further PP days (Fig. 2A).

Relationship of early (day 1) power measures with clinical outcomes

We first assessed the relationship between different power measures at day 1 and clinical outcomes (NIV failure, death) (Fig. 3A–C): we observed a progressive, linear, statistically significant increase in NIV failure and death rates across each MPWAL quartile, but not across MP and MPPBW quartiles, which rather showed a steep increase in the rate of clinical outcomes in the 3-4th quartile as compared with the lower 2 quartiles.

The investigation of mutual relationship between MP, MPPBW and MPWAL at day 1 disclosed a the weak correlation between MPWAL and the other power measures, indicating for any given value of MP delivered by NIV to the respiratory system, the energy per unit volume of WAL varied widely (Fig. 3D–E).

Fig. 3
figure 3figure 3figure 3figure 3

Panel AB: relationship between MP, MPPBW and MPWAL during day 1 of NIV in the whole study population (n = 216). Panel CD: quartiles of MP (Panel A), MPPBW (Panel B) and MPWAL (Panel C) and 28-d NIV failure and death(%) in the whole study population (n = 216). Median (range) of MP, of components of MP and of well aerated lung (WAL) volumes are displayed at the bottom of each graph. ⁕p < 0.05 vs. other quartiles for NIV failure and death; ǂp < 0.001 vs other quartiles for NIV failure and death

Early (day 1) MPWAL is independently associated with clinical outcomes

We then categorized the whole study population based on median MPWAL delivered during the initial 24 h of NIV (day 1) into high MPWAL (i.e. MPWAL ≥ 9.1 J/min/L) or low MPWAL (i.e., MPWAL < 9.1 J/min/L): patients receiving a low MPWAL(day 1) showed lower 28-d NIV failure, intubation and death rates than those receiving a high MPWAL(day 1) (Table 1, Fig. 4).

Fig. 4
figure 4figure 4

Cumulative incidence of Noninvasive Ventilation (NIV) failure (Panel A), death (Panel B) and endotracheal intubation (Panel C) at 28 days after enrollment in patients grouped according to high (≥ 9.1 J/min/L) vs low (< 9.1 J/min/L) MPWAL at day 1. In the endotracheal intubation group, patients with a Do-Not-Intubate (DNI) disposition were excluded

In Cox proportional multivariable models, MPWAL (day 1) predicted 28-d NIV failure (HR 1.682; 95%CI: 1.147–2.412, p = 0.003), intubation (HR 1.521; 95%CI: 1.193–2.314, p = 0.009) and death (HR 1.697; 95%CI: 1.217–2.313, p = 0.002) independently of baseline illness and lung disease severity and of other ventilatory variables (Table 2).

Table 2 Cox multivariable analysis of predictors of 28-d NV failure, death and endotracheal intubation (ETI) (n = 216)

Predictive performance of MPWAL(day 1) for clinical outcomes

Each ventilatory variable at day 1 was regressed on NIV failure and mortality in the whole study population: MPWAL(day 1) showed the highest AUROC for predicting 28-d NIV failure (AUROC 0.89; 95%CI: 0.85–0.93, p < 0.001) and death (AUROC 0.89; 95%CI: 0.85–0.94, p < 0.001), outperforming all other ventilatory parameters and WAL (p < 0.011 for both comparisons by DeLong’s test) (Table 3, Fig. 5). Additionally, MPWAL (day 1) optimal thresholds for both NIV failure and death overlapped.

Table 3 Discrimination of ventilator parameters measured during the 1st 24 h of NIV)(day 1) in predicting 28-d NIV failure and death in the whole study population (n = 216)
Fig. 5
figure 5

Time-course of different ventilatory (Panel A-B), gas exchange(Panel C-F), ultrasonographic (Panel G-N) and inflammatory (Panel O-P) parameters during the initial 7 days of NIV in the PP and supine groups, grouped according to low MPWAL or high MPWAL during the 1st 24 h of NIV (day 1) (n = 216). * P < 0.05 vs baseline (same group) # P < 0.01 vs. baseline (same group) § P < 0.01 vs. baseline (both groups) † P < 0.001 vs. baseline (both groups) ǁ P < 0.01 between groups ¶ P < 0.001 between groups # P < 0.001 ‡

Time course of ventilatory and other physio-anatomical variables during the initial 7 days of NIV

Over the initial 7 days there were missing data in the physiological parameters due to the occurrence of NIV failure or success and subsequent Unit discharge. Because these data were not missing at random but due to the consequence of treatment effect, we did not perform multiple imputation and excluded missing values from analyses.

Figure 6 shows time course of different ventilatory, ultrasonographic and inflammatory parameters in PP and supine patients receiving either high or low MPWAL(day 1).

Fig. 6
figure 6figure 6figure 6figure 6figure 6figure 6figure 6figure 6

Dose–response relationship between Mechanical Power normalized to well-aerated lung volume during the 1st 24 h of NIV [MP WAL(day 1)] and physio-anatomical responses after the 1st 24 h of NIV(∆sp0-1) in the whole study population (n = 216). PP patients are represented by black full dots, supine patients are represented by empty triangles. In the plots, the green area denotes a positive gas exchange (O2 or CO2) response, the red area a negative gas exchange response. -O2 response: an increase in paO2/FiO2 ratio during NIV in supine position at day 1 (timepoint sp1) as compared with NIV supine at timepoint sp0: i.e., pO2/FiO2sp1—pO2/FiO2sp0 > 0, or ∆paO2/FiO2sp0-1 > 0. -CO2 response: an increase in CO2 clearance, defined by a reduction in dead space indices (VR and MVcorr, tested separately) at sp1 as compared with timepoint sp0 [VRsp1-VRsp0 < 0 (or ∆VRsp0-1 < 0)]

Within each treatment group (PP or supine), median daily MPWAL remained relatively constant during initial days of NIV (Fig. 6A–B); gas exchange parameters (paO2/FiO2 ratio, VR, MVcorr), ultrasonographic parameters (global LUS score, global reaeration score, WAL %) and inflammatory biomarkers (CRP, procalcitonin) improved significantly over the initial 24 h of NIV (day 1, timepoint sp0-1) only in patients receiving a low MPWAL, but not in those receiving a high MPWAL (Table 1, Fig. 6C–P). The differences between low and high MPWAL group observed at day 1 were magnified during the initial 7 days of NIV (Table 1, Fig. 6).

Impact of early (day 1) MPWAL on physio-anatomical responses

We next explored potential mechanisms underlying the association of MPWAL(day 1) with the observed clinical outcomes.

In the whole study population, early MPWAL(day 1) correlated significantly with O2 response, CO2 response ultrasonographic responses and C-reactive protein changes at resupination (timepoint sp1) in univariate analysis (Fig. 7). The observed associations were confirmed by multivariable analysis, where MPWAL(day 1) independently predicted ∆VRsp0-1(β = 0.43, p = 0.011), ∆global LUSsp0-1(β = 0.41, p = 0.018), global reaeration scoresp1(β = -0.49, p = 0.009), ∆WALsp0-1(β = 0.41, p = 0.018) and ∆C-reactive proteinsp0-1(β = 0.38, p = 0.029) (Table 4).

Fig. 7
figure 7figure 7figure 7

Dose–response relationship between Mechanical Power normalized to well-aerated lung volume during the 1st 24 h of NIV [MP WAL(day 1)] and physio-anatomical responses after the 1st 24 h of NIV(∆sp0-1) in the whole study population (n = 216). PP patients are represented by black full dots, supine patients are represented by empty triangles. Green area denotes a positive response, red area a negative one

Table 4 Multivariable linear regression analysis of predictors of CO2-response (by ventilatory ratio, VR), ultrasonographic response and C-reactive protein responses in the whole study population(n = 216)

Discussion

In the first report exploring MPWAL in COVID-19 AHRF treated with prone or supine NIV, we found that.

  1. 1)

    PP was associated with a significant MPWAL reduction, attributable mainly to an absolute MP reduction and secondly to a relief in the “intensity” of the power delivered to the aerated lung through enhanced lung reaeration (Fig. 2).

  2. 2)

    MPWAL delivered during the first 24 h of NIV [MPWAL(day 1)] outperformed other power measures and ventilatory variables in clinical and pathophysiological outcome prediction.

  3. 3)

    MPWAL(day 1) independently predicted different markers of VILI, including gas exchange parameters, ultrasonographic and inflammatory biomarkers, in both supine and proning patients.

Diferent mechanical ventilatory variables have been proposed to affect clinical outcomes and to contribute to VILI, all of which have been separately addressed in experimental or clinical studies [1-3]. The MP represents the energy delivered to the respiratory system by mechanical ventilation, which results from a combination of ventilatory variables. Therefore, MP might have a higher predictive value for patient-centered outcomes, as different combinations of its components may yield a similar damage to the lung [4-8]. The ability to combine the effect of time-varying ventilatory variables seems particularly attractive in spontaneously breathing patients.

Our findings suggest MP delivered early after NIV initiation predicts both NIV failure and mortality. However, the power should be normalized to the size of for the size of well-inflated lung to get an optimal cut-off that could be used in clinical practice, as VILI originates from the interaction between the MP delivered to the respiratory system by the ventilation and the anatomical and pathophysiological characteristics of the latter. Notably, we demonstrated that bedside LUS could be used to accurately estimates WAL size, consistent with data from nonCOVID-19 ARDS [29].

Lastly, we showed that early PP may reduce MPWAL while improving oxygenation and CO2 clearance during NIV. Therefore, early PP therapy may be a more “energetically” advantageous strategy to relieve hypoxemia than PEEP uptitration, which may overdistend the lung and increase MP and lung injury when it does not recruit atelectatic lung tissue [8, 35]. The association of PEEP levels and NIV duration with COVID-19 pneumonia severity and mortality is consistent with this view [11-13].

Our analysis may have several clinical implications for noninvasive AHRF management.

First, it suggests PP can effectively reduce MP delivery. Second, they suggest early MPWAL calculation may help define the trade-offs between the benefits and risks of noninvasive ventilatory support, helping to titrate NIV to the least possible energy to achieve oxygenation and enhancing prompt NIV up-grade to invasive mechanical ventilation (IMV) to prevent VILI progression. Both these issues warrant evaluation in future ad hoc trials enrolling different AHRF etiologies.

Strengths and limitations

Strengths of our report are the thorough patient assessment, the hourly ventilatory data collection, which reflect actual power delivery during each ventilatory session more accurately than once daily data, and the separate assessment of supine and prone ventilatory modality. Limitations of our analysis include its post hoc nature and the absence of randomization in the original study. Furthermore, we focused on ventilator-delivered energy and did not assess the energy transferred from respiratory muscles to the lung during inspiration. However, consistent with COVID-19 literature. our patients had no clinical-radiological evidence for increased inspiratory effort or reduced lung compliance [14-17], and Pes swings, an index of inspiratory effort, are unrelated to disease course and unaffected by PP [43], while MPWAL predicted physio-anatomical and clinical outcomes in our patient population.

Conclusions and expert recommendations

Concern has been raised that mechanical ventilation may promote VILI and worsen outcome in spontaneously breathing patients with AHRF. Different ventilatory variables have been proposed to contribute to VILI progression, but no effective predictive, preventive and patient-centered model has been developed [44-48].

Based on our findings we suggest early quantitation of well-aeraled lung volume before NIV initiation in each patient with AHRF candidate to NIV, through CT scan analysis or at least through lung ultrasound, which in our series was at least as accurate for WAL quantitation as free open source software.

After NIV initiation, the value of MP delivered to the patient by NIV could be estimated from routine ventilatory parameters using available formulas and adjusted for individual WAL volume to estimate MPWAL: if patient’s MPWAL exceeds suggested safey threshold (9.1 J/min/L), VILI could be prevented by implementing PP, which reduced MPWAL by an average of 34% in our series.

If PP fails to reduce MPWAL below safety threshold, NIV upgrade to invasive ventilation should be considered to achieve complete ventilatory control and reduce MPWAL and the risk of VILI.

The added value of this approach in noninvasively ventilated AHRF patients is to provide an accurate tool for early prediction of the risk of VILI and adverse outcomes(i.e., MPWAL at day 1), an effective mean to prevent or reduce the risk of VILI (i.e., prone position adoption). Furthermore, this approach should be personalized, i.e. the MP delivered by the ventilator to patient’s respiratory system should be adjusted for individual WAL size, which varies widely across AHRF patients and could be calculated by lung imaging techniques.

This integrated approach, which showed accuracy in our study, needs prospective validation in independent cohorts of patients with AHRF of diverse etiologies.