Background

The treatment of severe pneumonia and acute respiratory distress syndrome (ARDS) following the coronavirus disease (COVID-19) pandemic is becoming a challenge [1]. Progressive respiratory failure develops in many patients with severe COVID-19, soon after the onset of dyspnea and hypoxemia. These patients commonly meet the ARDS criteria, defined as acute onset of bilateral infiltrates, severe hypoxemia, and lung edema that is not fully explained by cardiac failure or fluid overload [2, 3]. In severe and critical COVID-19, it is clinically difficult to determine the appropriate timing for invasive mechanical ventilation, resulting in the provision of different treatments based on by physicians’ own experiences and preferences.

Some expert recommendations based on previous studies have suggested early intubation for patients with severe and critical COVID-19, aimed to protect health personnel from cross-infection, reduce complications from tracheal intubation, and prevent self-induced lung injury (SILI) [4,5,6,7]. SILI generates an early phase of ARDS; the high transpulmonary pressures associated with spontaneous vigorous inspiratory effort may contribute to lung damage before the patient becomes fatigued or is sedated [8,9,10]. To prevent lung injury, it is recommended that SILI is prevented in the early stage of ARDS through various approaches such as supplemental oxygen, continuous positive airway pressure (CPAP), non-invasive ventilation (NIV), high-flow nasal cannula (HFNC), awake prone positioning, and target non-vigorous breathing. This is the rationale for the need for early tracheal intubation [9, 10].

Early intubation may pose a risk of generating viral aerosols and self-induced lung injury. Moreover, the background for early intubation is still based on theoretical physiology, and the clinical evidence of early intubation is still not fully considered [11,12,13]. Although several studies on the prognostic difference between early and late intubations have been reported [14,15,16,17], there are inevitable differences in the primary outcome, definition of early and late intubations, and the study designs, resulting in inconsistent results. Therefore, international guidelines, including those of the National Institute of Health, World Health Organization, and Australia, reveal no clear timeline for the recommendation on the application of mechanical ventilation (MV) among patients with severe COVID-19 [18,19,20]. Therefore, we aimed to explore the effect of the timing of MV on the clinical course and prognosis of patients with severe and critical COVID-19 through a systematic review and meta-analysis.

Methods

This study was conducted according to the recommendations outlined in the Preferred Reporting Items for Systematic Reviews and Meta-Analyses 2020 guidelines [21] (Additional File 1). This study was registered in the Prospective Register of Systematic Reviews on 16 February, 2022 (registration number CRD42022311122).

Eligibility criteria

The inclusion criteria were as follows: (1) population: studies targeting patients with severe COVID-19; (2) intervention and comparator: studies comparing early intubation to late intubation; (3) outcomes: studies reporting clinical outcomes (in-hospital mortality, length of stay, duration of MV, etc.); (4) studies published after 2020; (5) a randomized clinical trial or an observational study with a comparator group; and (6) full-text articles in English or Korean. The exclusion criteria were as follows: (1) studies that did not target patients with confirmed COVID-19, (2) studies that did not compare early and late intubation, (3) studies that did not report our outcomes of interest, and (4) duplicated studies.

Information sources and search strategy

We searched the following electronic databases: international databases (Ovid MEDLINE, Ovid EMBASE, the Cochrane Central Register of Controlled Trials), and the Korean domestic database (KMBASE) on July 15, 2021. Since new evidence on early intubation of patients COVID-19 patients is continuously published, we performed the search monthly from September 10, 2021 to February 10, 2022 to update the included studies. We searched Ovid MEDLINE for updates and reference lists of previously published related reviews. We used both controlled terms and text words such as (2019-nCoV OR COVID-19 OR Wuhan) AND (intubation, intratracheal OR intubation, endotracheal OR early intubation OR early endotracheal intubation). The search strategy is presented in Additional File 2.

Selection process

Two pairs of authors (HJL, JoK, WIC, and JJ) screened the title and abstract of retrieved citations using Covidence (https://www.covidence.org/). Each author independently assessed the eligibility of the identified studies, and conflicts resolved by discussion. Full texts were assessed by two authors (HJL and JoK) for the final decision on inclusion or exclusion. Any disagreement between the two authors was resolved through a discussion with a third author (MC).

Data items and extraction

The following data were extracted from the eligible studies using an electronic spreadsheet (Microsoft Excel) of a pre-designed data extraction form: author, publication year, study design, study country, study setting, COVID-19 severity, number in each arm, timing of intubation, definition of early intubation, and outcomes of interest. One author (JP) extracted the data extraction, and another two authors (JJ and WIC) independently evaluated the data.

Study outcomes

The primary outcome was all-cause mortality during hospitalization. The secondary outcomes were length of stay (LOS) in the intensive care unit (ICU), duration of MV, hospital LOS, ICU-free days, and ventilator-free days.

Study risk of bias assessment

A validated tool was used to evaluate the risk of bias according to the study design. The Risk of Bias Assessment Tool for Non-randomized Studies (RoBANS) 2.0 [22], which is an update of RoBANS 1.0 [23] was used for nonrandomized studies. This tool contains eight domains, including the possibility of target group comparisons, target group selection, confounders, exposure measurement, blinding of assessors, outcome assessment, incomplete outcome data, and selective outcome reporting. Each domain was assessed as having low, high, or unclear risk of bias. Quality assessments of the studies were conducted by two authors (WIC and JJ) independently, and disagreements were resolved through a discussion with a third author (MC).

Effect measures and synthesis methods

Based on the data extraction results, a meta-analysis was performed as follows. Relative risks (RR) with 95% confidence interval (CI) for discrete outcome data and mean differences (MD) with 95% CI for continuous outcome data were calculated using the random-effects model because of the heterogeneity across studies. Statistical significance was set at p < 0.05. To assess between-study heterogeneity, we constructed forest plots and calculated I2 statistics, with a value of > 75% considered as high heterogeneity [24]. A subgroup analysis was performed based on the definition of early intubation (intubated before 24 h or 48 h from the index time [ARDS onset or ICU admission]) and based on the index time for intubation in each study group. To synthesize the data, we used Review Manager 5.4 (RevMan, The Cochrane collaboration, Oxford, UK) software for the meta-analysis.

Certainty of evidence assessment

We used the Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) [25] to assess the certainty of evidence of in-hospital mortality, LOS in the ICU, MV duration, and hospital LOS. The GRADE includes five reasons for rating down the certainty in effect estimates (risk of bias, imprecision, inconsistency, indirectness, and publication bias) and three reasons for rating up certainty. Two authors assessed (WIC and JJ) the certainty of evidence as high, moderate, low, or very low, and discrepancies were resolved through discussions with a third author (MC).

Results

Study selection

A total of 1610 records were identified in July 2021 through the search strategy, and 153 duplicate records were removed before screening. Sixty-six records were subsequently included through monthly searches until February, 2022, totaling 1523 studies. Of the 1523 records, 1,469 were excluded after screening the titles and abstracts. Subsequently, the full texts of 54 reports were reviewed. After reviewing the eligibility of the original texts, 12 cohort studies [14,15,16,17, 26,27,28,29,30,31,32,33] were included, and there were no randomized controlled trials (Fig. 1). The list of excluded studies and reasons for exclusion are presented (Additional file 3).

Fig. 1
figure 1

PRISMA flowchart

Study characteristics

The characteristics of the included studies are summarized in Table 1. Four prospective cohort studies [16, 27, 31, 33] included 504 patients (sample size range, 32–205), and eight retrospective cohort studies [14, 15, 17, 26, 28,29,30, 32] included 2,339 patients (sample size range, 39–1618). Six studies [14, 15, 26, 28, 30, 32] were performed in the US, three [27, 31, 33] in Europe, two [17, 29] in Asia, and one [16] in Chile. Seven studies [14,15,16,17, 26, 30, 33] were single-center studies, while the others were multi-center studies. The index time for intubation and the definition of early intubation varied. The index time for intubation was defined according to the onset of acute respiratory failure in five studies [15, 26, 29, 30, 33] and ICU admission in three studies [17, 28, 31]. Five studies [14, 26, 28, 29, 31] defined early intubation as intubation within 24 h and five studies [16, 17, 27, 30, 32] as within 48 h from the index time.

Table 1 The basic characteristics of studies included in this review

Risk of bias in studies

Four studies [14, 16, 26, 27] were rated as having a high risk of bias in the domain of possibility of target group comparisons, ten studies [14,15,16,17, 26, 28,29,30, 32, 33] as high risk of bias in the domain of target group selection, and five studies [14, 15, 17, 30, 33] as high risk of bias in the domain of confounders (Fig. 2). Although there were some concerns in the domain of target group selection, serious problems did not occur because the domains of exposure measurement, blinding of assessors, and outcome assessment were assessed as having a low risk of bias.

Fig. 2
figure 2

Risk of bias of included studies. a Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies. b Risk of bias summary: review authors' judgements about each risk of bias item for each included study

Primary outcome

In-hospital mortality

Nine studies reported in-hospital mortality. Of the nine studies, eights studies [14, 15, 17, 26, 28,29,30,31] reporting numbers of death were included for quantitative synthesis and one study [32] reporting hazard ratio (HR) was not synthesized. Overall, the incidence of in-hospital mortality was 43.9% (215/490) in the early intubation group and 52.8% (161/305) in the late intubation group. In-hospital mortality was similar between the early and late intubation groups (RR, 0.91; 95% CI 0.75–1.10; P = 0.32; I2 = 33%; very low certainty of evidence; Fig. 3). Consistent results were obtained for subgroup analysis based on the definition of early intubation (< 24 h or < 48 h) (Fig. 4) and the index time (ARDS onset or ICU admission) except for the subgroup intubated before 48 h from the index time of ICU admission (Fig. 5). Parish et al. [32] reported that early intubation was not significantly associated with differences in mortality (HR, 1.09; 95% CI, 0.94 to 1.26; P = 0.26).

Fig. 3
figure 3

In-hospital mortality

Fig. 4
figure 4

Subgroup analysis of in-hospital mortality by definition of early intubation as < 24 h or < 48 h from index time

Fig. 5
figure 5

Subgroup analysis of in-hospital mortality by index time in studies defining early intubation as < 24 h (a) or < 48 h (b). a Subgroup analysis of in-hospital mortality by index time in studies defining early intubation as < 24 h. b Subgroup analysis of in-hospital mortality by index time in studies defining early intubation as < 48 h

Secondary outcomes

ICU length of stay

Nine studies reported LOS in the ICU [14,15,16,17, 26, 28,29,30,31]. We found no significant difference according to the timing of intubation (978 patients; MD, −1.77 days; 95% CI −4.61 to 1.07; P = 0.22; I2 = 78%; very low certainty of evidence; Fig. 6).

Fig. 6
figure 6

Length of stay in the intensive care unit

Duration of mechanical ventilation

Nine studies reported the duration of mechanical ventilation [14,15,16,17, 26,27,28,29,30]. We found no significant difference according to the timing of intubation (1066 patients; MD, −0.03 days; 95%, CI −1.79 to 1.72; P = 0.97; I2 = 49%; very low certainty of evidence; Fig. 7).

Fig. 7
figure 7

Duration of mechanical ventilation

Hospital length of stay

Six studies reported hospital LOS [14,15,16, 26, 27, 30]. Early intubation compared to late intubation reduced the hospital LOS (738 patients; MD, −4.32 days, 95% CI −7.20 to −1.44; P = 0.003; I2 = 45%; very low certainty of evidence; Additional file 4: Fig. S5). However, the subgroup analysis based on the definition of early intubation (< 24 h or < 48 h) showed no difference between early and late intubation (Additional file 4: Fig. S6).

ICU-free days and ventilator-free days

One study involving 32 patients reported ICU-free days [33]. Early intubation (median, 0 day; IQR, 0–16 days) was not significantly associated with lesser ICU-free days than that observed for late intubation (median, 0 day; IQR, 0–4 days) (P = 0.39). Four studies reported ventilator-free days [16, 29, 31, 33]. We found no evidence of a difference in ventilator-free days according to the timing of intubation (344 patients; MD, 0.94 day; 95% CI −4.56 to 6.43; P = 0.74; I2 = 54%; Additional file 4: Fig. S7).

The GRADE summary of findings table of in-hospital mortality, ICU LOS, duration of mechanical ventilation, and hospital LOS is reported in Table 2.

Table 2 GRADE summary of findings table of in-hospital mortality, ICU LOS, duration of mechanical ventilation, and hospital LOS

Discussion

In the present study, the key primary outcome, in-hospital mortality, did not differ between the early and late intubation groups. To our knowledge, this study is the most comprehensive meta-analysis of outcomes for the timing of intubation in patients with severe and critical COVID-19 who needed MV. To date, the rationale for early intubation in patients with COVID-19 remains unclear, and individual studies have assessed the appropriate timing of intubation. From the literature reviews, we found that the exact definition of the timing of MV differs across studies. Therefore, we adopted a specific classification in our study for the timing of MV based on the inconsistent results from a previous systematic review performed last year [34]. In addition, more recent studies were included in this review.

The definition of early intubation from the reviewed studies remains unclear. In clinical practice, prediction of clinical deterioration and the time when MV is required for patients is difficult. Moreover, the timing of ICU admission differed according to the nature of the clinical setting in medical facilities. However, the consideration of MV after respiratory failure, including ARDS, is a major predictive factor for ICU admission and potential MV.

According to ICU guidelines, ICU admission is required for patients requiring hourly and/or invasive monitoring or those with respiratory failure considering for MV. Among them, those with ARDS and severe pneumonia are typically admitted to the ICU [35]. As stated in ICU guidelines, despite differences in resources (available clinical expertise, bed availability, etc.), a triage for ICU admission is recommended. As a result, most cases coincide with the time of detection of ARDS and entrance to the ICU.

Management of ARDS is generally supportive, consisting of MV, prevention of stress ulcers and venous thromboembolism, and nutritional support while addressing the underlying etiology. ICU care is usually followed by the detection of ARDS or the potential for ARDS management [36,37,38,39]. For instance, a prospective study in Hong Kong investigated the impact on infection control and performance according to intubation time in critically ill patients with COVID-19. Since patients who required 10 L or more of oxygen at baseline were initially eligible and close monitoring was required, early or late intubation was decided after admission to the ICU [40]. As the Spo2/Fio2 ratio reflects the Pao2/Fio2 ratio in patients with ARDS, the study population who needed more than 10 L or more oxygen also met the criteria for hypoxemia in ARDS [41]. Moreover, applying NIV or HFNC was proposed in guidelines to be useful in the treatment of ARDS or acute respiratory failure in patients with COVID-19, requiring close monitoring in the ICU and should be progressed to intubation in case of no improvement within 2 h of NIV or HFNC [42, 43]. Therefore, we can assume that the approximating the time between ARDS detection and ICU admission as the primary index for the potential time for MV would be clinically reasonable, considering the COVID-19 pandemic.

However, in our study, we specified the studies that reported the estimation of the timing of intubation from ARDS occurrence [26, 29, 30] or from ICU admission [17, 28, 31]. There was no difference in the results for in-hospital mortality, the primary outcome, even when the index time for ARDS onset or ICU admission was specified in the subgroup analysis (Fig. 5). The strength of this study is that we included studies that reported a clear index time for estimating early or late intubation.

The present study also explored specific clinical outcomes. Heterogeneity among the studies regarding the medical circumstances and pandemic situation was inevitable, resulting in specific consideration of the critical care of patients and utilization of the ICU. In the subgroup analysis of the index time of intubation, there was no difference in in-hospital mortality according to the timing of intubation in critically ill patients with COVID-19. Of these, hospital LOS seemed to be shortened by early intubation, and subgroup analysis based on the definition of early intubation (< 24 h or < 48 h) showed no difference in hospital LOS between early and late intubation.

We also explored studies that reported ICU mortality and 28-day mortality in addition to in-hospital mortality. However, the number of studies that reported ICU [16, 27, 33] and 28-day mortalities [16, 33] was relatively low, and inconsistent indexes were applied in defining early and late intubation: one [33] was from the time of ARDS onset, one [16] was from the time of hospital admission, and another [27] was from the time of first respiratory support. The summarized results for ICU and 28-day mortality rates were difficult to interpret. (Additional file 4: Fig. S8 and Fig. S9).

This study had several limitations. First, relatively few studies were eligible for analysis, and the results showed heterogeneity between the studies. Therefore, further research is required. Second, the results were limited to observational studies because tracheal intubation as an intervention is practically difficult and impossible. Third, the differences in conditions of ICU care according to medical facilities or countries could not be fully considered in the analysis due to the lack of considerable studies in each identified country. Nevertheless, the results of this study will provide important implications for physicians to decide on the timing of intubation for critically ill patients with COVID-19 requiring ICU care.

Conclusions

This study showed that there was no difference in in-hospital mortality between early and late intubation groups. The secondary outcomes, including ICU LOS, duration of mechanical ventilation, hospital LOS, ICU-free days, and ventilator-free days, did not also differ between the two groups. The decision of physicians who determine the critical care for each critically ill patient with COVID-19 is still important. Further prospective studies would be necessary to support the results with strengthening the certainty of evidence.