Introduction

Supraglottic airway devices (SADs) have been widely used as alternative to tracheal intubation because they can be inserted conveniently and without tracheal intubation associated complications1. Since the introduction of the classic laryngeal mask airway (C-LMA)2, at least 10 types of SADs with novel materials and designs have been developed3. These SADs may have better efficacy than the C-LMA if they have a higher oropharyngeal leak pressure (OLP) and a lower risk of first-attempt insertion failure or airway complications2.

Randomized trials have compared only some of the existing SAD types. Moreover, few studies have directly compared all types of SADs, and those that did included a small number of patients. Conducting a trial to compare all types of SADs is exorbitant and impractical. Traditional meta-analyses on some SADs have been conducted, but they tended to pool different types of SADs into a single group, thus precluding the assessment of their clinical performance individually4,5,6. Therefore, which SAD has the best efficacy remains unclear.

A network meta-analysis incorporates direct and indirect evidence into one statistical framework to compare multiple treatments simultaneously, yielding consistent estimates of relative treatment efficacy7,8,9. We therefore conducted a systematic review and network meta-analysis to evaluate the efficacy of SADs in terms of OLP, the risk of first-attempt insertion failure, postoperative sore throat rate (POST) and other efficacy-associated outcomes, including overall insertion failure rate during induction, poor function after successful insertion, SAD failure during maintenance, hypoxia and aspiration.

Results

Figure 1 presents a flow chart of the trial selection process in 108 eligible randomized controlled trials met our inclusion criteria. Figures 2, 3 and 4 illustrate the network of included studies on the OLP, risk of first-attempt insertion failure, and POST. Table 1 summarizes the characteristics of the 17 types of SADs included in this study, and Supplementary Table S1 summarizes the clinical and methodological characteristics and the main outcomes of each trial.

Figure 1
figure 1

Summary of trial identification and selection. OLP, oropharyngeal leak pressure.

Figure 2
figure 2

Network of eligible supraglottic airway device comparisons for oropharyngeal leak pressure. The line indicates the number of trials and the dots indicate the number of patients. C-LMA, LMA Classic; U-LMA, Unique LMA; Protector, LMA Protector Airway; AuraGain, Ambu AuraGain Disposable Laryngeal Mask; P-LMA, Proseal LMA; I-gel, I-gel supraglottic airway; S-LMA, Supreme LMA; SLIPA, Streamlined Liner of the Pharynx Airway; Ambu-O, Ambu AuraOnce; Air-Q, Air-Q Masked Laryngeal Airway; SoftSeal, The Portex Soft Seal Laryngeal Mask; Solus, Solus Standard Laryngeal Mask Airway; Cobra, Cobra Perilaryngeal Airway; LT, Laryngeal Tube; LTSII, Laryngeal Tube Suction II; LTS-D, Laryngeal Tube Disposable; LTS, Laryngeal Tube Sonda.

Figure 3
figure 3

Network of eligible supraglottic airway device comparisons for the risk of first-attempt insertion failure. The line indicates the number of trials and the dots indicate the number of patients. C-LMA, LMA Classic; U-LMA, Unique LMA; Protector, LMA Protector Airway; AuraGain, Ambu AuraGain Disposable Laryngeal Mask; P-LMA, Proseal LMA; I-gel, I-gel supraglottic airway; S-LMA, Supreme LMA; SLIPA, Streamlined Liner of the Pharynx Airway; Ambu-O, Ambu AuraOnce; Air-Q, Air-Q Masked Laryngeal Airway; SoftSeal, The Portex Soft Seal Laryngeal Mask; Solus, Solus Standard Laryngeal Mask Airway; Cobra, Cobra Perilaryngeal Airway; LT, Laryngeal Tube; LTSII, Laryngeal Tube Suction II; LTS-D, Laryngeal Tube Disposable; LTS, Laryngeal Tube Sonda.

Figure 4
figure 4

Network of eligible supraglottic airway device comparisons for postoperative sore throat rate. The line indicates the number of trials and the dots indicate the number of patients. C-LMA, LMA Classic; U-LMA, Unique LMA; Protector, LMA Protector Airway; AuraGain, Ambu AuraGain Disposable Laryngeal Mask; P-LMA, Proseal LMA; I-gel, I-gel supraglottic airway; S-LMA, Supreme LMA; SLIPA, Streamlined Liner of the Pharynx Airway; Ambu-O, Ambu AuraOnce; Air-Q, Air-Q Masked Laryngeal Airway; SoftSeal, The Portex Soft Seal Laryngeal Mask; Solus, Solus Standard Laryngeal Mask Airway; Cobra, Cobra Perilaryngeal Airway; LT, Laryngeal Tube; LTSII, Laryngeal Tube Suction II; LTS-D, Laryngeal Tube Disposable; LTS, Laryngeal Tube Sonda.

Table 1 The characteristics of supraglottic airway devices.

Oropharyngeal leak pressure (OLP)

OLP data were obtained of 7784 adult patients in 75 trials, including 17 types of SADs. The pooled estimates of differences in OLP in the network meta-analysis are displayed in the upper triangle of Table 2, whereas the lower triangle of Table 2 presented the results of traditional pairwise meta-analyses. The Protector laryngeal mask airway (Protector) and Ambu AuraGain Disposable Laryngeal Mask (AuraGain) achieved the greatest highest and most improved OLP compared with that of the C-LMA, with mean differences (95% confidence interval) of 9.18 (5.60, 12.75) and 7.65 (3.63,11.67) cmH2O, respectively. The Proseal laryngeal mask airway (P-LMA), I-gel Supraglottic Airway (I-gel), Supreme laryngeal mask airway (S-LMA) and Streamlined Liner of the Pharynx Airway (SLIPA) had significantly higher OLP than that of the C-LMA, with mean differences ranging from 3.98 to 6.72 cmH2O. The Portex Soft Seal Laryngeal Mask (SoftSeal), Air-Q Masked Laryngeal Airway (Air-Q), and Cobra Perilaryngeal Airway (Cobra) also achieved higher OLP than did the C-LMA, with mean differences ranging from 4.55 to 6.72 cmH2O. In the laryngeal tube category, the Laryngeal Tube (LT) and the Laryngeal Tube Suction II (LTS-II), but not the Laryngeal Tube Disposable (LTS-D) or Laryngeal Tube Sonda (LTS), exhibited significantly greater OLP (5.94 to 7.25 cmH2O higher) than that of the C-LMA. The Unique laryngeal mask airway (U-LMA), Solus Standard Laryngeal Mask Airway (Solus), Ambu AuraOnce (Ambu-O), LTS-D and LTS exhibited non-significantly higher OLP than that of C-LMA, with mean differences ranging from − 0.46 to 4.16 cmH2O. The Protector had the highest surface under the cumulative ranking curve (SUCRA) values among the SADs (Supplementary Table S2), whereas the U-LMA and C-LMA had the lowest SUCRA values.

Table 2 Results for oropharyngeal leak pressure from network meta-analysis.

First-attempt insertion failure

Data on the first-attempt insertion failure were obtained for 10,191 adult patients in 102 trials with 17 types of SAD. The results of the network meta-analysis are presented in the upper triangle of in Table 3. The lower triangle in Table 3 presents the results of traditional pairwise meta-analyses. The risk of first-attempt insertion failure of each of the SADs was comparable to that of the C-LMA. The S-LMA and Ambu-O had significantly lower risks of first-attempt insertion failure than that of the P-LMA. The S-LMA also achieved a significantly lower risk than that of the I-gel. The P-LMA, Cobra and LTS achieved significantly higher risks than that of the U-LMA. The Cobra and LTS demonstrated significantly higher risks than that of the S-LMA. In terms of SUCRA values, the Ambu-O and S-LMA were ranked the highest, the AuraGain was ranked fifth and the Protector, was ranked seventh (Supplementary Table S4).

Table 3 Results for the risk of first-attempt insertion failure from network meta-analysis and pairwise meta-analysis.

Postoperative sore throat rate (POST) within 24 h and other secondary efficacy-related outcomes

Data on POST were obtained for 4125 adult patients in 37 trials, with 16 types of SADs. POST-related data on the Air-Q were unavailable. POST ranged from 0 to 50%. The results of the network meta-analysis are presented in the upper triangle of Table 4; none of the SAD exhibited a significant difference compared with the C-LMA. The S-LMA and Cobra had significantly higher POSTs than that of the AuraGain. The Cobra also attained a significantly higher POST than that of the Ambu-O. The lower triangle of Table 4 presents the results of the traditional pairwise analysis. The SADs with the highest ranking and lowest POST were the AuraGain and Cobra (Supplementary Table S7). The results regarding the overall insertion failure rate during induction, poor function after successful insertion, device failure during maintenance, hypoxia, and aspiration are presented in Supplementary Tables S10 to S14. Because the evidence on these outcomes is limited and includes many zero events, a network meta-analysis was considered infeasible.

Table 4 Results for postoperative sore throat rate from network meta-analysis and pairwise meta-analysis.

Meta-regression of the effect of using the neuromuscular blocking agents on OLP, risk of first-attempt insertion failure and POST, and effect of positive pressure ventilation on POST

Assessment of the transitivity assumption was performed for the use of neuromuscular blocking agents (NMBAs) (Supplementary Fig. S4). Meta-regression analysis revealed that using NMBAs was negatively associated with the risk of first-attempt insertion failure, and positively associated with OLP, but not associated with POST. The risk of first-attempt insertion failure of using NMBAs in combination with the P-LMA and I-gel was lower than the risk of first-attempt insertion failure of any of the SADs alone. The use of NMBAs in combination with the SLIPA, Solus Standard Laryngeal Mask Airway (Solus), Cobra and LTS-D achieved even higher OLP. Supplementary Tables S20 to S25 present the results of subgroup analysis stratified according to the use of NMBA. Meta-regression analysis indicated the positive pressure ventilation was not associated with POST (P = 0.97).

Publication bias

Comparison-adjusted funnel plots (Supplementary Figs. S8 to S10) and Egger’s regression test for study outcomes (OLP: P = 0.68; the risk of first-attempt insertion failure: P = 0.10; POST: P = 0.81) indicated that no small study bias was present.

Inconsistency

The design-by-treatment interaction model (P = 0.01) and loop inconsistency model (P = 0.002) were inconsistent within the network meta-analysis for the risk of first-attempt insertion failure. Significant inconsistencies between the results of direct and indirect comparisons were observed for the following comparisons: the C-LMA and LT (P < 0.001), the Protector and AuraGain (P = 0.001), the Protector and S-LMA (P = 0.004), the AuraGain and S-LMA (P = 0.01), the P-LMA and SLIPA (P = 0.03) and the P-LMA and LT (P = 0.004). For POST, we observed an inconsistency between direct and indirect comparisons of the C-LMA and P-LMA (P = 0.002) and the C-LMA and I-gel (P = 0.03). Finally, no evidence of inconsistency between direct and indirect comparisons was observed for OLP.

Other results

The value of kappa was 0.76. The results of sensitivity analysis of the risk of bias are presented in Supplementary Tables S3, S6, and S9. The results of the grading of recommendation, assessment, development, and evaluation (GRADE) are presented in Supplementary Tables 17, 18 and 19.

Discussion

Compared with the C-LMA, many SADs achieved a significantly higher OLP, with a mean difference ranging from 3.98 to 9.18 cmH2O. The Protector achieved the highest OLP and ranked the best among the SADs. All SAD exhibited similar risks of first-attempt insertion failure, and not significant differences in POST compared with those of the C-LMA. Evidences on other secondary efficacy-related outcomes was limited due to extremely low incidence rates. Although the Protector achieved the highest OLP, its risk of first-attempt insertion failure and POST were similar to those of the C-LMA.

The Protector had the highest OLP, followed by AuraGain, relatively newly developed SAD. The high OLP of the Protector may be attributable to the fact that it is made of medical grade silicone. Moreover, its curve airway tube and inflatable cuff may allow it to better conforms to the anatomical contours of an individual’s hypopharynx10. Other SADs also attained better OLP than that of the C-LMA, and this finding is consistent with that of a recent network meta-analysis of SADs for pediatric patients, which indicated that the I-gel, Cobra and P-LMA achieved 3.4 to 4.6 cmH2O higher OLP than that of the C-LMA11. Our results also revealed substantial differences in the efficacy of the SADs: this could not be observed in previous meta-analyses that pooled various SADs into one group4,5,6. In addition, some of the studies included in these meta-analyses used nonstandard methods of measuring OLP or did not clearly describe their measurement method. By contrast, our network meta-analysis evaluated these SADs separately and only included randomized controlled trials that measured OLP through plateau pressure measurement, thereby providing more accurate estimates.

Our results revealed that all SADs exhibited similar risks of first-attempt insertion failure to that of the C-LMA, consistent with the results of a previous network meta-analysis on pediatric patients11. Unlike previous studies6, we did not use the criterion of “ease of insertion” because it is a highly subjective, operator-dependent assessment; therefore, we used first-attempt insertion failure instead.

Our analysis indicated that all SADs had a POST comparable to that of the C-LMA in contrast to the results of previous meta-analyses4,5,12,13. This discrepancy may be because other meta-analyses have pooled data on patients who had received laparoscopic surgery and those from crossover studies when estimating the POST. We excluded crossover studies account for the carry-over effect. Moreover, patients receiving laparoscopic surgery were also excluded due to the higher OLP required, which would result in a greater risk of perilaryngeal tissue trauma.

Our results also demonstrated that adjunctive NMBAs had a significant effect. However, NMBAs exerted a significant effects on the risk of first-attempt insertion failure and OLP, which disagree with the findings of a previous network meta-analysis in which an NMBA was found to have no significant effects on OLP, the POST, or the risk of first-attempt insertion failure11.

Our study has several strengths. Rather than grouping various SADs into one group, we assessed the efficacy of each SAD individually and compares them within the same evidence base. Assessment of both OLP and the risk of first-attempt insertion failure according to SUCRA ranking may provide more complete information on SADs, which can help in selecting the appropriate SAD. This network meta-analysis also enabled us to compare therapies indirectly when no head-to-head trial had been performed and obtain more precise effect estimates by jointly assessing direct and indirect comparisons8. Furthermore, we also analysed major and efficacy-associated outcomes. The results of our analysis also provide updated evidence, with implications for using SADs in low-risk adult patients.

This study had some limitation. First, the data in our network meta-analysis were mostly derived from low risk, elective, nonobese patients with a low risk of aspiration. Second, some of our efficacy-related outcomes were reported by a limited number of studies with zero events, thereby resulting in greater uncertainty in our assessment of these outcomes. Third, some of the included studies did not report on NMBA use with SADs, affecting the strength of the meta-regression results on NMBA effects. Fourth, we performed a grey literature search but did not identify additional relevant studies that met our inclusion criteria. Fifth, many anesthetic agents have been developed, and anesthetic strategies have changed over the last 30 years since the C-LMA was developed. We included 108 studies in our network meta-analysis, and this large number of studies may have diluted the impact of the different anesthetic agents within our comparisons. Sixth, the number of patients in our network meta-analysis who were treated with the relatively newer SADs, such as the AuraGain, was much lower than number of patients treated with other SADs, such as the P-LMA or I-gel. A lower number of patients yields a wider confidence interval. More studies are therefore required to confirm the efficacy of these new devices.

In conclusion, SADs including the Proseal LMA, I-gel, Supreme LMA, Streamlined Liner of the Pharynx Airway, SoftSeal, Cobra Perilaryngeal Airway, Air-Q, Laryngeal Tube, and Laryngeal Tube Suction II (LTS-II), Laryngeal Tube Suction Disposable (LTS-D), Protector, and AuraGain, achieve significantly higher OLP and similar risks of first-attempt insertion failure compared with those of the C-LMA. Our data indicate that the Protector may be the best SAD because it achieves the highest OLP and a similar risks of first-attempt insertion failure and similar POST to those of C-LMA.

Methods

Data source and search strategy

The protocol for this network meta-analysis was registered with PROSPERO (number: CRD42017065273). We searched the EMBASE, Cochrane Central Register of Controlled Trials, and PubMed databases from their inception to December 2019 for randomized controlled trials that compared at least two types of SADs. We applied no language restrictions and performed a manual literature search. We searched for additional studies in the reference lists of all identified publications, including relevant meta-analyses and systematic reviews. We used the following keywords: I-gel, ProSeal LMA (P-LMA), Classic LMA (C-LMA), Supreme LMA (S-LMA), AuraOnce (Ambu-O), single-use Ambu LMA, Unique LMA (U-LMA), Streamlined Liner of the Pharynx Airway (SLIPA), Solus LMA (Solus), Portex Soft Seal LMA (SoftSeal), Air-Q LMA (Air-Q), Cobra Perilaryngeal Airway (Cobra), Laryngeal Tube (LT), Laryngeal Tube Suction (LTS-II), Laryngeal Tube Disposable (LTS-D), Laryngeal Tube Sonda (LTS), Ambu AuraGain Disposable Laryngeal Mask (AuraGain), LMA Protector Airway (Protector), oropharyngeal leak pressure (OLP), overall insertion failure during the induction, first-time insertion failure, insertion failure at first attempt, failure of the device during the maintenance, improper ventilation during the maintenance, ease of insertion, hypoxia, complications, sore throat, pulmonary aspiration, and aspiration. Details of the search strategy are provided in Supplementary Table S26 and Supplementary Fig. S11.

Study selection

We included randomized controlled trials on patients, who received elective surgeries under general anesthesia and had an indication for SAD insertion with an American Society of Anesthesiologists (ASA) score of I to III, body mass index (BMI) < 40, and age > 18 years. The SADs identified in the literature included the C-LMA, I-gel, P-LMA, S-LMA, Ambu-O, U-LMA, SLIPA, Solus, SoftSeal, Air-Q, Cobra, LT, LTS-II, LTS-D, LTS, AuraGain, and Protector. We used the C-LMA as the reference because it was the first SAD14. The included trials compared at least two SADs and reported one of the following primary or secondary outcomes. The primary outcomes were OLP, the risk of first-attempt insertion failure, and POST within 24 h. The secondary outcomes were overall insertion failure rate, successful insertion but with poor function, device failure during maintenance, aspiration and hypoxia. When OLP was assessed at several time points, we extracted the data recorded immediately after SAD insertion. When OLP was assessed in different head positions, we extracted the data recorded in the neutral head position. The overall insertion failure rate was measured during induction. Successful insertion but with poor function was defined as the SAD need for reinsertion, reposition, manipulation, or failure during intubation following successful insertion. Device failure during maintenance was defined as the need to change to another airway device due to laryngospasm or a hiccup, SAD of a different size was then used in case of poor sealing pressure and the SAD was inserted if necessary during maintenance. Hypoxia was regarded as any episode of hypoxia, hypoxemia or desaturation. The POST was evaluated simply in terms of the record of sore throat within 24 h of surgery. We excluded crossover studies in our analysis of the POST because the causative device could not be ascertained. When analyzing device failure, we excluded studies in which the SAD was not used for intraoperative airway management. We excluded studies in which OLP, rate of device failure during maintenance, hypoxia, aspiration, and POST were measured during laparoscopic surgery because the condition of such surgeries may affect these parameters. However, the OLP measured before the laparoscopic condition in the trials was still extracted in our meta-analysis. Higher OLP may be required for adequate sealing during laparoscopic surgery15, resulting in increased risks of device failure, hypoxia, and aspiration16. Laparoscopic position than in a supine position17, leading to a higher risk of ischemic injury at the oropharyngeal mucosa18. Therefore, we excluded trials that measured the POST after laparoscopic surgery. In addition, we extracted studies that measured OLP according to plateau pressure. To measure OLP, the expiratory valve of the circle system is closed at a fixed gas flow and the airway pressure is recorded.

The following studies were excluded: (1) duplicate publications; (2) animal or manikin experimental studies; (3) studies with unclear patient conditions (i.e., no clear definition of BMI or ASA); (4) studies in which the size of the selected SAD was not specified ( or in which clinicians did not follow the manufacturers' instructions; (5) studies with a lack of outcome data; (6) studies with unclear descriptions of outcome measurements; (7) studies involving patients with a higher aspiration risk, such as pregnant patients, patients with a history of gastroesophageal reflux, and those with an insufficient fasting time; and (8) studies of patients undergoing emergency surgery.

Data extraction and quality assessment

Two investigators (CJ Lai and YC Yeh) independently screened all titles and abstracts and evaluated relevant articles. If a study was deemed eligible by any reviewer, then it was included for full-text review. The two investigators then independently assessed the full texts of the studies, and any disagreement was resolved through consensus among the members of the study team. These two investigators extracted and entered information regarding the following aspects into an electronic database: study design, patient characteristics, interventions, comparisons and outcomes (OLP, the risk of first-attempt insertion failure, POST, insertion failure during induction, device failure during maintenance, poor device function after successful insertion, aspiration, and hypoxia). When the relevant information on design or outcomes was unclear, we contacted the original authors for clarifications. The two investigators also independently evaluated the methodological quality of eligible trials by using the Cochrane Collaboration’s tool for assessing the risk of bias19. Disagreements in the evaluation were resolved through consultation with another investigator (YKT). The kappa statistic was used to evaluate the interrater agreement20.

Data synthesis and analysis

We conducted network meta-analyses by using the random effects model proposed by Lu and Ades but implemented within the frequentist statistical framework21. Risk ratios were used to measure the relative treatment effect on the risk of first-attempt insertion failure (failed or successful) and the occurrence of postoperative sore throat (yes/no), weighted mean differences were used to measure changes in OLP (cmH2O). For crossover trials, we used the adjusting variance approach to address correlations between different procedures within trials22. We evaluated the transitivity assumption by comparing the distribution of NMBAs used across studies. We also conducted a meta-regression analysis to assess its impact on our network meta-analysis. We used meta-regression analysis to evaluate whether the injection of NMBAs affected the risk of first-attempt insertion failure, OLP, or POST outcomes, and we performed a subgroup analysis stratified according to NMBA use. We also used a meta-regression to evaluate whether positive pressure ventilation affected the POST. The restricted maximum likelihood method and DerSimonian-Laird method were used to estimate random effect models for network meta-analysis and traditional pairwise meta-analyses, respectively23. A risk of bias evaluation and sensitivity analysis were also performed24. We further applied the GRADE approach (grading of recommendation, assessment, development, and evaluation) to assess the quality of evidence regarding the primary outcomes (OLP, the risk of first-attempt insertion failure, POST)25.

For each outcome, we calculated the ranking probabilities for the different SADs. The SUCRA is a percentage of the mean rank of each treatment. The greater the area under the curve, the higher the ranking of the intervention; that is, the better its performance26.

Publication bias

A comparison-adjusted funnel plot and Egger’s regression test were used to test for publication bias. An asymmetrical funnel plot and the obtainment of P < 0.05 for Egger’s test suggested small study bias.

Inconsistency

To evaluate potential inconsistency between the results of direct and indirect comparisons within the network meta-analysis, we used three models: design-by-treatment interaction, the loop-specific inconsistency, and the node-splitting27,28,29.

All statistical analyses in this study were performed with Stata (version 15.0, StataCorp, College Station, TX).