Background

In the operating room, intensive care unit, and emergency department, the incidence of difficult airway (DA) is high, threatening the lives of patients [1,2,3,4,5,6]. DA is defined as difficulty in establishing artificial ventilation under general anesthesia and emergency conditions, which has been widely studied [7,8,9,10,11]. Even though the assessment of DA is necessary before intubation, the predictive ability of existing methods is very limited and cannot meet clinical needs, because of many affecting factors [4,5,6]. Thus, improving the DA prediction methods has great clinical significance.

At present, methods such as interincisor distance (IID), thyromental distance (TMD), modified Mallampati test (MMT) score, and upper lip bite test (ULBT) score are mostly used in clinical practice [12]. However, these indicators have limited performance, low sensitivity and specificity, and low positive predictive values [13, 14]. Recent studies have suggested that anatomical measurement of upper airway using ultrasonography could improve the DA prediction performance [15,16,17,18]. For examples, by use of ultrasonography, temporomandibular joint (TMJ) mobility [19] and tongue thickness (TT) [20] have been successfully used to predict DAs. However, DA associates with the whole anatomy of upper airway structure, and a single indicator is not sufficient to reflect the functionality of upper airway and cannot accurately predict DA. To overcome the predicting inability of single factor, some studies combined multiple indicators to improve DA prediction, for examples, the “3–3-2” rule [21], modified “look, evaluate, Mallampati score, obstruction, and neck mobility” (LEMON) criteria [22], and “Wilson” scores [23]. Although the combination of multiple indicators improves the sensitivity of DA prediction to a certain extent, at the same time these methods also reduce the specificity of DA prediction. Furthermore, the simple data processing method, ignoring the numerical size, has certain limitation in dealing with complicated clinical conditions, which may lead to loss of valuable information. Even in ultrasonography-guided methods, the absence of valuable information is still a major drawback of DA risk prediction.

Of the available risk assessment models, the nomogram predictive model combines the parameters by endowing different weights, and provides a highly accurate, evidence-based, individualized risk estimation, thus facilitating clinical decision-making [24,25,26,27,28]. Therefore, we proposed a hypothesis that nomogram can improve the prediction ability of difficult tracheal intubation (DTI) and difficult laryngoscopy (DL). In this study, using a clinical trial data, we established a nomogram model for DA prediction, which could significantly improve the DA prediction and reduce the harm of DA.

Methods

All participants were from a prospective observational DA study conducted at the Yijishan Hospital of Wannan Medical College, which has been described previously [20, 29]. In brief, the study enrolled 2254 patients who underwent selective surgery and general anesthesia with endotracheal intubation from May 2014 to May 2016 and aimed to explore risk factors of DTI and DL. The study was approved by the ethics committee of the First Affiliated Hospital of Wannan Medical College [No. (2013) 91] and registered at the China Clinical Trial Registration Center (Registration No. ChiCTR-RCS-14004539). Written informed consent was obtained from the patients before enrollment. All methods were performed in accordance with the relevant guidelines and regulations.

  1. 1)

    Inclusion and exclusion criteria:

Patients undergoing the elective operation, general anesthesia and tracheal intubation who were aged > 18 years old and had an American Society of Anesthesiologists (ASA) grade of I-III were included. We excluded patients with a confirmed history of DA or maxillofacial trauma and cervical spine injury; patients with maxillofacial, orolingual, or cervical tumors; patients with subglottic stenosis affecting tracheal intubation; patients with nasotracheal intubation or double-lumen endotracheal intubation; and patients with incomplete clinical information.

  1. 2)

    Airway assessment method

Classical predictors of DA:

Before the induction of general anesthesia, the IID, MMT score, ULBT score, TMD, height and weight of patients were evaluated or measured. Evaluation using classical methods was conducted by specialist staff, who with more than five years of clinical experience in performing intubations and airway assessment.

Measurement of TMJ mobility by ultrasonography [19]:

TMJ mobility was measured before anesthesia by a high-frequency linear array probe placed in front of the ear. The patients were asked to open and close their mouths, and the distance of anterior and inferior sliding of the mandibular condyle was measured dynamically.

Measurement of TT by ultrasonography [20]:

The patients were guided in the supine position, the head was stretched, the sniffing position was held, and the tongue was relaxed. A low-frequency convex array probe was placed in the middle sagittal plane of the chin; one end was placed flat in the middle of the mandible, and the other end was pointed at the thyroid cartilage. The direction of the probe was adjusted to be perpendicular to the skin. The distance between the submental skin and tongue surface, that is, the thickness of the tongue body, was measured by ultrasound.

The ultrasonography evaluation was conducted by a physician with ultrasound experience at least 40 times training, which was proved with good reliability among different sonographers [19].

  1. 3)

    General anesthesia induction program

The general anesthesia induction program included midazolam 0.02–0.05 mg/kg, fentanyl 3–4 μg/kg, vecuronium 0.1 mg/kg, propofol 1–2 mg/kg. Tracheal intubation was performed after administration of muscle relaxants by experienced anesthesiologists. The patient is in the supine sniffing position. A laryngoscope handle and a Macintosh laryngoscope blade (Sirius XL, Basildon, UK) were used for laryngoscopy. Endotracheal tube type and size were determined according to patient and surgical conditions. After tracheal intubation completed, mechanical ventilation was performed. If a DA is encountered, the anesthesiologist immediately seeks help from the DA management team. The following strategies were provided for difficult airways: video laryngoscope (UE, China)-assisted intubation, laryngeal mask endotracheal intubation or ventilation, fibrebronchoscopy-guided intubation, lighted stylets or light wands, and percutaneous thyrocricocentesis to establish a surgical airway rapidly [20, 29].

  1. 4)

    Study endpoints

The primary endpoint was DTI prediction capability and the secondary was DL prediction capability.

DL was defined as Cormack-Lehane grade III-IV laryngoscopy; that is, when laryngoscopy was performed, only a grade III or grade IV epiglottis could be observed. Cormack-Lehane grade is as follows: Grade I, most of the glottis is visible; Grade II, only the posterior extremity of glottis is visible; Grade III, no glottis can be seen but only the epiglottis; Grade IV, no epiglottis can be seen [30].

DTI was defined as intubation requiring up to three times or an intubation time of more than 10 min.

Endotracheal intubation was performed by anesthesiologists with more than five years of clinical experience who were unaware of the results of the ultrasonographic detection of the DA prediction indicators.

Statistical analysis

The χ2 test was used for categorical variables with Bonferroni correction for inter-group comparison. The continuous variables included in this study were skewed distribution by normality test, so Mann-Whitney U-test was used for the continuous variables. Restricted cubic spline (RCS) or the receiver operating characteristic (ROC) curve were used to selected different segmentation criteria for continuous data according to their quantile distribution in positive and negative groups of DL and DTI. And then all the influencing factors after pretreatment were used for the prediction models of DL and DTI respectively. We choose the variables with P value less than 0.1 of univariate logistic regression by the Akaike information criterion (AIC) stepwise analysis for the further multivariate logistic regression. Nomograms were constructed to predict DL and DTL based on the multivariate logistic regression results, and the corresponding scorecards were output. Finally, ROC curves were used to evaluate the accuracy of the nomograms. Statistical analyses were conducted with R (version 3.6.0, R Development Core Team) and SAS (version 9.4, SAS Institute Inc., Cary, NC).

Results

A total of 2254 patients, including 1059 males and 1195 females, were included in this study; 142 (6.30%) patients had DL, and 51 (2.26%) patients had DTI (Table S1). The distribution of continuous variables in DL and DTI was shown in Table S2 and Table S3. Results for RCS and ROC curves for continuous variables segmentation of DL and DTI were presented in Fig. S1, S2, S3, S4, S5, S6, S7, S8 and S9. Univariate logistic regressions of DL and DTI according to RCS were shown in Table S4 and Table S5.

According to the results of univariate logistic regression analysis, ULBT, MMT, sex, TMJ, age, BMI, TMD, IID, and TT were included in multivariate logistic regression analysis. As shown in Table 1, nine variables were considered as independent risk factors for DL, such as ULBT grade III [odds ratio (OR) = 4.114, 95% CI: 2.114–8.005, P < 0.001], MMT grade IV (OR = 2.461; 95% CI: 1.411–4.292; P = 0.002), TT > 67 mm (OR = 4.525; 95% CI: 1.918–10.672; P < 0.001). Five variables were considered as independent risk factors for DTI, such as ULBT grade III (OR = 5.078; 95% CI: 1.451–17.771; P < 0.001), TT > 62 mm (OR = 29.451; 95% CI: 6.212–139.622; P < 0.001) (Table 2).

Table 1 Univariate logistic regression and multivariate logistic analysis showing the association of variables with difficult laryngoscopy
Table 2 Univariate logistic regression and multivariate logistic analysis showing the association of variables with difficult tracheal intubation

The nomogram to predict DL was created based on the following 9 independent factors (Fig. 1): ULBT grade (I, II or III), MMT grade (I/II, III or IV), sex, TMJ (< 12 or ≥ 12 mm), age (< 36, 36–51, ≥ 52 y), BMI (< 18.5, 18.5–24, 24–27, 27–30, ≥ 30), TMD (< 65, 65–78, ≥ 78 mm), IID (< 40, ≥ 40 mm), and TT (< 60, 60–67, ≥ 67 mm). Nomogram for DTI prediction was shown in Fig. 3, including the 5 independent factors: ULBT grade (I, II or III), TMJ (< 11, ≥ 11 mm), age (< 32, 32–57, ≥ 58 y), IID (< 36, 35–40, ≥ 40 mm), and TT (< 55, 55–62, > 62 mm). Score cards of the DL and DTI was shown in Table S6. The higher total points obtained from the sum of the points for each factor in the nomogram means the greater possibility of DL or DTI. For example, a 60 years old (≥ 58 y) patient with ULBT grade II, TMJ 10 mm (< 11 mm), IID 42 mm (≥ 40 mm), TT 65 mm (> 62 mm) was given a total of 244 points for DTI (30 points for age, 25 points for ULBT grade II, 100 points for TMJ, 0 points for IID, and 89 points for TT), thus the predicted DTI risk was about 20% (Fig. 2, Table S6).

Fig. 1
figure 1

Nomogram predicting difficult laryngoscopy in patients underwent general anesthesia tracheal intubation. Abbreviations: ULBT, upper lip bite test; MMT, modified Mallampati test; TMJ, temporomandibular joint mobility by mandibular condylar mobility measured by ultrasonography; BMI, body mass index; TMD, thyromental distance; IID, interincisor distance; TT, tongue thickness measured by ultrasonography; DL, difficult laryngoscopy

Fig. 2
figure 2

Nomogram predicting difficult tracheal intubation in patients underwent general anesthesia tracheal intubation. Abbreviations: ULBT, upper lip bite test; MMT, modified Mallampati test; TMJ, temporomandibular joint mobility by mandibular condylar mobility measured by ultrasonography; IID, interincisor distance; TT, tongue thickness measured by ultrasonography; DTI, difficult tracheal intubation

To verify the performance of the model, the ROC curve was constructed. The areas under the ROC curves were 0.933 (95% CI: 0.912–0.954) and 0.974 (95% CI: 0.954–0.995) for DL and DTI, respectively (Fig. 3, Table S7).

Fig. 3
figure 3

The areas under the receiver operating characteristic curves (AUCs) of nomograms for difficult laryngoscopy and difficult tracheal intubation

Discussion

In this study, nomograms were developed to predict the risk of DL and DTI. These nomograms were constructed by combination of traditional DA assessment methods (such as ULBT, MMT, IID) and ultrasound airway assessment methods (such as TMJ, TT), which have achieved good performance with high AUC values. To our knowledge, this study is the first report that combines traditional DA assessment and ultrasonography-guided airway assessment by nomograms to predict DA.

In our nomogram models, the DL prediction nomogram incorporated 9 variables (ULBT, MMT, sex, TMJ, age, BMI, TMD, IID and TT), and the DTI prediction nomogram included 5 variables (ULBT, TMJ, age, IID and TT). Among these variables, ULBT, MMT, sex, age, BMI, TMD, IID were traditional DA assessment methods while TMJ and TT were ultrasonography-assisted airway assessment methods. In both DL and DTI nomograms, the performance of ultrasonography-assisted airway assessments was good. For example, TT has high nomogram scores in both DL (> 67 mm, 48 points) and DTI (> 62 mm, 89 points) prediction. In particular, TMJ has the highest scores in both DL and DTI nomograms. Our previous study [19, 20] found that TMJ and TT measured by ultrasound can be used to predict DA. TMJ measured by ultrasound performs better predication for DL, compared to traditional DA assessment methods such as IID, ULBT, TMD and MMT. However, due to small sample size (484 patients with 41 DL cases), the performance of TMJ to predict DTI was not assessed [19]. In current study, we showed that TMJ has the highest nomogram scores in both DL (< 12 mm, 100 points) and DTI (< 11 mm, 100 points) prediction, confirming the predictive performance for DA. In a previous study, we have shown that TT is another useful DA predictor [20]. Taken together, our findings indicated that the ultrasonography-assisted DA prediction methods are effective. Hence, we incorporated the TMJ and TT indicators measurement by ultrasound into the nomograms for DL and DTI prediction.

Many studies have attempted to combine indicators to improve predictive capability of DA. Most of these models were created by simple addition of indicators, such as the “3–3-2” rule [21], LEMON criteria [22], and “Wilson” scores [23]. In consistence with these studies, previously we found that the 3–3 rule (IID less than three fingers, a hyoidmental distance less than three fingers) is useful for DA prediction, with AUC 0.709 for DL and 0.822 for DTI [31]. In addition, by using the ratio of TT to TMD, the AUCs for DL and DTI could be significantly improved to 0.75 (95%CI, 0.73–0.76) and 0.86 (95%CI, 0.84–0.87), which are better than that of TT, TMD and MMT alone [20]. In current study, when nomograms were used, the AUCs were increased to 0.933 [95% CI, 0.912–0.954] and 0.974 (95% CI, 0.954–0.995) for DL and DTI, respectively. These data indicated that the nomograms are more powerful tools to predict DA.

The nomograms in this study are easy to implement in routinely clinical practice. As long as nine predictors (ULBT, MMT, Sex, TMJ, Age, BMI, TMD, IID, and TT) were collected, the incidence of DTI and DL in patients could be assessed by the nomograms. Among the nine predictive indicators, ULBT, MMT, Sex, Age, BMI, TMD and IID are all classic indicators commonly used in clinical practice, and the data are very easy to obtain. Furthermore, TT and TMJ can be well measured by conventional clinical ultrasound equipment, and previous studies have shown that the reliability of measurements between different sonographers is comparable [19]. In the actual use of the nomogram, we only need to convert the corresponding predictor value into the corresponding nomogram score value, and then add the score values to obtain the total score. Then the risk incidence corresponding to the total score is obtained, as described in the results section. The operation of nomogram is simple and intuitive, does not require complicated calculation, is less time-consuming, is very convenient to use, and can be quickly popularized.

This study has some limitations. Firstly, data were from a single center and the sample size of DTI and DL patients was relatively small. Secondly, as the patients in this work were all Asian, potential bias and influencing factors must be considered when the models are used for patients in Western countries. Therefore, multicenter, multiracial studies, especially international multicenter research are needed. Thirdly, the ultrasound measurement method adopted in this study may have certain obstacles in clinical application, and not every hospital has the conditions for ultrasound assessment of DA. Finally, more streamlined and efficient methods, and more advanced algorithms are needed to improve the models further.

Conclusions

Nomograms based on ultrasound airway assessment effectively predicted DL and DTI in patients. The nomogram is an ideal model for DA prediction which may be helpful for patients’ safety who underwent general anesthesia and tracheal intubation surgery.