Introduction

Obstructive Sleep Apnea Syndrome (OSAS) is a condition affecting 14 % of males and 5 % of females in adults 30–70 years of age [1]. OSAS is characterized by recurrent partial or complete obstruction of the upper airway (UA) during sleep, leading to cessation of airflow, intermittent and recurrent hypoxemia and sleep fragmentation. The sites of obstruction are generally divided in three anatomical areas, which are the velopharynx (VP), the base of the tongue (BOT) and the lateral pharyngeal walls (LPW).

A sleep study is required to diagnose obstructive sleep apnea syndrome and determine its severity. Unfortunately, sleep studies are labor-intensive and waiting times to get a sleep study in a hospital can be very long, even up to a few years. Therefore, diagnosis and treatment of the condition can be delayed significantly. This can be a real concern, especially if a patient has moderate to severe OSAS and is left untreated while waiting for the test. It is still a challenge for physicians to identify which patients would need a more urgent sleep study based on history or questionnaires alone. This study looked at the ability of the endoscopic Mueller maneuver (MM) to help predict the severity of OSAS based on UA collapse at its different levels.

The MM is a minimally invasive examination tool which has significant value due to its ease of use in clinical settings. The maneuver is done on an awake patient and can be conducted in less than a two-minute time span. The MM has mixed evidence regarding its effectiveness of predicting OSAS. Studies such as Woodson and Haganuma [2], as well as Friedman et al. [3] found no positive correlation between MM performance and OSAS. In contrast to these findings, Dreher et al. [4] found the Mueller maneuver to be predictive of OSAS based on the degree of obstruction at base of tongue and velum. The authors rationalized the divergent findings to be due to possible differences in patient cooperation as a result of the MM performed on awake patients. In the present study, the goal is to determine how well the MM maneuver can predict OSAS by observing how airway obstruction at the levels of the VP, BOT and LPW during the procedure relates to the severity of OSAS, as measured by AHI.

Methods

This study is a retrospective chart review of 506 consecutive patients referred to a McGill University Affiliated Teaching Site otolaryngology office in Montreal, Canada for snoring, and/or suspected obstructive sleep apnea syndrome. The review (CR14-62) has been approved by the Research Ethics Committee at the Sir Mortimer B. Davis-Jewish General Hospital.

All patients completed an Epworth Sleepiness Scale (ESS) [5] and had a full head and neck examination by one of three experienced physicians, including flexible nasopharyngolaryngoscopy and MM. Patients then underwent a level 3 cardiorespiratory polygraphy that was scored using American Academy of Sleep Medicine (AASM) criteria [6]. These studies were done using a Philips Stardust II Sleep Recorder. The recordings included monitoring of respiratory effort, oro-nasal pressure, oximetry, body position, snoring sound and pulse rate channels. The number of central apneas, obstructive apneas, mixed apneas and hypopneas were measured and added to form the apnea-hypopnea index. For hypopnea, the definition used was a reduction in nasal airflow ≥30 % with an associated desaturation ≥ 3 %. The severity of OSAS was graded following a scale based on the AHI: 5 ≤ AHI < 15 (mild), 15 ≤ AHI < 30 (moderate) and 30 ≤ AHI (severe).

The measures investigated were the relations between the three sites of obstruction based on the Mueller maneuver and BMI, and OSAS severity (AHI). The relation of OSAS severity with ESS was also investigated.

Data analysis was done using SAS, version 9.3 (Cary, NC). Continuous variables were presented as means with a standard deviation (SD). The correlation between the sites of obstruction with BMI, ESS, and AHI was done using a Spearman’s correlation test using a 95 % confidence interval (CI). A multivariate multiple logistic regression analysis was done to determine if the degree of collapse using the MM predicted OSA severity and the effects of BMI, ESS and age on these associations were analyzed.

Patients who underwent a Mueller maneuver and a sleep study were included in the study. No patients were excluded on the basis of gender, age, or BMI. Patients who did not meet criteria for OSAS diagnosis (AHI <5) were excluded from the study. Patients’ charts lacking data from the MM examination of airway obstruction were excluded.

There were no differences in demographic (age, BMI), ESS, or sleep characteristics (AHI) between the patients who lacked data on airway obstruction and those who were included in the study. There was also no bias towards taking measurements only in patients that had some degree of baseline airway narrowing at the UA levels.

Endoscopic Mueller maneuver

The Mueller maneuver attempts to emulate the collapse of the UA during sleep [7]. A flexible laryngoscope is inserted through the nose of the patient into his UA and the patient is asked to perform a series of reverse Valsalvas. The examination is of short duration (less than 2 min) and is recorded. All recordings of the examinations for each patient was saved and backed up on a hard drive for later use by the scorers. The scorers were able to go back to the recordings after the consultation with the patient to insure the accuracy of their scores on the Mueller maneuver. Assessors estimated a precise percentage score for the degree of obstruction of the VP, BOT, and LPW. The scores were then sorted semi-quantitatively following a scale based on the percentage of obstruction: ≤50 % (minimal), 51–75 % (moderate) and 76–100 % (severe).

Data analysis

The data from the sleep studies and MMs were collated along with the age, gender, height, weight, and BMI. A Spearman’s correlation coefficient was calculated to evaluate the association between the areas of collapse and the severity of OSAS. If a correlation were to be found, it would then be concluded that the area of obstruction that correlated would be considered as contributing to the severity of OSAS.

Results

There were 399 male (79 %) and 107 female (21 %) subjects. The mean age was 49 years old (SD = 12) and ages ranged from 16 to 90 years. The mean BMI was 29 kg/m2 (SD = 5) and ranged from 19 to 55 kg/m2. The mean ESS was 9 (SD = 5) and ranged from 0 to 24.

BMI and VP collapse correlate, r = 0.15 (95 % CI; 0.063, 0.24). BMI and BOT collapse correlate, r = 0.31 (95 % CI; 0.22, 0.38). BMI and LPW collapse correlate, r = 0.25 (95 % CI; 0.15, 0.34). Overall, BMI and AHI correlate, r = 0.42 (95 % CI; 0.35, 0.49).

At the VP, females had a mean collapse of 86 % (SD = 15), while males had a mean of 85 % (SD = 16). At the base of tongue, females had a mean collapse of 60 % (SD = 20), while males had a mean of 65 % (SD = 20). At the lateral pharyngeal wall, females had a mean collapse of 55 % (SD = 22), while males had a mean of 60 % (SD = 22). Overall, females had a mean AHI of 20 (SD = 18), while males had a mean AHI of 27 (SD = 20).

Sleep study screening findings

AHI values ranged from 5 to 122 events per hour with a mean of 26 (SD = 20) events per hour. Of the 506 total patients, 194 (38 %) had mild OSAS, 163 (32 %) had moderate OSAS, and 149 (30 %) had severe OSAS (Table 1).

Table 1 Distribution of the severity of OSAS at each UA level

Endoscopic Mueller maneuver findings

Of the 506 patient charts, 463 had data for VP collapse, 506 had data for BOT collapse, and 387 had data for LPW collapse. Of the 463 patients with data on VP level, 30 had a minimal airway collapse at that level, 41 had a moderate airway collapse and 392 had a severe airway collapse. Among the 506 patients with BOT collapse, 144 had a minimal collapse, 187 had a moderate collapse and 175 had a severe collapse. Of 387 patients with data for obstruction at the LPW, 158 had a minimal collapse, 109 had a moderate collapse and 120 had a severe collapse (Table 2).

Table 2 Mean AHI according to the level and degree of collapse

Correlation between areas of collapse and OSAS severity

At the VP, patients with minimal collapse had a mean AHI of 17 (SD = 10) events per hour; patients with moderate collapse had a mean AHI of 25 (SD = 15) events per hour; and patients with severe collapse had a mean AHI of 27 (SD = 21) events per hour. This correlation was r = 0.069 (95 % CI; −0.022, 0.16) (Table 2).

A positive correlation was observed under a comparative analysis between BOT percentages of collapse and AHI values, r = 0.26 (95 % CI; 0.18, 0.34). There was a significant rise in the mean AHI for every collapse degree. Patients with a minimal collapse at the BOT had a mean AHI of 19 (SD = 12) events per hour; patients with moderate collapse had a mean AHI of 24 (SD = 19) events per hour; patients with severe collapse had a mean AHI of 33 (SD = 24) events per hour (Table 2).

At the LPW, there was a correlation between degree of collapse and OSAS severity, r = 0.22 (95 % CI; 0.12, 0.31). Patients with minimal collapse had a mean AHI of 20 (SD = 16) events per hour; patients with moderate collapse had a mean AHI of 25 (SD = 19) events per hour; patients with severe collapse had a mean AHI of 33 (SD = 25) events per hour (Table 2).

Relation between OSAS measures and Epworth sleepiness scale score

The mean ESS score for all 604 patients was 9 (SD = 5). A positive correlation was found between BOT collapse and ESS scores, r = 0.18 (95 % CI; 0.096, 0.26), and between LPW collapse and ESS scores, r = 0.10 (95 % CI; 0.0022, 0.20). The association between VP collapse and ESS scores was r = 0.0091 (95 % CI; −0.082, 0.10). ESS scores and AHI were positively correlated, r = 0.19 (95 % CI; 0.11, 0.27).

Multiple logistic regression analysis of VP, BOT and LPW on AHI outcomes

Patients with severe compared to those with minimal VP collapse have an odds ratio (OR) of 2.75 (95 % CI; 0.99, 7.67) of being in the severe AHI category and OR = 1.64 (95 % CI; 0.68, 3.92) of being in the moderate AHI category. Patients with moderate compared to those with minimal VP collapse have OR = 2.35 (95 % CI; 0.64, 8.73) of being in the severe AHI category and OR = 3.10 (95 % CI; 1.04, 9.30) of being in the moderate AHI category (Table 3).

Table 3 VP odds ratio estimates

Patients with severe compared to those with minimal BOT collapse have OR = 5.39 (95 % CI; 2.98, 9.73) of being in the severe AHI category and OR = 1.77 (95 % CI; 1.03, 3.05) of being in the moderate AHI category. Patients with moderate compared to those with minimal BOT collapse have OR = 1.77 (95 % CI; 0.98, 3.19) of being in the severe AHI category and OR = 1.21 (95 % CI; 0.74, 1.99) of being in the moderate AHI category (Table 4).

Table 4 BOT odds ratio estimates

Patients with severe compared to those with minimal LPW collapse have OR = 3.91 (95 % CI; 2.12, 7.19) of being in the severe AHI category and OR = 1.57 (95 % CI; 0.87, 2.82) of being in the moderate AHI category. Patients with moderate compared to those with minimal LPW collapse have OR = 1.96 (95 % CI; 1.04, 3.68) of being in the severe AHI category and OR = 1.27 (95 % CI; 0.72, 2.24) of being in the moderate AHI category (Table 5).

Table 5 LPW odds ratio estimates

Multiple logistic regression analysis of VP, BOT and LPW on AHI outcomes and the effects of BMI, age, and ESS

Accounting for BMI, age and ESS, patients with severe compared to those with minimal VP collapse have OR = 1.77 (95 % CI; 0.60, 5.20) of being in the severe AHI category and OR = 1.44 (95 % CI; 0.59, 3.49) of being in the moderate AHI category. Patients with moderate compared to those with minimal VP collapse have OR = 1.66 (95 % CI; 0.42, 6.63) of being in the severe AHI category and OR = 2.83 (95 % CI; 0.93, 8.57) of being in the moderate AHI category (Table 6).

Table 6 VP odds ratio estimates accounting for BMI, age and ESS

Accounting for BMI, age and ESS, patients with severe compared to those with minimal BOT collapse have OR = 3.18 (95 % CI; 1.69, 5.99) of being in the severe AHI category and OR = 1.54 (95 % CI; 0.87, 2.72) of being in the moderate AHI category. Patients with moderate compared to those with minimal BOT collapse have OR = 1.40 (95 % CI; 0.75, 2.60) of being in the severe AHI category and OR = 1.15 (95 % CI; 0.70, 1.90) of being in the moderate AHI category (Table 7).

Table 7 BOT odds ratio estimates accounting for BMI, age and ESS

Accounting for BMI, age and ESS, patients with severe compared to those with minimal LPW collapse have OR = 2.53 (95 % CI; 1.32, 4.87) of being in the severe AHI category and OR = 1.40 (95 % CI; 0.77, 2.57) of being in the moderate AHI category. Patients with moderate compared to those with minimal LPW collapse have OR = 1.72 (95 % CI; 0.88, 3.33) of being in the severe AHI category and OR = 1.21 (95 % CI; 0.68, 2.16) of being in the moderate AHI category (Table 8).

Table 8 LPW odds ratio estimates accounting for BMI, age and ESS

Discussion

The Wisconsin Cohort Study found that the prevalence of OSAS in people aged 30–60 years was 9–24 % for males and 4–9 % for females. It is also widely recognized that BMI is a serious risk factor for OSAS. It was mentioned in the Wisconsin Cohort study that a one standard deviation difference in BMI was associated with a 4-fold increase in OSAS prevalence [8]. In our study, BMI was significantly associated with VP, BOT and LPW collapse as well as with AHI. With a predominance of males, the mean age of 49 years, and patients with high BMIs, our cohort is representative of patients with OSAS [9].

Various studies have been conducted in an attempt to find a correlation between the degree and area of UA collapse and the severity of OSAS using different modalities [7, 10]. These modalities include tri-dimensional magnetic resonance imaging (MRI) [11], cine and sleep-MRI [12], computed tomography (CT) [13], critical pressure measurement (Pcrit) [14], negative expiratory pressure technique (NEP) [15], drug induced sleep endoscopy (DISE) [16] and Mueller maneuver [24, 7, 10, 17, 18, 21]. Our study aimed at finding such a relationship using the Mueller maneuver.

In this study, the Mueller maneuver was investigated for its predictive value of the severity of OSAS diagnosis. This maneuver is simple, safe, low cost, and can be part of a routine flexible laryngoscopy done during the head and neck examination. Studies have divergent opinions concerning MM statistical reliability. A 1994 study by Petri et al. found no predictive value in the Mueller maneuver for uvulopalatopharyngoplasty (UPPP) success rate [17]. Patients who primarily had retropalatal obstruction as judged by MM had only a 40 % response to UPPP. On the other hand, Li et al. later found clinical value for MM in improving outcomes from UPPP [18]. This study is not looking at the correlation between the MM and surgical outcomes. We were evaluating if the degree of obstruction seen with the MM could correlate with the severity of OSAS to guide the physician and the patients on the urgency to test for OSAS. We believe the MM is an effective procedure to observe upper airway collapse and its degree and we showed that the greater the collapse in the UA, the greater the chance of having more severe OSAS.

It must be noted that because our study involves the subjective evaluations of three different physicians on the degree of upper airway obstruction, there may be inherent inter-observer variation. A recent study by Ramji et al. [19] tested inter- and intra-rater agreement on sixty-one recorded videos of children undergoing a sleep nasopharyngoscopy. The authors concluded that there was validation for this procedure with good inter- and intra-rater agreement. The study was done using non-expert raters who did not perform the sleep nasopharyngoscopy routinely and were at various stages in their otolaryngology career. In the present study, the three physicians who scored the degree of collapse during the Mueller maneuver would be considered expert raters. All three are trained to perform the endoscopy and have routinely performed the procedure over many years. This cannot ensure that inter-rater variation is reliable in the present study; however it gives more confidence that the results were not biased in this manner.

In our study, a scale similar to the one used to grade the degree of obstruction of the UAs in studies on drug-induced sleep endoscopy was used (≤50 % obstruction = minimal collapse, 51–75 % obstruction = moderate collapse, 76–100 % obstruction = significant collapse). The degree of UA obstruction correlated with AHI at each level studied, mainly at the BOT and LPW. The VP level discriminates weakly compared to the other two levels and this may be explained by the fact that the velum collapses in the majority of patients with OSAS.

From the multivariate multiple logistic regression analysis, it is clear that the strongest effects are seen at the BOT and LPW levels. More specifically, there is a higher likelihood that a patient with severe BOT collapse compared to one with minimal BOT collapse would have severe OSAS. The analysis shows that, while other variables such as BMI, age and ESS scores accentuate this effect, the conclusions still hold true above and beyond these variables. In addition, there is a higher likelihood that a patient with severe LPW collapse compared to one with minimal LPW collapse would have severe OSAS. The analysis also shows that this holds true when accounting for variables such as BMI, age, and ESS scores.

The regression analysis results show significant odds ratios for predicting severe OSAS based on BOT and LPW observations of collapse. These results are important to the purpose of the study, as it shows that the Mueller maneuver may be used as a clinical screening device for predicting the probability of having severe OSAS, and therefore be prioritized for a sleep study.

Other studies have shown correlations between upper airway and OSAS. Santiago-Recuerda et al. [20] conducted a study on 40 morbidly obese women to determine the relationship of upper airway changes and OSAS. They found that the oropharyngeal area at maximal inspiration was negatively correlated with AHI (r = −0.423, p = 0.044). This study used computed tomography assessments of the upper airway. More recently, Kum et al. [21] used the Mueller maneuver to measure collapse at the retrolingual level and found the LPW obstruction site to be correlated with AHI in OSAS patients. A similar obstruction grading scale was used in the present study; however, our results indicate an important relationship between the BOT and OSAS in addition to the relationship between the LPW and OSAS. To our knowledge, the present study is the first showing a relationship between BOT collapse, LPW collapse and OSAS severity using our graded obstruction scale based on the ease of the MM technique.

Conclusions

In this study, the Mueller maneuver helped identify patients with severe sleep apnea based on UA collapse. MM cannot be used to diagnose OSAS, but can be a valuable tool to help the physician estimate the severity of sleep apnea and the urgency to obtain a sleep study. This examination can be performed in the physician’s office as a part of the routine head and neck examination. It allows for physicians to identify patients with significant collapse at the BOT and LPW, and as a result prioritize them for a formal sleep study.