Introduction

In the neurocritical care unit (NCCU), a significant percentage of patients require intubation for airway management in the context of their neurologic injury. Prolonged intubation in an intensive care unit (ICU) is associated with an increased risk of pneumonia, overall morbidity, mortality, need for tracheostomy, length of stay, and cost [1]. In brain-injured patients, a depressed mental status is associated with longer duration of intubation, even after standard weaning parameters have been met [1]. In addition, extubation failure requiring reintubation has been linked to higher morbidity and mortality [2].

Multiple studies have examined parameters that may predict extubation failure in medical ICU patients [37], some of which utilize respiratory maneuvers requiring active patient participation [6, 8]. Applying these data to a neurocritical care population is challenging given the unique deficits that accompany neurologic injury. Studies examining extubation outcomes in patients with neurologic disease have produced conflicting results. For example, one study found that a poor level of consciousness was not independently associated with extubation outcome [1], while a later study found that a low Glasgow Coma Scale (GCS) score was associated with extubation failure [9]. Another study found that the ability to follow four commands was associated with a higher probability of extubation success [8]. More recently, it has been shown that the Four Score does not consistently predict extubation failure [10]. Further research is needed to address these discrepancies and determine the best method to assess extubation readiness in the NCCU.

We hypothesized that extubation readiness in NCCU patients is unique in comparison to medical or surgical ICU patients without brain injury, and is dependent on arousal and corticospinal/bulbar function. Our primary aim was to assess the association of extubation outcome with specific neurologic parameters, ascertained prior to extubation, while controlling for confounding by pulmonary, infectious, and other comorbid conditions. Furthermore, given the conflicting results of prior studies, we sought to compare the utility of GCS versus a focused neurologic examination in the prediction of extubation outcome.

Methods

Study Population

Individuals in this study constituted a prospective cohort of intubated and mechanically ventilated patients in the NCCU at Massachusetts General Hospital (MGH). A total of 339 individuals were enrolled from May 2007 through December 2009, through daily screening, to identify potential study patients. Overall, 378 decision events occurred, resulting in 285 extubations and 93 tracheostomies. Of extubations, 48 (16.8%) required reintubation for extubation failure. The clinical protocol approved by the MGH Institutional Review Board established eligibility for enrollment in this study as (1) patients admitted to the NCCU with the primary diagnosis of central nervous system (CNS) or peripheral nervous system (PNS) disease, including aneurysmal subarachnoid hemorrhage (SAH), stroke, intracerebral hemorrhage (ICH), traumatic brain injury (TBI), subdural hemorrhage (SDH), epidural hemorrhage (EDH), venous sinus thrombosis (VST), CNS infection, spinal cord injury (SCI), brain tumor, neuromuscular disease, seizure, toxidrome, elective neurosurgery, or other CNS/PNS process; (2) patients deemed to be suitable for extubation or tracheostomy by the NCCU attending physician; (3) intubation duration of >6 h. Patients were excluded from the study if they were being extubated in a terminal setting, or if they were admitted to the NCCU without primary CNS or PNS disease. No standardized pulmonary weaning protocol is currently in use in the NCCU at MGH, and so the decision to pursue a spontaneous breathing trial (SBT) and/or extubate was left to the discretion of the attending physician. In many cases, an SBT was not performed because the patient was already on minimal ventilator settings at the time of the decision to extubate, and had no underlying pulmonary disease. Neither universal decision rule regarding level of consciousness nor other metric was used to determine suitability for extubation versus tracheostomy.

Outcomes and Definitions

The primary outcome in this study was extubation failure. Patients were defined as failing extubation if they required reintubation within 72 h of the extubation event. Individuals reintubated for a planned procedure were not counted as failures, nor were individuals reintubated after 72 h. Data on late failures (after 72 h) were recorded, but these individuals were analyzed as successes. Re-analysis after omission of these eight individuals did not alter results significantly. A secondary outcome was the need for tracheostomy. Intubated patients for whom the NCCU attending recommended tracheostomy were enrolled at the time this decision was reached (i.e., not at the time of tracheostomy placement). The same variables were evaluated for this population as the extubation cohort, but these patients were not followed after the decision was made to perform tracheostomy.

Demographic, Infectious, and Pulmonary Data Collection

Baseline study data were collected immediately prior to extubation, or at the time of the decision to pursue tracheostomy. Nursing and/or respiratory therapy staff were questioned regarding the quantity (minimal, moderate, or profuse) and texture (thin, moderate, or thick) of suctioned endotracheal sputum. No attempts were made to quantify sputum texture or quantity, given a lack of standardized methods of evaluating these parameters. Biological and respiratory factors, including heart rate, blood pressure, respiratory rate, oxygen saturation, ventilator settings (mode, pressure support, PEEP, backup rate, FiO2), tidal volume, recent ABG results, recent chest X-ray (CXR) results, and recent sputum microbiology testing were recorded. The presence of a clinical diagnosis of pneumonia (defined by the primary service based on temperature, WBC count, sputum culture results, and CXR findings), was also recorded.

Other factors which were a priori predicted to be associated with extubation outcome were also obtained [e.g., past history of extubation failure, COPD, congestive heart failure (CHF) with EF <30%, smoking history, asthma and obstructive sleep apnea (OSA)]. Immediately prior to extubation, the presence or absence of an endotracheal cuff leak was ascertained. Cuff leak was assessed qualitatively by deflating the cuff of the endotracheal tube and ascultating the airway for breath sounds, and was determined to be absent if the airway sounds were not audible. Following extubation, the presence of inspiratory stridor was assessed by bedside evaluation. No attempts were made to resolve the diagnoses of stridor or stertor in this study.

Neurologic Examination Data Collection

A focused neurologic examination was performed at the time of enrollment, following the collection of the above data. This examination included GCS scoring (recorded as 3–11T), the ability to follow five verbal commands (“close your eyes,” “stick out your tongue,” “show two fingers,” “cough now,” and “wiggle your toes”), eye movements (recorded as normal or abnormal), cough with suctioning (recorded as present or absent), corneal reflexes (recorded as bilateral, unilateral, or absent) orbicularis strength (recorded as normal, asymmetric, or weak), lower facial strength (recorded as normal, asymmetric, or weak), gag reflex (recorded as present or absent), and gross limb strength (recorded on a scale of 0–5). Because the presence of the endotracheal tube, tape, and oro-lingual edema made it difficult in many cases to ascertain the ability to protrude tongue to command, these data were excluded from the final analysis, leaving results available for the remaining four commands.

Statistical Analysis

Data for this study were analyzed using the STATA v.10 software package for Windows (http://www.stata.com). P-values for all categorical data were computed using Chi-square or Fisher’s exact test. The comparison of continuous variables was performed with Wilcoxon’s non-parametric rank sum test for binary outcomes. The Kruskal–Wallis test was used for categorical outcomes. GCS was analyzed by Wilcoxon rank sum as well as categorically. Analysis of neurologic diagnosis by extubation outcome was performed by Fisher’s exact tests (row-by-column method) to search for statistically significant effects for individual diagnoses on extubation outcome.

For multivariate analyses, logistic regression was used to compare extubation success and failure. This regression analysis was performed with the inclusion of variables achieving P < 0.1 in univariate analysis. Two separate multivariate models were employed, one for GCS evaluation, and another for neurologic examination evaluation, due to significant collinearity between these observations [variance inflation factor (VIF) >2]. Variables entered into both models included duration of intubation, past medical history of CHF, COPD, or OSA, and secretion texture. The inclusion of neurologic diagnosis to the multivariate models (as a categorical predictor with mean effect across categories as reference for computation of effect sizes) did not change results (data not shown). Either GCS or a combination of neurologic examination findings were then added to this baseline model. The results of each multivariate regression were compared using a receiver operator characteristic (ROC) analysis by calculating the area under the curve (AUC) when plotting sensitivity by 1-specificity.

Results

Tracheostomy Versus Extubation Comparisons

In this study, 93 tracheostomy decisions were made during enrollment, compared with 285 extubations. Individuals sent for tracheostomy were younger (56.2 years ± 1.9 vs. 60.6 ± 1.1, P < 0.001), with lower GCS scores [10T (IQR 8–11T) vs. 11T (IQR 10–11T), P = 0.05]. The duration of intubation was longer for individuals referred for tracheostomy [5 days (IQR 3–9) vs. 2 days (IQR 1–4), P < 0.001], with a greater proportion qualifying for a clinical diagnosis of pneumonia (51 vs. 26%, P < 0.001). 24/93 (26%) tracheostomies were performed on individuals who had a previous record of failed extubation attempt. No individual with a GCS <7T was allowed a trial of extubation. Because the goal of this study was to identify predictors of extubation failure, individuals receiving tracheostomies were excluded from the remainder of analyses.

Demographic, Pulmonary, and Infectious Comparisons

The demographics, medical history, and primary diagnoses of extubated individuals are shown in Table 1. Patients with pulmonary comorbidities such as COPD or OSA were more likely to fail extubation (P = 0.02 and P = 0.05, respectively). There was no difference in extubation outcome by primary neurologic diagnosis (P = 0.12) (Table 1). Medications did not have a significant impact on extubation failure, with no significant difference seen in the use of benzodiazepines, antipsychotics, opioids, or antiepileptic medications between successes and failures (data not shown, all P = NS).

Table 1 Clinical characteristics, past medical history, and neurologic diagnoses

There was no significant difference in extubation outcome when comparing fever, clinical pneumonia, or CXR findings (Table 2). Individuals with thick secretions were more likely to fail extubation (32 vs. 17%, P = 0.02), but there was no difference between groups for secretion volume (P = 0.12). Pulmonary mechanical variables and ABG results on the day of extubation were also not associated with extubation outcome (Table 2). Rapid shallow breathing index (RSBI) values were all <105 in individuals allowed a trial of extubation, with no significant difference in RSBI between successes and failures (P = 0.48). There was likewise no difference between outcomes for individuals with PaO2/FiO2 ratio >300 (P = 0.33).

Table 2 Infectious signs, secretions, pulmonary, and ventilatory characteristics

Neurologic Examination Comparisons

Individuals who failed extubation had no significant difference in GCS scores compared with those who were successful (P = 0.21), but in categorical analysis, there was a trend for individuals with GCS 7–9T to be more likely to fail extubation than those with GCS 10T or 11T (P = 0.07) (Table 3). Individuals who failed extubation were significantly less likely to close their eyes or cough on command (P = 0.04 and P = 0.01), and were less likely to have a normal gag reflex (P = 0.02) or normal eye movements (P = 0.02). Conversely, the ability to follow each of four commands (close eyes, show two fingers, wiggle toes, and cough) was associated with extubation success (P = 0.01).

Table 3 Neurologic evaluation data

Multivariate Analyses

All the variables found to be associated with extubation failure with P < 0.1 in univariate analysis were included in two multivariate models comparing extubation outcomes (Table 4). Thick secretion texture and a diagnosis of COPD retained significance in both multivariate analyses (P < 0.05). In categorical multivariate analysis of GCS in our cohort, a GCS of 7–9T was associated with extubation failure (OR failure 2.91, 95% confidence interval (95% CI) 1.2–7.1, P = 0.02). The ability to follow all four selected commands (close eyes, cough to command, show two fingers, and wiggle toes) was associated with extubation success (OR failure 0.43, 95% CI 0.22–0.86, P = 0.02), while gag reflex and normal eye movements did not retain significance.

Table 4 Multivariate models for GCS score and neurologic examination parameters

The two multivariate models were compared in receiver–operator characteristic (ROC) analysis (Table 4). The model containing the elemental neurologic examination was found to be more predictive of extubation outcome than the model containing the GCS alone (AUC 0.75 vs. 0.71, P = 0.007).

Post-Extubation Comparisons

Data on pre-extubation endotracheal cuff leak and post-extubation stridor was available for 203/285 extubations. Of these, 25 (12%) had an absent cuff-leak, and 19 (9%) had post-extubation stridor. In univariate analysis, the presence of stridor was significantly associated with extubation failure (P = 0.01), while absence of cuff leak was not (P = 0.24). In a four-way multivariate analysis of cuff-leak and stridor, the absence of cuff-leak combined with presence of stridor was significantly associated with extubation failure (OR 10.2, 95% CI 1.8–58.6), P = 0.009) (Table 5).

Table 5 Multivariate analysis of pre-extubation cuff leak and post-extubation stridor

Discussion

In this prospective observational cohort of 285 extubations in the NCCU, a multivariate model incorporating specific aspects of the neurologic examination was more predictive of extubation outcome than a model utilizing GCS alone. The ability to follow all four commands (close eyes, cough to command, show two fingers, and wiggle toes) was associated with extubation success both in univariate and multivariate analysis. GCS was not associated with outcome when using a rank-sum based analysis, but in a categorical analysis, a GCS of 7–9T was associated with an increased risk of extubation failure.

The analysis of GCS was complicated in our study by the fact that no patient with a GCS score <7T was allowed a trial of extubation. Although no protocol was in place in our NCCU at the time of this study, this observation suggests that the treatment teams did not feel that individuals with GCS <7T would be likely to tolerate extubation. We are, therefore, unable to extend our findings of association between low GCS (7–9T) and extubation failure to even lower GCS scores. Our categorical analysis of GCS does support the concept that low GCS scores provide some predictive information for extubation outcome, but without inclusion of additional individuals with low GCS scores, a more definitive result cannot be ascertained.

The reintubation rate in our study (16.8%) was similar to the reintubation rates quoted by Coplin et al. [1] (17.2%) and Salam et al. [8] (15.9%). In the study by Namen et al. [9], the extubation failure rate was significantly higher, at 38%. A more recent study performed on in brain- and spine-injured patients in a neurocritical care wing of an emergency room showed a 12.5% reintubation rate in the control group, which dropped to 5% with institution of a weaning protocol, which primarily included RSBI and arterial blood gas results [11]. The protocol used in their analysis required a GCS score ≥8 for consideration of extubation, which may have excluded patients for whom the neurologic status was the primary limitation to extubation success. Namen et al. found an excess of extubation failures in patients with GCS scores <8, while Coplin et al. demonstrated an 80% extubation success rate in patients with GCS <8. This lack of consensus illustrates the need for further investigation of extubation rules for individuals with low GCS scores.

Salam et al. [5] assessed mental status by the ability to follow four commands: open eyes, track examiner, grasp hand, and protrude tongue. In our study, we prompted patients with five commands, four of which were different from those used by Salam et al. The specific commands used in the evaluation are likely important. For instance, opening eyes to command could reflect an arousal event to a novel stimulus or a startle reaction, rather than a volitional event. Regarding and following a face can occur without volition, as the human face is a strong stimulus for fixation. Grasping of a hand can be primitive reflex, not necessarily indicative of a patient following the command. In contrast, closing eyes to command, showing two fingers, wiggling toes, and coughing to command require relatively preserved consciousness, attention, intact long-tract motor pathways, and muscle coordination. Tongue protrusion was included in both our study and the study by Salam et al., but we feel that this command may be of limited utility because of practical ascertainment concerns due to the presence of the endotracheal tube and frequent oro-lingual edema.

A recent study followed respiratory rate, minute ventilation, tidal volume, RSBI, negative inspiratory force (NIF), and PaO2/FiO2 ratio, and found no correlation between any of these variables and extubation failure in a neurocritical care population [10]. Another study found that weaning parameters were more difficult to apply to patients intubated for neurologic reasons in comparison with patients intubated for primary pulmonary reasons [7]. Both studies correlate well with the lack of association between measured pulmonary variables and the outcome of extubation in our cohort. These results point toward a common issue of airway management rather than pulmonary mechanics as the primary cause of extubation failure in the NCCU. The results of the current study contribute to growing evidence supporting the utility of using basic, testable neurologic abilities to predict post-extubation airway maintenance.

The absence of sufficient endotracheal cuff leak has been associated with post-extubation stridor [12, 13]. The frequency of absent cuff leak (12%) and post-extubation stridor (9%) in our cohort compare favorably with previously quoted rates [13, 14]. While in prior studies stridor rarely led to reintubation [14, 15], we observed a strong association between stridor and extubation failure. While our results are not conclusive, they demonstrate the need for additional investigation into the causes and treatments of post-extubation inspiratory stridor and stertor in the NCCU.

There are several limitations to our study. As with any observational cohort study, unmeasured variables may have played a role in outcome. The study was unblinded, and reporting and observation biases could have impacted the results. There are also clinical data points which were not collected prospectively that could have increased the risk of extubation failure. For instance, a positive fluid balance in the 24 h before extubation has been shown to predict extubation failure [16]. This is possibly relevant to our population, as NCCU patients often receive aggressive hydration to maintain adequate cerebral perfusion. There may have been underreporting of potentially confounding prior medical conditions, such as COPD, smoking, or OSA, since many of the patients included in this study were not able to report their own medical history due to their neurological illness. We obtained this history from family members or the medical record when available, but this was not always possible. Quantitative measures were not used to estimate sputum thickness or volume, primarily because a standardized method for these measurements is not in clinical use. The qualitative estimates of these variables used in our study are commonplace, but their interpretation and generalizability are limited by the lack of a quantitative standard.

Our results suggest that a focused neurologic examination may be of higher utility than the GCS alone to predict extubation outcome. A prospective, protocol-driven study of extubation in the NCCU that takes into account elements of the neurologic examination in addition to commonly accepted pulmonary criteria is needed, both to validate the results of our observational study, and to build toward a prediction rule that improves extubation outcomes in our patient population.