The database of the International Mechanical Ventilation Study [19], which was collected in March, 1998, was analyzed. The following information was collected on each patient receiving mechanical ventilation: demographic data, type of problem (medical or surgical), date of initiation of mechanical ventilation, primary indication for mechanical ventilation and issues related to patient management. Development of the following events was assessed daily during the course of mechanical ventilation for a maximum of 28 days: acute respiratory distress syndrome (ARDS), barotrauma, pneumonia, sepsis, renal failure, hepatic failure and coagulopathy. Sepsis, pneumonia and ARDS were considered as events only if they appeared more than 48 h after the initiation of mechanical ventilation. A patient was considered to have any of the above conditions if it was present for at least two consecutive days. Each of these conditions has been previously defined [19]. The arterial blood gases correspond to the values obtained once daily at approximately 8:00 a.m. The ventilator variables correspond to the time that the arterial blood gases were obtained. The use of neuromuscular blockers, sedatives and vasoactive drugs (given for at least 3 h in a 24-h period) was recorded daily for a maximum of 28 days. Weaning refers to the discontinuation of mechanical ventilation. The onset for this was the time when the physician in charge considered the patient was likely to be able to resume spontaneous breathing.
In terms of statistical analysis, the results are expressed as means and standard deviations, median with the interquartile range and proportions as appropriate. Student’s t-test or Mann-Whitney U test were used to compare continuous variables and chi-square test or Fisher’s test were used to compare proportions. The association of mortality and age was quantified and tested using the Spearman rank correlation coefficient.
A recursive partitioning method [20] was first used to look for the threshold of age that best discriminated for ICU survival. Two statistical methods, recursive partitioning and logistic regression, were then used to analyze the data corresponding to the group older than 70 years in Europe (derivation group, n=849). The derivation models that provided the best fit for elderly patients in Europe were then applied as a validation set to patients from USA-Canada (n=498) and Latin-America (n=265). First, a classification tree was constructed using the recursive partitioning procedure (Answer Tree Software, Chicago, Illinois) and the following variables: age, Simplified Acute Physiological Score (SAPS) II, sex, previous functional status, principal reason for initiating the mechanical ventilation, variables associated with patient management (such as successful non-invasive ventilation, use of non-conventional techniques—prone position, inhaled nitric oxide, permissive hypercapnia, inverse ratio ventilation—need of neuromuscular blockers, need of sedatives, tidal volume, respiratory rate, applied positive end-expiratory pressure, peak pressure, plateau pressure, tracheostomy), complications while receiving mechanical ventilation (such as barotrauma, ARDS, sepsis, pneumonia, shock, acute renal failure, hepatic failure, coagulopathy, metabolic acidosis, respiratory acidosis, ratio of partial pressure of oxygen to fraction of inspired oxygen, PaO2:FIO2).
The recursive partitioning method identified the threshold value for each variable that provided the best separation of the study population according to survival. For continuous variables, potential threshold values are all the values represented in the data. For dichotomous variables, the threshold value is the integer value of the two categories. For non-ordered categorical variables that have more than two categories, all factorial arrangements of the category were evaluated. For each variable, the program selected the threshold value that produced two subsets of the greatest purity. The partitioning was started after evaluating each risk variable for its ability to separate cases from controls. The variable that achieved the most precise separation of dead from living patients was selected as the best predictor for the first branch of the tree. The recursive partitioning procedure was repeated for each of the two subgroups that resulted from the first split, again searching all the cut-off points of each separation of dead and alive groups. The process was repeated for subsequent descendant subsets until no further partitioning was feasible because the subgroup contained fewer than 25 patients or contained only dead or only alive patients. The purpose of this classification tree was to reveal the structure of the database with respect to distinct combinations of variables that jointly influence the risk of mortality.
Second, the distinct risk subgroups identified by the classification tree were modeled using logistic regression. A dummy variable with different risk subgroups represented by the subsets at the bottom of the classification tree was introduced in a logistic regression analysis to estimate the odds ratios for the mortality of each subgroup in relation to the subgroup with a lower mortality. This model was validated in the Latin-America and USA-Canada cohorts (validation group) and the goodness-of-fit of this model was checked with Hosmer-Lemeshow’s test.