Introduction

Computed tomographic (CT) densitometry of the lung is a reliable method to detect pathological changes in lung aeration. The quasi-linear relationship between X-ray attenuation and soft tissue density enables quantification of the amount of gas per unit of tissue with remarkable precision [1].

As both extremes of lung aeration, collapse and hyperinflation, seem to contribute to ventilator-associated lung injury [2], many studies of mechanically ventilated patients have used CT to quantify such pathological lung conditions. For instance, by treating voxels pertaining to a specific range of aeration as an independent lung compartment, researchers have calculated the size of the nonaerated or hyperinflated subvolumes. Recently, controversial conclusions about the physiological effects of positive end-expiratory pressure (PEEP) and tidal volumes have been drawn from such results [315]. These densitometric measurements, however, depend on the technical parameters of the CT scan protocol itself and the image reconstruction algorithms applied [1, 4, 5, 14, 1632]. Not often recalled, image reconstruction parameters have been shown to affect the calculated volume of the emphysematous lung compartment, and thresholds for emphysema quantification also depend on the reconstruction details [1629].

We hypothesized that CT image reconstruction could similarly affect the measurement of hyperinflation in mechanically ventilated patients to such an extent that clinical judgment might be influenced significantly. Therefore, we analyzed the influence of two parameters, slice thickness and reconstruction filter, on hyperinflation in mechanically ventilated patients with or without lung injury. For comparison, we also investigated a group of spontaneously breathing patients, with normal lungs.

Materials and methods

Patients

The institutional ethics committee approved this study. Because the observational analysis of CT data did not require patient manipulation, the need for informed consent was waived. Patients with routine thoracic CT scans were retrospectively assigned to three groups: (1) Acute respiratory failure (ARF): five polytrauma patients presenting with pulmonary infiltrates and impaired oxygenation; (2) normal-ventilated: six polytrauma patients with normal lung CT; (3) normal-spontaneous: five spontaneously breathing patients with normal lungs. Patients with pneumothoraces, known emphysematous lung disease, or significant respiratory artifacts in CT images were not included. Physiological data was gathered from the patient data management system of our ICU.

All patients with polytrauma were treated according to institutional guidelines, including a whole-body CT scan with contrast medium [8]. Initial pressure-controlled mechanical ventilation (Oxylog 3000, Dräger, Lübeck, Germany) during primary resuscitation and CT scanning was standardized, and it included the following ventilator settings: tidal volume of 6–8 ml/kg estimated body weight (Broca formula), a respiratory rate of 20 min−1, and PEEP of 10 cm H2O unless higher PEEP had been applied before CT.

CT scanning and image reconstruction

Each patient had one single multislice helical CT scan (Somatom Volume Zoom, Siemens, Erlangen, Germany) using the following protocol: 120 kV tube voltage, tube current between 100 and 200 mAs, 4 × 2.5 mm collimation, pitch 1.1, and 0.5 s gantry rotation. Spontaneously breathing patients were scanned during breath hold after full inspiration and mechanically ventilated patients were scanned during uninterrupted ventilation. Since landmark studies estimating thresholds for separating hyperinflation from normally aerated lung were performed using contrast medium, we included only patients who had received 120 ml of Iopamidol (Schering AG, Berlin, Germany) [4, 33]. Autocalibration for air, and calibration using the manufacturer’s standard phantom were performed daily and weekly, respectively.

From identical CT raw data, predefined image series covering the entire lung were reconstructed using an identical field of view and contiguous slices of effective thickness 3, 5, 7, and 10 mm. During routine clinical imaging, two reconstruction filters—a standard (manufacturer’s designation: B35f, medium heart) and a sharp body filter (B60f, medium sharp)—were applied (Table 1). The assignments of standard and sharp were made according to Boedeker et al. [22]. Sample volumes were calculated using the theoretical parameters of the filters provided by the manufacturer [20]. When referring to a particular reconstruction method, the abbreviation “filter_thickness” is used. For example, std_10 denotes reconstruction with standard filter and 10 mm slice thickness (Table 1).

Table 1 Details of the CT image series

Image segmentation

Image segmentation was performed on a 3D workstation (Virtuoso, Siemens) and involved a threshold-based region growing algorithm followed by interactive manual refinement [19]. Major pulmonary vessels, trachea, and main bronchi were excluded. All CT images were analyzed by two independent observers. Lung subvolumes were classified as hyperinflated (V hyper, −1,000 to −901 HU), normally aerated (V normal, −900 to −501 HU), poorly aerated (V poorly, −500 to −101 HU) and nonaerated (V non, −100 to +100 HU), and calculated as the sum over all voxels in the respective HU ranges [4, 10, 13, 14]. The sum of these subvolumes was defined as V total. The subvolume of voxels within the −1,000 to −950 HU range was also calculated. In addition to absolute values, all subvolumes were calculated as percentage of V total.

Statistical analysis

Data are presented as median (interquartile range). Volumes were rounded to the nearest milliliter. Age, body mass index, and PEEP differences were evaluated by Kruskal–Wallis and Mann–Whitney tests, respectively. Bland–Altman plots were used to evaluate the agreement of relative subvolumes obtained in 10 mm slices combined with different filters. The interobserver agreement of absolute subvolume readings was also assessed by Bland–Altman’s technique. In these plots, the mean volume obtained by two different reconstruction methods or observers, respectively, was plotted on the x axis. The difference between these values was calculated as percentage of the mean, and plotted on the y axis [34, 35]. To analyze the influence of thickness and filter on relative V hyper and V non by repeated measures ANOVA, the respective values from all patients were transformed (V hyper: square-root, V non: natural logarithm) to conform to normality assumptions and homoscedasticity. Histograms, and Shapiro–Wilk and Levene tests were used to confirm the normal distribution. Filter and thickness were used as within-subject, and group (i.e., ARF, normal-ventilated and normal-spontaneous) as between-subject factor. Interaction terms were also included in the model. Bonferroni t test was used for pairwise post-hoc testing. Because of the limited number of patients in each group, further intergroup comparison was omitted. Significance was set at P < 0.05. Statistical analyses were performed with SPSS 11.0 (SPSS GmbH, Munich, Germany).

Results

Patient characteristics are given in Table 2. Intergroup differences in age (P = 0.67) and body mass index (P = 0.60) were not significant. PEEP levels during CT did not differ between ARF and normal-ventilated patients (P = 1.0). Relative and absolute CT subvolumes for all groups are given in Tables 3 and 4, respectively.

Table 2 Patient data
Table 3 Influence of CT image reconstruction on relative pulmonary subvolumes
Table 4 Influence of CT image reconstruction on absolute pulmonary subvolumes

For all patients, V hyper varied significantly with every change of reconstruction method (all P values ≤ 0.001; Tables 3, 4; Figs. 1, 2). Extreme values for V hyper were observed for std_10 and shp_3, with median values over all patients of 0.3 (0–1.3)% and 8.8 (4–13.1)%, respectively. For ARF, normal-ventilated, and normal-spontaneous patients, the median absolute difference in V hyper between shp_3 and std_10 was 150 (61–208) ml, 426 (146–541) ml, and 372 (332–461) ml, respectively.

Fig. 1
figure 1

CT images after segmentation of the lung for three representative patients and three image reconstruction methods: std_10, shp_10, and shp_3, respectively. Voxels contributing to V hyper are marked yellow. Images are annotated with the absolute and relative V hyper of the entire lung. Relative V hyper are percentages of V total. Patient annotations (i.e., S1, V2, A2) correspond to Table 2. Normal-spontaneous patient S1 was scanned during end-inspiratory breath hold, normal-ventilated patient V2 and ARF patient A2 during uninterrupted mechanical ventilation

Fig. 2
figure 2

Influence of image reconstruction on relative V hyper (left column) and V non (right column), respectively, for ARF, normal-spontaneous, and normal-ventilated patients. For each individual, the respective subvolume measured at slice thickness = 10, 7, 5, and 3 mm in combination with two different filters (i.e., standard and sharp filter) was calculated as percentage of V total and plotted over the reconstruction method. Data points of individual patients were connected by dotted lines for better identification of individual changes

V hyper obtained with the sharp filter systematically exceeded that with the standard filter showing a median increment of 138 (62–272) ml corresponding to approximately 4% of V total (Figs. 2, 3). The influence of both, thickness and filter on V hyper was statistically significant (ANOVA P values < 0.001 for both relative and absolute values). Thinner slices caused more pronounced increments if combined with sharp filters (P < 0.001 for the interaction term).

Fig. 3
figure 3

Influence of reconstruction filter on relative V hyper, V normal, V poor, and V non for a slice thickness of 10 mm. Bland–Altman plots show the agreement between the relative subvolumes derived from the standard (V std) or sharp (V shp) reconstruction filter. Relative subvolumes are percentages of V total. Please note the logarithmic scale of the abscissa for V non

The impact of filter on all other subvolumes (V non, V normal, and V poor) was much smaller than that on V hyper (Tables 3, 4; Figs. 2, 3). Although there was a slight reduction in V non associated with the sharp filter in ARF patients (Figs. 2, 3), the overall effect of filter on V non was not significant (P = 0.35).

Independent of the filter used, V hyper increased markedly with decreasing thickness in all patients (P < 0.001; Tables 3, 4; Figs. 1, 2). Decreasing thickness also caused a small but statistically significant (P < 0.001) increment of V non (1% of V total). Nevertheless, the increment of V hyper amounted to 107 (49–252) ml when changing from 10 to 3 mm slices, whereas the corresponding median increment of V non was only 12 (7–16) ml. Moreover, the effect on V non was consistent only in normal-ventilated and normal-spontaneous patients (Tables 3, 4; Fig. 2). In ARF patients, decreasing thickness slightly increased V non only if combined with the standard filter.

Use of a more conservative threshold (−950 instead of −900 HU) significantly decreased V hyper in 3 mm slices for both filters (P < 0.001). V hyper in std_3 yielded 0 (0–0.1)%, 0.1 (0–0.2)%, and 0.1 (0–0.2)% for ARF, normal-ventilated, and normal-spontaneous patients, respectively. In shp_3, the corresponding values were 0.6 (0.4–1.3)%, 2.1 (0.5–3.2)%, and 1.7 (1.0–2.9)%.

V total changed very little with the reconstruction method. For ARF patients, the median V total over all reconstruction methods was 3,613 ml, showing a maximum relative variation of 1.9%. For normal-ventilated patients, the corresponding variation was 1.7% at 4,179 ml, and for normal-spontaneous patients, it was 2.5% at 4,184 ml.

The interobserver agreement was good for V hyper, V normal, and V poor. The worse agreement of V non readings resulted mainly from normal-spontaneous and normal-ventilated patients, in whom small absolute interobserver differences translated into considerable percentages. Interobserver bias (standard deviation) observed in the Bland–Altman analysis were −0.1 (1.5)%, 0.0 (0.1)%, −0.4 (2.2)%, and 1.1 (12.0)% for V hyper, V normal, V poor, and V non, respectively.

Discussion

Our work shows that the assessment of lung hyperinflation by CT densitometry is significantly affected by image reconstruction parameters. Estimates of V hyper, a variable originally conceived to identify truly pathological conditions of hyperinflation, were significantly magnified by using either sharp reconstruction filters or thin slices. While such an observation has already been made for spontaneously breathing patients with emphysema, this study confirmed the same also for mechanically ventilated patients with either acutely injured or normal lungs. The magnitude of these effects is certainly of clinical relevance, since the estimates of V hyper differed by up to 400 ml depending on the settings employed.

As both extremes of lung aeration – deaeration and hyperinflation – play a role in the pathogenesis of ventilator-associated lung injury [2], V hyper has frequently been assessed by CT to evaluate this known side effect of mechanical ventilation [315]. When 10 mm slices are analyzed, a −900 HU threshold has been suggested to assess V hyper in mechanically ventilated patients, which is similar to the threshold for emphysema (i.e., −910 HU) [4, 33].

Recently, however, some authors suggested that the use of thinner slices would increase spatial resolution and improve the detection of V hyper [5, 7, 14]. Accordingly, Vieira et al. demonstrated that V hyper assessed with the −900 HU threshold in 1.5 mm slices significantly exceeded that of 10 mm slices, concluding that the true V hyper caused by mechanical ventilation had thus far been underestimated due to partial-volume-artifacts [5].

Theoretically, the approach of reducing partial-volume-artifacts seems sound. Because of the finite spatial resolution of CT, the size of the scanned (sampled) elementary tissue volume usually exceeds the alveolar dimensions by far. This implies that, within an elementary volume, a few collapsed or edematous alveoli could obscure the presence of many hyperinflated ones [5, 14]. This search for an improved spatial resolution, however, may be misleading under certain conditions. High-resolution CT (HRCT) is currently the method of choice for a visual and qualitative assessment of pathologic changes within the lung such as emphysema [30, 31], but the situation seems quite different for a quantitative analysis of V hyper [19, 21, 27, 28, 36].

Even with HRCT, where thin slices are commonly used together with sharp filters, we are still far away from the spatial resolution required to display single (hyperinflated) alveoli [5, 20, 21, 31, 37]. Thus, instead of improving the detection of V hyper, we may be trespassing the limits of density resolution of current CT technology. Kemerink et al. demonstrated that different reconstruction protocols may result in unacceptable differences in densitometric measurements, especially in heterogeneous foam-like materials, and tried to overcome these limitations by introducing the concept of CT sample volume [20, 21]. While the voxel size represents an arbitrary volume within which the density information is displayed or interpolated, the sample volume is the minimum volume carrying truly independent information, not contaminated by neighborhood densities. It is the volume in a scanned section over which the X-ray attenuation is truly estimated and frequently encompasses a few contiguous voxels. According to Kemerink et al., comparability of densitometric results (and thus the threshold-based assessment of V hyper) requires sample volumes above a critical value of 8 mm3 [20, 21]. Discussion of the appropriateness of this value is beyond the scope of this work, but it is worth noting that we observed large increases in V hyper when sample volumes were below 8 mm3.

Whenever spatial resolution is increased, for instance by using sharp filters and/or thin slices, the sample volume decreases and the histogram of CT numbers broadens due to an increased number of voxels with extreme values [1923, 29, 31]. The key question is to what extent the broadened histogram and the associated increase in V hyper is related to less partial-volume-artifact (i.e., better reflecting reality), and to what extent it is an artifact due to a too small sample volume (i.e., an increased amount of reconstruction noise). Important information to answer this question can be obtained from analysis of V non—the subvolume at the other extreme of the histogram. According to Kemerink et al., the artifact caused by a too small sample volume produces a preferential overestimation of V hyper with little effects on the other compartments [20, 21]. In contrast, any decrease in partial-volume-artifacts should produce symmetric and similar effects in V non, because vessels, bronchial walls, edematous or collapsed alveoli are reproduced with less contamination by surrounding lower density tissues [20, 21]. Although increments of V non may not exactly mirror those of V hyper, since this would require a mean lung density around −500 HU, the increments of V non observed in our patients were much smaller than that expected. Thus, we must conclude that sharp filters were not appropriate for the assessment of V hyper, especially in ARF patients: whatever the slice thickness, V hyper from the sharp filter significantly exceeded the corresponding V hyper from the standard one, without any associated change in V non (in fact there was a slight decrease).

A similar observation has been established for the quantification of emphysema, a condition where sharp filters are no longer recommended for the assessment of V hyper, since they were shown to overestimate this subvolume [2024, 26, 36].

In contrast to this straightforward conclusion about the inappropriateness of sharp filters, the results of our study are less conclusive regarding the possible artifact caused by thin slices. Both absolute V hyper and V non increased with decreasing thickness (Table 4). This was paralleled by a decrease in V poor and particularly consistent for reconstructions using the standard filter. Such behavior is compatible with a reduction in partial-volume-artifacts, suggesting a true advantage of thinner slices. When V non was expressed as percentage of V total (Table 3), however, it remained almost unchanged (Fig. 2). Furthermore, the median relative variation of V non in normal-ventilated and normal-spontaneous was only about 1%, corresponding to an absolute volume of approximately 10 ml. Since V non is more susceptible to subjective segmentation errors than V hyper, such a small increase in V non may also be related to a higher rate of segmentation errors when analyzing a great number of thin slices. Nevertheless, our analyses suggest that thinner slices (especially in combination with standard filter) enable assessment of V hyper with less influence of partial-volume-artifacts. However, further studies seem necessary to prove this conclusively for the assessment of lung hyperinflation associated with mechanical ventilation.

Thin slices have been implemented in several protocols for the CT quantification of V hyper in emphysema, asthma or interstitial lung disease. To avoid an overestimation of V hyper, however, most of these studies used −950 HU or even lower thresholds to separate V hyper from nomal lung [16, 18, 24, 27, 28, 3032, 36, 38, 39]. Accordingly, the authors were able to reduce partial-volume-artifacts without the associated overestimation of V hyper. Unfortunately, this procedure has not been adopted in studies about mechanical ventilation, and this may explain the high values of V hyper reported in subjects scanned at low transpulmonary pressures [5, 6, 9, 15]. In all these studies, thin slices were analyzed with the traditional −900 HU threshold, sometimes in combination with sharp filters. In this context, it is interesting to note the significant decrease in V hyper in 3 mm slices observed in our study after applying the −950 HU threshold.

It is likely, however, that no single threshold for V hyper is universally appropriate, even when using intermediate slice thicknesses in combination with soft or standard filters [26]. Thresholds will not only vary with patient-specific factors [16, 36], but also with the severity of the lung pathology [5]. A gold standard for measurement of V hyper in mechanically ventilated patients is lacking, especially for conditions in which the average lung density is increased, such as acute lung injury. Finally, even if V hyper assessed by threshold-based CT densitometry might correlate with macroscopic and microscopic morphometry (as shown for emphysema [32, 33, 39]), it is still a challenge to understand how we could relate it to physiological indicators of alveolar hyperinflation such as functional dead-space [40], septal stress, or increased cytokine release [7, 41].

Some limitations of our retrospective study deserve attention. Although a fixed CT protocol was used for each group, different respiratory conditions (e.g., slightly different PEEP levels) may have resulted in uncontrolled lung volumes at which CT data were acquired. Also, age, body mass index, and smoking habits differed between groups. Finally, the radiation dose (current-time product) had been chosen by the CT technician according to indication for CT and patient factors and thus varied among our patients. All these factors may have biased intergroup comparisons of V hyper, but not the within-patient assessments, which were the focus of this study. Finally, two outliers can be recognized in the V hyper column of Fig. 2 (ARF and normal-ventilated). The lower radiation doses applied in these two patients (100 mAs) might be an explanation for the high values of V hyper, although such an artifact seemed to be limited in previous studies [30, 32]. Anyway, we tested the overall analysis without these two outliers. We found that the trend of increasing V hyper with higher image resolution remained, suggesting that our conclusions are robust.

In conclusion, CT-based assessment of the hyperinflated lung volume in mechanically ventilated patients is significantly affected by image reconstruction parameters. Sharp filters overestimate this volume whatever slice thickness they are combined with. Thin slices used in combination with standard filters seem to reduce partial-volume-artifacts and may improve the assessment of hyperinflation. This approach, however, would require the definition and validation of new HU thresholds (e.g., −950 HU in normal lungs). Quantitative data of different studies can only be compared if identical reconstruction parameters were used and comprehensively specified.