Background

The regenerative capacity of the liver has been known since ancient times. Liver volume can be of prognostic value prior to liver surgery in certain patients [1, 2]. This is one of the many reasons that liver volume measurement has attracted much attention, with more than 200 results in PubMed when searching for “liver volumetry.” Measuring liver volume, however, is time-consuming and cumbersome [3]. There are several semi-automated software programs available for accomplishing the task, and most are used for computerized tomography (CT) images. Knowledge of attenuation on CT is very helpful for delineating the liver in semi-automated programs. Measurement of attenuation is not possible on magnetic resonance imaging (MRI) [4] and its different sequences.

While volumetry involves measuring the whole area occupied by an object on each plane (usually axial) and then summing them, stereology employs statistical methods to arrive at an estimated area or volume [511]. Volumetry is a survey of all pixels, whereas stereology is a sample of pixels thought to represent the whole area or volume. The methods by which stereology can be used for sampling differ. One can use a systematic sampling method, usually employing a grid, in which the points on the grid are counted if they fall on the target. This is usually referred to Cavalier’s principle (for further information please see Appendix 1).

The aim of this study was to compare liver volume as measured by volumetry with that measured using stereology on several systematic grids and find which grid provides an accurate estimate of liver volume and its change over time, as well as to estimate the time required for a stereological study.

Materials and methods

Stereology

Based on a previous small pilot study with CT, we anticipated that a grid with a size somewhere between 3 and 5 cm would provide an accurate estimate of liver volume. In the pilot, we used several grids for measuring area.

The stereological measurements used in this study were performed using software developed in-house. The program was implemented in C++ as an offline private application using an open-source image processing platform for medical image analysis (http://code.google.com/p/platinum-image/), which can run on Windows, Mac OS, and Linux. The software allows use of stereology grids with the desired resolution in axial, sagittal, or coronal directions.

Each liver volume was measured on different occasions using 3-, 4-, 5-, and 6-cm grids. The measurements were made by a radiologist (with 11 years of experience in abdominal MRI and CT interpretation), blinded to the reference method and other stereological measurements (Fig. 1).

Fig. 1
figure 1

Screenshot of stereology software. The grid positioned on this slide is 40 x 40 mm (i.e., the grid resolution). The number of times the lines intersect within the boundaries of the liver is 13. Since each point of intersection represents 4 × 4 or 16 cm2, the total area occupied by the liver is estimated as 13 × 4 × 4 or 208 cm2. When an intersect falls on the border of the measured object, then a predefined corner is evaluated. Our radiologist opted for the upper right corner before the start of the study. If the right upper corner (or any corner chosen before the start of the study) is occupied by the object of interest, then the whole intersect is regarded as occupied by the liver. Otherwise, the intersection is considered to have fallen beyond the borders of the object of interest and is not included in the measurements

The liver volume for each subject was measured at three times: before and after low-calorie diet and after gastric bypass surgery. Differences between each of these states were calculated for each patient. These changes were measured by both stereological methods (3- to 6-cm grids) and volumetry. One subject had only two MRI examinations, and in this subject the change between these two available examinations was noted.

The subjects

The subjects were chosen from a previously published study [12], where the patients had undergone different treatments for obesity. All subjects were female. They had been imaged prior to and after treatments for weight reduction. The liver volumes were therefore expected to change due to these treatment regimens. The radiologist making stereological measurements did not have any information about the nature of that study. For details about the subjects, the reader is directed to the original study [12]. In summary, these patients were on average 34.7 years of age with an average weight of 121.3 kg at baseline dropping to 113.9 kg after 1 month of low-calorie diet and finally 105.8 kg 1 month after gastric bypass. The caloric intake comprised 30 % protein, 49 % carbohydrates, and 21 % fat. Total calorie intake was set to total energy expenditure (as calculated by Harris-Benedict) minus 1,000 kcal/day (minimum 860 kcal/day). The study had been approved by the local ethics committee.

MRI

The patients had been imaged before any treatment (first MRI), after treatment with low-calorie diet (second MRI), and after gastric bypass surgery (third MRI). MRI was performed on a 1.5T clinical scanner (Achieva; Philips Healthcare, Best, The Netherlands). Imaging parameters were as follows: T1 weighted images: TR 5.9 ms, TE 1.36/3.22/5.09 ms, flip angle 3°, elementary signal sampling field of view (FOV) (in motion direction) 112 mm, virtual FOV 530 × 377 × 2,000 mm3, and voxel size 2.07 × 2.07 × 8.00 mm3. Imaging was in supine position with the arms extended above the head.

Reference method

Volumetric measurements were made by two independent readers, and the average of their measurements was used as reference. The average time for measuring each liver volume was 17.2 ± 6.5 min.

Statistical methods

For statistical analysis, dedicated software (SPSS for Windows, version 20.0.0) and Microsoft Excel 2003 were used. The measured volumes were compared to the results of the reference method by using paired t-test. Correlation curves between different stereological measurements and the reference method were drawn based on calculation of correlation coefficient and coefficient of variation. The Pearson’s correlation coefficient (r) was calculated to assess the strength of linear dependency. Bland-Altman curves were drawn, and the percentage by which cases differed from the reference method was calculated.

Results

The average liver volume measured on volumetry was 1,998 mL with a standard deviation (SD) of 319 mL. The results of liver volume measured by the stereology method using different grids are shown in Table 1. The table also shows the average time needed for stereological measurements, the correlation coefficients and 95 % confidence intervals for difference in volume measurements. The 3- and 4-cm grids provided measurement results that differed from the volumetric measurements by less than 300 mL. The mean differences from volumetric measurements were 37 and 3 mL for 3- and 4-cm grids, respectively.

Table 1 Point liver volume measurements with different stereology methods based on grids 3 × 3, 4 × 4, 5 × 5 and 6 × 6 cm and their correlation to volumetric measurements (n = 41)

Scatterplots and Bland-Altman curves are shown for each grid (Figs. 2 and 3). When comparing the stereological method with volumetry, there was no statistically significant difference in measurements of liver volume or of changes in liver volume (p > 0.05).

Fig. 2
figure 2

Scatterplots demonstrate relations between measured liver volume by a 3-, b 4-, c 5-, and d 6-cm grid stereology. The linear regression lines are also drawn

Fig. 3
figure 3

Bland-Altman curves showing differences in liver volume measured with stereology and volumetry related to average liver volume on a 3-, b 4-, c 5-, d 6-cm grids

The changes in liver volume before and after treatment are shown in Table 2. Bland-Altman curves demonstrating differences in liver volume change for 3-, 4-, 5-, and 6-cm grids are shown in Fig. 4.

Table 2 Changes in liver volume before and after weight-loss treatment as measured by volumetry and stereology using 3-, 4-, 5-, and 6-cm grids
Fig. 4
figure 4

Bland-Altman curves demonstrating differences in liver volume changes as assessed by volumetry and by stereology on a 3-, b 4-, c 5-, and d 6-cm grids

Discussion

Our study shows that stereological methods employing 3- and 4-cm grids give accurate results for measuring both liver volume and changes in liver volume. Grids of 5 and 6 cm may still be useful in certain situations, e.g., when groups of patients are compared.

Stereology has the advantage of being done within a reasonable time. Although we used computer software to identify the points and to direct the process, this is not necessary as a simple overhead paper or any transparent paper placed on the monitor or hard copy image is sufficient. Stereology therefore entails small costs and can be used if there is no access to semi-automated or fully automated volumetry software.

Stereology also offers the advantage of being neither signal intensity-dependent nor density-dependent as several other available programs are. It can therefore be used in combination with ultrasound, CT, MRI with any sequence, etc. as long as there is no bias in cross-sectioning. One can choose to include or exclude lesions, vascular structures, or specific liver segments. Aydinli and colleagues have used stereology to evaluate response in hepatic alveolar echinococcosis and found that simple volumetric measurements were in some cases unable to replace stereology [13]. Mazonakis and colleagues used stereology for measurement of malignant liver lesions and compared their results to planimetry and found comparable values [14]. Stereology is particularly useful when the shape of an object is bizarre and unpredictable such as for the measurement of hepatic iron stores [15].

There are some drawbacks to our study. One could question our reference method. This method, however, is the basis of an already published study [12] and is a reliable and acceptable method [16]. In fact one study has found stereology better than volumetry when compared to intra-operative liver volume measurements [17]. In another study, Doherty and colleagues were able to show that stereology was superior to semi-automated volumetry for measurement of the temporal lobe [18].

To date there is only one study looking into stereology for measurement of liver volume [19]. These authors found stereology to be a more efficient approach than planimetry with an excellent agreement between the two. They, however, used 1.6-, 2.5-, and 3.3-cm grids and changed the number of sections studied. Interestingly, they found a larger difference (8.5 mL) between measurements when patients with hepatomegaly were studied compared to patients with normal liver volume (2.4 mL).

Our measurements were done on 8-mm-thick sections. Reiner and colleagues have found that the results of CT- and MR-based liver volumetry are dependent on slice thickness. Best volume measurements when time was factored in were 6- and 8-mm slices for CT and MRI, respectively [20].

Another aspect not studied to the best of our knowledge is measurement of liver volume in other imaging planes. We, like other authors before us, used axial sections. The liver has a special shape, with the lower segments of the right lobe tapering down vertically, much like the shape of a cone. Measuring cylinders or cones on planes perpendicular to their axes can cause bias. The sampling idea behind stereologic methods is that over- and underestimations cancel each other out.

Conclusion

Stereological methods can be used for accurate liver volume measurements with a 4 × 4 cm grid in less than 2 min.