INTRODUCTION

Breast cancer is the most common form of cancer and a leading cause of death for women in the West, where about one in ten women will develop it during her life. It is obvious that the chance for cure is highest if the breast cancer is identified at an early stage before the tumor has spread. That’s why most Western countries have adopted nationwide breast screening for postmenopausal women. X-ray mammography is currently the most effective imaging modality used in breast screening systems.

Often, two mammogram views are taken of each breast in a screening session. The two mammographic views are craniocaudal (CC), in which the X-ray projects from head to toe, and mediolateral Oblique (MLO), in which the X-ray direction is from one shoulder to the opposite hip. After the mammograms are taken, the radiologist attempts to detect the abnormalities in the mammographic views. If there is any suspicion of malignancy, a biopsy is carried out to confirm the diagnosis.

Breast abnormalities in X-ray mammograms include microcalcifications, masses, architectural distortion, and asymmetry. These abnormalities may be associated with either benign or malignant conditions, depending on their shapes and structures. Microcalcification clusters have more chance of being malignant and are deemed as one of the most important mammographic signs of early breast cancer. A mass is another indication of a tumor. There are three categories of mammographic appearance of a mass: circumscribed, irregular-shaped, and spiculated. A circumscribed mass usually has a round, oval, or lobular shape. A well-defined mass may represent a fibroadenoma or a cyst. Most circumscribed masses are benign, although there are rare instances in which a well-defined mass is malignant. An irregular-shaped mass represents one with a poorly defined margin, which indicates possible infiltration of cancer into surrounding tissue. Irregular-shaped masses are considered suspicious. A spiculated mass is characterized by an irregular central mass surrounded with radiating spicules. Spiculated masses have a very high risk of being malignant.

In computer-aided diagnosis (CAD), a manually or automatically detected mass has to be segmented before it can be classified as benign or malignant. After segmentation, the classification step is applied based on mass features, including shape, compactness, texture, etc. Shao and Brady1 proposed an iterative method for reconstructing the 3D masses. However, the 2D mass shapes are segmented manually and the reconstructed 3D mass shapes are rather coarse in their algorithm. In the following sections, we will present an approach based on automated mass segmentation and partial volume compensated mass shape reconstruction from the two mammographic views. It is believed that an accurate 3D reconstruction contains more information for classification of masses into benign or malignant, or at least is valuable to the clinician by visualization.

The remaining of this paper is organized as follows. “Automated Mass Segmentation” section introduces the segmentation of the 2D mass shapes in the two mammographic views based on a region growing approach. “Mass Shape Reconstruction” section describes the 3D-reconstruction technique. Evaluation results are reported in the “Experimental Results” section. Finally, “Conclusion” section concludes this paper.

AUTOMATED MASS SEGMENTATION

The mass segmentation step is crucial because the 3D mass reconstruction is highly dependent on the accuracy of the segmented mass shapes. Mammograms have very low contrast, which makes it impossible to use a pure thresholding method to segment the mass regions in the CC and MLO views. We use a seeded region growing technique instead for segmentation of the mass regions. Region growing begins by locating a set of “seeds” and attempts to merge neighboring pixels into this growing region until no more pixels can be added to it.2 This method first defines a homogeneity criterion. If the neighboring pixels satisfy the homogeneity criterion, they are merged into the target object region.

Adams and Bischof 3 proposed a seeded region growing algorithm, which takes an intensity image and a set of seeds as input. This algorithm is efficient, robust, and parameter-free. Unfortunately, this method is dependent on the order of the pixels being processed, which means the region growing result is different if the pixels are processed in different orders. Muller et al.4 proposed an automated 3D region-growing algorithm, which is based on an assessment function. The principle of this algorithm is to build a region-growing sequence by increasing the maximal homogeneity threshold from a very small value to a large one. On each segmented region, a 3D parameter that has been validated on a test image assesses the segmentation quality. This set of values called assessment function is used to determine the optimal homogeneity criterion. The main advantages of this algorithm are its capability of eliminating isolated points due to the noise and preserving connectivity of the target object structure. Revol and Jourlin2 developed a region-growing technique, which reconsiders the assignment of pixels at each step by a process minimizing variance through special dilations. Therefore, the region created by an initial seed can be nonconnected and possibly does not contain this seed. The homogeneity criterion of this method is based on an adequate tuning between spatial neighborhood and histogram neighborhood.

The drawback of the above methods is that they are rather time-consuming computationally and not effective for our mass segmentation problem. We propose a simple but efficient seeded region growing approach here. The seed is estimated from the manually segmented mass regions as used in the work of Shao and Brady.1 Assume that R is the manually segmented mass region, the seed is computed as:

$$ x_{s} = \frac{1} {k}{\sum\limits_{x_{i} \in R} {x_{i} } }\quad y_{s} = \frac{1} {k}{\sum\limits_{y_{i} \in R} {y_{i} } } $$
(1)

where (x i , y i ) represents the location of a pixel belonging to R and k is the number of pixels in R. A dilation operation of neighborhood N is applied to the seed (x s , y s ). N is obtained as follows:

$$ N = \min {\left( {x_{{\max }} - x_{s} ,x_{s} - x_{{\min }} ,y_{{\max }} - y_{s} ,y_{s} - y_{{\min }} } \right)} $$
(2)

where x max, x min are the maximal and minimal values of x i , and y max, y min are the maximal and minimal values of y i , respectively. The foreground measure I F is estimated from the intensity values within the dilated seed region S D as follows:

$$ I_{{\text{F}}} = \frac{1} {m}{\sum\limits_{{\left( {x_{i} ,y_{i} } \right)} \in S_{{\text{D}}} } {I{\left( {x_{i} ,y_{i} } \right)}} } $$
(3)

where m is the number of pixels belonging to the dilated seed region S D.

The background measure I B is estimated from the dilation of the manually segmented mass region R. A dilation operation of neighborhood L is applied to R, which results in a region R D. Typically, we set L to be 5 in our experiments. The background measure is computed as follows:

$$ I_{{\text{B}}} = \frac{1} {n}{\sum\limits_{{\left( {x_{i} ,y_{i} } \right)} \in R_{{\text{D}}} - R} {I{\left( {x_{i} ,y_{i} } \right)}} } $$
(4)

where I(x i , y i ) is the intensity value of the region R D −R, n is the number of pixels in the region R D −R.

The local signal dependent threshold can be defined as:

$$ T = I_{{\text{B}}} + \alpha {\left( {I_{{\text{F}}} - I_{{\text{B}}} } \right)} = \alpha I_{{\text{F}}} + {\left( {1 - \alpha } \right)}I_{{\text{B}}} $$
(5)

where α is a constant with values between zero and one. We set α as 0.5, i.e., T = 0.5(I F + I B), in our experiments.

Now, we have obtained the global limit of region growing. We also need the homogeneity condition for region growing. For each seed point S i belonging to S D and its neighboring pixels, the standard deviation of gray levels σ S i is calculated. Let σ mean be the mean value of σ S i :

$$ \sigma _{{{\text{mean}}}} = \frac{1} {{N_{{\text{S}}} }}{\sum\limits_{S_{i} \in S_{{\text{D}}} } {\sigma _{{{\text{Si}}}} } } $$
(6)

where N S is the number of pixels within the region of S D.

The region growing process is applied recursively. At each step, a homogeneity region R n (R 0=S D) is dilated by a neighborhood 1, yielding a region D n+1. For each pixel (x i , y i )∈D n+1 −R n, if the standard deviation (σ i ) of gray levels of (x i , y i ) and its neighbors is less than βσ mean, and the gray levels of I i exceeds T, it is marked as belonging to R n+1. That is, if σ i  < βσ mean and I(x i , y i ) > T, (x i , y i )∈R n+1; otherwise, (x i , y i )∉R n+1. The region growing procedure is iterated until the region R n can no longer be increased. We set β as 1.2 in our experiments.

To eliminate the possible holes in the segmented regions, a linkage process is applied. Any pixel with at least six of its eight neighboring pixels belonging to the segmented region is included to be part of the segmented region.

MASS SHAPE RECONSTRUCTION

The initial 3D mass shape, which is represented by a binary volume image, is reconstructed based on an algebraic reconstruction technique5,6, similar to that of Shao and Brady.1

Let v(i, j, k) be the 3-D reconstructed object image, i = 1, ..., n, j = 1, ..., n, k = 1, ..., n, which is a discrete three-dimensional image that represents a binary object. In this study, we assume that the mass lesion to be reconstructed is incompressible and that the attenuation coefficient within a mass is constant. For this reason, we represent the reconstructed object as a binary three-dimensional image.

Let P CC(r, c), P MLO(r, c)/P LM(r, c) represent the mass regions in the CC and MLO/LM views and the background, r = 1, ..., n, c = 1, ..., n. P CC(r, c), P MLO(r, c)/P LM(r, c) is equal to h m (h m is the background compensated intensity of a mass region), if (r, c) belongs to the mass region; and is equal to 0, if (r, c) belongs to the background.

Then, we convert v(i, j, k) into v j , j = 1, 2, ..., n 3. We also convert P CC(r, c) into P1 i , i = 1, ..., n 2, P MLO(r, c) into P2 i , i = 1, ..., n 2, respectively. We can write the linear equations as follows:

$$ w_{{11}} v_{1} + w_{{12}} v_{2} + w_{{13}} v_{3} + \cdots + w_{{1N}} v_{N} = P1_{1} $$
$$ w_{{21}} v_{1} + w_{{22}} v_{2} + w_{{23}} v_{3} + \cdots + w_{{2N}} v_{N} = P1_{2} $$
$$ w_{{n^{{\text{2}}} 1}} v_{1} + w_{{n^{{\text{2}}} {\text{2}}}} v_{{\text{2}}} + w_{{n^{{\text{2}}} {\text{3}}}} v_{3} + \cdots w_{{n^{{\text{2}}} N}} v_{N} = P1_{{n^{{\text{2}}} }} $$
$$ w_{{{\left( {n^{{\text{2}}} + 1} \right)}1}} v_{1} + w_{{{\left( {n^{{\text{2}}} + 1} \right)}2}} v_{2} + w_{{{\left( {n^{{\text{2}}} + 1} \right)}3}} v_{3} + \cdots w_{{{\left( {n^{2} + 1} \right)}N}} v_{N} = P2_{1} $$
$$ w_{{{\left( {2n^{{\text{2}}} } \right)}1}} v_{1} + w_{{{\left( {2n^{{\text{2}}} } \right)}2}} v_{2} + w_{{{\left( {2n^{2} } \right)}3}} v_{3} + \cdots w_{{{\left( {2n^{2} } \right)}N}} v_{N} = P2_{{n^{2} }} $$

w ij represents the influence of a voxel v j on a ray r i passing through pixel p i and it is determined by the length of ray r i in voxel v j .

We start by guessing the initial value of the voxel vector, V = V (0). Currently, we do that using the shape from silhouette method. At each iteration, we use a correction procedure to update the volume vector V. At iteration step k, the value of p i (k) is measured as projected from the present state of the volume vector V (k). The difference between p i and p i (k) is then back-projected to the 3-D volume object which generates V (k+1) to make sure that if p i (k+1) were computed from V (k+1), it would be closer to p i than p i (k). The correction procedure is only applied to the boundary voxels because our initially reconstructed object is a binary volume image and solid. Let b(V), b(\( \overline{V} \)) be the boundary voxels. When v j  = 1, and its neighboring voxels have both 0s and 1s, v j b(V). On the other hand, when v j  = 0, and its neighboring voxels have both 0s and 1s, v j b(\( \overline{V} \)). The correction procedure can be expressed as follows:

  • If v j b(V), and \(\frac{{{\sum\nolimits_{i = 1}^{2n^{2} } {{\left( {P_{i} - {\sum\nolimits_{n = 1}^N {\omega _{{in}} \upsilon _{n} } }} \right)}} }{\left( {\omega _{{ij}} > 0?1:0} \right)}}}{{{\sum\nolimits_{i = 1}^{2n^{2} } {\omega _{{ij}} } }}} < - {\text{Threshold}},\), change v j from 1 to 0;

  • If v j b(\( \overline{V} \)), and \( \frac{{{\sum\limits_{i = 1}^{2n^{2} } {\left( {P_{i} - } \right.} }{\sum\limits_{n = 1}^N {\left. {w_{{in}} v_{n} } \right){\left( {w_{{ij}} > 0?1:0} \right)}} }}} {{{\sum\limits_{i = 1}^{2n^{2} } {w_{{ij}} } }}} > {\text{Threshold}} \), change v j from 0 to 1.

We set Threshold as 0.5 in the current reconstruction procedure. The above procedure is applied recursively, until the reconstruction converges to a particular shape.

The above reconstructed binary 3D mass shape is rather coarse and is affected by the partial volume effect (PVE). PVE arises if more than one tissue type occupies a single voxel element. The intensity value of the partial volume pixel (or voxel) is a combination of the intensity values of each tissue type present. The accuracy of assigning these partial volume pixels (or voxels) to one single tissue type is inherently limited, especially when the image resolution is low.

To obtain a more accurate 3D mass shape, partial volume estimation is applied on the initial binary boundary as follows:·

  • If v j b(V), and \(\frac{{{\sum\nolimits_{i = 1}^{2n^{2} } {\left( {P_{i} - } \right.} }{\sum\nolimits_{n = 1}^N {\left. {\omega _{{in}} \upsilon _{n} } \right){\left( {\omega _{{ij}} > 0?1:0} \right)}} }}}{{{\sum\nolimits_{i = 1}^{2n^{2} } {\omega _{{ij}} } }}} < 0,\) \(\upsilon _{j} = 1 + \frac{{{\sum\nolimits_{i = 1}^{2n^{2} } {\left( {P_{i} - } \right.{\sum\nolimits_{n = 1}^N {\left. {\omega _{{in}} \upsilon _{n} } \right){\left( {\omega _{{ij}} > 0?1:0} \right)}} }} }}}{{{\sum\nolimits_{i = 1}^{2n^{2} } {\omega _{{ij}} } }}};\)

  • If v j b(\( \overline{V} \)), and \( \frac{{{\sum\nolimits_{i = 1}^{2n^{2} } {\left( {P_{i} - } \right.} }{\sum\nolimits_{n = 1}^N {\left. {w_{{in}} v_{n} } \right){\left( {w_{{ij}} > 0?1:0} \right)}} }}}{{{\sum\nolimits_{i = 1}^{2n^{2} } {w_{{ij}} } }}} > 0 \), \( v_{j} = \frac{{{\sum\nolimits_{i = 1}^{2n^{2} } {\left( {P_{i} - } \right.} }{\sum\nolimits_{n = 1}^N {\left. {w_{{in}} v_{n} } \right){\left( {w_{{ij}} > 0?1:0} \right)}} }}}{{{\sum\nolimits_{i = 1}^{2n^{2} } {w_{{ij}} } }}} \).

The initial three-dimensional binary boundary is replaced by partial volume voxels accordingly. The above procedure is applied recursively until the difference between the current reconstruction and the previous one is smaller than a threshold. Therefore, we have obtained a three-dimensional reconstruction with the inner voxels as 1 and the boundary voxels having the values between 0 and 1.

EXPERIMENTAL RESULTS

We apply our reconstruction algorithm on real mammographic images. Figure 1 shows the results of mass segmentation and reconstruction of a pair of CC and MLO views.

Fig 1
figure 1

(a) CC and MLO views; (b) results of automated mass segmentation; (c) results of reconstruction viewed at two different angles.

To evaluate the accuracy of the reconstructed algorithm, we project the reconstructed objects to two planes, which are parallel to the CC and MLO views. The mean deviation rate of the projections of the reconstructed masses from the original 2D mass regions is used as the error measure of the algorithm. The mean deviation rates of our proposed algorithm in comparison to the method in1 are shown in Table 1. The results are the average of reconstructions of 28 pairs of CC and MLO views. We can see that the proposed algorithm significantly reduces the reconstruction error.

Table 1 Average Mean Deviation Rates of Reconstructions of 28 Pairs of CC and MLO Views

CONCLUSION

In this paper, we developed a reconstruction algorithm for mass regions in mammograms. The 2D mass shapes are first segmented using an efficient region growing technique in CC and MLO views. The 3D mass shapes are then iteratively refined by an algebraic reconstruction technique. Partial volume estimation is applied on the boundary voxels to get a more accurate 3D representation of the mass. More accurately reconstructed mass shapes are useful for breast cancer diagnosis and treatment purposes. Radiologists can use the reconstructed three-dimensional mass coupled with the two-dimensional masses in the CC and MLO views to classify the lesion into malignant or benign. The reconstructed mass shapes can be also used as a visualization tool for medical education.