Improving the Quantitative Analysis of Breast Microcalcifications: A Multiscale Approach

Marasinou, Chrysostomos; Li, Bo; Paige, Jeremy; Omigbodun, Akinyinka; Nakhaei, Noor; Hoyt, Anne; Hsu, William

doi:10.1007/s10278-022-00751-3

Improving the Quantitative Analysis of Breast Microcalcifications: A Multiscale Approach

Open access
Published: 23 February 2023

Volume 36, pages 1016–1028, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Digital Imaging Aims and scope Submit manuscript

Improving the Quantitative Analysis of Breast Microcalcifications: A Multiscale Approach

Download PDF

Chrysostomos Marasinou¹,
Bo Li²,
Jeremy Paige²,
Akinyinka Omigbodun¹,
Noor Nakhaei³,
Anne Hoyt² &
…
William Hsu ORCID: orcid.org/0000-0002-5168-070X¹

1678 Accesses
4 Altmetric
Explore all metrics

Abstract

Accurate characterization of microcalcifications (MCs) in 2D digital mammography is a necessary step toward reducing the diagnostic uncertainty associated with the callback of indeterminate MCs. Quantitative analysis of MCs can better identify MCs with a higher likelihood of ductal carcinoma in situ or invasive cancer. However, automated identification and segmentation of MCs remain challenging with high false positive rates. We present a two-stage multiscale approach to MC segmentation in 2D full-field digital mammograms (FFDMs) and diagnostic magnification views. Candidate objects are first delineated using blob detection and Hessian analysis. A regression convolutional network, trained to output a function with a higher response near MCs, chooses the objects which constitute actual MCs. The method was trained and validated on 435 screening and diagnostic FFDMs from two separate datasets. We then used our approach to segment MCs on magnification views of 248 cases with amorphous MCs. We modeled the extracted features using gradient tree boosting to classify each case as benign or malignant. Compared to state-of-the-art comparison methods, our approach achieved superior mean intersection over the union (0.670 ± 0.121 per image versus 0.524 ± 0.034 per image), intersection over the union per MC object (0.607 ± 0.250 versus 0.363 ± 0.278) and true positive rate of 0.744 versus 0.581 at 0.4 false positive detections per square centimeter. Features generated using our approach outperformed the comparison method (0.763 versus 0.710 AUC) in distinguishing amorphous calcifications as benign or malignant.

Improved automated early detection of breast cancer based on high resolution 3D micro-CT microcalcification images

Article Open access 11 February 2022

False Positive Reduction by an Annular Model as a Set of Few Features for Microcalcification Detection to Assist Early Diagnosis of Breast Cancer

Article 18 June 2018

Segmentation and Classification of Microcalcifications Using Digital Mammograms

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Breast cancer is the most common cancer in women, accounting for 12% of cancer cases worldwide [1]. Studies have shown that early detection using mammography reduces breast cancer mortality [2]. In many countries, screening programs have been established with sensitivity levels ranging between 80 and 95% [3, 4]. However, screening also results in false positive outcomes, leading to patient anxiety, unnecessary biopsies, and the identification of clinically insignificant cancers, raising concerns about overdetection.

Microcalcifications (MCs), which are small calcium deposits, are common mammographic findings where they typically appear as high optical density structures. Nearly 50% of the biopsied MCs are associated with ductal carcinoma in situ (DCIS) [5], an early form of cancer but a nonobligate precursor to invasive cancer [6, 7]. MCs are reported by radiologists using a set of qualitative descriptors related to morphology (shape) and distribution, as defined by the American College of Radiology Breast Imaging Reporting and Data System (BI-RADS) using a combination of full-field digital mammograms (FFDMs) and magnification views. Descriptors correspond to varying levels of suspicion for cancer. For example, amorphous MCs are assigned a moderate suspicion level (i.e., BI-RADS 4B) with a positive predictive value (PPV₃)^{Footnote 1} of 21% [8]. However, the assigned level of suspicion can be open to interpretation and varies by radiologists due to subtle differences in MCs’ size, shape, texture, and heterogeneity in the background tissue [9]. Hence, determining whether a group of calcifications is associated with malignancy is challenging, and the current PPV₃ of biopsied suspicious MCs on 2D mammography is in the range of 20–41% [10].

Many computerized methods have been developed to aid radiologists in detecting MCs [9, 11,12,13,14,15]. These methods, generally categorized as computer-aided detection (CADe) systems, automatically mark groups of suspicious MCs in mammograms. Current CADe systems achieve high sensitivity but at the cost of a large number of false positive marks per mammogram, increasing the interpretation time. In work similar to ours, Wang et al. [9] developed a context-sensitive deep convolutional neural network, focusing on detecting MCs with low false positives. Their approach generated candidate locations using a difference of Gaussians (DoG) blob detection, filtering out non-MCs using a convolutional neural network. Our approach goes beyond detection by segmenting the boundaries of each MC.

In addition, studies have shown that using shape and intensity features from segmented MCs can improve malignancy classification [16,17,18,19,20]. Precise segmentation also allows for more accurate quantitative characterization of the shape and distribution of MCs and texture analysis of the surrounding breast parenchyma that could be used to classify cancerous regions better. Prior studied techniques include wavelet transform for isolating high-frequency components [21, 22], gray-level morphological operations [11, 23,24,25,26], fuzzy logic [27], and binary pixel classification using machine learning [17]. Although segmentation of MCs is performed in these studies, all but Ciecholewski [24] evaluated the performance of their algorithm as a detection task (using free-response operating characteristic analysis), not a segmentation task (measured by the overlap of delineated regions). Ciecholewski reported an intersection over the union of 70.8% between the segmented MCs and the radiologist annotations on a set of 200 regions, which we use as a basis for comparing our algorithm.

In this study, a quantitative morphology-based approach for characterizing MCs is demonstrated. Given a 2D digital mammogram, we initially identify bright salient structures using the DoG blob detection algorithm. Hessian analysis is then applied to segment these structures. Next, dense regression is employed to segment regions containing structures that are likely to be MCs. Dense regression has been used for similar tasks such as cell and nuclei detection [28, 29], retinal optical disc and fovea detection [30], and focal vascular lesion localization on brain MRI [31]. The idea is that human experts’ reference annotations are mapped to a smooth proximity function that reaches its maximum value when corresponding to the annotated points. Dense regression models are then trained to map the input mammogram to the proximity function. The proximity function method is advantageous when objects are annotated by a single pixel rather than their actual boundaries (e.g., many MCs are tiny and time-consuming to delineate). A fully convolutional network with pretrained weights is utilized to perform dense regression. The outputs of the dense regression model and the blob segmentation algorithm are combined to generate the final MC segmentation.

The contributions of our work are summarized as follows:

Obtaining precise annotations of all MCs is impractical, given the time and labor required. As a result, manual annotations of MC boundaries are often inconsistently drawn with high variability. To accommodate this uncertainty, we use proximity functions to represent individual MCs as part of regression model training.
A dense regression model and a novel blob segmentation algorithm are applied to generate MCs’ accurate segmentation while achieving fewer false positives than comparable state-of-the-art algorithms.
Our approach is trained and tested on a set of screening and diagnostic mammograms from two cohorts (INbreast and local data). We demonstrate the generalizability of our approach by applying our method to a set of magnification views.

Materials and Methods

Data

INbreast Dataset

For model training and internal validation, we utilized a public dataset called INbreast [32], a collection of 2D screening and diagnostic FFDMs, which were generated using a Siemens MammoNovation system. 115 screening cases with 410 images were collected at a 0.070 mm per pixel resolution and 14-bit greyscale. The dataset included detailed annotations provided by two experts for several types of lesions (i.e., masses, MCs, asymmetries, and distortions). Fifty-six cases had pathology-confirmed diagnoses, of which 45 were cancerous (DCIS and invasive). We used 294 images (147 craniocaudal (CC) and 147 mediolateral oblique (MLO) views) from 86 screening cases with annotations of individual MCs. MCs were annotated in two ways: (1) small MCs were annotated by a single pixel to denote their location, and (2) larger MCs were annotated using pixel-wise contours. It should be noted that a guideline of what was considered small versus larger MC was not reported.

Local Dataset

As an additional test set, data collected retrospectively from patients who had 2D diagnostic FFDMs performed at our institution, following an institutional review board (IRB)-approved protocol, was used. The dataset consisted of 79 diagnostic cases with 141 FFDM images (46 CC, 21 MLO, and 74 mediolateral (ML) views) where MCs were present. All images were acquired using Hologic Selenia full-field digital mammography equipment at a 0.070 mm per pixel resolution and 12-bit greyscale. After collecting the data, suspicious MCs were annotated by a breast fellowship-trained, board-certified radiologist with five years of experience. An open-source medical image viewer, Horos, was utilized to generate the annotations. Individual MCs were annotated by single pixels indicating their locations. A second board-certified radiologist annotated a sample of 5 cases to assess the annotation task’s interreader reliability. The index of specific agreement and the kappa statistic was determined. The two radiologists’ agreement was moderate, with an index of specific agreement of 0.664 (0.606–0.729, 95% confidence interval), see Supplementary Information.

Local Magnification View Dataset

We used magnification views obtained from 248 patients with amorphous calcifications seen at our institution to evaluate the performance of a malignancy classification. The model utilized features extracted from segmentations generated by our approach to classifying cases as benign or malignant (see the case study described in the Sect. “Case study: Identifying breast cancers among amorphous calcifications’’).

Overall Approach

The overall approach is illustrated in Fig. 1.

Blob Segmentation

The first stage is the segmentation of granular structures that are candidate MCs. To generate candidate MC segments, we developed Hessian DoG for blob segmentation. This module’s objective is the accurate segmentation of bright salient structures that are candidate MC objects, as shown in Fig. 1.

Scale-space theory is a framework formulated to represent signals at multiple scales. The Gaussian scale-space representation of an image I(x, y) is defined as [33]:

$$\begin{aligned} \begin{aligned} L(x, y; \sigma ) = G(x, y; \sigma ) *I(x, y), \end{aligned} \end{aligned}$$

(1)

where $*$ is the convolution and $G(x,y,\sigma )$ the two-dimensional Gaussian function

$$\begin{aligned} \begin{aligned} G(x,y; \sigma )&= \frac{1}{2\pi \sigma ^2} e^{-(x^2+y^2)/(2\sigma ^2)}. \end{aligned} \end{aligned}$$

(2)

In the DoG method, blobs with associated scale levels are detected from scale-space maxima of the scale-normalized DoG function. The normalized DoG function is defined as:

$$\begin{aligned} \begin{aligned} DoG(x,y;\sigma )&= \frac{\sigma }{\Delta \sigma } \left( L(x,y;\sigma +\Delta \sigma )-L(x,y;\sigma ) \right) \end{aligned} \end{aligned}$$

(3)

where $\Delta \sigma$ is the difference between two scales. To construct the DoG scale-space representation, a sequence of scales is considered $\sigma _n = k^n \sigma _{\min }$ where k is a constant multiplicative factor and $n=[0,1,\cdots , n_{\max } ]$. The DoG representations Eq. (3) are computed for all adjacent scales (i.e., $\Delta \sigma = \sigma _{n+1}-\sigma _n$) forming a 3-dimensional representation:

$$\begin{aligned} \begin{aligned} DoG(x,y,n)&= \frac{\sigma _n}{\sigma _{n+1}-\sigma _n} \left( L(x,y;\sigma _{n+1})-L(x,y;\sigma _n) \right) \end{aligned} \end{aligned}$$

(4)

with x, y the two spatial dimensions and $n=[0,1,\cdots , n_{\max -1} ]$ a scale dimension. Local maxima in the 3-dimensional representation are computed giving a blob set $(x^{(i)},y^{(i)},\sigma ^{(i)})$ where i identifies each blob. The number of blob detections is controlled by a threshold, $T_\mathrm {DoG}$, that is applied as a lower bound on the DoG representation before obtaining the local maxima. Moreover, in the case of overlapping blobs, the smaller blob is eliminated if the overlapping fraction is greater than the threshold $O_\mathrm {DoG}$.

The DoG algorithm outputs the location and scale of the detected blobs. To extract the blob shapes, we extended this method using Hessian analysis. The geometrical structure (i.e., convexity structure) of a blob-like object can be described by the eigenvalues of the Hessian [34]. In particular, a bright blob-like structure corresponds to two negative and large eigenvalues, whereas a bright tubular structure corresponds to one large negative eigenvalue and a small eigenvalue of an arbitrary sign. These structures correspond to the target MC candidates.

The Hessian DoG (H) representation at scale $\sigma$ is given by

$$\begin{aligned} \begin{aligned} \mathrm {H}(x,y;\sigma ) = \begin{pmatrix} \frac{\partial ^2 DoG(x,y;\sigma )}{\partial x^2} &{} \frac{\partial ^2 DoG(x,y;\sigma )}{\partial x \partial y}\\ \frac{\partial ^2 DoG(x,y;\sigma )}{\partial x \partial y} &{} \frac{\partial ^2 DoG(x,y;\sigma )}{\partial y^2} \end{pmatrix}. \end{aligned} \end{aligned}$$

(5)

H is computed across all scales in the sequence $\sigma _n$. At each scale, the following constraints are imposed:

$$\begin{aligned} \begin{aligned} \mathrm {tr}(H)< 0 \wedge \left( \det (H)<0\quad \vee \quad \frac{\det (H)}{\mathrm {tr}(H)^2} \le h_{\mathrm {thr}} \right) \end{aligned} \end{aligned}$$

(6)

where $h_{\mathrm {thr}}$ is a tunable parameter. The constraints ensure that the Hessian is either negative definite or has a small positive eigenvalue. In this way, only bright salient blob-like and tubular structures are segmented. The constraint generates a binary mask at each scale. Iterating over the blob set found in the DoG algorithm $(x^{(i)},y^{(i)},\sigma ^{(i)})$, the corresponding objects are found in the Hessian masks. More specifically, for the Hessian mask at scale $\sigma ^{(i)}$, the object spanning the location $(x^{(i)},y^{(i)})$ is found. The output of this step consists of all detected objects merged into a single binary mask.

Regression Convolutional Neural Network

While the blob segmentation step identifies objects that are candidate MCs, many will be false positives. This step identifies regions where MCs are most likely present by segmenting the MCs’ area to choose relevant MC objects from the previous stage. This task was performed using a fully convolutional neural network, commonly used in image segmentation, as the regression model. The model’s output is a smooth proximity map reaching a maximum value at the predicted MC locations.

The MC region segmentation task is analogous to cell and nuclei detection in microscopy images. The two tasks share the following characteristics: (1) they are highly imbalanced (i.e., the positive class captures a small region compared to the background within an image, and it often consists of many small structures), (2) the background is highly inhomogeneous, (3) individual objects exhibit large variation in sizes, shapes, and textures, (4) boundaries of the structures are often blurry, (5) the resolution of both types of images is large, and (6) the annotations are usually a mixture of individual points or exact boundaries. Inspired by this analogy, we adapted methods previously used in cell and nuclei detection [28, 29]. In [28], the authors used regression to detect cell centers. The human-annotated binary masks containing cell centers’ locations were transformed into a continuous function flat on the background with localized peaks at each cell’s center. These functions were then used to train a Random Forest Regression algorithm on a set of image patches. The cell centers were identified with local maxima in the model’s output. In [29], the authors showed that the same technique could be applied using a deep learning model. Their regression model was a fully convolutional neural network with a large receptive field capable of encoding high-resolution information.

Our MC segmentation model is formulated as follows: Given a mask generated from reference annotations $M(x,y) \in \{0,1\}$, the MC locations are given by $\{(x_i, y_i)\}$ where $M(x_i,y_i) = 1$. The proximity function is then defined as:

$$\begin{aligned} P(x,y) = \max _i g(x,y,x_i,y_i) \end{aligned}$$

(7)

$$\begin{aligned} \begin{aligned} g(x,y,x_i,y_i)&= {\left\{ \begin{array}{ll} (e^{\alpha (1-r/\xi )}-1)/(e^\alpha -1), &{} r \le \xi \\ 0, &{} r > \xi \end{array}\right. } \end{aligned} \end{aligned}$$

(8)

$$\begin{aligned} r = \sqrt{(x-x_i)^2+(y-y_i)^2} \end{aligned}$$

(9)

where $\alpha , \xi$ are tunable parameters. The function maps MC locations on an exponentially curved surface, expanding to a distance $\xi$ with decay rate $\alpha$ before it vanishes. An example of the transformation is illustrated in Fig. 2. This transformed mask compensated for the fact that we had mixed quality annotations (i.e., point-like and exact) and forced the model to learn information from the precise locations of MCs and the surrounding background.

We constructed a model which predicts the proximity function P(x, y) given the image I(x, y). A feature pyramid network (FPN) [35] was used with Inception-v4 [36] as the backbone. The FPN architecture was introduced for applications such as region proposal, object detection, and instance segmentation. It adopts a pyramidal shape structure similar to many segmentation networks, such as the U-net [37], with an encoder that produces semantic features at different scales and a decoder that combines the encoder features by upsampling them.

The FPN architecture is suitable because it allows features from all scales to independently contribute to the final prediction. The network is illustrated in Fig. 3. The network consists of an encoding and a decoding branch. In the encoding branch, the Inception-v4 architecture was adopted with weights pretrained on ImageNet [38]. Features were extracted at four scales (down-sampled compared to the original image by factors of 4, 8, 16, and 32). The features were then transferred to the decoding branch via skip connections. To match their spatial sizes, they were upsampled by factors of 1, 2, 4, and 8. The resulting features were aggregated using addition and further upsampled to match the image size. The number of output channels was set to 1 and passed through a sigmoid function to generate a value between 0 and 1. This value was thresholded to achieve the final segmentation.

The model was trained using a soft Dice loss function, which was introduced as an optimization objective in biomedical segmentation applications [39, 40]. The formulation in [39] was used:

$$\begin{aligned} \begin{aligned} L_{\mathrm {DICE}}(\hat{P}, P) = 1 - \frac{2 \sum \limits _{x,y} P(x,y) \hat{P}(x,y) +\epsilon }{\sum \limits _{x,y} (P(x,y)+\hat{P}(x,y)) +\epsilon } \end{aligned} \end{aligned}$$

(10)

with $\epsilon$ set to 1 where $\epsilon$ was introduced for numerical stability and P and $\hat{P}$ correspond to the target and predicted proximity map, respectively. A segmentation binary mask was generated by applying a cut-off on the resulting proximity mask, $\hat{P}(x,y) \ge p_{thr}$ where $p_{thr} \in [0,1]$.

The regression model was trained using patches extracted from the images and corresponding masks. We applied the sliding window approach with a patch size of 512 pixels and a stride of 480 to permit overlapping patches. Only patches with annotated MCs present were considered. From INbreast cases, 1045 patches were extracted from the training set and 329 from the validation set. From our local dataset, 252 patches from the training set were extracted.

The mask patches were transformed using the proximity function map Eq. (7). We set $\xi = \{ 6,8,10,12\}$ for the characteristic distance and $\alpha = \{ -1,-2, 10^{-4}, 1,2\}$ for the decay rate. The proximity function Eq. (7) is not well defined when $\alpha =0$.

Data augmentation was performed to enrich the training set by randomly applying horizontal flipping, magnification, spatial translations in both directions, cropping, contrast enhancement, brightness adjustment, and gamma correction (details are given in supplementary materials). The resulting patches are 320x320 pixels in size. The soft Dice loss was used to compute the error between target and predicted proximity functions Eq. (10). The model was trained for 40 epochs using the adaptive moment estimation (Adam) optimization method [41] with mini-batch size 8, learning rate $10^{-4}$, $\beta _1 = 0.9$, $\beta _2 = 0.999$ and $\epsilon =10^{-8}$. At the end of each epoch, the model was evaluated on the image patches of the INbreast validation set, with the average IoU per patch as the metric. The model achieving the highest IoU over all epochs was kept. The Inception-v4 weights were initialized with weights pretrained on ImageNet. The rest of the model weights were initialized randomly following He initialization [42]. The configuration $\xi = 10$, $\alpha =1$ achieved the highest performance for our approach on the validation set. We updated our model to work with local data by training the model with an additional 40 epochs using a combination of patches from the INbreast and local datasets.

Output Generation

Blob segmentation detects bright objects, whereas the regression CNN outputs a mask of relevant MC regions. The intersection of these outputs results in the final set of detected MCs and their segmentations. We retained the Hessian DoG objects that overlap $> o_{\mathrm {thr}}$ with the CNN region mask, where $o_{\mathrm {thr}}$ is a tunable parameter, representing percentage overlap.

Comparison Methods

We compared our approach against two state-of-the-art methods. For the MC detection task, our approach was compared to the paper by Wang and Yang [9], which used two subnetworks, one focusing on the local features and the other on features extracted from the background tissue around the location. They reported a detection performance of 80% true positive fraction (TPF) at a false positive rate of 1.03 FPs/cm². We implemented their context-sensitive deep neural network, classifying a location as MC or non-MC, training our implementation on the INbreast dataset. We implemented DoG based on their reported parameters adjusting the scales to the resolution of our dataset.

For the MC segmentation task, our approach was compared to Ciecholewski [24], where MC segmentation was performed using morphological operations. In the first step, morphological operators were applied to the original image to detect the MCs’ locations. Specifically, a morphological pyramid was generated using the closing-opening filter. Differences in the pyramid representations of the original image were obtained and combined using the extended maximum of the original image and morphological reconstruction. In the second step, the MC shapes were extracted using watershed segmentation, where the output of the first step was utilized as a marker.

Evaluation Metrics

The segmentation performance was assessed using Intersection over the Union (IoU). We defined IoU per object as the averaged IoU between each reference annotation object and the object with the most overlap within the prediction mask^{Footnote 2}. The mean IoU between the background and the positive MC class per image was computed to evaluate the image-wise segmentation. The IoU per MC object was measured to examine the performance of segmenting individual MCs.

The detection performance of our approach was evaluated using Free-Response Operating Characteristic (FROC) analysis, similar to prior work [9, 12]. In FROC analysis, the true positive rate (TPR) was plotted against false positive detections per image unit area (cm²). The analysis required the definition of localization rules to determine true positives. We defined a detected object as a true positive if its distance from a ground truth object was at most 5 pixels (0.35 mm)^{Footnote 3} or if it demonstrated an IoU value of at least 0.3 with a ground truth object.

Results

Training, Validation, and Test Sets

The INbreast dataset was partitioned into a training set with 51 cases (173 images), a validation set with 17 cases (56 images), and a test set with 18 cases (65 images). The local dataset was partitioned into a training set for fine-tuning the model with 112 images and a held-out test set with 29 images. We used the INbreast validation set to fine-tune our approach and the INbreast test set to assess the performance. Cases were kept independent (i.e., all images from an individual case were included within the same subset) to avoid potential bias.

Model Selection and Optimization

We evaluated our model across hyperparameters using the INbreast validation set. The FROC analysis is presented in Table 1, and the mean IoU per image and IoU per object are summarized in Table 2. For the FROC analysis, 100 bootstrap samples were used to find the partial area under the curve (pAUC) in each experiment. The pAUC was computed for the range between 0 and 1 FPs per unit area, and the 95% confidence interval was reported. For the computation of the segmentation metrics, a threshold on the predicted proximity function was applied. To determine the optimal threshold for each experiment, we referred to the corresponding FROC curve and found the point closest to a TPR of 1 and a false positive per unit area of 0. All configurations performed similarly in terms of the FROC analysis and segmentation metrics. We chose the model with the highest mean value of the FROC pAUC with $\xi =10$ and $\alpha =1$. We set $\sigma _{\min } = 1.18$, $\sigma _{\max }=3.1$, overlapping fraction $O_\mathrm {DoG}=1$, DoG threshold $T_\mathrm {DoG}=0.006$ and Hessian threshold $h_\mathrm { thr}=1.4$. To optimize the final output, we examined $o_\mathrm {thr} = \{0.2, 0.3, 0.4, 0.5, 0.6\}$ to determine the overlap threshold. Setting $o_\mathrm {thr}=0.3$ achieved the highest performance on the validation set.

Table 1 Detection performance for different regression models on the validation set. The highest pAUC is bolded

Full size table

Table 2 Segmentation results of different regression models on the validation set. The highest IoUs are bolded

Full size table

Detection and Segmentation Results

Figure 5 compares the detection performance between our method and the method of Wang and Yang [9]. The true positive detection rate on the y-axis was plotted against the false positive counts per unit area (1 $cm^2$). The FROC analysis was performed on the INbreast validation and test sets. Our method achieved FROC pAUC 0.819 ± 0.046 with a TPR of 0.852 at 0.4 false positives per unit area on the validation set. On the test set, the FROC pAUC was 0.697 ± 0.078 with a TPR of 0.744 at 0.4 false positives per unit area. In comparison, our implementation of [9] achieved FROC pAUC 0.703 ± 0.057 and 0.581 ± 0.072 in the validation and test sets, respectively.

Figure 6 shows the detection performance of our approach on the local dataset. We also compared the model’s performance trained solely on INbreast data and fine-tuned on local data. The performance of the two models was comparable since the original model achieved 0.313 ± 0.109 FROC pAUC, and the fine-tuned model achieved 0.420 ± 0.107. However, for the range of 0.2 to 0.6 FPs per unit area, the fine-tuned model outperformed the original based on TPR.

Table 3 reports the segmentation performance of our approach on the INbreast validation and test sets. For comparison, the segmentation results of the morphological method of Ciecholewski (see the Sect. “Comparison Methods’’ ) are also presented. Based on the paired Wilcoxon signed-rank test, we achieved superior performance in both mIoU per image and IoU per object for both subsets with $p<0.01$. Figure 4 presents a sampling of model outputs.

Table 3 Segmentation Results of Final Model on Validation and Test Sets

Full size table

Case Study: Identifying Breast Cancers Among Amorphous Calcifications

We present a case study that utilized features computed from regions segmented by our method to classify whether MCs identified as amorphous were benign or malignant. We compared the predictive value of these features with those computed from regions delineated by another “baseline” method [24].

Data

The local magnification view dataset consisted of diagnostic exams performed at our institution between 2017 and 2019. In particular, 284 mammographic cases with biopsied amorphous MCs were selected. The cases were chosen such that for the same case and laterality (left/right breast), all biopsy results were either benign or malignant (high-risk results were omitted). The cases corresponded to a total of 642 diagnostic images (318 ML/MLO/LM and 324 CC). A doctoral-trained researcher annotated the most suspicious regions, providing the regions of interest (ROIs) as bounding boxes. A board-certified radiologist validated these ROIs. In total, 674 ROIs were provided, i.e., 612 images with one ROI, 28 images with two ROIs, and 2 images with three ROIs. Incorporating the pathology information, 390 (57.9%) ROIs were benign and 284 (42.1%) malignant. To eliminate the annotation bias associated with the size and shape of ROIs, we transformed the bounding boxes of ROIs to a fixed size. The mean ROI height and width were 222 and 256 pixels, respectively. Therefore, we decided to transform each bounding box to a 256x256 pixel size retaining its center location. Another reason for adopting fixed-sized ROIs was to make the classification model focus on segmentation-related features.

MC Segmentation

MCs were segmented using two different methods:

1.
Our Approach: For the regression network, the FPN was trained on both local and INbreast data (see the Sect. “Data’’). For the Hessian DoG blob segmentation, we fine-tuned the parameters mentioned in Sect. “Blob Segmentation’’ to achieve the best classification performance.
2.
We used the method developed by Ciecholewski [24] as the “baseline” segmentation method, described in the Sect. “Comparison Methods’’.

Feature Extraction

Upon segmentation of the ROIs, relevant features were extracted (n=31). The extracted features are categorized into two main groups: (1) regional features describing all ROI MCs as a whole and (2) individual MC features. The regional features were: the area of the foreground (all MCs), the area of the convex hull enclosing all MCs, major and minor axis length, the orientation of the major axis with respect to the horizontal line, eccentricity, solidity, moments of inertia (n=2), Hu moments (n=7) and number of MCs. The MCs were also described individually by their area, major and minor axis length, maximum, minimum, and mean intensity within, and eccentricity. Individual MC features were statistically aggregated using mean and standard deviation per ROI.

Classification

Features were inputted into a gradient tree boosting classifier. Gradient boosting generates weak predictive models, i.e., in our case, decision trees, which are enhanced in each iteration, targeting residual errors, and are linearly combined to give the final model. We applied fivefold cross-validation to train and test our task. Partitioning the data into folds was performed patient-wise. For each training and test split, the data were pre-processed using imputation by mean value followed by standardization (i.e., subtracting the mean and scaling to unit variance). Imputation was needed for cases where the segmentation was empty. The parameters for both imputation and standardization were derived from the training set and applied to both training and test sets.

Evaluation

Classification was performed on different sets of features derived from different MC segmentation techniques. ROC analysis was performed to evaluate each model, and classification metrics were obtained, i.e., ROC AUC, accuracy, sensitivity, specificity, and positive predictive value. The mean and standard deviation of each metric was computed across folds.

The parameters of each segmentation method were fine-tuned, and the configurations that achieved the highest ROC AUC were kept (details are given in Supplementary Information). The best models obtained using our method and baseline segmentations were compared using ROC analysis. Figure 7 depicts the ROC curves of each model. Other classification metrics are presented in Table 4. We deliberately chose operating thresholds to achieve high sensitivity (close to 90%) for both models. The model trained on features generated from our method’s segmentations achieved superior values for all other classification metrics.

In Fig. 8, we compared the performance of our approach with respect to its components: FPN and Hessian DoG blob segmentation. Although the performance of blob segmentation was low (AUC=0.562 ± 0.056), the combination of blob segmentation and FPN resulted in the highest performance (AUC=0.763 ± 0.056).

Table 4 Fivefold cross-validation classification metrics compared between our method and baseline segmentations. Mean values and standard deviations across folds are presented. The threshold is chosen to achieve a sensitivity closest to 0.9

Full size table

Discussion

We presented an approach that combines DoG with Hessian analysis and dense regression to achieve precise MC segmentation in 2D digital mammograms. To our knowledge, this is one of the first works applying a fully convolutional architecture for MC segmentation, which permits concurrent prediction on multiple adjacent locations. The method was trained and validated on 435 mammograms from two separate datasets. The results show that our method outperforms comparable approaches that have been recently published. In the FROC analysis using the INbreast dataset, our method achieves a TPR of 0.744 at false positives per unit area of 0.4 in comparison with a TPR of 0.618 at the same level of false positives as what is reported in [9]. On the segmentation task, our approach achieves a mean IoU per image of 0.670 and IoU per object of 0.607 compared to 0.524 mean IoU per image and 0.363 IoU per object for the morphological approach presented in [24]. The addition of local data, even when coarsely annotated by a human reader, improved the performance of our method. The ability to utilize a mixture of annotations, ranging from precise segmentations of larger calcifications to point estimates representing the centroid of smaller calcifications, is a strength of our approach. As an indirect validation of our method, we conducted segmentation-based MC malignancy classification. In this downstream task, our method outperformed the baseline segmentation method with a ROC AUC of 0.763 versus 0.710. Also, our approach demonstrated incremental performance in terms of its constituents (i.e., the Hessian DoG blob segmentation and the regression model). We also showed the ability to generalize our approach to other mammographic views (e.g., magnification views).

While we achieved a lower number of false positives than other approaches, the overall number of false positives per image is still high. Our approach would benefit from a false positive reduction step. Most false positives occur near larger calcifications and correspond to more irregular shapes than actual MCs. The irregular detection can be attributed to the regression model, designed to segment regions containing calcifications. In the case of larger calcifications, the segmented regions span larger areas, increasing the likelihood of retaining false positive objects. Additional filtering based on size and shape criteria in areas where large calcifications are identified could lead to a substantial false positive reduction. Human annotation and confirmation of every MC on an image is an impractical task, and algorithms should emphasize identifying MCs that are at the highest risk of being associated with cancerous lesions. Moreover, our algorithm likely identified MCs missed by human readers, inflating the false positive count. Our approach also under-segments or over-segments in certain scenarios. Undersegmentation occurs most often in large objects due to: (1) interior regions of objects having lower intensities that are omitted and (2) incorrect delineation of boundaries due to subtle contrast differences between the MC and surrounding tissue. Nevertheless, large calcifications are typically considered benign and not clinically significant. Their undersegmentation will not affect quantitative features that may predict invasive cancers. Oversegmentation occurs primarily when bright objects identified with Hessian DoG are close together and erroneously combined into a single object when only part corresponds to an actual MC.

Several limitations of our approach exist. Labeling all MCs in full-field mammograms is time-consuming and prone to human error and inter-annotator variability. Hence, our work is limited by the dataset size and variations in how MCs are annotated, ranging from point-like annotations to detailed contours. Using a proximity function to reflect the uncertainty associated with MC annotations makes our approach robust to training data variations. Moreover, the inherent differences in mammograms acquired with equipment manufactured by different vendors present another challenge. The local dataset was obtained using equipment manufactured by Hologic, whereas the public dataset INbreast was obtained using Siemens equipment. The brightness and contrast levels of the images varied substantially between manufacturers. Given that the INbreast dataset had four times as many cases as the local dataset, our model was fine-tuned with a limited number of training patches. The method was trained and evaluated using existing data prone to selection bias, making the model susceptible to underspecification. Moreover, with the increased adoption of digital breast tomosynthesis, our method has not yet been evaluated on these scans. Ongoing work includes annotating additional cases from our institution that would allow us to fine-tune the model further and experiment with different training strategies to improve the generalizability of our approach.

Conclusions

We described a new quantitative approach for MC segmentation based on blob segmentation and dense regression. We showed that our method performs better than state-of-the-art MC segmentation and detection methods. In our case study, we evaluated the effect of the segmentation method on computed quantitative image features and classification performance. Our results suggested that our method has the potential to segment calcifications on a variety of images (FFDMs, magnification views) with minimal fine-tuning. Moreover, our method exhibited better performance than features generated using a comparison segmentation method. The case study also demonstrated the potential of quantitative characterization of MCs in improving the management of women with amorphous calcifications. Shape, intensity, and texture features can be extracted from individually segmented MCs to yield quantitative descriptors of MC morphology and distribution. While further studies are needed to evaluate the PPV₃ and reproducibility of our quantitative features, these features, enabled by accurate segmentation of MCs, may provide a basis for reducing false positives and unnecessary biopsies.

Data Availability

Our code (including trained models) is publicly available here: https://github.com/cmarasinou/HDoGReg.

Notes

PPV₃ is the proportion of cases that underwent biopsy due to abnormal breast imaging findings which resulted in a breast cancer diagnosis.
MC objects annotated by a single pixel were disregarded in computing IoU per MC object. IoU is not well defined in such cases.
Centroids of individual objects were used in computing their distance.

References

Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A (2018) Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68(6):394–424 https://doi.org/10.3322/caac.21492
Article PubMed Google Scholar
Marmot MG, Altman DG, Cameron DA, Dewar JA, Thompson SG, Wilcox M, on Breast Cancer Screening TIUP (2013) The benefits and harms of breast cancer screening: an independent review. Br J Cancer 108(11):2205–2240. https://doi.org/10.1038/bjc.2013.177
Euler-Chelpin Mv, Lillholm M, Napolitano G, Vejborg I, Nielsen M, Lynge E (2018) Screening mammography: Benefit of double reading by breast density. Breast Cancer Res Treat 171(3):767–776, https://doi.org/10.1007/s10549-018-4864-1
Article Google Scholar
Kemp Jacobsen K, O’Meara ES, Key D, SM Buist D, Kerlikowske K, Vejborg I, Sprague BL, Lynge E, von Euler-Chelpin M (2015) Comparing sensitivity and specificity of screening mammography in the united states and denmark. Int J Cancer 137(9):2198–2207. https://doi.org/10.1002/ijc.29593
Farshid G, Sullivan T, Downey P, Gill PG, Pieterse S (2011) Independent predictors of breast malignancy in screen-detected microcalcifications: Biopsy results in 2545 cases. Br J Cancer 105(11):1669–1675, https://doi.org/10.1038/bjc.2011.466
Article CAS PubMed PubMed Central Google Scholar
Cox RF, Hernandez-Santana A, Ramdass S, McMahon G, Harmey JH, Morgan MP (2012) Microcalcifications in breast cancer: Novel insights into the molecular mechanism and functional consequence of mammary mineralisation. Br J Cancer 106(3):525–537, https://doi.org/10.1038/bjc.2011.583
Article CAS PubMed PubMed Central Google Scholar
Hofvind S, Iversen BF, Eriksen L, Styr BM, Kjellevold K, Kurz KD (2011) Mammographic morphology and distribution of calcifications in ductal carcinoma in situ diagnosed in organized screening. Acta Radiol 52(5):481–487, https://doi.org/10.1258/ar.2011.100357
Article PubMed Google Scholar
Sickles EA, D’Orsi CJ, Bassett LW, Appleton CM, Berg WA, Burnside ES, et al. (2013) ACR BI-RADS® ATLAS: Breast Imaging Reporting & Data System, Part I, vol 5. American College of Radiology Reston, VA
Google Scholar
Wang J, Yang Y (2018) A context-sensitive deep learning approach for microcalcification detection in mammograms. Pattern Recognition 78:12 – 22. https://doi.org/10.1016/j.patcog.2018.01.009
Wilkinson L, Thomas V, Sharma N (2017) Microcalcification on mammography: Approaches to interpretation and biopsy. Br J Radiol 90(1069). https://doi.org/10.1259/bjr.20160594
Dengler J, Behrens S, Desaga JF (1993) Segmentation of microcalcifications in mammograms. IEEE Trans Med Imaging 12(4):634–642, https://doi.org/10.1109/42.251111
Article CAS PubMed Google Scholar
El-Naqa I, Yang Y, Wernick MN, Galatsanos NP, Nishikawa RM (2002) A support vector machine approach for detection of microcalcifications. IEEE Trans Med Imaging 21(12):1552–1563, https://doi.org/10.1109/TMI.2002.806569
Article PubMed Google Scholar
Netsch T, Peitgen HO (1999) Scale-space signatures for the detection of clustered microcalcifications in digital mammograms. IEEE Trans Med Imaging 18(9):774–786, https://doi.org/10.1109/42.802755
Article CAS PubMed Google Scholar
Oliver A, Torrent A, Lladó X, Tortajada M, Tortajada L, Sentís M, Freixenet J, Zwiggelaar R (2012) Automatic microcalcification and cluster detection for digital and digitised mammograms. Knowl Based Syst 28:68–75, https://doi.org/10.1016/j.knosys.2011.11.021
Article Google Scholar
Yoshida H, Doi K, Nishikawa RM (1994) Automated detection of clustered microcalcifications in digital mammograms using wavelet processing techniques. Medical Imaging 1994: Image Processing 2167(May 1994):868–886, https://doi.org/10.1117/12.175126
Alam N, Denton ER, Zwiggelaar R (2019) Classification of microcalcification clusters in digital mammograms using a stack generalization based classifier. J Imaging 5(9), https://doi.org/10.3390/jimaging5090076
Bria A, Karssemeijer N, Tortorella F (2014) Learning from unbalanced data: A cascade-based approach for detecting clustered microcalcifications. Med Image Anal 18(2):241–252, https://doi.org/10.1016/j.media.2013.10.014
Article CAS PubMed Google Scholar
Cai H, Huang Q, Rong W, Song Y, Li J, Wang J, Chen J, Li L (2019) Breast microcalcification diagnosis using deep convolutional neural network from digital mammograms. Comput Math Methods Med 2019:1–10, https://doi.org/10.1155/2019/2717454
Article Google Scholar
Chen Z, Strange H, Oliver A, Denton ER, Boggis C, Zwiggelaar R (2015) Topological modeling and classification of mammographic microcalcification clusters. IEEE Trans Biomed Eng 62(4):1203–1214, https://doi.org/10.1109/TBME.2014.2385102
Article PubMed Google Scholar
Strange H, Chen Z, Denton ER, Zwiggelaar R (2014) Modelling mammographic microcalcification clusters using persistent mereotopology. Pattern Recognit Lett 47:157–163, https://doi.org/10.1016/j.patrec.2014.04.008
Article Google Scholar
Regentova E, Zhang L, Zheng J, Veni G (2007) Microcalcification detection based on wavelet domain hidden Markov tree model: Study for inclusion to computer aided diagnostic prompting system. Med Phys 34(6):2206–2219, https://doi.org/10.1118/1.2733800
Article PubMed Google Scholar
Strickland R, Hee Il Hahn (2002) Wavelet transforms for detecting microcalcifications in mammograms. IEEE Trans Med Imaging 15(2):218–229, https://doi.org/10.1109/42.491423
Article Google Scholar
Betal D, Roberts N, Whitehouse GH (1997) Segmentation and numerical analysis of microcalcifications on mammograms using mathematical morphology. Br J Radiol 70(SEPT.):903–917, https://doi.org/10.1259/bjr.70.837.9486066
Ciecholewski M (2017) Microcalcification segmentation from mammograms: A morphological approach. J Digit Imaging 30(2):172–184, https://doi.org/10.1007/s10278-016-9923-8
Article PubMed Google Scholar
Halkiotis S, Mantas J (2002) Automatic detection of clustered microcalcifications in digital mammograms. Stud Health Technol Inform 90:24–29, https://doi.org/10.3233/978-1-60750-934-9-24
Article PubMed Google Scholar
Xu S, Liu H, Song E (2011) Marker-controlled watershed for lesion segmentation in mammograms. J Digit Imaging 24(5):754–763, https://doi.org/10.1007/s10278-011-9365-2
Article PubMed PubMed Central Google Scholar
Heng-Da Cheng, Yui Man Lui, Freimanis R (2002) A novel approach to microcalcification detection using fuzzy logic technique. IEEE Trans Med Imaging 17(3):442–450, https://doi.org/10.1109/42.712133
Article Google Scholar
Kainz P, Urschler M, Schulter S, Wohlhart P, Lepetit V (2015) You should use regression to detect cells. In: Navab N, Hornegger J, Wells WM, Frangi AF (eds) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, Springer International Publishing, Cham, pp 276–283, https://doi.org/10.1007/978-3-319-24574-4_33
Xie Y, Xing F, Shi X, Kong X, Su H, Yang L (2018) Efficient and robust cell detection: A structured regression approach. Med Image Anal 44:245 – 254, https://doi.org/10.1016/j.media.2017.07.003
Article PubMed Google Scholar
Meyer MI, Galdran A, Mendonça AM, Campilho A (2018) A pixel-wise distance regression approach for joint retinal optical disc and fovea detection. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 11071 LNCS:39–47, https://doi.org/10.1007/978-3-030-00934-2_5
van Wijnen KMH, Dubost F, Yilmaz P, Ikram MA, Niessen WJ, Adams H, Vernooij MW, de Bruijne M (2019) Automated lesion detection by regressing intensity-based distance with a neural network. In: Shen D, Liu T, Peters TM, Staib LH, Essert C, Zhou S, Yap PT, Khan A (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2019, Springer International Publishing, Cham, pp 234–242, https://doi.org/10.1007/978-3-030-32251-9_26
Moreira IC, Amaral I, Domingues I, Cardoso A, Cardoso MJ, Cardoso JS (2012) Inbreast: Toward a full-field digital mammographic database. Acad Radiol 19(2):236 – 248, https://doi.org/10.1016/j.acra.2011.09.014
Article PubMed Google Scholar
Lindeberg T (1998) Feature detection with automatic scale selection. Int J Comput Vis 30(2):79–116, https://doi.org/10.1023/A:1008045108935
Article Google Scholar
Frangi AF, Niessen WJ, Vincken KL, Viergever MA (1998) Multiscale vessel enhancement filtering. In: Wells WM, Colchester A, Delp S (eds) Medical Image Computing and Computer-Assisted Intervention — MICCAI’98, Springer Berlin Heidelberg, Berlin, Heidelberg, pp 130–137, https://doi.org/10.1007/BFb0056195
Lin T, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 936–944
Szegedy C, Ioffe S, Vanhoucke V, Alemi AA (2017) Inception-v4, Inception-ResNet and the impact of residual connections on learning. 31st AAAI Conference on Artificial Intelligence, AAAI 2017 pp 4278–4284, 1602.07261
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 9351:234–241, https://doi.org/10.1007/978-3-319-24574-4_281505.04597
Yakubovskiy P (2020) Segmentation models. https://github.com/qubvel/segmentation_models.pytorch (accessed 1 November 2020)
Drozdzal M, Vorontsov E, Chartrand G, Kadoury S, Pal C (2016) The importance of skip connections in biomedical image segmentation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 10008 LNCS:179–187, https://doi.org/10.1007/978-3-319-46976-8_191608.04117
Milletari F, Navab N, Ahmadi SA (2016) V-Net: Fully convolutional neural networks for volumetric medical image segmentation. Proceedings - 2016 4th International Conference on 3D Vision, 3DV 2016 pp 565–571, https://doi.org/10.1109/3DV.2016.791606.04797
Kingma DP, Ba JL (2015) Adam: A method for stochastic optimization. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings pp 1–15, 1412.6980
He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), IEEE Computer Society, USA, ICCV ’15, p 1026-1034, https://doi.org/10.1109/ICCV.2015.123

Download references

Acknowledgements

This work was supported in part by the National Science Foundation under Grant No. 1722516 and the UCLA Department of Radiological Sciences through the generous support of the Iris Cantor Foundation. Access to data was supported by the National Center for Advancing Translational Science (NCATS) of the National Institutes of Health under the UCLA Clinical and Translational Science Institute grant number UL1TR001881 and the Integrated Diagnostics (IDx) Breast program. We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research. Also, we would like to acknowledge the Breast Research Group at the University of Porto for making the INbreast dataset available.

Funding

Chrysostomos Marasinou, Akinyinka Omigbodun, Noor Nakhaei, and William Hsu were supported by National Science Foundation Grant No. 1722516. Akinyinka Omigbodun received support from the UCLA Department of Radiological Sciences through the generous support of the Iris Cantor Foundation.

Author information

Authors and Affiliations

Medical & Imaging Informatics, Department of Radiological Sciences, David Geffen School of Medicine at UCLA, 924 Westwood Blvd, Ste 420, Los Angeles, 90024, USA
Chrysostomos Marasinou, Akinyinka Omigbodun & William Hsu
Department of Radiological Sciences, David Geffen School of Medicine at UCLA, Los Angeles, 90095, CA, USA
Bo Li, Jeremy Paige & Anne Hoyt
Department of Computer Science, UCLA Samueli School of Engineering, Los Angeles, 90095, CA, USA
Noor Nakhaei

Authors

Chrysostomos Marasinou
View author publications
You can also search for this author in PubMed Google Scholar
Bo Li
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy Paige
View author publications
You can also search for this author in PubMed Google Scholar
Akinyinka Omigbodun
View author publications
You can also search for this author in PubMed Google Scholar
Noor Nakhaei
View author publications
You can also search for this author in PubMed Google Scholar
Anne Hoyt
View author publications
You can also search for this author in PubMed Google Scholar
William Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Chrysostomos Marasinou: conceptualization, methodology, software, validation, formal analysis, data curation, writing — original draft, visualization. Bo Li: conceptualization, data curation, writing — review and editing. Jeremy Paige: conceptualization, data curation, writing — review and editing. Akinyinka Omigbodun: conceptualization, writing — review and editing. Noor Nakhaei: conceptualization, writing — review and editing. Anne Hoyt: conceptualization, writing — review and editing, funding acquisition. William Hsu: conceptualization, methodology, resources, writing — review and editing, supervision, project administration, funding acquisition.

Corresponding author

Correspondence to William Hsu.

Ethics declarations

Ethics Approval

This retrospective study followed an IRB-approved protocol under a waiver of consent.

Consent to Participate

Not applicable

Consent for Publication

Not applicable

Conflict of Interest

Chrysostomos Marasinou, Bo Li, Jeremy Paige, Akinyinka Omigbodun, Noor Nakhaei, and Anne Hoyt do not have competing interests to declare. William Hsu was a recipient of a research grant from Siemens Medical Solutions unrelated to this work.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 518 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Marasinou, C., Li, B., Paige, J. et al. Improving the Quantitative Analysis of Breast Microcalcifications: A Multiscale Approach. J Digit Imaging 36, 1016–1028 (2023). https://doi.org/10.1007/s10278-022-00751-3

Download citation

Received: 23 November 2021
Revised: 04 December 2022
Accepted: 06 December 2022
Published: 23 February 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s10278-022-00751-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Improving the Quantitative Analysis of Breast Microcalcifications: A Multiscale Approach

Abstract

Similar content being viewed by others

Improved automated early detection of breast cancer based on high resolution 3D micro-CT microcalcification images

False Positive Reduction by an Annular Model as a Set of Few Features for Microcalcification Detection to Assist Early Diagnosis of Breast Cancer

Segmentation and Classification of Microcalcifications Using Digital Mammograms

Introduction

Materials and Methods

Data

INbreast Dataset

Local Dataset

Local Magnification View Dataset

Overall Approach

Blob Segmentation

Regression Convolutional Neural Network

Output Generation

Comparison Methods

Evaluation Metrics

Results

Training, Validation, and Test Sets

Model Selection and Optimization

Detection and Segmentation Results

Case Study: Identifying Breast Cancers Among Amorphous Calcifications

Data

MC Segmentation

Feature Extraction

Classification

Evaluation

Discussion

Conclusions

Data Availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics Approval

Consent to Participate

Consent for Publication

Conflict of Interest

Additional information

Publisher’s Note

Supplementary Information

Supplementary file1 (DOCX 518 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation