1 Introduction

Although limited, mammography proves useful as a screening tool for early detection of breast cancer. The effectiveness of diagnosis of some radiological signs of cancer is moderate, especially in case of architectural distortions (ADs) and subtle spiculated masses (SMs) [1]. These pathologies are commonly misdiagnosed even by the most experienced radiologists. Hence, computer-aided diagnosis (CAD) systems have been undergoing improvements for years due to significantly lower efficiency in ADs detection compared to high-efficiency detection of both masses and calcifications. 56% sensitivity of detection was achieved for subtle SMs, whereas typical SMs were computer-detected with 86% sensitivity at 0.24 false positives per image (FPI) [2]. Simultaneously, two commercial CAD systems (R2 Image Checker [3] and Second Look CADx [4]) detect ADs with low sensitivity being less than 40% at approx. 1.0 FPI [2]. Moreover, the detection efficiency of pathological findings with the use of CAD systems decreases with increasing breast tissue density [5]. Both SMs and ADs are commonly misdiagnosed even by the most experienced radiologists in screening mammography. This is a problem in early detection of pathologies and, consequently, in full recovery from breast cancer. Spiculated findings in mammograms are manifested by radiating straight line structures in various directions from the hypothetical center. The lack of stable patterns of both normal and abnormal breast tissues tends to have an adverse effect on automatic detection of subtle spiculated lesions. However, local directional distribution of spiculated structures tends to be a dominant distinguishing factor for both ADs and SMs in computer-aided detection methods, based on the review of the literature.

Instead of a statistical texture analysis in the image domain, an analysis of spicule distribution in scalable transform domains was proposed, including wavelets [6] and their extensions to 2D kernels, complex domains and optimized filter set-based implementations [7,8,9,10,11]. The scalability of transforms makes it possible to obtain sparse representation which is more flexible for studying the dependence of the specific features of the spiculated pathologies (ADs and SMs) at different scales, as well as for determination of scale-invariant numerical descriptors. Consequently, several properties of normalized sparse representation are obtainable, i.e. signal energy compaction against energy dissipation of noise, shift (rotation) invariance, a sufficiently high directional selectivity, and low computational complexity. Therefore, the signal-adjusted tensor wavelets [6] and the extensions with non-separable 2D kernels: curvelets [7, 8], contourlets [9], and complex wavelets [10, 11] were considered due to the desired properties of content-oriented image representation. The selected set of transform domains reflects simplicity, flexibility of design, and popularity of tensor wavelets in medical imaging applications. Curvelets and contourlets increase description sparsity of smooth curved lines and multidirectional straight line structures, whereas complex wavelets assure shift invariance and an additional directional analysis in a phase domain. Nonlinear approximation of multidirectional structures was optimized in signal sparse domains. Precise determination of mammographic spicules can frequently be problematic due to their subtlety. Consequently, the radiologists’ assessments of these barely visible pathologies in mammograms are frequently unrepeatable [12].

2 Materials and methods

The main goal of the study is directional characteristics of local piecewise linear structures in image, which accurately manifest the mammographic signs of malignancy with the use of the most appropriate multiscale transform domain. For this purpose, the initial experiments were conducted with the use of the designed spicule phantom with close-lying linear structures of limited length and with a wide angular spectrum. Two criteria: angular selectivity and angular resolution, were proposed for the determination of the local multidirectional activity of piecewise linear objects, both related to the number of detected structures with at least half of their length in reconstructed images. The experiments, based on the multidirectional activity measures, were conducted with the use of a spicule phantom model with different noise levels and various density characteristics of linear structures distribution. Subsequently, the results of the phantom model-based optimization were verified on a test set of real mammograms.

The novelty of the paper method lies in all of the four components: spicule phantom model design, a proposal of directional activity measures, verification of the utility of wavelet-like transforms, and a proof of concept for the recognition of both ADs and SMs in real mammograms.

2.1 Directional activity measures

Two criteria of directional activity: angular resolution and angular selectivity, were proposed to identify the image transformation suitable for the effective (i.e. selective and discriminative) representation of specific orientation activity of piecewise linear structures. Hence for this purpose, angular resolution was specified as the minimum angular separation of two radiating lines, while the separation is perceived or computed along at least half of their length. The minimum angle of separation is modeled by a phantom of close-lying lines originating from a relatively small center with a controlled angular distance. Simultaneously, angular selectivity (also called directional resolution) was defined as the maximum number of lines or piecewise lines originating from a small center which propagate in different directions of a full angle and are perceptually or computationally separable along at least half of their length.

2.2 Spicule phantom model

Not only multiscale directional analysis, but also a shift (including rotation) invariance of image transforms, low transform-domain redundancy and low computational complexity play a role in determination of the ability to represent multidirectional structures. Selection of the optimal image transforms is based on as close as possible preservation of object orientations in transform domains in relation to original images. Considering those properties, the bases or frames of tensor wavelets, complex wavelets, contourlets and curvelets were selected to be experimentally verified, based on a spicule phantom model which contains object (line) value of 1 and slightly Gaussian blurred lines of approx. 9-pixel in width and 90-pixel in length which are estimated averaged sizes of spicules according to our previous study [13]. The angle range of line distribution was assumed as: 0\(^\circ \)–360\(^\circ \) and different center size of line convergence was adopted: the sparse phantom (Fig. 1a), the medium-density phantom (Fig. 1b), and the dense phantom (Fig. 1c). Realistic conditions for object-background relationship, based on [14], were generated with the value change of two parameters: the global contrast level and the noise level. Global contrast level (\(C_{\text{glob}}\)) was defined as follows:

$$\begin{aligned} C_{\text{glob}} = \frac{\text {mean}\big [Ph_{obj}(i,j)\big ]-\text {mean}\big [Ph_{\text{backgr}}(i,j)\big ]}{\max \big [\max \limits _{i,j}Ph_{obj}(i,j),\max \limits _{i,j}Ph_{\text{backgr}}(i,j)\big ]} \end{aligned}$$
(1)

where \(\text {mean}[Ph_{obj}(i,j)]\) is the mean pixel-value for the objects, \(\text {mean}[Ph_{obj}(i,j)]\) being the mean pixel-value for the spicule phantom background. For the phantom model with background pixel-value equal 0, \(C_{\text{glob}}\) was set to approx. 76% and decreased with the step size of approx. 1.5%, caused by increasing background pixel-value in the range of 0–0.9.

Fig. 1
figure 1

a Various forms of spicule phantom model with varying spicule density levels: sparse, medium-density, dense phantom of linear structures radiating from the convergence center of different size (left to right). b and c Various forms of spicule phantom model with multiplicative Gaussian noise. Noise level, being background density level, was defined on the basis of varying background pixel-value equal b 0.5, c 0.7, resulting in \(C_{\text{glob}}=7.5\) and 4.5, \({\text{PSNR}}=2.78\) and 1.46, respectively

Additionally, signal-dependent and zero-mean Gaussian noise \(\varepsilon \) with standard deviation \(\sigma = 1\) was added to the phantom model [14]. The phantom model with various noise level \(Ph_{\varepsilon }(i,j)\) (Fig. 1b, c) was calculated as follows:

$$\begin{aligned} Ph_{\varepsilon }(i,j) = Ph(i,j) + \varepsilon \sqrt{Ph(i,j)} \end{aligned}$$
(2)

where Ph(ij) is a spicule phantom model with structures which are diverse in terms of both the propagation density and the object–background relation. \(C_{\text{glob}}\) reached 16% for the phantom model with background pixel-value equal 0 and gradually decreased by 1.5% with increasing background pixel-value.

In this approach, the peak signal-to-noise ratio (PSNR) was defined as follows:

$$\begin{aligned} {\text{PSNR}} = 10\log {\frac{\Big (\max \limits _{i,j}{Ph(i,j)\Big )^2}}{\frac{1}{N\times M}\sum \nolimits _{1}^{N}\sum \nolimits _{1}^{M}\Big (Ph(i,j)-Ph_{\varepsilon }(i,j)\Big )^2}} \end{aligned}$$
(3)

where Ph(ij) is the appropriate spicule phantom of the selected \(C_{\text{glob}}\), with \(Ph_{\varepsilon }(i,j)\) being the phantom Ph(ij) with multiplicative noise \(\varepsilon \) and \(N\times M\) being the size of images, i.e. \(512\times 512\). PSNR decreased from near 12–0.5dB.

With such assumptions, both different size of pathological findings and the diversified density of breast tissue were simulated with the use of increasing background pixel-value and noise level in relation to the analyzed objects.

2.3 Directional activity of multiscale wavelet-like domains

It is challenging to objectively characterize and reliably compare both angular resolution and angular selectivity of competitive multiscale domains. It is probably caused by a number of diverse factors characterizing discrete transforms, frequently dependent on others, i.e. variable redundancy of transform domains, energy compaction, susceptibility to aliasing, limited invariance of structure representation, variable numbers of directional subbands, homogeneity of angular resolution, etc. Therefore, an initial experiment, based on noisy spicule phantom model, was conducted to assess the directional activity of multiscale transform domains. For this purpose, the MATLAB toolboxes of four selected image transform domains were used and both the type of filters and the number of scales and appropriate subbands of decomposition were adjusted to effectively extract the modeled spicules from the various background. Three types of image background were presented in this paper as a simulation of breast tissue: fat tissue, fat-glandular tissue and glandular tissue, respectively. As a result, there were computed:

  • 2-dimensional tensor wavelet transform [6] for three scales and three subbands with low-pass orthonormal quadrature mirror filters (QMF), called Battle-Lemarie, which generate spline orthogonal wavelet basis, the degree of the spline being 5 [15],

  • contourlet transform [9] for five scales, where the vector of numbers of directional filter bank decomposition levels at each of the five pyramidal levels (from coarse to fine scale) was determined as follows: \([2^0, 2^0, 2^0, 2^3, 2^4]\), and the 9-7 filter was used for the pyramidal decomposition step, whereas the pkva filter was used for the directional decomposition step [16];

  • complex-valued curvelet transform [7] for four scales, including the coarsest wavelet level, and three directional scales, where the coarsest scale was divided into eight angular ranges [7];

  • complex wavelet transform with six levels of decomposition, near-symmetric (13,19)-tap filters as biorthogonal filters and Q-Shift orthogonal (10,10)-tap filters [10, 11]; six complex highpass subimages for each of all levels were generated as the result of the transformation.

The main aim of the present paper was to verify the ability of wavelet-like transforms to diversify multidirectional structures. Based on the four decomposition settings and the two proposed measures of directional activity, the number of noticeable model-structures in the reconstructed images corresponds to the angular selectivity, whereas the analysis of degree of structure compression indicates the angular resolution.

3 Results and discussion

Our research was divided into two parts. Firstly, an attempt was made to verify the repeatability of linear structure orientations in the reconstructed images with the use of tensor wavelets, contourlets, curvelets, and complex wavelets. Based on the experiments with the spicule phantom model with no added noise (Fig. 1a), but with nonlinear approximation of transformation coefficients (with the criterion of its absolute value order) it may be noticed that most of the analyzed transformations distort orientations of spicule phantom model (Figs. 2, 3). These distortions are negligible and diagnostically unimportant merely in the case of the complex wavelet domains (the least significant deviations from the original structure directions). Additionally, the most accurate representation of structures was achieved in the complex wavelet domain. Accurate representation was construed as a smooth structure reconstruction, maintaining precise orientation of the analyzed structures (Figs. 2, 3). Continuous nature of the multidirectional piecewise linear structures was best preserved in the complex wavelet domain. Moreover, the angular selectivity of the analyzed transforms varies depending on the propagation density of the structures. The higher the propagation density was, the lower the angular selectivity was in all of the analyzed domains.

Fig. 2
figure 2

a The reconstructed images based on \(n=200\): tensor wavelet, contourlet, curvelet, and complex wavelet coefficients of the highest amplitude values for the medium-dense spicule phantom model, b the results of detection with the use of binarization of the reconstructed images with two thresholds: \(t_{b}=0.5\), and \(t_{ds}=0.5l_{obj}\), \(l_{obj}\)-linear structure length. The unidentified objects were color-marked (color figure online)

Fig. 3
figure 3

a The reconstructed images based on \(n=200\): tensor wavelet, contourlet, curvelet, complex wavelet coefficients of the highest amplitude values for the dense spicule phantom model, b the detection results with the use of binarization of the reconstructed images with two thresholds: \(t_{b}=0.5\), and \(t_{ds}=0.5l{obj}\), \(l_{obj}\)-linear structure length. The several structures detected as one object were color-marked (color figure online)

Secondly, an attempt was made to determine the directional activity, based on angular resolution and selectivity of tensor wavelets, contourlets, curvelets and complex wavelets relative to sparse representation of objects in the spicule phantom model. Sparse representation is a consequence of the coefficient reduction with the use of approximation techniques. The reconstruction images based on the significant decomposition coefficients are applied to determine the ability of the selected multiscale transforms to represent multidirectional structures. The minimum number of decomposition coefficients, sufficient to perceptually and computationally distinguish all linear structures in the phantom along at least half their length, depends on three factors: the used transform domain, the density of radiating linear structures, and the surrounding background (tissue density).

Fig. 4
figure 4

The results of sparse nonlinear approximation in the tensor wavelet, contourlet, curvelet and complex wavelet domains for the dense phantom of high background density \(C_{\text{glob}}=4.5\), \({\text{PSNR}}=1.46\) (Fig. 1c-right). The reconstructed images based on \(n=300\) and 600 coefficients of the highest amplitude values

Sparse approximation of coefficients tends to be a useful way to enhance linear structures, primarily in two of the four analyzed domains, i.e. the curvelet and the complex wavelet domain (Fig. 4). In both the tensor wavelet and the contourlet domains, the process of separation and differentiation of linear structures of various orientations becomes increasingly difficult when these objects are progressively compressed, whereas the detection of phantom objects tends to be more effective in the curvelet and complex wavelet domains.

The results of the initial experiments with the use of spicule phantom model with the added noise (Fig. 4) can be considered as another interesting aspect of multidirectional lines recognition. In this case, it is complex wavelets rather than curvelets that have a higher potential to detect multidirectional structures. Using curvelets, single linear structures may be detected as several smaller structures due to the lack of continuity across the analyzed linear structures. Hence, the number of detected objects in the reconstructed images may be distorted.

Fig. 5
figure 5

The results of sparse nonlinear approximation in the tensor wavelet, contourlet, curvelet and complex wavelet domains for mammograms [17] with architectural distortions (a, b) and normal breast tissue (c). The reconstructed images based on \(n=300\) and 600 coefficients of the highest amplitude values

Moreover, the higher was the observed background density level, the higher was the number of significant coefficients necessary for a complete signal description. Furthermore, the effective sparse approximation of transform coefficients depends on the type of image transformation rather than the density of piecewise linear structures in the curvelet and complex wavelet domains. Based on the experiment in the complex wavelet domain, it appears that the significant coefficients were mainly concentrated in the regions with the objects of interests, and subsequently around diagnostically irrelevant information of image content. Informative energy accumulation of image content was differentiated more fuzzily in other domains in view of the absolute value of coefficients. Hence, it was more possible to determine the threshold between the informative and uninformative image content in the complex wavelet domain than in other domains (Fig. 4—complex wavelets).

This observation was confirmed in the conducted tests with real mammograms, especially for a small number of approximation coefficients (Fig. 5—complex wavelets). Sparse representation of spicules in the complex wavelet domain tends to result in a dominant angularity distribution of linear spicules. The informative image content becomes less pronounced with an increased number of approximation coefficients due to a more accurate restoration of both noise and uninformative structures, together with irregular details of spicules, making them similar to other structures. Spicule discrimination appears less distinctive in case of the other domains (Fig. 5—tensor wavelets and contourlets). The superiority of complex wavelets in spicule representation, fairly noticeable in subjective assessment, was confirmed by the proposed objective assessment measure of spicule representation (Figs. 6, 7), defined as the product of correlation (CORR) and informative measure of correlation (|IMC|).

Correlation is defined as the marginal-probability over rows (\(p_{x}\)) and columns (\(p_{y}\)) of the normalized co-occurrence matrix (p(ij)) from image:

$$\begin{aligned} {\text{CORR}} = \frac{\sum \nolimits _{i}\sum \nolimits _{j}(ij)p(i,j)-\mu _{x}\mu _{y}}{\sigma _{x}\sigma _{y}} \end{aligned}$$
(4)

where \(\mu _{x}, \mu _{y}, \sigma _{x}, \sigma _{y}\)—the means and standard deviations of \(p_{x}\) and \(p_{y}\), \(i,j \in [0,I_{\text {max}}]\), \(I_{\text {max}}\)—image intensity [18].

Informative measure of correlation is defined as follows:

$$\begin{aligned} {\text{IMC}} = \frac{\max {(HX,HY)}}{HXY - HXY1} \end{aligned}$$
(5)

where

$$ HXY= -\sum \limits _{i}\sum \limits _{j}p(i,j)\log (p(i,j)) $$
(6)
$$ HXY1= -\sum \limits _{i}\sum \limits _{j}p(i,j)\log {p_{x}(i),p_{y}(j)} $$
(7)

and HXHY—entropies of \(p_{x}\) and \(p_{y}\) [18].

Fig. 6
figure 6

The proposed objective assessment measures of spicule representation for the dense phantom of high density \(C_{\text{glob}}=4.5\), \({\text{PSNR}}=1.46\) (Fig. 1c-right). The dependence of CORR, IMC and \({\text{CORR}}*{\text{IMC}}\) on the number of approximation coefficients

Fig. 7
figure 7

The proposed objective assessment measure of spicule representation for mammographic ROIs (Fig. 5) with architectural distortions (a, b) and normal breast tissue (c)

The \({\text{CORR}}*{\text{IMC}}\) measure concentrates on the calculation of correlation of restored neighbor pixels with averaged contexts. Higher correlations of restored data (due to the nonlinear approximation with significantly reduced numbers of the applied transform coefficients) mean above all

  • denoising and blurring of details,

  • sparse representation for smooth structures which are informative image content,

  • smoothness versus coarseness of the reconstructed structures,

  • continuous representation of the holistic structure core in the compact form of the approximated structures without splitting or irregularity.

In addition, lower entropy of the joint probability of the normalized co-occurrence matrix relative to the entropy, which is calculated for marginal probabilities for rows and columns, means an increase in compacted data dependence in a full spatial context of multidirectional analysis in the averaged 2D context.

Both subjective (Figs. 4, 5) and objective (Figs. 6, 7) assessments confirm practical usefulness of complex wavelets in the extraction of different oriented spicules in noisy real mammograms. To sum up, it can be assumed that the complex wavelet transform is quite an effective tool for analyzing and extracting the subtle piecewise linear structures in images due to shift invariant, appropriate angular resolution and selection and, consequently, extraction of diagnostically relevant image content.

4 Conclusions

This paper attempts to show that sparse representation of texture directionality in the transform domain is a desirable tool to extract directionality of piecewise linear structures (here, to extract mammographic spicules). However, precise determination of angular resolution and angular selectivity of four selected multiscale transformations is an interesting, yet unstudied issue in mammogram preprocessing due to various transform-domain redundancies. Nonetheless, the concept of sparse approximation of texture directionality in the complex wavelet domain may be a useful tool at the stage of image preprocessing or directional structure extraction.