Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Breast cancer is a high-prevalence disease affecting 1/8 women in the USA. Current routine screening consists of X-ray mammography, which, however, shows low sensitivity to malign tumors in dense breasts, for which a large number of false positives leads to an unnecessary number of breast biopsies. Also, the use of ionizing radiation advises against a frequent utilization, for instance, to monitor the progress of a tumor. Finally, the compression of the breast down to a few centimeter may cause patient discomfort. For these reasons, latest recommendations restrict the general use of X-ray mammography to biennial examinations in women over 50 year old [13].

Ultrasound (US) is a safe, pain-free, and widely available medical imaging modality, which can complement routine mammographies. Conventional screening breast US (B-mode), which measures reflectivity and scattering from tissue structures, showed significantly higher sensitivity combined with mammography (97 %) than the latter alone (74 %) [8]. However, B-mode US shows poor specificity. A novel US modality, Ultrasound Computed-tomography (USCT), aims at mapping other tissue parameters, such as the speed-of-sound (SoS), which shows a high potential for tumor differentiation (e.g., fibroadenoma, carcinoma, cysts) [2]. However, this method requires dedicated and complex systems consisting of a large number of transducer elements located around the breast in order to measure US wave propagation paths along multiple trajectories, from which the SoS-USCT image is reconstructed [3, 5, 10, 11]. Low-cost extensions of conventional B-mode systems that only require a single multi-element array transducer are desirable for SoS-USCT for the daily clinical routine. There have been some early attempts to combine B-mode systems with X-ray mammography, using the back compression plate as a timing reference. Yet, the reconstruction suffers from strong limited-angle artifacts, which provide unsatisfactory image quality, unless detailed prior information of the screened inclusion geometry is available [6, 9].

Fig. 1.
figure 1

SoS imaging setup.

In this work we propose a novel SoS-USCT method, hand-held sound-speed imaging, which overcomes the above listed limitations. By transmitting US waves through tissue between a B-mode transducer and a hand-held reflector, a SoS-USCT image of sufficient quality for tumor screening is obtained (Fig. 1). A specific reflector design combined with dedicated image processing provides unambiguous measurement of US time-of-flight (ToF) between different transmitter/receiver elements of a transducer, from which local tissue SoS is derived as an image. Total-variation regularization overcomes the previously reported limited-angle artifacts and enables prior-less SoS imaging and precise delineation of piece-wise homogeneous inclusions. The proposed method only requires a small and localized breast compression, while allowing for flexible access to arbitrary imaging planes within the breast.

2 Methods

A 128-element 5 MHz linear ultrasound array (L14/5-38) was operated in multistatic mode (a), each element sequentially firing (Tx) and the rest receiving (Rx). For this purpose, a custom acquisition sequence was implemented on a research ultrasound machine (SonixTouch, Ultrasonix, Richmond, Canada). In a first implementation, a conventional ultrasound beamformer is adapted to the application by beamforming only a single element pair in Tx and Rx at a time, which requires the acquisition of 128 \(\times \) 128 RF lines for 40 mm depth in about 8 s. To keep the measurement scene stable during acquisition, a positioning frame was introduced to keep the orientation of transducer and reflector fixed with respect to each other (Fig. 5b). For each line, the raw (unmodulated) ultrasound data (RF lines) are recorded. Computations are then performed in Matlab®.

2.1 Reflector Delineation

The reflector consists of a thin Plexiglas stripe (50 mm \(\times \) 7 mm \(\times \) 5 mm), which limits the reflected echoes to the desired imaging plane, and allows for flexible access to different breast locations. The material choice ensures a coherent wave reflection along the tested angular range. The flat reflector geometry is simple for manufacture, and easy to identify in US data. Secondary echoes corresponding to wave reflections between reflector boundaries are well-separated from the main echo and filtered out (Fig. 2a, b).

In a real tissue scenario, a modulated ultrasound waveform with an oscillatory pressure pattern is recorded. The recorded signal shows multiple local maxima, with varying amplitudes depending on the wave path. Simply picking the peak response in each RF line yields incorrect ToF values, since different peaks may be selected for different transmit-receive (Tx-Rx) pairs. An adaptive amplitude-tracking measurement, which uses the current timing measurement as prior information for the adjacent Tx-Rx pairs, was shown for non-destructive testing of heterogeneous materials [12]. However, it requires manual initialization, which is not affordable for in-vivo scenarios and fails when, due to wave interference and scattering effects, the reflected wave-front falls below the system noise level (fading), as frequently observed in real tissue samples (Fig. 2c, d).

Fig. 2.
figure 2

Reflector identification for ex-vivo liver test (Fig. 5c). a) Setup details; b) RF lines acquired with overlapped DP delineation for the case of same Tx and Rx; c) the measured ToF matrix \(t_{i,o}\) ; and d) the relative path delays \({\varDelta }t_{i,o}\) after compensating for geometric effects. The proposed DP method outperforms independent RF line analysis and adaptive amplitude-tracking [12].

In this work a global optimization is introduced, which simultaneously considers the full Tx-Rx dataset. Based on Dynamic Programming (DP), which has been applied in US for the segmentation of bones [4] and vessel walls [1], an algorithm for detecting oscillatory patterns in RF lines is proposed. It consists of a global cost matrix \(C(l,t_l)\), which is cumulatively built along successive RF lines l (adjacent Tx-Rx pairs) for a list of N timing candidates \(t_l = {t_l^0, t_l^1 \ldots t_l^N}\), i.e., a list of possible time samples in the current RF line l, among which the optimum reflector timing can be found. Also, a memory matrix \(M(l,t_l)\) records discrete timing decisions for each line and candidate. The optimum reflector timing is then found, which minimizes the cumulative cost, and following \(M(l,t_l)\) backwards the optimum reflector delineation T(l) is drawn:

$$\begin{aligned} \begin{pmatrix} C(l, t_l) \\ M(l,t_l) \end{pmatrix}= & {} \begin{pmatrix} \min _{t_{l-1}} \left\{ C(l-1, t_{l-1}) + f_1(t_l, t_{l-1}) \right\} + f_0(t_l) \\ \mathop {\hbox {argmin}}_{t_{l-1}} \left\{ C(l-1, t_{l-1}) + f_1(t_l, t_{l-1}) \right\} \end{pmatrix} \\ T(l)= & {} \left\{ \mathop {\hbox {argmin}}\limits _{t_l} C(l,t_l), \ \ l = L; \quad M(l+1, T(l+ 1)), \quad l = 1 \ldots L-1 ;\right. \nonumber \end{aligned}$$
(1)

with \(f_0\) and \(f_1\) non-linear functions that incorporate ToF for current \(t_1\) and neighbouring \(t_{l-1}\) RF lines. The general formulation of Eq. 1 introduces regularization into the reflector timing problem, enabling the natural incorporation of available prior information (oscillatory pattern, smoothness, multiple echoes, path geometry) into the optimization. Moreover, the delineation does not require manual initialization and is parallelizable linewise. The currently not optimized Matlab code runs on a single-core of an Intel Core i7-4770K CPU in <100 s, but several future speed improvements are envisioned.

2.2 Total-Variation Sound-Speed Image Reconstruction

Once the time delay matrix for all transmit-receiver elements \(t_{i,o}\) has been obtained (Fig. 2c), a SoS image is reconstructed. First, the baseline geometrical delays \(\overline{t}_{i,o}\) due to different path lengths between different transmit-receiver elements are subtracted from \(t_{i,o}\) to isolate the relative delays induced by SoS inhomogeneities \({\varDelta }t_{i,o}\) (Fig. 2d):

$$\begin{aligned} {\varDelta }t_{i,o} = t_{i,o} - \overline{t}_{i,o} \qquad \overline{t}_{i,o} = (\overline{c})^{-1} \sqrt{4d^2 + p(i_o - i_i)^2} \quad \forall i,o \end{aligned}$$
(2)

where \(\overline{c}\) is the average tissue speed of sound (with a nominal value of 1540 ms \(^{-1}\)), d is the distance between transducer and reflector, p is array pitch (0.3 mm for our probe), and \(i_i\), \(i_o\) are the indices of the Tx i and Rx o elements considered (1..128). Note that d and \(\overline{c}\) are estimated with linear regression based on Eq. 2. In practice, a non-linear fit is performed to estimate both the reflector inclination and in-plane orientation.

Fig. 3.
figure 3

Formulation of the sound-speed reconstruction problem. (a) Ray tracing discretization. (b) Smoothness regularization with L2 and L1 norms. While the L2 norm favors the smooth sound-speed profile, the L1 norm (TV) equally weights smooth and sharp gradients.

The next step is the reconstruction of SoS distribution, which is expressed in slowness units \(\sigma \) [s/m], with \(c(x,y) = \overline{c}\left( 1 + \sigma (x,y)\right) ^{-1}\). The tissue region is discretized into cells c traversed by a finite set of ray paths p corresponding to different Tx-Rx pairs (Fig. 3a). With the known differential path lengths \(l_{p,c}\), the path delays \({\varDelta }t_p\) are calculated in function of the slowness increments \(\sigma _c\), i.e.,\({\varDelta }t_{p} = \sum _{c=1}^{C} l_{p,c} \sigma _c\), in matrix form \(\mathbf {{\varDelta }t}\) \(=\) \(\mathbf {L} \varvec{\sigma }\). Since reconstruction can be ill-posed, regularization becomes necessary. A conventional solution in X-ray Computed Tomography (CT) [7], is Filtered Backprojection (FBP), which averages the delays of all rays p propagating through cell c. Previous reflector-based US works [9] have used Algebraic Reconstruction (ART), in which \(\mathbf {{\varDelta }t}\) \(=\) \(\mathbf {L} \varvec{\sigma }\) is approximated via singular value decomposition, preserving only the largest singular values of \(\mathbf {L}\) (typically 5 % of the total). Both FBP and ART provide a stable SoS-image reconstruction, which however suffers from strong streak artifacts and a coarse resolution in the vertical direction (Fig. 4c). The reason is that, similarly as in limited-angle CBCT, reflector-based SoS-USCT is an ill-posed problem [7], every cell being traversed by only a limited set of path orientations; i.e., paths parallel to the reflector are missing. This is a main geometric limitation with respect to dedicated USCT systems, which incorporate complete angular path sets [3, 5, 10].

Fig. 4.
figure 4

Simulation of sound-speed image reconstruction with (top) single and (bottom) multiple inclusions: (a) in-silico phantom, (b-c) reconstruction with prior-art, and (d) our TV approach.

To overcome the limited-angle artifacts, we introduce additional regularizing assumptions for the smoothness of the SoS-image:

$$\begin{aligned} \hat{\varvec{\sigma }}_{TV} = \mathop {\hbox {argmin}}\limits _{\varvec{\sigma }} \left\{ \Vert \mathbf {{\varDelta }t} - \mathbf {L} \varvec{\sigma } \Vert _2 + \lambda \Vert \mathbf {D} \varvec{\sigma } \Vert _n \right\} \Vert \end{aligned}$$
(3)

where \(\Vert \mathbf {D} \varvec{\sigma } \Vert _n = \sum _{i,j} | \sigma _{i+1,j} - \sigma _{i,j} |^n + | \sigma _{i,j + 1} - \sigma _{i,j} |^n\) minimizes the sum of horizontal i and vertical j gradients of the reconstructed image, and \(\lambda \) is a constant. The norm of the smoothness term n critically influences the reconstruction results (Fig. 3b). For L2-norm, i.e., \(\Vert \varvec{x}\Vert _2 = \sum \left| x\right| ^2\), a closed linear solution (Tikhonov regularization) of Eq. 3 is found, but smooth gradients are favored with respect to sharp gradients. As a result, the reconstruction does not significantly improve with respect to ART. However, if the L1-norm \(n=1\) is used \(\Vert \varvec{x}\Vert _1 = \sum \left| x\right| \) (Total Variation (TV) Regularization), sharp and smooth gradients are equally weighted, which leads to the reconstruction of a minimum number of piecewise homogeneous inclusions. This concept has been previously applied to regularize sparse array apertures in full-angle 3D USCT [11]. We apply this here for the first time to the limited-angle ultrasound reflection tomography case. With \(n=1\), Eq. 3 becomes a convex problem, which is iteratively solved with off-the-shelf optimization packages.

The resulting SoS images (Fig. 4c) successfully filter out limited-angle artifacts and delineate closed inclusion geometries. However, they still show reduced axial resolution, due to the extremely reduced path orientation set (according to Fig. 3a, for a SoS image aspect ratio of 1:1, the largest available ray angle is 25\(^{\circ }\)). In order to compensate for this resolution loss we introduce Anisotropically-Weighted Total Variation (AWTV), which balances horizontal and vertical gradients with a constant \(\kappa \) according to the available ray information in each direction:

$$\begin{aligned} \hat{\varvec{\sigma }}_\mathrm {AWTV}\!=\!\mathop {\hbox {argmin}}\limits _{\varvec{\sigma }} \left\{ \Vert \mathbf {{\varDelta }t} - \mathbf {L} \varvec{\sigma } \Vert _2 + \lambda \sum _{i,j} \kappa | \sigma _{i+1,j} - \sigma _{i,j} | + (1 - \kappa ) | \sigma _{i,j + 1} - \sigma _{i,j} | \right\} \end{aligned}$$
(4)

With a reconstructed pixel size equal to the array pitch (p =0.3 mm), an optimum reconstruction performance was achieved with \(\lambda = 0.0008\) and \(\kappa = 0.9\) .

2.3 Tissue Phantoms, Ex-vivo and In-vivo Tests

A tissue-mimicking phantom was manufactured from gelatin (9 g/100 mL water), mixed with flour to simulate typical ultrasound reflectivity patterns (speckle). Hard inclusions simulating tumors (Fig. 5a) were introduced by using a higher amount of gelatin (13 g/100 mL water) in well-defined phantom regions. In order to make these inclusions invisible to conventional B-mode US, the same amount of scatterering was used in tissue background and hard inclusions, so that both exhibited the same echogenicity. To test the applicability of the method to breast mammography, a breast elastography phantom (Model 059, CIRS Inc., Norfolk, VA, USA) was tested. The phantom is fabricated with a tissue-mimicking material (Zerdine\(^{TM}\)) and shows a realistic breast geometry, incorporating both skin layers and glandular tissue, together with cystic (water) and dense lesions (with embedded microcalcifications) (Fig. 5b). Ex-vivo tests were performed in bovine liver samples. Hard inclusions were simulated by ablating small pieces of liver (submerged in 250 mL water for 6 min at 700 W microwave). These were afterwards inserted in the liver (Fig. 5c). Finally, a preliminary in-vivo test was carried out with a healthy volunteer with benign cysts. While the subject sat in tripod position (Fig. 5d), the sonographer placed the US probe on the region-of-interest. Then, the subject held the positioning frame closed with both hands, while the reflector USCT data was acquired. For all tests, B-mode US images were also generated from the multi-static datasets for comparison with the SoS images.

Fig. 5.
figure 5

Hand-held sound-speed mammography of gelatin phantom (a), breast-mimicking phantom (b), ex-vivo liver sample with hard inclusion (c), and in-vivo data for a benign cystic mass (d).

3 Results and Discussion

The proposed DP method clearly outperforms independent RF line analysis and adaptive amplitude-tracking, enabling the acquisition of a continuous ToF matrix for real tissues (Fig. 2c), in which small timing variations (<100 ns) caused by SoS inhomogeneities are successfully observed (Fig. 2d). Signal fading (Fig. 2b) was typically observed around the inclusion boundaries, where strong wave refraction occurs due to quasi-parallel incident ray paths. DP automatically filters out fading positions from the reconstruction. Calibration experiments in gradually-heated water provided quantitative SoS values with a sensitivity < 0.005 ms\(^{-1}\). The observed timing error of std = 15 ns results in a noise floor of 0.8 \(\upmu \)s m\(^{-1}\), corresponding to a <0.1 % sound-speed contrast.

The proposed AWTV SoS reconstruction achieves significant improvements in the delineation of inclusion geometry (Fig. 4). Often-problematic vertical elongation of inclusions is strongly reduced (14 %) compared to ART (>300 %) and TV (95 %), which enables a quantitative reconstruction of original SoS values (SoS error <0.3 %). Streak artifacts, which are typical in ART (CNR = 15 dB), are not visible in AWTV (CNR = 37 dB). Moreover, our novel approach successfully reconstructs multiple inclusions with different SoS values and geometries (Fig. 4). Not only are the inclusion positions correctly identified, but also are their SoS values and diameter satisfactorily estimated.

An excellent performance is observed in both phantom and ex-vivo tests. For the gelatin phantom, the hard inclusions were manufactured with a small SoS contrast (−3.5 \(\upmu \)s m\(^{-1}\), 0.5 % SoS increase), but nonetheless were successfully resolved (Fig. 5a). In the more heterogeneous breast phantom (Fig. 5b) both hard inclusions (−17 \(\upmu \)s m\(^{-1}\), 2.6 % SoS increase) and cysts (−6 \(\upmu \)s m\(^{-1}\), 0.9 % SoS increase) show a higher contrast and are well-separated from the background noise, which is around 0.6 %. These values are more representative of real breast tumors, as reported by [2]. The background noise is related to reconstruction artifacts (e.g., the gradient information is missing at image boundaries), and to a minor extent, to refraction effects not accounted for in the ray tracing model. The hard inclusion in the ex-vivo liver samples was invisible in the B-mode, but clearly delineated in the SoS image, with contrast similar to the breast phantom; see Fig. 5d. Despite movement artifacts, lower US signal-to-noise ratio (<10 dB), and imperfect coupling between reflector and breast tissue, the preliminary in-vivo test demonstrates a successful identification of cystic inclusion, with an expected lower SoS contrast (−8 \(\upmu \)s m\(^{-1}\)) than the ex-vivo hard inclusions.

4 Conclusions and Outlook

A novel hand-held sound-speed imaging modality has been proposed with minimum hardware modifications to conventional B-mode ultrasound systems. An accurate geometric delineation of hypoechogenic inclusions was achieved with a high SoS-contrast for hard inclusions in both breast elastography phantoms and ex-vivo liver samples. SoS values are known as potential quantitative imaging biomarkers for breast mass differentiation [2]. In our preliminary in-vivo test, even cystic inclusion, which are known to be of low SoS contrast, were successfully identified, indicating the future potential for detecting higher contrast cancerous tumors. The proposed method is radiation-free, painless, and can potentially complement routine screening for breast cancer. Prospective applications can be for other organs that allow reflector placement such as the testicles, limbs, skin, the prostate, and with catheters; or during open-surgery, e.g., for liver.