1 Introduction

Liver tumor ultrasound scanning is recently becoming increasingly recommended as a first diagnosis option for early prediction of response to chemotherapy treatment [1]. However, visual assessment of tumor response to chemotherapy is very challenging without monitoring longitudinally the tumor development. This is, in part, due to the intertwined tumor speckle variations, leading to formation of complex texture patterns. A robust approach to tackle this texture complexity is to assess the radio-frequency (RF) echoes – instead of B-mode images – which are not subjected to log-compression and proprietary filtering algorithms. This original data preservation allows for better statistical modeling of backscattering properties.

During the course of chemotherapy treatment, changes within tumor region may occur due to progression or regression of disease. The speckle patterns variations are heterogeneous as tumor angiogenesis can affect the complexity of the tissue spatial relationship. This heterogeneity in tumor tissue scatterers could span different scatterer densities. Regions within the tumor tissue that respond to treatment exhibit different statistical properties to that of the non-respondent counterpart [2-4,6]. Thus it is essential to improve the discriminative abilities of the employed statistical distribution model for better characterization of tumor heterogeneity.

A number of contributions employed the backscattered statistics from RF ultrasound signal for evaluating the early death of tumor cells in response to chemotherapy treatment; such as using the scatterer spacing and diameter, and acoustic concentration in combination with texture features from the gray-level co-occurrence matrix [3]. Others relied on calculating the power spectrum from the Fourier transform of raw RF data, and subsequently deriving the spectral slope, 0-MHz intercept, and mid-band fit quantitative parameters [2]. In another similar work, the maximum mean discrepancy features were extracted on histograms of quantitative ultrasound spectroscopic parametric maps [4]. However, analyzing tissue backscattering properties from a single resolution perspective is limiting, as substantial information that could assist in tumor tissue characterization can be hidden at different locations, orientations and scales of resolution. Refined statistical properties can be obtained from the RF envelope-detected signal performed in 2D [5], where a fractal-based representation that underpins a tumor model can be achieved [6].

For an improved ultrasound tissue characterization of tumor texture, the use of the fractal dimension (FD) as in [6] might not capture well all relevant tissue changes. Namely, the FD corresponds to scatterers spatial distribution and not scatterer number density, thus it is possible to get similar FD values for textures which might not look alike. As a result, this could overlook some of the important aspects of the statistics of the envelope detected from the RF signal. Tissue heterogeneity property previously proved to be useful in assessing tumor aggressiveness [7]. Therefore, the variation in scatterer number density within the tumor focal region – which corresponds to spatial heterogeneity – would be an interesting property to consider as well.

Here we propose a novel ultrasound texture analysis approach for tissue characterization. The assumption is made that quantifying tumor spatial heterogeneity could assist in revealing subtle cues (i.e., small changes in tissue texture) for tumors that responded to chemotherapy administration. A Nakagami statistical distribution is fitted locally to the envelope RF signal for estimating the backscattering parameters. Subsequently, circular harmonic wavelets frames are used to decompose the backscattered shape statistics into different spatial scales and local circular frequencies. Finally, a heterogeneity feature descriptor is successively constructed by mapping the circular harmonic wavelets frames on the fractal dimension space, and then quantitatively estimating tissue sparsity from the constructed fractal texture maps.

2 Materials and Methods

Although the backscattered signal from tumor tissue tends to show a stochastic pattern, the local concentration and spatial arrangement of progressive tumor tissue scatterers may follow a distribution different from regressive ones. To maximize the difference between the two conditions, the fractal signatures are derived from multi-scale circular frequency analysis of the acoustic properties of the envelope RF signal for assessing tissue heterogeneity.

2.1 Ultrasound RF Data Analysis

The amplitudes of the individual backscattered signals are assumed to be randomly distributed due to the random backscatter coefficient of each individual scatterer. The interference signals from the large number of randomly distributed scatterers give the echo signal its statistical nature. Many statistical models exist for the purpose of characterizing randomness in soft tissue; however, very few can estimate the model parameters with analytical simplicity and computational ease. The Nakagami distribution is an example of a simple bi-parametric model which can characterize tissue in various scattering conditions and scatterer densities [8]. The Nakagami density function is defined as

$$\begin{aligned} P_{n}\left( x|m,\varOmega \right) = \frac{2m^m}{\varGamma \left( m\right) \varOmega ^m}x^{2m-1}e^{-mx^{2}/\varOmega }, \end{aligned}$$
(1)

for \(x \ge 0\), where \(\varGamma \) is the Euler gamma function. The real numbers \(m > 0\) (related to the local concentration of scatterers) and \(\varOmega > 0\) (related to the local backscattered energy) are called the shape and scaling parameters, respectively. Similarly to the Rayleigh distribution, the envelope of the RF signal \(x^2\) follows a gamma distribution. By fine-tuning the shape of the distribution parameter m, other statistical distributions can be modeled, such as, an approximation of the Rician distribution (i.e., post-Rayleigh) for \(m > 1\), a Rayleigh distribution for the special case when \(m = 1\), and when \(m < 1\) a K-distribution (i.e., pre-Rayleigh). The envelope-detected RF signal based on the Nakagami m parameter was used subsequently for investigating tissue heterogeneity.

2.2 Circular Harmonic Wavelets

A natural way of assessing the echo signal f(xy) is to analyse its statistical properties at different spatial scales. An efficient way to systematically decompose f(xy) into successive dyadic scales is to use a redundant isotropic wavelet transform. The recursive projection of f on Simoncelli’s isotropic wavelet provides such a frame representation [9]. The radial profile of the wavelet function is defined in polar coordinates in the Fourier domain as

$$\begin{aligned} \hat{h}(\rho ) = \left\{ \begin{array}{ll} \cos \left( \frac{\pi }{2}\log _2\left( \frac{2\rho }{\pi }\right) \right) , &{} \frac{\pi }{4} < \rho \le \pi \\ 0, &{} \text {otherwise}. \end{array} \right. \end{aligned}$$
(2)

The scaling function is not used to ensure illumination invariance. The local structural properties of f(xy) can be well described in terms of the local circular frequencies, which was at the origin of the success of methods such as local binary patterns (LBP) [10]. In this work, local circular harmonics are computed on top of the wavelet frames to characterize circular frequencies at multiple scales [11], which is an extension of steerable Riesz wavelets [12]. Circular harmonic wavelets (CHW) of order n are constructed in the Fourier domain as

$$\begin{aligned} \hat{\phi }^{(n)}(\rho ,\varphi )=\hat{h}(\rho )\text {e}^{\text {j} n\varphi }. \end{aligned}$$
(3)

The representation obtained from the collection of the complex magnitudes of the scalar products \(|\langle f,\phi ^{(n)}\rangle |\) characterizes the local circular frequencies in f(xy) up to an order \(n=1:N\) and is rotation invariant [13].

2.3 Heterogeneity Quantification

After the projection of f on CHW frames, the self-similarity of each voxel from its surrounding neighborhood is determined via estimating its localized FD. This would serve as an estimated array of localized FD values (i.e. fractal texture map) for a multi-dimensional representation of tissue heterogeneity.

Fractal Texture Map Estimation. There are several methods to estimate the FD, however multiplicative speckle scale changes can effect the stability of parameter estimation. The fractal Brownian motion (fBm), which is known for its capability for describing random phenomena [14], can work well with ultrasound tissue characterization [6]. Both scale- and rotation-invariance properties of the non-stationary fBm model makes it a perfect candidate to be integrated with the Nakagami modeling and CHW decomposition. The fBm can be characterized by

$$\begin{aligned} E\left( \varDelta v \right) = K\varDelta r^H, \end{aligned}$$
(4)

where \(E\left( \varDelta v \right) = \left| q - p\right| \) is the mean absolute difference of voxel pairs \(\varDelta v\); \(\varDelta r = \sqrt{\sum ^{s}_{i=1}\left( q - p\right) ^2}\) (s = 3 for fBm texture surface enveloped in 3-D space) is the voxel pair distances; H is the Hurst exponent; and K > 0 is a constant.

Given a volume set V of constructed envelope-detected RF tumor regions \(f^{\mu }_{i}(x,y)\), where \(\mu \) stands for the Nakagami shape parameter and i is a certain slice in the acquired volume, tissue fractal characteristics from the backscattered envelope are investigated. A fractal texture map \(\mathcal {F}\), having a size \(m\times n\) and for k dimensions, can be defined as in (5) based on the CHW frames for all corresponding voxels \(v_{xy}^{k}\) of \(f^{\mu }_{i}(x,y)\), \(f \in V\). The k value empirically specifies the maximum convolution kernel size I used in estimating \(\varDelta v\) of (4). The slope of linear regression line of the log-log plot of (4) gives H from which the localized fractal dimension is estimated (\(FD = 3-H\)). This procedure is iterated covering all \(v_{xy}^{k}\) which yields a set of multi-dimensional fractal texture maps \(\mathcal {M}_f\) to be constructed for each V, where \(\mathcal {M}_f\) = \(\left\{ \mathcal {F}_1,\mathcal {F}_2,\dots ,\mathcal {F}_z\right\} \), and z is the total number of \(\mathcal {F}\) in \(\mathcal {M}_f\).

$$\begin{aligned} \mathcal {F}^{(N,J)}\left\{ f\right\} (x,y) = \left( \begin{array} {llllll} v^{k}_{11} &{} v^{k}_{12} &{} \cdots &{} v^{k}_{1y} &{} \cdots &{} v^{k}_{1n} \\ v^{k}_{21} &{} v^{k}_{22} &{} \cdots &{} v^{k}_{2y} &{} \cdots &{} v^{k}_{2n} \\ \vdots &{} \vdots &{} \ddots &{} \vdots &{} &{} \vdots \\ v^{k}_{x1} &{} v^{k}_{x2} &{} \cdots &{} v^{k}_{xy} &{} \cdots &{} v^{k}_{xn} \\ \vdots &{} \vdots &{} &{}\vdots &{} \ddots &{} \vdots \\ v^{k}_{m1} &{} v^{k}_{m2} &{} \cdots &{} v^{k}_{my} &{} \cdots &{} v^{k}_{mn} \\ \end{array}\right) \end{aligned}$$
(5)

The integration of the fBm model at different CHW orders N and scales J can contribute towards a better separability between the mixtures of speckle patterns within tissue. Such that \(\varDelta v\) and \(\varDelta r\) locally estimate the FD of each \(v_{xy}^{k}\) up to the resolution limits of \(f^{\mu }_{i}\) specified by k that best characterizes the speckle patterns for different scales and circular frequencies. This approach enables for further probing the resolution of CHW frames, and hence facilitates for assessing the speckle pattern heterogeneity.

Lacunarity Analysis.To further differentiate between textures having similar FD values, the lacunarity (\(\mathcal {L}\)) – a coefficient of variation that can measure the sparsity of the fractal texture – can assist in quantifying aspects of patterns that exhibit scale-dependent changes in structure [15]. Namely, it measures the heterogeneity of the fractal pattern, providing meta-information about the dimension of \(\mathcal {F}\). The lower the \(\mathcal {L}\) value, the more heterogeneous the examined tumor region \(f^{\mu }_{i}\) represented by \(\mathcal {F}\), and vice versa. \(\mathcal {L}\) can be defined in terms of the relative variance of the size distribution in \(\mathcal {F}\) as

$$\begin{aligned} \mathcal {L} = \frac{\frac{1}{mn}\sum _{x}\sum _{y}\mathcal {F}^{2}-\left( \frac{1}{mn}\sum _{x}\sum _{y}\mathcal {F}\right) ^{2}}{\left( \frac{1}{mn}\sum _{x}\sum _{y}\mathcal {F}\right) ^{2}} = \frac{E[\mathcal {F}^{2}]-E\left[ \mathcal {F}\right] ^{2}}{E\left[ \mathcal {F}\right] ^{2}} = \frac{Var\left[ \mathcal {F}\right] }{E\left[ \mathcal {F}\right] ^{2}}. \end{aligned}$$
(6)

3 Results and Discussion

3.1 Clinical Tumor Cross-Sectional Dataset

The approach has been validated on RF ultrasound data acquired using a diagnostic ultrasound system (Zonare Medical Systems, Mountain View, CA, USA) 4 MHz curvilinear transducer and 11 MHz sampling. The output 2-D image size was \(65 \times 160\) mm with a resolution of \(225 \times 968\) pixels. A total of 287 cross sectional images of 33 volumetric liver tumors manually segmented were obtained from the stacks of 2-D images, 117 were responsive and 170 did not responded to chemotherapy treatment. Response to treatment was determined based on conventional computed tomography follow up imaging as part of the patient standard clinical care based on the response evaluation criteria in solid tumors (RECIST) [16]. The baseline cross-sectional imaging was compared against those performed at the end of treatment according to the RECIST criteria to determine response to treatment for each target tumor. A tumor was classified as responsive if categorized as partial response and non-responsive if no change or disease demonstrated progression.

3.2 Statistical Analysis

To quantitatively assess the robustness of our approach, \(2\left( N+1\right) \times {J}\) features, where 2 stands for both the average \(\mathcal {F}\) and \(\mathcal {L}\) estimated at each N and J per slice of each of the acquired volumes, were fed into a support vector machine classifier to compare the overall modeling performance of classifying responsive versus non-responsive cases. Cross-validation was performed based on a leave-one-tumor-out (loo) approach, and further validated on independent test-set of 107 cross sectional images (69 responsive versus 38 non-responsive images). The convolution kernel size I used in estimating the localized FD of \(\mathcal {F}\) was initially optimized while having N and J fixed, see Fig. 1. Then the classification performance for different N and J values of the CHW representation was investigated in order to quantify \(\mathcal {L}\) extracted from \(\mathcal {F}\). Hence, when the optimized values of N, J and I are employed, 97.91 % (97.2 % for unseen data) best classification accuracy is achieved as compared to 92.1 % in the work of [6], and similarly applies to the 5- and 10-folds cross validation results (indicated in terms of mean ± standard deviation of the performance over 60 runs). Figure 2 shows the \(\mathcal {F}\) and corresponding \(\mathcal {L}\) for a non-responsive vs responsive case. A less heterogeneous texture (i.e. higher \(\mathcal {L}\) values colored in red in Fig. 2) is witnessed in the responsive case. This indicates tumor tissue texture is becoming more sparse, which could be signs of necrotic regions, and hence responding to treatment (Table 1).

Fig. 1.
figure 1

Classification accuracies for varying convolution kernel size (I) in pixels with fixed order (\(N=2\)) and scale (\(J=6\))

Table 1. Classification performance for the multidimensional heterogeneity analysis of clinical liver tumor dataset
Fig. 2.
figure 2

(1st column) Tumor B-mode images, (2nd column) fractal texture maps and (3rd column) corresponding tissue heterogeneity representation for a (1st row) non-responsive vs (2nd row) responsive case, respectively. Red regions in (c) and (f) indicate response to treatment according to RECIST criteria [16]. CHW decomposition was based on a 2nd order and up to the 8-th scale.

Characterizing the speckle patterns in terms of multi-scale circular harmonics representation could assist in better characterization of the backscattered signal, which adapts according to the varying nature of tissue structure. As changes in the scatterers’ spatial distribution and number density reflect in the ultrasound backscattering, the sensitivity of response to treatment of the envelope RF signal is implicitly linked to changes in FD and associated \(\mathcal {L}\) on the CHW frames. Finally, tumors with varying depth would decrease the amplitude of the RF data, and quantifying the tumor response to chemotherapy treatment under such conditions is planned for future work.

4 Conclusion

A novel approach has been presented for quantifying liver tumor response to chemotherapy treatment with three main contributions: (a) ultrasound liver tumor texture analysis based on a Nakagami distribution model for analyzing the envelope RF data is important to retain enough information; (b) a set of CHW frames are used to define a new tumor heterogeneity descriptor that is characterized at multi-scale circular harmonics of the ultrasound RF envelope data; (c) the heterogeneity is specified by the lacunarity measure, which is viewed as the size distribution of gaps on the fractal texture of the decomposed CHW coefficients. Finally the measurement of heterogeneity for the proposed representation model is realized by means of support vector machines.