1 Introduction

Cancer is not a bounded, self-organized system. Most malignant tumors have heterogeneous growth, leading to disorderly proliferation well beyond the surgical margins. In fact, in solid tumors, depending on the malignant phenotype, the impact of the tumor is observed not just within the visible tumor, but also in the immediate peritumoral, as well as in seemingly normal-appearing adjacent field. The phenomenon of tumor involvement outside of the visible surgical margins is known as “tumor field effect” [1]. One largely unexplored aspect of tumor field effect in solid tumors, is the impact on overall survival due to the pressure exerted on the surrounding normal parenchyma caused by active proliferation and tumor burden thereof. For instance, in Glioblastoma (GBM), the herniation or gross distortion of the brainstem (remote to the tumor location) was identified as the proximal cause of death in 60% of the studies [2].

In this work, we present a new prognostic image-based descriptor: radiographic-Deformation and Textural Heterogeneity (r-DepTH). r-DepTH attempts to comprehensively capture the systemic nature of the tumor by computing radiomic measures of intra-, and extra-lesional texture and structural heterogeneity. Specifically, r-DepTH computes measurements from the entire tumor field as observed on MRI scans by, (1) capturing uneven yet subtle tissue deformations in the normal-appearing parenchyma, and (2) combining these tissue deformations with 3D gradient-based texture features [3] computed within the tumor confines. We demonstrate that this combination of deformation and textural heterogeneity via r-DepTH enables improved prediction of disease outcome compared to radiomic descriptors extracted from within the visible tumor alone.

2 Previous Work and Novel Contributions

Multiple studies [4,5,6] have explored radiomic (co-occurrence, gray-level dependence, directional gradients, and shape-based) descriptors obtained from the tumor confines on radiographic imaging (i.e. MRI, CT), to capture intra-tumoral heterogeneity. Interestingly, a recent study in GBM demonstrated that radiomic features from peri-tumoral regions were significantly more prognostic of overall patient survival than the features from within the tumor confines [7]. Similarly, the tumor field effect in GBM has been shown to be manifested several millimeters distal to the visible tumor margins [8]. These findings then beg the question if there is prognostic information that could be mined from the subtle deformations due to tumor proliferation and burden, in the seemingly normal parenchyma distal to tumor boundaries. Similarly, one could further argue that these extra-tumoral deformations (Fig. 1(c), (g)), when combined with textural patterns (Fig. 1(b), (f)) from the tumor confines, could potentially allow for a more comprehensive characterization of tumor heterogeneity, as compared to features from tumor alone. This integrated descriptor could then serve as a powerful prognostic marker to reliably predict patient survival in solid tumors (Fig. 2).

Fig. 1.
figure 1

Textural differences within the tumor for two different patients with (a) STS, and (e) LTS are shown in (b) and (f). Corresponding deformation magnitudes in the surrounding normal parenchyma are shown in (c), (g), and are highlighted for a small region outside the tumor across STS (d) and LTS (h).

Fig. 2.
figure 2

Overview of r-DepTH and overall workflow.

Uniquely, r-DepTH descriptor captures heterogeneity in solid tumors, both from intra, as well as extra-tumoral field. Firstly r-DepTH captures the textural heterogeneity from the tumor (\(\mathbb {F}_{tex}^T\)) and peritumoral regions (\(\mathbb {F}_{tex}^P\)) using the method presented in [3]. Secondly, it captures the deformation heterogeneity (\(\mathbb {F}_{def}\)) within the normal parenchyma as a function of the distance from the tumor margins. The r-DepTH descriptor is then obtained as \(\mathbb {F}_{depth} = [\mathbb {F}_{tex}^T, \mathbb {F}_{tex}^P,\mathbb {F}_{def}]\). r-DepTH is modeled around the rationale that highly aggressive solid tumors (with worse outcome) likely proliferate in a more disorderly fashion, and hence lead to more heterogeneous deformations in the surrounding normal parenchyma [9] (Fig. 1(c)) and higher textural heterogeneity within the tumor confines (Fig. 1(b)), as compared to relatively less aggressive tumors with overall improved outcomes. In this work, we will evaluate the utility of the r-DepTH descriptor in the context of distinguishing long-term (LTS) versus short-term (STS) GBM survivors using a total of 79 T1-w MRI patient scans.

3 Methodology

3.1 Notation

We define an image scene \(\mathcal {I}\) as \(\mathcal {I}= (C,f)\), where \(\mathcal {I}\) is a spatial grid C of voxels \(c \in C\), in a 3-dimensional space, \(\mathbb {R}^{3}\). Each voxel, \(c \in C\) is associated with an intensity value f(c). \(\mathcal {I}_T\), \(\mathcal {I}_P\) and \(\mathcal {I}_N\) correspond to the intra-tumoral, peri-tumoral, and surrounding normal parenchyma sub-volumes within every \(\mathcal {I}\) respectively, such that \([\mathcal {I}_{T},\mathcal {I}_{P},\mathcal {I}_{N}] \subset \mathcal {I}\). We further divide the sub-volume \(\mathcal {I}_{N}\) into uniformly sized annular sub-volumes \(\mathcal {I}_{N}^{j}\), where j is the number of uniformly-sized annular bands, such that \(j \in \{1,\dots ,k\}\), and k is an user-defined proximity parameter dependent on the distance g from the tumor margin.

3.2 Radiographic-Deformation and Textural Heterogeneity (r-DepTH) Descriptor

  1. 1.

    Extraction of deformation heterogeneity descriptors from within the normal parenchyma: Healthy T1w MNI atlas (\(\mathcal {I}_{Atlas}\)) is used to measure the tissue deformation in the normal appearing brain regions of every patient volume \(\mathcal {I}\). \(\mathcal {I}_{Atlas}\) is first non-rigidly aligned to \(\mathcal {I}\) using mutual information based similarity measure provided in ANTs (Advanced Normalization Tools) SyN (Symmetric Normalization) toolbox [10]. The tumor mask \(\hat{\mathcal {I}}_{mask}\) is removed from \(\mathcal {I}\) during registration such that only the spatial intensity differences due to structural deformation caused by mass effect are recovered, when compared to \(\mathcal {I}_{Atlas}\). Given the reference (\(\mathcal {I}\)) and floating (\(\mathcal {I}_{Atlas}\)), the non-rigid alignment can be formulated as: \((\mathcal {I},\hat{\mathcal {I}}_{mask})=T(\mathcal {I}_{Atlas})\) where, T(.) is the forward transformation of the composite (including affine components) voxel-wise deformation field that maps the displacements of the voxels between the reference and floating volumes. This transformation also propagates the atlas brain mask (\(\hat{\mathcal {I}}_{Atlas}\)) to the subject space, thereby skull-stripping the subjects. As ANTs SyN satisfies the conditions of a diffeomorphic registration, an inverse \(T^{-1}(.)\) exists, that successfully maps \(\mathcal {I}\) to the \({\mathcal {I}}_{Atlas}\) space. This inverse mapping yields the tissue deformation of \(\mathcal {I}\) with respect to \(\mathcal {I}_{Atlas}\), representing the deformations exerted on every \(c\in C_N\), due to the tumor mass effect. Considering \((c_x',c_y',c_z')\) as new voxel positions of \(\mathcal {I}\) when mapped to \(\mathcal {I}_{Atlas}\), the displacement vector is given as \([\delta x,\delta y,\delta z]\) where vector \((c_x',c_y',c_z')=(c_x,c_y,c_z)+(\delta x,\delta y,\delta z)\), and the magnitude of deformation is given by: \(D(c)=\root \of {(\delta x)^{2}+(\delta y)^{2}+(\delta z)^{2}}\), for every \(c \in C_N^j\), and \(j \in \{1,\dots ,k\}\). First order statistics (i.e. mean, median, standard deviation, skewness, and kurtosis) are then computed by aggregating D(c) for every c within every sub-volume \(\mathcal {I}_N^j\) yielding a feature descriptor \(\mathbb {F}_{def}^j\) for every annular sub-region \(C_N^j\), where \(C_N^j \subset C_N\), \(j \in \{1,\dots ,k\}\).

  2. 2.

    Extraction of 3D gradient-based descriptors from tumor, and peri-tumoral regions: We used a 3D gradient-based texture descriptor presented in [3]. This texture descriptor captures tumor heterogeneity by computing higher order statistics from the gradient orientation changes computed across X, Y, and Z directions. These features has been shown to be successful in tumor characterization for a variety of applications in brain, lung and breast cancers. Briefly, for every \(c \in [C_P,C_T]\), gradients along the X, Y and Z directions are computed as, \(\nabla f(c)= \frac{\partial f(c)}{\partial X}\hat{i} + \frac{\partial f(c)}{\partial Y}\hat{j}+ \frac{\partial f(c)}{\partial Z}\hat{k}\), where \( \frac{\partial f(c)}{\partial q}\) is the gradient magnitude along the q axis, \(q \in \{X,Y,Z\} \). A \({N} \times {N} \times {N}\) window centered around every \(c \in C\) is selected to compute the localized gradient field. We then compute \(\partial f_X(c_t)\), \(\partial f_Y(c_t)\) and \(\partial f_Z(c_t)\), for every \(c \in [C_P,C_T]\), \({t} \in {\{1,2,\dots ,{N}^3\}}\). The vector gradient matrix \({\mathcal {F}}\) associated with every c is given by \({\mathcal {F}}=[\partial f_X(c_t)\;\;\partial f_Y(c_t)\;\;\partial f_Z(c_t) ]\), where \([\partial f_X(c_t)\;\;\partial f_Y(c_t)\;\;\partial f_Z(c_t) ]\), \({t} \in {\{1,2,\dots ,{N}^3\}}\) is the matrix of gradient vectors in the X, Y and Z directions for every \(c_t\) given by a \({N}^3 \) \(\times \) 3 matrix. Singular value decomposition of \(\mathcal {F}\) for a voxel \(c_t\) yields three dominant principal components \(\psi _{X}(c_{t})\), \(\psi _{Y}(c_{t})\) and \(\psi _{Y}(c_{t})\) in the X-, Y- and Z-directions respectively. Two principal orientations \(\theta (c_t)\) and \(\phi (c_t)\) can then be obtained to capture variability in orientations across (XY), and (XYZ) (in-plane and out-of-plane variability), given by \(\theta (c_t) = \tan ^{- 1}\frac{\psi _Y(c_{t})}{{\psi _X(c_{t})}}\) and \(\phi (c_t) = \tan ^{- 1}\frac{\psi _Z(c_{t})}{\sqrt{\psi _Y^2(c_{t}) + \psi _X^2(c_{t}})}\). Two separate \(\mathcal {N} \times \mathcal {N}\) co-occurrence matrices, \(\mathcal {M}^{\theta }\), and \(\mathcal {M}^{\phi }\) are computed, corresponding to \(\theta (c_t)\) and \(\phi (c_t)\), which capture the orientation pairs between voxels in a local neighborhood. We then individually compute 13 Haralick statistics as [\(\mathcal {S}_{\theta _b}\), \(\mathcal {S}_{\phi _b}\)], \(b \in [1,13] \) from \(\mathcal {M}^{\theta }\) and \(\mathcal {M}^{\phi }\), for every voxel \(c \in \{C_P, C_T\}\) as shown in [11]. For every b, first order statistics (i.e. mean, median, standard deviation, skewness, and kurtosis) are then computed by aggregating [\(\mathcal {S}_{\theta _b}\), \(\mathcal {S}_{\phi _b}\)] for every \(c \in \{C_P, C_T\}\) yielding a feature descriptor \(\mathbb {F}_{tex}^T\) for the tumor volume, and \(\mathbb {F}_{tex}^P\) for the peri-tumoral volume.

  3. 3.

    Computation of r-DepTH descriptor: The descriptor \(\mathbb {F}_{depth}\) is obtained as a feature vector by concatenation of the deformation descriptor, \(\mathbb {F}_{def}\), and the texture descriptors, \(\mathbb {F}_{tex}^T\), and \(\mathbb {F}_{tex}^P\).

4 Experimental Design

4.1 Data Description and Preprocessing

A total of 105 3-Tesla treatment-naive Gadolinium (Gd)-contrast T1w, T2w, and FLAIR MRI GBM studies were retrospectively obtained from the Cancer Imaging Archive [12]. We restricted our inclusion criteria to include short-term survivors with an overall survival (OS) of <240 days and long-term survivors with OS > 540 days. This resulted in a total of 68 patients in the training cohort, with an equal split of 34 STS and LTS cases respectively. An independent cohort of a total of 11 studies (4 LTS and 7 STS cases), with the same MRI sequences as the training set, was obtained from the collaborating institution. The T1w images were first bias-corrected using N4 bias correction [13]. The lesion masks were manually delineated by an expert radiologist as tumor, peri-tumoral, and normal parenchymal regions on T1w MRI scans.

Table 1. List of features computed from T1w scans to distinguish LTS from STS.

4.2 Implementation Details

The normal parenchymal region was divided into \(k =12\) annual bands, such that neighboring bands were equidistant to each other at 5 mm. Hence, each brain MRI volume \(\mathcal {I}\) is associated with a \(60\times 1\) deformation feature vector \(\mathbb {F}_{def}\), with a total of 5 statistics (mean, median, standard deviation, skewness, and kurtosis) obtained from each k, \(k\in [1, \dots , 12]\). Similarly for \(\mathbb {F}_{tex}^T\) and \(\mathbb {F}_{tex}^P\) respectively, the same 5 statistics are computed from [\(\mathcal {S}_{\theta _b}\), \(\mathcal {S}_{\phi _b}\)], \(|\mathcal {S}_{\theta _b}|\) = \(|\mathcal {S}_{\phi _b}| = 13\), resulting in a \(130\times 1\) feature vector, each. Following feature extraction, sequential forward feature selection [14] was employed to identify the most discriminating subset of features between STS and LTS from the training cohort. A total of 50 iterations of three-fold (one fold held-out for testing), patient-stratified, cross-validation scheme was used for constructing a linear discriminant analysis (LDA) classifier using the training set. The top 5 best performing features were obtained for each of the four feature sets, \(\mathbb {F}_{def}\), \(\mathbb {F}_{tex}^P\), \(\mathbb {F}_{tex}^T\), and \(\mathbb {F}_{depth}\) using the training cohort. Additionally, a total of 6 shape features (\(\mathbb {F}_{shape}\)) were also extracted for every \(\mathcal {I}\) for comparison with the other 4 feature sets. The top performing features from each of the 5 feature sets were used to lock down five different LDA classifiers, and independently evaluated on the N = 11 test cases. Kaplan-Meier (KM) survival analysis, along with log-rank test, was independently employed for each of the 5 feature sets, to compare survival times between the two groups (STS versus LTS). The horizontal axis on the KM curve shows the time in days from initial diagnosis, and the vertical axis shows the probability of survival. Any point on the curve reflects the probability that a patient in each group would remain alive at that instance. Labels assigned by the LDA classifier were used for KM-curve generation (Fig. 4).

5 Results and Discussion

5.1 Distinguishing LTS vs STS Using r-DepTH

The analysis on the training dataset on \(\mathbb {F}_{def}\) demonstrated that the skewness of deformation magnitude across LTS (Fig. 3(a)) and STS (Fig. 3(b)) was consistently statistically significantly different (\(p \le 0.05\)) for annular regions \(g \le 30\) millimeters proximal to the tumor (Fig. 3(c), (d)). However, the significance did not hold for \(g >30\) mm across LTS and STS studies. Higher values of skewness are shown in red while lower values are shown in dark blue in Fig. 3. Deformation magnitudes were found to be highly positively skewed (shown in red) in STS as compared to LTS (3(e)) (shown in green). Our results corroborate with recent findings in [15], suggesting that there may be prognostic impact due to tumor burden in certain cognitive areas because of the structural deformation heterogeneity, eventually affecting survival. Further, the top 5 features on the training set (N = 68) across \(\mathbb {F}_{def}\), \(\mathbb {F}_{tex}\) and \(\mathbb {F}_{depth}\), yielded an AUC of \(0.71\pm 0.08\), \(0.77\pm 0.08\) and \(0.83 \pm 0.07\) respectively via a 3-fold cross-validation.

Fig. 3.
figure 3

T1w scans of two different STS (a) and LTS patients (b). Figures 3(c), (d) show the corresponding deformation skewness statistics within 5 mm-annular regions in the normal parenchyma. Histograms of the deformation magnitudes in the first annular band between LTS and STS are shown in (e). Deformation magnitudes were found to be highly positively skewed in STS as compared to LTS. Box plots of deformation skewness across 4 different annular bands, \(g\le 5\), \(5<g\le 10\), \(30<g \le 35\), \(35<g \le 40\) (in mm) are shown in (f).

5.2 Evaluation on Independent Validation Set

Figure 4(a) shows the ideal “ground truth” KM curve for STS and LTS patients obtained on an independent cohort of (N = 11) studies. Figures 4(b)–(d) show the KM curves obtained using the assigned labels from the LDA classifier using \(\mathbb {F}_{def}\), \(\mathbb {F}_{tex}\) and \(\mathbb {F}_{depth}\) respectively. KM curves using \(\mathbb {F}_{def}\) (p=0.176), \(\mathbb {F}_{tex}\) (p = 0.81), \(\mathbb {F}_{shape}\) (p = 0.1) alone to distinguish LTS and STS patients, were not found to be significant. However, interestingly, \(\mathbb {F}_{depth}\) descriptor, yielded a statistically significant survival curve for distinguishing STS versus LTS with p = 0.038. Additionally, the classifier trained on \(\mathbb {F}_{depth}\) could correctly predict the survival group in 9 out of the 11 studies (accuracy = 81%), while \(\mathbb {F}_{tex}^T\) achieved an accuracy of 64%, and \(\mathbb {F}_{tex}^P\) of 54% in predicting the survival group.

Fig. 4.
figure 4

KM curves obtained from the validation set (N = 11) are shown for ground truth (a), \(\mathbb {F}_{def}\) (b), \(\mathbb {F}_{tex}\) (c), and (d) \(\mathbb {F}_{depth}\) respectively.

6 Concluding Remarks

In this study, we present a new radiomics approach, r-DepTH, which comprehensively captures the intra-, and extra-tumoral heterogeneity by measuring (a) the anatomical deformations in surrounding normal parenchyma, and (b) the gradient-based texture representations from within the tumor. The r-DepTH features demonstrated significant improvement in predicting overall survival in GBM patients using KM curve analysis (p = 0.038), over employing deformation and texture features alone. Future work will focus on validating r-DepTH on a larger cohort of studies to establish its efficacy as a new prognostic marker for GBM as well as other solid tumors. We will further employ r-DepTH in conjunction with other known clinical variables, to reliably predict patient outcome and improve treatment management in solid tumors.