Holistic Segmentation of Intermuscular Adipose Tissues on Thigh MRI

Yao, Jianhua; Kovacs, William; Hsieh, Nathan; Liu, Chia-Ying; Summers, Ronald M.

doi:10.1007/978-3-319-66182-7_84

Holistic Segmentation of Intermuscular Adipose Tissues on Thigh MRI

Jianhua Yao²¹,
William Kovacs²¹,
Nathan Hsieh²¹,
Chia-Ying Liu²¹ &
…
Ronald M. Summers²¹

Conference paper
First Online: 04 September 2017

11k Accesses
12 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10433))

Abstract

Muscular dystrophies (MD) cause muscles to gradually degenerate into fat. In order to effectively study and track disease progression, it is important to quantify both muscle and fat volumes, especially the intermuscular adipose tissue (IMAT). Existing methods were mostly based on unsupervised pixel clustering and morphological models. We propose a method integrating two holistic neural networks (one for edges and one for regions) and a dual active contour model to accurately locate the fascia lata and segment multiple tissue types on thigh MRIs. The proposed method is robust to image artifacts and weak boundaries, and thus it performs well for severe MD cases. Our method was tested on 104 data sets and achieved Dice coefficients 0.940 and 0.943 for muscle and IMAT in challenging severe cases, respectively.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Muscular dystrophies (MD) cause muscles to gradually degenerate into adipose tissue, which can affect the patient’s ability to walk and even breathe. Duchenne muscular dystrophy is the most common and severe type of MD and causes muscle weakness in the legs and upper arms [1]. In order to effectively plan the treatment and study disease progression, accurate segmentation of muscle and fat is desirable.

MRI has often been used to evaluate muscle distribution in the thighs. Figure 1 shows examples of T1-weighted thigh MRI of different MD severity levels. As shown in the examples, in severe cases, most muscle tissue has been converted to intermuscular adipose tissue (IMAT). IMAT has the same pixel intensity as subcutaneous adipose tissue (SAT). IMAT and SAT are separated by a boundary called the fascia lata, which normally forms as a stocking-like membrane around the muscles of the thighs. The fascia lata can be obscure on MRI and other fascia may interfere with its identification.

A number of investigations have been published on muscle segmentation on thigh MRI. Most of them worked on normal thighs or thighs with mild MD and were based on unsupervised pixel clustering and morphological models. Origiu et al. [2] developed an active contour model to detect the muscle boundary and a fuzzy c-means method to distinguish muscle from fat. Similarly, Positano et al. [3] also used fuzzy clustering and edge map to differentiate muscle and IMAT. Chambers et al. [4] used livewire to segment the muscle boundary and fuzzy c-means for pixel classification. Tan et al. [5] developed a random forest classifier to detect the fascia lata near the muscle boundaries, and thus only worked for mild MD cases. Kovacs et al. [6] presented the state-of-the-art method on severe MD tissue segmentation. They developed sophisticated filters and models to detect the fascia lata, remove outliers and incorporate anatomical knowledge. Fuzzy c-means was also adopted for pixel classification.

While existing methods would work well with mild disease cases, they may struggle with moderate to severe cases. Fuzzy clustering may fail when the pixel intensity is inconsistent and inhomogeneity artifacts exist (Fig. 1b), which are known problems affecting MR images. Furthermore, fascia lata detection in prior methods relied on the remaining muscles boundaries, which may be unreliable in severe cases.

To address these challenges, we propose a framework that combines deep learning techniques and traditional deformable models. Particularly, we adopt a holistically-nested edge detection (HED) approach for fascia lata detection. The motivation is that the fascia lata does not appear as isolated edges, but instead as a group of edges forming a specific shape, which can be taken advantage of by the holistic image-to-image training and prediction. Furthermore, multi-level output from HED can be used to derive robust segmentation. Our main contributions are threefold: (1) adopt two deep learning networks to classify both edges and regions; (2) integrate deep learning outputs with traditional deformable models; and (3) propose a dual active contour model for accurate fascia lata localization.

2 Method

Our holistic segmentation framework is outlined in Fig. 2. First, a preprocessing step is implemented to mask out the thighs and separate the left and right thighs. They are then fed into a holistic edge neural net for fascia lata detection. The output edge potential map drives a dual active contour model to accurately locate the fascia lata boundaries. A second holistic region neural net combined with the output of the fascia lata detection then classifies muscle, IMAT and SAT regions in the thighs. The algorithm is conducted on 2D slices and can be extended to 3D.

2.1 Preprocessing

In the preprocessing stage, an adaptive threshold is applied to mask thighs from the background. Then the two thighs are separated by connected component analysis. A region is cropped for each thigh for further processing. The cortical bones and bone marrows are segmented using thresholding and morphological closing. The current algorithm does not correct the inhomogeneity artifact that inherent to MRI images.

2.2 Holistic Fascia Lata Detection

As seen in Fig. 1, fascia lata boundaries can be weak. In this setting, traditional edge detectors such as Canny may not reliably detect the fascia lata without leading to over-detection. To extract semantically meaningful boundaries, an image-to-image holistic training and prediction is applied. Holistically-nested Edge Detection, HED, proposed by Xie and Tu [7], is a deep learning model that leverages fully convolutional networks and deeply supervised nets.

The network architecture is illustrated in Fig. 3, which consists of a sequence of deep network layers with side outputs at each stage. The network has 5 stages, with strides of 1, 2, 4, 8, and 16 respectively. The side outputs are fused to give the final prediction, which incorporates coherent contribution at different scales. Each side output is associated with a classifier with a class-balanced cross-entropy loss function,

$$ l_{side} \left( W \right) = - \beta \sum\nolimits_{{j \in Y^{ + } }} {logPr\left( {y_{j} = 1 |X} \right) - \left( {1 - \beta } \right)} \sum\nolimits_{{j \in Y^{ - } }} {logPr\left( {y_{j} = 0 |X} \right)} $$

(1)

Here X is the input image, Y is the ground truth edge map, Y⁺ is the set of edge pixels, Y⁻ is the set of non-edge pixels. W is the network layer parameters. β is the balancing weight between edge and non-edge samples, and β = |Y⁻|/|Y|. Pr is the probability of a pixel belonging to a certain class.

The fused output is a weighted combination of all side outputs and its loss function is the distance between the fused prediction and the ground truth edge map. The network parameters W and combination weight h are learned by minimizing the sum of the loss function at each layer via back-propagation and stochastic gradient decent.

Given a test image X_t, the edge map predictions can be obtained by forwarding the convolutional neural network (CNN) with fitted parameters and weights

$$ \left( {Y_{fused} ,Y_{side1} ,Y_{side2} ,Y_{side3} ,Y_{side4} ,Y_{side5} } \right) = CNN(X_{T} , W, h) $$

(2)

Examples of side outputs and fused output are shown in Fig. 3. The fused edge map indicates the probability that a pixel belongs to the fascia lata.

The network is trained using manually labelled fascia latae. Since it is very difficult to manually locate the precise boundary, we employ a Canny edge detector to assist ground truth generation. An operator first traces a contour close to the fascia lata. Then the Canny edges closest to the manual tracing are used for the HED training. If no Canny edge exists in the neighborhood, the pixel with the maximum gradient is used. Figure 4 shows the training sample generation.

2.3 Dual Active Contour Model for Fascia Lata Segmentation

The holistic edge image represents the probability of a pixel being on the fascia lata. As shown in Figs. 3 and 5b, it forms a thick band around the fascia lata and some weak boundaries may be missing. In order to obtain a continuous and precise fascia lata segmentation, we employ a dual active contour model (DACM) on the holistic edge potential map.

The DACMs are driven by internal forces, edge potential forces, and constraints between the exterior and interior contours. The energy function is written as,

$$ \begin{aligned} & {\rm E}(C_{E} ) = w_{i} I(C_{E} ) + w_{p} P^{ + } (C_{E} ) + w_{c} S(C_{E} ,C_{I} ) \\ & {\rm E}(C_{I} ) = w_{i} I(C_{I} ) + w_{p} P^{ - } (C_{I} ) + w_{c} S(C_{E} ,C_{I} ) \\ \end{aligned} $$

(3)

where C_E and C_I are exterior and interior contours, I(C) is the internal spline force, P⁺(C) and P⁻(C) are positive and negative edge potential forces, S(C_E,C_I) is the constraint between the two contours, and w_i, w_p and w_c are weights for the three forces. The internal forces keep the contour smooth and continuous. Directional gradient is applied to compute the edge potential force. For a point on the holistic edge image, the directional gradient is computed as the gradient along the line between that point and the center of the cortical bone (obtained in the preprocessing step). Positive P⁺(C) and negative P⁻(C) directional gradients are used for exterior and interior contour evolution respectively. The constraint between the dual contours is the thickness of DACM (the distance between two contours). If the thickness at some points is greater than one standard deviation beyond the mean, a constraint force will be exerted to pull the two contours closer. The weights for different forces are kept constant throughout the evolution. The initialization of the DACM is as follows. From the center of the femur bone, a ray is shot outwards in every direction (360°). The first point with probability >0.5 on the holistic edge image hit by the ray is set as one point on the initial interior contour. The initial interior contour is expanded outward until hitting a point with probability 0, which is then set as an initial exterior contour point. Outliers are removed if their distances to the bone are beyond two standard deviations from mean. Missing initial contour points are interpolated by neighbors.

The average of converged C_I and C_E is set as the final fascia lata contour. Figure 5 demonstrates the DACM for fascia lata segmentation.

2.4 Holistic Tissue Classification

All thigh pixels outside the fascia lata are considered to be subcutaneous adipose tissue (SAT). The regions inside the fascia lata are classified as either muscle or IMAT (cortical bones and bone mallows have already been masked out in the preprocessing stage). For now, we ignore other tissues such as vessel, tendon and nerve since they only count for a very small portion of the thigh.

Using architecture similar to the holistic fascia lata detector, a holistic region neural network is designed to classify muscle and IMAT regions in an image-to-image training and prediction manner. The training images are cropped around the fascia lata with manually delineated muscle regions as foreground and IMAT as background. The fused output returns the probability map of a pixel belonging to the muscle region. The images are normalized using the cortical bone intensity (lowest intensity) and 95% highest intensity in the image to address the intensity inconsistency issue with MRI images. A threshold of 0.5 is applied on the holistic output to differentiate muscle and IMAT. Figure 6 shows several side outputs of the holistic muscle neural net and the final tissue classification result.

2.5 Implementation Details

The holistic network is implemented using the publicly available Caffe library [8]. The network is first loaded with pre-trained VGG-16 model [9], and then fine-tuned with our thigh training data. The hyper-parameters include: learning rate $ ( 1 {\text{e}}^{ - 6} ) $, momentum (0.9 for edge, 0.5 for region), weight decay (0.0002). All images are resized to 400 * 400 for training and prediction. The left and right thigh images are also mirrored to augment the training set. The training takes about 4 h on two NVidia Titan X GPUs with 12 GB memory. The testing takes about 0.6 s.

3 Results

T1 weighted thigh MRIs from 104 patients (mean age:40 year, 38 male, 66 female) were acquired for this research. The MR parameters included TR 600 ms, TE 20 ms, flip angle 150, slice thickness 8 mm, and in-plane resolution 0.98 mm. Among the studies, there were 16 mild/normal, 23 moderate and 65 severe cases. We randomly selected 25 cases (9 mild, 8 moderate, and 8 severe) for quantitative training and testing, and the rest for visual inspection and qualitative assessment. For the 25 cases, an operator manually traced the fascia lata and classified the tissues. 15 cases (322 thigh images) were used for training and 10 cases (180 thigh images) for independent testing. For a subset of the test set (60 thigh images), a second operator conducted manual segmentation independently for the inter-observer variability assessment. The results reported in this section are for the test set performance.

Figure 7 shows results for fascia lata segmentation done via manual segmentation, active contour model, the state-of-the art model based method [6], and our proposed method. Table 1 compares the performance metrics including Dice coefficient of the region enclosed by the fascia lata, mean and max contour distance for active contour model, state-of-the-art model based method [6], inter-observer variability, and our proposed method. Classic active contour models work well on mild cases, but fail to detect the fascia lata in severe cases. The state-of-the-art model-based method misses a few boundaries and has trouble with severe inhomogeneity artifact.

Table 1. Fascia lata segmentation performance

Full size table

Figure 8 shows results of tissue classification. Table 2 compares the performance of the fuzzy C-means method, inter-observer variability and our proposed method. We note that our method works robustly for severe disease cases and cases with severe inhomogeneity artifact. Our method also performs comparably (sometimes better than) with inter-observer variability.

Table 2. Tissue classification performance (Dice Coefficient).

Full size table

4 Conclusion

The two (edge and region) holistic neural networks built on top of fully connected network and deeply-supervised network are capable of detecting weak boundaries such as the fascia lata and handle image artifact. Integrating the holistic network with dual active contour models allow us to achieve highly accurate classification for severe muscle dystrophy cases on MRI. The proposed method outperforms the state-of-the-art model-based methods and unsupervised clustering techniques.

References

Mercuri, E., Muntoni, F.: Muscular dystrophies. Lancet 381, 845–860 (2013)
Article Google Scholar
Orgiu, S., et al.: Automatic muscle and fat segmentation in the thigh from T1-Weighted MRI. J. MRI 43(3), 601–610 (2016)
Google Scholar
Positano, V., et al.: Accurate segmentation of subcutaneous and intermuscular adipose tissue from MR images of the thigh. J. MRI 29, 677–684 (2009)
Google Scholar
Chambers, O., et al.: Computer-based assessment for facioscapulohumeral dystrophy diagnosis. Comput. Methods Progr. Biomed. 120(1), 37–48 (2015)
Article Google Scholar
Tan, C., et al.: Accurate thigh inter-muscular adipose quantification using a data-driven and sparsity-constrained deformable model. In: IEEE International Symposium on Biomedical Imaging (2015)
Google Scholar
Kovacs, W., et al.: Identification of muscle and subcutaneous and intermuscular adipose tissue on thigh MRI of muscular dystrophy. In: IEEE International Symposium on Biomedical Imaging (ISBI) (2016)
Google Scholar
Xie, S., Tu, Z.: Holistically-nested edge detection. In: IEEE International Conference on Computer Vision (2015)
Google Scholar
Jia, Y., et al.: Caffe: convolutional architecture for fast feature embedding. In: The 22nd ACM International Conference on Multimedia (2014)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)
Google Scholar

Download references

Acknowledgement

This research was supported by the National Institutes of Health, Clinical Center. The authors thank Nvidia for the GPU donation.

Author information

Authors and Affiliations

Clinical Image Processing Services, Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, Bethesda, USA
Jianhua Yao, William Kovacs, Nathan Hsieh, Chia-Ying Liu & Ronald M. Summers

Authors

Jianhua Yao
View author publications
You can also search for this author in PubMed Google Scholar
William Kovacs
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Hsieh
View author publications
You can also search for this author in PubMed Google Scholar
Chia-Ying Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ronald M. Summers
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianhua Yao .

Editor information

Editors and Affiliations

Université de Sherbrooke, Sherbrooke, QC, Canada
Maxime Descoteaux
DKFZ, Heidelberg, Germany
Lena Maier-Hein
Ulm University of Applied Sciences, Ulm, Germany
Alfred Franz
Université de Rennes 1, Rennes, France
Pierre Jannin
McGill University, Montreal, QC, Canada
D. Louis Collins
Université Laval, Québec, QC, Canada
Simon Duchesne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yao, J., Kovacs, W., Hsieh, N., Liu, CY., Summers, R.M. (2017). Holistic Segmentation of Intermuscular Adipose Tissues on Thigh MRI. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D., Duchesne, S. (eds) Medical Image Computing and Computer Assisted Intervention − MICCAI 2017. MICCAI 2017. Lecture Notes in Computer Science(), vol 10433. Springer, Cham. https://doi.org/10.1007/978-3-319-66182-7_84

Download citation

DOI: https://doi.org/10.1007/978-3-319-66182-7_84
Published: 04 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66181-0
Online ISBN: 978-3-319-66182-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)