Deep Convolutional Neural Networks Using U-Net for Automatic Brain Tumor Segmentation in Multimodal MRI Volumes

Kermi, Adel; Mahmoudi, Issam; Khadir, Mohamed Tarek

doi:10.1007/978-3-030-11726-9_4

Deep Convolutional Neural Networks Using U-Net for Automatic Brain Tumor Segmentation in Multimodal MRI Volumes

Conference paper
First Online: 26 January 2019

6388 Accesses
53 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11384))

Abstract

Precise 3D computerized segmentation of brain tumors remains, until nowadays, a challenging process due to the variety of the possible shapes, locations and image intensities of various tumors types. This paper presents a fully automated and efficient brain tumor segmentation method based on 2D Deep Convolutional Neural Networks (DNNs) which automatically extracts the whole tumor and intra-tumor regions, including enhancing tumor, edema and necrosis, from pre-operative multimodal 3D-MRI. The network architecture was inspired by U-net and has been modified to increase brain tumor segmentation performance. Among applied modifications, Weighted Cross Entropy (WCE) and Generalized Dice Loss (GDL) were employed as a loss function to address the class imbalance problem in the brain tumor data. The proposed segmentation system has been tested and evaluated on both, BraTS’2018 training and validation datasets, which include a total of 351 multimodal MRI volumes of different patients with HGG and LGG tumors representing different shapes, giving promising and objective results close to manual segmentation performances obtained by experienced neuro-radiologists. On the challenge validation dataset, our system achieved a mean enhancing tumor, whole tumor, and tumor core dice score of 0.783, 0.868 and 0.805 respectively. Other quantitative and qualitative evaluations are presented and discussed along the paper.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Brain tumor segmentation in multimodal Magnetic Resonance Imaging (MRI) is widely used as a vital process for surgical planning and simulation, treatment planning prior to radiation therapy, therapy evaluation [1,2,3,4,5], and intra-operative neuro navigation and image neurosurgery [6,7,8]. However, segmenting brain tumor manually is not only a challenging task, but also a time-consuming one, favoring therefore, the emergence of computerized approaches.

Despite considerable research works and encouraging results in the medical imaging domain, fast and precise 3D computerized brain tumors segmentation remains until now a challenging process and a very difficult task to achieve because brain tumors may appear in different size, shape, location and image intensity [2,3,4,5]. Many recent research adopted deep learning methods [9], specifically Convolutional Neural Networks (CNNs) [9,10,11,12]. CNN shown their effectiveness and were proved successful to automatically classify the normal and pathological brain MRI scans in the past few BraTS challenges as well as other semantic and medical segmentation problems.

This paper proposes an automated and efficient segmentation method of whole tumor and intra-tumor structures, including enhancing tumor, edema and necrosis, in multimodal 3D-MRI. It is based on 2D Deep Convolutional Neural Networks (DNNs) using a modified U-net architecture [10]. The proposed DNN model is trained to segment both High Grade Glioma (HGG) and Lower Grade Glioma (LGG) volumes.

The rest of the paper is organized as follows. First, Sect. 2 presents an overview of the proposed segmentation method. Experimental results with their evaluations are given in Sect. 3. Finally, a conclusion and future work are presented in Sect. 4.

2 The Proposed Method

The proposed segmentation system is entirely automated. The brain tumor segmentation process is based on deep learning more precisely on 2D Convolutional Neural Networks. It includes the main following steps: pre-processing of the 3D-MRI data, training using a U-net architecture, and brain tumoral structures prediction.

2.1 Data and Pre-processing

The BraTS’2018 challenge training dataset consists of 210 pre-operative multimodal MRI scans of subjects with HGG and 75 scans of subjects with LGG, and the BraTS’2018 challenge validation dataset includes 66 different multimodal 3D-MRI [13,14,15,16]. Images were acquired at 19 different centers using MR scanners from different vendors and with 3T field strength. They comprise co-registered native (T1) and contrast-enhanced T1-weighted (T1Gd) MRI, as well as T2-weighted (T2) and T2 Fluid Attenuated Inversion Recovery (FLAIR) MRI. All 3D-MRI of BraTS’2018 dataset have a volume dimension of 240 × 240 × 155. They are distributed, co-registered to the same anatomical template and interpolated to the same resolution (1 mm³). All MRI volumes have been segmented manually, by one to four raters, and their annotations were approved by experienced neuro-radiologists. Each tumor was segmented into edema, necrosis and non-enhancing tumor and active/enhancing tumor.

First, a minimal pre-processing of MRI data is applied, as in [11]. The 1% highest and lowest intensities were removed, then each modality of MR images was normalized by subtracting the mean and dividing by the standard deviation of the intensities within the slice. To address the class imbalance problem in the data, data augmentation technique [17] were employed. This consists in adding new synthetic images by performing operations and transformations on data and the corresponding manual tumors segmentation images obtained by human experts (i.e., ground truth data). The transformations comprise rotation, translation, and horizontal flipping and mirroring.

2.2 Network Architecture and Training

The CNN used in this study has a similar architecture as that of U-net [10]. Our network architecture can be seen in Fig. 1. It consists of a contracting path (left side) and an expanding path (right side). The contracting path consists of 3 pre-activated residual blocks, as in [18, 19], instead of plain blocks in the original U-net. Each block has two convolution units each of which comprises a Batch Normalization (BN) layer, an activation function, called Parametric Rectified Linear Unit (PReLU) [20],instead of ReLU function used in the original architecture [10], and a convolutional layer, like in [12], instead of using Maxpooling [10], with Padding = 2, Stride = 1 and a 3 × 3 size filter. For down sampling, a convolution layer with a 2 × 2 filter and a stride of 2 is applied. At each down sampling step, the number of feature channels is doubled. The contracting path is followed by a fourth residual unit that acts as a bridge to connect both paths. In the same way, the expanding path is built using 3 residual blocks. Prior to each block, there is an upsampling operation which increases the feature map size by 2, followed by a 2 × 2 convolution and a concatenation with the feature maps corresponding to the contracting path. In the last layer of the expanding path, a 1 × 1 convolution with the Softmax activation function is used to map the multi-channel feature maps to the desired number of classes.

In total, the proposed network model contains 7 residual blocks, 25 convolution layers, 15 layers of BN and 10159748 parameters to optimize.

The designed network was trained with axial slices extracted from training MRI set, including HGG and LGG cases, and the corresponding ground truth segmentations. The goal is to find the network parameters (weights and biases) that minimize a loss function. In this work, this can be achieved by using Stochastic Gradient Descent algorithm (SGD) [17], at each iteration SGD updates the parameters towards the opposite direction of the gradients. In our network model, we used a loss function that adds Weighted Cross Entropy (WCE) [17] and Generalized Dice (GDL) [21] to address the class imbalance problem present in brain tumor data. So, the two components of the loss function are defined as follows:

$$ \begin{array}{*{20}c} {WCE = - \frac{1}{K}\sum\nolimits_{k} {\mathop \sum \limits_{i}^{L} W_{i} g_{ik} } \log \left( {p_{ik} } \right) } \\ \end{array} $$

(1)

$$ \begin{array}{*{20}c} {GDL = 1 - 2\frac{{\mathop \sum \nolimits_{i}^{L} w_{i} \mathop \sum \nolimits_{k} g_{ik} p_{ik} }}{{\mathop \sum \nolimits_{i}^{L} w_{i} \mathop \sum \nolimits_{k} \left( {g_{ik} + p_{ik} } \right)}}} \\ \end{array} $$

(2)

where L is the total number of labels, K denotes the batch size. w_i represents the weight assigned to the ith label. As in [21], we set wi to $ \frac{1}{{\left( {\mathop \sum \nolimits_{k} g_{ik} } \right)^{2} }} $. p_ik and g_ik representing the value of the (ith, kth) pixel of the segmented binary image and of the binary ground truth image, respectively.

2.3 Brain Tumoral Structures Prediction

After network training, prediction may be performed. This step consists to provide the network with the four MRI modalities of an unsegmented volume that it has never processed or encountered before, and it must be able to return a segmented image.

3 Experimental Results and Discussion

In this study, we have tested and evaluated our segmentation system on pre-operative multimodal MRI scans of both the training/testing and the validation datasets of the BraTS’2018 challenge [22]. The results of automatically segmented tumors, denoted by A, can be compared with manually segmented tumors by human experts, denoted by B, which are considered as ground truth for evaluation. The results presented in subsequent sections, were obtained during the BraTS’2018 challenge, for training and validation [22]. The top 63 approaches are further compared in terms of results (Dice, Sensitivity and Specificity) and one surface measure based on the Hausdorff distance (HD), in [23] on 191 cases. These measures allow to assess the segmentation results accuracy, as well as measuring the similarity between the segmentations A and B [2, 24]. The Dice metric is computed as a performance metric. It measures the similarity between two volumes A and B, corresponding to the output segmentation of the model and clinical ground truth annotations, respectively. The Sensitivity metric measures the proportion of positive voxels of the real brain tumor that are correctly segmented as such, while Specificity metric indicates how well the true negatives are predicted. Employing Sensitivity and Specificity can provide a good assessment of the segmentation result. The HD metric indicates the segmentation quality at the tumor’s border by evaluating the greatest distance between the two segmentation surfaces A and B, and is independent of the tumor size.

3.1 Performance on 20% of BraTS’2018 Training Dataset (Testing Set)

Preliminary segmentation results for the 285 3D-MRI of the BraTS’2018 training data set have been obtained using 80% of this data set (i.e., 228 subjects) for training and the remaining 20% (i.e., 57 subjects) for validation purposes. Results obtained by our automated system for 10 sample cases are shown in Figs. 2 and 3. Figure 2 shows segmentation results from 5 multimodal MRI where HGG tumors are present and Fig. 3 shows other segmentation results from other 5 MRI with LGG tumors. In these figures, each row represents one clinical case. In the first four columns from left to right, images show one axial slice of MRI acquired in Flair, T1, T1C and T2 modality, respectively, used as input channels to our CNN model. In the fifth and the sixth columns, images show the ground truth (GT) and the prediction labels respectively, where we can distinguish intra-tumoral regions by color-code: enhancing tumor (yellow), peritumoral edema (green) and necrotic and non-enhancing tumor (red). As it can be seen, tumors in the brain MRI of the 10 cases vary in size, shape, position and intensity. By visual inspection, the proposed method usually generates segmentations (Prediction) sensibly similar to the ones obtained by the experts (GT).

A quantitative evaluation of segmentation results of Enhancing Tumor (ET), Whole Tumor (WT) and Tumor Core (TC) using the four previously mentioned metrics is given in Tables 1 and 2. Mean, standard deviation, median and 25th and 75th percentile are given for Dice and Sensitivity metrics in Table 1 and for Specificity and Hausdorff distance in Table 2. Values presented in Table 1 show high performance on the Dice metric for WT region, but lower performance for ET and TC regions. Moreover, the proposed method showed good performances for the segmentation of the three intra-tumoral regions. However, the effectiveness, of the approach, is high-lighted in the case of HGG tumors, when compared with the LGG ones.

Table 1. Quantitative evaluation of segmentation results on 20% of BraTS’2018 training dataset (57 MRI scans) using Dice and Sensitivity metrics.

Full size table

Table 2. Quantitative evaluation of segmentation results on 20% of BraTS’2018 training dataset (57 MRI scans) using Specificity and Hausdorff distance metrics.

Full size table

3.2 Performance on BraTS’2018 Validation Dataset

For our participation to BraTS’2018 competition, 100% of the training, including the previous testing dataset (i.e., 285 subjects) is used for training. The performance on BraTS’2018 validation dataset, which is composed of 66 subjects, is diffused in the challenge leaderboard Web site^{Footnote 1} and presented with more statistics in Tables 3, 4, Figs. 4 and 5. In this context, we can compare the obtained segmentation results with those of other participants. The method achieved a mean ET, WT, and TC dice score of 0.783, 0.868 and 0.805 respectively. These scores are close to those obtained by the top performing methods. Also, an average HD scores of 3.728, 8.127 and 9.84 for ET, WT, and TC, respectively were obtained. In addition, it was observed that our DNN model maintains similar WT scores on both, 20% of BraTS’2018 training/testing set used for validation and the final validation dataset proposed for the competition purposes. However, a slight increase in performance on the validation dataset was observed in the ET and TC compartments. It should be noted that this performance was not obtained by overfitting the validation data (i.e., our DNN model has not previously trained on MRI volumes of BraTS’2018 validation dataset).

Table 3. Quantitative evaluation of segmentation results on BraTS’2018 validation dataset (66 MRI scans) by using Dice and Sensitivity metrics.

Full size table

Table 4. Quantitative evaluation of segmentation results on BraTS’2018 validation dataset (66 MRI scans) by using Specificity and Hausdorff distance metrics.

Full size table

This performance can be explained by the fact that the number of learned cases (training dataset) used later for the segmentation of the validation dataset is larger than the one used for the BraTS’2018 segmentation on the training/testing dataset. This represents 285 and 228 cases for both trainings respectively. It is also possible that the slight improvements obtained on the validation dataset, are due to the fact that this latter contains more MRI with HGG tumors than MRI with LGG tumors. Indeed, the segmentation efficiency obtained using the proposed network, is more evident on HGG volumes when compared to LGG ones.

Boxplots showing the dispersion of Dice and Sensitivity scores are represented in Fig. 4 and boxplots of the dispersion of Specificity and HD scores are represented in Fig. 5. In these figures, boxplots show quartile ranges of the scores; whiskers and dots ‘●’ indicate outliers; and ‘x’ indicates the mean score.

4 Conclusion and Future Work

In this paper, a fully automatic and accurate method for segmentation of whole brain tumor and intra-tumoral regions using a 2D deep convolutional network based on a well-known architecture in medical imaging called “U-net” is proposed. The constructed DNN model was trained to segment both HGG and LGG volumes.

The proposed method was tested and evaluated quantitatively on both BraTS’2018 training and challenge validation datasets. The total learning computation time of the 285 multimodal MRI volumes of BraTS’2018 training dataset is 185 h on a Cluster machine with Intel Xeon E5-2650 CPU@ 2.00 GHz (64 GB) and NVIDIA Quadro 4000–448 Core CUDA (2 GB) GPU. The average segmentation time of a brain tumor and its components from a given MRI volume is about 62 s on the same GPU. The different tests showed that the segmentation results were very satisfactory, and the evaluation measures confirm that our results are very similar to those manually obtained by the experts, although the proposed method can be further improved.

As future work, a more powerful GPU to further accelerate learning phase of DNN is planned. Thus, a larger number of CNN topologies as well other data augmentation methods may be tested. Also, other interesting perspective consists to use ensemble learning methods, like Stacking and Blending, to improve segmentation performance in tumor core and active tumor regions. Finally, a future work possibility may focus on refining the segmentation results by reducing the false-positive rate using post-processing techniques, such as: applying a conditional random field (CRF) and removing small connected components.

Notes

1.
https://www.cbica.upenn.edu/BraTS18/lboardValidation.html.

References

Dass, R., Priyanka, Devi, S.: Image segmentation techniques. Int. J. Electron. Commun. Technol. (IJECT) 3, 66–70 (2012)
Google Scholar
Kermi, A., Andjouh, K., Zidane, F.: Fully automated brain tumour segmentation system in 3D-MRI using symmetry analysis of brain and level sets. IET Image Process. 12, 1964–1971 (2018)
Article Google Scholar
Khotanlou, H., Colliot, O., Atif, J., Bloch, I.: 3D brain tumor segmentation in MRI using fuzzy classification, symmetry analysis and spatially constrained deformable models. Fuzzy Sets Syst. 160, 1457–1473 (2009)
Article MathSciNet Google Scholar
Prastawaa, M., Bullitt, E., Geriga, G.: Simulation of brain tumors in MR images for evaluation of segmentation efficacy. Med. Image Anal. 13, 297–311 (2009)
Article Google Scholar
Sharma, N., Aggarwal, L.M.: Automated medical image segmentation techniques. J. Med. Phys. 35, 3–14 (2010)
Article Google Scholar
Despotovic, I., Goossens, B., Philips, W.: MRI segmentation of the human brain: challenges, methods, and applications. Comput. Math. Methods Med. 2015, 23 (2015)
Article Google Scholar
Wirtz, C.R., et al.: The benefit of neuronavigation for neurosurgery analyzed by its impact on glioblastoma surgery. Neurol. Res. 22, 354–360 (2000)
Article Google Scholar
Yanyun, L., Zhijian, S.: Automated brain tumor segmentation in magnetic resonance imaging based on sliding-window technique and symmetry analysis. Chin. Med. J. 127, 462–468 (2014)
Google Scholar
Işın, A., Direkoğlu, C., Şah, M.: Review of MRI-based brain tumor image segmentation using deep learning methods. Procedia Comput. Sci. 102, 317–324 (2016)
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Havaei, M., et al.: Brain tumor segmentation with Deep Neural Networks. Med. Image Anal. 35, 18–31 (2017)
Article Google Scholar
Milletari, F., Navab, N., Ahmadi, S.A.: V-Net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV), pp. 565–571 (2016)
Google Scholar
Bakas, S., et al.: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features. Sci. Data 4, 170117 (2017)
Article Google Scholar
Bakas, S., et al.: Segmentation labels and radiomic features for the pre-operative scans of the TCGA-GBM collection. The Cancer Imaging Archive (2017)
Google Scholar
Bakas, S., et al.: Segmentation labels and radiomic features for the pre-operative scans of the TCGA-LGG collection. The Cancer Imaging Archive (2017)
Google Scholar
Menze, B.H., et al.: The multimodal brain tumor image segmentation benchmark (BRATS). IEEE Trans. Med. Imaging 34, 1993–2024 (2015)
Article Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
MATH Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Identity mappings in deep residual networks. In: Leibe, B., Matas, J., Sebe, N., Welling, Max (eds.) ECCV 2016. LNCS, vol. 9908, pp. 630–645. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_38
Chapter Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1026–1034. IEEE Computer Society (2015)
Google Scholar
Sudre, C.H., Li, W., Vercauteren, T., Ourselin, S., Jorge Cardoso, M.: Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. In: Cardoso, M.J., et al. (eds.) DLMIA/ML-CDS -2017. LNCS, vol. 10553, pp. 240–248. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67558-9_28
Chapter Google Scholar
Kermi, A., Mahmoudi, I., Khadir, M.T.: Brain tumor segmentation in multimodal 3D-MRI of BraTS’2018 datasets using Deep Convolutional Neural Networks. In: Pre-conference Proceedings of the 7th International MICCAI BraTS’2018 Challenge, Granada, Spain, pp. 252–263 (2018)
Google Scholar
Bakas, S., et al.: Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge. arXiv preprint arXiv:1811.02629 (2018)
Castillo, L.S., Daza, L.A., Rivera, L.C., Arbelàez, P.: Volumetric multimodality neural network for brain tumor segmentation. In: Proceedings of the 6th MICCAI BraTS Challenge, Quebec, Canada, pp. 34–41 (2017)
Google Scholar

Download references

Acknowledgments

This work was in part financially supported by an Algerian research project (CNEPRU) funded by the Ministry of Higher Education and Scientific Research (Project title and number: “PERFORM”, B*04120140014). We would like to thank Bakas, S., Ph.D. and Postdoctoral researcher at SBIA of Perelman School of Medicine University of Pennsylvania – USA for providing the entire BraTS’2018 datasets employed in this study. We also gratefully acknowledge the support of CERIST, the Algerian Research Center for Scientific and Technical Information, for allowing us the use of “IBNBADIS” Cluster, without which we could not have performed these tests.

Author information

Authors and Affiliations

LMCS Laboratory, National Higher School of Computer Sciences (ESI), BP.68M, 16309, Oued-Smar, El-Harrach, Algiers, Algeria
Adel Kermi & Issam Mahmoudi
LabGed Laboratory, Department of Computer Sciences, University Badji-Mokhtar of Annaba, BP.12, 23000, Annaba, Algeria
Mohamed Tarek Khadir

Authors

Adel Kermi
View author publications
You can also search for this author in PubMed Google Scholar
Issam Mahmoudi
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Tarek Khadir
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adel Kermi .

Editor information

Editors and Affiliations

University Hospital of Zurich, Zürich, Switzerland
Alessandro Crimi
University of Pennsylvania, Philadelphia, PA, USA
Spyridon Bakas
University Medical Center Utrecht, Utrecht, The Netherlands
Hugo Kuijf
National Cancer Institute, Bethesda, MD, USA
Farahani Keyvan
University of Bern, Bern, Switzerland
Mauricio Reyes
Erasmus University Medical Center, Rotterdam, The Netherlands
Theo van Walsum

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kermi, A., Mahmoudi, I., Khadir, M.T. (2019). Deep Convolutional Neural Networks Using U-Net for Automatic Brain Tumor Segmentation in Multimodal MRI Volumes. In: Crimi, A., Bakas, S., Kuijf, H., Keyvan, F., Reyes, M., van Walsum, T. (eds) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. BrainLes 2018. Lecture Notes in Computer Science(), vol 11384. Springer, Cham. https://doi.org/10.1007/978-3-030-11726-9_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-11726-9_4
Published: 26 January 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-11725-2
Online ISBN: 978-3-030-11726-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Abstract

1 Introduction

2 The Proposed Method

2.1 Data and Pre-processing

2.2 Network Architecture and Training

2.3 Brain Tumoral Structures Prediction

3 Experimental Results and Discussion

3.1 Performance on 20% of BraTS’2018 Training Dataset (Testing Set)

3.2 Performance on BraTS’2018 Validation Dataset

4 Conclusion and Future Work

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation