A novel deep learning based hippocampus subfield segmentation method

Manjón, José V.; Romero, José E.; Coupe, Pierrick

doi:10.1038/s41598-022-05287-8

A novel deep learning based hippocampus subfield segmentation method

Article
Open access
Published: 25 January 2022

Volume 12, article number 1333, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

A novel deep learning based hippocampus subfield segmentation method

Download PDF

José V. Manjón¹,
José E. Romero¹ &
Pierrick Coupe^2,3

3191 Accesses
6 Citations
7 Altmetric
Explore all metrics

Abstract

The automatic assessment of hippocampus volume is an important tool in the study of several neurodegenerative diseases such as Alzheimer's disease. Specifically, the measurement of hippocampus subfields properties is of great interest since it can show earlier pathological changes in the brain. However, segmentation of these subfields is very difficult due to their complex structure and for the need of high-resolution magnetic resonance images manually labeled. In this work, we present a novel pipeline for automatic hippocampus subfield segmentation based on a deeply supervised convolutional neural network. Results of the proposed method are shown for two available hippocampus subfield delineation protocols. The method has been compared to other state-of-the-art methods showing improved results in terms of accuracy and execution time.

Deep convolutional neural network for hippocampus segmentation with boundary region refinement

Article 17 April 2023

Multi-target Interactive Neural Network for Automated Segmentation of the Hippocampus in Magnetic Resonance Imaging

Article 23 July 2019

Deep Learning for Medical Image Segmentation

Introduction

The hippocampus (HC) is a bilateral brain structure located in the medial temporal lobe at both sides of the brainstem near to the cerebellum. HC is involved in many brain functions such as memory and spatial reasoning¹. It plays also an important role in many neurodegenerative diseases such as Alzheimer's disease (AD)². Furthermore, hippocampus volume estimation is considered a valuable tool for follow-up and treatment adjustment^3,4,5.

In the last years, many HC segmentation methods have been proposed^6,7,8. Most of them, were restricted to consider the hippocampus as a single structure⁹ due image resolution limitations. However, it is well known that, for example, AD affects the different HC subfields at different moments during the disease progression^2,10. Thus, automatic and accurate HC subfield segmentation methods would be really important to obtain early biomarkers of the disease.

Currently, advances in modern MR sequences allow acquiring high-resolution images making possible to divide the hippocampus into its constituent parts. In the last years, several delineation protocols have been proposed (some of these protocols have been used to create manually labeled MRI datasets). However, there is still little consensus between the different HC subfield protocols as shown in Ref.¹¹ where 21 delineation protocols were compared. For example, in 2013, Winterburn presented a new in-vivo high-resolution atlas¹² to divide the hippocampus in five different sub-regions: CA1, CA2-3, CA4/DG, Stratum and Subiculum. Later, in 2015, Kulaga-Yoskovitz developed another segmentation protocol¹³ consisting of three structures: CA1-3, CA4/DG and Subiculum.

Several automatic methods for HC subfield segmentation have been developed in the last years^14,15,16. One of the most well-known methods for HC subfield segmentation is named ASHS¹⁷ that uses a multi-atlas approach combined with a similarity-weighted voting and a boosting-based error correction. Unfortunately, this method took several hours to produce a segmentation due to the exhaustive use of non-linear registrations (an updated version of this software has greatly reduced this time to few minutes). More recently, we proposed a method named HIPS¹⁸ that obtained state-of-the-art results in two different delineation protocols (Winterburn and Kulaga-Yoskovitz) with relatively low processing times thanks to the use of a fast multi-atlas label fusion method called OPAL¹⁹. Although these methods have promising results, their automatic measurements are not close enough to manual tracings in some cases²⁰.

Recently, due to the expansion of deep learning in medical imaging, novel methods based on this technology have been proposed to further improve the accuracy of HC segmentation. For full hippocampus segmentation many methods based on convolutional neural networks (CNN) have been already proposed^21,22,23,24. Recently, deep learning-based methods has been also proposed for hippocampus subfield segmentation. For example, UGNET has been proposed²⁵ using an adversarial training approach and also variants of the famous UNET architecture²⁶ such as the Dilated Dense UNET²⁷ have been proposed. However, one of the major problems of supervised deep learning methods is their hunger for training data to be able to generalize on unseen data.

In this paper, we propose a novel deep-learning based segmentation method that takes benefit of a problem specific preprocessing that locates the data in a canonical geometrical and intensity space therefore simplifying the segmentation problem and thus reducing the need for lots of manually labeled data. The proposed method has been validated using two hippocampus subfield segmentation protocols with publically available datasets.

Materials and methods

Training data

In this work, we have used two different datasets including two manual labeling hippocampus subfield segmentation protocols, both with high-resolution (HR) T1w and T2w MR images (see Fig. 1). Details of these datasets are given below:

Kulaga-Yoskovitz dataset

This dataset includes 25 subjects from a public repository (http://www.nitrc.org/projects/mni-hisub25) (31 ± 7 years, 12 males, 13 females) with manually segmented labels dividing the HC in three parts (CA1-3, DG-CA4 and Subiculum). The Ethics Committee of the Montreal Neurological Institute and Hospital approved the study and written informed consent was obtained from all participants in accordance with the standards of the Declaration of Helsinki. Participants gave their written informed consent prior to scanning and received a monetary compensation. MR data from each subject consist of an isotropic 3D-MPRAGE T1-weighted (0.6 mm³) and anisotropic 2D T2-weighted TSE images (0.4 × 0.4 × 2 mm³). Images underwent automated correction for intensity non-uniformity, intensity standardization and were linearly registered to the MNI152 space. T1w and T2w images were resampled to a resolution of 0.4 mm³. To reduce interpolation artifacts, the T2w data was upsampled using a non-local super-resolution method²⁸. For more details about the labeling protocol see the original paper¹³.

Winterburn dataset

This dataset contains 5 subjects with 0.3 × 0.3 × 0.3 mm³ high resolution T1-weighted and T2-weighted images obtained by 2 × interpolation of 0.6 × 0.6 × 0.6 mm³ acquisitions and their corresponding manual segmentations. The HR images are publicly available at the CoBrALab website (http://cobralab.ca/atlases). These MR images were taken from 5 healthy volunteers (2 males, 3 females, aged 29–57). The study was conducted in keeping with the Declaration of Helsinki, was approved by the Centre for Addiction and Mental Health Research Ethics Board, and all subjects provided written, informed consent for data acquisition and sharing. High-resolution T1-weighted images were acquired using the 3D inversion-prepared fast spoiled gradient-recalled echo acquisition (TE/TR = 4.3 ms/9.2 ms, TI = 650 ms, α = 8°, 2-NEX and isotropic resolution of 0.6 mm³). High-resolution T2-weighted images were acquired using the 3D fast spin echo acquisition, FSE-CUBE (TE/TR = 95.3 ms/2500 ms, ETL = 100 ms, 2NEX, and isotropic resolution of 0.6 mm³). Reconstruction filters, ZIPX2 and ZIP512, were also used resulting in a final isotropic 0.3 mm³ dimension voxels. The hippocampus and each of their subfields were segmented manually by an expert rater including 5 labels (CA1, CA2/3, CA4/DG, (SR/SL/SM), and subiculum). For more details about the labeling protocol see the original paper¹². All methods were performed in accordance to relevant guidelines and regulations.

Example images of these two protocols are shown at Fig. 1. Images and labels were visualized using ITK-SNAP v 3.4.0 software (http://www.itksnap.org).

Image preprocessing

The images were preprocessed using the following steps: (1) Denoising using the Spatially Adaptive Non-Local Means Filter²⁹, (2) Intensity inhomogeneity correction using the N4 bias field correction³⁰, (3) Affine registration to the Montreal Neurological Institute (MNI) space by applying the Advanced Normalization Tools (ANTs) package³¹. This registration was estimated using the T1w MNI152 template (at 0.5 mm³ resolution) and the T1w images, and applied to both T1w and T2w images (a rigid transformation from T2w to T1w was previously estimated and later concatenated with T1w transformation to perform a single interpolation step when registering both T1w and T2w images). (4) Cropping: To reduce the memory requirements and the computational cost, the images were cropped around HC area, (5) Finally, the cropped images were intensity normalized by subtracting the image mean and dividing by its standard deviation.

Proposed method

Our proposed method is based on a variant of the well-known UNET architecture²⁶. The proposed UNET has 4 resolution levels (from 0.5 to 4 mm). We used three blocks of BatchNormalization, 3D convolution (kernel size of 3 × 3 × 3 voxels) plus ReLU layers for each resolution level. We also used dropout layers (with 0.5 rate) in the encoding part of the UNET to minimize overfitting problems. The input of the network consists of a tensor with two channels (T1 and T2 images). The first resolution level has 64 filters and the next levels multiply by 2 this number to compensate the loss of spatial resolution. Similarly, the number of filters is reduced by 2 in the ascending path of the encoder at each resolution level. The output is also a tensor of nc channels represent the probabilities of each subfield and the background.

We also used a modified version of deep supervision³² approach that helps to train very deep networks by producing segmentations at different resolution levels. Deep supervision has been shown to not only counteract the adverse effects of gradient vanishing but also to speed up convergence and produce highly accurate results even with limited data. The main difference of our implementation compared to Dou et al., is that we used upsampled low-resolution outputs also as inputs of the next level of the decoder (concatenated with the upsampled features and the encoder shortcut) to help in the next resolution level (only for 1 and 2 mm resolution levels). The resulting network has 56 layers and 35,085,580 trainable parameters. In Fig. 2 the scheme of the proposed network is shown. We will refer to the deep supervised variant of the UNET as DS-UNET3D.

The loss function plays a major role in the training process and an exhaustive search of the most suitable function for the proposed architecture and problem has to be done. One of the most common loss functions for classification is the categorical cross entropy. However, for segmentation purposes it is common to use the dice loss (DL)³³ as it directly optimizes the segmentation metric most commonly used and it is more robust to the class imbalance problem (1). Recently, a Generalized Dice Loss (GDL)³⁴ was proposed to deal with the well-known dependency of the dice index with the size of the labels (2). Inspired by GDL, we propose in this paper the Generalized Jaccard Loss (GDL) (4) which is a variant of Jaccard loss (3) following the same idea to reduce label size dependency:

$$DL\left(p,t\right)\underset{}{=1-\frac{2}{\mathrm{NC}}\sum_{\mathrm{c}=1}^{\mathrm{NC}}\frac{\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}{\mathrm{t}}_{\mathrm{ci}}}{\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}+{\mathrm{t}}_{\mathrm{ci}}}},$$

(1)

$$GDL\left(p,t\right)\underset{}{=1-2\frac{\sum_{\mathrm{c}=1}^{\mathrm{NC}}{\mathrm{w}}_{\mathrm{c}}\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}{\mathrm{t}}_{\mathrm{ci}}}{\sum_{\mathrm{c}=1}^{\mathrm{NC}}{\mathrm{w}}_{\mathrm{c}}\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}+{\mathrm{t}}_{\mathrm{ci}}}},$$

(2)

$$JL\left(p,t\right)\underset{}{=1-\frac{1}{\mathrm{NC}}\frac{\sum_{\mathrm{c}=1}^{\mathrm{NC}}\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}{\mathrm{t}}_{\mathrm{ci}}}{\sum_{\mathrm{c}=1}^{\mathrm{NC}}(\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}+{\mathrm{t}}_{\mathrm{ci}}-\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}{\mathrm{t}}_{\mathrm{ci}})}},$$

(3)

$$GJL\left(p,t\right)\underset{}{=1-\frac{\sum_{\mathrm{c}=1}^{\mathrm{NC}}{\mathrm{w}}_{\mathrm{c}}\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}{\mathrm{t}}_{\mathrm{ci}}}{\sum_{\mathrm{c}=1}^{\mathrm{NC}}{\mathrm{w}}_{\mathrm{c}}(\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}+{\mathrm{t}}_{\mathrm{ci}-}\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{p}}_{\mathrm{ci}}{\mathrm{t}}_{\mathrm{ci}})}},$$

(4)

where N is the number of voxels, NC is the number of classes, p is the predicted probability and t is the true probability and ${w}_{c}=1/{(\sum_{\mathrm{i}=1}^{\mathrm{N}}{\mathrm{t}}_{\mathrm{ci}})}^{2}$. Alternatively, in our proposed GJL loss we did not use the squared volume to normalize but just the volume, i.e. ${w}_{c}=1/\sum_{\mathrm{i}=1}^{\mathrm{N}}{{\mathrm{t}}_{\mathrm{ci}}}$.

As the size of our training datasets is small (specially for the Winterburn dataset) we used different approaches of data augmentation. We randomly smooth and sharpen the images to simulate different image quality conditions during the training. We expanded the Winterburn dataset with automatic segmentations of the Kulaga-Yoskovitz dataset using the method HIPS. Note that to generate these segmentations only training data was used as library. Finally, we used also mixup³⁵ as a data agnostic method for performing data augmentation.

Batch normalization is a highly effective manner to speed up the training process and to improve the results by minimizing the internal covariate shift. However, we realized that when used with small batch sizes it behaves sub-optimally at test time. The reason of this issue is that Batch Normalization layer behaves differently at training and test time. During training, the mean and standard deviation of the activation maps are computed for the whole batch using a moving average estimation to enforce stability during training. However, at test time the network does not processes any batch of data and therefore cannot estimate the mean and standard deviation of the batch, as a result, the network uses the historical mean and standard deviation stored during training. Unfortunately, when using small batch sizes (N = 1 in our case) the stored values do not work very well. Nevertheless, if we run the network in training mode we force the network to use the current mean and standard deviation of the new case and the results are significantly improved. We call this, training time batch normalization (TTBN).

Experiments and results

In this section, the analysis of the different options of the proposed method and their results are presented. To evaluate the segmentation accuracy, we have used the DICE coefficient³⁶ measured in the linear MNI152 space. All experiments were performed using tensorflow 1.2.0 and keras 2.2.4 using Titan Xp Nvidia GPU with 12 GB RAM. To train the network, we used an Adam optimizer³⁷ with default parameters during 200 epochs and we test different loss functions with multiscale loss weights (0.1, 0.2 and 0.7) for low, medium and high-resolution outputs respectively (see Fig. 2). A batch size of one was used in all our experiments.

Kulaga-Yoskovitz and Winterburn datasets were preprocessed as described in “Image preprocessing”. To increase the size of the training data, we put together left and right crops by left–right flipping the left crops to generate right oriented crops. This yield 50 right crops in Kulaga-Yoskovitz dataset and 10 right crops in Winterburn dataset. Since the size of both datasets is quite small we have used a K-fold cross validation strategy to increase the relevance of our findings. Specifically, we used K = 5 in both datasets. In Kulaga-Yoskovitz dataset this let each fold with 40 training images (5 of them for validation) and 10 test images. In Winterburn dataset each fold had 8 training images (2 of them for validation) and 2 test images.

Analysis of the proposed method

There are many factors that affect the performance of deep learning methods such as the architecture, the loss function, data augmentation strategies, etc. In this section we will present some experiments that show their effects on the proposed method.

The first option we tested was the loss function. We compared 5 different loss functions using the same exact network initialization. In Table 1 the validation DICE of each loss function is compared for both datasets. We also included the categorical cross entropy (CCE) in the comparison as it is a common loss used in segmentation/classification. As can be noted, CCE performed worse than dice loss which is a common loss function used in segmentation. Curiously, the GDL failed giving a really low dice compared with the other losses. JL performed similar than dice loss. The proposed GJL was the best performing loss in both datasets and therefore was selected a loss function of the proposed method.

Table 1 Average DICE in Kulaga-Yoskovitz (first row) and Winterburn (second row) datasets.

Full size table

To study the impact of the proposed architecture, we run the proposed network with deep supervision and compared it with the classic UNET. In Table 2 the results of the comparison are shown. As can be seen, the proposed architecture was able to improve the results in both datasets.

Table 2 Comparison of our proposed deep supervised UNET vs classic UNET 3D.

Full size table

It is well-known that in deep learning the amount of training data plays a major role (probably the biggest) in the quality of the network results. Unfortunately, manually labeled cases of hippocampal subfields is a rare resource due to the difficulty of generating such data. Automatic data augmentation has been traditional used to artificially increase the number of training cases. This has been usually done applying random transformations on the available training data (rotation, scale, etc.). In this project, we have used a combination of different methods to augment the number of training cases. In the case of Kulaga-Yuskevitz, we randomly smooth and sharpen the cropped images to generate low and high-quality images to improve generalization capabilities of the network. We also used mixup³⁵ to linearly combine inputs and outputs (alfa = 0.3). Mixup is a data-agnostic data augmentation method that has been proven beneficial specially when using a small training dataset³⁸. In the case of Winterburn dataset, we used the same approach but in addition we increased the training dataset using automatic segmentations of the Kulaga-Yuskevitz dataset with the HIPS method¹⁸ which is a patch-based multi-atlas label fusion based method (using as atlases the training cases of each fold). In the Table 3 the results of the proposed method with and without data augmentations are shown. As expected, data augmentation strategies helped to improve the results in both datasets. The improvement in Winterburn dataset was more important given the small size of the training set (N = 6).

Table 3 Data augmentation results.

Full size table

A last experiment was performed to evaluate the effect of the TTBN technique. In Table 4 the results of both datasets are shown. As can be seen, TTBN helped in both data sets, but the improvement in the Winterburn dataset was relatively greater.

Table 4 Training time batch normalization results.

Full size table

For the final results of both datasets, we estimated each structure dice, average dice among structures and whole hippocampus dice. In Table 5 the k-fold cross validation results for both datasets are shown. An example result of DeepHIPS for both protocols is shown in Fig. 3.

Table 5 Mean DICE and standard deviation for each structure segmentation over the Kulaga-Yoskovitz and Winterburn datasets.

Full size table

Standard resolution vs high resolution

The proposed method uses high resolution MR images but these sequences are not always available either in research or in clinical environments. However, it would be desirable to be able to analyze legacy data. For this reason, we evaluated the proposed method using standard resolution (1 × 1 × 1 mm³) images upsampled to 0.5 × 0.5 × 0.5 mm³ using B-spline interpolation and a super-resolution technique²⁸. To do it, we reduced the resolution of the HR images by a factor 2 and later we upsampled them using the described methods.

Tables 6 and 7 show the results for both datasets. The results confirm that the proposed method can produce competitive results when using standard resolution images. Note that the results using LASR are better than using B-spline interpolation for both datasets and closely resembles those obtained using the original HR images. This important result shows that the proposed framework can efficiently process usual 1 × 1 × 1 mm³ MR data. Recent advances in deep learning based superresolution³⁹ can further reduce the gap between original HR data results and the upsampled standard resolution images. However, this is beyond the objectives of the work and will be studied in a future research.

Table 6 Winterburn dataset mean DICE and standard deviation for each structure segmentation using the B-spline interpolation and LASR to the previously downsampled image to be segmented. Best results in bold. Results using the HR images are also provided for comparison.

Full size table

Table 7 Kulaga-Yoskovitz dataset mean DICE and standard deviation for each structure segmentation using the B-spline interpolation and LASR to the previously downsampled image to be segmented. Best results in bold. Results using the HR images are also provided for comparison.

Full size table

Method comparison

The proposed method was compared with state-of-the-art related methods. Specifically, for the Kulaga-Yoskovitz dataset we compared with HIPS method¹⁸ and a recent deep learning-based method named ResDUnet dedicated to hippocampus subfields segmentation²⁷. In both cases we used published results in their papers for the comparison. In Table 8 we show the results of the comparison. We included also the inter and intra-rater accuracy for comparison purposes. As can be noticed, the proposed method outperformed previous state-of-the-art methods. It is also worth to note that the proposed method improved the inter-rater accuracy and got very close to the intra-rater accuracy.

Table 8 Mean DICE and standard deviation for each structure segmentation over the Kulaga-Yoskovitz dataset. Best results in bold.

Full size table

For the Winterburn dataset, we compared with HIPS method¹⁸ that represents the state of the art in this dataset. In Table 9, we show the results of the comparison. We included also the intra-rater accuracy for comparison purposes. As can be noticed the proposed method outperformed HIPS method by a large margin and got very close to the intra-rater accuracy.

Table 9 Mean DICE in the MNI space and standard deviation for each structure segmentation using high resolution T1w, T2w and Multispectral respectively over the Winterburn dataset. Best results in bold.

Full size table

Regarding to the execution time, the proposed network takes around 1 s to segment a new case. The whole DeepHIPS pipeline (including preprocessing) takes around 2 min while HIPS method takes around 20 min.

Discussion

In this paper, we have presented a new deep learning-based method for HR hippocampus subfield segmentation that we called DeepHIPS. We have validated the proposed method using 2 publically available datasets (Winterburn and Kulaga-Yoskovitz).

Our proposed method first preprocesses the HR T1 and T2 images to improve their quality and to locate them into a standard space (MNI152) to finally crop the region of interest to process. From the architecture point of view, our model is a 3D UNET variant that uses deep supervision and low-resolution feedback to make easier the training process. We found that this variant worked better that the classic UNET.

We have also proposed a novel loss function (GJL) based on the Jaccard similarity index that enables to improve the accuracy of the network borrowing ideas from a modified version of the GDL (i.e. using linear volume weights instead of quadratic). We further improve the results using classical data augmentation techniques such as image mirroring and intensity transformation and more modern ones such as mixup.

Finally, we improved the results of the network at test time by running Batch normalization layers in training mode instead of test mode. We found that when using small batch sizes (N = 1 in our case) batch normalization layers didn’t behave properly due to the use of the stored mean and standard deviation during training. Using current sample statistics systematically improved the results in all our experiments despite the simplicity of the approach (especially in the Winterburn dataset with an improvement of nearly the 9%). We called this strategy Training Time Bach Normalization (TTBN).

We compared the results of the proposed method with state-of-the-art methods in two datasets. In the Kulaga-Yoskovitz dataset we compared with HIPS method and a recent deep learning-based method named ResDUnet. The proposed method improved the results of both methods for all subfields and got closer to the intra-rater accuracy which can be considered as the upper bound of the method. For the Winterburn dataset, we compared with HIPS method and again, the proposed method improved the results for all subfields and the overall accuracy got very close to the intra-rater accuracy.

We also studied the accuracy of the propose method using standard resolution images (1 mm²) upsampled to HR (0.5 mm²) using superresolution method (LASR). Although the accuracy slightly dropped compared to the HR results we found it still very competitive making possible the use of legacy data.

We are aware that the training libraries of the proposed method are quite small to ensure a good generalization (especially in the case of Winterburn) and our future efforts will be directed to increase the size of these libraries by manually labeling new cases and using semi-supervised approaches to automatically extend the training dataset size.

From an efficiency point of view the proposed method is not only more accurate but also more efficient than previous state of the art (HIPS) reducing by a factor 10 the total execution time.

Conclusion

In this work, we have presented a new method for HR hippocampus subfield segmentation based on a deep learning approach and we have validated it with two publically available datasets (Winterburn and Kulaga-Yoskovitz) showing competitive results in both accuracy and efficiency. We plan to make fully accessible the DeepHIPS pipeline through the new release of our online image analysis service volbrain (http://volbrain.upv.es) so researchers around the world can use our pipeline without requiring complex pipeline installations or the use of expensive hardware (GPUs, etc.).

References

Milner, B. Psychological defects produced by temporal lobe excision. Res. Publ. Assoc. Res. Nerv. Ment. Dis. 36, 244–257 (1958).
CAS PubMed Google Scholar
Braak, H. & Braak, E. Neuropathological stageing of Alzheimer-related changes. Acta Neuropathol. 82(4), 239–259 (1991).
Article CAS Google Scholar
Jack, C. R. et al. Rates of hippocampal atrophy correlate with change in clinical status in aging and AD. Neurology 55, 484–489 (2000).
Article Google Scholar
Jack, C. R. et al. Brain atrophy rates predict subsequent clinical conversion in normal elderly and amnestic MCI. Neurology 65, 1227–1231 (2005).
Article Google Scholar
Dickerson, B. C. & Sperling, R. A. Neuroimaging biomarkers for clinical trials of disease-modifying therapies in Alzheimer’s disease. NeuroRx 2, 348–360 (2005).
Article Google Scholar
Barnes, J. et al. A comparison of methods for the automated calculation of volumes and atrophy rates in the hippocampus. Neuroimage 40, 1655–1671 (2008).
Article CAS Google Scholar
Collins, D. L. & Pruessner, J. C. Towards accurate, automatic segmentation of the hippocampus and amygdala from MRI by augmenting ANIMAL with a template library and label fusion. Neuroimage 52(4), 1355–1366 (2010).
Article Google Scholar
Coupé, P. et al. Patch-based segmentation using expert priors: Application to hippocampus and ventricle segmentation. Neuroimage 54(2), 940–954 (2011).
Article Google Scholar
Chupin, M. et al. Fully automatic hippocampus segmentation and classification in Alzheimer’s disease and mild cognitive impairment applied on data from ADNI. Hippocampus 19(6), 579–587 (2009).
Article Google Scholar
Hett, K., Ta, V., Catheline, G., Tourdias, T., Manjón, J. V., Coupe, P. Multimodal Hippocampal Subfield Grading For Alzheimer’s Disease Classification. (Scientific Reports, 2019).
Yushkevich, P. A. et al. Quantitative comparison of 21 protocols for labeling hippocampal subfields and parahippocampal subregions in in vivo MRI: Towards a harmonized segmentation protocol. Neuroimage 111, 526–541 (2015).
Article Google Scholar
Winterburn, J. L. et al. A novel in vivo atlas of human hippocampal subfields using high-resolution 3 T magnetic resonance imaging. Neuroimage 74, 254–265 (2013).
Article Google Scholar
Kulaga-Yoskovitz, J. et al. Multi-contrast submillimetric 3 Tesla hippocampal subfield segmentation protocol and dataset. Sci. Data. 2, 150059 (2015).
Article Google Scholar
Van Leemput, K. et al. Automated segmentation of hippocampal subfields from ultra-high resolution in vivo MRI. Hippocampus 19(6), 549–557 (2009).
Article Google Scholar
Pipitone, J. et al. Multi-atlas segmentation of the whole hippocampus and subfields using multiple automatically generated templates. Neuroimage 101, 494–512 (2014).
Article Google Scholar
Iglesias, J. E. et al. A computational atlas of the hippocampal formation using ex vivo, ultra-high resolution MRI: Application to adaptive segmentation of in vivo MRI. Neuroimage 115(15), 117–137 (2015).
Article Google Scholar
Yushkevich, P. A. et al. Automated volumetry and regional thickness analysis of hippocampal subfields and medial temporal cortical structures in mild cognitive impairment. Hum. Brain Mapp. 36(1), 258–287 (2015).
Article Google Scholar
Romero, J. E., Coupé, P. & Manjón, J. V. HIPS: A new hippocampus subfield segmentation method. Neuroimage 163, 286–295 (2017).
Article Google Scholar
Giraud, R. et al. An optimized PatchMatch for multi-scale and multi-feature label fusion. Neuroimage 124, 770–782 (2016).
Article Google Scholar
Peixoto-Santos, J. E. et al. Manual hippocampal subfield segmentation using high-field MRI: Impact of different subfields in hippocampal volume loss of temporal lobe epilepsy patients. Front. Neurol. 9, 927 (2018).
Article Google Scholar
Chen, Y., Shi, B., Wang, Z., Zhang, P., Smith, C. D., & Liu, J. Hippocampus segmentation through multi-view ensemble ConvNets. In 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017) 192–196 (Melbourne, VIC, 2017).
Cao, L. et al. Multi-task neural networks for joint hippocampus segmentation and clinical score regression. Multimed. Tools Appl. 77(22), 29669–29686 (2018).
Article Google Scholar
Thyreau, B., Sato, K., Fukuda, H. & Taki, Y. Segmentation of the hippocampus by transferring algorithmic knowledge for large cohort processing. Med. Image Anal. 43, 214–228 (2018).
Article Google Scholar
Ataloglou, D., Dimou, A., Zarpalas, D. & Daras, P. Fast and precise hippocampus segmentation through deep convolutional neural network ensembles and transfer learning. Neuroinformatics 17(4), 563–582 (2019).
Article Google Scholar
Shi, Y., Cheng, K. & Liu, Z. Hippocampal subfields segmentation in brain MR images using generative adversarial networks. BioMed. Eng. OnLine 18, 5 (2019).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional networks for biomedical image segmentation. MICCAI 3(2015), 234–241 (2015).
Google Scholar
Hancan, Z. et al. TITLE=dilated dense U-Net for infant hippocampus subfield segmentation. Front. Neuroinform. 13(30), 1–12 (2019).
Google Scholar
Manjón, J. V. et al. Non-local MRI upsampling. Med. Image Anal. 14(6), 784–792 (2010).
Article Google Scholar
Manjón, J. V., Coupé, P., Martí-Bonmatí, L., Collins, D. L. & Robles, M. Adaptive non-local means denoising of MR images with spatially varying noise levels. J. Magn. Reson. Imaging 31, 192–203 (2010).
Article Google Scholar
Tustison, N. J. et al. N4ITK: Improved N3 bias correction. IEEE Trans. Med. Imaging 29(6), 1310–1320 (2010).
Article Google Scholar
Avants, B. B., Tustison, N. & Song, G. Advanced normalization tools (ANTS). Insight J. 2, 1–35 (2009).
Google Scholar
Dou, Q. et al. 3D deeply supervised network for automated segmentation of volumetric medical images. Med. Image Anal. 41, 40–54 (2017).
Article Google Scholar
Milletari, F., Navab, N., & Ahmadi, S. A. V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. Arxiv (2016).
Sudre, C. H., Li, W., Vercauteren, T., Ourselin, S. & Jorge, C. M. Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. DLMIA 2017, ML-CDS 2017. Lecture Notes in Computer Science Vol. 10553 (eds Cardoso, M. et al.) (Springer, 2017).
Google Scholar
Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D. mixup: Beyond Empirical Risk Minimization. https;//arXiv.org/abs/1710.09412 (2017)
Zijdenbos, A. P., Dawant, B. M., Margolin, R. A. & Palme, A. C. Morphometric analysis of white matter lesions in MR images: Method and validation. IEE Trans. Med. Imaging 13, 716–724 (1994).
Article CAS Google Scholar
Diederik, P. K. & Jimmy, L. B. Adam: A method for stochastic optimization. https;//arXiv.org/abs/1412.6980v9 (2014).
Eaton-Rosen, Z., Bragman, F., Ourselin, S. & Cardoso, M. J. Improving data augmentation for medical image segmentation. In International Conference on Medical Imaging with Deep Learning, MIDL2018 (2018).
Chen, Y., Xie, Y., Zhou, Z., Shi, G., Christodoulou, A. G. & Li, D. Brain MRI super resolution using 3D deep densely connected neural networks. In IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), Washington, 739–742 (2018).

Download references

Acknowledgements

This research was supported by the Spanish DPI2017-87743-R grant from the Ministerio de Economia, Industria y Competitividad of Spain. This study has been also carried out with financial support from the French State, managed by the French National Research Agency (ANR) in the frame of the Investments for the future Program IdEx Bordeaux (ANR-10-IDEX-03-02, HL-MRI Project) and Cluster of excellence CPU and TRAIL (HR-DTI ANR-10-LABX-57). The authors gratefully acknowledge the support of NVIDIA Corporation with their donation of the TITAN X GPU used in this research.

Author information

Authors and Affiliations

Instituto de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas (ITACA), Universitat Politècnica de València, Camino de Vera s/n, 46022, Valencia, Spain
José V. Manjón & José E. Romero
Univ. Bordeaux, LaBRI, UMR 5800, PICTURA, 33400, Talence, France
Pierrick Coupe
CNRS, LaBRI, UMR 5800, PICTURA, 33400, Talence, France
Pierrick Coupe

Authors

José V. Manjón
View author publications
You can also search for this author in PubMed Google Scholar
José E. Romero
View author publications
You can also search for this author in PubMed Google Scholar
Pierrick Coupe
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.M. wrote the main manuscript text and prepared the figures All authors reviewed the manuscript.

Corresponding author

Correspondence to José V. Manjón.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Manjón, J.V., Romero, J.E. & Coupe, P. A novel deep learning based hippocampus subfield segmentation method. Sci Rep 12, 1333 (2022). https://doi.org/10.1038/s41598-022-05287-8

Download citation

Received: 01 September 2021
Accepted: 04 January 2022
Published: 25 January 2022
DOI: https://doi.org/10.1038/s41598-022-05287-8
Springer Nature Limited

A novel deep learning based hippocampus subfield segmentation method

Abstract

Similar content being viewed by others

Deep convolutional neural network for hippocampus segmentation with boundary region refinement

Multi-target Interactive Neural Network for Automated Segmentation of the Hippocampus in Magnetic Resonance Imaging

Deep Learning for Medical Image Segmentation

Introduction