AI-assisted automatic MRI-based tongue volume evaluation in motor neuron disease (MND)

Vernikouskaya, Ina; Müller, Hans-Peter; Ludolph, Albert C.; Kassubek, Jan; Rasche, Volker

doi:10.1007/s11548-024-03099-x

AI-assisted automatic MRI-based tongue volume evaluation in motor neuron disease (MND)

Original Article
Open access
Published: 27 March 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Computer Assisted Radiology and Surgery Aims and scope Submit manuscript

AI-assisted automatic MRI-based tongue volume evaluation in motor neuron disease (MND)

Download PDF

285 Accesses
Explore all metrics

Abstract

Purpose

Motor neuron disease (MND) causes damage to the upper and lower motor neurons including the motor cranial nerves, the latter resulting in bulbar involvement with atrophy of the tongue muscle. To measure tongue atrophy, an operator independent automatic segmentation of the tongue is crucial. The aim of this study was to apply convolutional neural network (CNN) to MRI data in order to determine the volume of the tongue.

Methods

A single triplanar CNN of U-Net architecture trained on axial, coronal, and sagittal planes was used for the segmentation of the tongue in MRI scans of the head. The 3D volumes were processed slice-wise across the three orientations and the predictions were merged using different voting strategies. This approach was developed using MRI datasets from 20 patients with ‘classical’ spinal amyotrophic lateral sclerosis (ALS) and 20 healthy controls and, in a pilot study, applied to the tongue volume quantification to 19 controls and 19 ALS patients with the variant progressive bulbar palsy (PBP).

Results

Consensus models with softmax averaging and majority voting achieved highest segmentation accuracy and outperformed predictions on single orientations and consensus models with union and unanimous voting. At the group level, reduction in tongue volume was not observed in classical spinal ALS, but was significant in the PBP group, as compared to controls.

Conclusion

Utilizing single U-Net trained on three orthogonal orientations with consequent merging of respective orientations in an optimized consensus model reduces the number of erroneous detections and improves the segmentation of the tongue. The CNN-based automatic segmentation allows for accurate quantification of the tongue volumes in all subjects. The application to the ALS variant PBP showed significant reduction of the tongue volume in these patients and opens the way for unbiased future longitudinal studies in diseases affecting tongue volume.

Deficits in tongue motor control are linked to microstructural brain damage in multiple sclerosis: a pilot study

Article Open access 08 October 2015

Toward More Accessible Fully Automated 3D Volumetric MRI Decision Trees for the Differential Diagnosis of Multiple System Atrophy, Related Disorders, and Age-Matched Healthy Subjects

Article Open access 26 September 2022

Deep learning based diagnosis of Parkinson’s Disease using diffusion magnetic resonance imaging

Article 14 March 2022

Introduction

Amyotrophic lateral sclerosis (ALS), the most common adult motor neuron disease (MND), is characterized by a progressive loss of motor neurons that leads to progressive pareses, respiratory failure, and death mostly within 3 to 5 years after its onset [1]. Bulbar dysfunction, characterized by tongue wasting and fasciculation, accompanied by flaccid dysarthria and dysphagia, is emerging in the vast majority of patients during the advanced phases of the disease [1, 2], as one major factor that determines a patient’s prognosis [3]. Prognostic biomarkers of ALS are needed, particularly of bulbar involvement, which is one of the key determinants of long-term prognosis and survival in this disorder [4]. Progressive bulbar palsy (PBP) is an ALS variant in which patients show an isolated bulbar onset with a progressive affection of the lower cranial nerves including tongue atrophy [5] before they develop spinal symptoms of MND.

Quantitative noninvasive imaging-based assessment of the severity of tongue volume loss requires conduction of longitudinal studies measuring several tongues features with specialized instruments including magnetic resonance imaging (MRI) or high-resolution ultrasound. Several case reports and few systematic imaging studies have suggested structural tongue measures in the course of ALS [3, 6, 7]. Specifically, T1 sequences were used to assess atrophy, fibrosis and fatty degeneration, and a previous large-scale study suggested that in vivo sonography and region-of-interest (ROI)-based MRI tongue measures could aid as biomarkers to reflect bulbar and motor function impairment in ALS [7]. Although it has been previously shown by ultrasound that the tongue thickness in a group of 18 ALS was lower than that of healthy controls [3], tongue size and shape can significantly vary across subjects and longitudinal studies need to be performed to investigate the tongue muscle atrophy in diseases like ALS.

In order to access measurements like volume, thickness, or shape of a given structure, the anatomy needs to be segmented, which is a time-consuming and error-prone process when performed manually. To reduce the time and subjectivity of medical segmentations and consequently improve reliability, automatic segmentation methods based on deep convolutional neural networks (CNNs) were used [8]. It has been shown by the authors of the nnU-Net, i.e., a framework relying on 2D and 3D U-Nets that automatically configure themselves [9] that plain end-to-end CNNs with U-Net like architectures perform exceptionally well in most biomedical image segmentation tasks.

After several studies based on CNNs for MRI and ultrasound images of the vocal tract for understanding speech production [10,11,12,13,14], more studies on using semantic segmentation based on deep CNNs for tongue segmentation have been recently conducted, and the effect is better than most of the traditional image segmentation methods [15, 16]. However, there are still limitations in those methods, e.g., including image preprocessing such as image enhancement [17] making the whole segmentation process more complex or brightness discrimination [18] reducing the ability of generalization as a deep learning-based model.

The aim of the present study was to adapt the CNN model of U-Net architecture to MRI data of the tongue (which are included in routinely acquired volume-rendering scans of the human head) with the final goal to obtain an automated pipeline for determination of tongue atrophy in neurologic diseases. In our approach, the T1-weighted MRI images from the 3D volumes are processed slice-wise across the axial, sagittal, and coronal planes with the CNN of U-Net like architecture, and the predictions from the three orthogonal orientations are merged using different voting strategies integrating more 3D information into the 2D model. Compared to the original triplanar U-Net approach where three orientation-specific U-Nets are trained [19], we utilize a single U-Net which is trained on axial, sagittal, and coronal slices [20], allowing to share common features across orientations. Furthermore, we investigate the sensitivity of different voting strategies for merging the predictions from different orientations. We developed our approach using 40 datasets available with reference segmentation of the tongue (20 from healthy controls and 20 from MND patients diagnosed with ALS) and applied it in a group comparison study comprising further 19 controls and 19 MND patients diagnosed with PBP.

Methods

MRI dataset

Seventy-eight T1-weighted whole head MRI datasets acquired on a 1.5 T MRI scanner (Symphony, Siemens Medical, Erlangen, Germany) with a T1-weighted 3D MPRAGE sequence as a standardized clinical MRI examination protocol for patients with MND were available for this study. Data were obtained from the MRI database of the Department of Neurology, University of Ulm, Germany. The respective ethics application includes the recording and the analysis of MRI data, irrespective of the analysis technique; no additional MRI scans have been performed for the current study. The field-of-view of the T1-weighted images of the head usually also covers the tongue so that these images could be used for segmentation of the tongue. T1-weighted scans that did not cover the tongue were not used in this study. In addition, motion artifacts due to tongue movement could not be excluded although subjects were not instructed to keep the tongue still. Figure 1 provides overview of all available datasets with the corresponding demographic data for each group.

Forty datasets including 20 healthy subjects without any neurologic/psychiatric disease or other medical condition and 20 patients with sporadic ALS who were diagnosed with definite, probable, or possible ALS according to revised E1 Escorial criteria [22] recruited in the outpatient and inpatient settings of the Department of Neurology, University of Ulm, Germany were available with the corresponding reference segmentation of the tongue. For methodological development, the data samples from controls and ALS patients were randomly split into training and test datasets at a ratio of 70%/30% at subject level, resulting in 14 training and 6 test datasets from each group.

Further, 19 patients diagnosed with PBP, who met the diagnostic criteria for PBP, and 19 controls were investigated in the group comparison study. The MRI data were part of a previous study with a different research focus [23]. All PBP patients showed an isolated bulbar onset with a progressive affection of the lower cranial nerves causing dysarthria and/or dysphagia, tongue wasting and fasciculation before they developed spinal MND symptoms. To be eligible, subjects had to fulfill the following criteria: no family history of MND, no clinical diagnosis of frontotemporal dementia, no other major systemic, psychiatric or neurologic illnesses, no history of substance abuse. Further mandatory criteria for inclusion were negative tests for other neuromuscular diseases and for infections of the central nervous system, and routine MRI scans excluded any brain abnormalities indicating a different etiology of the clinical symptoms. These data were available without the corresponding ground truth labels.

Preprocessing and generation of ground truth segmentations

For image preprocessing and creating a ground truth label the software package Tensor Imaging and Fiber Tracking (TIFT) was used, expanded by a volumetric extension package [24]. In the preprocessing pipeline, original 3D MPRAGE volumes were first rescaled to a 256 × 256 × 256 matrix with an isotropic resolution of 1.0 × 1.0 × 1.0 mm³. After rescaling, the data were reoriented with the palatal tip in the center of the matrix and with the nose pointing to the right in the sagittal image. Data were intensity-normalized with z-score normalization based on the mean and standard deviation of the means of all subjects who participated in the study. The mean intensity value for the individual subjects was determined in an area defined by a matrix of 128 × 128 × 128 voxels with the tongue as the center (note: this area was the same for all subjects). A visualization is provided in Fig. 2. The segmentation was then carried out in a rescaled matrix of 256 × 256 × 256 voxels with a resolution of 0.5 × 0.5 × 0.5 mm³. The segmentation of the ground truth data of the tongue was performed manually using a 3D intensity threshold-based marking tool. Data were displayed in parallel in axial, sagittal, and coronal views and the tongue was manually marked (Fig. 2, right) by a 3D painting/drawing tool implemented in the TIFT software platform by a trained operator (HPM) and controlled by a medical expert (JK). The number of slices covering the tongue varied between subjects and orientations with an average number of 120 slices in axial, 95 slices in sagittal, and 143 slices in coronal orientation.

Model training

For training we used U-Net model implemented from scratch. In contrast to original architecture with five convolutional blocks on each branch, the number of feature channels in the contracting path was reduced to 32, 64, 128, 256, and 512, respectively. To increase the network generalization and reduce overfitting, a dropout layer was applied after repeated 3 × 3 convolutional layers with ReLU activation in each downsampling step. The modified U-Net architecture with the corresponding layers’ settings is shown in Fig. 3.

The training was performed along 50 epochs using early stopping where the training was stopped when the validation loss was observed to have ceased improving for 10 consecutive epochs with a batch size of 16 images per pass. The loss function was based on the categorical cross entropy and Adaptive Moment Estimation (Adam) with the learning rate of 10^–4 and remaining hyperparameters kept with their default Keras values was used as the optimizer. Mean Intersection over Union (IoU) was used as metric to evaluate the model. Fivefold cross-validation strategy was applied for training, where 20% of available data was hold-out at each fold as validation set.

Inference

For inference, the weighted ensemble average of all fivefold models with the fixed weights, i.e., the validation IoU at each fold, was used for 2D slice-by-slice segmentation of axial, sagittal, and coronal images, respectively (Fig. 4).

Resulting axial, sagittal, and coronal predictions were merged using four different voting strategies to produce the final segmentation mask. In addition to softmax averaging with equal weights as a baseline approach [19], we compared three different voting strategies in order to find the optimal balance of recall and precision. For these approaches, we first thresholded the softmax scores of each of the three orientations to obtain hard predictions. Then, the following strategies were applied: the exact segmentation of a tongue was defined as the union of the corresponding positive voxels across (a) at least one orientation prediction (union); (b) at least two orientation predictions (majority); (c) all orientation predictions (unanimous voting).

Evaluation metrics

To evaluate the introduced approaches, performance metrics such as precision, recall, and the principal segmentation metric, i.e., Dice score which is equivalent to F1 score, were calculated via the true positives (TP), false positives (FP), and false negatives (FN). Further, ground truth tongue volumes and tongue volumes predicted by different approaches were compared applying a paired Student’s t test according to Shapiro–Wilk test for normality (p-value < 0.05 was assumed statistically significant). Finally, we accessed the differences between tongue volumes in the control and the PBP group applying unpaired t test or Mann–Whitney U rank test as appropriate depending on the results of Shapiro–Wilk test.

Results

While having the highest number of TP among all investigated prediction strategies, the most inclusive strategy, i.e., union, achieved the best recall (0.93 on average) due to significantly lower number of FN, but a very low precision (0.78) given by the large number of FP, resulting in an overall Dice score of 0.85. The Dice score for very restrictive, unanimous, voting was similar (0.85) having the highest precision of 0.92 (due to the lowest number of FP), but the lowest recall of 0.80 (due to the highest number of FN). Softmax averaging and majority voting performed best in terms of segmentation accuracy (Dice score of 0.88) due to similarly high precision (0.88) and recall (0.88), outperforming two other merging strategies and improving predictions on single orientations. These results are summarized in Table 1.

Table 1 True positives (TP), true negatives (TN), false positives (FP), and false negatives (FN), as well as calculated performance metrics (mean precision, mean recall, and mean F1 score) achieved with predictions on single axial, sagittal, and coronal orientations and after application of consensus models with different merging strategies (softmax averaging, union, majority, and anonymous voting) calculated on 12 test datasets consisting of 6 controls and 6 ALS patients

Full size table

Qualitative results from a single MRI slice of ALS patient confirmed results of the confusion matrices and are summarized in Fig. 5. All models provided similar number of true predictions (magenta overlay in Fig. 5). While predicting less false positives (blue overlay) than e.g., the axial model only or the consensus model with union voting and missing very little number of positives (yellow overlay) as compared to e.g., the consensus model with unanimous voting, consensus models with softmax averaging and majority voting performed best.

Differences in volume quantification between each approach and ground truth in the test dataset subdivided into control and ALS subgroups (6 subjects each) are demonstrated in Fig. 6a. Very good accordance was observed between ground truth tongue volumes and predictions in both ALS and controls with consensus models with softmax averaging and majority voting, which obviously outperformed predictions on individual orientations. Highly significant overestimation of tongue volumes in comparison to ground truth in both groups was observed for the consensus model with union voting and underestimation with unanimous voting, especially in the ALS group. The differences in tongue volumes between controls’ and ALS patients’ test groups with either approach were relatively small and statistically not significant.

The analysis at the group level using the best performing consensus models with softmax averaging and majority voting revealed a significant reduction in the quantified tongue volume at p = 0.002 in PBP patients (91 ± 16) versus controls (106 ± 8), even in this rather low sized data sample of 19 subjects per group (Fig. 6b).

Discussion

Accurate delineation of the tongue from low-contrast medical MR images of soft tissue remains a challenge, due to the lack of definitive boundary features separating many of the adjacent soft tissue [25]. Different from the conventional segmentation tasks in nature scene, tongue segmentation is more challenging because of the following issues: (1) large variations of tongue appearance for different patients while higher precision requirement; (2) data imbalance, e.g., small parts of foreground region (tongue body) compared with the background region; and (3) hard sample mining, e.g., lip pixels as the hard samples is hard to be segmented from tongue pixels because the similar appearances and close touch between them [15]. Recent studies demonstrated the applicability of AI methods in tongue segmentation [14, 26]. MRI is a useful modality for the noninvasive quantification of the tongue volume for longitudinal assessments of the muscle atrophy associated with the disease progression e.g., in patients with MND [7] who often present with tongue atrophy as a bulbar sign. Thus, an automatic method providing segmentation accuracy of the tongue comparable to that of an expert can be highly beneficial to reduce manual reading time and efforts.

In this methodological pilot study, we approach this challenge presenting a CNN-based method using single U-Net trained on three orthogonal orientations with consequent merging of respective orientations in a consensus model using different voting strategies. Training of a triplanar network as opposed to training of three orientation-specific networks requires that all slices have identical dimensions which we have ensured by resampling the volume to a regular cube in the preprocessing step. Further, we have observed that single orientation predictions tend to contain many erroneous detections, hence we applied different merging strategies of individual orientations in order to potentially reduce both the numbers of false positives and false negatives and to increase the segmentation accuracy. The most restrictive unanimous merging strategy implying that only pixels that had been confirmed in all three orientations are accepted has been previously suggested to be a key factor for the good performance of lesion segmentation [20]. In our study, unanimous voting strategy achieved the highest precision, i.e., most of the pixels predicted as tongue were true predictions. However, this precision gain was outweighed by the loss in recall showing that this approach missed pixels from the tongue. The opposite approach with union of all orientations achieved very high recall at the contrary, but obviously is limited by the low precision. As a result, both approaches yield significant deviation in segmented tongue volume compared to unbiased ground truth. Softmax averaging of predicted probabilities (which is equivalent to merging with majority voting in case of equal weights for the three respective orientations) performed best, balancing precision and recall better than other models and achieved an average Dice coefficient of 0.88. Very good accordance between ground truth tongue volume and tongue volume provided by consensus models with softmax averaging and major voting was achieved in our test dataset, outperforming all other strategies at the group level.

In our datasets from ALS patients with ‘classical’ spinal manifestation on the one hand and patients with the ALS variant PBP with prominent bulbar syndrome including hypoglossus nerve involvement with consecutive tongue affectation on the other hand, plausible results could be obtained. Using the introduced automatic approach, the assessments in the ‘classical’ spinal ALS showed no significant results versus controls with respect to tongue volumes. These data are in accordance with previous studies, including a study in 206 ALS patients in which the MRI analyses of the tongue for different parameters including sagittal tongue area resulted in only small effect sizes [7]. Obviously, the variability of the tongue involvement as one part of the bulbar syndrome is high in ‘classical’ spinal ALS. In contrast, a highly significant difference in tongue volume was obtained between PBP patients and healthy controls in our study, even though high variability of tongue volumes was also observed at cross-sectional level. In this group with prominent bulbar symptoms, tongue involvement is a major element of the clinical presentation.

Conclusions

A CNN model of U-Net like architecture was successfully adapted for segmentation of the tongue from routinely acquired MRI scans of the human head. The training on three orthogonal orientations with consequent merging of respective orientations in a consensus model allowed for an automated determination of atrophy of the tongue at the group level. That way, the added value of this study is the future use of the developed pipeline in longitudinal clinical studies for the detection of tongue atrophy in neurologic diseases. To this end, not only larger patient groups have to be investigated, but correlation analyses have to follow with clinical (and perhaps other technical) markers of disease severity and longitudinal progression. More specifically, the structure/volume of the tongue will have to be correlated with tongue function assessments including tongue movement ability and swallowing function.

References

Masrori P, Van Damme P (2020) Amyotrophic lateral sclerosis: a clinical review. Eur J Neurol 27(10):1918–1929
Article CAS PubMed PubMed Central Google Scholar
Romero-Gangonells E, Virgili-Casas MN, Dominguez-Rubio R, Povedano M, Pérez-Saborit N, Calvo-Malvar N, Barcelo MA (2021) Evaluation of dysphagia in motor neuron disease. review of available diagnostic tools and new perspectives. Dysphagia 36(4):558–573
Article PubMed Google Scholar
Nakamori M, Hosomi N, Takaki S, Oda M, Hiraoka A, Yoshikawa M, Matsushima H, Ochi K, Tsuga K, Maruyama H, Izumi Y, Matsumoto M (2016) Tongue thickness evaluation using ultrasonography can predict swallowing function in amyotrophic lateral sclerosis patients. Clin Neurophysiol 127(2):1669–1674
Article PubMed Google Scholar
Rosenbohm A, Peter RS, Erhardt S, Lulé D, Rothenbacher D, Ludolph AC, Nagel G, ALS Registry Study Group (2017) Epidemiology of amyotrophic lateral sclerosis in Southern Germany. J Neurol 264(4):749–757
Article PubMed Google Scholar
Chiò A, Calvo A, Moglia C, Mazzini L, Mora G, PARALS study group (2011) Phenotypic heterogeneity of amyotrophic lateral sclerosis: a population based study. J Neurol Neurosurg Psychiatry 82(7):740–746
Article PubMed Google Scholar
Lee E, Xing F, Ahn S, Reese TG, Wang R, Green JR, Atassi N, Wedeen WJ, El Fakhri G, Woo J (2018) Magnetic resonance imaging based anatomical assessment of tongue impairment due to amyotrophic lateral sclerosis: a preliminary study. J Acoust Soc Am 143(4):EL248
Article PubMed PubMed Central Google Scholar
Hensiek N, Schreiber F, Wimmer T, Kaufmann J, Machts J, Fahlbusch L, Garz C, Vogt S, Prudlo J, Dengler R, Petri S, Nestor PJ, Vielhaber S, Schreiber S (2020) Sonographic and 3T-MRI-based evaluation of the tongue in ALS. NeuroImage Clin 26:102233
Article PubMed PubMed Central Google Scholar
Rodrigues L, Rezende TJR, Wertheimer G, Santos Y, França M, Rittner L (2022) A benchmark for hypothalamus segmentation on T1-weighted MR images. Neuroimage 264:119741
Article PubMed Google Scholar
Isensee F, Jaeger PF, Kohl SAA, Petersen J, Maier-Hein KH (2021) nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods 18(2):203–211
Article CAS PubMed Google Scholar
Valliappan C, Mannem R, Ghosh PK (2018) Air-tissue boundary segmentation in real-time magnetic resonance imaging video using semantic segmentation with fully convolutional networks. Proc Interspeech 2018:3132–3136
Google Scholar
Somandepalli K, Toutios A, Narayanan SS (2017) Semantic edge detection for tracking vocal tract air-tissue boundaries in real-time magnetic resonance images. Proc Interspeech 2017:631–635
Article Google Scholar
Eslami M, Neuschaefer-Rube C, Serrurier A (2020) Automatic vocal tract landmark localization from midsagittal MRI data. Sci Rep 10(1):1468
Article CAS PubMed PubMed Central Google Scholar
Zhu J, Styler W, Calloway IC (2018) Automatic tongue contour extraction in ultrasound images with convolutional neural networks. J Acoust Soc Am 143(3):1966
Article Google Scholar
Eslami M, Neuschaefer-Rube C, Serrurier A (2019) Automatic vocal tract segmentation based on conditional generative adversarial neural network. In: Studientexte Zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung, pp 263–270. http://www.essv.de/paper.php?id=90
Zhou C, Fan H, Li Z (2019) Tonguenet: accurate localization and segmentation for tongue images using deep neural networks. IEEE Access 7:148779–148789
Article Google Scholar
Li J, Zhang Z, Zhu X, Zhao Y, Ma Y, Zang J, Li B, Cao X, Xue C (2022) Automatic classification framework of tongue feature based on convolutional neural networks. Micromachines 13(4):501
Article CAS PubMed PubMed Central Google Scholar
Li J, Xu B, Ban X, Tai P, Ma B (2017) A tongue image segmentation method based on enhanced HSV convolutional neural network. In: Luo Y (ed) Cooperative design, visualization, and engineering. Lecture Notes in Computer Science. Springer International Publishing, Cham, pp 252–260
Chapter Google Scholar
Qu P, Zhang H, Zhuo L, Zhang J, Chen G (2017) Automatic tongue image segmentation for traditional Chinese medicine using deep neural network. In: Huang DS, Bevilacqua V, Premaratne P, Gupta P (eds) Intelligent computing theories and application. Lecture Notes in Computer Science. Springer International Publishing, Cham, pp 247–259
Google Scholar
Guha Roy A, Conjeti S, Navab N, Wachinger C (2019) QuickNAT: A fully convolutional network for quick and accurate segmentation of neuroanatomy. Neuroimage 186:713–727
Article PubMed Google Scholar
Hitziger S, Ling WX, Fritz T, D’Albis T, Lemke A, Grilo J (2022) Triplanar U-Net with lesion-wise voting for the segmentation of new lesions on longitudinal MRI studies. Front Neurosci 16:964250. https://doi.org/10.3389/fnins.2022.964250
Article PubMed PubMed Central Google Scholar
Cedarbaum JM, Stambler N, Malta E, Fuller C, Hilt D, Thurmond B, Nakanishi A (1999) The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function. BDNF ALS Study Group (Phase III). J Neurol Sci 169(1–2):13–21
Article CAS PubMed Google Scholar
Ludolph A, Drory V, Hardiman O, Nakano I, Ravits J, Robberecht W, Shefner J, WFN Research Group On ALS/MND (2015) A revision of the El Escorial criteria-2015. Amyotroph Lateral Scler Front Degener 16(5–6):291–292
Article Google Scholar
Müller HP, Gorges M, Del Tredici K, Ludolph AC, Kassubek J (2019) The same cortico-efferent tract involvement in progressive bulbar palsy and in “classical” ALS: a tract of interest-based MRI study. NeuroImage Clin 24:101979. https://doi.org/10.1016/j.nicl.2019.101979
Article PubMed PubMed Central Google Scholar
Vernikouskaya I, Müller HP, Roselli F, Ludolph AC, Kassubek J, Rasche V (2023) AI-assisted quantification of hypothalamic atrophy in amyotrophic lateral sclerosis by convolutional neural network-based automatic segmentation. Sci Rep 13(1):21505. https://doi.org/10.1038/s41598-023-48649-6
Article CAS PubMed PubMed Central Google Scholar
Harandi NM, Abugharbieh R, Fels S (2015) 3D segmentation of the tongue in MRI: a minimally interactive model-based approach. Comput Methods Biomech Biomed Eng Imaging Vis 3(4):178–188
Article Google Scholar
Isaieva K, Laprie Y, Turpault N, Houssard A, Felblinger J, Vuissoz PA (2020) Automatic tongue delineation from MRI images with a convolutional neural network approach. Appl Artif Intell 34(14):1115–1123. https://doi.org/10.1080/08839514.2020.1824090
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the Ulm University Center for Translational Imaging MoMAN for its support.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Jan Kassubek and Volker Rasche have shared senior authorship.

Authors and Affiliations

Department of Internal Medicine II, Ulm University Medical Center, Albert-Einstein-Allee 23, 89081, Ulm, Germany
Ina Vernikouskaya & Volker Rasche
Department of Neurology, University of Ulm, Ulm, Germany
Hans-Peter Müller, Albert C. Ludolph & Jan Kassubek
German Center for Neurodegenerative Diseases (DZNE), Ulm, Germany
Albert C. Ludolph & Jan Kassubek
Core Facility Small Animal MRI, University of Ulm, Ulm, Germany
Volker Rasche

Authors

Ina Vernikouskaya
View author publications
You can also search for this author in PubMed Google Scholar
Hans-Peter Müller
View author publications
You can also search for this author in PubMed Google Scholar
Albert C. Ludolph
View author publications
You can also search for this author in PubMed Google Scholar
Jan Kassubek
View author publications
You can also search for this author in PubMed Google Scholar
Volker Rasche
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ina Vernikouskaya.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Ethical approval

No additional MRI scans have been performed for the current study. The study including the recording and the analysis of MRI data has been approved by the Ethics Committee of the University of Ulm (references #19/12 and #20/12) in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. Written informed consent was obtained from all individual participants included in the study. Previous studies on the analyses of MRI data have already been performed [23].

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vernikouskaya, I., Müller, HP., Ludolph, A.C. et al. AI-assisted automatic MRI-based tongue volume evaluation in motor neuron disease (MND). Int J CARS (2024). https://doi.org/10.1007/s11548-024-03099-x

Download citation

Received: 22 December 2023
Accepted: 04 March 2024
Published: 27 March 2024
DOI: https://doi.org/10.1007/s11548-024-03099-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

AI-assisted automatic MRI-based tongue volume evaluation in motor neuron disease (MND)