The Dresden in vivo OCT dataset for automatic middle ear segmentation

Liu, Peng; Steuer, Svea; Golde, Jonas; Morgenstern, Joseph; Hu, Yujia; Schieffer, Catherina; Ossmann, Steffen; Kirsten, Lars; Bodenstedt, Sebastian; Pfeiffer, Micha; Speidel, Stefanie; Koch, Edmund; Neudert, Marcus

doi:10.1038/s41597-024-03000-0

The Dresden in vivo OCT dataset for automatic middle ear segmentation

Data Descriptor
Open access
Published: 26 February 2024

Volume 11, article number 242, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Data

The Dresden in vivo OCT dataset for automatic middle ear segmentation

Download PDF

1044 Accesses
8 Altmetric
Explore all metrics

Abstract

Endoscopic optical coherence tomography (OCT) offers a non-invasive approach to perform the morphological and functional assessment of the middle ear in vivo. However, interpreting such OCT images is challenging and time-consuming due to the shadowing of preceding structures. Deep neural networks have emerged as a promising tool to enhance this process in multiple aspects, including segmentation, classification, and registration. Nevertheless, the scarcity of annotated datasets of OCT middle ear images poses a significant hurdle to the performance of neural networks. We introduce the Dresden in vivo OCT Dataset of the Middle Ear (DIOME) featuring 43 OCT volumes from both healthy and pathological middle ears of 29 subjects. DIOME provides semantic segmentations of five crucial anatomical structures (tympanic membrane, malleus, incus, stapes and promontory), and sparse landmarks delineating the salient features of the structures. The availability of these data facilitates the training and evaluation of algorithms regarding various analysis tasks with middle ear OCT images, e.g. diagnostics.

Towards fully automated inner ear analysis with deep-learning-based joint segmentation and landmark detection framework

Article Open access 04 November 2023

Fully automated preoperative segmentation of temporal bone structures from clinical CT scans

Article Open access 08 January 2021

Deep learning for the fully automated segmentation of the inner ear on MRI

Article Open access 03 February 2021

Background & Summary

The air-filled middle ear cavity consists of the tympanic membrane (TM) and the ossicular chain that connects the TM to the inner ear. Functionally, it matches the impedance of air to the fluid-filled inner ear¹. The functionality of the middle ear can be disrupted by a variety of conditions such as acute or chronic otitis media or trauma. Pathophysiologically, they result in impaired sound transmission due to perforation of the TM, fixation or disruption of the ossicular chain, or middle ear effusion. Patients perceive this as conductive hearing loss. Current diagnostic modalities, including otoscopy, audiometry and tympanometry, each focus on a single aspect of the pathology. Otoscopy provides a visual assessment of the TM, audiometry evaluates the frequency dependent level of hearing, and tympanometry only assesses the pressure-dependent compliance of the TM.

As an innovative imaging technology, endoscopic optical coherence tomography (OCT)^2,3,4 enables the assessment of both the morphology and function of the middle ear in vivo by the non-invasive acquisition of depth-resolved and high-resolution images. In recent years, several groups therefore developed promising solutions towards in vivo middle ear diagnostics^5,6,7. Nevertheless, intrinsic limitations of OCT, e.g. the backscattered light intensity loss over tissue depth as well as the cumulative effect of preceding structures, reduce the signal quality of the target structures, e.g. the stapes, which are further away from the endoscopic probe. Additionally, the OCT volumetric data are usually noisy and often difficult to interpret, especially regarding the identification of deeper middle ear structures such as incus and stapes (see Fig. 1). As a flourishing technique, deep learning facilitates medical image analysis tasks, e.g. segmentation ⁸ and registration ⁹. Thus, the usage of machine learning in the case of middle ear diagnostics is promising, because it has the potential of simplifying the classification of middle ear diseases. Nevertheless, the current bottleneck of the development and application of deep neural networks in the field of middle ear diagnostics is the scarcity of publicly available OCT datasets in this field.

In this paper, we introduce the Dresden in vivo OCT Dataset of the Middle Ear, which features 43 OCT image volumes from healthy and pathological middle ears of 29 subjects (see Fig. 2). Five essential middle ear structures are segmented including tympanic membrane, malleus, incus, stapes and cochlear promontory. Besides the segmentations, sparse landmarks depicting the salient anatomical features are provided for evaluating algorithms such as detection and registration. Seeing that voxel-wise annotation is a time-consuming task, even for an experienced clinician, we capitalized on the fact that structures like the tympanic membrane are well captured by the OCT volume and the morphological deviation of such structures between healthy ears is slight.

Fig. 2: Overview of the comprehensive, segmented OCT dataset and corresponding video images consisting of both healthy and pathological ear data with six different types, including sclerosis, retraction, cholesteatoma, perforation, otitis media and reconstruction.

For this, the Human-in-the-Loop^10,11,12 (HITL) approach has been proposed and proven to significantly reduce annotation effort while yielding promising results. Following the concept of HITL, we iteratively trained a segmentation neural network namely nnUnet⁸ with an expert correcting the prediction of nnUnet at each iteration. Additionally, we included more pathological samples over the iteration, which vary a lot in morphology and are difficult for the network to segment, so that the learning challenges are reasonably distributed at each step. At the end of the HITL process, the network has been fully trained and is capable of segmenting OCT images from both healthy and pathological middle ears and can be used as a pre-trained model for images from the same and other modalities. In such a way, we alleviated the heavy workload and included more samples with various morphology in the same time scale. The combination of the results from two human raters and the trained neural network rater is checked by the expert and became the final output.

Methods

This dataset consists of 43 OCT image volumes from both healthy and pathologic middle ears (see Table 1). For each image sample, the semantic segmentation of five anatomical structures including tympanic membrane, malleus, incus, stapes, and promontory is provided. Apart from these, sparse landmarks describing the shape and outline of the segmented structures are marked. Therefrom, sparse point correspondences can be retrieved and contribute to performing or evaluating algorithms of various tasks, e.g. multi-modal image fusion.

Table 1 Sample distributions.

Full size table

Image Acquisition

The OCT volumes were collected between 11.2022 and 08.2023 from clinical daily diagnostics at the University Hospital Carl Gustav Carus Dresden. The subjects consist of two main types: healthy volunteers and patients with age ranging from 22 to 66. This study is covered by the approval of the local Institutional Review Board (IRB00001473) at the TU Dresden (EK 252062017). All patients provided written informed consent to data acquisition, scientific analyses and sharing. Data is anonymized in order to comply with ethical standards and the European Union’s General Data Protection Regulation.

The image acquisition was performed using a custom-built endoscopic OCT system based on the system according to Kirsten et al.² (see Fig. 1) with adaptations as described in Golde et al.¹³. A swept-wavelength laser source (SL132120, MEMS-VCSEL, Thorlabs) operates at a sweep rate of 200 kHz, has a center wavelength of 1300 nm, and a wavelength sweep range of 100 nm. In the sample arm’s endoscopic probe, components included are a collimator, two galvanometer scanners for beam guidance, a dichroic mirror for additional visual imaging, and a lens setup featuring GRIN rod lenses. This configuration provides a working distance of 10 mm and an image depth range of approximately 8 mm corresponding to 1024 pixel providing an axial resolution of around 15 μm. With the GRIN endoscope, most of the middle ear is accessed by scanning the proximal surface of the GRIN optics with 500 times 500 A-scans of which approximately 450 A-scans in both lateral directions cover an angular FOV of approximately ± 30°. This spans a field of view (FOV) of around 10 mm at the working distance and thus an approximated lateral resolution of 45 μm. Due to the imaging geometry, the acquired data shows a fan-shape distortion as visualized in Ref. ⁴. Note that, for the sake of preservation of the original information content, the distorted images are stored and act as the target of annotation instead of correcting the fan-shape distortion by geometrically rescaling the volumes using interpolation. Nevertheless, distortion correction can be applied to the data by the provided code such that an isotropic spatial sampling of 20 μm in each direction is obtained.

The measured volumes were processed according to conventional swept-source OCT processing, i.e., background correction, zero-padding, compensating occurring dispersion mismatch, filtering with a Hann window and applying the inverse Fourier transform, using a custom Matlab script (MATLAB R2022b, Mathworks). The acquired volumes were stored in the format of nearly raw raster data (NRRD).

To support the manual image segmentation by less noisy and speckled images, the OCT volumes were additionally processed by applying a tomographic non-local means despeckling (TNode) algorithm by Cuartas-Vélez et al.¹⁴ beforehand. However, the calculation for despeckling is time-consuming and, thus, not suitable for real-time application. Therefore, it was not applied to the data, which were used for the neural network training.

Segmentation of Anatomical Structures

For semantic segmentation of middle ear structures (tympanic membrane, malleus, incus, stapes and promontory), the open-source software 3D Slicer (version: 5.2.2, https://www.slicer.org) with the Segmentation Editor tool was employed. Three raters including a medical student, a biomedical engineer and an experienced clinician (expert) conducted the segmentation process following the provided segmentation protocol as a guideline (see Supplementary file 1).

In practice, pixel-wise segmentation for all image slices is time-consuming. For each OCT sample, it took at least five hours to segment the volume from scratch including a quality check. Thus, to reduce the workload of such a process, the Human-in-the-Loop approach was harnessed to train a deep neural network (nnUnet⁸) iteratively and to utilize the predictions of nnUnet as pre-segmentations for the other human raters to work on.

The HITL procedure is depicted in Fig. 3, which consists of two main phases. In the initial phase, 14 OCT images from healthy ears were segmented manually by an experienced clinician, which comprised the initial training set for nnUnet. Then in the next phase, the clinician corrected the predictions of the network trained from the last phase for unsegmented samples. These new image volumes contained more pathological samples compared to the last iteration. Together with the corrected segmentation masks, they made up the training set for the next iterations. As such, the loop was stopped when the segmentation loss on the test set with 5 samples was low and the prediction was qualitatively approved by the expert. The average time acquired for segmenting each sample was reduced from five hours each to 20 minutes on average. The number of samples for each iteration is listed in the Table 2. Thanks to the HITL process, the two human raters were able to exploit the prediction of nnUnet as pre-segmentation and perform correction until the segmentation accords with real middle ear morphology. At the end of the HITL process, the segmentation results of the latest nnUnet were collected, which is the third rater for all samples. The results were checked by an expert clinician and then merged with the segmentation from two other raters using the STAPLE¹⁵ algorithm. The final segmentation mask was checked by a clinician.

Table 2 Samples distribution of HITL iterations.

Full size table

Annotation of Sparse Landmarks

Image segmentation extracts the critical information and simplifies the analysis of OCT images. However, due to the incompleteness of the structures, approaches like multi-modal image fusion can be carried out to reveal the absent parts and further facilitate the interpretation process. Thus, landmarks that describe the morphology of the segmented structures can provide more fine-grained information and act as a performance measurement for the results of fusion algorithms. In this paper, two biomedical data scientists annotated the sparse landmarks (see Fig. 1) of the segmented structures using 3D Slicer (version 5.2.2) with the Markup module. This annotation process is performed under the guidelines which are elaborated in Supplementary file 2. In the last step, the landmarks were checked and corrected by the expert and constituted the final output.

For the tympanic membrane, the annulus showing the boundary (about 20 points), and the umbo, which is the central point of maximum depression and marks the end of the manubrium, is annotated. Two landmarks are placed to show the short process of malleus and the malleus handle. Long process of the incus is usually partially visible, so corresponding landmark consists of two points, one is the most proximal visible point, and the other one is the distal tip of the long process of the incus, right above the incudostapedial joint. Furthermore, a single point is marked on the stapes due to the rare visibility. Note that all these landmarks are marked on the outer side of the epithelium or bones, and are done for the merged segmentation only.

Data Records

The DIOME dataset is stored at OpARA (Open Access Repository and Archive, https://doi.org/10.25532/OPARA-279)¹⁶ and accessible without prior registration. The data folder structure is shown in Fig. 4. 43 sub-folders for 43 OCT middle ear samples compose the first layer. Within each sample folder, three items are listed: a metadata YAML file describing the basic information of the current sample, e.g. if it is a left or right ear, and OCT measurement settings, as well as an OCT image volume in the format of NRRD, and an annotation folder containing all annotation-related items. Within each annotation folder, three NRRD files represent the segmentation results from three raters, and their merged results are saved under the folder next to them. Since the landmarks come along with the merged segmentation, a folder named “landmarks" is placed next to it, which contains six JSON files for the sparse landmarks.

**Fig. 4: Folder structure of the dataset.**

Technical Validation

To merge the segmentation from the three annotators for each image volume, including two human raters and one segmentation neural network, the STAPLE¹⁵ algorithm was employed. It takes a collection of segmentations of an image and computes simultaneously a probabilistic estimate of the combined segmentation and is often applied in the biomedical field. To validate our segmentations from all three annotators, two metrics commonly used for measuring segmentation performance were calculated:

F1 score is a counting-based metric that measures the overlap between a reference mask and another input segmentation. The value of the F1 score varies from 0 to 1, where 0 means no overlap and 1 full overlap.
Hausdorff distance showcases the maximum distance between two segmentation masks. As a distance-based metric, it usually works as a complement to the counting-based metric and focuses on the assessment of the segmentation boundary and shape. Here we normalized the Hausdorff distance via the diagonal of the 3D image volume. Close to 0 means less separation between the two segmentations, and close to 1 presents larger distance.

The results of the segmentation evaluation are listed in Table 3, where the anatomical structures are ordered based on the distance to the OCT probe. The values in each cell show the average F1 score and Hausdorff distances of all annotators on all anatomical structures. As indicated by comparing the table values, most of the anatomical structures do not have large discrepancies, and the annotators agree with the merged results. However, with the increase in the distance to the probe (from top to bottom), a decreasing tendency of F1 score can be observed which corresponds to the decrease of OCT image quality over depths, e.g. larger noise around the stapes regions. Although the F1 scores of stapes and promontory are lower than other structures, particularly for annotator A1, Hausdorff distances are low enough to prove the rationality.

Table 3 Comparison between the segmentation of all raters including two human raters (A1, A2) and one neural network rater (A3).

Full size table

One interesting fact is the third rater, i.e. the neural network, outperforms the rater A1, the medical student. This proves the comparable capability of the neural network in segmenting OCT images against a human rater.

Usage Notes

The dataset was published under the Creative Commons Attribution (CC-BY 4.0) license. It can facilitate algorithm development in various deep learning tasks, for example, semantic segmentation, pathology detection or classification, etc. On the one hand, it can be combined with OCT image datasets from other anatomy for fast learning of OCT data and to improve performance. On the other hand, integration with data from other modalities via image registration enables the knowledge transfer to promote the visibility and readability of target structures. For easy processing of the dataset and evaluation of the developed methods, basic functions including 3D model reconstruction, visualization, and metrics calculation are provided.

Code availability

Scripts for segmentation merging and visualization, statistics calculation and fan-shape correction are publicly available at https://gitlab.com/nct_tso_public/diome. All the scripts are written in Python 3.11 and are public under the MIT license.

References

Zwislocki, J. Normal function of the middle ear and its measurement. Audiology 21, 4–14 (1982).
Article CAS PubMed Google Scholar
Kirsten, L. et al. Endoscopic optical coherence tomography with wide field-of-view for the morphological and functional assessment of the human tympanic membrane. Journal of Biomedical Optics 24, 031017 (2018).
Article ADS PubMed PubMed Central Google Scholar
Morgenstern, J. et al. Endoscopic optical coherence tomography for evaluation of success of tympanoplasty. Otology & Neurotology 41, e901–e905 (2020).
Article Google Scholar
Steuer, S. et al. In vivo microstructural investigation of the human tympanic membrane by endoscopic polarization-sensitive optical coherence tomography. Journal of Biomedical Optics 28, 121203 (2023).
Article ADS PubMed PubMed Central Google Scholar
MacDougall, D., Farrell, J., Brown, J., Bance, M. & Adamson, R. Long-range, wide-field swept-source optical coherence tomography with GPU accelerated digital lock-in doppler vibrography for real-time, in vivo middle ear diagnostics. Biomedical Optics Express 7, 4621–4635 (2016).
Article PubMed PubMed Central Google Scholar
Park, J. et al. Investigation of middle ear anatomy and function with combined video otoscopy-phase sensitive OCT. Biomedical Optics Express 7, 238 (2016).
Article PubMed PubMed Central Google Scholar
Kim, W., Kim, S., Huang, S., Oghalai, J. S. & Applegate, B. E. Picometer scale vibrometry in the human middle ear using a surgical microscope based optical coherence tomography and vibrometry system. Biomedical Optics Express 10, 4395–4410 (2019).
Article PubMed PubMed Central Google Scholar
Isensee, F., Jaeger, P. F., Kohl, S. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods 18, 203–211 (2021).
Article CAS PubMed Google Scholar
Liu, P. et al. Non-rigid point cloud registration for middle ear diagnostics with endoscopic optical coherence tomography. International Journal of Computer Assisted Radiology and Surgery1-7 (2023).
Yang, L., Zhang, Y., Chen, J., Zhang, S. & Chen, D. Z. Suggestive annotation: A deep active learning framework for biomedical image segmentation. In Medical Image Computing and Computer Assisted Intervention- MICCAI 2017: 20th International Conference, Quebec City, QC, Canada, September 11-13, 2017, Proceedings, Part III 20, 399-407 (2017).
Bodenstedt, S. et al. Active learning using deep Bayesian networks for surgical workflow analysis. International journal of computer assisted radiology and surgery 14, 1079–1087 (2019).
Article PubMed Google Scholar
Kirillov, A. et al. Segment Anything. Preprint at https://arxiv.org/abs/2304.02643 (2023).
Golde, J. et al. Data-informed imaging: how radiography and shape models support endoscopic OCT imaging of the middle ear. In Imaging, Therapeutics, and Advanced Technology in Head and Neck Surgery and Otolaryngology 2023, vol. 12354, 1235405 (2023).
Cuartas-Vélez, C., Restrepo, R., Bouma, B. E. & Uribe-Patarroyo, N. Volumetric non-local-means based speckle reduction for optical coherence tomography. Biomedical Optics Express 9, 3354–3372 (2018).
Article PubMed PubMed Central Google Scholar
Warfield, S. K., Zou, K. H. & Wells, W. M. Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation. IEEE transactions on medical imaging 23, 903–921 (2004).
Article PubMed PubMed Central Google Scholar
Steuer, S., Golde, J., Morgenstern, J. & Liu, P. Dresden in vivo OCT Dataset of the Middle Ear (DIOME). OpARA https://doi.org/10.25532/OPARA-279 (2023).

Download references

Acknowledgements

This work is part of an Interdisciplinary Innovation Project of the Else Kröner-Fresenius Center for Digital Health at TU Dresden. Support in volume despeckling from the Center for Biomedical OCT Research and Translation (National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health Award P41 EB015903) based on the provided resources (https://octresearch.org/resources/) is acknowledged.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Faculty of Medicine, 01307, Dresden, Germany
Peng Liu, Joseph Morgenstern & Marcus Neudert
Department of Translational Surgical Oncology, National Center for Tumor Diseases (NCT/UCC Dresden), German Cancer Research Center (DKFZ), Helmholtz-Zentrum Dresden-Rossendorf (HZDR), 01307, Dresden, Germany
Peng Liu, Yujia Hu, Sebastian Bodenstedt, Micha Pfeiffer & Stefanie Speidel
Else Kröner Fresenius Center, TUD Dresden University of Technology, 01307, Dresden, Germany
Peng Liu, Svea Steuer, Jonas Golde, Joseph Morgenstern, Sebastian Bodenstedt, Stefanie Speidel, Edmund Koch & Marcus Neudert
Clinical Sensoring and Monitoring, TUD Dresden University of Technology, 01307, Dresden, Germany
Svea Steuer, Jonas Golde, Lars Kirsten & Edmund Koch
Medical Physics and Biomedical Engineering, TUD Dresden University of Technology, 01307, Dresden, Germany
Jonas Golde & Lars Kirsten
Fraunhofer Institute for Material and Beam Technology IWS, 01277, Dresden, Germany
Jonas Golde
Ear Research Center Dresden, TUD Dresden University of Technology, 01307, Dresden, Germany
Joseph Morgenstern, Catherina Schieffer, Steffen Ossmann & Marcus Neudert

Authors

Peng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Svea Steuer
View author publications
You can also search for this author in PubMed Google Scholar
Jonas Golde
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Morgenstern
View author publications
You can also search for this author in PubMed Google Scholar
Yujia Hu
View author publications
You can also search for this author in PubMed Google Scholar
Catherina Schieffer
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Ossmann
View author publications
You can also search for this author in PubMed Google Scholar
Lars Kirsten
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Bodenstedt
View author publications
You can also search for this author in PubMed Google Scholar
Micha Pfeiffer
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie Speidel
View author publications
You can also search for this author in PubMed Google Scholar
Edmund Koch
View author publications
You can also search for this author in PubMed Google Scholar
Marcus Neudert
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.L. wrote the guideline of landmarks markup, trained the neural network, and wrote the manuscript. J.G. conceived the study, acquired the data and provided the scripts related to fan-shape correction. J.M., S.O., and C.S. performed segmentation, annotation, and quality checks of the data, and created segmentation guidelines. Y.H. contributed to the evaluation and fan-shape correction scripts. S.St. organized the data structure, sample selection, and segmentation procedure. L.K. contributed to the endoscopic OCT system development and to the concept of fan-shape correction. S.B., M.P., S.Sp., E.K., and M.N. provided technical and clinical input on the project. All authors reviewed and approved the final manuscript.

Corresponding authors

Correspondence to Peng Liu or Marcus Neudert.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary_1_How_to_ Segmentation_MiddleEar

Supplementary_2_Annotation_Guideline_for_Markups

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, P., Steuer, S., Golde, J. et al. The Dresden in vivo OCT dataset for automatic middle ear segmentation. Sci Data 11, 242 (2024). https://doi.org/10.1038/s41597-024-03000-0

Download citation

Received: 21 September 2023
Accepted: 25 January 2024
Published: 26 February 2024
DOI: https://doi.org/10.1038/s41597-024-03000-0
Springer Nature Limited

Associated content

Medical imaging data for digital diagnostics

Collection 20 December 2022

The Dresden in vivo OCT dataset for automatic middle ear segmentation

Abstract

Similar content being viewed by others

Towards fully automated inner ear analysis with deep-learning-based joint segmentation and landmark detection framework

Fully automated preoperative segmentation of temporal bone structures from clinical CT scans

Deep learning for the fully automated segmentation of the inner ear on MRI

Background & Summary

Methods

Image Acquisition

Segmentation of Anatomical Structures

Annotation of Sparse Landmarks

Data Records

Technical Validation

Usage Notes

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary_1_How_to_ Segmentation_MiddleEar

Supplementary_2_Annotation_Guideline_for_Markups

Rights and permissions

About this article

Cite this article

Medical imaging data for digital diagnostics

Navigation

The Dresden in vivo OCT dataset for automatic middle ear segmentation

Abstract

Similar content being viewed by others

Towards fully automated inner ear analysis with deep-learning-based joint segmentation and landmark detection framework

Fully automated preoperative segmentation of temporal bone structures from clinical CT scans

Deep learning for the fully automated segmentation of the inner ear on MRI

Background & Summary

Methods

Image Acquisition

Segmentation of Anatomical Structures

Annotation of Sparse Landmarks

Data Records

Technical Validation

Usage Notes

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary_1_How_to_ Segmentation_MiddleEar

Supplementary_2_Annotation_Guideline_for_Markups

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation