Automated detection of enlarged extraocular muscle in Graves’ ophthalmopathy with computed tomography and deep neural network

Hanai, Kaori; Tabuchi, Hitoshi; Nagasato, Daisuke; Tanabe, Mao; Masumoto, Hiroki; Miya, Sakurako; Nishio, Natsuno; Nakamura, Hirohiko; Hashimoto, Masato

doi:10.1038/s41598-022-20279-4

Automated detection of enlarged extraocular muscle in Graves’ ophthalmopathy with computed tomography and deep neural network

Article
Open access
Published: 26 September 2022

Volume 12, article number 16036, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Automated detection of enlarged extraocular muscle in Graves’ ophthalmopathy with computed tomography and deep neural network

Download PDF

Kaori Hanai¹,
Hitoshi Tabuchi^2,3,
Daisuke Nagasato^2,3,4,
Mao Tanabe²,
Hiroki Masumoto²,
Sakurako Miya²,
Natsuno Nishio²,
Hirohiko Nakamura⁵ &
…
Masato Hashimoto¹

1328 Accesses
14 Citations
2 Altmetric
Explore all metrics

Abstract

This study aimed to develop a diagnostic software system to evaluate the enlarged extraocular muscles (EEM) in patients with Graves’ ophthalmopathy (GO) by a deep neural network.This prospective observational study involved 371 participants (199 EEM patients with GO and 172 controls with normal extraocular muscles) whose extraocular muscles were examined with orbital coronal computed tomography. When at least one rectus muscle (right or left superior, inferior, medial, or lateral) in the patients was 4.0 mm or larger, it was classified as an EEM patient with GO. We used 222 images of the data from patients as the training data, 74 images as the validation test data, and 75 images as the test data to “train” the deep neural network to judge the thickness of the extraocular muscles on computed tomography. We then validated the performance of the network. In the test data, the area under the curve was 0.946 (95% confidence interval (CI) 0.894–0.998), and receiver operating characteristic analysis demonstrated 92.5% (95% CI 0.796–0.984) sensitivity and 88.6% (95% CI 0.733–0.968) specificity. The results suggest that the deep learning system with the deep neural network can detect EEM in patients with GO.

Detection of active and inactive phases of thyroid-associated ophthalmopathy using deep convolutional neural network

Article Open access 14 January 2021

Development of a deep learning model to distinguish the cause of optic disc atrophy using retinal fundus photography

Article Open access 01 March 2024

Quantitative assessment of extraocular muscles in Graves’ ophthalmopathy using T1 mapping

Article 19 July 2023

Introduction

Graves’ ophthalmopathy (GO) is a chronic autoimmune disorder that affects the retrobulbar tissues and extraocular muscles with strong etiological links to autoimmune thyroid disease. Extraocular muscle dysfunction reportedly occurs in approximately 40%–60% of patients with GO in actual clinical practice^1,2 and has significant negative effects on the quality of life³. Early detection of extraocular muscle abnormalities on orbital imaging might thus be necessary for managing thyroid myopathy successfully. In actual clinical practice, orbital imaging is not likely to be performed unless the patient complains of double vision. Additionally, radiologists may not always be available to interpret the findings, especially in regions with a shortage of doctors^4,5. In some regions of developing countries, facilities for adequate imaging might be scarcer than radiologists.

Supervised machine learning systems, known as neural networks, have been applied to medical research⁶. Many studies on the diagnostic and classification performance of deep learning (DL) systems with CT images have been conducted^{7,8,9,10,11,12,13}. However, to the best of our knowledge, there has not been a report in which DL systems have classified enlarged extraocular muscle (EEM) images in patients with GO and normal extraocular muscle (NEM) images in normal subjects using CT images.

This research aimed to develop a diagnostic software system in which a DL system could evaluate the EEM in patients with GO with orbital CT images.

Results

We used EEM images from 199 patients (56 men and 143 women) with GO (mean age, 55.9 ± 13.7 years) and NEM images from 172 controls (40 men and 132 women; mean age, 52.6 ± 18.4 years) in this analysis. We found no significant differences in age (p = 0.21) or gender (p = 0.85) between the two groups (Table 1).

Table 1 Participant characteristics.

Full size table

Table 2 shows the right and left superior, inferior, medial, and lateral rectus muscles in the two groups. All right or left rectus muscle thicknesses differed significantly between the two groups (each p < 0.001).

Table 2 The difference in the maximum diameter between enlarged extraocular muscle (EEM) and normal extraocular muscle (NEM).

Full size table

In the test data, the area under the curve (AUC) diagnosis by the neural network was 0.946 (95% confidence interval [CI] 0.894–0.998), and receiver operating characteristic (ROC) analysis demonstrated 92.5% (95% CI 0.796–0.984) sensitivity and 88.6% (95% CI 0.733–0.968) specificity (Fig. 1). For the test data, 276.2 s was needed to analyze the CT scans of 75 patients (3.6 s/patient).

Figure 2 shows composite images where the representative orbital CT images of patients with GO and healthy participants were layered with the heat maps. The right and left rectus muscles in the orbital CT images are displayed in blue, indicating the parts of the image where the DL model focuses on distinguishing between EEM and NEM.

Discussion

We investigated whether a DL system could evaluate EEM in patients with GO. This system was able to classify both EEM and NEM with high AUCs, sensitivity, and specificity, indicating that the system distinguished images as belonging to participants with EEM or those with NEM on orbital CT images with nearly the same level of accuracy as that of doctors.

Our study defined the 4-mm thickness of the extraocular muscle diameter as abnormal. This cutoff value was determined based on previous reports of Dutton showing NEM thickness. However, Ozgen et al.¹⁴ reported that mean maximum diameters of the extraocular muscles measured using conventional CT were MR 4.2 (range 3.3–5.0) mm, LR 3.3 (1.7–4.8) mm, SR 4.6 (range 3.2–6.1) mm, and IR 4.8 (range 3.2–6.5) mm. In their study, they used conventional CT. In this CT, individual variations in the chin-up posture of participants during coronal section imaging were observed, which may enhance the variability of extraocular muscle thickness. Conversely, spiral CT is used in our study. Spiral CT is created by reconstructing horizontal cross-sectional images, which are captured at the same angle due to participants’ constant posture during imaging. Therefore, our results showed less variation in extraocular muscle thickness in the control group compared to the findings of Ozgen et al. Therefore, we assumed that our extraocular muscle thickness results were consistent with Dutton’s, with an average thickness of less than 4 mm for each extraocular muscle.

A nationwide survey of patients with GO in the United Kingdom revealed delays in diagnosis, wide variability of access to specialist centers, appropriate treatment, and overall low patient satisfaction with treatment¹⁵. The same study revealed that only 25% of patients had referrals to a specialist GO clinic and that referrals were typically late. In several studies on general health-related questionnaires about quality of life among patients with GO, the scores of these patients were lower than those of the healthy reference population^16,17. Gerding et al. reported that quality-of-life scores among patients with GO were worse than those in patients with diabetes, emphysema, or heart failure¹⁶. In approximately 70% of adults with Graves’ hyperthyroidism, magnetic resonance imaging or CT scanning reveals EEM¹⁸. Physicians thus need to monitor patients for ocular signs, including lid edema, lid retraction, and proptosis on visual inspection, and EEM, as demonstrated on orbital imaging, in patients with Graves’ hyperthyroidism. We consider that early detection and treatment of thyroid myopathy may become possible if the DL software system evaluating EEM in GO plays a supporting role in the actual clinical practice.

The modified clinical activity score (CAS) is currently the most widely used index to determine the active phase of inflammation in GO¹⁹. However, a recent study of GO indicated that the CAS may not reflect the inflammatory activity of myopathy, especially in mild to moderate GO with low NOSPECS scores (no sign of thyroid disease, only eyelid signs, soft tissue involvement, proptosis, extraocular motility restriction, corneal involvement, and sight loss). This system classifies the clinical severity of GO with low exophthalmos values^20,21. Nagy et al. reported that EEM does not imply the presence of edematous swelling, and the severity of diplopia is unrelated to the degree of ocular congestion and edema²⁰. Kim et al. reported that 44.4% of patients with GO and progressive diplopia had low CASs and no typical symptoms of inflammation²¹. These findings may have arisen because the CAS reflects primarily ocular muscle involvement and acute orbital congestion, which represents inflammatory changes within orbital connective and adipose tissues. Ophthalmologists thus must detect EEM early in the course of GO.

In our heat maps showing the focus of DL, color intensity surrounding the rectus muscles on the orbital CT images increased. The areas in the orbital CT images that the DL system focused on were consistent with those that ophthalmologists focus on when using CT images, they confirm EEM. In other words, the generated heat maps suggest that DL systems can accurately detect EEM associated with GO on the orbital CT images. Our DL software system may be helpful in the ophthalmological assessment of patients with GO.

Our system had several limitations. First, our study was conducted within a single facility, and the model’s robustness must be evaluated prospectively with data from multiple facilities. Second, from the perspective of radiation exposure to the participants, images with a slice thickness of 2 mm were used during CT imaging in this study. Using images with finer slice thickness may improve accuracy. Third, the judgment of EEM was based on measurements of the thickness of the muscles on two-dimensional CT images. The muscles’ volumetric measurement must be evaluated on three-dimensional CT or magnetic resonance images. Finally, DL’s performance and versatility should be evaluated extensively with larger samples and more images.

In conclusion, our results indicate that our DL system and orbital coronal CT had high accuracy for detecting EEM in GO. DL systems to screen orbital coronal CT images may yield useful information about early treatment for EEM patients with GO.

Methods

Patients

This prospective observational study complied with the Declaration of Helsinki. The study protocol followed the ethics committees of Nakamura Memorial and Tsukazaki Hospital. The patients provided written informed consent for the publication of this study and accompanying images. All experimental protocols were approved by the licensing committees of these hospitals.

In this study, we examined data from patients with GO and healthy normal subjects who had orbital CT scans at Nakamura Memorial Hospital between February 2017 and November 2019. An experienced neuro-ophthalmologist diagnosed GO using Bartley and Gorman's criteria²². Patients with orbital tumors, blowout fractures, immunoglobulin G4-associated ophthalmopathy, or idiopathic orbital inflammation were excluded from this study.

Extraocular muscles were analyzed with orbital images obtained using a whole-body CT system (SOMATOM Definition AS+; Siemens, Erlangen, Germany) without contrast. Axial scans were obtained at an angle of − 10° to − 15° to the orbitomeatal line, and coronal scans in a paraxial plane 90° to the orbital axis were reconstructed from the axial scans (slice thickness, 2 mm). We measured the diameter of all rectus muscles shown on six slices from the globe’s posterior margin to the orbital apex (Fig. 3). The maximum diameter was defined as the thickest diameter of each muscle on the six slices. The spindle-like spreading of the rectus muscles without tendon involvement was identified morphologically as EEM²³. Diameters of the superior, inferior, medial, and lateral rectus muscles were measured on coronal scans. The inferior and superior oblique muscles were excluded because their course is oblique to the coronal plane.

Anatomically, the rectus muscles are typically 2.5–4.0 mm thick at the midpoint²⁴. Therefore, we classified rectus muscles > 4.0 mm thick as enlarged. On this basis, this study involved 371 participants (199 patients with EEM and 172 controls with NEM). All 199 EEM patients were diagnosed with GO.

The DL model and its training

The DL algorithm consists of four main processes: (1) extraction of the retrobulbar region from the CT image; (2) trimming of the orbital area on the CT image; (3) classification of the presence or absence of hypertrophied extraocular muscle; and (4) evaluation of extraocular muscle abnormality in GO. For down-sampling and up-sampling, the neural network architecture for segmentation was obtained through Residual Network-50²⁵ (Supplementary Fig. S1). First, the globe was segmented on coronal CT slices, and the orbital region posterior to the segmented globe was segmented and trimmed using Residual Network-50 (Fig. 4). The code is provided in the supplemental data. Next, all rectus muscles judged by the neuro-ophthalmologist to be abnormal on coronal CT were tagged. For classification, we used the Visual Geometry Group-16²⁶ as the neural network and trained the DL system using the tag (Supplementary Fig. S2). The neural network generates the probability for each slice’s category (e.g., 0.1 for normal and 0.9 for abnormal). If the probability of “abnormal” exceeds a certain threshold, the slice is considered abnormal. We calculated this threshold from the validation data. Additionally, we calculated the proportion of slices considered abnormal by the neural network. If the proportion exceeded a certain threshold, the CT data as a whole was judged to reveal extraocular muscle abnormalities.

For all model training, the loss function was the sum of binary cross-entropy and dice loss, batch size was 16, and epochs was 100. These details are included in the supplemental codes.

For training data, we used coronal scans from 120 patients with EEM and 102 controls with NEM; for validation data, we used scans from 39 patients with EEM and 35 controls with NEM; and for test data, we used scans from 40 patients with EEM and 35 controls with NEM.

Statistical analysis

We used Fisher’s exact test and the unpaired t-test to compare differences between EEM and NEM. We constructed ROC curves and the proportion of CT slices judged as abnormal by the neural network based on the diagnostic imaging data. Then, we calculated the AUC of the ROC curve, the point at which the ROC curve was closest to the upper left (100% sensitivity, 100% specificity), and the sensitivity and specificity. The 95% CI of the AUC was calculated assuming a normal distribution²⁷; the Clopper–Pearson method was used to calculate the 95% CIs for sensitivity and specificity²⁸.

All statistical analyses were performed using the Python library SciPy (https://www.scipy.org/). Significance was expressed by p < 0.05.

Heat map

The two main types of explainability in machine learning technology are intrinsic explainability and post hoc explainability²⁹. In this study, we used Score-CAM (score-weighted class activation mapping), a type of post hoc visual explanation method³⁰, to construct heat maps for indicating the areas where images in the convolutional neural network were focused. The target layer was the block5_conv2 layer of Visual Geometry Group-16. The heat maps revealed that the model focused more on the blue parts of the image.

Data availability

The CT images and the image data sets used in this study are available upon reasonable request from the corresponding authors.

References

Kozaki, A. et al. Proptosis in dysthyroid ophthalmopathy: a case series of 10931 Japanese cases. Optom. Vis. Sci. 87, 200–204 (2010).
Article Google Scholar
Hiromatsu, Y., Eguchi, H., Tani, J., Kasaoka, M. & Teshima, Y. Graves’ ophthalmopathy: Epidemiology and natural history. Intern. Med. 53, 353–360 (2014).
Article Google Scholar
Son, B. J., Lee, S. Y. & Yoon, J. S. Evaluation of thyroid eye disease: Quality-of-life questionnaire (TED-QOL) in Korean patients. Can. J. Ophthalmol. 49, 167–173 (2014).
Article Google Scholar
Gonçalves, A. C., Silva, L. N., Gebrim, E. M., Matayoshi, S. & Monteiro, M. L. Predicting dysthyroid optic neuropathy using computed tomography volumetric analyses of orbital structures. Clinics 67, 891–896 (2012).
Article Google Scholar
Gonçalves, A. C., Gebrim, E. M. & Monteiro, M. L. Imaging studies for diagnosing Graves’ orbitopathy and dysthyroid optic neuropathy. Clinics 67, 1327–1334 (2012).
Article Google Scholar
Jiang, J., Zhou, L., He, Y., Jiang, X. & Fu, Y. Using a stacked neural network to improve the auto-segmentation accuracy of Graves’ ophthalmopathy target volumes for radiotherapy. Sheng Wu Yi Xue Gong Cheng Xue Za Zhi 37, 670–675 (2020).
PubMed Google Scholar
Ardila, D. et al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat. Med. 25, 954–961 (2019).
Article CAS Google Scholar
Shen, L., Zhao, W. & Xing, L. Patient-specific reconstruction of volumetric computed tomography images from a single projection view via deep learning. Nat. Biomed. Eng. 3, 880–888 (2019).
Article Google Scholar
Huang, Z. et al. The correlation of deep learning-based CAD-RADS evaluated by coronary computed tomography angiography with breast arterial calcification on mammography. Sci. Rep. 10, 11532 (2020).
Article CAS Google Scholar
Pan, F. et al. A novel deep learning-based quantification of serial chest computed tomography in coronavirus Disease 2019 (COVID-19). Sci. Rep. 11, 417 (2021).
Article CAS Google Scholar
Chen, J. et al. Deep learning-based model for detecting 2019 novel coronavirus pneumonia on high-resolution computed tomography. Sci. Rep. 10, 19196 (2020).
Article ADS CAS Google Scholar
Jaskari, J. et al. Deep learning method for mandibular canal segmentation in dental cone beam computed tomography volumes. Sci. Rep. 10, 5842 (2020).
Article ADS CAS Google Scholar
Shi, Z. et al. A clinically applicable deep-learning model for detecting intracranial aneurysm in computed tomography angiography images. Nat. Commun. 11, 6090 (2020).
Article ADS CAS Google Scholar
Ozgen, A. & Ariyurek, M. Normative measurements of orbital structures using CT. AJR Am. J. Roentgenol. 170, 1093–1096 (1998).
Article CAS Google Scholar
Estcourt, S., Hickey, J., Perros, P., Dayan, C. & Vaidya, B. The patient experience of service for thyroid eye disease in the United Kingdom: Results of a nationwide survey. Eur. J. Endocrinol. 161, 483–487 (2009).
Article CAS Google Scholar
Gerding, M. N. et al. Quality of life in patients with Graves’ ophthalmopathy is markedly decreased: Measurement by the medical outcomes and instrument. Thyroid 7, 885–889 (1997).
Article CAS Google Scholar
Estcourt, S., Quinn, A. G. & Vaidya, B. Quality of life in thyroid eye disease: impact of quality of care. Eur. J. Endocrinol. 164, 649–655 (2011).
Article CAS Google Scholar
Bahn, R. S. Graves’ ophthalmopathy. N. Engl. J. Med. 362, 726–738 (2010).
Article CAS Google Scholar
Bartalena, L. et al. Consensus statement of the European group on Graves’ orbitopathy (EUGOGO) on the management of Graves’ orbitopathy. Thyroid 18, 333–346 (2008).
Article Google Scholar
Nagy, E. V. et al. Graves’ ophthalmopathy: Eye muscle involvement in patients with diplopia. Eur. J. Endocrinol. 142, 591–597 (2000).
Article CAS Google Scholar
Kim, J. W., Woo, Y. J. & Yoon, J. S. Is modified clinical activity score an accurate indicator of diplopia progression in Graves’ ophthalmopathy patients?. Endocr. J. 63, 1133–1140 (2016).
Article Google Scholar
Bartley, G. B. & Gorman, C. A. Diagnostic criteria for Graves’ ophthalmopathy. Am. J. Ophthalmol. 119, 792–795 (1995).
Article CAS Google Scholar
Le Moli, R. et al. Graves’ ophthalmopathy: Extraocular muscle/total orbit area ratio is positively related to the clinical activity score. Eur. J. Ophthalmol. 22, 301–308 (2012).
Article Google Scholar
Dutton, J. J. Atras of Clinical and Surgical Orbital Anatomy 16–17 (W. B. Saunders, 1994).
Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. https://arxiv.org/abs/1512.03385.pdf (2015).
Simonyan, K. & Andrew, Z. Very Deep Convolutional Networks for Large-Scale Image Recognition. https://arxiv.org/pdf/1409.1556.pdf (2014).
Hanley, J. A. & McNeil, B. J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143, 29–36 (1982).
Article CAS Google Scholar
Clopper, C. J. & Pearson, E. S. The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika 26, 404–413 (1934).
Article Google Scholar
Du, M., Liu, N. & Hu, X. Techniques for interpretable machine learning. Commun. ACM 63, 68–77 (2019).
Article ADS Google Scholar
Wang, H. et al. Score-CAM: score-weighted visual explanations for convolutional neural networks. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 24–25 (2020).

Download references

Acknowledgements

We thank the staff at Nakamura Memorial Hospital for their support in collecting the images and data. The authors would like to thank Enago (www.enago.jp) for the English language review.

Author information

Authors and Affiliations

Department of Ophthalmology, Nakamura Memorial Hospital, Sapporo, Japan
Kaori Hanai & Masato Hashimoto
Department of Ophthalmology, Tsukazaki Hospital, 68-1 Aboshi-Waku, Himeji City, Hyogo Prefecture, 671-1227, Japan
Hitoshi Tabuchi, Daisuke Nagasato, Mao Tanabe, Hiroki Masumoto, Sakurako Miya & Natsuno Nishio
Department of Technology and Design Thinking for Medicine, Hiroshima University Graduate School, Hiroshima, Japan
Hitoshi Tabuchi & Daisuke Nagasato
Department of Ophthalmology, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, Japan
Daisuke Nagasato
Department of Neurosurgery, Nakamura Memorial Hospital, Sapporo, Japan
Hirohiko Nakamura

Authors

Kaori Hanai
View author publications
You can also search for this author in PubMed Google Scholar
Hitoshi Tabuchi
View author publications
You can also search for this author in PubMed Google Scholar
Daisuke Nagasato
View author publications
You can also search for this author in PubMed Google Scholar
Mao Tanabe
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Masumoto
View author publications
You can also search for this author in PubMed Google Scholar
Sakurako Miya
View author publications
You can also search for this author in PubMed Google Scholar
Natsuno Nishio
View author publications
You can also search for this author in PubMed Google Scholar
Hirohiko Nakamura
View author publications
You can also search for this author in PubMed Google Scholar
Masato Hashimoto
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.H. and D.N. wrote the main manuscript text. K.H., H.T., and M.H. designed the research. H.T. and D.N. conducted the research. M.T. and H.M. undertook the DL methods and statistical analysis. S.M. and N.N. evaluated the data. H.N. and M.M. collected the data. All the authors reviewed the manuscript.

Corresponding author

Correspondence to Daisuke Nagasato.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Legends.

Supplementary Figure S1.

Supplementary Figure S2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hanai, K., Tabuchi, H., Nagasato, D. et al. Automated detection of enlarged extraocular muscle in Graves’ ophthalmopathy with computed tomography and deep neural network. Sci Rep 12, 16036 (2022). https://doi.org/10.1038/s41598-022-20279-4

Download citation

Received: 04 February 2022
Accepted: 12 September 2022
Published: 26 September 2022
DOI: https://doi.org/10.1038/s41598-022-20279-4
Springer Nature Limited

This article is cited by

Optical coherence tomography angiography in thyroid associated ophthalmopathy: a systematic review
- Mohammad Taher Rajabi
- Reza Sadeghi
- Sepideh Poshtdar
BMC Ophthalmology (2024)
Neural network application for assessing thyroid-associated orbitopathy activity using orbital computed tomography
- Jaesung Lee
- Sanghyuck Lee
- Jeong Kyu Lee
Scientific Reports (2023)

Automated detection of enlarged extraocular muscle in Graves’ ophthalmopathy with computed tomography and deep neural network

Abstract

Similar content being viewed by others

Detection of active and inactive phases of thyroid-associated ophthalmopathy using deep convolutional neural network

Development of a deep learning model to distinguish the cause of optic disc atrophy using retinal fundus photography

Quantitative assessment of extraocular muscles in Graves’ ophthalmopathy using T1 mapping

Introduction

Results

Discussion