Abstract
Objective
We constructed and validated a machine learning-based malignancy risk estimation model using predefined clinicoradiological features, and evaluated its clinical utility for the management of thyroid nodules.
Methods
In total, 5708 benign (n = 4597) and malignant (n = 1111) thyroid nodules were collected from 5081 consecutive patients treated in 26 institutions. Seventeen experienced radiologists evaluated nodule characteristics on ultrasonographic images. Eight predictive models were used to stratify the thyroid nodules according to malignancy risk; model performance was assessed via nested 10-fold cross-validation. The best-performing algorithm was externally validated using data for 454 thyroid nodules from a tertiary hospital, then compared to the Thyroid Imaging Reporting and Data System (TIRADS)–based interpretations of radiologists (American College of Radiology, European and Korean TIRADS, and AACE/ACE/AME guidelines).
Results
The area under the receiver operating characteristic (AUROC) curves of the algorithms ranged from 0.773 to 0.862. The sensitivities, specificities, positive predictive values, and negative predictive values of the best-performing models were 74.1–76.6%, 80.9–83.4%, 49.2–51.9%, and 93.0–93.5%, respectively. For the external validation set, the ElasticNet values were 83.2%, 89.2%, 81.8%, and 90.1%, respectively. The corresponding TIRADS values were 66.5–85.0%, 61.3–80.8%, 45.9–72.1%, and 81.5–90.3%, respectively. The new model exhibited a significantly higher AUROC and specificity than did the TIRADS risk stratification, although its sensitivity was similar.
Conclusion
We developed a reliable machine learning–based predictive model that demonstrated enhanced specificity when stratifying thyroid nodules according to malignancy risk. This system will contribute to improved personalized management of thyroid nodules.
Key Points
• The area under the receiver operating characteristic (AUROC) curve, sensitivity, and specificity of our model were 0.914, 83.2%, and 89.2%, respectively (derived using the validation dataset).
• Compared to the TIRADS values, the AUROC and specificity are significantly higher, while the sensitivity is similar.
• An interactive version of our AI algorithm is at http://tirads.cdss.co.kr .
Similar content being viewed by others
Abbreviations
- AI:
-
Artificial intelligence
- AUROC:
-
Area under the receiver operating characteristic
- CI:
-
Confidence interval
- FNA:
-
Fine-needle aspiration
- LASSO:
-
Least Absolute Shrinkage and Selection Operator
- ML:
-
Machine learning
- NPV:
-
Negative predictive value
- PPV:
-
Positive predictive value
- RF:
-
Random forest
- RSS:
-
Risk stratification systems
- TIRADS:
-
Thyroid Imaging Reporting and Data System
- US:
-
Ultrasonography
- XGBoost:
-
Extreme gradient boosting
References
Ha EJ, Chung SR, Na DG et al (2021) 2021 Korean Thyroid Imaging Reporting and Data System and Imaging-Based Management of Thyroid Nodules: Korean Society of Thyroid Radiology Consensus Statement and Recommendations. Korean J Radiol 22:2094–2123
Tessler FN, Middleton WD, Grant EG et al (2017) ACR Thyroid Imaging, Reporting and Data System (TI-RADS): White Paper of the ACR TI-RADS Committee. J Am Coll Radiol 14:587–595
Gharib H, Papini E, Garber JR et al (2016) American Association of Clinical Endocrinologists, American College of Endocrinology, and Associazione Medici Endocrinologi Medical Guidelines for Clinical Practice for the Diagnosis and Management of Thyroid Nodules--2016 Update. Endocr Pract 22:622–639
Russ G, Bonnema SJ, Erdogan MF, Durante C, Ngu R, Leenhardt L (2017) European Thyroid Association Guidelines for Ultrasound Malignancy Risk Stratification of Thyroid Nodules in Adults: The EU-TIRADS. Eur Thyroid J 6:225–237
Ha EJ, Baek JH (2021) Applications of machine learning and deep learning to thyroid imaging: where do we stand? Ultrasonography 40:23–29
Ha EJ, Baek JH, Na DG (2017) Risk stratification of thyroid nodules on ultrasonography: current status and perspectives. Thyroid 27:1463–1468
Kwak JY, Jung I, Baek JH et al (2013) Image reporting and characterization system for ultrasound features of thyroid nodules: multicentric Korean retrospective study. Korean J Radiol 14:110–117
Choi YJ, Baek JH, Baek SH et al (2015) Web-based malignancy risk estimation for thyroid nodules using ultrasonography characteristics: development and validation of a predictive model. Thyroid 25:1306–1312
Li LR, Du B, Liu HQ, Chen C (2020) Artificial intelligence for personalized medicine in thyroid cancer: current status and future perspectives. Front Oncol 10:604051
Peng S, Liu Y, Lv W et al (2021) Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: a multicentre diagnostic study. Lancet Digit Health 3:e250–e259
Han M, Ha EJ, Park JH (2021) Computer-aided diagnostic system for thyroid nodules on ultrasonography: diagnostic performance based on the thyroid imaging reporting and data system classification and dichotomous outcomes. AJNR Am J Neuroradiol 42:559–565
Kim HL, Ha EJ, Han M (2019) Real-world performance of computer-aided diagnosis system for thyroid nodules using ultrasonography. Ultrasound Med Biol 45:2672–2678
Yoo YJ, Ha EJ, Cho YJ, Kim HL, Han M, Kang SY (2018) Computer-aided diagnosis of thyroid nodules via ultrasonography: initial clinical experience. Korean J Radiol 19:665–672
Nguyen DT, Kang JK, Pham TD, Batchuluun G, Park KR (2020) Ultrasound image-based diagnosis of malignant thyroid nodule using artificial intelligence. Sensors (Basel) 20:1822
Thomas J, Haertling T (2020) AIBx, artificial intelligence model to risk stratify thyroid nodules. Thyroid 30:878–884
Thomas J, Ledger GA, Mamillapalli CK (2020) Use of artificial intelligence and machine learning for estimating malignancy risk of thyroid nodules. Curr Opin Endocrinol Diabetes Obes 27:345–350
Ha EJ, Shin JH, Na DG et al (2021) Comparison of the diagnostic performance of the modified Korean Thyroid Imaging Reporting and Data System for thyroid malignancy with three international guidelines. Ultrasonography 40:594–601
Chung SR, Ahn HS, Choi YJ et al (2021) Diagnostic performance of the Modified Korean Thyroid Imaging Reporting and Data System for Thyroid Malignancy: a multicenter validation study. Korean J Radiol 22:1579–1586
Robin X, Turck N, Hainard A et al (2011) pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12:77
O'Brien R, Ishwaran H (2019) A random forests quantile classifier for class imbalanced data. Pattern Recognit 90:232–249
Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33:1–22
Tianqi Chen CG (2016) Xgboost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data
Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F, Chang CC et al (2014) e1071: Misc Functions of the Department of Statistics (e1071), TU Wien. R package version 13
Kim EK, Park CS, Chung WY et al (2002) New sonographic criteria for recommending fine-needle aspiration biopsy of nonpalpable solid nodules of the thyroid. AJR Am J Roentgenol 178:687–691
Middleton WD, Teefey SA, Reading CC et al (2017) Multiinstitutional analysis of thyroid nodule risk stratification using the American College of Radiology Thyroid Imaging Reporting and Data System. AJR Am J Roentgenol 208:1331–1341
Chang Y, Paul AK, Kim N et al (2016) Computer-aided diagnosis for classifying benign versus malignant thyroid nodules based on ultrasound images: a comparison with radiologist-based assessments. Med Phys 43:554
Wu G, Lv W, Yin R et al (2021) Deep learning based on ACR TI-RADS can improve the differential diagnosis of thyroid nodules. Front Oncol 11:575166
Zhang S, Du H, Jin Z et al (2020) A novel interpretable computer-aided diagnosis system of thyroid nodules on ultrasound based on clinical experience. IEEE Access 8:53223–53231
Jin Z, Zhu Y, Zhang S et al (2021) Diagnosis of thyroid cancer using a TI-RADS-based computer-aided diagnosis system: a multicenter retrospective study. Clin Imaging 30:43–49
Funding
The authors state that this work was supported by the National Research Foundation of Korea (NRF) grant by the Korea government (MSIT) (#2021R1C1C100698711), by the 2018 intramural research fund of Ajou University Medical Center, and by grant No 03-2021-2230 from the SNUH Research Fund.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Guarantor
The scientific guarantor of this publication is Ji-hoon Kim.
Conflict of interest
The authors of this manuscript declare no relationships with any companies whose products or services may be related to the subject matter of the article.
Statistics and biometry
No complex statistical methods were necessary for this paper.
Informed consent
Written informed consent was waived by the Institutional Review Board.
Ethical approval
This study was approved by our institutional review board.
Methodology
-
retrospective
-
multicenter cohort study
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
ESM 1
(DOCX 36 kb)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ha, E.J., Lee, J.H., Lee, D.H. et al. Development of a machine learning-based fine-grained risk stratification system for thyroid nodules using predefined clinicoradiological features. Eur Radiol 33, 3211–3221 (2023). https://doi.org/10.1007/s00330-022-09376-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00330-022-09376-0