Abstract
The complex shape of the foot, consisting of 26 bones, variable ligaments, tendons, and muscles leads to misdiagnosis of foot fractures. Despite the introduction of artificial intelligence (AI) to diagnose fractures, the accuracy of foot fracture diagnosis is lower than that of conventional methods. We developed an AI assistant system that assists with consistent diagnosis and helps interns or non-experts improve their diagnosis of foot fractures, and compared the effectiveness of the AI assistance on various groups with different proficiency. Contrast-limited adaptive histogram equalization was used to improve the visibility of original radiographs and data augmentation was applied to prevent overfitting. Preprocessed radiographs were fed to an ensemble model of a transfer learning-based convolutional neural network (CNN) that was developed for foot fracture detection with three models: InceptionResNetV2, MobilenetV1, and ResNet152V2. After training the model, score class activation mapping was applied to visualize the fracture based on the model prediction. The prediction result was evaluated by the receiver operating characteristic (ROC) curve and its area under the curve (AUC), and the F1-Score. Regarding the test set, the ensemble model exhibited better classification ability (F1-Score: 0.837, AUC: 0.95, Accuracy: 86.1%) than other single models that showed an accuracy of 82.4%. With AI assistance for the orthopedic fellow, resident, intern, and student group, the accuracy of each group improved by 3.75%, 7.25%, 6.25%, and 7% respectively and diagnosis time was reduced by 21.9%, 14.7%, 24.4%, and 34.6% respectively.
Similar content being viewed by others
References
Naser A, Osama A (2016) A proposed expert system for foot diseases diagnosis. Am J Innov Res Appl Sci 2:160–173
Judd DB, Kim DH (2002) Foot fractures frequently misdiagnosed as ankle sprains. Am Fam Physician 66:785–794
Wei CJ, Tsai WC, Tiu CM, Wu HT, Chiou HJ, Chang CY (2006) Systematic analysis of missed extremity fractures in emergency radiology. Acta Radiol 47:710–717. https://doi.org/10.1080/02841850600806340
Bruno MA, Walker EA, Abujudeh HH (2015) Understanding and confronting our mistakes: the epidemiology of error in radiology and strategies for error reduction. Radiographics 35:1668–1676. https://doi.org/10.1148/rg.2015150023
Alhasan M, Hasaneen M (2021) Digital imaging, technologies and artificial intelligence applications during COVID-19 pandemic. Comput Med Imaging Graph 91:101933. https://doi.org/10.1016/j.compmedimag.2021.101933
Lalehzarian SP, Gowd AK, Liu JN (2021) Machine learning in orthopaedic surgery. World J Orthop 12:685–699. https://doi.org/10.5312/wjo.v12.i9.685
Zhang C, Lu Y (2021) Study on artificial intelligence: the state of the art and future prospects. J Ind Inf Integr 23:100224. https://doi.org/10.1016/j.jii.2021.100224
Gupta A, Anpalagan A, Guan L, Khwaja AS (2021) Deep learning for object detection and scene perception in self-driving cars: survey, challenges, and open issues. Array 10:100057. https://doi.org/10.1016/j.array.2021.100057
Pal SK, Pramanik A, Maiti J, Mitra P (2021) Deep learning in multi-object detection and tracking: state of the art. Appl Intell 51:6400–6429. https://doi.org/10.1007/s10489-021-02293-7
Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts HJWL (2018) Artificial intelligence in radiology. Nat Rev Cancer 18:500–510. https://doi.org/10.1038/s41568-018-0016-5
Thian YL, Li Y, Jagmohan P, Sia D, Chan VEY, Tan RT (2019) Convolutional neural networks for automated fracture detection and localization on wrist radiographs. Radiol Artif Intell 1:e180001. https://doi.org/10.1148/ryai.2019180001
Raghavendra U, Bhat NS, Gudigar A, Acharya UR (2018) Automated system for the detection of thoracolumbar fractures using a CNN architecture. Futur Gener Comput Syst 85:184–189. https://doi.org/10.1016/j.future.2018.03.023
Kuang Z, Deng X, Yu L, Zhang H, Lin X, Ma H (2020) Skull R-CNN: a CNN-based network for the skull fracture detection. Med Imaging Deep Learn PMLR 382–392
Kim JH, Mo YC, Choi SM, Hyun Y, Lee JW (2021) Detecting ankle fractures in plain radiographs using deep learning with accurately labeled datasets aided by computed tomography: a retrospective observational study. Appl Sci 11. https://doi.org/10.3390/app11198791
Shelmerdine SC, White RD, Liu H, Arthurs OJ, Sebire NJ (2022) Artificial intelligence for radiological paediatric fracture assessment: a systematic review. Insights Imaging. https://doi.org/10.1186/s13244-022-01234-3
Dupuis M, Delbos L, Veil R, Adamsbaum C (2022) External validation of a commercially available deep learning algorithm for fracture detection in children. Diagn Interv Imaging 103:151–159. https://doi.org/10.1016/j.diii.2021.10.007
Guermazi A, Tannoury C, Kompel AJ, Murakami AM, Ducarouge A, Gillibert A, Li X, Tournier A, Lahoud Y, Jarraya M, Lacave E, Rahimi H, Pourchot A, Parisien RL, Merritt AC, Comeau D, Regnard NE, Hayashi D (2022) Improving radiographic fracture recognition performance and efficiency using artificial intelligence. Radiology 302:627–636. https://doi.org/10.1148/radiol.210937
Jones RM, Sharma A, Hotchkiss R, Sperling JW, Hamburger J, Ledig C, O’Toole R, Gardner M, Venkatesh S, Roberts MM, Sauvestre R, Shatkhin M, Gupta A, Chopra S, Kumaravel M, Daluiski A, Plogger W, Nascone J, Potter HG, Lindsey RV (2020) Assessment of a deep-learning system for fracture detection in musculoskeletal radiographs. npj Digit Med 3:1–6. https://doi.org/10.1038/s41746-020-00352-w
Lindsey R, Daluiski A, Chopra S, Lachapelle A, Mozer M, Sicular S, Hanel D, Gardner M, Gupta A, Hotchkiss R, Potter H (2018) Deep neural network improves fracture detection by clinicians. Proc Natl Acad Sci USA 115:11591–11596. https://doi.org/10.1073/pnas.1806905115
Alizadehsani R, Roshanzamir M, Hussain S, Khosravi A, Koohestani A, Zangooei MH, Abdar M, Beykikhoshk A, Shoeibi A, Zare A, Panahiazar M, Nahavandi S, Srinivasan D, Atiya AF, Acharya UR (2021) Handling of uncertainty in medical data using machine learning and probability theory techniques: a review of 30 years (1991–2020). Springer US
Varma M, Lu M, Gardner R, Dunnmon J, Khandwala N, Rajpurkar P, Long J, Beaulieu C, Shpanskaya K, Fei-Fei L, Lungren MP, Patel BN (2019) Automated abnormality detection in lower extremity radiographs using deep learning. Nat Mach Intell 1:578–583. https://doi.org/10.1038/s42256-019-0126-0
Lin YJ, Chung IF (2019) Medical data augmentation using generative adversarial networks: x-ray image generation for transfer learning of hip fracture detection. In: Proceedings—2019 international conference on technologies and applications of artificial intelligence (TAAI). pp 14–18. https://doi.org/10.1109/TAAI48200.2019.8959908
Kitamura G, Chung CY, Moore BE (2019) Ankle fracture detection utilizing a convolutional neural network ensemble implemented with a small sample, de novo training, and multiview incorporation. J Digit Imaging 32:672–677. https://doi.org/10.1007/s10278-018-0167-7
Hardalaç F, Uysal F, Peker O, Çiçeklidağ M, Tolunay T, Tokgöz N, Kutbay U, Demirciler B, Mert F (2022) Fracture detection in wrist X-ray images using deep learning-based object detection models. Sensors. https://doi.org/10.3390/s22031285
Kandel I, Castelli M, Popovič A (2021) Comparing stacking ensemble techniques to improve musculoskeletal fracture image classification. J Imaging. https://doi.org/10.3390/JIMAGING7060100
Kim JY, Kim JM (2020) Bearing fault diagnosis using grad-CAM and acoustic emission signals. Appl Sci. https://doi.org/10.3390/app10062050
Menikdiwela M, Nguyen C, Li H, Shaw M (2018) CNN-based small object detection and visualization with feature activation mapping. In: International conference on image and vision computing New Zealand 2017-December. pp 1–5. https://doi.org/10.1109/IVCNZ.2017.8402455
Zhang Y, Hong D, McClement D, Oladosu O, Pridham G, Slaney G (2021) Grad-CAM helps interpret the deep learning models trained to classify multiple sclerosis types using clinical brain magnetic resonance imaging. J Neurosci Methods 353:109098. https://doi.org/10.1016/j.jneumeth.2021.109098
Cheng CT, Chen CC, Fu CY, Chaou CH, Wu YT, Hsu CP, Chang CC, Chung IF, Hsieh CH, Hsieh MJ, Liao CH (2020) Artificial intelligence-based education assists medical students’ interpretation of hip fracture. Insights Imaging. https://doi.org/10.1186/s13244-020-00932-0
Meng XH, Wu DJ, Wang Z, Ma XL, Dong XM, Liu AE, Chen L (2021) A fully automated rib fracture detection system on chest CT images and its impact on radiologist performance. Skelet Radiol 50:1821–1828. https://doi.org/10.1007/s00256-021-03709-8
Ren Y, Zhang L, Suganthan PN (2016) Ensemble classification and regression-recent developments, applications and future directions [review article]. IEEE Comput Intell Mag 11:41–53. https://doi.org/10.1109/MCI.2015.2471235
Wang H, Wang Z, Du M, Yang F, Zhang Z, Ding S, Mardziel P, Hu X (2020) Score-CAM: score-weighted visual explanations for convolutional neural networks. In: IEEE computer society conference on computer vision and pattern recognition work 2020-June. pp 111–119. https://doi.org/10.1109/CVPRW50498.2020.00020
Zuiderveld K (1994) VIII.5.—contrast limited adaptive histogram equalization. In: Heckbert PS (ed) Graphics gems. Academic Press, San Diego, pp 474–485
Chollet F (2015) Keras: deep learning for humans. In: Keras. https://github.com/fchollet/keras
Deng J, Dong W, Socher R, Li L-J, Li K, Li F-F (2009) ImageNet: a large-scale hierarchical image database. pp 248–255. https://doi.org/10.1109/cvprw.2009.5206848
Szegedy C, Ioffe S, Vanhoucke V, Alemi AA (2017) Inception-v4, inception-ResNet and the impact of residual connections on learning. In: 31st AAAI conference on artificial intelligence (AAAI). pp 4278–4284
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition 2016-December. pp 770–778. https://doi.org/10.1109/CVPR.2016.90
Huertas-Tato J, Martín A, Camacho D (2020) Cloud type identification using data fusion and ensemble learning. Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) 12490 LNCS. pp 137–147. https://doi.org/10.1007/978-3-030-62365-4_13
Rafik HD (2022) Classification and detection of covid-19 in human respiratory lungs using convolutional neural network architectures. pp 1–10. https://doi.org/10.1109/ai-csp52968.2021.9671158
Rezaeijo SM, Ghorvei M, Mofid B (2021) Predicting breast cancer response to neoadjuvant chemotherapy using ensemble deep transfer learning based on CT images. J Xray Sci Technol 29:835–850. https://doi.org/10.3233/XST-210910
Kingma DP, Ba JL (2015) Adam: a method for stochastic optimization. In: 3rd international conference on learning representations ICLR 2015—conference track proceedings. pp 1–15
Ju C, Bibaut A, van der Laan M (2018) The relative performance of ensemble methods with deep convolutional neural networks for image classification. J Appl Stat 45:2800–2818. https://doi.org/10.1080/02664763.2018.1441383
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings—30th IEEE conference on computer vision pattern recognition (CVPR) 2017-January. pp 2261–2269. https://doi.org/10.1109/CVPR.2017.243
Chollet F (2017) Xception: deep learning with depthwise separable convolutions. In: Proceedings—30th IEEE conference on computer vision pattern recognition (CVPR) 2017-January. pp 1800–1807. https://doi.org/10.1109/CVPR.2017.195
Zhang X, Yang Y, Shen YW, Zhang KR, Jiang Z, kun, Ma LT, Ding C, Wang BY, Meng Y, Liu H (2022) Diagnostic accuracy and potential covariates of artificial intelligence for diagnosing orthopedic fractures: a systematic literature review and meta-analysis. Eur Radiol 32:7196–7216. https://doi.org/10.1007/s00330-022-08956-4
Choi J, Hui JZ, Spain D, Su YS, Cheng CT, Liao CH (2021) Practical computer vision application to detect hip fractures on pelvic X-rays: a bi-institutional study. Trauma Surg Acute Care Open 6:1–4. https://doi.org/10.1136/tsaco-2021-000705
Hendrix N, Scholten E, Vernhout B, Bruijnen S, Maresch B, de Jong M, Diepstraten S, Bollen S, Schalekamp S, de Rooij M, Scholtens A, Hendrix W, Samson T, Ong LLS, Postma E, van Ginneken B, Rutten M (2021) Development and validation of a convolutional neural network for automated detection of scaphoid fractures on conventional radiographs. Radiol Artif Intell. https://doi.org/10.1148/ryai.2021200260
Avrahami D, Pajaczkowski JA (2012) Femoral neck stress fracture in a female athlete: a case report. J Chiropr Med 11:273–279. https://doi.org/10.1016/j.jcm.2012.05.010
Varoquaux G, Cheplygina V (2022) Machine learning for medical imaging: methodological failures and recommendations for the future. npj Digit Med. https://doi.org/10.1038/s41746-022-00592-y
Roberts M, Driggs D, Thorpe M, Gilbey J, Yeung M, Ursprung S, Aviles-Rivero AI, Etmann C, McCague C, Beer L, Weir-McCall JR, Teng Z, Gkrania-Klotsas E, Ruggiero A, Korhonen A, Jefferson E, Ako E, Langs G, Gozaliasl G, Yang G, Prosch H, Preller J, Stanczuk J, Tang J, Hofmanninger J, Babar J, Sánchez LE, Thillai M, Gonzalez PM, Teare P, Zhu X, Patel M, Cafolla C, Azadbakht H, Jacob J, Lowe J, Zhang K, Bradley K, Wassin M, Holzer M, Ji K, Ortet MD, Ai T, Walton N, Lio P, Stranks S, Shadbahr T, Lin W, Zha Y, Niu Z, Rudd JHF, Sala E, Schönlieb CB (2021) Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans. Nat Mach Intell 3:199–217. https://doi.org/10.1038/s42256-021-00307-0
Acknowledgements
This work was supported by the Promoting AI Collaboration Project Research Fund (1.210129.01) of Ulsan National Institute of Science & Technology (UNIST) and the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (Grant Nos. 2021M2D2A1A01050059 and 2021R1F1A1046079) and Biomedical Research Institute Grant (20210017), Pusan National University Hospital.
Funding
This work was supported by Ulsan National Institute of Science and Technology (Grant No. 1.210129.01), Ministry of Science and ICT, South Korea (Grant Nos. 2021M2D2A1A01050059, 2021R1F1A1046079), Pusan National University Hospital (Grant No. 20210017), Ministry of SMEs and Startups, South Korea (Grant No. S3248116), Research Institute of Industrial Science and Technology (Grant No. 2.220971.01), and Ministry of Trade, Industry and Energy, South Korea (Grant No. 20017502).
Author information
Authors and Affiliations
Contributions
TK: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data curation, Writing—Original Draft, Writing—Review & Editing. TSG: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Resources, Data curation, Writing—Original Draft, Writing—Review & Editing. JSL: Formal analysis. JHL: Resources, Data Curation. HK: Writing—Original Draft, Writing—Review & Editing. IDJ: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data curation, Writing—Original Draft, Writing—Review & Editing, Supervision.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kim, T., Goh, T.S., Lee, J.S. et al. Transfer learning-based ensemble convolutional neural network for accelerated diagnosis of foot fractures. Phys Eng Sci Med 46, 265–277 (2023). https://doi.org/10.1007/s13246-023-01215-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13246-023-01215-w