Addressing Dataset Shift for Trustworthy Deep Learning Diagnostic Ultrasound Decision Support

Zhu, Calvin; Noseworthy, Michael D.; Doyle, Thomas E.

doi:10.1007/978-3-662-67868-8_7

Part of the book series: Lecture Notes in Computer Science ((TCOMPUTATSCIE,volume 13850))

153 Accesses

Abstract

Ultrasound (US) is the most widely used medical imaging modality due to its low cost, portability, real time imaging ability and use of non-ionizing radiation. However, unlike other imaging modalities such as CT or MRI, it is a heavily operator dependent, requiring trained expertise to leverage these benefits.

Recently there has been an explosion of interest in AI across the medical community and many are turning to the growing trend of deep learning (DL) models to assist in diagnosis. However, due to possible differences in training and deployment, model performance suffers which can lead to misdiagnosis and operator hesitancy. This issue is known as dataset shift. Two aims to address dataset shift were proposed. The first was to quantify how US operator skill and hardware affects acquired images. The second was to use this skill quantification method to screen and match data to deep learning models to improve performance.

A CAE Healthcare BLUE phantom with mock lesions was scanned by three operators using three different US systems (Siemens S3000, Clarius L15, and Ultrasonix SonixTouch) producing 39013 images. DL models were trained on a specific set to classify the presence of a simulated tumour and tested with data from differing sets. Principle Component Analysis (PCA) for dimension reduction was applied, then K-Means clustering was used to separate images generated by operator and hardware into clusters. This clustering algorithm was then used to screen incoming images during deployment to best match input to an appropriate DL model which is trained specifically to classify that type of operator or hardware.

Results showed a noticeable difference when models were given data from differing datasets with the largest accuracy drop being 81.26% to 31.26%. Overall, operator differences more significantly affected DL model performance. Clustering models had much higher success separating hardware data compared to operator data. The proposed method reflects this result with a much higher accuracy across the hardware test set compared to the operator data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Akkus, Z., et al.: A survey of deep-learning applications in ultrasound: artificial intelligence-powered ultrasound for improving clinical workflow. J. Am. College of Radiol. 16(9, Part B), 1318–1328 (2019). https://doi.org/10.1016/j.jacr.2019.06.004, https://www.sciencedirect.com/science/article/pii/S1546144019307112, special Issue: Quality and Data Science
Aldrich, J.E.: Basic physics of ultrasound imaging. Critical Care Med. 35(5) (2007). https://journals.lww.com/ccmjournal/Fulltext/2007/05001/Basic_physics_of_ultrasound_imaging.3.aspx
Carovac, A., Smajlovic, F., Junuzovic, D.: Application of ultrasound in medicine. Acta informatica medica: AIM: J. Soc. Med. Inform. Bosnia Herzegovina: casopis Drustva za medicinsku informatiku BiH 19(3), 168–171 (Sep 2011). https://doi.org/10.5455/aim.2011.19.168-171, https://pubmed.ncbi.nlm.nih.gov/23408755, 23408755[pmid]
Chan, V., Perlas, A.: Basics of Ultrasound Imaging, pp. 13–19. Springer, New York, New York, NY (2011). https://doi.org/10.1007/978-1-4419-1681-5_2
Géron, A.: Hands-on machine learning with Scikit-Learn and TENSORFLOW: concepts, tools, and techniques to build intelligent systems. O’Reilly Media, Inc. (2019)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015). https://doi.org/10.1038/nature14539
Article Google Scholar
Liu, S., et al.: Deep learning in medical ultrasound analysis: a review. Engineering 5(2), 261–275 (2019). https://doi.org/10.1016/j.eng.2018.11.020, https://www.sciencedirect.com/science/article/pii/S2095809918301887
Moreno-Torres, J.G., Raeder, T., Alaiz-Rodríguez, R., Chawla, N.V., Herrera, F.: A unifying view on dataset shift in classification. Pattern Recogn. 45(1), 521–530 (2012). https://doi.org/10.1016/j.patcog.2011.06.019, https://www.sciencedirect.com/science/article/pii/S0031320311002901
Park, S.H.: Artificial intelligence for ultrasonography: unique opportunities and challenges. Ultrasonography (Seoul, Korea) 40(1), 3–6 (2021). https://pubmed.ncbi.nlm.nih.gov/33227844, 33227844[pmid]
Park, V.Y., et al.: Diagnosis of thyroid nodules: performance of a deep learning convolutional neural network model vs. radiologists. Sci. Rep. 9(1), 17843 (2019). https://doi.org/10.1038/s41598-019-54434-1
Pinto, A., et al.: Sources of error in emergency ultrasonography. Crit. Ultrasound J. 5(1), S1 (2013). https://doi.org/10.1186/2036-7902-5-S1-S1
Article MathSciNet Google Scholar
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987). https://doi.org/10.1016/0377-0427(87)90125-7. https://www.sciencedirect.com/science/article/pii/0377042787901257

Download references

Author information

Authors and Affiliations

School of Biomedical Engineering, McMaster University, Hamilton, ON, Canada
Calvin Zhu, Michael D. Noseworthy & Thomas E. Doyle
Imaging Research Centre, St. Joseph’s Healthcare Hamilton, Hamilton, ON, Canada
Calvin Zhu & Michael D. Noseworthy
Department of Electrical and Computer Engineering, McMaster University, Hamilton, ON, Canada
Michael D. Noseworthy & Thomas E. Doyle
Vector Institute of Artificial Intelligence, Toronto, ON, Canada
Thomas E. Doyle
Department of Radiology, McMaster University, Hamilton, ON, Canada
Michael D. Noseworthy

Authors

Calvin Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Michael D. Noseworthy
View author publications
You can also search for this author in PubMed Google Scholar
Thomas E. Doyle
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas E. Doyle .

Editor information

Editors and Affiliations

University of Calgary, Calgary, AB, Canada
Marina Gavrilova
Sardina Systems OÜ, Tallinn, Estonia
C. J. Kenneth Tan
McGill University, Montreal, QC, Canada
Mark Coates
University of Calgary, Calgary, AB, Canada
Yaoping Hu
University of Calgary, Calgary, AB, Canada
Henry Leung
Concordia University, Montreal, QC, Canada
Arash Mohammadi
University of Toronto, Toronto, ON, Canada
Konstantinos N. Plataniotis
University of Calgary, Calgary, AB, Canada
Helder Rodrigues de Oliveira

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhu, C., Noseworthy, M.D., Doyle, T.E. (2023). Addressing Dataset Shift for Trustworthy Deep Learning Diagnostic Ultrasound Decision Support. In: Gavrilova, M., et al. Transactions on Computational Science XL. Lecture Notes in Computer Science(), vol 13850. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-67868-8_7

Download citation

DOI: https://doi.org/10.1007/978-3-662-67868-8_7
Published: 21 August 2023
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-67867-1
Online ISBN: 978-3-662-67868-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Addressing Dataset Shift for Trustworthy Deep Learning Diagnostic Ultrasound Decision Support