The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data

Baltatzis, Vasileios; Le Folgoc, Loïc; Ellis, Sam; Manzanera, Octavio E. Martinez; Bintsi, Kyriaki-Margarita; Nair, Arjun; Desai, Sujal; Glocker, Ben; Schnabel, Julia A.

doi:10.1007/978-3-030-87444-5_6

The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data

Vasileios Baltatzis^15,16,
Loïc Le Folgoc¹⁶,
Sam Ellis¹⁵,
Octavio E. Martinez Manzanera¹⁵,
Kyriaki-Margarita Bintsi¹⁶,
Arjun Nair¹⁷,
Sujal Desai¹⁸,
Ben Glocker¹⁶ &
…
Julia A. Schnabel^15,19,20

Conference paper
First Online: 21 September 2021

1164 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12929))

Abstract

Convolutional Neural Networks (CNNs) are widely used for image classification in a variety of fields, including medical imaging. While most studies deploy cross-entropy as the loss function in such tasks, a growing number of approaches have turned to a family of contrastive learning-based losses. Even though performance metrics such as accuracy, sensitivity and specificity are regularly used for the evaluation of CNN classifiers, the features that these classifiers actually learn are rarely identified and their effect on the classification performance on out-of-distribution test samples is insufficiently explored. In this paper, motivated by the real-world task of lung nodule classification, we investigate the features that a CNN learns when trained and tested on different distributions of a synthetic dataset with controlled modes of variation. We show that different loss functions lead to different features being learned and consequently affect the generalization ability of the classifier on unseen data. This study provides some important insights into the design of deep learning solutions for medical imaging tasks.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Dou, Q., Castro, D.C., Kamnitsas, K., Glocker, B.: Domain generalization via model-agnostic learning of semantic features. Adv Neural Inf. Process. Syst. 32 (2019), https://github.com/biomedia-mira/masf. arXiv:1910.13580
Hadsell, R., Chopra, S., LeCun, Y.: Dimensionality reduction by learning an invariant mapping. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. vol. 2, pp. 1735–1742 (2006). https://doi.org/10.1109/CVPR.2006.100, https://ieeexplore.ieee.org/document/1640964
Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2323 (1998). https://doi.org/10.1109/5.726791
McWilliams, A., Tammemagi, M.C., Mayo, J.R., et al.: Probability of cancer in pulmonary nodules detected on first screening CT. New Engl. J. Med. 369(10), 910–919 (2013). https://doi.org/10.1056/NEJMoa1214726, https://doi.org/10.1056/NEJMoa1214726
Paszke, A., Gross, S., Massa, F., et al.: PyTorch: an imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems, vol. 32 (2019)
Google Scholar
Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. In: 2nd International Conference on Learning Representations (ICLR 2014) - Workshop Track Proceedings (2014). http://code.google.com/p/cuda-convnet/
Winkens, J., Bunel, R., Roy, A.G., et al.: Contrastive training for improved out-of-distribution detection. arXiv preprint 2007.05566 (2020), arXiv:2007.05566

Download references

Acknowledgments

This work is funded by the King’s College London & Imperial College London EPSRC Centre for Doctoral Training in Medical Imaging (EP/L015226/1), EPSRC grant EP/023509/1, the Wellcome/EPSRC Centre for Medical Engineering (WT 203148/Z/16/Z), and the UKRI London Medical Imaging & Artificial Intelligence Centre for Value Based Healthcare. The Titan Xp GPU was donated by the NVIDIA Corporation.

Author information

Authors and Affiliations

School of Biomedical Engineering and Imaging Sciences, King’s College London, London, UK
Vasileios Baltatzis, Sam Ellis, Octavio E. Martinez Manzanera & Julia A. Schnabel
BioMedIA, Department of Computing, Imperial College London, London, UK
Vasileios Baltatzis, Loïc Le Folgoc, Kyriaki-Margarita Bintsi & Ben Glocker
Department of Radiology, University College London, London, UK
Arjun Nair
The Royal Brompton and Harefield NHS Foundation Trust, London, UK
Sujal Desai
Technical University of Munich, Munich, Germany
Julia A. Schnabel
Helmholtz Center Munich, Munich, Germany
Julia A. Schnabel

Authors

Vasileios Baltatzis
View author publications
You can also search for this author in PubMed Google Scholar
Loïc Le Folgoc
View author publications
You can also search for this author in PubMed Google Scholar
Sam Ellis
View author publications
You can also search for this author in PubMed Google Scholar
Octavio E. Martinez Manzanera
View author publications
You can also search for this author in PubMed Google Scholar
Kyriaki-Margarita Bintsi
View author publications
You can also search for this author in PubMed Google Scholar
Arjun Nair
View author publications
You can also search for this author in PubMed Google Scholar
Sujal Desai
View author publications
You can also search for this author in PubMed Google Scholar
Ben Glocker
View author publications
You can also search for this author in PubMed Google Scholar
Julia A. Schnabel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vasileios Baltatzis .

Editor information

Editors and Affiliations

University of Bern, Bern, Switzerland
Mauricio Reyes
CISUC, FCTUC, Coimbra, Portugal
Pedro Henriques Abreu
INESC, FEUP, Porto, Portugal
Jaime Cardoso
Santa Clara University, Santa Clara, CA, USA
Mustafa Hajij
National Institutes of Health, Bethesda, MD, USA
Ghada Zamzmi
Massachusetts General Hospital, Harvard, Boston, MA, USA
Paul Rahul
Broad Institute of MIT and Harvard, Cambridge, MA, USA
Lokendra Thakur

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baltatzis, V. et al. (2021). The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data. In: Reyes, M., et al. Interpretability of Machine Intelligence in Medical Image Computing, and Topological Data Analysis and Its Applications for Medical Data. IMIMIC TDA4MedicalData 2021 2021. Lecture Notes in Computer Science(), vol 12929. Springer, Cham. https://doi.org/10.1007/978-3-030-87444-5_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-87444-5_6
Published: 21 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87443-8
Online ISBN: 978-3-030-87444-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)