Skip to main content

The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12929))

Abstract

Convolutional Neural Networks (CNNs) are widely used for image classification in a variety of fields, including medical imaging. While most studies deploy cross-entropy as the loss function in such tasks, a growing number of approaches have turned to a family of contrastive learning-based losses. Even though performance metrics such as accuracy, sensitivity and specificity are regularly used for the evaluation of CNN classifiers, the features that these classifiers actually learn are rarely identified and their effect on the classification performance on out-of-distribution test samples is insufficiently explored. In this paper, motivated by the real-world task of lung nodule classification, we investigate the features that a CNN learns when trained and tested on different distributions of a synthetic dataset with controlled modes of variation. We show that different loss functions lead to different features being learned and consequently affect the generalization ability of the classifier on unseen data. This study provides some important insights into the design of deep learning solutions for medical imaging tasks.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   59.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Dou, Q., Castro, D.C., Kamnitsas, K., Glocker, B.: Domain generalization via model-agnostic learning of semantic features. Adv Neural Inf. Process. Syst. 32 (2019), https://github.com/biomedia-mira/masf. arXiv:1910.13580

  2. Hadsell, R., Chopra, S., LeCun, Y.: Dimensionality reduction by learning an invariant mapping. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. vol. 2, pp. 1735–1742 (2006). https://doi.org/10.1109/CVPR.2006.100, https://ieeexplore.ieee.org/document/1640964

  3. Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations (ICLR 2015) - Conference Track Proceedings (2015)

    Google Scholar 

  4. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2323 (1998). https://doi.org/10.1109/5.726791

  5. McWilliams, A., Tammemagi, M.C., Mayo, J.R., et al.: Probability of cancer in pulmonary nodules detected on first screening CT. New Engl. J. Med. 369(10), 910–919 (2013). https://doi.org/10.1056/NEJMoa1214726, https://doi.org/10.1056/NEJMoa1214726

  6. Paszke, A., Gross, S., Massa, F., et al.: PyTorch: an imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems, vol. 32 (2019)

    Google Scholar 

  7. Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. In: 2nd International Conference on Learning Representations (ICLR 2014) - Workshop Track Proceedings (2014). http://code.google.com/p/cuda-convnet/

  8. Winkens, J., Bunel, R., Roy, A.G., et al.: Contrastive training for improved out-of-distribution detection. arXiv preprint 2007.05566 (2020), arXiv:2007.05566

Download references

Acknowledgments

This work is funded by the King’s College London & Imperial College London EPSRC Centre for Doctoral Training in Medical Imaging (EP/L015226/1), EPSRC grant EP/023509/1, the Wellcome/EPSRC Centre for Medical Engineering (WT 203148/Z/16/Z), and the UKRI London Medical Imaging & Artificial Intelligence Centre for Value Based Healthcare. The Titan Xp GPU was donated by the NVIDIA Corporation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vasileios Baltatzis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Baltatzis, V. et al. (2021). The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data. In: Reyes, M., et al. Interpretability of Machine Intelligence in Medical Image Computing, and Topological Data Analysis and Its Applications for Medical Data. IMIMIC TDA4MedicalData 2021 2021. Lecture Notes in Computer Science(), vol 12929. Springer, Cham. https://doi.org/10.1007/978-3-030-87444-5_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-87444-5_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-87443-8

  • Online ISBN: 978-3-030-87444-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics