Knowledge Distillation for Semi-supervised Domain Adaptation

Orbes-Arteainst, Mauricio; Cardoso, Jorge; Sørensen, Lauge; Igel, Christian; Ourselin, Sebastien; Modat, Marc; Nielsen, Mads; Pai, Akshay

doi:10.1007/978-3-030-32695-1_8

Mauricio Orbes-Arteainst^18,19,20,21,
Jorge Cardoso²¹,
Lauge Sørensen^18,20,
Christian Igel¹⁸,
Sebastien Ourselin²¹,
Marc Modat²¹,
Mads Nielsen^18,20 &
…
Akshay Pai¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11796))

Included in the following conference series:

1529 Accesses
16 Citations

Abstract

In the absence of sufficient data variation (e.g., scanner and protocol variability) in annotated data, deep neural networks (DNNs) tend to overfit during training. As a result, their performance is significantly lower on data from unseen sources compared to the performance on data from the same source as the training data. Semi-supervised domain adaptation methods can alleviate this problem by tuning networks to new target domains without the need for annotated data from these domains. Adversarial domain adaptation (ADA) methods are a popular choice that aim to train networks in such a way that the features generated are domain agnostic. However, these methods require careful dataset-specific selection of hyperparameters such as the complexity of the discriminator in order to achieve a reasonable performance. We propose to use knowledge distillation (KD) – an efficient way of transferring knowledge between different DNNs – for semi-supervised domain adaption of DNNs. It does not require dataset-specific hyperparameter tuning, making it generally applicable. The proposed method is compared to ADA for segmentation of white matter hyperintensities (WMH) in magnetic resonance imaging (MRI) scans generated by scanners that are not a part of the training set. Compared with both the baseline DNN (trained on source domain only and without any adaption to target domain) and with using ADA for semi-supervised domain adaptation, the proposed method achieves significantly higher WMH dice scores.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Oliver, A., Odena, A., Raffel, C.A., Cubuk, E.D., Goodfellow, I.: Realistic evaluation of deep semi-supervised learning algorithms. In: Advances in Neural Information Processing Systems, pp. 3239–3250 (2018)
Google Scholar
Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015)
Lopez-Paz, D., Bottou, L., Schölkopf, B., Vapnik, V.: Unifying distillation and privileged information. arXiv preprint arXiv:1511.03643 (2015)
Hoffman, J., Rodner, E., Donahue, J., Darrell, T., Saenko, K.: Efficient learning of domain-invariant image representations. arXiv preprint arXiv:1301.3224 (2013)
Karani, N., Chaitanya, K., Baumgartner, C., Konukoglu, E.: A lifelong learning approach to brain MR segmentation across scanners and protocols. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11070, pp. 476–484. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00928-1_54
Chapter Google Scholar
Tzeng, E., Hoffman, J., Saenko, K., Darrell, T.: Adversarial discriminative domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7167–7176 (2017)
Google Scholar
Sun, B., Saenko, K.: Deep CORAL: correlation alignment for deep domain adaptation. In: Hua, G., Jégou, H. (eds.) ECCV 2016. LNCS, vol. 9915, pp. 443–450. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49409-8_35
Chapter Google Scholar
Hoffman, J., et al.: Cycada: cycle-consistent adversarial domain adaptation. arXiv preprint arXiv:1711.03213 (2017)
Kamnitsas, K., et al.: Unsupervised domain adaptation in brain lesion segmentation with adversarial networks. In: Niethammer, M., et al. (eds.) IPMI 2017. LNCS, vol. 10265, pp. 597–609. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59050-9_47
Chapter Google Scholar
Gupta, S., Hoffman, J., Malik, J.: Cross modal distillation for supervision transfer. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2827–2836 (2016)
Google Scholar
Huang, R., Noble, J.A., Namburete, A.I.L.: Omni-supervised learning: scaling up to large unlabelled medical datasets. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11070, pp. 572–580. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00928-1_65
Chapter Google Scholar

Download references

Acknowledgements

This project has received funding from the EU H2020 under the Marie Skłodowska-Curie grant agreement No 721820.

Author information

Authors and Affiliations

Department of Computer Sceince, University of Copenhagen, Copenhagen, Denmark
Mauricio Orbes-Arteainst, Lauge Sørensen, Christian Igel & Mads Nielsen
Cerebriu A/S, Copenhagen, Denmark
Mauricio Orbes-Arteainst & Akshay Pai
Biomediq A/S, Copenhagen, Denmark
Mauricio Orbes-Arteainst, Lauge Sørensen & Mads Nielsen
King’s College London, London, UK
Mauricio Orbes-Arteainst, Jorge Cardoso, Sebastien Ourselin & Marc Modat

Authors

Mauricio Orbes-Arteainst
View author publications
You can also search for this author in PubMed Google Scholar
Jorge Cardoso
View author publications
You can also search for this author in PubMed Google Scholar
Lauge Sørensen
View author publications
You can also search for this author in PubMed Google Scholar
Christian Igel
View author publications
You can also search for this author in PubMed Google Scholar
Sebastien Ourselin
View author publications
You can also search for this author in PubMed Google Scholar
Marc Modat
View author publications
You can also search for this author in PubMed Google Scholar
Mads Nielsen
View author publications
You can also search for this author in PubMed Google Scholar
Akshay Pai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mauricio Orbes-Arteainst .

Editor information

Editors and Affiliations

University of Sydney, Sydney, NSW, Australia
Luping Zhou
University of Rennes 1, Rennes, France
Duygu Sarikaya
Radboud University Medical Center, Nijmegen, The Netherlands
Seyed Mostafa Kia
National Center for Tumor Diseases (NCT/UCC), Dresden, Germany
Stefanie Speidel
Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, MD, USA
Anand Malpani
Harvard Medical School, Massachusetts General Hospital, Boston, MA, USA
Daniel Hashimoto
University of Pennsylvania, Philadelphia, PA, USA
Mohamad Habes
Umeå University, Umeå, Sweden
Tommy Löfstedt
Charité-Universitätsmedizin Berlin, Berlin, Germany
Kerstin Ritter
IBM Research - Almaden, San Jose, CA, USA
Hongzhi Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Orbes-Arteainst, M. et al. (2019). Knowledge Distillation for Semi-supervised Domain Adaptation. In: Zhou, L., et al. OR 2.0 Context-Aware Operating Theaters and Machine Learning in Clinical Neuroimaging. OR 2.0 MLCN 2019 2019. Lecture Notes in Computer Science(), vol 11796. Springer, Cham. https://doi.org/10.1007/978-3-030-32695-1_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-32695-1_8
Published: 07 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32694-4
Online ISBN: 978-3-030-32695-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)