Abstract
Deep learning models have achieved remarkable success in medical imaging analysis. However, existing methods are primarily focused on supervised learning, which requires a massive amount of training data. Recent studies have explored semi-supervised learning approaches to address this issue, where data augmentation was applied to unlabeled data. However, there are still two unsolved challenges in applying data augmentation to unlabeled medical images: it can i) result in the lesion features loss and ii) reduce the discriminability of prediction results. Thus, in this work, weak data augmentation is applied to unlabeled data to avoid losing lesions features. Also, we propose nuclear-norm maximization to achieve entropy minimization without losing prediction diversity. Experimental results on two public datasets show that the proposed method outperforms the compared models.
Similar content being viewed by others
References
Tschandl P, Rosendahl C, Kittler H (2018) The ham10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Nat Sci Data 5:180161
Al-Waisy AS, Al-Fahdawi S, Mohammed MA, Abdulkareem KH, Mostafa SA, Maashi MS, Arif M, Garcia-Zapirain B (2020) Covid-chexnet: hybrid deep learning framework for identifying covid-19 virus in chest x-rays images. Soft Computing, 1–16
De Fauw J, Ledsam JR, Romera-Paredes B, Nikolov S, Tomasev N, Blackwell S, Askham H, Glorot X, O’Donoghue B, Visentin D et al (2018) Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat Med 24(9):1342–1350
Yarowsky D (1995) Unsupervised word sense disambiguation rivaling supervised methods. In: 33rd Annual meeting of the association for computational linguistics, pp 189–196
Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Proceedings of the eleventh annual conference on computational learning theory, pp 92–100
GRANDVALET Y (2005) Semi-supervised learning by entropy minimization. Adv Neural Inf Process Syst 17:529–536
Lee D. -H., et al. (2013) Pseudo-label: the simple and efficient semi-supervised learning method for deep neural networks. Workshop on Challenges in Representation Learning (ICML), vol 3, pp 896
Oliver A, Odena A, Raffel C, Cubuk E, Goodfellow I (2018) Realistic evaluation of semi-supervised learning algortihms. In: International conference on learning representations, pp 1–15
Odena A, Oliver A, Raffel C, Cubuk ED, Goodfellow I (2018) Realistic evaluation of semi-supervised learning algorithms. In: International conference on learning representations workshop
Laine SM, Aila TO (2021) Temporal ensembling for semi-supervised learning. Google Patents. US Patent 11,068,781
Tarvainen A, Valpola H (2017) Mean teachers are better role models: weight-averaged consistency targets improve semi-supervised deep learning results. Advances in Neural Information Processing Systems, 30
Miyato T, Maeda S-i, Koyama M, Ishii S (2018) Virtual adversarial training: a regularization method for supervised and semi-supervised learning. IEEE Trans Pattern Anal Mach Intell 41(8):1979–1993
Sohn K, Berthelot D, Carlini N, Zhang Z, Zhang H, Raffel C, Cubuk ED, Kurakin A, Li C-L (2020) Fixmatch: simplifying semi-supervised learning with consistency and confidence. Adv Neural Inf Process Syst 33:596–608
Cui S, Wang S, Zhuo J, Li L, Huang Q, Tian Q (2020) Towards discriminability and diversity: batch nuclear-norm maximization under label insufficient situations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3941–3950
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958
Berthelot D, Carlini N, Goodfellow I, Papernot N, Oliver A, Raffel C (2019) Mixmatch: a holistic approach to semi-supervised learning. Adv Neural Inf Process Syst, 32
Berthelot D, Carlini N, Cubuk ED, Kurakin A, Sohn K, Zhang H, Raffel C (2019) Remixmatch: semi-supervised learning with distribution matching and augmentation anchoring. In: International conference on learning representations
Xie Q, Dai Z, Hovy E, Luong T, Le Q (2020) Unsupervised data augmentation for consistency training. Adv Neural Inf Process Syst 33:6256–6268
Zhu W, Peng B, Wu H, Wang B (2020) Query set centered sparse projection learning for set based image classification. Appl Intell 50(10):3400–3411
Qiao S, Shen W, Zhang Z, Wang B, Yuille A (2018) Deep co-training for semi-supervised image recognition. In: Proceedings of the European conference on computer vision (eccv), pp 135–152
Hu C, Wu X-J, Kittler J (2018) Semi-supervised learning based on gan with mean and variance feature matching. IEEE Trans Cogn Develop Syst 11(4):539–547
Jin W, Yang P, Tang P (2018) Double discriminator generative adversarial networks and their application in detecting nests built in catenary and semisupervized learning. Scientia Sinica Informationis 48 (07):888–902
Jin Y, Cheng K, Dou Q, Heng P-A (2019) Incorporating temporal prior from motion flow for instrument segmentation in minimally invasive surgery video. In: International conference on medical image computing and computer-assisted intervention, pp 440–448. Springer
Cheplygina V, de Bruijne M, Pluim JP (2019) Not-so-supervised: a survey of semi-supervised, multi-instance, and transfer learning in medical image analysis. Med Image Anal 54:280–296
Zhang Y, Yang L, Chen J, Fredericksen M, Hughes DP, Chen DZ (2017) Deep adversarial networks for biomedical image segmentation utilizing unannotated images. In: International conference on medical image computing and computer-assisted intervention, pp 408–416. Springer
Gu L, Zheng Y, Bise R, Sato I, Imanishi N, Aiso S (2017) Semi-supervised learning for biomedical image segmentation via forest oriented super pixels (voxels). In: International conference on medical image computing and computer-assisted intervention, p Springer
Singh S, Janoos F, Pécot T, Caserta E, Leone G, Rittscher J, Machiraju R (2011) Identifying nuclear phenotypes using semi-supervised metric learning. In: Biennial international conference on information processing in medical imaging. Springer, pp 398–410
Bai W, Chen C, Tarroni G, Duan J, Guitton F, Petersen SE, Guo Y, Matthews PM, Rueckert D (2019) Self-supervised learning for cardiac mr image segmentation by anatomical position prediction. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 541–549
Li CH, Yuen PC (2001) Semi-supervised learning in medical image database. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 154–160
Filipovych R, Davatzikos C, Initiative ADN, et al. (2011) Semi-supervised pattern classification of medical images: application to mild cognitive impairment (mci). Neuroimage 55(3):1109–1119
Batmanghelich KN, Dong HY, Pohl KM, Taskar B, Davatzikos C et al (2011) Disease classification and prediction via semi-supervised dimensionality reduction. In: 2011 IEEE International symposium on biomedical imaging: from Nano to Macro. IEEE, pp 1086–1090
Batmanghelich NK, Taskar B, Davatzikos C (2011) Generative-discriminative basis learning for medical imaging. IEEE Trans Med Imag 31(1):51–69
Csurka G, Clinchant S, Jacquet G (2011) Xrce’s participation at medical image modality classification and ad-hoc retrieval tasks of image clef2011. CLEF (Notebook Papers/Labs/Workshop), vol 150
de Herrera AGS, Markonis D, Joyseeree R, Schaer R, Foncubierta-Rodríguez A, Müller H (2015) Semi–supervised learning for image modality classification. In: International workshop on multimodal retrieval in the medical domain. Springer, pp 85–98
Peikari M, Salama S, Nofech-Mozes S, Martel AL (2018) A cluster-then-label semi-supervised learning approach for pathology image classification. Sci Rep 8(1):1–13
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2020) Generative adversarial networks. Commun ACM 63(11):139–144
Madani A, Moradi M, Karargyris A, Syeda-Mahmood T (2018) Semi-supervised learning with generative adversarial networks for chest x-ray classification with ability of data domain adaptation. In: 2018 IEEE 15th international symposium on biomedical imaging (ISBI 2018). IEEE, pp 1038–1042
Madani A, Ong JR, Tibrewal A, Mofrad MR (2018) Deep echocardiography: data-efficient supervised and semi-supervised deep learning towards automated diagnosis of cardiac disease. NPJ Digit Med 1 (1):1–11
Lecouat B, Chang K, Foo C -S, Unnikrishnan B, Brown JM, Zenati H, Beers A, Chandrasekhar V, Kalpathy-Cramer J, Krishnaswamy P (2018) Semi-supervised deep learning for abnormality classification in retinal images
Li X, Yu L, Chen12 H, Fu C-W, Heng P-A Semi-supervised skin lesion segmentation via transformation consistent self-ensembling model
Cui W, Liu Y, Li Y, Guo M, Li Y, Li X, Wang T, Zeng X, Ye C (2019) Semi-supervised brain lesion segmentation with an adapted mean teacher model. In: International conference on information processing in medical imaging. Springer, pp 554–565
Yu L, Wang S, Li X, Fu C-W, Heng P-A (2019) Uncertainty-aware self-ensembling model for semi-supervised 3d left atrium segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 605–613
Grandvalet Y, Bengio Y (2004) Semi-supervised learning by entropy minimization. Advances in Neural Information Processing Systems, 17
Pogorelov K, Randel KR, Griwodz C, Eskeland SL, de Lange T, Johansen D, Spampinato C, Dang-Nguyen D. -T., Lux M, Schmidt PT, Riegler M, Halvorsen P, Kvasir A (2017) Multi-class image dataset for computer aided gastrointestinal disease detection. In: Proceedings of the 8th ACM on multimedia systems conference (mmsys). https://doi.org/10.1145/3083187.3083212, pp 164–169
Gyawali PK, Ghimire S, Bajracharya P, Li Z, Wang L (2020) Semi-supervised medical image classification with global latent mixing. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 604–613
Wang X, Chen H, Xiang H, Lin H, Lin X, Heng P-A (2021) Deep virtual adversarial self-training with consistency regularization for semi-supervised medical image classification. Med Image Anal 70:102010
Sundararajan M, Taly A, Yan Q (2017) Axiomatic attribution for deep networks. In: Proceedings of the 34th international conference on machine learning, vol 70, pp 3319–3328
Acknowledgments
This work was supported by the Research Foundation of Yunnan Province No. 202002AD080001, 202001BB050043 and 2019FA044, National Natural Science Foundation of China under Grants No.62162065, Provincial Foundation for Leaders of Disciplines in Science and Technology No.2019HB121.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, P., Qian, W., Cao, J. et al. Semi-supervised medical image classification via increasing prediction diversity. Appl Intell 53, 10162–10175 (2023). https://doi.org/10.1007/s10489-022-04012-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04012-2