Abstract
Metric-based few-shot learning has gained considerable attention for simply and effectively addressing the few-shot classification problem. However, a huge number of the existing approaches focus only on the similarity or distance between features of the instances in the embedding space, neglecting the geometric structure of the samples. To remedy this, we propose a novel approach referred to as the discriminant space metric network (DSMNet) for few-shot image classification problem. DSMNet exploits the geometric structure of the samples within each episode to enhance the discriminative ability of the embedding space. Specifically, DSMNet aims to increase the distance between features belonging to different classes while making those from the same class more compact by maximizing the between-class scatter and minimizing the within-class scatter in the embedding space. Moreover, we developed a novel adaptation strategy for improving the model’s generalizing capability. Extensive experiments are conducted on four few-shot classification benchmark datasets to demonstrate the proposed DSMNet. We also performed several ablation studies to analyze its performance. The superiority of DSMNet over existing networks is indicated by the experimental results.
Similar content being viewed by others
Availability of data and materials
The datasets used during this study are available upon reasonable request to the authors.
Code Availability
Code availability: The code is publicly available at https://github.com/ylshitou/DSMNet.
References
Zhou H, Zhang S, Peng J, Zhang S, Li J, Xiong H, Zhang W (2021) Informer: Beyond efficient transformer for long sequence time-series forecasting. In: Proceedings of association for the advance of artificial intelligence
Chen Y, Chiang S-W, Wu M (2022) A few-shot transfer learning approach using text-label embedding with legal attributes for law article prediction. Appl Intell 52(3):2884–2902
Chen L, Min Y, Zhang M, Karbasi A (2020) More data can expand the generalization gap between adversarially robust and standard models. In: Proceedings of the international conference on machine learning, pp 1670–1680
Dvornik N, Schmid C, Mairal J (2019) Diversity with cooperation: Ensemble methods for few-shot classification. In: Proceedings of the IEEE international conference on computer vision, pp 3723–3731
Bronskill J, Gordon J, Requeima J, Nowozin S, Turner R (2020) Tasknorm: Rethinking batch normalization for meta-learning. In: Proceedings of the international conference on machine learning, pp 1153–1164
Yan L, Zhang L, Zheng X, Li F (2021) Deeper multi-column dilated convolutional network for congested crowd understanding. Neural Comput Appl:1–16
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252
Lin T, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft COCO: Common objects in context. In: Proceedings of the European conference on computer vision, pp 740–755
Li X, Chang D, Ma Z, Tan Z, Xue J, Cao J, Yu J, Guo J (2020) OSLNet: Deep small-sample classification with an orthogonal softmax layer. IEEE Trans Image Process 29:6482–6495
Li X, Sun Z, Xue J, Ma Z (2021) A concise review of recent few-shot meta-learning methods. Neurocomputing 456:463–468
Yao H, Wu X, Tao Z, Li Y, Ding B, Li R, Li Z (2020) Automated relational meta-learning. In: Proceedings of the international conference on learning representations
Jiang M, Li F, Liu L (2022) Continual meta-learning algorithm. Appl Intell 52(4):4527–4542
Yin M, Tucker G, Zhou M, Levine S, Finn C (2020) Meta-learning without memorization. In: Proceedings of the international conference on learning representations
Jiang M, Li F (2022) Lie group continual meta learning algorithm. Appl Intell 52(10):10965–10978
Li L, Jin W, Huang Y (2022) Few-shot contrastive learning for image classification and its application to insulator identification. Appl Intell 52(6):6148–6163
Rakelly K, Shelhamer E, Darrell T, Efros AA, Levine S (2019) Few-shot segmentation propagation with guided networks. In: Proceedings of the international conference on machine learning
Finn C, Abbeel P, Levine S (2017) Model-agnostic meta-learning for fast adaptation of deep networks. In: Proceedings of the international conference on machine learning, pp 1126– 1135
Ravi S, Larochelle H (2017) Optimization as a model for few-shot learning. In: Proceedings of the international conference on learning representations
Li W, Wang L, Xu J, Huo J, Gao Y, Luo J (2019) Revisiting local descriptor based image-to-class measure for few-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7260–7268
Chen Z, Fu Y, Zhang Y, Jiang Y-G, Xue X, Sigal L (2019) Multi-level semantic feature augmentation for one-shot learning. IEEE Trans Image Process 28(9):4594–4605
Antoniou A, Edwards H, Storkey A (2018) How to train your MAML. In: Proceedings of the international conference on learning representations
Bi S, Wang Y, Li X, Dong M, Zhu J (2022) Critical direction projection networks for few-shot learning. Appl Intell 52(5):5400–5413
Snell J, Swersky K, Zemel RS (2017) Prototypical networks for few-shot learning. In: Advances in neural information processing systems, pp 4077–4087
Chen W, Liu Y, Kira Z, Wang YF, Huang J (2019) A closer look at few-shot classification. In: Proceedings of the international conference on learning representations
Tseng H, Lee H, Huang J, Yang M (2020) Cross-domain few-shot classification via learned feature-wise transformation. In: Proceedings of the international conference on learning representations
Sung F, Yang Y, Zhang L, Xiang T, Torr PH, Hospedales TM (2018) Learning to compare: Relation network for few-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1199–1208
Garcia V, Bruna J (2018) Few-shot learning with graph neural networks. In: Proceedings of the international conference on learning representations
Huang H, Wu Z, Li W, Huo J, Gao Y (2021) Local descriptor-based multi-prototype network for few-shot learning. Pattern Recogn 116:107935
Liu X, Zhou F, Liu J, Jiang L (2020) Meta-learning based prototype-relation network for few-shot classification. Neurocomputing 383:224–234
Yan L, Zhang L (2019) Unsupervised dimension reduction using supervised orthogonal discriminant projection for clustering. In: International conference on high performance computing and communications, pp 2239–2246
Shu X, Gao Y, Lu H (2012) Efficient linear discriminant analysis with locality preserving for face recognition. Pattern Recogn 45(5):1892–1898
Rusu AA, Rao D, Sygnowski J, Vinyals O, Pascanu R, Osindero S, Hadsell R (2019) Meta-learning with latent embedding optimization. In: Proceedings of the international conference on learning representations
Vinyals O, Blundell C, Lillicrap T, kavukcuoglu k, Wierstra D (2016) Matching networks for one shot learning. In: Advances in neural information processing systems, vol 29, pp 3630–3638
Xu R, Xing L, Shao S, Zhao L, Liu B, Liu W, Zhou Y (2022) GCT: Graph co-training for semi-supervised few-shot learning. IEEE Trans Circuits Syst Video Technol:1–1
Shao S, Xing L, Wang Y, Xu R, Zhao C, Wang Y, Liu B (2021) MHFC: Multi-head feature collaboration for few-shot learning. In: Proceedings of the 29th ACM international conference on multimedia, pp 4193–4201
Shao S, Xing L, Xu R, Liu W, Wang Y, Liu B (2022) MDFM: Multi-decision fusing model for few-shot learning. IEEE Trans Circ Syst Video Technol 32(8):5151–5162
Wang Y, Xu C, Liu C, Zhang L, Fu Y (2020) Instance credibility inference for few-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 12836–12845
Finn C, Xu K, Levine S (2018) Probabilistic model-agnostic meta-learning. Adv Neural Inf Process Syst:31
Munkhdalai T, Yu H (2017) Meta networks. In: Proceedings of the international conference on machine learning, pp 2554–2563
Zhang R, Che T, Ghahramani Z, Bengio Y, Song Y (2018) MetaGAN: An adversarial approach to few-shot learning. Adv Neural Inf Process Syst:31
Hariharan B, Girshick R (2017) Low-shot visual recognition by shrinking and hallucinating features. In: Proceedings of the IEEE international conference on computer vision, pp 3018–3027
Gao H, Shou Z, Zareian A, Zhang H, Chang S (2018) Low-shot learning via covariance-preserving adversarial augmentation networks. Adv Neural Inf Process Syst:31
Wang Y-X, Girshick R, Hebert M, Hariharan B (2018) Low-shot learning from imaginary data. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7278–7286
Antoniou A, Storkey A, Edwards H (2017) Data augmentation generative adversarial networks. arXiv:1711.04340
Koch G, Zemel R, Salakhutdinov R et al (2015) Siamese neural networks for one-shot image recognition. In: ICML deep learning workshop, vol 2
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the international conference on learning representations
Oreshkin BN, López PR, Lacoste A (2018) TADAM: Task dependent adaptive metric for improved few-shot learning. In: Advances in neural information processing systems
Qiao S, Liu C, Shen W, Yuille AL (2018) Few-shot image recognition by predicting parameters from activations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7229–7238
Allen K, Shelhamer E, Shin H, Tenenbaum J (2019) Infinite mixture prototypes for few-shot learning. In: Proceedings of the international conference on machine learning, pp 232–241
Triantafillou E, Zhu T, Dumoulin V, Lamblin P, Evci U, Xu K, Goroshin R, Gelada C, Swersky K, Manzagol P-A et al (2019) Meta-dataset: A dataset of datasets for learning to learn from few examples. In: Proceedings of the international conference on learning representations
Wah C, Branson S, Welinder P, Perona P, Belongie S (2011) The Caltech-UCSD Birds-200-2011 dataset
Khosla A, Jayadevaprakash N, Yao B, Li F-F (2011) Novel dataset for fine-grained image categorization: Stanford dogs. In: Proceedings of the CVPR workshop on fine-grained visual categorization (FGVC), vol 2
Bertinetto L, Henriques JF, Torr PHS, Vedaldi A (2019) Meta-learning with differentiable closed-form solvers. In: Proceedings of the international conference on learning representations
Hilliard N, Phillips L, Howland S, Yankov A, Corley CD, Hodas NO (2018) Few-shot learning with metric-agnostic conditional embeddings. arXiv:1802.04376
Krizhevsky A, Hinton G et al (2009) Learning multiple layers of features from tiny images. University of Toronto, ON, Canada
Bertinetto L, Henriques JF, Valmadre J, Torr P, Vedaldi A (2016) Learning feed-forward one-shot learners. Adv Neural Inf Process Syst:29
Kingma DP, Ba J (2015) Adam: A method for stochastic optimization. In: Proceedings of the international conference on learning representations
Li W, Xu J, Huo J, Wang L, Gao Y, Luo J (2019) Distribution consistency based covariance metric networks for few-shot learning. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 8642–8649
Raghu A, Raghu M, Bengio S, Vinyals O (2020) Rapid learning or feature reuse? towards understanding the effectiveness of MAML. In: Proceedings of the international conference on learning representations
Qin Y, Zhang W, Zhao C, Wang Z, Zhu X, Shi J, Qi G, Lei Z (2021) Prior-knowledge and attention based meta-learning for few-shot learning. Knowl-Based Syst 213:106609
Li X, Wu J, Sun Z, Ma Z, Cao J, Xue J-H (2021) Bsnet: Bi-similarity network for few-shot fine-grained image classification. IEEE Trans Image Process 30:1318–1331
Xue Z, Xie Z, Xing Z, Duan L (2020) Relative position and map networks in few-shot learning for image classification. In: IEEE conference on computer vision and pattern recognition workshops, pp 4032–4036
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2017) Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision, pp 618–626
Van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9(11)
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Sergey Z, Nikos K (2016) Wide residual networks. In: Proceedings of the British machine vision conference, pp 1–12
Acknowledgements
We would like to thank Chengxiang Hu, Meijuan Su and Mengjuan Jiang for their technical support. We would also like to thank the computer resources and other support provided by the Machine Learning Laboratory of Soochow University.
Funding
This work was supported in part by the Priority Academic Program Development of Jiangsu Higher Education Institutions, by the National Key R&D Program of China (2018YFA0701700; 2018YFA0701701) and by the National Natural Science Foundation of China under Grant No.61672364, No.62176172 and No.61902269.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Leilei Yan: Conceptualization, Methodology, Software, Validation, Writing - original draft, Writing - review and editing. Fanzhang Li: Conceptualization, Methodology, Software, Writing - review and editing, Validation, Project administration, Funding acquisition. Li Zhang: Investigation, Software, Visualization, Writing - review and editing. Xiaohan Zheng: Investigation, Software, Visualization. All authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval
Not applicable.
Consent for Publication
Not applicable.
Consent to participate
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yan, L., Li, F., Zhang, L. et al. Discriminant space metric network for few-shot image classification. Appl Intell 53, 17444–17459 (2023). https://doi.org/10.1007/s10489-022-04413-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04413-3