Discriminant space metric network for few-shot image classification

Yan, Leilei; Li, Fanzhang; Zhang, Li; Zheng, Xiaohan

doi:10.1007/s10489-022-04413-3

Discriminant space metric network for few-shot image classification

Published: 05 January 2023

Volume 53, pages 17444–17459, (2023)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Leilei Yan¹,
Fanzhang Li ORCID: orcid.org/0000-0003-4318-3081²,
Li Zhang¹ &
…
Xiaohan Zheng¹

482 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Metric-based few-shot learning has gained considerable attention for simply and effectively addressing the few-shot classification problem. However, a huge number of the existing approaches focus only on the similarity or distance between features of the instances in the embedding space, neglecting the geometric structure of the samples. To remedy this, we propose a novel approach referred to as the discriminant space metric network (DSMNet) for few-shot image classification problem. DSMNet exploits the geometric structure of the samples within each episode to enhance the discriminative ability of the embedding space. Specifically, DSMNet aims to increase the distance between features belonging to different classes while making those from the same class more compact by maximizing the between-class scatter and minimizing the within-class scatter in the embedding space. Moreover, we developed a novel adaptation strategy for improving the model’s generalizing capability. Extensive experiments are conducted on four few-shot classification benchmark datasets to demonstrate the proposed DSMNet. We also performed several ablation studies to analyze its performance. The superiority of DSMNet over existing networks is indicated by the experimental results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-distance metric network for few-shot learning

Article 25 March 2022

Multi-scale feature network for few-shot learning

Article 07 January 2020

Few-Shot Classification with Semantic Augmented Activators

Availability of data and materials

The datasets used during this study are available upon reasonable request to the authors.

Code Availability

Code availability: The code is publicly available at https://github.com/ylshitou/DSMNet.

References

Zhou H, Zhang S, Peng J, Zhang S, Li J, Xiong H, Zhang W (2021) Informer: Beyond efficient transformer for long sequence time-series forecasting. In: Proceedings of association for the advance of artificial intelligence
Chen Y, Chiang S-W, Wu M (2022) A few-shot transfer learning approach using text-label embedding with legal attributes for law article prediction. Appl Intell 52(3):2884–2902
Article Google Scholar
Chen L, Min Y, Zhang M, Karbasi A (2020) More data can expand the generalization gap between adversarially robust and standard models. In: Proceedings of the international conference on machine learning, pp 1670–1680
Dvornik N, Schmid C, Mairal J (2019) Diversity with cooperation: Ensemble methods for few-shot classification. In: Proceedings of the IEEE international conference on computer vision, pp 3723–3731
Bronskill J, Gordon J, Requeima J, Nowozin S, Turner R (2020) Tasknorm: Rethinking batch normalization for meta-learning. In: Proceedings of the international conference on machine learning, pp 1153–1164
Yan L, Zhang L, Zheng X, Li F (2021) Deeper multi-column dilated convolutional network for congested crowd understanding. Neural Comput Appl:1–16
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252
Article MathSciNet Google Scholar
Lin T, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft COCO: Common objects in context. In: Proceedings of the European conference on computer vision, pp 740–755
Li X, Chang D, Ma Z, Tan Z, Xue J, Cao J, Yu J, Guo J (2020) OSLNet: Deep small-sample classification with an orthogonal softmax layer. IEEE Trans Image Process 29:6482–6495
Article MathSciNet MATH Google Scholar
Li X, Sun Z, Xue J, Ma Z (2021) A concise review of recent few-shot meta-learning methods. Neurocomputing 456:463–468
Article Google Scholar
Yao H, Wu X, Tao Z, Li Y, Ding B, Li R, Li Z (2020) Automated relational meta-learning. In: Proceedings of the international conference on learning representations
Jiang M, Li F, Liu L (2022) Continual meta-learning algorithm. Appl Intell 52(4):4527–4542
Article Google Scholar
Yin M, Tucker G, Zhou M, Levine S, Finn C (2020) Meta-learning without memorization. In: Proceedings of the international conference on learning representations
Jiang M, Li F (2022) Lie group continual meta learning algorithm. Appl Intell 52(10):10965–10978
Article Google Scholar
Li L, Jin W, Huang Y (2022) Few-shot contrastive learning for image classification and its application to insulator identification. Appl Intell 52(6):6148–6163
Article Google Scholar
Rakelly K, Shelhamer E, Darrell T, Efros AA, Levine S (2019) Few-shot segmentation propagation with guided networks. In: Proceedings of the international conference on machine learning
Finn C, Abbeel P, Levine S (2017) Model-agnostic meta-learning for fast adaptation of deep networks. In: Proceedings of the international conference on machine learning, pp 1126– 1135
Ravi S, Larochelle H (2017) Optimization as a model for few-shot learning. In: Proceedings of the international conference on learning representations
Li W, Wang L, Xu J, Huo J, Gao Y, Luo J (2019) Revisiting local descriptor based image-to-class measure for few-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7260–7268
Chen Z, Fu Y, Zhang Y, Jiang Y-G, Xue X, Sigal L (2019) Multi-level semantic feature augmentation for one-shot learning. IEEE Trans Image Process 28(9):4594–4605
Article MathSciNet MATH Google Scholar
Antoniou A, Edwards H, Storkey A (2018) How to train your MAML. In: Proceedings of the international conference on learning representations
Bi S, Wang Y, Li X, Dong M, Zhu J (2022) Critical direction projection networks for few-shot learning. Appl Intell 52(5):5400–5413
Article Google Scholar
Snell J, Swersky K, Zemel RS (2017) Prototypical networks for few-shot learning. In: Advances in neural information processing systems, pp 4077–4087
Chen W, Liu Y, Kira Z, Wang YF, Huang J (2019) A closer look at few-shot classification. In: Proceedings of the international conference on learning representations
Tseng H, Lee H, Huang J, Yang M (2020) Cross-domain few-shot classification via learned feature-wise transformation. In: Proceedings of the international conference on learning representations
Sung F, Yang Y, Zhang L, Xiang T, Torr PH, Hospedales TM (2018) Learning to compare: Relation network for few-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1199–1208
Garcia V, Bruna J (2018) Few-shot learning with graph neural networks. In: Proceedings of the international conference on learning representations
Huang H, Wu Z, Li W, Huo J, Gao Y (2021) Local descriptor-based multi-prototype network for few-shot learning. Pattern Recogn 116:107935
Article Google Scholar
Liu X, Zhou F, Liu J, Jiang L (2020) Meta-learning based prototype-relation network for few-shot classification. Neurocomputing 383:224–234
Article Google Scholar
Yan L, Zhang L (2019) Unsupervised dimension reduction using supervised orthogonal discriminant projection for clustering. In: International conference on high performance computing and communications, pp 2239–2246
Shu X, Gao Y, Lu H (2012) Efficient linear discriminant analysis with locality preserving for face recognition. Pattern Recogn 45(5):1892–1898
Article MATH Google Scholar
Rusu AA, Rao D, Sygnowski J, Vinyals O, Pascanu R, Osindero S, Hadsell R (2019) Meta-learning with latent embedding optimization. In: Proceedings of the international conference on learning representations
Vinyals O, Blundell C, Lillicrap T, kavukcuoglu k, Wierstra D (2016) Matching networks for one shot learning. In: Advances in neural information processing systems, vol 29, pp 3630–3638
Xu R, Xing L, Shao S, Zhao L, Liu B, Liu W, Zhou Y (2022) GCT: Graph co-training for semi-supervised few-shot learning. IEEE Trans Circuits Syst Video Technol:1–1
Shao S, Xing L, Wang Y, Xu R, Zhao C, Wang Y, Liu B (2021) MHFC: Multi-head feature collaboration for few-shot learning. In: Proceedings of the 29th ACM international conference on multimedia, pp 4193–4201
Shao S, Xing L, Xu R, Liu W, Wang Y, Liu B (2022) MDFM: Multi-decision fusing model for few-shot learning. IEEE Trans Circ Syst Video Technol 32(8):5151–5162
Article Google Scholar
Wang Y, Xu C, Liu C, Zhang L, Fu Y (2020) Instance credibility inference for few-shot learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 12836–12845
Finn C, Xu K, Levine S (2018) Probabilistic model-agnostic meta-learning. Adv Neural Inf Process Syst:31
Munkhdalai T, Yu H (2017) Meta networks. In: Proceedings of the international conference on machine learning, pp 2554–2563
Zhang R, Che T, Ghahramani Z, Bengio Y, Song Y (2018) MetaGAN: An adversarial approach to few-shot learning. Adv Neural Inf Process Syst:31
Hariharan B, Girshick R (2017) Low-shot visual recognition by shrinking and hallucinating features. In: Proceedings of the IEEE international conference on computer vision, pp 3018–3027
Gao H, Shou Z, Zareian A, Zhang H, Chang S (2018) Low-shot learning via covariance-preserving adversarial augmentation networks. Adv Neural Inf Process Syst:31
Wang Y-X, Girshick R, Hebert M, Hariharan B (2018) Low-shot learning from imaginary data. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7278–7286
Antoniou A, Storkey A, Edwards H (2017) Data augmentation generative adversarial networks. arXiv:1711.04340
Koch G, Zemel R, Salakhutdinov R et al (2015) Siamese neural networks for one-shot image recognition. In: ICML deep learning workshop, vol 2
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the international conference on learning representations
Oreshkin BN, López PR, Lacoste A (2018) TADAM: Task dependent adaptive metric for improved few-shot learning. In: Advances in neural information processing systems
Qiao S, Liu C, Shen W, Yuille AL (2018) Few-shot image recognition by predicting parameters from activations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7229–7238
Allen K, Shelhamer E, Shin H, Tenenbaum J (2019) Infinite mixture prototypes for few-shot learning. In: Proceedings of the international conference on machine learning, pp 232–241
Triantafillou E, Zhu T, Dumoulin V, Lamblin P, Evci U, Xu K, Goroshin R, Gelada C, Swersky K, Manzagol P-A et al (2019) Meta-dataset: A dataset of datasets for learning to learn from few examples. In: Proceedings of the international conference on learning representations
Wah C, Branson S, Welinder P, Perona P, Belongie S (2011) The Caltech-UCSD Birds-200-2011 dataset
Khosla A, Jayadevaprakash N, Yao B, Li F-F (2011) Novel dataset for fine-grained image categorization: Stanford dogs. In: Proceedings of the CVPR workshop on fine-grained visual categorization (FGVC), vol 2
Bertinetto L, Henriques JF, Torr PHS, Vedaldi A (2019) Meta-learning with differentiable closed-form solvers. In: Proceedings of the international conference on learning representations
Hilliard N, Phillips L, Howland S, Yankov A, Corley CD, Hodas NO (2018) Few-shot learning with metric-agnostic conditional embeddings. arXiv:1802.04376
Krizhevsky A, Hinton G et al (2009) Learning multiple layers of features from tiny images. University of Toronto, ON, Canada
Bertinetto L, Henriques JF, Valmadre J, Torr P, Vedaldi A (2016) Learning feed-forward one-shot learners. Adv Neural Inf Process Syst:29
Kingma DP, Ba J (2015) Adam: A method for stochastic optimization. In: Proceedings of the international conference on learning representations
Li W, Xu J, Huo J, Wang L, Gao Y, Luo J (2019) Distribution consistency based covariance metric networks for few-shot learning. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 8642–8649
Raghu A, Raghu M, Bengio S, Vinyals O (2020) Rapid learning or feature reuse? towards understanding the effectiveness of MAML. In: Proceedings of the international conference on learning representations
Qin Y, Zhang W, Zhao C, Wang Z, Zhu X, Shi J, Qi G, Lei Z (2021) Prior-knowledge and attention based meta-learning for few-shot learning. Knowl-Based Syst 213:106609
Article Google Scholar
Li X, Wu J, Sun Z, Ma Z, Cao J, Xue J-H (2021) Bsnet: Bi-similarity network for few-shot fine-grained image classification. IEEE Trans Image Process 30:1318–1331
Article MathSciNet Google Scholar
Xue Z, Xie Z, Xing Z, Duan L (2020) Relative position and map networks in few-shot learning for image classification. In: IEEE conference on computer vision and pattern recognition workshops, pp 4032–4036
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2017) Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision, pp 618–626
Van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9(11)
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Sergey Z, Nikos K (2016) Wide residual networks. In: Proceedings of the British machine vision conference, pp 1–12

Download references

Acknowledgements

We would like to thank Chengxiang Hu, Meijuan Su and Mengjuan Jiang for their technical support. We would also like to thank the computer resources and other support provided by the Machine Learning Laboratory of Soochow University.

Funding

This work was supported in part by the Priority Academic Program Development of Jiangsu Higher Education Institutions, by the National Key R&D Program of China (2018YFA0701700; 2018YFA0701701) and by the National Natural Science Foundation of China under Grant No.61672364, No.62176172 and No.61902269.

Author information

Authors and Affiliations

School of Computer Science and Technology Joint International Research Laboratory of Machine Learning and Neuromorphic Computing, Soochow University, No.1 ShiZi Street, Su Zhou, 215006, Jiang Su, China
Leilei Yan, Li Zhang & Xiaohan Zheng
School of Computer Science and Technology, Soochow University, 215006, Suzhou, China
Fanzhang Li

Authors

Leilei Yan
View author publications
You can also search for this author in PubMed Google Scholar
Fanzhang Li
View author publications
You can also search for this author in PubMed Google Scholar
Li Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohan Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Leilei Yan: Conceptualization, Methodology, Software, Validation, Writing - original draft, Writing - review and editing. Fanzhang Li: Conceptualization, Methodology, Software, Writing - review and editing, Validation, Project administration, Funding acquisition. Li Zhang: Investigation, Software, Visualization, Writing - review and editing. Xiaohan Zheng: Investigation, Software, Visualization. All authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Fanzhang Li.

Ethics declarations

Ethics approval

Not applicable.

Consent for Publication

Not applicable.

Consent to participate

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Yan, L., Li, F., Zhang, L. et al. Discriminant space metric network for few-shot image classification. Appl Intell 53, 17444–17459 (2023). https://doi.org/10.1007/s10489-022-04413-3

Download citation

Accepted: 14 December 2022
Published: 05 January 2023
Issue Date: July 2023
DOI: https://doi.org/10.1007/s10489-022-04413-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discriminant space metric network for few-shot image classification

Abstract

Access this article

Similar content being viewed by others

Multi-distance metric network for few-shot learning

Multi-scale feature network for few-shot learning

Few-Shot Classification with Semantic Augmented Activators

Availability of data and materials

Code Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent for Publication

Consent to participate

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Discriminant space metric network for few-shot image classification

Abstract

Access this article

Similar content being viewed by others

Multi-distance metric network for few-shot learning

Multi-scale feature network for few-shot learning

Few-Shot Classification with Semantic Augmented Activators

Availability of data and materials

Code Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent for Publication

Consent to participate

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation