Skip to main content
Log in

A Multi-prototype Capsule Network for Image Recognition with High Intra-class Variations

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

Capsule networks are deep neural networks that perform a part-to-whole association and instantiate the parameters of a “whole” (e.g., a class) by searching the agreement of “parts”. These networks are based on grouping neurons into units called capsules. The activity of “part” capsules is propagated to the “whole” capsules in the next layer by a transformation matrix, instead of a scalar weight, as is typical in conventional networks. One of the challenges of capsule networks is training on complex data with high intra-class variations. The high intra-class variation in real-world datasets cannot be appropriately captured by considering just one representation for each class. In this paper, we present a new multi-prototype capsule network architecture and propose a soft competitive learning algorithm that guides the network towards learning the diversity that exists within classes. We show that by having multiple capsules for each class and using a soft competitive cross-entropy loss, specialized learning in capsule networks can occur in which each capsule represents a particular part of the feature space of the corresponding class. Discovering the intra-class prototypes makes it easier for the capsule network to deal with complex datasets. Our extensive experiments on MNIST, CEDAR, MCYT, C-Cube, and SVHN datasets show the superiority of the proposed approach. Results show that the proposed model achieves an average of 3.36% and 3.73% classification accuracy improvement compared to the original capsule networks and high-dimensional capsules, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Afshar P, Plataniotis KN, Mohammadi A (2019) Capsule networks for brain tumor classification based on MRI images and coarse tumor boundaries. In: ICASSP 2019–2019 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 1368–1372

  2. Bahadori MT (2018) Spectral capsule networks. In: International conference on learning representations workshop track, New Orleans, Louisiana, USA

  3. Chang MC, Chiang CK, Tsai CM, Chang YK, Chiang HL, Wang YA, Chang SY, Li YL, Tsai MS, Tseng HY (2020) Ai city challenge 2020-computer vision for smart transportation applications. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 620–621

  4. Cheng X, He J, He J, Xu H (2019) Cv-capsnet: complex-valued capsule network. IEEE Access 7:85,492-85,499

    Article  Google Scholar 

  5. Dougherty ER (2020) Digital image processing methods. CRC Press, Boca Raton

    Book  Google Scholar 

  6. Dubey AK, Jain V (2019) A review of face recognition methods using deep learning network. J Inf Optim Sci 40(2):547–558

    Google Scholar 

  7. de Rosario VM, Borin E, Breternitz M (2019) The multi-lane capsule network. IEEE Signal Process Lett 26(7):1006–1010

    Article  Google Scholar 

  8. do Rosario VM, Breternitz M, Borin E (2019b) Efficiency and scalability of multi-lane capsule networks (mlcn). In: 2019 31st international symposium on computer architecture and high performance computing (SBAC-PAD). IEEE, pp 152–159

  9. Ghiasi-Shirazi K (2019) Competitive cross-entropy loss: a study on training single-layer neural networks for solving nonlinearly separable classification problems. Neural Process Lett 50(2):1115–1122

    Article  Google Scholar 

  10. Gumusbas D, Yildirim T (2019) Offline signature identification and verification using capsule network. In: 2019 IEEE international symposium on INnovations in Intelligent SysTems and Applications (INISTA). IEEE, pp 1–5

  11. Hahn T, Pyeon M, Kim G (2019) Self-routing capsule networks. In: Advances in neural information processing systems, pp 7658–7667

  12. Hinton GE, Krizhevsky A, Wang SD (2011) Transforming auto-encoders. In: International conference on artificial neural networks. Springer, pp 44–51

  13. Hinton GE, Sabour S, Frosst N (2018) Matrix capsules with em routing. In: International conference on learning representations

  14. Jain DK, Jain R, Upadhyay Y, Kathuria A, Lan X (2020) Deep refinement: capsule network with attention mechanism-based system for text classification. Neural Comput Appl 32(7):1839–1856

    Article  Google Scholar 

  15. Jampour M, Abbaasi S, Javidi M (2021) Capsnet regularization and its conjugation with resnet for signature identification. Pattern Recognit 120(107):851

    Google Scholar 

  16. Javidi M, Abbaasi S, Naybandi Atashi S, Jampour M (2021) Covid-19 early detection for imbalanced or low number of data using a regularized cost-sensitive capsnet. Sci Rep 11(1):1–12

    Article  Google Scholar 

  17. Li D, Zhao X, Yuan G, Liu Y, Liu G (2021) Robustness comparison between the capsule network and the convolutional network for facial expression recognition. Appl Intell 51(4):2269–2278

    Article  Google Scholar 

  18. Mukhometzianov R, Carrillo J (2018) Capsnet comparative performance evaluation for image classification. arXiv:1805.11195

  19. Paik I, Kwak T, Kim I (2019) Capsule networks need an improved routing algorithm. arXiv:1907.13327

  20. Qian K, Tian L, Liu Y, Wen X, Bao J (2021) Image robust recognition based on feature-entropy-oriented differential fusion capsule network. Appl Intell 51(2):1108–1117

    Article  Google Scholar 

  21. Sabour S, Frosst N, Hinton GE (2017) Dynamic routing between capsules. In: Advances in neural information processing systems, pp 3856–3866

  22. Sun G, Ding S, Sun T, Zhang C, Du W (2022) A novel dense capsule network based on dense capsule layers. Appl Intell 52(3):3066–3075

  23. Xi E, Bing S, Jin Y (2017) Capsule network performance on complex data. arXiv:1712.03480

  24. Xiang C, Zhang L, Tang Y, Zou W, Xu C (2018) Ms-capsnet: a novel multi-scale capsule network. IEEE Signal Process Lett 25(12):1850–1854

    Article  Google Scholar 

  25. Xu Z, Lu W, Zhang Q, Yeung Y, Chen X (2019) Gait recognition based on capsule network. J Vis Commun Image Represent 59:159–167

    Article  Google Scholar 

  26. Zarei-Sabzevar R, Ghiasi-Shirazi K, Harati A (2022) Prototype-based interpretation of the functionality of neurons in winner-take-all neural networks. IEEE Trans Neural Netw Learn Syst. https://doi.org/10.1109/TNNLS.2022.3155174

    Article  Google Scholar 

  27. Zhang Q, Zhu SC (2018) Visual interpretability for deep learning: a survey. Front Inf Technol Electron Eng 19(1):27–39

    Article  Google Scholar 

  28. Zhao T, Liu Y, Huo G, Zhu X (2019) A deep learning iris recognition method based on capsule network architecture. IEEE Access 7:49,691-49,701

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Mohammadreza Qaraei and anonymous reviewers for their valuable advice and discussions that greatly improved the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kamaledin Ghiasi-Shirazi.

Ethics declarations

Coonflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Abbaasi, S., Ghiasi-Shirazi, K. & Harati, A. A Multi-prototype Capsule Network for Image Recognition with High Intra-class Variations. Neural Process Lett 55, 6697–6711 (2023). https://doi.org/10.1007/s11063-023-11155-x

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-023-11155-x

Keywords

Navigation