Abstract
Capsule networks are deep neural networks that perform a part-to-whole association and instantiate the parameters of a “whole” (e.g., a class) by searching the agreement of “parts”. These networks are based on grouping neurons into units called capsules. The activity of “part” capsules is propagated to the “whole” capsules in the next layer by a transformation matrix, instead of a scalar weight, as is typical in conventional networks. One of the challenges of capsule networks is training on complex data with high intra-class variations. The high intra-class variation in real-world datasets cannot be appropriately captured by considering just one representation for each class. In this paper, we present a new multi-prototype capsule network architecture and propose a soft competitive learning algorithm that guides the network towards learning the diversity that exists within classes. We show that by having multiple capsules for each class and using a soft competitive cross-entropy loss, specialized learning in capsule networks can occur in which each capsule represents a particular part of the feature space of the corresponding class. Discovering the intra-class prototypes makes it easier for the capsule network to deal with complex datasets. Our extensive experiments on MNIST, CEDAR, MCYT, C-Cube, and SVHN datasets show the superiority of the proposed approach. Results show that the proposed model achieves an average of 3.36% and 3.73% classification accuracy improvement compared to the original capsule networks and high-dimensional capsules, respectively.
Similar content being viewed by others
References
Afshar P, Plataniotis KN, Mohammadi A (2019) Capsule networks for brain tumor classification based on MRI images and coarse tumor boundaries. In: ICASSP 2019–2019 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 1368–1372
Bahadori MT (2018) Spectral capsule networks. In: International conference on learning representations workshop track, New Orleans, Louisiana, USA
Chang MC, Chiang CK, Tsai CM, Chang YK, Chiang HL, Wang YA, Chang SY, Li YL, Tsai MS, Tseng HY (2020) Ai city challenge 2020-computer vision for smart transportation applications. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 620–621
Cheng X, He J, He J, Xu H (2019) Cv-capsnet: complex-valued capsule network. IEEE Access 7:85,492-85,499
Dougherty ER (2020) Digital image processing methods. CRC Press, Boca Raton
Dubey AK, Jain V (2019) A review of face recognition methods using deep learning network. J Inf Optim Sci 40(2):547–558
de Rosario VM, Borin E, Breternitz M (2019) The multi-lane capsule network. IEEE Signal Process Lett 26(7):1006–1010
do Rosario VM, Breternitz M, Borin E (2019b) Efficiency and scalability of multi-lane capsule networks (mlcn). In: 2019 31st international symposium on computer architecture and high performance computing (SBAC-PAD). IEEE, pp 152–159
Ghiasi-Shirazi K (2019) Competitive cross-entropy loss: a study on training single-layer neural networks for solving nonlinearly separable classification problems. Neural Process Lett 50(2):1115–1122
Gumusbas D, Yildirim T (2019) Offline signature identification and verification using capsule network. In: 2019 IEEE international symposium on INnovations in Intelligent SysTems and Applications (INISTA). IEEE, pp 1–5
Hahn T, Pyeon M, Kim G (2019) Self-routing capsule networks. In: Advances in neural information processing systems, pp 7658–7667
Hinton GE, Krizhevsky A, Wang SD (2011) Transforming auto-encoders. In: International conference on artificial neural networks. Springer, pp 44–51
Hinton GE, Sabour S, Frosst N (2018) Matrix capsules with em routing. In: International conference on learning representations
Jain DK, Jain R, Upadhyay Y, Kathuria A, Lan X (2020) Deep refinement: capsule network with attention mechanism-based system for text classification. Neural Comput Appl 32(7):1839–1856
Jampour M, Abbaasi S, Javidi M (2021) Capsnet regularization and its conjugation with resnet for signature identification. Pattern Recognit 120(107):851
Javidi M, Abbaasi S, Naybandi Atashi S, Jampour M (2021) Covid-19 early detection for imbalanced or low number of data using a regularized cost-sensitive capsnet. Sci Rep 11(1):1–12
Li D, Zhao X, Yuan G, Liu Y, Liu G (2021) Robustness comparison between the capsule network and the convolutional network for facial expression recognition. Appl Intell 51(4):2269–2278
Mukhometzianov R, Carrillo J (2018) Capsnet comparative performance evaluation for image classification. arXiv:1805.11195
Paik I, Kwak T, Kim I (2019) Capsule networks need an improved routing algorithm. arXiv:1907.13327
Qian K, Tian L, Liu Y, Wen X, Bao J (2021) Image robust recognition based on feature-entropy-oriented differential fusion capsule network. Appl Intell 51(2):1108–1117
Sabour S, Frosst N, Hinton GE (2017) Dynamic routing between capsules. In: Advances in neural information processing systems, pp 3856–3866
Sun G, Ding S, Sun T, Zhang C, Du W (2022) A novel dense capsule network based on dense capsule layers. Appl Intell 52(3):3066–3075
Xi E, Bing S, Jin Y (2017) Capsule network performance on complex data. arXiv:1712.03480
Xiang C, Zhang L, Tang Y, Zou W, Xu C (2018) Ms-capsnet: a novel multi-scale capsule network. IEEE Signal Process Lett 25(12):1850–1854
Xu Z, Lu W, Zhang Q, Yeung Y, Chen X (2019) Gait recognition based on capsule network. J Vis Commun Image Represent 59:159–167
Zarei-Sabzevar R, Ghiasi-Shirazi K, Harati A (2022) Prototype-based interpretation of the functionality of neurons in winner-take-all neural networks. IEEE Trans Neural Netw Learn Syst. https://doi.org/10.1109/TNNLS.2022.3155174
Zhang Q, Zhu SC (2018) Visual interpretability for deep learning: a survey. Front Inf Technol Electron Eng 19(1):27–39
Zhao T, Liu Y, Huo G, Zhu X (2019) A deep learning iris recognition method based on capsule network architecture. IEEE Access 7:49,691-49,701
Acknowledgements
The authors would like to thank Mohammadreza Qaraei and anonymous reviewers for their valuable advice and discussions that greatly improved the manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Coonflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Abbaasi, S., Ghiasi-Shirazi, K. & Harati, A. A Multi-prototype Capsule Network for Image Recognition with High Intra-class Variations. Neural Process Lett 55, 6697–6711 (2023). https://doi.org/10.1007/s11063-023-11155-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-023-11155-x