This work is devoted to fast image recognition techniques based on statistical sequential analysis. We examine the possibility to sequentially process principal components and organize a convolutional neural network with early exits. We pay special attention to sequentially learning a multi-task lightweight neural network model in order to predict several facial attributes (age, gender, and ethnicity) based on preliminary training on the face classification task. We specifically note that the entire above-mentioned model should be fine-tuned in order to deal with the emotion recognition problem. Experimental evaluation on several datasets demonstrates that the proposed approach has high accuracy and very low running time and space complexity compared to known state-of-the-art methods.
Similar content being viewed by others
References
Y. Benjaminiand Y. Hochberg, “Controlling the false discovery rate: a practical and powerful approach to multiple testing,” J. the Royal Statistical Society, Ser. B (Methodological), 57, No. 1, 289–300 (1995).
Q. Cao, Li Shen, W. Xie, O. M. Parkhi, and A. Zisserman, “VGGFace2: A dataset for recognising faces across pose and age,” in: Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG) (2018), pp. 67–74.
A. Das, A. Dantcheva, and F. Bremond, “Mitigating bias in gender, age and ethnicity classification: A multi-task convolution neural network approach,” in: Proceedings of the European Conference on Computer Vision (ECCV), Springer (2018), pp. 573–585.
J. Deng, J. Guo, X. Niannan, and S. Zafeiriou, “Arcface: Additive angular margin loss for deep face recognition,” in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019), pp. 4690–4699.
E. Eidinger, R. Enbar, and T. Hassner, “Age and gender estimation of unfiltered faces,” IEEE Transactions on Information Forensics and Security, 9, No. 12, 2170–2179 (2014).
I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning, MIT Press (2016).
S. Hung, J.-H. Lee, T. Wan, C.-H. Chen, Y.-M. Chan, and C.-S. Chen, “Increasingly packing multiple facial-informatics modules in a unified deep-learning model via lifelong learning,” in: Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR) (2019), pp. 339–343.
A. Mollahosseini, B. Hasani, and M. H. Mahoor, “AffectNet: A database for facial expression, valence, and arousal computing in the wild,” IEEE Transactions on Affective Computing, 10, No. 1, 18–31 (2017).
P. Panda, A. Sengupta, and K. Roy, “Conditional deep learning for energy-efficient and enhanced pattern recognition,” in: Proceedings of IEEE Design, Automation & Test in Europe Conference & Exhibition (2016), pp. 475–480.
O. M. Parkhi, A. Vedaldi, and A. Zisserman, “Deep face recognition, in: Proceedings of the British Machine Vision Conference (BMVC), Vol. 3 (2015).
R. Rothe, R. Timofte, and L. Van Gool, “DEX: Deep expectation of apparent age from a single image,” in: Proceedings of the IEEE International Conference on Computer Vision Workshops (2015), pp. 10–15.
A. V. Savchenko, Search Techniques in Intelligent Classification Systems, Springer (2016).
A. V. Savchenko, “Maximum-likelihood dissimilarities in image recognition with deep neural networks,” Computer Optics, 41, No. 3, 422–430 (2017).
A. V. Savchenko, “Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet,” PeerJ Computer Science, 5, e197 (2019).
A. V. Savchenko, “Sequential three-way decisions in multi-category image recognition with deep features based on distance factor,” Information Sciences, 489, 18–36 (2019).
A. V. Savchenko, “Probabilistic neural network with complex exponential activation functions in image recognition,” IEEE Transactions on Neural Networks and Learning Systems, 31, No. 2, 651–660 (2020).
A. V. Savchenko, “Sequential analysis with specified confidence level and adaptive convolutional neural networks in image recognition,” in: Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN) (2020).
F. Schroff, D. Kalenichenko, and J. Philbin, “FaceNet: A unified embedding for face recognition and clustering,” in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015), pp. 815–823.
S. C. Schwartz, “Estimation of probability density by an orthogonal series,” The Annals of Mathematical Statistics, 1261–1265 (1967).
D. F. Specht, “Probabilistic neural networks,” Neural Networks, 3, No. 1, 109–118 (1990).
S. Teerapittayanon, B. McDanel,and H. T. Kung, “BranchyNet: Fast inference via early exiting from deep neural networks,” in: Proceedings of the 23rd IEEE International Conference on Pattern Recognition (ICPR) (2016), pp. 2464–2469.
A. Wald, Sequential Analysis, Dover Publications, New York (2013).
Y. Y. Yao and X. F. Deng, “Sequential three-way decisions with probabilistic rough sets,” in: Proceedings of ICCI*CC, IEEE Computer Society (2011), pp. 120–125.
Z. Zhang, Y. Song, and H. Qi, “Age progression/regression by conditional adversarial autoencoder,” in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017), pp. 5810–5818.
Author information
Authors and Affiliations
Corresponding author
Additional information
Translated from Zapiski Nauchnykh Seminarov POMI, Vol. 499, 2021, pp. 267–283.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Savchenko, A.V. Fast Image Classification Algorithms Based on Sequential Analysis. J Math Sci 273, 628–638 (2023). https://doi.org/10.1007/s10958-023-06524-9
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10958-023-06524-9