Abstract
Nowadays, face recognition using video surveillance systems becomes one of the active research topics in security domains. Security plays a significant role in everyday life for secure and sustainable developments of smart cities. The conventional techniques provide efficient recognition results only when the faces are captured with complete face images. However, they suffer to handle large pose variation images extracted from video sequences. Therefore, to deal with this issue, this paper specially designed a multidimensional face recognition model to recognize faces under multiple pose variations and angles. Three face video databases, namely facesurv database, IARPA Janus benchmark database and McGill database, are utilized for experimental evaluation. The videos of these three databases are converted into number of image frames through background subtraction process. From the image frames, the large pose variation images with different angles are identified and selected to process further. The video recorded under dynamic environment conditions diminishes recognition performance, so the image frames are processed through several preprocessing pipelines. The preprocessed images are then fed into the proposed optimal mask region-based convolutional neural network with modified short-term memory (OMRCNN-MBiLSTM) model, which learns the facial patterns present in the images more efficiently. The feature vectors learned by the proposed classifier are matched with the input face database to determine the identity of the person. With the ability to handle multiview and large pose variations, the proposed model accurately recognizes faces. The simulation result manifests the superiority of proposed model over other existing methods.
Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
Mliki H, Dammak S, Fendri E (2020) An improved multi-scale face detection using convolutional neural network. SIViP 14(7):1345–1353
Zhao H, Shi Y, Tong X, Wen J, Ying X, Zha H (2021) G-FAN: graph-based feature aggregation network for video face recognition. In: 2020 25th international conference on pattern recognition (ICPR) IEEE, pp 1672–1678
Loey M, Manogaran G, Taha MHN, Khalifa NEM (2021) A hybrid deep transfer learning model with machine learning methods for face mask detection in the era of the COVID-19 pandemic. Measurement 167:108288
Zhang L, Sun L, Yu L, Dong X, Chen J, Cai W, Wang C, Ning X (2021) ARFace: attention-aware and regularization for face recognition with reinforcement learning. IEEE Trans Biom, Behav Identity Sci 4(1):30–42
Jain DK, Zhang Z, Huang K (2020) Multi angle optimal pattern-based deep learning for automatic facial expression recognition. Pattern Recogn Lett 139:157–165
Gao G, Yu Y, Yang M, Chang H, Huang P, Yue D (2020) Cross-resolution face recognition with pose variations via multilayer locality-constrained structural orthogonal procrustes regression. Inf Sci 506:19–36
Li B, Lima D (2021) Facial expression recognition via ResNet-50. Int J Cognit Comput Eng 2:57–64
Ben Fredj H, Bouguezzi S, Souani C (2021) Face recognition in unconstrained environment with CNN. Vis Comput 37(2):217–226
Plichoski GF, Chidambaram C, Parpinelli RS (2021) A face recognition framework based on a pool of techniques and differential evolution. Inf Sci 543:219–241
Kumar T, Bhushan S, Jangra S (2021) An improved biometric fusion system of fingerprint and face using whale optimization. Int J Adv Comput Sci Appl. https://doi.org/10.14569/IJACSA.2021.0120176
Balasubramanian K, Nalligoundenpalayam Periyasamy A, Kishore R (2023) Modified spider monkey optimization algorithm based feature selection and probabilistic neural network classifier in face recognition. Expert Syst 40:e13088
Naseri RAS, Kurnaz A, Farhan HM (2023) Optimized face detector-based intelligent face mask detection model in IoT using deep learning approach. Appl Soft Comput 134:109933
Zhang Y, Yan L (2023) Face recognition algorithm based on particle swarm optimization and image feature compensation. SoftwareX 22:101305
Benradi H, Chater A, Lasfar A (2023) A hybrid approach for face recognition using a convolutional neural network combined with feature extraction techniques. IAES Int J Artif Intell 12(2):627
Yu C, Zhang Z, Li H, Sun J, Xu Z (2023) Meta-learning-based adversarial training for deep 3D face recognition on point clouds. Pattern Recogn 134:109065
Cheng WC, Hsiao HC, Li LH (2023) Deep learning mask face recognition with annealing mechanism. Appl Sci 13(2):732
Chandrasekhar T, Kumar CS (2023) Neural network optimization-based facial geometric key homomorphic cloud security. Int J Intell Syst Appl Eng 11(1):277–286
Bah SM, Ming F (2020) An improved face recognition algorithm and its application in attendance management system. Array 5:100014
Ahmed S, Frikha M, Hussein TDH, Rahebi J (2021) Optimum feature selection with particle swarm optimization to face recognition system using Gabor wavelet transform and deep learning. BioMed Res Int 2021:6621540
Mishra NK, Singh SK (2021) Face recognition using 3d cnns. In: Verma GK, Soni B, Bourennane S, Ramos ACB (eds) Data science. Springer, Berlin, pp 279–293
Shirley CP, Ram Mohan NR, Chitra B (2021) Gravitational search-based optimal deep neural network for occluded face recognition system in videos. Multidimens Syst Signal Process 32(1):189–215
Bahroun S, Abed R, Zagrouba E (2021) KS-FQA: keyframe selection based on face quality assessment for efficient face recognition in video. IET Image Proc 15(1):77–90
Zheng G, Xu Y (2021) Efficient face detection and tracking in video sequences based on deep learning. Inf Sci 568:265–285
Soni N, Sharma EK, Kapoor A (2021) Novel BSSSO-based deep convolutional neural network for face recognition with multiple disturbing environments. Electronics 10(5):626
Medapati PK, Tejo Murthy PHS, Sridhar KP (2020) LAMSTAR: For IoT-based face recognition system to manage the safety factor in smart cities. Trans Emerg Telecommun Technol 31(12):e3843
Gupta S, Gupta N, Ghosh S, Singh M, Nagpal S, Vatsa M, Singh R (2019) FaceSurv: a benchmark video dataset for face detection and recognition across spectra and resolutions. In: 2019 14th IEEE international conference on automatic face & gesture recognition IEEE, pp 1–7.
Klare BF, Klein B, Taborsky E, Blanton A, Cheney J, Allen K, Grother P, Mah A, Jain AK (2015) Pushing the frontiers of unconstrained face detection and recognition: Iarpa janus benchmark a. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1931–1939.
Demirkus M, Precup D, Clark JJ, Arbel T (2015) Hierarchical temporal graphical model for head pose estimation and subsequent attribute classification in real-world videos. Comput Vis Image Underst 136:128–145
Alkhalid FF (2020) Online preprocessing of gesture signs using background substructure and edge detection algorithms. Int J Simul Syst SciTechnol 21(2):1–6
Zhou Z, Zhang M, Chen J, Wu X (2020) Detection and classification of multi-magnetic targets using mask-RCNN. IEEE Access 8:187202–187207
Khan MA, Akram T, Zhang YD, Sharif M (2021) Attributes based skin lesion detection and recognition: a mask RCNN and transfer learning-based deep learning framework. Pattern Recogn Lett 143:58–66
Yu D, Wang L, Chen X, Chen J (2021) Using BiLSTM with attention mechanism to automatically detect self-admitted technical debt. Front Comp Sci 15(4):1–12
Enriquez EAT, Mendoza RG, Velasco ACT (2022) Philippine eagle optimization algorithm. IEEE Access 10:29089–29120
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Human and animal rights
This article does not contain any studies with human or animal subjects performed by any of the authors.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Jayabharathi, P., Suresh, A. POC-net: pelican optimization-based convolutional neural network for recognizing large pose variation from video. Neural Comput & Applic 35, 24091–24107 (2023). https://doi.org/10.1007/s00521-023-08953-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-08953-8