Abstract
Identity-independent factors, such as variations of pose, expression, illumination, etc., are the key challenges in face recognition. To avoid the effects of these factors, existing face recognition methods usually adopt two approaches: pose-invariant face feature extracting and face normalization before feature extraction. Contrary to these, we propose a single deep model jointly performing face normalization and representation learning tasks for face recognition, named normalization and reconstruction general adversarial network (NRGAN). First, the unified NRGAN model can boost the performance of the two tasks for each other. Second, NRGAN can synthesize normalized face images without the requirement of paired data, which makes our method have better generalization ability to the uncontrolled environment. Third, a factor-invariant identity disentanglement training strategy is introduced to decouple the identity feature representation from other factors without using any of these factors’ labels. Extensive experiment results on four currently popular face datasets demonstrate the effectiveness of NRGAN on both normalized face synthesis and face recognition tasks.
Similar content being viewed by others
Data availability statement
The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.
References
Taigman Y, Yang M, Ranzato MA, Wolf L (2014) Deepface: closing the gap to human-level performance in face verification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1701–1708
Schroff F, Kalenichenko D, Philbin J (2015) FaceNet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 815–823
Li J, Zhao J, Zhao F, Liu H, Sim T (2016) Robust face recognition with deep multi-view representation learning. In: The 2016 ACM
Zhao J et al (2017) Dual-agent GANs for photorealistic and identity preserving profile face synthesis. In: Neural information processing systems (NIPS)
Wang H, Wang Y, Zhou Z, Ji X, Gong D, Zhou J, Li Z, Liu W (2018) CosFace: large margin cosine loss for deep face recognition. Presented at the CVPR
Deng J, Guo J, Zafeiriou S (2019) ArcFace: additive angular margin loss for deep face recognition. In: Presented at the 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Zhao J, Xing J, Xiong L, Yan S, Feng J (2020) Recognizing profile faces by imagining frontal view. Int J Comput Vis 128(2):460–478
Jian Z et al (2018) Towards pose invariant face recognition in the wild. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, Salt Lake City, UT, USA, pp 2207–2216
Luan T, Yin X, Liu X (2017) Disentangled representation learning GAN for pose-invariant face recognition. In: Vision computer, recognition pattern. Honolulu , USA, HI, pp 1283–1292
Xiangyu Z, Lei Z, Junjie Y, Yi D, Li SZ (2015) High-fidelity pose and expression normalization for face recognition in the wild. In: IEEE conference on Computer vision and pattern recognition (CVPR), pp 787–796
Sagonas C, Panagakis Y, Zafeiriou S, Pantic M (2015) Robust statistical face frontalization. In: IEEE International conference on computer vision (ICCV), Santiago, Chile, pp 3871–3879
Huang R, Zhang S, Li T, He R (2017) Beyond face rotation: global and local perception GAN for photorealistic and identity preserving frontal view synthesis. In: 2017 IEEE international conference on computer vision (ICCV), Venice, Italy, pp 2458–2467
Yin X, Yu X, Sohn K, Liu X, Chandraker M (2017) Towards large-pose face frontalization in the wild. In: 2017 IEEE international conference on computer vision (ICCV), Venice, Italy, pp 4010–4019
Hu Y, Wu X, Yu B, He R, Sun Z (2018) Pose-guided photorealistic face rotation. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, Salt Lake City, UT, USA, pp 8398–8406
Qian Y, Deng W, Hu J (2019) Unsupervised face normalization with extreme pose and expression in the wild. In: Presented at the CVPR
Zhang Z et al (2021) Semi-supervised face frontalization in the wild. IEEE Trans Inf Forensics Secur 16:909–922
Ahonen T, Hadid A, Pietikainen M (2006) Face description with local binary patterns: application to face recognition. IEEE Trans Pattern Anal Mach Intell 28(12):2037–2041
Tan H, Yang B, Ma Z (2013) Face recognition based on the fusion of global and local HOG features of face images. IET Comput Vis 8(3):224–234
Bicego M, Lagorio A, Grosso E, Tistarelli M (2006) On the use of SIFT features for face authentication. In: 2006 Conference on computer vision and pattern recognition workshop (CVPRW’06), pp 35–35
Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10(1):207–244
Chen D, Cao X, Wen F, Sun J (2013) Blessing of dimensionality: high-dimensional feature and its efficient compression for face verification. In: Computer vision and pattern recognition, pp 3025–3032
Sun Y, Wang X, Tang X (2013) Deep learning face representation from predicting 10,000 classes. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1891–1898
Sun Y, Wang X, Tang X (2014) Deep learning face representation by joint identification-verification. arXiv preprint arXiv:1406.4773
Sun Y, Liang D, Wang X, Tang X (2015) DeepID3: face recognition with very deep neural networks. Comput Sci
Peng X, Yu X, Sohn K, Metaxas DN, Chandraker M (2017) Reconstruction-based disentanglement for pose-invariant face recognition. In: IEEE international conference on computer vision (ICCV), pp 1632–1641
Liu W, Wen Y, Yu Z, Li M, Raj B, Song L (2017) SphereFace: deep hypersphere embedding for face recognition. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 6738–6746
Huang Y et al (2020) CurricularFace: adaptive curriculum learning loss for deep face recognition. In: IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 5900–5909
Bao J, Chen D, Wen F, Li H, Hua G (2018) Towards open-set identity preserving face synthesis. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, Salt Lake City, UT, USA, pp 6713–6722
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: International conference on neural information processing systems, pp 2672–2680
Guo Y, Zhang L, Hu Y, He X, Gao J (2016) MS-Celeb-1M: a dataset and benchmark for large-scale face recognition. In: European conference on computer vision, pp 87–102
Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks. Comput Sci 11/19
Gross R, Matthews I, Cohn J, Kanade T, Baker S (2008) Multi-PIE. In: 8th IEEE international conference on automatic face and gesture recognition (FG) pp 1–8
Yi D, Lei Z, Liao S, Li SZ (2014) Learning face representation from scratch. Comput Sci
Huang GB, Mattar M, Berg T, Learned-Miller E (2008) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. In: Workshop on faces in ‘real-life’ images: detection, alignment, and recognition
Klare BF, Klein B, Taborsky E, Blanton A (2015) Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A, pp 1931–1939
Sengupta S, Chen J, Castillo C, Patel VM, Chellappa R, Jacobs DW (2016) Frontal to profile face verification in the wild. In: IEEE winter conference on applications of computer vision (WACV), pp 1–9
Zhu Z, Luo P, Wang X, Tang X (2013) Deep learning identity preserving face space. In: Proceedings of the ICCV, vol 1, p 2
Jourabloo A, Liu X (2017) Pose-invariant face alignment via CNN-based dense 3D model fitting. Int J Comput Vis 124:187–203
Zhang K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multi-task cascaded convolutional networks. IEEE Signal Process Lett (SPL) 23(10):1499–1503
Kingma D, Ba J (2014) Adam: a method for stochastic optimization. Comput Sci
Abadi M et al (2016) TensorFlow: a system for large-scale machine learning. arXiv:1605.08695
Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S (2017) GANs trained by a two time-scale update rule converge to a local nash equilibrium. ArXiv pre-print, p 6629
Zhou W, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612
Hu C, Feng Z, Wu X, Kittler J (2020) Dual encoder–decoder based generative adversarial networks for disentangled facial representation learning. IEEE Access 8:130159–130171
Yu X, Shiri F, Ghanem B, Porikli F (2020) Can we see more? Joint frontalization and hallucination of unaligned tiny faces. IEEE Trans Pattern Anal Mach Intell 42(9):2148–2164
Tu X et al (2022) Joint face image restoration and frontalization for recognition. IEEE Trans Circuits Syst Video Technol 32(3):1285–1298
Junho Y, Heechul J, ByungIn Y, Changkyu C, Dusik P, Junmo K (2015) Rotating your face using multi-task deep neural network. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 676–684
Luan X, Zheng J, Li W (2021) Learning unsupervised face normalization through frontal view reconstruction. IEEE Trans Circuits Syst Video Technol 32:5201–5212
Zhu Z, Luo P, Wang X, Tang X (2014) Multi-view perceptron: a deep model for learning face identity and view representations. In: Advances in neural information processing systems
Khan A, Wahab N (2016) Deep residual learning for image recognition. In: Presented at the CVPR
Hassner T, Harel S, Paz E, Enbar R (2015) Effective face frontalization in unconstrained images. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), Boston, MA, USA, pp 4295–4304
Zhang Z, Chen X, Wang B, Hu G, Zuo W, Hancock ER (2019) Face frontalization using an appearance-flow-based convolutional neural network. IEEE Trans Image Process 28(5):2187–2199
Rong C, Zhang X, Lin Y (2020) Feature-improving generative adversarial network for face frontalization. IEEE Access 4:1–11
Masi I, Rawls S, Medioni G, Natarajan P (2016) Pose-aware face recognition in the wild. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 4838–4846
Luo M, Cao J, Ma X, Zhang X, He R (2021) FA-GAN: face augmentation GAN for deformation-invariant face recognition. IEEE Trans Inf Forensics Secur 16:2341–2355
Chen J, Zheng J, Patel VM, Chellappa R (2016) Fisher vector encoded deep convolutional features for unconstrained face verification. In: IEEE international conference on image processing (ICIP), pp 2981–2985
Leibe B, Matas J, Sebe N, Welling M (2016) A discriminative feature learning approach for deep face recognition. In: Presented at the European conference on computer vision
Wang JCF, Liu W, Liu H (2018) Additive margin softmax for face verification. IEEE Signal Process Lett 25(7):926–930
Zhang L et al (2022) ARFace: attention-aware and regularization for face recognition with reinforcement learning. IEEE Trans Biom Behav Ident Sci 4(1):30–42
van der Maaten L, Hinton G (2008) Viualizing data using t-SNE. J Mach Learn Res 9:2579–2605
Funding
This research is sponsored by the Natural Science Foundation of Chongqing, China (Grant No. CSTB2022NSCQ-MSX0996), the key project of science and technology research program of Chongqing Education Commission of China (No. KJZD-K202301102), and the Natural Science Foundation of Chongqing, China (No. CSTB2023NSCQ-LZX0068).
Author information
Authors and Affiliations
Contributions
Yanfei Liu: Conceptualization, methodology, writing—original draft preparation. Junhua chen: Software, writing—review and editing. Yuanqian Li: Data curation, validation. Tianshu Wu: Investigation. Hao Wen: Writing—review and editing.
Corresponding author
Ethics declarations
Conflict of interest
We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work, there is no professional or other personal interest of any nature or kind in any product, service, and/or company that could be construed as influencing the position presented in, or the review of the manuscript entitled “Joint Face Normalization and Representation Learning for Face Recognition”.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, Y., Chen, J., Li, Y. et al. Joint face normalization and representation learning for face recognition. Pattern Anal Applic 27, 64 (2024). https://doi.org/10.1007/s10044-024-01255-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10044-024-01255-2