Abstract
Face detection often plays the first step in various visual applications. Large variants of facial deformations due to head movements and facial expression make it difficult to identify appropriate face region. In this paper, a robust real-time face alignment system, including facial landmarks detection and face rectification, is proposed. A facial landmarks detection model based on regression tree is utilized in the proposed system. In face rectification framework, 2-D geometrical analysis based on pitch, yaw and roll movements is designed to solve the misalignment problem in face detection. The experiments on the two datasets verify the performance significantly improved by the proposed method in the facial recognition task and outperform than those obtained by other alignment methods. Furthermore, the proposed method can achieve robust recognition results even if the amount of training images is not large.
Similar content being viewed by others
References
Ahonen T, Hadid A, Pietikainen M (2006) Face description with local binary patterns: application to face recognition. IEEE Trans Pattern Anal Mach Intell 28(12):2037–2041
Brunelli R, Poggio T (1993) Face recognition: features versus templates. IEEE Trans Pattern Anal Mach Intell 15(10):1042–1052
Chen, Jun-Cheng, Vishal M. Patel, and Rama Chellappa. “Unconstrained face verification using deep cnn features.” In2016 IEEE winter conference on applications of computer vision (WACV), pp. 1–9. IEEE, (2016)
Cootes T, Taylor C, Cooper D, Graham J (1995) Active shape models their training and application. Comput Vis Image Underst 61(01):389
Cootes TF, Edwards GJ, Taylor CJ (2001) Active appearance models. IEEE Trans Pattern Anal Mach Intell 23(6):681–685
Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13(1)
N Dalal and B Triggs, “histograms of oriented gradients for human detection,” in IEEE Conference on Computer Vision and Pattern Recognition, 2005
Dapogny, A., Bailly, K., and Cord, M. (2019). DeCaFA: deep convolutional Cascade for face alignment in the wild. In proceedings of the IEEE international conference on computer vision (pp. 6893-6901).
Ding C, Tao D (2015) Robust face recognition via multimodal deep face representation. IEEE Trans Multimedia 17(11):2049–2058
Y Freund and R E Schapire, “A decision-theoretic generalization of on-line learning and an application to boosting,” in Proceedings of the 2nd European Conference on Computational Learning Theory, 1995.
Gao W, Cao B, Shan S, Chen X, Zhou D, Zhang X, Zhao D (2008) The cas-peal large-scale chinese face database and baseline evaluations. IEEE Trans Syst Man Cybern Syst Hum 38(1):149–161
J. Ho, M.H. Y. M.H. Yang, J. L. J. Lim, K.C. L. K.C. Lee, and D. Kriegman, “Clustering appearances of objects under varying illumination conditions,” Proceedings. 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 11–18, (2003).
GB Huang, V Jain, and EG Learned-Miller, “Unsupervised joint alignment of complex images,” 2007 IEEE 11th International Conference on Computer Vision, pp. 1–8, (2007)
GB Huang, M Ramesh, T Berg, and E Learned-Miller, “Labeled faces in the wild: A database for studying face recognition in unconstrained environments,” University of Massachusetts, Amherst, Tech Rep 07-49, (2007)
Jiang, Huaizu, and Erik Learned-Miller. “Face detection with the faster R-CNN.” In 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), pp. 650–657. IEEE, (2017)
V. Kazemi and J. Sullivan, “One millisecond face alignment with an ensemble of regression trees,” in 2014 IEEE Conference on Computer Vision and Pattern Recognition, June (2014), pp. 1867–1874.
C. Kotropoulos and I. Pitas, “Rule-based face detection in frontal views,” in 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 4, April (1997), pp. 2537–2540 vol.4.
A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1, ser. NIPS’12. USA: Curran Associates Inc., (2012), pp. 1097–1105. [Online]. Available: http://dl.acm.org/citation.cfm?id=2999134.2999257
A. Lanitis, C. Taylor, and T. Cootes, “Automatic face identification system using flexible appearance models,” Image and Vision Computing, vol. 13, no. 5, pp. 393–401, (1995), 5th British Machine Vision Conference
Li H, Wang P, Shen C (2010) Robust face recognition via accurate face alignment and sparse representation. In: 2010 International Conference on Digital Image Computing: Techniques and Applications, pp 262–269
H. Li, P. Wang, and C. Shen, “Robust Face Recognition via Accurate Face Alignment and Sparse Representation,” 2010 International Conference on Digital Image Computing: Techniques and Applications, pp. 262–269, (2010).
Lowe DG (2004) Distinctive image features from scale-invariant key points. IJCV 60(2):91–110
Lu H, Yang F (2014) Active Shape Model and Its Application to Face Alignment. Springer, Berlin, Heidelberg, pp 1–31
AM Martinez and R Benavente, “The AR face database,” CVC Technical Report 24, (1998).
Murphy-Chutorian E, Trivedi MM (2009) Head pose estimation in computer vision: a survey. IEEE Trans Pattern Anal Mach Intell 31(4):607–626
Y. Taigman, M. Yang, M. Ranzato, and L. Wolf, “Deepface: closing the gap to human-level performance in face verification,” in The IEEE conference on computer vision and pattern recognition (CVPR), (2014)
Thomaz CE, Giraldi GA (2010) A new ranking method for principal components analysis and its application to face image analysis. Image Vis Comput 28(6):902–913 [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0262885609002613
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Wu X, He R, Sun Z, Tan T (2018) A light cnn for deep face representation with noisy labels. IEEE Transactions on Information Forensics and Security 13(11):2884–2896
Xiao, Shengtao, Jiashi Feng, Junliang Xing, Hanjiang Lai, Shuicheng Yan, and Ashraf Kassim. “Robust facial landmark detection via recurrent attentive-refinement networks.” In European conference on computer vision, pp. 57–72. Springer, Cham, (2016)
Zhang K, Zhang Z, Li Z, Yu Q (2016) Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Processing Letters 23(10):1499–1503
Zhao Y, Tang F, Dong W, Huang F, Zhang X (2019) Joint face alignment and segmentation via deep multi-task learning. Multimed Tools Appl 78(10):13131–13148
Acknowledgements
This work was supported in part by the Australian Research Council (ARC) under Grant DP180100670 and Grant DP180100656, in part by the U.S. Army Research Laboratory under Agreement W911NF-10-2-0022, and in part by the Taiwan Ministry of Science and Technology under Grant MOST 106-2218-E-009-027-MY3.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chou, K.P., Prasad, M., Yang, J. et al. A robust real-time facial alignment system with facial landmarks detection and rectification for multimedia applications. Multimed Tools Appl 80, 16635–16657 (2021). https://doi.org/10.1007/s11042-020-09216-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09216-7