Abstract
Recently, computer vision based on deep learning is developing rapidly. As an important branch in this area, face recognition has made great progress. The state of art has achieved 99.77% [1] pair-wise verification accuracy on LFW dataset. But the face dataset in the real application environment such as security checking in the station and bank account opening is much more complex than LFW because of face shelter, postures, uneven illumination and the different resolutions and so on. Except that, LFW dataset only contains the faces like western people but little of other area. Since faces from different areas have not consistent distribution, their methods always cannot achieve high recognition accuracy in practice. In this paper, aiming at Asian face, we propose a multiple-step model training method based on CNN network for real scene face recognition in the absence of large amounts of appropriate data. In the whole training process, each step plays an important role. For step1, it mainly enhanced the generalization ability of model by using a large-scale data set from different source. For step2, it improved the specificity of the model by using a smaller dataset which has closer data distribution in the real scene. And for the final step, metric learning is used to make the model more discriminative and expressive. Meanwhile, some strategy including data cleaning, data augmented and data balance are used in our method to improve the whole performance. Experiments show that this method can achieve high-performance for face recognition in the real application scene.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. CoRR, https://arxiv.org/abs/1512.03385 (2015)
Redmon, J., Divvala, S.K., Girshick, R.B., Farhadi, A.: You only look once: unified, real-time object detection. CoRR, https://arxiv.org/abs/1506.02640 (2015)
Sun, Y., Chen, Y., Wang, X., Tang, X.: Deep learning face representation by joint identification-verification. Proc. Adv. Neural Inf. Process. Syst. 27, 1988–1996 (2014)
Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3476–3483 (2013)
Sun, Y., Wang, X., Tang, X.: Deeply learned face representations are sparse, selective, and robust. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2892–2900 (2015)
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1701–1708 (2014)
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Web-scale training for face identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2746–2754 (2015)
Wen, Y., Zhang, K., Li, Z., Qiao, Y.: A discriminative feature learning approach for deep face recognition. In: Proceedings of the European Conference on Computer Vision, pp. 499–515. Springer (2016)
Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch. CoRR, https://arxiv.org/abs/1411.7923 (2014)
Chen, D., Cao, X., Wen, F. and Sun, J.: Blessing of dimensionality: high-dimensional feature and its efficient compression for face verification. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3025–3032. IEEE (2013)
Cao, X., Wipf, D., Wen, F., Duan, G.: A practical transfer learning algorithm for face verification. In: International Conference on Computer Vision (ICCV) (2013)
Huang, G.B., Ramesh, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical Report 07-49, University of Massachusetts, Amherst, October 2007
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1701–1708 (2014)
Parkhi, O.M., Vedaldi, A., Zisserman, A.: Deep face recognition. In: Proceedings of the British Machine Vision Conference (2015)
Schroff, F., Kalenichenko, D., Philbin, J.: Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 815–823 (2015)
Zhou, E., Cao, Z., Yin, Q. Naive-deep face recognition: touching the limit of LFW benchmark or not? Technical report, arXiv:1501.04690
Sukhbaatar, S., Fergus, R.: Learning from noisy labels with deep neural networks. CoRR, https://arxiv.org/abs/1406.2080 (2014)
Reed, S., Lee, H., Anguelov, D., Szegedy, C., Erhan, D., Rabinovich, A.: Training deep neural networks on noisy labels with bootstrapping. CoRR, https://arxiv.org/abs/1412.6596 (2014)
Wu, X., He, R., Sun, Z., et al.: A light CNN for deep face representation with noisy labels. Computer Science (2016)
Wu, R., Yan, S., Shan, Y., et al.: Deep image: scaling up image recognition. arXiv preprint arXiv:1501.02876, 22, 388 (2015)
Dai, W., Yang, Q., Xue, G.R., et al.: Boosting for transfer learning. In: International Conference on Machine Learning, pp. 193–200. ACM (2007)
Acknowledgements
The authors of this paper are members of Shanghai Engineering Research Center of Intelligent Video Surveillance. Our research was sponsored by following projects: the National Natural Science Foundation of China (61403084, 61402116); Program of Science and Technology Commission of Shanghai Municipality (Nos. 15530701300, 15XD15202000); 2012 IoT Program of Ministry of Industry and Information Technology of China; Key Project of the Ministry of Public Security (No. 2014JSYJA007); the Project of the Key Laboratory of Embedded System and Service Computing, Ministry of Education, Tongji University(ESSCKF 2015-03); Shanghai Rising-Star Program (17QB1401000).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Li, D., Zhang, X., Song, L., Zhao, Y. (2018). Multiple-Step Model Training for Face Recognition. In: Abawajy, J., Choo, KK., Islam, R. (eds) International Conference on Applications and Techniques in Cyber Security and Intelligence. ATCI 2017. Advances in Intelligent Systems and Computing, vol 580. Edizioni della Normale, Cham. https://doi.org/10.1007/978-3-319-67071-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-67071-3_21
Published:
Publisher Name: Edizioni della Normale, Cham
Print ISBN: 978-3-319-67070-6
Online ISBN: 978-3-319-67071-3
eBook Packages: EngineeringEngineering (R0)