Advertisement

A Deep Learning Approach to Joint Face Detection and Segmentation

Chapter

Abstract

Robust face detection and facial segmentation are crucial pre-processing steps to support facial recognition, expression analysis, pose estimation, building of 3D facial models, etc. In previous approaches, the process of face detection and facial segmentation are usually implemented as sequential, mostly separated modules. In these methods, face detection algorithms are usually first implemented so that facial regions can be located in given images. Segmentation algorithms are then carried out to find the facial boundaries and other facial features, such as the eyebrows, eyes, nose, mouth, etc. However, both of these tasks are challenging due to numerous variations of face images in the wild, e.g. facial expressions, illumination variations, occlusions, resolution, etc. In this chapter, we present a novel approach to detect human faces and segment facial features from given images simultaneously. Our proposed approach performs accurate facial feature segmentation and demonstrates its effectiveness on images from two challenging face databases, i.e. Multiple Biometric Grand Challenge (MBGC) and Labeled Faces in the Wild (LFW).

Keywords

Graphic Processing Unit Facial Feature Face Detection Convolutional Neural Network Facial Boundary 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    B. Hariharan, P. Arbeláez, R. Girshick, J. Malik, Simultaneous detection and segmentation, in European Conference on Computer Vision (ECCV) (2014)Google Scholar
  2. 2.
    P. Arbelaez, J. Pont-Tuset, J. Barron, F. Marques, J. Malik, Multiscale combinatorial grouping, in Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2014)Google Scholar
  3. 3.
    Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, T. Darrell, Caffe: Convolutional Architecture for Fast Feature Embedding (2014). arXiv preprint arXiv:1408.5093Google Scholar
  4. 4.
    D.M.J. Tax, One-class classification, Ph.D. thesis, Delft University of Technology, June 2001Google Scholar
  5. 5.
    K. Seshadri, M. Savvides, Robust modified active shape model for automatic facial landmark annotation of frontal faces, in Proceedings of the 3rd IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS) (2009), pp. 319–326Google Scholar
  6. 6.
    K. Luu, T.H.N. Le, K. Seshadri, M. Savvides, Facecut - a robust approach for facial feature segmentation, in ICIP (2012), pp. 1841–1848Google Scholar
  7. 7.
    V. Vezhnevets, V. Konouchine, “GrowCut” - interactive multi-label N-D image segmentation by cellular automata, in Proceedings of the International Conference on Computer Graphics and Vision (GRAPHICON) (2005)Google Scholar
  8. 8.
    P.J. Phillips, P.J. Flynn, J.R. Beveridge, W.T. Scrugs, A.J. O’Toole, D. Bolme, K.W. Bowyer, B.A. Draper, G.H. Givens, Y.M. Lui, H. Sahibzada, Joseph A. Scallan III, S. Weimer, Overview of the multiple biometrics grand challenge, in Proceedings of the 3 rd IAPR/IEEE International Conference on Biometrics (2009), pp. 705–714Google Scholar
  9. 9.
    P.N Belhumeur, D.W. Jacobs, D Kriegman, N. Kumar, Localizing parts of faces using a consensus of exemplars, in Computer Vision and Pattern Recognition (CVPR) (2011), pp. 545–552Google Scholar
  10. 10.
    Y.Y. Boykov, M.P. Jolly, Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images, in Proceedings of the IEEE International Conference on Computer Vision (ICCV) (2001), pp. 105–112Google Scholar
  11. 11.
    C. Rother, V. Kolmogorov, A. Blake, “GrabCut”: interactive foreground extraction using iterated graph cuts, in Proceedings of ACM Transactions on Graphics (SIGGRAPH) (2004), pp. 309–314Google Scholar
  12. 12.
    L. Grady, Random walks for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 28(11), 1768–1783 (2006)Google Scholar
  13. 13.
    P. Viola, M. Jones, Robust real-time face detection. Int. J. Comput. Vis. 57, 137–154 (2004)CrossRefGoogle Scholar
  14. 14.
    C. Zhang, Z. Zhang, A survey of recent advances in face detection. Tech. Rep. MSR-TR-2010-66, June 2010Google Scholar
  15. 15.
    J. Li, Y. Zhang, Learning surf cascade for fast and accurate object detection, in 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013), pp. 3468–3475Google Scholar
  16. 16.
    X. Zhu, D. Ramanan, Face detection, pose estimation, and landmark localization in the wild, in 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2012), pp. 2879–2886Google Scholar
  17. 17.
    P.F. Felzenszwalb, R.B. Girshick, D. McAllester, D. Ramanan, Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)CrossRefGoogle Scholar
  18. 18.
    X. Yu, J. Huang, S. Zhang, W. Yan, D.N. Metaxas, Pose-free facial landmark fitting via optimized part mixtures and cascaded deformable shape model, in 2013 IEEE International Conference on Computer Vision (ICCV) (2013), pp. 1944–1951Google Scholar
  19. 19.
    D. Chen, S. Ren, Y. Wei, X. Cao, J. Sun, Joint cascade face detection and alignment, in Computer Vision ECCV 2014, eds. by D. Fleet, T. Pajdla, B. Schiele, T. Tuytelaars. Lecture Notes in Computer Science, vol. 8694 (Springer International, New York, 2014), pp. 109–122Google Scholar
  20. 20.
    G. Ghiasi, C. Fowlkes, Occlusion coherence: localizing occluded faces with a hierarchical deformable part model, in CVPR (2014)Google Scholar
  21. 21.
    M. Mathias, R. Benenson, M. Pedersoli, L. Van Gool, Face detection without bells and whistles, in Computer Vision ECCV 2014, eds. by D. Fleet, T. Pajdla, B. Schiele, T. Tuytelaars. Lecture Notes in Computer Science, vol. 8692 (Springer International, New York, 2014), pp. 720–735Google Scholar
  22. 22.
    C. Garcia, M. Delakis, Convolutional face finder: a neural architecture for fast and robust face detection. IEEE Trans. Pattern Anal. Mach. Intell. 26(11), 1408–1423 (2004)CrossRefGoogle Scholar
  23. 23.
    H. Jin, Q. Lin, H. Lu, X. Tong, Face detection using one-class SVM in color images, in Proceedings of the International Conference on Signal Processing (2004)Google Scholar
  24. 24.
    Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRefGoogle Scholar
  25. 25.
    T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, T. Poggio, Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach. Intell. 29(3), 411–426 (2007)CrossRefGoogle Scholar
  26. 26.
    A. Krizhevsky, I. Sutskever, G.E. Hinton, Imagenet classification with deep convolutional neural networks, in NIPS (2012), pp. 1097–1105Google Scholar
  27. 27.
    X. Xiong, F. de la Torre, Supervised descent method and its applications to face alignment, in 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2013, pp. 532–539Google Scholar
  28. 28.
    R. Gross, I. Matthews, J.F. Cohn, T. Kanade, S. Baker, Multi-PIE, in Proceedings of the 8 th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008) (2008), pp. 1–8Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.CyLab Biometrics Center, Department of Electrical & Computer EngineeringCarnegie Mellon UniversityPittsburghUSA

Personalised recommendations