Cascaded Regressions of Learning Features for Face Alignment

Tran, Ngoc-Trung; Ababsa, Fakhreddine; Fredj, Sarra Ben; Charbit, Maurice

doi:10.1007/978-3-319-25903-1_61

Ngoc-Trung Tran^19,20,
Fakhreddine Ababsa²⁰,
Sarra Ben Fredj²¹ &
…
Maurice Charbit¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9386))

Included in the following conference series:

International Conference on Advanced Concepts for Intelligent Vision Systems

2858 Accesses

Abstract

Face alignment is a fundamental problem in computer vision to localize the landmarks of eyes, nose or mouth in 2D images. In this paper, our method for face alignment integrates three aspects not seen in previous approaches: First, learning local descriptors using Restricted Boltzmann Machine (RBM) to model the local appearance of each facial points independently. Second, proposing the coarse-to-fine regression to localize the landmarks after the estimation of the shape configuration via global regression. Third, and using synthetic data as training data to enable our approach to work better with the profile view, and to forego the need of increasing the number of annotations for training. Our results on challenging datasets compare favorably with state of the art results. The combination with the synthetic data allows our method yielding good results in profile alignment. That highlights the potential of using synthetic data for in-the-wild face alignment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Belhumeur, P.N., Jacobs, D.W., Kriegman, D.J., Kumar, N.: Localizing parts of faces using a consensus of exemplars. In: CVPR (2011)
Google Scholar
Blanz, V., Vetter, T.: A morphable model for the synthesis of 3D faces. In: SIGGRAPH, pp. 187–194, New York (1999)
Google Scholar
Burgos-Artizzu, X., Perona, P., Dollár, P.: Robust face landmark estimation under occlusion. In: ICCV (2013)
Google Scholar
Cao, X., Wei, Y., Wen, F., Sun, J.: Face alignment by explicit shape regression. In: CVPR (2012)
Google Scholar
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. TPAMI 23(6), 681–685 (2001)
Article Google Scholar
Cootes, T.F., Wheeler, G.V., Walker, K.N., Taylor, C.J.: View-based active appearance models. IVC 20(9–10), 657–664 (2002)
Article Google Scholar
Cristinacce, D., Cootes, T.F.: Feature detection and tracking with constrained local models. In: BMVC (2006)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
Google Scholar
Dollar, P., Welinder, P., Perona, P.: Cascaded pose regression. In: CVPR (2010)
Google Scholar
Gross, R., Matthews, I., Cohn, J.F., Kanade, T., Baker, S.: Multi-pie. IVC 28(5), 807–813 (2010)
Article Google Scholar
Hassner, T.: Viewing real-world faces in 3D. In: ICCV (2013)
Google Scholar
Hinton, G., Salakhutdinov, R.: Reducing the dimensionality of data with neural networks (2006)
Google Scholar
Kazemi, V., Sullivan, J.: One millisecond face alignment with an ensemble of regression trees. In: CVPR (2014)
Google Scholar
Koestinger, M., Wohlhart, P., Roth, P. M., Bischof, H.: Annotated facial landmarks in the wild: a large-scale, real-world database for facial landmark localization. In: First IEEE International Workshop on Benchmarking Facial Image Analysis Technologies (2011)
Google Scholar
Le, V., Brandt, J., Lin, Z., Bourdev, L., Huang, T.S.: Interactive facial feature localization. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 679–692. Springer, Heidelberg (2012)
Chapter Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60, 91–110 (2004)
Article Google Scholar
Valstar, M.F., Martinez, B., Binefa, X., Pantic, M.: Facial point detection using boosted regression and graph models. In: CVPR, pp. 2729–2736 (2010)
Google Scholar
Messer, K., Matas, J., Kittler, J., Luettin, J., Maitre, G.: XM2VTSDB: the extended M2VTS database. In: Second International Conference on Audio and Video-based Biometric Person Authentication (1999)
Google Scholar
Ren, S., Cao, X., Wei, Y., Sun, J. : Face alignment at 3000 FPS via regressing local binary features (2014)
Google Scholar
Sagonas, C., Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: 300 faces in-the-wild challenge: the first facial landmark localization challenge. In: ICCV Workshops (2013)
Google Scholar
Saragih, J.M., Lucey, S., Cohn, J.F.: Deformable model fitting by regularized landmark mean-shift. IJCV 91, 200–215 (2011)
Article MATH MathSciNet Google Scholar
Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: CVPR (2013)
Google Scholar
Wang, Y., Lucey, S., Cohn, J.: Enforcing convexity for improved alignment with constrained local models. In: CVPR (2008)
Google Scholar
Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: CVPR (2012)
Google Scholar
Xiong, X., la Torre Frade, F.D.: Supervised descent method and its applications to face alignment. In: CVPR (2013)
Google Scholar
Yan et al., 2013]yan-iccvw-2013 Yan, J., Lei, Z., Yi, D., Li, S.Z.: Learn to combine multiple hypotheses for accurate face alignment. In: ICCV Workshops (2013)
Google Scholar
Zhang, J., Shan, S., Kan, M., Chen, X.: Coarse-to-Fine Auto-Encoder Networks (CFAN) for real-time face alignment. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part II. LNCS, vol. 8690, pp. 1–16. Springer, Heidelberg (2014)
Google Scholar
Zhao, X., Shan, S., Chai, X., Chen, X.: Cascaded shape space pruning for robust facial landmark detection. In: ICCV (2013)
Google Scholar
Zhou, E., Fan, H., Cao, Z., Jiang, Y., Yin, Q.: Extensive facial landmark localization with coarse-to-fine convolutional network cascade. In: ICCV Workshops (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

LTCI-CNRS, Telecom ParisTECH, 37-39, Rue Dareau, 75014, Paris, France
Ngoc-Trung Tran & Maurice Charbit
IBISC, University of Evry, 40, Rue du Pelvoux, 91020, Evry, France
Ngoc-Trung Tran & Fakhreddine Ababsa
UFE, Paris Observatory, 5, Place Jules Janssen, 92195, Meudon, France
Sarra Ben Fredj

Authors

Ngoc-Trung Tran
View author publications
You can also search for this author in PubMed Google Scholar
Fakhreddine Ababsa
View author publications
You can also search for this author in PubMed Google Scholar
Sarra Ben Fredj
View author publications
You can also search for this author in PubMed Google Scholar
Maurice Charbit
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ngoc-Trung Tran .

Editor information

Editors and Affiliations

Dipartimento di Matematica e Informatica, Università di Catania, Catania, Catania, Italy
Sebastiano Battiato
Arcueil CX, France
Jacques Blanc-Talon
Catania, Italy
Giovanni Gallo
Gent, Belgium
Wilfried Philips
CSIRO, Sydney, New South Wales, Australia
Dan Popescu
Vision Lab., University of Antwerp, Antwerpen, Belgium
Paul Scheunders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tran, NT., Ababsa, F., Fredj, S.B., Charbit, M. (2015). Cascaded Regressions of Learning Features for Face Alignment. In: Battiato, S., Blanc-Talon, J., Gallo, G., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2015. Lecture Notes in Computer Science(), vol 9386. Springer, Cham. https://doi.org/10.1007/978-3-319-25903-1_61

Download citation

DOI: https://doi.org/10.1007/978-3-319-25903-1_61
Published: 06 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25902-4
Online ISBN: 978-3-319-25903-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics