Learning the Face Shape Models for Facial Landmark Detection in the Wild

Wu, Yue; Ji, Qiang

doi:10.1007/978-3-319-13737-7_4

Yue Wu¹⁷ &
Qiang Ji¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8912))

Included in the following conference series:

International Workshop on Face and Facial Expression Recognition from Real World Videos

1217 Accesses
1 Altmetric

Abstract

Facial landmark detection in the wild is challenging due to the appearance and shape variations caused by facial expressions, head poses, illuminations, and occlusions. To tackle this problem, we propose two probabilistic face shape models that could capture the face shape variations in different conditions. The first model is a undirected graphical model constructed based on the Restricted Boltzmann Machine (RBM). It decouples the shape variations into expression related and pose related parts. The second model is a directed hierarchical probabilistic model that specifically uses the head pose and expression labels in model construction. It embeds the local shape variations for each facial component, and automatically exploits the relationships among facial components, expressions and head poses. Experiments on benchmark databases show the effectiveness of the proposed probabilistic face shape models for facial landmark detection in the wild.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Murphy-Chutorian, E., Trivedi, M.: Head pose estimation in computer vision: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(4), 607–626 (2009)
Article Google Scholar
Fasel, B., Luettin, J.: Automatic facial expression analysis: A survey. Pattern Recognition 36(1), 259–275 (1999)
Article MATH Google Scholar
Wu, Y., Wang, Z., Ji, Q.: Facial feature tracking under varying facial expressions and face poses based on restricted boltzmann machines. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3452–3459 (2013)
Google Scholar
Wu, Y., Wang, Z., Ji, Q.: A hierarchical probabilistic model for facial feature detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1781–1788 (2014)
Google Scholar
Lucey, P., Cohn, J., Kanade, T., Saragih, J., Ambadar, Z., Matthews, I.: The extended cohn-kanade dataset (ck+): a complete dataset for action unit and emotion-specified expression. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 94–101 (2010)
Google Scholar
Phillips, P., Moon, H., Rauss, P., Rizvi, S.: The feret evaluation methodology for face-recognition algorithms. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 137–143 (1997)
Google Scholar
Tong, Y., Liao, W., Ji, Q.: Isl multi-view facial expression database. http://www.ecse.rpi.edu/ cvrl/database/database.html
Koestinger, M., Wohlhart, P., Roth, P.M., Bischof, H.: Annotated facial landmarks in the wild: a large-scale, real-world database for facial landmark localization. In: First IEEE International Workshop on Benchmarking Facial Image Analysis Technologies (2011)
Google Scholar
Cootes, T.F., Taylor, C.J., Cooper, D.H., Graham, J.: Active shape models their training and application. Comput. Vis. Image Underst. 61(1), 38–59 (1995)
Article Google Scholar
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(6), 681–685 (2001)
Article Google Scholar
Cristinacce, D., Cootes, T.: Automatic feature localisation with constrained local models. Pattern Recognition 41(10), 3054–3067 (2008)
Article MATH Google Scholar
Valstar, M., Martinez, B., Binefa, V., Pantic, M.: Facial point detection using boosted regression and graph models. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 13–18 (2010)
Google Scholar
Martinez, B., Valstar, M.F., Binefa, X., Pantic, M.: Local evidence aggregation for regression-based facial point detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(5), 1149–1163 (2013)
Article Google Scholar
Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2879–2886 (2012)
Google Scholar
Xiong, X., De la Torre Frade, F.: Supervised descent method and its applications to face alignment. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), May 2013
Google Scholar
Hinton, G.E., Osindero, S.: A fast learning algorithm for deep belief nets. Neural Computation 18, 1527–1554 (2006)
Article MathSciNet MATH Google Scholar
Ranzato, M., Krizhevsky, A., Hinton, G.E.: Factored 3-way restricted boltzmann machines for modeling natural images. Journal of Machine Learning Research 9, 621–628 (2010)
Google Scholar
Hinton, G.E.: Training products of experts by minimizing contrastive divergence. Neural Comput. 14(8), 1771–1800 (2002)
Article MathSciNet MATH Google Scholar
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research 9, 1871–1874 (2008)
MATH Google Scholar
Friedman, N.: Learning belief networks in the presence of missing values and hidden variables. In: International Conference on Machine Learning, pp. 125–133. Morgan Kaufmann (1997)
Google Scholar
Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press (2009)
Google Scholar
Pantic, M., Valstar, M., Rademaker, R., Maat, L.: Web-based database for facial expression analysis. In: IEEE International Conference on Multimedia and Expo, ICME 2005, pp. 317–321 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of ECSE, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
Yue Wu & Qiang Ji

Authors

Yue Wu
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Ji
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiang Ji .

Editor information

Editors and Affiliations

Computer and Systems Engineering, Pensselaer Polytechnic Institute, Troy, New York, USA
Qiang Ji
Department of Architecture, Design, and Media Technology, Aalborg, Denmark
Thomas B. Moeslund
Department of Computer Science, Stevens Institute of Technology, Hoboken, New Jersey, USA
Gang Hua
Department of Architecture, Design, and Media Technology, Aalborg, Denmark
Kamal Nasrollahi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, Y., Ji, Q. (2015). Learning the Face Shape Models for Facial Landmark Detection in the Wild. In: Ji, Q., B. Moeslund, T., Hua, G., Nasrollahi, K. (eds) Face and Facial Expression Recognition from Real World Videos. FFER 2014. Lecture Notes in Computer Science(), vol 8912. Springer, Cham. https://doi.org/10.1007/978-3-319-13737-7_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-13737-7_4
Published: 19 March 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13736-0
Online ISBN: 978-3-319-13737-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics