Facial Landmark Detection Under Large Pose

Hao, Yangyang; Zhu, Hengliang; Shao, Zhiwen; Tan, Xin; Ma, Lizhuang

doi:10.1007/978-3-030-04212-7_60

Yangyang Hao¹⁶,
Hengliang Zhu¹⁶,
Zhiwen Shao¹⁶,
Xin Tan¹⁶ &
…
Lizhuang Ma^16,17

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11304))

Included in the following conference series:

International Conference on Neural Information Processing

2176 Accesses

Abstract

Facial landmark detection is a necessary step in many vision tasks and plenty of excellent methods have been proposed to solve this problem. However, for the conditions with large pose and complex expression, these works usually suffer an eclipse. In this paper, we propose a two-stage cascade regression framework using patch-difference features to overcome the above problem. In the first stage, by applying the patch-difference feature and augmenting the large pose samples to the classical shape regression model, salient landmarks (eye centers, nose, mouth corners) can be located precisely. In the second stage, by applying enhanced feature section constraint to the patch-difference feature, multi-landmark detection is achieved. Experimental results show that our algorithm has a significant improvement compared to the classical shape regression method and achieves superior results on COFW dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Asthana, A., Zafeiriou, S., Cheng, S., Pantic, M.: Robust discriminative response map fitting with constrained local models. In: Computer Vision and Pattern Recognition, pp. 3444–3451. IEEE, New York (2013)
Google Scholar
Belhumeur, P.N., Jacobs, D.W., Kriegman, D.J., Kumar, N.: Localizing parts of faces using a consensus of exemplars. IEEE Trans. Pattern Anal. Mach. Intell. 35(12), 545–552 (2013)
Article Google Scholar
Burgos-Artizzu, X.P., Perona, P., Dollár, P.: Robust face landmark estimation under occlusion. In: International Conference on Computer Vision, pp. 1513–1520. IEEE, New York (2013)
Google Scholar
Cao, X., Wei, Y., Wen, F., Sun, J.: Face alignment by explicit shape regression. Int. J. Comput. Vision 107(2), 117–190 (2012)
MathSciNet Google Scholar
Chen, C., Dantcheva, A., Ross, A.: Automatic facial makeup detection with application in face recognition. In: International Conference on Biometrics, pp. 1–8. IEEE, New York (2013)
Google Scholar
Guo, D., Sim, T.: Digital face makeup by example. In: Computer Vision and Pattern Recognition, pp. 73–79. IEEE, New York (2009)
Google Scholar
Dollár, P., Welinder, P., Perona, P.: Cascaded pose regression. In: Computer Vision and Pattern Recognition, pp. 1078–1085. IEEE, New York (2010)
Google Scholar
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat., 1189–1232 (2001)
Google Scholar
Honari, S., Yosinski, J., Vincent, P., Pal, C.: Recombinator networks: learning coarse-to-fine feature aggregation. In: Computer Vision and Pattern Recognition, pp. 5743–5752. IEEE Computer Society, New York (2016)
Google Scholar
Jain, A.K.: Fundamentals of Digital Image Processing. Prentice Hall (1989)
Google Scholar
Kazemi, V., Sullivan, J.: One millisecond face alignment with an ensemble of regression trees. In: Computer Vision and Pattern Recognition, pp. 1867–1874. IEEE, New York (2014)
Google Scholar
Le, V., Brandt, J., Lin, Z., Bourdev, L., Huang, T.S.: Interactive facial feature localization. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 679–692. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33712-3_49
Chapter Google Scholar
Lee, D., Park, H., Yoo, C.: Face alignment using cascade Gaussian process regression trees. In: Computer Vision and Pattern Recognition, pp. 4204–4212. IEEE, New York (2015)
Google Scholar
Ramirez Rivera, A., Castillo, R., Chae, O.: Local directional number pattern for face analysis: face and expression recognition. IEEE Trans. Image Process. 22(5), 1740–1752 (2013)
Article MathSciNet Google Scholar
Ren, S., Cao, X., Wei, Y., Sun, J.: Face alignment at 3000 fps via regressing local binary features. In: Computer Vision and Pattern Recognition, pp. 1685–1692. IEEE, New York (2014)
Google Scholar
Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: Computer Vision and Pattern Recognition, pp. 3476–3483. IEEE Computer Society, New York (2013)
Google Scholar
Trigeorgis, G., Snape, P., Nicolaou, M.A., Antonakos, E., Zafeiriou, S.: Mnemonic descent method: a recurrent process applied for end-to-end face alignment. In: Computer Vision and Pattern Recognition, pp. 4177–4187. IEEE, New York (2016)
Google Scholar
Tzimiropoulos, G.: Project-out cascaded regression with an application to face alignment. In: Computer Vision and Pattern Recognition. IEEE, New York (2015)
Google Scholar
Xiao, S., et al.: Recurrent 3D–2D dual learning for large-pose facial landmark detection. In: IEEE International Conference on Computer Vision, pp. 1642–1651. IEEE Computer Society, New York (2017)
Google Scholar
Xiao, S., Feng, J., Xing, J., Lai, H., Yan, S., Kassim, A.: Robust facial landmark detection via recurrent attentive-refinement networks. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 57–72. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_4
Chapter Google Scholar
Xiong, X., De la Torre, F.: Global supervised descent method. In: Computer Vision and Pattern Recognition, pp. 2664–2673. IEEE Computer Society, New York (2015)
Google Scholar
Xiong, X., De la Torre, F.: Supervised descent method and its applications to face alignment. In: Computer Vision and Pattern Recognition, pp. 532–539. IEEE, New York (2013)
Google Scholar
Zhang, J., Shan, S., Kan, M., Chen, X.: Coarse-to-fine auto-encoder networks (CFAN) for real-time face alignment. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8690, pp. 1–16. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10605-2_1
Chapter Google Scholar
Zhang, Z., Luo, P., Loy, C.C., Tang, X.: Learning deep representation for face alignment with auxiliary attributes. IEEE Trans. Pattern Anal. Mach. Intell. 38(5), 918–930 (2016)
Article Google Scholar
Zhou, E., Fan, H., Cao, Z., Jiang, Y.: Extensive facial landmark localization with coarse-to-fine convolutional network cascade. In: International Conference on Computer Vision Workshops, pp. 386–391. IEEE Computer Society, New York (2014)
Google Scholar
Zhu, S., Li, C., Loy, C.C., Tang, X.: Face alignment by coarse-to-fine shape searching. In: Computer Vision and Pattern Recognition. IEEE, New York (2015)
Google Scholar

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (No. 61472245) and the Science and Technology Commission of Shanghai Municipality Program (No. 16511101300).

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China
Yangyang Hao, Hengliang Zhu, Zhiwen Shao, Xin Tan & Lizhuang Ma
Department of Computer Science and Software Engineering, East China Normal University, Shanghai, China
Lizhuang Ma

Authors

Yangyang Hao
View author publications
You can also search for this author in PubMed Google Scholar
Hengliang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Zhiwen Shao
View author publications
You can also search for this author in PubMed Google Scholar
Xin Tan
View author publications
You can also search for this author in PubMed Google Scholar
Lizhuang Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lizhuang Ma .

Editor information

Editors and Affiliations

The Chinese Academy of Sciences, Beijing, China
Long Cheng
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi Sing Leung
Kobe University, Kobe, Japan
Seiichi Ozawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hao, Y., Zhu, H., Shao, Z., Tan, X., Ma, L. (2018). Facial Landmark Detection Under Large Pose. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11304. Springer, Cham. https://doi.org/10.1007/978-3-030-04212-7_60

Download citation

DOI: https://doi.org/10.1007/978-3-030-04212-7_60
Published: 17 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04211-0
Online ISBN: 978-3-030-04212-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics