An Optimization Strategy for Efficient Facial Landmark Detection Based on Improved Pixel-in-Pixel Net Model

Li, Renhao; Yu, Yanan; Yin, Guanghua

doi:10.1007/978-3-031-47100-1_3

Part of the book series: Signals and Communication Technology ((SCT))

Included in the following conference series:

International Conference on Cloud Computing and Computer Networks

65 Accesses

Abstract

Efficient facial landmark detection has been applied to various fields, such as driverless driving, facial beautification technology, facial expression analysis, etc. However, in specific practical tasks, there are still some situations where facial expression cannot be correctly recognized or analyzed. This paper proposes an improved MobileNetV2_re method to improve the loss of accuracy of key points of the problem of pixel-in-pixel Net (PIPNet) in the existing facial landmark detection task. We use the ghost module to replace part of the inverted residual block from the original model, build a new MobileNetV2_re network, and improve the accuracy of the model. It is proved that the situation where high NME and low AUC of PIPNet in the original network MobileNetV2 can be effectively improved by comparing the tested normalized mean error (NME) and the area under the curve (AUC) value and selecting a better network. Compared with MobileNetV2, Resnet18, Resnet50, and Resnet101, NME of MobileNetV2_re in PIPNET is reduced by about 14.07%, AUC of MobileNetV2_re in PIPNET is increased by about 7.52%, and it shows higher accuracy in efficient facial landmark detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Jin H, Liao S, Shao L. Pixel-In-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild[J]. 2020.
Google Scholar
Kipf T N, Welling M. Semi-Supervised Classification with Graph Convolutional Networks[J]. 2016.
Google Scholar
Li Q, Wang Y, Wang Y, et al. HDMapNet: An Online HD Map Construction and Evaluation Framework[J]. 2021.
Google Scholar
Hou Q, Zhou D, Feng J. Coordinate Attention for Efficient Mobile Network Design[J]. 2021.
Google Scholar
He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition[J]. IEEE, 2016.
Google Scholar
Wavelet Transform Time-Frequency Image and Convolutional Network-Based Motor Imagery EEG Classification[J]. IEEE Access, 2019, 7:6084–6093.
Google Scholar
Bae W, Yoo J, Ye J C. Beyond Deep Residual Learning for Image Restoration: Persistent Homology-Guided Manifold Simplification[J]. IEEE, 2017.
Google Scholar
He K, Zhang X, Ren S, et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification[J]. IEEE Computer Society, 2015.
Google Scholar
Howard A G, Zhu M, Chen B, et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications[J]. 2017.
Google Scholar
Sandler M, Howard A, Zhu M, et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2018.
Google Scholar
Li X, Ding L, Li W, et al. FPGA accelerates deep residual learning for image recognition[C]// 2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). IEEE, 2017.
Google Scholar
Galdran A, Chakor H, Alrushood A A, et al. Automatic classification and triage of diabetic retinopathy from retinal images based on a convolutional neural networks (CNN) method[J]. Acta Ophthalmologica, 2019, 97.
Google Scholar
Han K, Wang Y, Tian Q, et al. GhostNet: More Features From Cheap Operations[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2020.
Google Scholar
Krizhevsky A, Sutskever I, Hinton G. ImageNet Classification with Deep Convolutional Neural Networks[J]. Advances in neural information processing systems, 2012, 25(2).
Google Scholar
Guo X, Li S, Zhang J, et al. PFLD: A Practical Facial Landmark Detector[J]. 2019.
Google Scholar
Gao P, Lu K, Xue J, et al. A Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism[J]. IEEE Transactions on Multimedia, 2020, PP(99):1–1.
Google Scholar
Burgos-Artizzu X P, Perona P, P Dollár. Robust Face Landmark Estimation under Occlusion[C]// IEEE International Conference on Computer Vision. IEEE, 2014.
Google Scholar
Deng Z, Li K, Zhao Q, et al. Effective face landmark localization via single deep network[J]. 2017.
Google Scholar
Hang, Zhao, Orazio, et al. Loss Functions for Image Restoration With Neural Networks[J]. IEEE Transactions on Computational Imaging, 2017.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Technology and Engineering, Tianjin University of Technology and Education, Tianjin, China
Renhao Li, Yanan Yu & Guanghua Yin

Authors

Renhao Li
View author publications
You can also search for this author in PubMed Google Scholar
Yanan Yu
View author publications
You can also search for this author in PubMed Google Scholar
Guanghua Yin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Software, Shangdong University, Jinan, China
Lei Meng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, R., Yu, Y., Yin, G. (2024). An Optimization Strategy for Efficient Facial Landmark Detection Based on Improved Pixel-in-Pixel Net Model. In: Meng, L. (eds) International Conference on Cloud Computing and Computer Networks. CCCN 2023. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-031-47100-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-031-47100-1_3
Published: 24 January 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47099-8
Online ISBN: 978-3-031-47100-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics