Abstract
Efficient facial landmark detection has been applied to various fields, such as driverless driving, facial beautification technology, facial expression analysis, etc. However, in specific practical tasks, there are still some situations where facial expression cannot be correctly recognized or analyzed. This paper proposes an improved MobileNetV2_re method to improve the loss of accuracy of key points of the problem of pixel-in-pixel Net (PIPNet) in the existing facial landmark detection task. We use the ghost module to replace part of the inverted residual block from the original model, build a new MobileNetV2_re network, and improve the accuracy of the model. It is proved that the situation where high NME and low AUC of PIPNet in the original network MobileNetV2 can be effectively improved by comparing the tested normalized mean error (NME) and the area under the curve (AUC) value and selecting a better network. Compared with MobileNetV2, Resnet18, Resnet50, and Resnet101, NME of MobileNetV2_re in PIPNET is reduced by about 14.07%, AUC of MobileNetV2_re in PIPNET is increased by about 7.52%, and it shows higher accuracy in efficient facial landmark detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jin H, Liao S, Shao L. Pixel-In-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild[J]. 2020.
Kipf T N, Welling M. Semi-Supervised Classification with Graph Convolutional Networks[J]. 2016.
Li Q, Wang Y, Wang Y, et al. HDMapNet: An Online HD Map Construction and Evaluation Framework[J]. 2021.
Hou Q, Zhou D, Feng J. Coordinate Attention for Efficient Mobile Network Design[J]. 2021.
He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition[J]. IEEE, 2016.
Wavelet Transform Time-Frequency Image and Convolutional Network-Based Motor Imagery EEG Classification[J]. IEEE Access, 2019, 7:6084–6093.
Bae W, Yoo J, Ye J C. Beyond Deep Residual Learning for Image Restoration: Persistent Homology-Guided Manifold Simplification[J]. IEEE, 2017.
He K, Zhang X, Ren S, et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification[J]. IEEE Computer Society, 2015.
Howard A G, Zhu M, Chen B, et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications[J]. 2017.
Sandler M, Howard A, Zhu M, et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2018.
Li X, Ding L, Li W, et al. FPGA accelerates deep residual learning for image recognition[C]// 2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). IEEE, 2017.
Galdran A, Chakor H, Alrushood A A, et al. Automatic classification and triage of diabetic retinopathy from retinal images based on a convolutional neural networks (CNN) method[J]. Acta Ophthalmologica, 2019, 97.
Han K, Wang Y, Tian Q, et al. GhostNet: More Features From Cheap Operations[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2020.
Krizhevsky A, Sutskever I, Hinton G. ImageNet Classification with Deep Convolutional Neural Networks[J]. Advances in neural information processing systems, 2012, 25(2).
Guo X, Li S, Zhang J, et al. PFLD: A Practical Facial Landmark Detector[J]. 2019.
Gao P, Lu K, Xue J, et al. A Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism[J]. IEEE Transactions on Multimedia, 2020, PP(99):1–1.
Burgos-Artizzu X P, Perona P, P Dollár. Robust Face Landmark Estimation under Occlusion[C]// IEEE International Conference on Computer Vision. IEEE, 2014.
Deng Z, Li K, Zhao Q, et al. Effective face landmark localization via single deep network[J]. 2017.
Hang, Zhao, Orazio, et al. Loss Functions for Image Restoration With Neural Networks[J]. IEEE Transactions on Computational Imaging, 2017.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Li, R., Yu, Y., Yin, G. (2024). An Optimization Strategy for Efficient Facial Landmark Detection Based on Improved Pixel-in-Pixel Net Model. In: Meng, L. (eds) International Conference on Cloud Computing and Computer Networks. CCCN 2023. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-031-47100-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-47100-1_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47099-8
Online ISBN: 978-3-031-47100-1
eBook Packages: EngineeringEngineering (R0)