Skip to main content

An Optimization Strategy for Efficient Facial Landmark Detection Based on Improved Pixel-in-Pixel Net Model

  • Conference paper
  • First Online:
International Conference on Cloud Computing and Computer Networks (CCCN 2023)

Abstract

Efficient facial landmark detection has been applied to various fields, such as driverless driving, facial beautification technology, facial expression analysis, etc. However, in specific practical tasks, there are still some situations where facial expression cannot be correctly recognized or analyzed. This paper proposes an improved MobileNetV2_re method to improve the loss of accuracy of key points of the problem of pixel-in-pixel Net (PIPNet) in the existing facial landmark detection task. We use the ghost module to replace part of the inverted residual block from the original model, build a new MobileNetV2_re network, and improve the accuracy of the model. It is proved that the situation where high NME and low AUC of PIPNet in the original network MobileNetV2 can be effectively improved by comparing the tested normalized mean error (NME) and the area under the curve (AUC) value and selecting a better network. Compared with MobileNetV2, Resnet18, Resnet50, and Resnet101, NME of MobileNetV2_re in PIPNET is reduced by about 14.07%, AUC of MobileNetV2_re in PIPNET is increased by about 7.52%, and it shows higher accuracy in efficient facial landmark detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Jin H, Liao S, Shao L. Pixel-In-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild[J]. 2020.

    Google Scholar 

  2. Kipf T N, Welling M. Semi-Supervised Classification with Graph Convolutional Networks[J]. 2016.

    Google Scholar 

  3. Li Q, Wang Y, Wang Y, et al. HDMapNet: An Online HD Map Construction and Evaluation Framework[J]. 2021.

    Google Scholar 

  4. Hou Q, Zhou D, Feng J. Coordinate Attention for Efficient Mobile Network Design[J]. 2021.

    Google Scholar 

  5. He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition[J]. IEEE, 2016.

    Google Scholar 

  6. Wavelet Transform Time-Frequency Image and Convolutional Network-Based Motor Imagery EEG Classification[J]. IEEE Access, 2019, 7:6084–6093.

    Google Scholar 

  7. Bae W, Yoo J, Ye J C. Beyond Deep Residual Learning for Image Restoration: Persistent Homology-Guided Manifold Simplification[J]. IEEE, 2017.

    Google Scholar 

  8. He K, Zhang X, Ren S, et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification[J]. IEEE Computer Society, 2015.

    Google Scholar 

  9. Howard A G, Zhu M, Chen B, et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications[J]. 2017.

    Google Scholar 

  10. Sandler M, Howard A, Zhu M, et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2018.

    Google Scholar 

  11. Li X, Ding L, Li W, et al. FPGA accelerates deep residual learning for image recognition[C]// 2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). IEEE, 2017.

    Google Scholar 

  12. Galdran A, Chakor H, Alrushood A A, et al. Automatic classification and triage of diabetic retinopathy from retinal images based on a convolutional neural networks (CNN) method[J]. Acta Ophthalmologica, 2019, 97.

    Google Scholar 

  13. Han K, Wang Y, Tian Q, et al. GhostNet: More Features From Cheap Operations[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2020.

    Google Scholar 

  14. Krizhevsky A, Sutskever I, Hinton G. ImageNet Classification with Deep Convolutional Neural Networks[J]. Advances in neural information processing systems, 2012, 25(2).

    Google Scholar 

  15. Guo X, Li S, Zhang J, et al. PFLD: A Practical Facial Landmark Detector[J]. 2019.

    Google Scholar 

  16. Gao P, Lu K, Xue J, et al. A Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism[J]. IEEE Transactions on Multimedia, 2020, PP(99):1–1.

    Google Scholar 

  17. Burgos-Artizzu X P, Perona P, P Dollár. Robust Face Landmark Estimation under Occlusion[C]// IEEE International Conference on Computer Vision. IEEE, 2014.

    Google Scholar 

  18. Deng Z, Li K, Zhao Q, et al. Effective face landmark localization via single deep network[J]. 2017.

    Google Scholar 

  19. Hang, Zhao, Orazio, et al. Loss Functions for Image Restoration With Neural Networks[J]. IEEE Transactions on Computational Imaging, 2017.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, R., Yu, Y., Yin, G. (2024). An Optimization Strategy for Efficient Facial Landmark Detection Based on Improved Pixel-in-Pixel Net Model. In: Meng, L. (eds) International Conference on Cloud Computing and Computer Networks. CCCN 2023. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-031-47100-1_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-47100-1_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-47099-8

  • Online ISBN: 978-3-031-47100-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics