Enhancing facial geometry analysis by DeepFaceLandmark leveraging ResNet101 and transfer learning

Sharma, Nand Kishore; Rahamatkar, Surendra; Rathore, Abhishek Singh

doi:10.1007/s41870-024-01872-4

Enhancing facial geometry analysis by DeepFaceLandmark leveraging ResNet101 and transfer learning

Original Research
Published: 12 May 2024

(2024)
Cite this article

International Journal of Information Technology Aims and scope Submit manuscript

Nand Kishore Sharma ORCID: orcid.org/0000-0001-8260-1739¹,
Surendra Rahamatkar³ &
Abhishek Singh Rathore²

8 Accesses
Explore all metrics

Abstract

The face recognition is the pivotal component in the surveillance system. To comprehensively assess facial structures, acquiring facial features becomes indispensable. This information is effectively represented by facial landmarks, which serve as specific locations of key characteristics. This manuscript proposes a novel DeepFaceLandmark algorithm by incorporating a transfer learning approach with the ResNet architecture on the Dlib iBUG 300-W dataset to detect the coordinates of 68 facial landmarks. This proposed work investigates the rationale behind encompassing DeepFaceLandmark in conjunction with facial feature extraction and face geometry analysis. The proposed work detects facial landmarks in challenging situations such as illumination, expression, head-pose, and occlusion to extract the facial features. In addition to these challenges, the proposed work also detected the facial landmarks on the face which are occluded via glass and partially visible due to the impact of light on glass. Furthermore, the author also specifically delves into elucidating the categories of face landmarks, detailing the employed algorithms, and evaluating their performance. Performance evaluation resulted in an impressive accuracy of 98.76%. The recorded training loss was 0.0003, with a validation loss of 0.0007 for the train-test split, and a minimum loss of 0.0006. Furthermore, the analysis extends its performance evaluation by conducting comparisons with other comparable state-of-the-art techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

Article 08 August 2022

YOLO-based Object Detection Models: A Review and its Applications

Article 14 March 2024

Notes

The dataset is publicly available and can be downloaded via http://dlib.net/files/data/ibug_300W_large_face_landmark_dataset.tar.gz.

References

Murphy-Chutorian E, Trivedi MM (2009) Head pose estimation in computer vision: a survey. IEEE Trans Pattern Anal Mach Intell 31(4):607–626. https://doi.org/10.1109/TPAMI.2008.106
Article Google Scholar
Mallat K, Dugelay JL, Križaj J, Peer P, Štruc V, Dobrišek S, Chateau T (2019) A Survey on Transfer Learning. IEEE Trans Pattern Anal Mach Intell 22(1): 681–685. https://doi.org/10.1109/34.927467
Prakash SR, Singh PN (2023) Background region based Face orientation prediction through HSV skin color model and K-Means clustering. Int J Inf Technol 15(3):1275–1288. https://doi.org/10.1007/s41870-023-01174-1
Article Google Scholar
Wu Y, Ji Q (2019) Facial landmark detection: a literature survey. Int J Comput Vision 127(2):115–142. https://doi.org/10.1007/s11263-018-1097-z
Article Google Scholar
Zhang H, Li Q, Sun Z, Liu Y (2018) Combining data-driven and model-driven methods for robust facial landmark detection. IEEE Trans Inf Forensics Secur 13(10):2409–2422. https://doi.org/10.1109/TIFS.2018.2800901
Article Google Scholar
Jindal A, Priya DR (2019) Landmark points detection in case of human facial tracking and detection. Int J Eng Adv Technol 9(2): 3769–3776. https://doi.org/10.35940/ijeat.b3367.129219
Wu Y, Hassner T, Kim K, Medioni G, Natarajan P (2018) Facial landmark detection with tweaked convolutional neural networks. IEEE Trans Pattern Anal Mach Intell 40(12):3067–3074. https://doi.org/10.1109/TPAMI.2017.2787130
Article Google Scholar
Gondhi NK, Kour N, Effendi S, Kaushik K (2018) An efficient algorithm for facial landmark detection using haar-like features coupled with corner detection following anthropometric constraints. 2nd International Conference on Telecommunication and Networks, TEL-NET 2017, 2018-Janua, 1–6. https://doi.org/10.1109/TEL-NET.2017.8343517
Wu T, Turaga P, Chellappa R (2012) Age estimation and face verification across aging using landmarks. IEEE Trans Inf Forensics Secur 7(6):1780–1788. https://doi.org/10.1109/TIFS.2012.2213812
Article Google Scholar
Devries T, Biswaranjan K, Taylor GW (2014) Multi-task learning of facial landmarks and expression. Proc Conf Comput Robot Vision CRV 2014:98–103. https://doi.org/10.1109/CRV.2014.21
Article Google Scholar
Bodini M (2019) A review of facial landmark extraction in 2D images and videos using deep learning. Big Data Cognitive Computing 3(1):1–14. https://doi.org/10.3390/bdcc3010014
Article MathSciNet Google Scholar
Kim T, Mok J, Lee E (2021) Detecting facial region and landmarks at once via deep network. Sensors, 21(16). https://doi.org/10.3390/s21165360
Sagonas C, Antonakos E, Tzimiropoulos G, Zafeiriou S, Pantic M (2016) 300 faces in-the-wild challenge: database and results. Image Vis Comput 47:3–18. https://doi.org/10.1016/j.imavis.2016.01.002
Article Google Scholar
Zane E, Yang Z, Pozzan L, Guha T, Narayanan S, Grossman RB (2019) Motion-capture patterns of voluntarily mimicked dynamic facial expressions in children and adolescents with and without ASD. J Autism Dev Disord 49(3):1062–1079. https://doi.org/10.1007/s10803-018-3811-7
Article Google Scholar
Xing J, Niu Z, Huang J, Hu W, Zhou X, Yan S (2018) Towards robust and accurate multi-view and partially-occluded face alignment. IEEE Trans Pattern Anal Mach Intell 40(4):987–1001. https://doi.org/10.1109/TPAMI.2017.2697958
Article Google Scholar
Chavan SA, Chaudhari NM, Ramteke RJ, Pawar UB (2023) Mathematical analysis behind occlusion handling in image with deep learning. Int J Inf Technol 15(7):3741–3749. https://doi.org/10.1007/s41870-023-01408-2
Article Google Scholar
Cootes TF, Taylor CJ, Cooper DH, Graham J (1995) Active shape models-their training and application. Comput Vis Image Understanding, 61(1): 38–59. https://doi.org/10.1006/cviu.1995.1004
Cristinacce D, Cootes T (2007) Boosted regression active shape models. BMVC 2007—Proceedings of the British Machine Vision Conference 2007. https://doi.org/10.5244/C.21.79
Cootes TF, Edwards GJ, Taylor C J (2001) Active Appearance and Models tracking a single deforming object we match a model which can fit and a whole class of objects. Ieee Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, No. 6, June, 23(6), 681–685.
Tzimiropoulos, G., & Pantic, M. (2013). Optimization problems for fast AAM fitting in-the-wild. Proceedings of the IEEE International Conference on Computer Vision, 593–600. https://doi.org/10.1109/ICCV.2013.79
Mallat K, Dugelay JL (2020) Facial landmark detection on thermal data via fully annotated visible-to-thermal data synthesis. IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics. https://doi.org/10.1109/IJCB48548.2020.9304854
Alabort-I-Medina J, Zafeiriou S (2014) Bayesian active appearance models. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 3438–3445. https://doi.org/10.1109/CVPR.2014.439
Wang X, Fu T, Liao S, Wang S, Lei Z, Mei T (2020) Exclusivity-consistency regularized knowledge distillation for face recognition. lecture notes in computer science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 12369 LNCS, 325–342. https://doi.org/10.1007/978-3-030-58586-0_20
Shin S, Lee J, Lee J, Yu Y, Lee K (2022) Teaching where to look: attention similarity knowledge distillation for low resolution face recognition. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13672 LNCS, 631–647. https://doi.org/10.1007/978-3-031-19775-8_37
Yang C, An Z, Zhou H, Cai L, Zhi X, Wu J, Zhang Q (2022) MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13684 LNCS, 534–551. https://doi.org/10.1007/978-3-031-20053-3_31
Liu J, Qin H, Wu Y, Guo J, Liang D, Xu K (2022) CoupleFace: Relation Matters for Face Recognition Distillation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13672 LNCS, 683–700. https://doi.org/10.1007/978-3-031-19775-8_40
Wen Y, Zhang K, B, Z. L., & Qiao, Y. (2016) A Discriminative Feature Learning Approach. Eccv 1:499–515
Google Scholar
Ling H, Wu J, Huang J, Chen J, Li P (2020) Attention-based convolutional neural network for deep face recognition. Multimedia Tools Appl 79(9–10):5595–5616. https://doi.org/10.1007/s11042-019-08422-2
Article Google Scholar
Ge H, Zhu Z, Dai Y, Wang B, Wu X (2022) Facial expression recognition based on deep learning. Comput Methods Programs Biomed 215:106621. https://doi.org/10.1016/j.cmpb.2022.106621
Article Google Scholar
Deng J, Guo J, Yang J, Xue N, Kotsia I, Zafeiriou S (2022) ArcFace: Additive Angular Margin Loss for Deep Face Recognition. IEEE Trans Pattern Anal Mach Intell 44(10):5962–5979. https://doi.org/10.1109/TPAMI.2021.3087709
Article Google Scholar
Chaurasiya R, Ganotra D (2023) Deep dilated CNN based image denoising. Int J Inf Technol 15(1):137–148. https://doi.org/10.1007/s41870-022-01125-2
Article Google Scholar
Chim S, Lee JG, Park HH (2019) Dilated skip convolution for facial landmark detection. Sensors (Switzerland) 19(24):1–21. https://doi.org/10.3390/s19245350
Article Google Scholar
Gangonda SS, Patavardhan PP, Karande KJ (2022) VGHN: variations aware geometric moments and histogram features normalization for robust uncontrolled face recognition. Int J Inf Technol 14(4):1823–1834. https://doi.org/10.1007/s41870-021-00703-0
Article Google Scholar
Hamid Y, Elyassami S, Gulzar Y, Balasaraswathi VR, Habuza T, Wani S (2023) An improvised CNN model for fake image detection. Int J Inf Technol 15(1):5–15. https://doi.org/10.1007/s41870-022-01130-5
Article Google Scholar
Shukla AK, Shukla A, Singh R (2023) Automatic attendance system based on CNN–LSTM and face recognition. Int J Inf Technol. https://doi.org/10.1007/s41870-023-01495-1
Article Google Scholar
Karanwal S (2023) Improved local descriptor (ILD): a novel fusion method in face recognition. Int J Inf Technol 15(4):1885–1894. https://doi.org/10.1007/s41870-023-01245-3
Article Google Scholar
Gokulakrishnan S, Chakrabarti P, Hung BT, Shankar SS (2023) An optimized facial recognition model for identifying criminal activities using deep learning strategy. Int J Inf Technol 15(7):3907–3921. https://doi.org/10.1007/s41870-023-01420-6
Article Google Scholar
Yadav R, Priyanka, Kacker P (2023) AutoMEDSys: automatic facial micro-expression detection system using random fourier features based neural network. Int J Inform Technol. https://doi.org/10.1007/s41870-023-01662-4
Revina IM, Emmanuel WRS (2021) A survey on human face expression recognition techniques. J King Saud Univ—Comput Inform Sci 33(6):619–628. https://doi.org/10.1016/j.jksuci.2018.09.002
Article Google Scholar
Li J, Jin K, Zhou D, Kubota N, Ju Z (2020) Attention mechanism-based CNN for facial expression recognition. Neurocomputing 411:340–350. https://doi.org/10.1016/j.neucom.2020.06.014
Article Google Scholar
Chen J, Lv Y, Xu R, Xu C (2019) Automatic social signal analysis: Facial expression recognition using difference convolution neural network. J Parallel Distributed Comput 131:97–102. https://doi.org/10.1016/j.jpdc.2019.04.017
Article Google Scholar
Ji Y, Hu Y, Yang Y, Shen F, Shen HT (2019) Cross-domain facial expression recognition via an intra-category common feature and inter-category Distinction feature fusion network. Neurocomputing 333:231–239. https://doi.org/10.1016/j.neucom.2018.12.037
Article Google Scholar
Zhang H, Huang B, Tian G (2020) Facial expression recognition based on deep convolution long short-term memory networks of double-channel weighted mixture. Pattern Recogn Lett 131:128–134. https://doi.org/10.1016/j.patrec.2019.12.013
Article Google Scholar
Jain DK, Zhang Z, Huang K (2020) Multi angle optimal pattern-based deep learning for automatic facial expression recognition. Pattern Recogn Lett 139:157–165. https://doi.org/10.1016/j.patrec.2017.06.025
Article Google Scholar
Wang Z, Zeng F, Liu S, Zeng B (2021) OAENet: Oriented attention ensemble for accurate facial expression recognition. Pattern Recogn 112:107694. https://doi.org/10.1016/j.patcog.2020.107694
Article Google Scholar
Xie, W., Jia, X., Shen, L., & Yang, M. (2019). Sparse deep feature learning for facial expression recognition. Pattern Recognition, 96. https://doi.org/10.1016/j.patcog.2019.106966
Zheng X, Guo Y, Huang H, Li Y, He R (2020) A survey of deep facial attribute analysis. Int J Comput Vision 128(8–9):2002–2034. https://doi.org/10.1007/s11263-020-01308-z
Article Google Scholar
Jaber AG, Muniyandi RC, Usman OL, Singh HKR (2022) A hybrid method of enhancing accuracy of facial recognition system using gabor filter and stacked sparse autoencoders deep neural network. Applied Sciences (Switzerland), 12(21). https://doi.org/10.3390/app122111052
Jain N, Kumar S, Kumar A, Shamsolmoali P, Zareapoor M (2018) Hybrid deep neural networks for face emotion recognition. Pattern Recogn Lett 115:101–106. https://doi.org/10.1016/j.patrec.2018.04.010
Article Google Scholar
Shao J, Qian Y (2019) Three convolutional neural network models for facial expression recognition in the wild. Neurocomputing 355:82–92. https://doi.org/10.1016/j.neucom.2019.05.005
Article Google Scholar
Weiss K, Khoshgoftaar TM, Wang DD (2016) A survey of transfer learning. Journal of Big Data (Vol. 3). Springer International Publishing. https://doi.org/10.1186/s40537-016-0043-6
Panigrahi S, Nanda A, Swarnkar T (2021) A Survey on transfer learning. Smart Innovation Syst Technol 194:781–789. https://doi.org/10.1007/978-981-15-5971-6_83
Article Google Scholar
Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359. https://doi.org/10.1109/TKDE.2009.191
Article Google Scholar
Sharma NK, Rahamatkar S, Rathore AS (2023) Analysis of facial geometry to identify and extract face landmarks using transfer learning. Proceedings of the International Conference on Engineering Research and Application 2022 (Icera 2022), 2936, 020034. https://doi.org/10.1063/5.0175339
Agrawal S, Sahu SP (2023) Image-based Parkinson disease detection using deep transfer learning and optimization algorithm. Int J Inf Technol. https://doi.org/10.1007/s41870-023-01601-3
Article Google Scholar
Mosayyebi F, Seyedarabi H, Afrouzian R (2023) Gender recognition in masked facial images using EfficientNet and transfer learning approach. Int J Inf Technol. https://doi.org/10.1007/s41870-023-01565-4
Article Google Scholar
Ma J, Li X, Ren Y, Yang R, Zhao Q (2021) Landmark-Based Facial Feature Construction and Action Unit Intensity Prediction. Mathematical Problems in Engineering, 2021. https://doi.org/10.1155/2021/6623239
Križaj J, Peer P, Štruc V, Dobrišek S (2020) Simultaneous multi-descent regression and feature learning for facial landmarking in depth images. Neural Comput Appl 32(24):17909–17926. https://doi.org/10.1007/s00521-019-04529-7
Article Google Scholar
Le V, Brandt J, Lin Z, Bourdev L, Huang TS (2012) Interactive facial feature localization. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7574 LNCS(PART 3), 679–692. https://doi.org/10.1007/978-3-642-33712-3_49
Pourramezan Fard A, Mahoor MH (2022) Facial landmark points detection using knowledge distillation-based neural networks. Comput Vis Image Underst 215(2015):1–15. https://doi.org/10.1016/j.cviu.2021.103316
Article Google Scholar
Cootes TF, Edwards GJ, Taylor CJ (2001) Active appearance models. IEEE Trans Pattern Anal Mach Intell 23(6):681–685. https://doi.org/10.1109/34.927467
Article Google Scholar
Khabarlak K, Koriashkina L (2022) Fast Facial Landmark Detection and Applications: A Survey. Journal of Computer Science and Technology(Argentina), 22(1), 12–41. https://doi.org/10.24215/16666038.22.e02
Sagonas C, Tzimiropoulos G, Zafeiriou S, Pantic M (2013) 300 faces in-the-wild challenge: The first facial landmark Localization Challenge. Proceedings of the IEEE International Conference on Computer Vision, 397–403. https://doi.org/10.1109/ICCVW.2013.59
Köstinger M, Wohlhart P, Roth PM, Bischof H (2011) Annotated facial landmarks in the wild: A large-scale, real-world database for facial landmark localization. Proceedings of the IEEE International Conference on Computer Vision, (March 2014), 2144–2151. https://doi.org/10.1109/ICCVW.2011.6130513
Zhu X, Ramanan D (2012) Face detection, pose estimation, and landmark localization in the wild. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2879–2886. https://doi.org/10.1109/CVPR.2012.6248014
Martinez AM (n.d.) AR Face Database. Retrieved from https://www2.ece.ohio-state.edu/~aleix/ARdatabase.html#:~:text=This face database was created,70 men and 56 women).
Messer K, Matas J, Kittler J, Luettin J, Maitre G (1999) XM2VTSDB : The Extended M2VTS Database University of Surrey 1 Introduction 2 Database Speci cation 3 The Database Acquisition System. Proceedings of the Second international conference on audio and video-based biometric person authentication, 1–6.
Gross R, Matthews I, Cohn J, Kanade T, Baker S (n.d.). Multi-PIE.
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2016-Decem, 770–778. https://doi.org/10.1109/CVPR.2016.90
Zhang Q (2022) A novel ResNet101 model based on dense dilated convolution for image classification. SN Applied Sciences, 4(1). https://doi.org/10.1007/s42452-021-04897-7
Asthana A, Zafeiriou S, Cheng S, Pantic M (2013) Robust Discriminative Response Map Fitting with Constrained Local Models. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (pp. 3444–3451). https://doi.org/10.1109/CVPR.2013.442
Baltrušaitis T, Robinson P, Morency LP (2014) Continuous conditional neural fields for structured regression. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8692 LNCS(PART 4), 593–608. https://doi.org/10.1007/978-3-319-10593-2_39
Tan M, Le QV (2019) EfficientNet: Rethinking model scaling for convolutional neural networks. 36th International Conference on Machine Learning, ICML 2019, 2019-June, 10691–10700.
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Adam H (2017) MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. Retrieved from http://arxiv.org/abs/1704.04861

Download references

Acknowledgements

The research is supported by Technosys Security System Pvt. Ltd. Bhopal (M.P.) India, under a research collaboration. The authors would like to express gratitude to Technosys Security System Pvt. Ltd. for their support and valuable suggestions.

Funding

There is no direct and indirect finance involved in it.

Author information

Authors and Affiliations

Amity School of Engineering & Technology, Amity University Chhattisgarh, Raipur, Chhattisgarh, India
Nand Kishore Sharma
Shri Vaishnav Vidyapeeth Vishwavidyalaya, Indore, Madhya Pradesh, India
Abhishek Singh Rathore
Avantika University, Ujjain, Madhya Pradesh, India
Surendra Rahamatkar

Authors

Nand Kishore Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Surendra Rahamatkar
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Singh Rathore
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nand Kishore Sharma.

Ethics declarations

Conflict of interest

The authors declare that there is no Conflict of Interest for publication of this manuscript. And the authors declare no competing interests.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Sharma, N.K., Rahamatkar, S. & Rathore, A.S. Enhancing facial geometry analysis by DeepFaceLandmark leveraging ResNet101 and transfer learning. Int. j. inf. tecnol. (2024). https://doi.org/10.1007/s41870-024-01872-4

Download citation

Received: 20 December 2023
Accepted: 08 April 2024
Published: 12 May 2024
DOI: https://doi.org/10.1007/s41870-024-01872-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhancing facial geometry analysis by DeepFaceLandmark leveraging ResNet101 and transfer learning

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

YOLO-based Object Detection Models: A Review and its Applications

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Enhancing facial geometry analysis by DeepFaceLandmark leveraging ResNet101 and transfer learning

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

YOLO-based Object Detection Models: A Review and its Applications

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation