Face Recognition Using 3D CNNs

Mishra, Nayaneesh Kumar; Singh, Satish Kumar

doi:10.1007/978-981-16-1681-5_18

Nayaneesh Kumar Mishra⁶ &
Satish Kumar Singh⁶

Part of the book series: Transactions on Computer Systems and Networks ((TCSN))

1485 Accesses

Abstract

The area of face recognition is one of the most widely researched areas in the domain of computer vision and biometric. This is because the non-intrusive nature of face biometric makes it comparatively more suitable for application in area of surveillance at public places such as airports. The application of primitive methods in face recognition could not give very satisfactory performance. However, with the advent of machine and deep learning methods and their application in face recognition, several major breakthroughs were obtained. The use of 2D convolution neural networks(2D CNN) in face recognition crossed the human face recognition accuracy and reached to 99%. Still, robust face recognition in the presence of real-world conditions such as variation in resolution, illumination and pose is a major challenge for researchers in face recognition. In this work, we used video as input to the 3D CNN architectures for capturing both spatial and time domain information from the video for face recognition in real-world environment. For the purpose of experimentation, we have developed our own video dataset called CVBL video dataset. The use of 3D CNN for face recognition in videos shows promising results with DenseNets performing the best with an accuracy of 97% on CVBL dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ahonen T, Hadid A, Pietikäinen M (2004) Face recognition with local binary patterns. In: Computer vision-ECCV 2004. Springer, pp 469–481
Google Scholar
Ahonen T, Rahtu E, Ojansivu V, Heikkila J (2008) Recognition of blurred faces using local phase quantization. In: International conference on pattern recognition
Google Scholar
Bilgazyev E, Efraty B, Shah SK, Kakadiaris IA (2011) Improved face recognition using super-resolution. In: 2011 international joint conference on biometrics (IJCB). IEEE, pp 1–7
Google Scholar
Brunelli R, Poggio T (1993) Face recognition: features versus templates. IEEE Trans Pattern Anal Mach Intell 15(10):1042–1052
Article Google Scholar
Chellappa R, Wilson CL, Sirohey S (1995) Human and machine recognition of faces: a survey. Proc IEEE 83(5):705–740
Article Google Scholar
CVBL Dataset. https://cvbl.iiita.ac.in/dataset.php. Last accessed 30 Dec 2018
Deng J, Guo J, Xue N, Zafeiriou S (2018) Arcface: Additive angular margin loss for deep face recognition. arXiv preprint arXiv:1801.07698
Donahue J, Hendricks LA, Guadarrama S, Rohrbach M, Venugopalan S, Saenko K, Darrell T (2015) Long-term recurrent convolutional networks for visual recognition and description. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2625–2634
Google Scholar
Gunturk BK, Batur AU, Altunbasak Y, Hayes MH, Mersereau RM (2003) Eigenface-domain super-resolution for face recognition. IEEE Trans Image Process 12(5):597–606
Article Google Scholar
Hadsell R, Chopra S, LeCun Y (2006) Dimensionality reduction by learning an invariant mapping. In: Null. IEEE, pp 1735–1742
Google Scholar
Hara K, Kataoka H, Satoh Y (2017) Can spatiotemporal 3D CNNs retrace the history of 2D CNNs and ImageNet?arXiv preprint arXiv:1711.09577
He K, Zhang X, Ren S, Sun J (2016) Identity mappings in deep residual networks. European conference on computer vision. Springer, Cham, pp 630–645
Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Identity mappings in deep residual networks. In: Proceedings of the European conference on computer vision (ECCV), pp 630–645
Google Scholar
Huang G, Liu Z, van der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 4700–4708
Google Scholar
Jain AK, Duin RPW, Mao J (2000) Statistical pattern recognition: a review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37
Article Google Scholar
Karpathy A, Toderici G, Shetty S, Leung T, Sukthankar R, Fei-Fei L (2014) Large-scale video classification with convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1725–1732
Google Scholar
Kuehne H, Jhuang H, Garrote E, Poggio T, Serre T (2011) HMDB: a large video database for human motion recognition. In: 2011 IEEE international conference on computer vision (ICCV). IEEE, pp 2556–2563
Google Scholar
Lin M, Chen Q, Yan S (2013) Network in network. arXiv preprint arXiv:1312.4400
Liu W et al (2017) Sphereface: deep hypersphere embedding for face recognition. In: The IEEE conference on computer vision and pattern recognition (CVPR), vol 1
Google Scholar
Liu W, Wen Y, Yu Z, Yang M (2016) Large-margin softmax loss for convolutional neural networks. In: ICML, pp 507–516
Google Scholar
PyTorch. https://pytorch.org/. Last accessed 25 Dec 2018
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 815–823
Google Scholar
Soomro K, Zamir AR, Shah M (2012) UCF101: a dataset of 101 human action classes from videos in the wild. In: CRCV-TR-12-01, Nov (2012)
Google Scholar
Sun Y, W, Tang X (2015) Deeply learned face representations are sparse, selective, and robust. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Google Scholar
Tran D, Bourdev L, Fergus R, Torresani L, Paluri M (2015) Learning spatiotemporal features with 3d convolutional networks. In: Proceedings of the IEEE international conference on computer vision, pp 4489–4497
Google Scholar
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71–86
Article Google Scholar
Varol G, Laptev I, Schmid C (2018) Long-term temporal convolutions for action recognition. IEEE Trans Pattern Anal Mach Intell 40(6):1510–1517
Article Google Scholar
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Article Google Scholar
Wen Y et al (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision. Springer, Cham
Google Scholar
Wibowo ME, Tjondronegoro D, Chandran V (2012) Probabilistic matching of image sets for video-based face recognition. In: International conference on digital image computing: techniques and applications (DICTA)
Google Scholar
Wibowo ME, Tjondronegoro D (2012) Face recognition across pose on video using eigen light-fields. International conference on digital image computing: techniques and applications (DICTA) 2011:536–541
Google Scholar
Wiskott L, Fellous J-M, Kruger N, Von Malsburg CD (1997) Face recognition by Elastic Bunch graph matching. IEEE Trans Pattern Anal Mach Intell 19(7):775–779
Article Google Scholar
Wolf L, Hassner T, Maoz I (2011) Face recognition in unconstrained videos with matched background similarity. In: CVPR
Google Scholar
Xie X, Zheng W-S, Lai J, Yuen PC, Suen CY (2011) Normalization of face illumination based on large-and small-scale features. IEEE Trans Image Process 20(7):1807–1821
Article MathSciNet Google Scholar
Xie S, Girshick R, Dollár P, Tu Z, He K (2017) Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1492–1500
Google Scholar
Yue-Hei Ng J, Hausknecht M, Vijayanarasimhan S, Vinyals O, Monga R, Toderici G (2015) Beyond short snippets: deep networks for video classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4694–4702
Google Scholar
Zagoruyko S, Komodakis N (2016) Wide residual networks. In: Proceedings of the British machine vision conference
Google Scholar
Zhu X, Lei Z, Yan J, Yi D, Li SZ (2015) High-fidelity pose and expression normalization for face recognition in the wild. Proc IEEE Conf Comput Vis Pattern Recogn:787–796
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Vision and Biometric Lab, IIIT Allahabad, Allahabad, India
Nayaneesh Kumar Mishra & Satish Kumar Singh

Authors

Nayaneesh Kumar Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Satish Kumar Singh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Engineering, National Institute of Technology Kurukshetra, Kurukshetra, India
Gyanendra K. Verma
Department of Computer Science and Engineering, National Institute of Technology Silchar, Silchar, India
Badal Soni
Multidimensional Signal Processing Group, Ecole Centrale Marseille, MARSEILLE, France
Salah Bourennane
Mathematics and Computing Institute, Universidade Federal de Itajuba, Itajuba, Brazil
Alexandre C. B. Ramos

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mishra, N.K., Singh, S.K. (2021). Face Recognition Using 3D CNNs. In: Verma, G.K., Soni, B., Bourennane, S., Ramos, A.C.B. (eds) Data Science. Transactions on Computer Systems and Networks. Springer, Singapore. https://doi.org/10.1007/978-981-16-1681-5_18

Download citation

DOI: https://doi.org/10.1007/978-981-16-1681-5_18
Published: 20 August 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-1680-8
Online ISBN: 978-981-16-1681-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics