Employing Data Augmentation for Recognition of Hand Gestures Using Deep Learning

Kumar, Deepak; Aleem, Abdul; Gore, Manoj Madhava

doi:10.1007/978-981-33-4582-9_25

Deepak Kumar⁶,
Abdul Aleem⁶ &
Manoj Madhava Gore⁶

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 61))

Included in the following conference series:

Congress on Intelligent Systems

352 Accesses
1 Citations

Abstract

Hand gestures are a form of non-verbal communication. Apart from traditional input devices, hand gestures are used for interaction with computers too. Communication via hand gestures finds many applications in the real world. Different people have hands with different shapes and orientations, which is termed as nonlinearity. The nonlinearity affects the performance of hand gesture models. A convolutional neural network (CNN) is an approach of neural networks, specifically known as deep learning. CNN is used to recognize and classify images. Sometimes, CNN could not correctly understand the hand gesture due to nonlinearity. Data augmentation helps CNN to understand the nonlinearity and complexity of images better. Data augmentation generates enormous data from lesser data, thus increasing the data adversity. Data augmentation uses various operations like zooming, rotating, shifting, shearing, and scaling to generate more data from the existing data. This article executes a CNN model using augmented data for recognition of static hand gestures. The dataset consists of 10 different hand gestures. The experimented CNN model has been trained using 10000 images and tested using 1000 images. The changes in the output of CNN with and without data augmentation have been highlighted. The CNN model employing data augmentation achieved an accuracy of 98.10%, whereas the CNN model excluding the data augmentation process attained an accuracy of 94.90% only.

This work was done as a part of M. Tech. Thesis [1] during his stay at MNNIT Allahabad as a Master’s student.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Softcover Book: USD 279.99; Price excludes VAT (USA)

Hardcover Book: USD 279.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
the number of pixels shifts over the input matrix.

References

Kumar D (2020) M. Tech. Thesis: Employing data augmentation for recognition of hand gesture using deep learning. Tech Rep, MNNIT Allahabad, India
Google Scholar
Erhan D, Szegedy C, Toshev A, Anguelov D (2014) Scalable object detection using deep neural networks. In: The IEEE conference on computer vision and pattern recognition (CVPR)
Google Scholar
Zeiler MD., Fergus R (2014) Visualizing and understanding convolutional networks. In European Conference on Computer Vision. Springer, pp 818–833
Google Scholar
Li G, Tang H, Sun Y, Kong J, Jiang G, Jiang D, Tao B, Xu S, Liu H (2019) Hand gesture recognition based on convolution neural network. Cluster Comput 22(2):2719–2729
Article Google Scholar
Perez L, Wang J (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621
Mikołajczyk A, Grochowski M (2018) Data augmentation for improving deep learning in image classification problem. In: 2018 international interdisciplinary PhD workshop (IIPhDW). IEEE, pp 117–122
Google Scholar
Shijie J, Ping W, Peiyi J, Siping H (2017) Research on data augmentation for image classification based on convolution neural networks. In: 2017 Chinese automation congress (CAC). IEEE, pp 4165–4170
Google Scholar
Nutipalli P, Gudla SPK, Yogitha B, Rajesh G A comparative analysis on hand gesture recognition using deep learning
Google Scholar
Obaid F, Babadi A, Yoosofan A (2020) Hand gesture recognition in video sequences using deep convolutional and recurrent neural networks. Appl Comput Syst 25(1):57–61
Article Google Scholar
Molchanov P, Gupta S, Kim K, Kautz J (2015) Hand gesture recognition with 3d convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops pp 1–7
Google Scholar
Yingxin X, Jinghua L, Lichun W, Dehui K (2016) A robust hand gesture recognition method via convolutional neural network. In: 2016 6th international conference on digital home (ICDH), pp 64–67. IEEE
Google Scholar
Oyedotun OK, Khashman A (2017) Deep learning in vision-based static hand gesture recognition. Neu Comput Appl 28(12):3941–3951
Article Google Scholar
Islam MZ, Hossain MS, Ul Islam R, Andersson K (2019) Static hand gesture recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th international conference on informatics, electronics and vision (ICIEV) and 2019 3rd international conference on imaging, vision and pattern recognition (iCIVPR), pp 324–329. IEEE
Google Scholar
KaewTraKulPong P, Bowden R (2002) An improved adaptive background mixture model for real-time tracking with shadow detection. In: Video-based surveillance systems, pp 135–144. Springer
Google Scholar
Zivkovic Z (2004) Improved adaptive gaussian mixture model for background subtraction. In Proceedings of the 17th international conference on pattern recognition, vol 2. ICPR 2004. IEEE, pp 28–31
Google Scholar
Grundland M, Dodgson NA (2007) Decolorize: fast, contrast enhancing, color to grayscale conversion. Pattern Recog 40(11):2891–2896
Article Google Scholar

Download references

Author information

Authors and Affiliations

CSE Department, MNNIT Allahabad, Prayagraj, India
Deepak Kumar, Abdul Aleem & Manoj Madhava Gore

Authors

Deepak Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Aleem
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Madhava Gore
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abdul Aleem .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Rajasthan Technical University, Kota, Rajasthan, India
Harish Sharma
Department of Computer Science & Engineering and Information Technology, Jaypee Institute of Information Technology, Noida, Uttar Pradesh, India
Mukesh Saraswat
Department of Computer Science and Engineering, CHRIST (Deemed to be University), Bangalore, Karnataka, India
Sandeep Kumar
Department of Mathematics, South Asian University, New Delhi, Delhi, India
Jagdish Chand Bansal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kumar, D., Aleem, A., Gore, M.M. (2021). Employing Data Augmentation for Recognition of Hand Gestures Using Deep Learning. In: Sharma, H., Saraswat, M., Kumar, S., Bansal, J.C. (eds) Intelligent Learning for Computer Vision. CIS 2020. Lecture Notes on Data Engineering and Communications Technologies, vol 61. Springer, Singapore. https://doi.org/10.1007/978-981-33-4582-9_25

Download citation

DOI: https://doi.org/10.1007/978-981-33-4582-9_25
Published: 20 May 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-4581-2
Online ISBN: 978-981-33-4582-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics