Abstract
Hand gestures are a form of non-verbal communication. Apart from traditional input devices, hand gestures are used for interaction with computers too. Communication via hand gestures finds many applications in the real world. Different people have hands with different shapes and orientations, which is termed as nonlinearity. The nonlinearity affects the performance of hand gesture models. A convolutional neural network (CNN) is an approach of neural networks, specifically known as deep learning. CNN is used to recognize and classify images. Sometimes, CNN could not correctly understand the hand gesture due to nonlinearity. Data augmentation helps CNN to understand the nonlinearity and complexity of images better. Data augmentation generates enormous data from lesser data, thus increasing the data adversity. Data augmentation uses various operations like zooming, rotating, shifting, shearing, and scaling to generate more data from the existing data. This article executes a CNN model using augmented data for recognition of static hand gestures. The dataset consists of 10 different hand gestures. The experimented CNN model has been trained using 10000 images and tested using 1000 images. The changes in the output of CNN with and without data augmentation have been highlighted. The CNN model employing data augmentation achieved an accuracy of 98.10%, whereas the CNN model excluding the data augmentation process attained an accuracy of 94.90% only.
This work was done as a part of M. Tech. Thesis [1] during his stay at MNNIT Allahabad as a Master’s student.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
the number of pixels shifts over the input matrix.
References
Kumar D (2020) M. Tech. Thesis: Employing data augmentation for recognition of hand gesture using deep learning. Tech Rep, MNNIT Allahabad, India
Erhan D, Szegedy C, Toshev A, Anguelov D (2014) Scalable object detection using deep neural networks. In: The IEEE conference on computer vision and pattern recognition (CVPR)
Zeiler MD., Fergus R (2014) Visualizing and understanding convolutional networks. In European Conference on Computer Vision. Springer, pp 818–833
Li G, Tang H, Sun Y, Kong J, Jiang G, Jiang D, Tao B, Xu S, Liu H (2019) Hand gesture recognition based on convolution neural network. Cluster Comput 22(2):2719–2729
Perez L, Wang J (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621
Mikołajczyk A, Grochowski M (2018) Data augmentation for improving deep learning in image classification problem. In: 2018 international interdisciplinary PhD workshop (IIPhDW). IEEE, pp 117–122
Shijie J, Ping W, Peiyi J, Siping H (2017) Research on data augmentation for image classification based on convolution neural networks. In: 2017 Chinese automation congress (CAC). IEEE, pp 4165–4170
Nutipalli P, Gudla SPK, Yogitha B, Rajesh G A comparative analysis on hand gesture recognition using deep learning
Obaid F, Babadi A, Yoosofan A (2020) Hand gesture recognition in video sequences using deep convolutional and recurrent neural networks. Appl Comput Syst 25(1):57–61
Molchanov P, Gupta S, Kim K, Kautz J (2015) Hand gesture recognition with 3d convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops pp 1–7
Yingxin X, Jinghua L, Lichun W, Dehui K (2016) A robust hand gesture recognition method via convolutional neural network. In: 2016 6th international conference on digital home (ICDH), pp 64–67. IEEE
Oyedotun OK, Khashman A (2017) Deep learning in vision-based static hand gesture recognition. Neu Comput Appl 28(12):3941–3951
Islam MZ, Hossain MS, Ul Islam R, Andersson K (2019) Static hand gesture recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th international conference on informatics, electronics and vision (ICIEV) and 2019 3rd international conference on imaging, vision and pattern recognition (iCIVPR), pp 324–329. IEEE
KaewTraKulPong P, Bowden R (2002) An improved adaptive background mixture model for real-time tracking with shadow detection. In: Video-based surveillance systems, pp 135–144. Springer
Zivkovic Z (2004) Improved adaptive gaussian mixture model for background subtraction. In Proceedings of the 17th international conference on pattern recognition, vol 2. ICPR 2004. IEEE, pp 28–31
Grundland M, Dodgson NA (2007) Decolorize: fast, contrast enhancing, color to grayscale conversion. Pattern Recog 40(11):2891–2896
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, D., Aleem, A., Gore, M.M. (2021). Employing Data Augmentation for Recognition of Hand Gestures Using Deep Learning. In: Sharma, H., Saraswat, M., Kumar, S., Bansal, J.C. (eds) Intelligent Learning for Computer Vision. CIS 2020. Lecture Notes on Data Engineering and Communications Technologies, vol 61. Springer, Singapore. https://doi.org/10.1007/978-981-33-4582-9_25
Download citation
DOI: https://doi.org/10.1007/978-981-33-4582-9_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-4581-2
Online ISBN: 978-981-33-4582-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)