Skip to main content

Employing Data Augmentation for Recognition of Hand Gestures Using Deep Learning

  • Conference paper
  • First Online:
Intelligent Learning for Computer Vision (CIS 2020)

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 61))

Included in the following conference series:

Abstract

Hand gestures are a form of non-verbal communication. Apart from traditional input devices, hand gestures are used for interaction with computers too. Communication via hand gestures finds many applications in the real world. Different people have hands with different shapes and orientations, which is termed as nonlinearity. The nonlinearity affects the performance of hand gesture models. A convolutional neural network (CNN) is an approach of neural networks, specifically known as deep learning. CNN is used to recognize and classify images. Sometimes, CNN could not correctly understand the hand gesture due to nonlinearity. Data augmentation helps CNN to understand the nonlinearity and complexity of images better. Data augmentation generates enormous data from lesser data, thus increasing the data adversity. Data augmentation uses various operations like zooming, rotating, shifting, shearing, and scaling to generate more data from the existing data. This article executes a CNN model using augmented data for recognition of static hand gestures. The dataset consists of 10 different hand gestures. The experimented CNN model has been trained using 10000 images and tested using 1000 images. The changes in the output of CNN with and without data augmentation have been highlighted. The CNN model employing data augmentation achieved an accuracy of 98.10%, whereas the CNN model excluding the data augmentation process attained an accuracy of 94.90% only.

This work was done as a part of M. Tech. Thesis [1] during his stay at MNNIT Allahabad as a Master’s student.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 279.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    the number of pixels shifts over the input matrix.

References

  1. Kumar D (2020) M. Tech. Thesis: Employing data augmentation for recognition of hand gesture using deep learning. Tech Rep, MNNIT Allahabad, India

    Google Scholar 

  2. Erhan D, Szegedy C, Toshev A, Anguelov D (2014) Scalable object detection using deep neural networks. In: The IEEE conference on computer vision and pattern recognition (CVPR)

    Google Scholar 

  3. Zeiler MD., Fergus R (2014) Visualizing and understanding convolutional networks. In European Conference on Computer Vision. Springer, pp 818–833

    Google Scholar 

  4. Li G, Tang H, Sun Y, Kong J, Jiang G, Jiang D, Tao B, Xu S, Liu H (2019) Hand gesture recognition based on convolution neural network. Cluster Comput 22(2):2719–2729

    Article  Google Scholar 

  5. Perez L, Wang J (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621

  6. Mikołajczyk A, Grochowski M (2018) Data augmentation for improving deep learning in image classification problem. In: 2018 international interdisciplinary PhD workshop (IIPhDW). IEEE, pp 117–122

    Google Scholar 

  7. Shijie J, Ping W, Peiyi J, Siping H (2017) Research on data augmentation for image classification based on convolution neural networks. In: 2017 Chinese automation congress (CAC). IEEE, pp 4165–4170

    Google Scholar 

  8. Nutipalli P, Gudla SPK, Yogitha B, Rajesh G A comparative analysis on hand gesture recognition using deep learning

    Google Scholar 

  9. Obaid F, Babadi A, Yoosofan A (2020) Hand gesture recognition in video sequences using deep convolutional and recurrent neural networks. Appl Comput Syst 25(1):57–61

    Article  Google Scholar 

  10. Molchanov P, Gupta S, Kim K, Kautz J (2015) Hand gesture recognition with 3d convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops pp 1–7

    Google Scholar 

  11. Yingxin X, Jinghua L, Lichun W, Dehui K (2016) A robust hand gesture recognition method via convolutional neural network. In: 2016 6th international conference on digital home (ICDH), pp 64–67. IEEE

    Google Scholar 

  12. Oyedotun OK, Khashman A (2017) Deep learning in vision-based static hand gesture recognition. Neu Comput Appl 28(12):3941–3951

    Article  Google Scholar 

  13. Islam MZ, Hossain MS, Ul Islam R, Andersson K (2019) Static hand gesture recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th international conference on informatics, electronics and vision (ICIEV) and 2019 3rd international conference on imaging, vision and pattern recognition (iCIVPR), pp 324–329. IEEE

    Google Scholar 

  14. KaewTraKulPong P, Bowden R (2002) An improved adaptive background mixture model for real-time tracking with shadow detection. In: Video-based surveillance systems, pp 135–144. Springer

    Google Scholar 

  15. Zivkovic Z (2004) Improved adaptive gaussian mixture model for background subtraction. In Proceedings of the 17th international conference on pattern recognition, vol 2. ICPR 2004. IEEE, pp 28–31

    Google Scholar 

  16. Grundland M, Dodgson NA (2007) Decolorize: fast, contrast enhancing, color to grayscale conversion. Pattern Recog 40(11):2891–2896

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdul Aleem .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kumar, D., Aleem, A., Gore, M.M. (2021). Employing Data Augmentation for Recognition of Hand Gestures Using Deep Learning. In: Sharma, H., Saraswat, M., Kumar, S., Bansal, J.C. (eds) Intelligent Learning for Computer Vision. CIS 2020. Lecture Notes on Data Engineering and Communications Technologies, vol 61. Springer, Singapore. https://doi.org/10.1007/978-981-33-4582-9_25

Download citation

Publish with us

Policies and ethics