Abstract
There is a long drawn communication barrier between normal people and deaf-mute community. Sign language is a major tool of communication for hearing impaired people. The goal of this work is to develop a Convolutional Neural Network (CNN) based Indian sign language classifier. CNN models with combination of different hidden layers are analysed and the model giving highest accuracy is selected. Further synthetic data is generated using Conditional Generative Adversarial Network (CGAN), in order to improve classification accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The dataset used in this work can be found here.
References
Huang J, Zhou W, Li H, Li W (2015) Sign Language Recognition using 3D convolutional neural networks. In: 2015 IEEE International conference on multimedia and expo (ICME), pp 1–6
Elakkiya R, Vijayakumar P, Kumar N (2021) An optimized generative adversarial network based continuous sign language classification. Expert Syst Appl, p 11527
Sharmaa P, Anand RS (2021) A comprehensive evaluation of deep models and optimizers for Indian sign language recognition. In: Graphics and visual computing
Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X (2016) Improved techniques for training gans. Adv Neural Inf Process Syst 29:2234–2242
Rastgoo R, Kiani K, Escalera S, Sabokrou M (2021) Sign language production: a review. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3451–3461
Camgoz NC, Hadfield S, Koller O, Ney H, Bowden R (2018) Neural sign language translation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7784–7793
LiaoY Xiong P, Min W, Min W, Lu J (2019) Dynamic sign language recognition based on video sequence with BLSTM-3D residual networks. IEEE Access 7:38044–38054
Miyato T, Kataoka T, Koyama M, Yoshida Y (2018) Spectral normalization for generative adversarial networks. arXiv preprint arXiv:1802.05957
Poorna SS, Ravi Kiran Reddy M, Akhil N, Kamath S, Mohan L, Anuraj K, Pradeep HS (2020) Computer vision aided study for melanoma detection: a deep learning versus conventional supervised learning approach. In: Advanced computing and intelligent engineering. Springer, Singapore, pp 75–83
Bharath Chandra BV, Naveen C, Sampath Kumar MM, Sai Bhargav MS, Poorna SS, Anuraj K (2021) A comparative study of drowsiness detection from Eeg signals using pretrained CNN models. In: 2021 12th International conference on computing communication and networking technologies (ICCCNT), pp 1–3. https://doi.org/10.1109/ICCCNT51525.2021.9579555
Aloysius N, Geetha M (2017) A review on deep convolutional neural networks. In: International conference on communication and signal processing (ICCSP), pp 588–592. https://doi.org/10.1109/ICCSP.2017.8286426
Geetha M, Manjusha C, Unnikrishnan P, Harikrishnan R (2013) A vision based dynamic gesture recognition of Indian Sign Language on Kinect based depth images. In: 2013 International conference on emerging trends in communication, control, signal processing and computing applications (C2SPCA), pp 1–7. https://doi.org/10.1109/C2SPCA.2013.6749448
Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning, pp 448–456
Lee KS, Town C (2020) Mimicry: towards the reproducibility of gan research. arXiv preprint arXiv:2005.02494
Miyato T, Koyama M (2018) cGANs with projection discriminator. arXiv preprint arXiv:1802.05637
Odena Augustus, Dumoulin Vincent, Olah Chris (2016) Deconvolution and checkerboard artifacts. Distill 1(10):e3
Nguyen A, Clune J, Bengio Y, Dosovitskiy A, Yosinski J (2017) Plug & play generative networks: conditional iterative generation of images in latent space. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4467–4477
Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S (2017) Gans trained by a two time-scale update rule converge to a local nash equilibrium. In: Advances in neural information processing systems, vol 30
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Charan, M.G.K.S. et al. (2022). Sign Language Recognition Using CNN and CGAN. In: Suma, V., Baig, Z., Kolandapalayam Shanmugam, S., Lorenz, P. (eds) Inventive Systems and Control. Lecture Notes in Networks and Systems, vol 436. Springer, Singapore. https://doi.org/10.1007/978-981-19-1012-8_33
Download citation
DOI: https://doi.org/10.1007/978-981-19-1012-8_33
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-1011-1
Online ISBN: 978-981-19-1012-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)