Abstract
For natural and meaningful communication between the deaf community and the hearing population, sign language is very important. Most of the Arab sign recognition studies have focused on the identification of the sign action based on the descriptor of the feature. However, the limitation of this traditional method is the need to choose which features are important in each particular sequence. To address this issue, we propose a novel approach based on a deep learning architecture to classify video sequences of Arabic sign language, especially Moroccan sign language. Two methods of classification are applied, namely 2D convolutional recurring neural network (2DCRNN) and 3D convolutional neural network (3DCNN). Concerning the first method, a 2DCRNN model is used to extract features with a recurring network pattern to detect the relationship between frames. The second method uses a 3DCNN model learning the spatiotemporal features out of small patches. After 2DCRNN and the 3DCNN models extracted feature, the video data are classified into various classes, using a fully connected network. The proposed approach is trained over a collection of 224 videos of five individuals performing 56 different signs. The results achieved through the fourfold cross-validation technique demonstrate the performance of the proposed approach in terms of recall, F1 score, and AUROC, with the level accuracy of 92% for 2DCRNN and 99% for 3DCNN.
Similar content being viewed by others
References
Napier, J.; Leeson, L.: Sign language in action. In: Sign language in action, pp. 50–84. Springer (2016)
Sandler, W.; Lillo-Martin, D.: Sign Language and Linguistic Universals. Cambridge University Press, Cambridge (2006)
Yeasin, M.; Chaudhuri, S.: Visual understanding of dynamic hand gestures. Pattern Recognit. 33(11), 1805–1817 (2000)
NIDCD Fact Sheet, H., Balance: American sign language. NIH Publication No. 11-4756 March 2019 (2019)
Yang, S.; Zhu, Q.: Video-based chinese sign language recognition using convolutional neural network. In: 2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN), pp. 929–934. IEEE (2017)
Al-Fityani, K.; Padden, C.: Sign language geography in the arab world. Sign languages: a Cambridge survey 20,(2010)
Esam, N.; Abbas, A.; Krause, P.: Towards empowering hearing impaired students’ skills in computing and technology. Int. J. Adv. Comput. Sci. Appl. 8(1) (2017)
Abdel-Fattah, M.A.: Arabic sign language: a perspective. J. Deaf Stud. Deaf Educ. 10(2), 212–221 (2005)
Gugenheimer, J.; Plaumann, K.; Schaub, F.; Di Campli San Vito, P.; Duck, S.; Rabus, M.; Rukzio, E.: The impact of assistive technology on communication quality between deaf and hearing individuals. In: Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, pp. 669–682 (2017)
Mohandes, M.; Deriche, M.; Liu, J.: Image-based and sensor-based approaches to arabic sign language recognition. IEEE Trans. Hum.-Mach. Syst. 44(4), 551–557 (2014)
Tubaiz, N.; Shanableh, T.; Assaleh, K.: Glove-based continuous arabic sign language recognition in user-dependent mode. IEEE Trans. Hum.-Mach. Syst. 45(4), 526–533 (2015)
Mohandes, M.; A-Buraiky, S.; Halawani, T.; Al-Baiyat, S.: Automation of the arabic sign language recognition. In: Proceedings. 2004 International Conference on Information and Communication Technologies: From Theory to Applications, 2004., pp. 479–480. IEEE (2004)
Ibrahim, N.B.; Zayed, H.H.; Selim, M.M.: Advances, challenges and opportunities in continuous sign language recognition. J. Eng. Appl. Sci. 15(5), 1205–1227 (2020)
Elons, A.; Ahmed, M.; Shedid, H.; Tolba, M.: Arabic sign language recognition using leap motion sensor. In: 2014 9th International Conference on Computer Engineering & Systems (ICCES), pp. 368–373. IEEE (2014)
Ahmed, A.A.; Aly, S.: Appearance-based arabic sign language recognition using hidden markov models. In: 2014 International Conference on Engineering and Technology (ICET), pp. 1–6. IEEE (2014)
Abdi, H.; Williams, L.J.: Principal component analysis. Wiley Interdiscip. Rev. Comput. Stat. 2(4), 433–459 (2010)
Heikkilä, M.; Pietikäinen, M.; Schmid, C.: Description of interest regions with local binary patterns. Pattern Recognit. 42(3), 425–436 (2009)
Shanableh, T.; Assaleh, K.; Al-Rousan, M.: Spatio-temporal feature-extraction techniques for isolated gesture recognition in arabic sign language. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 37(3), 641–650 (2007)
Hemayed, E.E.; Hassanien, A.S.: Edge-based recognizer for arabic sign language alphabet (ars2v-arabic sign to voice). In: 2010 International Computer Engineering Conference (ICENCO), pp. 121–127. IEEE (2010)
Dahmani, D.; Larabi, S.: User-independent system for sign language finger spelling recognition. J. Vis. Commun. Image Represent. 25(5), 1240–1250 (2014)
Hassan, M.; Assaleh, K.; Shanableh, T.: User-dependent sign language recognition using motion detection. In: 2016 International Conference on Computational Science and Computational Intelligence (CSCI), pp. 852–856. IEEE (2016)
Alzohairi, R.; Alghonaim, R.; Alshehri, W.; Aloqeely, S.; Alzaidan, M.; Bchir, O.: Image based arabic sign language recognition system. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 9(3) (2018)
Soora, N.R.; Deshpande, P.S.: Review of feature extraction techniques for character recognition. IETE J. Res. 64(2), 280–295 (2018)
Adams, S.; Beling, P.A.: A survey of feature selection methods for Gaussian mixture models and hidden Markov models. Artif. Intell. Rev. 52(3), 1739–1779 (2019)
Deriche, M.; Aliyu, S.O.; Mohandes, M.: An intelligent arabic sign language recognition system using a pair of LMCs with GMM based classification. IEEE Sensors J. 19(18), 8067–8078 (2019)
Hayani, S.; Benaddy, M.; El Meslouhi, O.; Kardouchi, M.: Arab sign language recognition with convolutional neural networks. In: 2019 International Conference of Computer Science and Renewable Energies (ICCSRE), pp. 1–4. IEEE (2019)
Maraqa, M.; Abu-Zaiter, R.: Recognition of arabic sign language (arsl) using recurrent neural networks. In: 2008 First International Conference on the Applications of Digital Information and Web Technologies (ICADIWT), pp. 478–481. IEEE (2008)
Li, D.; Rodriguez, C.; Yu, X.; Li, H.: Word-level deep sign language recognition from video: a new large-scale dataset and methods comparison. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1459–1469 (2020)
Li, D.; Yu, X.; Xu, C.; Petersson, L.; Li, H.: Transferring cross-domain knowledge for video sign language recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6205–6214 (2020)
Albanie, S.; Varol, G.; Momeni, L.; Afouras, T.; Chung, J.S.; Fox, N.; Zisserman, A.: Bsl-1k: scaling up co-articulated sign language recognition using mouthing cues. In: European Conference on Computer Vision, pp. 35–53. Springer (2020)
Momeni, L.; Varol, G.; Albanie, S.; Afouras, T.; Zisserman, A.: Watch, read and lookup: learning to spot signs from multiple supervisors. In: Proceedings of the Asian Conference on Computer Vision (2020)
Li, D.; Xu, C.; Yu, X.; Zhang, K.; Swift, B.; Suominen, H.; Li, H.: Tspnet: hierarchical feature learning via temporal semantic pyramid for sign language translation. arXiv:2010.05468 (2020)
Renz, K.; Stache, N.C.; Albanie, S.; Varol, G.: Sign language segmentation with temporal convolutional networks. In: ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2135–2139. IEEE (2021)
Deng, L.; Yu, D.: Deep learning: methods and applications. Found. Trends Signal Process. 7(3–4), 197–387 (2014)
Ibrahim, N.B.; Selim, M.M.; Zayed, H.H.: An automatic arabic sign language recognition system (arslrs). J. King Saud Univ.-Comput. Inf. Sci. 30(4), 470–477 (2018)
ElBadawy, M.; Elons, A.; Shedeed, H.A.; Tolba, M.: Arabic sign language recognition with 3d convolutional neural networks. In: 2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS), pp. 66–71. IEEE (2017)
Acknowledgements
The authors acknowledge the support of Achourouk, Association of the Deaf of Ouarzazate, for sharing the database of Moroccan sign language used in this work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Rights and permissions
About this article
Cite this article
Boukdir, A., Benaddy, M., Ellahyani, A. et al. Isolated Video-Based Arabic Sign Language Recognition Using Convolutional and Recursive Neural Networks. Arab J Sci Eng 47, 2187–2199 (2022). https://doi.org/10.1007/s13369-021-06167-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13369-021-06167-5