Abstract
It is very challenging for establishing the communication with deaf people around the world. They need to get the assistance from others, and others need not be true always. To overcome this situation, a device we propose to develop an application which will provide the easy way of communicating using sign language without the help of others. The concept of this device development is a novel idea. It is intended to make the device as standalone using the recent development in embedded system technology. The proposed system is aimed to develop a pocket assistant for the deaf and hearing-impaired people in communicating with other people. All the functionality of the application is built around the organization of communication to establish a conversation between the user and his interlocutor. It is planned to develop a device to recognize the interlocutor’s speech in real time, query the related sign representation stored in database, and display the text or set of pictures in sign language on the screen. The device will be based on Raspberry Pi hardware. The technology involves capturing the audio using …. The audio will be processed by removing the noise and fed into the audio-to-text convertor to output the text message. Text information is detected using histogram of oriented gradients (HOG) and local binary pattern (LBP). The required information will be selected and queried to extract the desired sign representation from the database and provide the desired output to the user on the screen. Technological transfer of the proposed product will enable mass production that can be utilized in national and global market for the benefit of the elderly and deaf people. It has three modules, login, recording the information, and translating the information and storing it in the database.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Sirch LS, Palese A (2017) Communication difficulties experienced by deaf male patients during their in-hospital stay: findings from a qualitative descriptive study. Scand J Caring Sci 31(2):368–377
Sharma MV, Kumar NV, Masaguppi SC, Mn S, Ambika DR (2013) Virtual talk for deaf, mute, blind and normal humans. In: Proceedings of the 2013 1st Texas instruments India educators’ conference, TIIEC 2013, pp 316–320
Soltani F, Eskanderi F, Golestan S (2012) Developing a gesture-based game for deaf/mute people using microsoft kinect. In: Proceedings of sixth international conference on complex, intelligent, and software intensive systems, pp 491–495
Lowe DG (2004) Distinctive image features from scale-invariant key points. Int J Comput Vision 60(2):91–110
Liu H (2009) Skew detection for complex document images using robust borderlines in both text and non-text regions. Pattern Recogn Lett 29:1893–1900
Arafat Y, Muhammad Saleem S, Afaq Hussain S (2009) Comparative analysis of invariant schemes for logo classification. In: Proceedings of the international conference on emerging technologies (ICET), pp 256–261
Butzke, M, Silva, AG, Hounsell, MS, Pillon, MA (2008) Automatic recognition of vehicle attribute-color classification and logo segmentation. Hifen, Urugaiana, pp 32–62
Mehmood Z, Anwar SM, Ali N, Habib HA, Rashid M (2016) A novel image retrieval based on a combination of local and global histograms of visual words. Math Probl Eng 2016:1–12. Article ID 8217250
Anagnostopoulos CNE, Anagnostopoulos IE, Psoroulas ID, Loumos V, Kayafas E (2008) ‘License plate recognition from still Images and video sequences’: a survey. IEEE Trans Intell Transp Syst 9:377–391. https://doi.org/10.1109/TITS.2008.922938
Zhang C, Chen X, Chen W (2006) A PCA-based vehicle classification framework. In: 22nd international conference on data engineering workshops
Bagarinao E, Kurita T, Higashikubo M, Inayoshi H (2009) Adapting SVM image classifiers to changes in imaging conditions using incremental SVM: an application to car detection. In: Proceedings of the 9th Asian conference on computer vision (ACCV), pp 363–372
Kim KK, Kim KI, Kim JB, Kim HJ (2000) Learning-based approach for license plate recognition. In: Proceeding of IEEE workshop on neural networks for signal processing, vol 2, pp 614–623
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Juan L, Gwun O (2010) A comparison of SIFT, PCA-SIFT and SURF. Int J Image Process (IJIP) 3(4):143–152
Sivaraman S, Trivedi MM (2010) A general active-learning framework for on-road vehicle recognition and tracking. IEEE Trans Intell Transp Syst 11(2):267–276
Vapnik V, Golowich S, Smola A (1997) Support vector method for function approximation, regression estimation, and signal processing. In: Advances in neural information processing systems. MIT Press, Cambridge, pp 281–287
Shao Y, Lunetta RS (2010) Comparison of support vector machine, neural network, and CART algorithms for the land cover classification using limited training data points. The U.S. Environmental Protection Agency
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Toshpulotov, S., Saidov, S., Shanmugam, S.K., Shyamala Devi, J., Ramkumar, K. (2021). A Novel Idea for Designing a Speech Recognition System Using Computer Vision Object Detection Techniques. In: Singh, V., Asari, V.K., Kumar, S., Patel, R.B. (eds) Computational Methods and Data Engineering. Advances in Intelligent Systems and Computing, vol 1257. Springer, Singapore. https://doi.org/10.1007/978-981-15-7907-3_28
Download citation
DOI: https://doi.org/10.1007/978-981-15-7907-3_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7906-6
Online ISBN: 978-981-15-7907-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)