Skip to main content

A Novel Idea for Designing a Speech Recognition System Using Computer Vision Object Detection Techniques

  • Conference paper
  • First Online:
Computational Methods and Data Engineering

Abstract

It is very challenging for establishing the communication with deaf people around the world. They need to get the assistance from others, and others need not be true always. To overcome this situation, a device we propose to develop an application which will provide the easy way of communicating using sign language without the help of others. The concept of this device development is a novel idea. It is intended to make the device as standalone using the recent development in embedded system technology. The proposed system is aimed to develop a pocket assistant for the deaf and hearing-impaired people in communicating with other people. All the functionality of the application is built around the organization of communication to establish a conversation between the user and his interlocutor. It is planned to develop a device to recognize the interlocutor’s speech in real time, query the related sign representation stored in database, and display the text or set of pictures in sign language on the screen. The device will be based on Raspberry Pi hardware. The technology involves capturing the audio using …. The audio will be processed by removing the noise and fed into the audio-to-text convertor to output the text message. Text information is detected using histogram of oriented gradients (HOG) and local binary pattern (LBP). The required information will be selected and queried to extract the desired sign representation from the database and provide the desired output to the user on the screen. Technological transfer of the proposed product will enable mass production that can be utilized in national and global market for the benefit of the elderly and deaf people. It has three modules, login, recording the information, and translating the information and storing it in the database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Sirch LS, Palese A (2017) Communication difficulties experienced by deaf male patients during their in-hospital stay: findings from a qualitative descriptive study. Scand J Caring Sci 31(2):368–377

    Article  Google Scholar 

  2. Sharma MV, Kumar NV, Masaguppi SC, Mn S, Ambika DR (2013) Virtual talk for deaf, mute, blind and normal humans. In: Proceedings of the 2013 1st Texas instruments India educators’ conference, TIIEC 2013, pp 316–320

    Google Scholar 

  3. Soltani F, Eskanderi F, Golestan S (2012) Developing a gesture-based game for deaf/mute people using microsoft kinect. In: Proceedings of sixth international conference on complex, intelligent, and software intensive systems, pp 491–495

    Google Scholar 

  4. Lowe DG (2004) Distinctive image features from scale-invariant key points. Int J Comput Vision 60(2):91–110

    Article  Google Scholar 

  5. Liu H (2009) Skew detection for complex document images using robust borderlines in both text and non-text regions. Pattern Recogn Lett 29:1893–1900

    Google Scholar 

  6. Arafat Y, Muhammad Saleem S, Afaq Hussain S (2009) Comparative analysis of invariant schemes for logo classification. In: Proceedings of the international conference on emerging technologies (ICET), pp 256–261

    Google Scholar 

  7. Butzke, M, Silva, AG, Hounsell, MS, Pillon, MA (2008) Automatic recognition of vehicle attribute-color classification and logo segmentation. Hifen, Urugaiana, pp 32–62

    Google Scholar 

  8. Mehmood Z, Anwar SM, Ali N, Habib HA, Rashid M (2016) A novel image retrieval based on a combination of local and global histograms of visual words. Math Probl Eng 2016:1–12. Article ID 8217250

    Google Scholar 

  9. Anagnostopoulos CNE, Anagnostopoulos IE, Psoroulas ID, Loumos V, Kayafas E (2008) ‘License plate recognition from still Images and video sequences’: a survey. IEEE Trans Intell Transp Syst 9:377–391. https://doi.org/10.1109/TITS.2008.922938

  10. Zhang C, Chen X, Chen W (2006) A PCA-based vehicle classification framework. In: 22nd international conference on data engineering workshops

    Google Scholar 

  11. Bagarinao E, Kurita T, Higashikubo M, Inayoshi H (2009) Adapting SVM image classifiers to changes in imaging conditions using incremental SVM: an application to car detection. In: Proceedings of the 9th Asian conference on computer vision (ACCV), pp 363–372

    Google Scholar 

  12. Kim KK, Kim KI, Kim JB, Kim HJ (2000) Learning-based approach for license plate recognition. In: Proceeding of IEEE workshop on neural networks for signal processing, vol 2, pp 614–623

    Google Scholar 

  13. Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630

    Google Scholar 

  14. Juan L, Gwun O (2010) A comparison of SIFT, PCA-SIFT and SURF. Int J Image Process (IJIP) 3(4):143–152

    Google Scholar 

  15. Sivaraman S, Trivedi MM (2010) A general active-learning framework for on-road vehicle recognition and tracking. IEEE Trans Intell Transp Syst 11(2):267–276

    Google Scholar 

  16. Vapnik V, Golowich S, Smola A (1997) Support vector method for function approximation, regression estimation, and signal processing. In: Advances in neural information processing systems. MIT Press, Cambridge, pp 281–287

    Google Scholar 

  17. Shao Y, Lunetta RS (2010) Comparison of support vector machine, neural network, and CART algorithms for the land cover classification using limited training data points. The U.S. Environmental Protection Agency

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Ramkumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Toshpulotov, S., Saidov, S., Shanmugam, S.K., Shyamala Devi, J., Ramkumar, K. (2021). A Novel Idea for Designing a Speech Recognition System Using Computer Vision Object Detection Techniques. In: Singh, V., Asari, V.K., Kumar, S., Patel, R.B. (eds) Computational Methods and Data Engineering. Advances in Intelligent Systems and Computing, vol 1257. Springer, Singapore. https://doi.org/10.1007/978-981-15-7907-3_28

Download citation

Publish with us

Policies and ethics