Silent Speech Recognition

Kandagal, Amaresh P.; Udayashankara, V.; Anusuya, M. A.

doi:10.1007/978-981-10-9059-2_13

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 801))

Included in the following conference series:

International Conference on Cognitive Computing and Information Processing

1171 Accesses
1 Citations

Abstract

Speech is essential to exchange information. Speech recognition is one of the interfaces for man-machine interaction. However, the performance of these systems is restricted to noisy acoustic conditions. Silent speech i.e. visual dynamic features of speech have more potential information for Human-Computer Interaction. This paper presents lip localization and segmentation by Otsu algorithm. The height and width parameters of lip movements are captured as visual cues for silent speech recognition. We develop stochastic visual word models with an in-house database of 20 subjects. Performance evaluation these models are measured by word error rate. The accuracy of the system recorded for speaker dependent female subjects is 84.6%, and 65.8% as an overall result.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Petajan, E.: Automatic lip reading to enhance speech recognition. In: IEEE Proceedings of Global Telecommunications Conference, Atlanta, GA, pp. 265–272 (1984)
Google Scholar
Kandagal, A.P., Udayashankara, V.: Automatic bimodal audiovisual speech recognition a review. In: IEEE International Conference on Contemporary Computing and Informatics, Mysore, India, pp. 940–945 (2014). https://doi.org/10.1109/ic3i.2014.7019673
Tareque, M.H., Al Hasan, A.S.: Human lips-contour recognition and tracing. Int. J. Adv. Res. Artif. Intell. 3, 47–51 (2014)
Google Scholar
Luettin, J., Thacker, N.A., Beet, S.W.: Speech reading using shape and intensity information. In: 4th International Conference on Speech and Language Processing, vol. 1, pp. 58–61 (1996)
Google Scholar
Kass, M., Witkin, A., Terzopoulos, D.: Snakes active contour models. Int. J. Comput. Vis. 1(4), 321–331 (1988)
Article Google Scholar
Hassanat, A.B.A., Jassim, S.: Color-based lip localization method. In: Proceedings of SPIE - The International Society for Optical Engineering (2010). https://doi.org/10.1117/12.850629
Matthews, I., Cootes, T.F., Bangham, J.A., Cox, S., Harvey, R.: Extraction of visual features for lip-reading. Trans. Pattern Anal. Mach. Intell. 24, 198–213 (2002)
Article Google Scholar
Eveno, N., Caplier, A., Coulon, P.Y.: New color transformation for lips segmentation. In: 4th IEEE Workshop on Multimedia Signal Processing, pp. 3–8 (2001)
Google Scholar
The MathWorks Inc.: MATLAB User Guide, vol. 4 (1998)
Google Scholar
Gonzalez, R.C., Woods, R.E., Eddins, S.L.: Digital image processing using MATLAB, vol. 2. Gatesmark Publishing, Knoxville (2009)
Google Scholar
Young, S., Evermann, G., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book. Cambridge University Engineering Department, Cambridge (2009)
Google Scholar
Jun, H., Hua, Z.: Research on visual speech feature extraction. In: Proceedings of the International Conference on Computer Engineering and Technology, vol. 2, pp. 499–502 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Sri Siddhartha Institute of Technology, Tumkur, India
Amaresh P. Kandagal
Sri Jayachamarajendra College of Engineering, Mysuru, India
V. Udayashankara & M. A. Anusuya

Authors

Amaresh P. Kandagal
View author publications
You can also search for this author in PubMed Google Scholar
V. Udayashankara
View author publications
You can also search for this author in PubMed Google Scholar
M. A. Anusuya
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amaresh P. Kandagal .

Editor information

Editors and Affiliations

Sri Jayachamarajendra College of Engineering, Mysuru, Karnataka, India
T.N. Nagabhushan
Sri Jayachamarajendra College of Engineering, Mysuru, Karnataka, India
V. N. Manjunath Aradhya
JSS Academy of Technical Education, Bengaluru, Karnataka, India
Prabhudev Jagadeesh
JSS Academy of Technical Education, Noida, Uttar Pradesh, India
Seema Shukla
JSS Academy of Technical Education, Bengaluru, Karnataka, India
Chayadevi M.L.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kandagal, A.P., Udayashankara, V., Anusuya, M.A. (2018). Silent Speech Recognition. In: Nagabhushan, T., Aradhya, V.N.M., Jagadeesh, P., Shukla, S., M.L., C. (eds) Cognitive Computing and Information Processing. CCIP 2017. Communications in Computer and Information Science, vol 801. Springer, Singapore. https://doi.org/10.1007/978-981-10-9059-2_13

Download citation

DOI: https://doi.org/10.1007/978-981-10-9059-2_13
Published: 07 April 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-9058-5
Online ISBN: 978-981-10-9059-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics