Lipreading Using n–Gram Feature Vector

Singh, Preety; Laxmi, Vijay; Gupta, Deepika; Gaur, M. S.

doi:10.1007/978-3-642-16626-6_9

Preety Singh⁶,
Vijay Laxmi⁶,
Deepika Gupta⁶ &
…
M. S. Gaur⁶

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 85))

592 Accesses
2 Citations

Abstract

The use of n-grams is quite prevalent in the field of pattern recognition. In this paper, we use this concept to build new feature vectors from extracted parameters to be used for visual speech classification. We extract the lip contour using edge detection and connectivity analysis. The boundary is defined using six cubic curves. The visual parameters are used to build n-gram feature vectors. Two sets of classification experiments are performed with the n-gram feature vectors: using the hidden Markov model and using multiple data mining algorithms in WEKA, a tool widely used by researchers. Preliminary results show encouraging results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alizadeh, S., Boostani, R., Asadpour, V.: Lip Feature Extraction and Reduction for HMM-Based Visual Speech Recognition Systems. In: Proc. 9th International Conference on Signal Processing (ICSP 2008), pp. 561–564 (2008)
Google Scholar
Eveno, N., Caplier, A., Coulon, P.Y.: Accurate and Quasi-Automatic Lip Tracking. IEEE Transaction on Circuits and Video Technology 14(5), 706–715 (2004)
Article Google Scholar
Goldschen, A.J.: Continuous automatic speech recognition by lipreading. PhD thesis, George Washington University, Washington, DC, USA (1993)
Google Scholar
HTK Hidden Markov Model Toolkit home page, http://htk.eng.cam.ac.uk/
Matthews, I., Cootes, T.F., Bangham, J.A., Cox, S., Harvey, R.: Extraction of Visual Features for Lipreading. IEEE Trans. Pattern Analysis and Machine Intelligence 24(2), 198–213 (2002)
Article Google Scholar
Silveira, L.G., Facon, J., Borges, D.L.: Visual Speech Recognition: a solution from feature extraction to words classification. In: Proc. 16th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2003), Sao Carlos, Brazil, pp. 399–405. IEEE Computer Society, Los Alamitos (2003)
Chapter Google Scholar
Sumby, W.H., Pollack, I.: Visual Contribution to Speech Intelligibility in Noise. Journal of Acoustical Society of America 26(2), 212–215 (1954)
Article Google Scholar
University of Waikato. Open Source Machine Learning Software WEKA, http://www.cs.waikato.ac.nz/ml/weka/
Yau, W.C., Kumar, D.K., Arjunan, S.P.: Voiceless speech recognition using dynamic visual speech features. In: Proceedings of the HCSNet workshop on Use of vision in human-computer interaction (VisHCI 2006), Canberra, Australia, pp. 93–101. Australian Computer Society, Inc. (2006)
Google Scholar
Yu, K., Jiang, X., Bunke, H.: Lipreading: A classifier combination approach. Pattern Recognition Letters 18(11-13), 1421–1426 (1997)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Malaviya National Institute of Technology, Jaipur, India
Preety Singh, Vijay Laxmi, Deepika Gupta & M. S. Gaur

Authors

Preety Singh
View author publications
You can also search for this author in PubMed Google Scholar
Vijay Laxmi
View author publications
You can also search for this author in PubMed Google Scholar
Deepika Gupta
View author publications
You can also search for this author in PubMed Google Scholar
M. S. Gaur
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Civil Engineering Department, Polytechnic School, University of Burgos, Francisco de Vittoria s/n, 09006, Burgos, Spain
Álvaro Herrero
Departamento de Informáca y Automática, Facultad de Biología, University of Salamanca, Plaza de la Merced s/n, 37008, Salamanca, Spain
Emilio Corchado
Castilla y León, Fundación Centro de Supercomputación, 24071, Leon, Spain
Carlos Redondo
Castilla y León, Fundación Centro de Supercomputación, 24071, León, Spain
Ángel Alonso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Singh, P., Laxmi, V., Gupta, D., Gaur, M.S. (2010). Lipreading Using n–Gram Feature Vector. In: Herrero, Á., Corchado, E., Redondo, C., Alonso, Á. (eds) Computational Intelligence in Security for Information Systems 2010. Advances in Intelligent and Soft Computing, vol 85. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16626-6_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-16626-6_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16625-9
Online ISBN: 978-3-642-16626-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics