Abstract
The use of n-grams is quite prevalent in the field of pattern recognition. In this paper, we use this concept to build new feature vectors from extracted parameters to be used for visual speech classification. We extract the lip contour using edge detection and connectivity analysis. The boundary is defined using six cubic curves. The visual parameters are used to build n-gram feature vectors. Two sets of classification experiments are performed with the n-gram feature vectors: using the hidden Markov model and using multiple data mining algorithms in WEKA, a tool widely used by researchers. Preliminary results show encouraging results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alizadeh, S., Boostani, R., Asadpour, V.: Lip Feature Extraction and Reduction for HMM-Based Visual Speech Recognition Systems. In: Proc. 9th International Conference on Signal Processing (ICSP 2008), pp. 561–564 (2008)
Eveno, N., Caplier, A., Coulon, P.Y.: Accurate and Quasi-Automatic Lip Tracking. IEEE Transaction on Circuits and Video Technology 14(5), 706–715 (2004)
Goldschen, A.J.: Continuous automatic speech recognition by lipreading. PhD thesis, George Washington University, Washington, DC, USA (1993)
HTK Hidden Markov Model Toolkit home page, http://htk.eng.cam.ac.uk/
Matthews, I., Cootes, T.F., Bangham, J.A., Cox, S., Harvey, R.: Extraction of Visual Features for Lipreading. IEEE Trans. Pattern Analysis and Machine Intelligence 24(2), 198–213 (2002)
Silveira, L.G., Facon, J., Borges, D.L.: Visual Speech Recognition: a solution from feature extraction to words classification. In: Proc. 16th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2003), Sao Carlos, Brazil, pp. 399–405. IEEE Computer Society, Los Alamitos (2003)
Sumby, W.H., Pollack, I.: Visual Contribution to Speech Intelligibility in Noise. Journal of Acoustical Society of America 26(2), 212–215 (1954)
University of Waikato. Open Source Machine Learning Software WEKA, http://www.cs.waikato.ac.nz/ml/weka/
Yau, W.C., Kumar, D.K., Arjunan, S.P.: Voiceless speech recognition using dynamic visual speech features. In: Proceedings of the HCSNet workshop on Use of vision in human-computer interaction (VisHCI 2006), Canberra, Australia, pp. 93–101. Australian Computer Society, Inc. (2006)
Yu, K., Jiang, X., Bunke, H.: Lipreading: A classifier combination approach. Pattern Recognition Letters 18(11-13), 1421–1426 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Singh, P., Laxmi, V., Gupta, D., Gaur, M.S. (2010). Lipreading Using n–Gram Feature Vector. In: Herrero, Á., Corchado, E., Redondo, C., Alonso, Á. (eds) Computational Intelligence in Security for Information Systems 2010. Advances in Intelligent and Soft Computing, vol 85. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16626-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-16626-6_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16625-9
Online ISBN: 978-3-642-16626-6
eBook Packages: EngineeringEngineering (R0)