Lip contour extraction for language learning in VEC3D

Yang, Mau-Tsuen; You, Zhen-Wei; Shih, Ya-Chun

doi:10.1007/s00138-008-0139-x

Lip contour extraction for language learning in VEC3D

Original Paper
Published: 30 April 2008

Volume 21, article number 33, (2009)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Mau-Tsuen Yang¹,
Zhen-Wei You¹ &
Ya-Chun Shih²

113 Accesses
3 Citations
Explore all metrics

Abstract

Lip image analysis is a critical problem in many vision-based applications such as lip reading, speaker authentication, video conferencing and language learning. We design three dynamic probability maps based on the color, shape, and edge features, respectively. These probability maps are then utilized to extract both inner and outer lip contours using a grid-based gradient-ascent approach. The experiments show that the proposed algorithm can extract inner/outer lip contours efficiently and reliably. The proposed lip contour extraction algorithm is applied successfully in a language-learning application called VEC3D.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Aleksic, P., Williams, J., Wu, Z., Katsaggelos, A.: Audio-visual speech recognition using MPEG-4 compliant visual features. EURASIP J. Appl. Signal Process. 1213–1227 (2002)
Balci, K.: Xface: MPEG-4 based open source toolkit for 3d facial animation. In: Working Conference on Advanced Visual Interfaces, Italy, May 2004
Cootes, T., Edwards, G., Taylor, C.: Active appearance models. IEEE Trans. Pattern Anal. Mach. Intell. 23(6), 681–685 (2001)
Article Google Scholar
Dansereau, R., Li, C., Goubran, R.: Lip feature extraction using motion, color, and edge information. In: IEEE Internatioal Workshop on Haptic, Audio and Visual Environments and Their Applications, pp. 1–6, Sept. 2003
Delmas, P., Coulon, P., Fristot, V.: Automatic snakes for robust lip boundaries extraction. ICASSP, pp. 3069–3072 (1999)
Eveno, N., Caplier, A., Coulon, P.: A new color transformation for lips segmentation. Multimed. Signal Process. France, Oct. 2001
Eveno, N., Caplier, A., Coulon, P.: Key point based segmentation of lips. IEEE Int. Conf. Multimed. Expo 2, 125–128 (2002)
Google Scholar
Kass, M., Witkin, A., Terzopoulos, D.: Snakes: active contour models. Int. J. Comput. Vis. 1(4), 321–331 (1988)
Article Google Scholar
Lavagetto, F., Pockaj, R.: The facial animation engine: toward a high-level interface for the design of MPEG-4 compliant animated faces. IEEE Trans. Circuits Syst. Video Technol. (1999)
Leung, S., Wang, S., Lau, W.: Lip Image segmentation using fuzzy clustering incorporating an elliptic shape function. IEEE Trans. Image Process. 13(1), 51–62 (2004)
Article Google Scholar
Luettin, J., Tracker, N., Beet, S.: Active shape models for visual speech feature extraction. Electronic System Group Rep. Univ. of Sheffield, UK, 95/44 (1995)
Patterson, E., Gurbuz, S., Tufekci, Z., Gowdy, J.: Moving-talker, speaker-independent feature study and baseline results using the CUAVE multimodal speech corpus. EURASIP J. Appl. Signal Process. (2002)
Shih, Y., Yang, M.T.: A collaborative virtual environment for situated language learning using VEC3D. J. Educ. Technol. Soc. 11(1), 56–68 (2008)
Google Scholar
Terzopoulos, D., Waters, K.: Analysis and synthesis of facial image sequences using physical and anatomical models. IEEE Trans. Pattern Anal. Mach. Intell. 25, 569–579 (1993)
Article Google Scholar
Wakasugi, T., Nishiura, M., Fukui, K.: Robust lip contour extraction using separability of multi-dimensional distributions. In: IEEE International Conference on Automatic Face and Gesture Recognition, pp. 415–420, May 2004
Wang, G., Yang, M.T., Chiang, C., Tai, W.: A talking face driven by voice using hidden Markov model. J. Inf. Sci. Eng. 22, 1059–1075 (2006)
Google Scholar
Zhang, X., Mersereau, R., Clements, M., Broun, C.: Visual speech feature extraction for improved speech recognition. ICASSP 1993–1996 (2002)

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Dong Hwa University, Hualien, Taiwan
Mau-Tsuen Yang & Zhen-Wei You
Department of English, National Hualien University of Education, Hualien, Taiwan
Ya-Chun Shih

Authors

Mau-Tsuen Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zhen-Wei You
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Chun Shih
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mau-Tsuen Yang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, MT., You, ZW. & Shih, YC. Lip contour extraction for language learning in VEC3D. Machine Vision and Applications 21, 33 (2009). https://doi.org/10.1007/s00138-008-0139-x

Download citation

Received: 29 June 2007
Accepted: 01 April 2008
Published: 30 April 2008
DOI: https://doi.org/10.1007/s00138-008-0139-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Lip contour extraction for language learning in VEC3D

Abstract

Access this article

Similar content being viewed by others

Emotional Speech Recognition Based on Lip-Reading

Lip-Reading: Toward Phoneme Recognition Through Lip Kinematics

Lip segmentation using automatic selected initial contours based on localized active contour model

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Lip contour extraction for language learning in VEC3D

Abstract

Access this article

Similar content being viewed by others

Emotional Speech Recognition Based on Lip-Reading

Lip-Reading: Toward Phoneme Recognition Through Lip Kinematics

Lip segmentation using automatic selected initial contours based on localized active contour model

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation