Skip to main content
Log in

Lip contour extraction for language learning in VEC3D

  • Original Paper
  • Published:
Machine Vision and Applications Aims and scope Submit manuscript

Abstract

Lip image analysis is a critical problem in many vision-based applications such as lip reading, speaker authentication, video conferencing and language learning. We design three dynamic probability maps based on the color, shape, and edge features, respectively. These probability maps are then utilized to extract both inner and outer lip contours using a grid-based gradient-ascent approach. The experiments show that the proposed algorithm can extract inner/outer lip contours efficiently and reliably. The proposed lip contour extraction algorithm is applied successfully in a language-learning application called VEC3D.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Aleksic, P., Williams, J., Wu, Z., Katsaggelos, A.: Audio-visual speech recognition using MPEG-4 compliant visual features. EURASIP J. Appl. Signal Process. 1213–1227 (2002)

  2. Balci, K.: Xface: MPEG-4 based open source toolkit for 3d facial animation. In: Working Conference on Advanced Visual Interfaces, Italy, May 2004

  3. Cootes, T., Edwards, G., Taylor, C.: Active appearance models. IEEE Trans. Pattern Anal. Mach. Intell. 23(6), 681–685 (2001)

    Article  Google Scholar 

  4. Dansereau, R., Li, C., Goubran, R.: Lip feature extraction using motion, color, and edge information. In: IEEE Internatioal Workshop on Haptic, Audio and Visual Environments and Their Applications, pp. 1–6, Sept. 2003

  5. Delmas, P., Coulon, P., Fristot, V.: Automatic snakes for robust lip boundaries extraction. ICASSP, pp. 3069–3072 (1999)

  6. Eveno, N., Caplier, A., Coulon, P.: A new color transformation for lips segmentation. Multimed. Signal Process. France, Oct. 2001

  7. Eveno, N., Caplier, A., Coulon, P.: Key point based segmentation of lips. IEEE Int. Conf. Multimed. Expo 2, 125–128 (2002)

    Google Scholar 

  8. Kass, M., Witkin, A., Terzopoulos, D.: Snakes: active contour models. Int. J. Comput. Vis. 1(4), 321–331 (1988)

    Article  Google Scholar 

  9. Lavagetto, F., Pockaj, R.: The facial animation engine: toward a high-level interface for the design of MPEG-4 compliant animated faces. IEEE Trans. Circuits Syst. Video Technol. (1999)

  10. Leung, S., Wang, S., Lau, W.: Lip Image segmentation using fuzzy clustering incorporating an elliptic shape function. IEEE Trans. Image Process. 13(1), 51–62 (2004)

    Article  Google Scholar 

  11. Luettin, J., Tracker, N., Beet, S.: Active shape models for visual speech feature extraction. Electronic System Group Rep. Univ. of Sheffield, UK, 95/44 (1995)

  12. Patterson, E., Gurbuz, S., Tufekci, Z., Gowdy, J.: Moving-talker, speaker-independent feature study and baseline results using the CUAVE multimodal speech corpus. EURASIP J. Appl. Signal Process. (2002)

  13. Shih, Y., Yang, M.T.: A collaborative virtual environment for situated language learning using VEC3D. J. Educ. Technol. Soc. 11(1), 56–68 (2008)

    Google Scholar 

  14. Terzopoulos, D., Waters, K.: Analysis and synthesis of facial image sequences using physical and anatomical models. IEEE Trans. Pattern Anal. Mach. Intell. 25, 569–579 (1993)

    Article  Google Scholar 

  15. Wakasugi, T., Nishiura, M., Fukui, K.: Robust lip contour extraction using separability of multi-dimensional distributions. In: IEEE International Conference on Automatic Face and Gesture Recognition, pp. 415–420, May 2004

  16. Wang, G., Yang, M.T., Chiang, C., Tai, W.: A talking face driven by voice using hidden Markov model. J. Inf. Sci. Eng. 22, 1059–1075 (2006)

    Google Scholar 

  17. Zhang, X., Mersereau, R., Clements, M., Broun, C.: Visual speech feature extraction for improved speech recognition. ICASSP 1993–1996 (2002)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mau-Tsuen Yang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, MT., You, ZW. & Shih, YC. Lip contour extraction for language learning in VEC3D. Machine Vision and Applications 21, 33 (2009). https://doi.org/10.1007/s00138-008-0139-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s00138-008-0139-x

Keywords

Navigation