A visual speech model based on fuzzy-neuro methods

Bothe, Hans H.

doi:10.1007/3-540-60298-4_251

A visual speech model based on fuzzy-neuro methods

Hans H. Bothe¹

Neural Networks
Conference paper
First Online: 01 January 2005

293 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 974))

Abstract

This paper describes a new approach of modeling visual speech movements, based on a codebook of characteristic key-pictures and a complex fuzzy neural network (FNN). Goal is the development of a computer animation program as a training aid for learning lip-reading. The network architecture makes possible a fusion of linguistic expert knowledge into the FNN. The current PC version allows a synchronization of the animation program with a special stand-alone speech synthesis computer via a Centronics parallel interface.

Download to read the full chapter text

Chapter PDF

References

P. Menzerath and A. de Lacerda: Koartikulation, Steuerung und Lautabgrenzung, Berlin, (1933).
Google Scholar
G. Alich: Zur Erkennbarkeit von Sprachgestalten beim Ablesen vom Munde (Dissertation), Bonn, (1961).
Google Scholar
H.H. Bothe and N. v. Bötticher: Key-frame selection for the analysis of visual speech with fuzzy-c-means algorithm. In: B. Bouchon-Meunier & R. Yager & L.A. Zadeh (Eds.), Advances in Intelligent Computing, Springer-Verlag, Berlin-Heidelberg (to appear, 1995).
Google Scholar
H.H. Bothe, G. Lindner and F. Rieger The Development of a Computer Animation Program for the Teaching of Lipreading, In: E. Ballabio, I. Placencia-Porrero and R. Puig de la Bellacasa (Eds.), Technology and Informatics 9, Rehabilitation Technology: Strategies for the European Union, Amsterdam, (1993), 45–49.
Google Scholar
D. Storey and M. Roberts: Reading the Speech of Digital Lips: Motives an Methods for Audio-visual Speech Synthesis, Visible Language 22 (1989), 112–127.
Google Scholar
M.M. Cohen and D.W. Massaro: Synthesis of Visible Speech, Behaviour Research Methods, Instruments & Computers, (1990), 260–263.
Google Scholar
M. Saintourens, M.H. Tramus, H. Huitric, and M. Nahas: Creation of a Synthetic Face Speaking in Real Time with a Synthetic Voice, Proceedings of the ESCA Workshop on Speech Synthesis, Autrance, (1990), 381–393.
Google Scholar
F. Lavagetto: Converting Speech into Lip Movements: A Multimedia Telephone for Hard of Hearing People. Trans. Rehabilitation Engineering, (to appear; 1995).
Google Scholar
Bothe, H.H.: Fuzzy input coding for an artificial neural network. (ACM/SAC'95), Nashville, (1995).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics, Technical University of Berlin, Einsteinufer 17, D-10587, Berlin
Hans H. Bothe

Authors

Hans H. Bothe
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Carlo Braccini Leila DeFloriani Gianni Vernazza

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bothe, H.H. (1995). A visual speech model based on fuzzy-neuro methods. In: Braccini, C., DeFloriani, L., Vernazza, G. (eds) Image Analysis and Processing. ICIAP 1995. Lecture Notes in Computer Science, vol 974. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60298-4_251

Download citation

DOI: https://doi.org/10.1007/3-540-60298-4_251
Published: 02 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60298-9
Online ISBN: 978-3-540-44787-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)