Abstract
This paper describes a new approach of modeling visual speech movements, based on a codebook of characteristic key-pictures and a complex fuzzy neural network (FNN). Goal is the development of a computer animation program as a training aid for learning lip-reading. The network architecture makes possible a fusion of linguistic expert knowledge into the FNN. The current PC version allows a synchronization of the animation program with a special stand-alone speech synthesis computer via a Centronics parallel interface.
Chapter PDF
References
P. Menzerath and A. de Lacerda: Koartikulation, Steuerung und Lautabgrenzung, Berlin, (1933).
G. Alich: Zur Erkennbarkeit von Sprachgestalten beim Ablesen vom Munde (Dissertation), Bonn, (1961).
H.H. Bothe and N. v. Bötticher: Key-frame selection for the analysis of visual speech with fuzzy-c-means algorithm. In: B. Bouchon-Meunier & R. Yager & L.A. Zadeh (Eds.), Advances in Intelligent Computing, Springer-Verlag, Berlin-Heidelberg (to appear, 1995).
H.H. Bothe, G. Lindner and F. Rieger The Development of a Computer Animation Program for the Teaching of Lipreading, In: E. Ballabio, I. Placencia-Porrero and R. Puig de la Bellacasa (Eds.), Technology and Informatics 9, Rehabilitation Technology: Strategies for the European Union, Amsterdam, (1993), 45–49.
D. Storey and M. Roberts: Reading the Speech of Digital Lips: Motives an Methods for Audio-visual Speech Synthesis, Visible Language 22 (1989), 112–127.
M.M. Cohen and D.W. Massaro: Synthesis of Visible Speech, Behaviour Research Methods, Instruments & Computers, (1990), 260–263.
M. Saintourens, M.H. Tramus, H. Huitric, and M. Nahas: Creation of a Synthetic Face Speaking in Real Time with a Synthetic Voice, Proceedings of the ESCA Workshop on Speech Synthesis, Autrance, (1990), 381–393.
F. Lavagetto: Converting Speech into Lip Movements: A Multimedia Telephone for Hard of Hearing People. Trans. Rehabilitation Engineering, (to appear; 1995).
Bothe, H.H.: Fuzzy input coding for an artificial neural network. (ACM/SAC'95), Nashville, (1995).
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bothe, H.H. (1995). A visual speech model based on fuzzy-neuro methods. In: Braccini, C., DeFloriani, L., Vernazza, G. (eds) Image Analysis and Processing. ICIAP 1995. Lecture Notes in Computer Science, vol 974. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60298-4_251
Download citation
DOI: https://doi.org/10.1007/3-540-60298-4_251
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60298-9
Online ISBN: 978-3-540-44787-0
eBook Packages: Springer Book Archive