A visual speech model based on fuzzy-neuro methods

  • Hans H. Bothe
Neural Networks
Part of the Lecture Notes in Computer Science book series (LNCS, volume 974)


This paper describes a new approach of modeling visual speech movements, based on a codebook of characteristic key-pictures and a complex fuzzy neural network (FNN). Goal is the development of a computer animation program as a training aid for learning lip-reading. The network architecture makes possible a fusion of linguistic expert knowledge into the FNN. The current PC version allows a synchronization of the animation program with a special stand-alone speech synthesis computer via a Centronics parallel interface.


Radial Basis Function Network Fuzzy Neural Network Speech Synthesis Facial Animation Articulatory Movement 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    P. Menzerath and A. de Lacerda: Koartikulation, Steuerung und Lautabgrenzung, Berlin, (1933).Google Scholar
  2. 2.
    G. Alich: Zur Erkennbarkeit von Sprachgestalten beim Ablesen vom Munde (Dissertation), Bonn, (1961).Google Scholar
  3. 3.
    H.H. Bothe and N. v. Bötticher: Key-frame selection for the analysis of visual speech with fuzzy-c-means algorithm. In: B. Bouchon-Meunier & R. Yager & L.A. Zadeh (Eds.), Advances in Intelligent Computing, Springer-Verlag, Berlin-Heidelberg (to appear, 1995).Google Scholar
  4. 4.
    H.H. Bothe, G. Lindner and F. Rieger The Development of a Computer Animation Program for the Teaching of Lipreading, In: E. Ballabio, I. Placencia-Porrero and R. Puig de la Bellacasa (Eds.), Technology and Informatics 9, Rehabilitation Technology: Strategies for the European Union, Amsterdam, (1993), 45–49.Google Scholar
  5. 5.
    D. Storey and M. Roberts: Reading the Speech of Digital Lips: Motives an Methods for Audio-visual Speech Synthesis, Visible Language 22 (1989), 112–127.Google Scholar
  6. 6.
    M.M. Cohen and D.W. Massaro: Synthesis of Visible Speech, Behaviour Research Methods, Instruments & Computers, (1990), 260–263.Google Scholar
  7. 7.
    M. Saintourens, M.H. Tramus, H. Huitric, and M. Nahas: Creation of a Synthetic Face Speaking in Real Time with a Synthetic Voice, Proceedings of the ESCA Workshop on Speech Synthesis, Autrance, (1990), 381–393.Google Scholar
  8. 8.
    F. Lavagetto: Converting Speech into Lip Movements: A Multimedia Telephone for Hard of Hearing People. Trans. Rehabilitation Engineering, (to appear; 1995).Google Scholar
  9. 9.
    Bothe, H.H.: Fuzzy input coding for an artificial neural network. (ACM/SAC'95), Nashville, (1995).Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1995

Authors and Affiliations

  • Hans H. Bothe
    • 1
  1. 1.Department of ElectronicsTechnical University of BerlinBerlin

Personalised recommendations