Advertisement

Phonetic Sequence to Graphemes Conversion Based on DTW and One-Stage Algorithms

  • Rafael Teruszkin
  • Fernando Gil Vianna ResendeJr.
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3960)

Abstract

This work proposes an algorithm for converting phonetic sequences into graphemes using DTW on the recognition of isolated words or closed sentences, and using One-Stage on a continuous speech recognition task. Most speech recognition systems resolve the task of recognition on a single stage without having an intermediate phonetic sequence result. The proposed solution is hybrid in the sense that it uses HMMs and Viterbi Decoding for recognizing a phonetic sequence (actually, triphones) and then DTW or One-Stage to generate the corresponding graphemes. Experimental results showed an average accuracy rate of 100% on the recognition of closed sentences, and average word recognition rate of 84% for the continuous speech recognition task.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    HTK - Hidden Markov Model Toolkit, http://htk.eng.cam.ac.uk
  2. 2.
    Rabiner, L.R., Juang, B.: Fundamentals on Speech Recognition. New Jersey, Prentice Hall (1996)Google Scholar
  3. 3.
    Ney, H.: The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition. In: Proceedings of ICASSP (1984)Google Scholar
  4. 4.
    Alcaim, A., Solewicz e, J.A., Moraes, J.A.: Freqüência de ocorrência dos fones e listas de frases foneticamente balanceadas no português falado no Rio de Janeiro. Revista da Sociedade Brasileira de Telecomunicações, Rio de Janeiro, 7(1), 23–41 (1992)Google Scholar
  5. 5.
    Barbosa, F.L.F., et al.: Grapheme-phone transcription algorithm for a Brazilian Portuguese TTS. In: Mamede, N.J., Baptista, J., Trancoso, I., Nunes, M.d.G.V. (eds.) PROPOR 2003. LNCS, vol. 2721, pp. 23–30. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  6. 6.
    Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm and System Development, ch. 11. Prentice Hall, Englewood Cliffs (2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Rafael Teruszkin
    • 1
  • Fernando Gil Vianna ResendeJr.
    • 1
    • 2
  1. 1.Programa de Engenharia ElétricaCOPPE, UFRJBrazil
  2. 2.Departamento de Engenharia Eletrônica e de ComputaçãoEscola Politécnica, UFRJBrazil

Personalised recommendations