Advertisement

AUDIMUS.MEDIA: A Broadcast News Speech Recognition System for the European Portuguese Language

  • Hugo Meinedo
  • Diamantino Caseiro
  • João Neto
  • Isabel Trancoso
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2721)

Abstract

Many applications such as media monitoring are experiencing a large expansion as a consequence of the different emerging media sources and can benefit dramatically by using automatic transcription of audio data. In this paper, we describe the development of a speech recognition engine, AUDIMUS.MEDIA used in the Broadcast News domain. Additionally we describe recent improvements that permitted a relative recognition error decrease of more than 20% and a 4x speed-up.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Amaral, R., Langlois, T., Meinedo, H., Neto, J., Souto, N., Trancoso, I.: The development of a portuguese version of a media watch system. In: Proceedings EUROSPEECH 2001, Aalborg, Denmark (2001)Google Scholar
  2. 2.
    Meinedo, H., Neto, J.: Combination of acoustic models in continuous speech recognition. In: Proceedings ICSLP 2000, Beijing, China (2000)Google Scholar
  3. 3.
    Mohri, M., Pereira, F., Riley, M.: Weighted finite-state transducers in speech recognition. In: ASR 2000 Workshop. (2000)Google Scholar
  4. 4.
    Caseiro, D., Trancoso, I.: Using dynamic wfst composition for recognizing broadcast news. In: Proc. ICSLP’ 2002, Denver, Colorado, USA (2002)Google Scholar
  5. 5.
    Caseiro, D., Trancoso, I.: On integrating the lexicon with the language model. In: Proc. Eurospeech’ 2001, Aalborg, Denmark (2001)Google Scholar
  6. 6.
    Caseiro, D., Trancoso, I.: Transducer composition for “on-the-fly” lexicon and language model integration. In: Proc. ICASSP’ 2003, Hong Kong, China (2003)Google Scholar
  7. 7.
    Caseiro, D., Silva, F.M., Trancoso, I., Viana, C.: Automatic alignment of map task dialogs using wfsts. In: Proc. PMLA, ISCA Tutorial and Research Workshop on Pronunciation Modelling and Lexicon Adaptation, Aspen, Colorado, USA (2002)Google Scholar
  8. 8.
    Renals, S., Hochberg, M.: Efficient search using posterior phone probability estimates. In: Proc. ICASSP’ 95, Detroit, MI (1995) 596–599Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Hugo Meinedo
    • 1
  • Diamantino Caseiro
    • 1
  • João Neto
    • 1
  • Isabel Trancoso
    • 1
  1. 1.L2F — Spoken Language Systems LabINESC-ID / ISTLisboaPortugal

Personalised recommendations