Journal of the Brazilian Computer Society

, Volume 17, Issue 1, pp 53–68

Free tools and resources for Brazilian Portuguese speech recognition

Authors

    • Federal University of Pará
  • Carlos Patrick
    • Federal University of Pará
  • Aldebaro Klautau
    • Federal University of Pará
  • Isabel Trancoso
    • IST/INESC-ID
Open AccessOriginal Paper

DOI: 10.1007/s13173-010-0023-1

Cite this article as:
Neto, N., Patrick, C., Klautau, A. et al. J Braz Comput Soc (2011) 17: 53. doi:10.1007/s13173-010-0023-1

Abstract

An automatic speech recognition system has modules that depend on the language and, while there are many public resources for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This work describes the development of resources and free tools for BP speech recognition, consisting of text and audio corpora, phonetic dictionary, grapheme-to-phone converter, language and acoustic models. All of them are publicly available and, together with a proposed application programming interface, have been used for the development of several new applications, including a speech module for the OpenOffice suite. Performance tests are presented, comparing the developed BP system with a commercial software. The paper also describes an application that uses synthesis and speech recognition together with a natural language processing module dedicated to statistical machine translation. This application allows the translation of spoken conversations from BP to English and vice versa. The resources make easier the adoption of BP speech technologies by other academic groups and industry.

Keywords

Speech recognitionBrazilian PortugueseGrapheme-to-phone conversionApplication programming interfaceSpeech-based applications
Download to read the full article text

Copyright information

© The Brazilian Computer Society 2010