Advertisement

Cost-Efficient Cross-Lingual Adaptation of a Speech Recognition System

  • Zoraida Callejas
  • Jan Nouza
  • Petr Cerva
  • Ramón López-Cózar
Part of the Advances in Intelligent and Soft Computing book series (AINSC, volume 57)

Summary

In this paper, we describe a methodology that proved to be successful and cost-efficient for porting an existing speech recognition system to other languages. Our initial aim was to make a system called MyVoice developed for handicapped persons in Czechia, available also to users that speak other languages. The experimental results show that the proposed method can not only be used with languages with similar origins (in this case Czech and Slovak), but also with languages that belong to very different branches of the Indo-European language family (such as Czech and Spanish), obtaining in both cases accuracy rates above 70% with a 149k words lexicon.

Keywords

Minority Language Speech Recognition System Word Error Rate Speech Recognizer Speaker Adaptation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Kirchhoff, K., Vergyri, D.: Cross-dialectal data sharing for acoustic modeling in Arabic speech recognition. Speech Communication 46, 37–51 (2005)CrossRefGoogle Scholar
  2. 2.
    Nouza, J., Nouza, T., Cerva, P.: A Multi-Functional Voice-Control Aid for Disabled Persons. In: Proceedings of 10th International Conference on Speech and Computer (SPECOM 2005), Patras, Greece, pp. 715–718 (2005)Google Scholar
  3. 3.
    Cerva, P., Nouza, J.: Design and Development of Voice Controlled Aids for Motor-Handicapped Persons. In: Proceedings of 11th International Conference on Spoken Language Processing (Interspeech 2007 - Eurospeech), Antwerp, Belgium, pp. 2521–2524 (2007)Google Scholar
  4. 4.
    Gauvain, J.L., Lee, C.H.: Maximum A Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains. IEEE Transactions on Speech and Audio Processing 2, 291–298 (1994)CrossRefGoogle Scholar
  5. 5.
    Gales, M.J.F., Woodland, P.C.: Mean and Variance Adaptation Within the MLLR Framework. Computer Speech and Language 10, 249–264 (1996)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Zoraida Callejas
    • 1
  • Jan Nouza
    • 2
  • Petr Cerva
    • 2
  • Ramón López-Cózar
    • 1
  1. 1.Dept. of Computer Languages and SystemsUniversity of GranadaGranadaSpain
  2. 2.Inst. of Information Technology and ElectronicsTechnical University of LiberecLiberecCzech Republic

Personalised recommendations