Developing a Basque TTS for the Navarro-Lapurdian Dialect

  • Eva Navas
  • Inma Hernaez
  • Daniel Erro
  • Jasone Salaberria
  • Beñat Oyharçabal
  • Manuel Padilla
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8854)

Abstract

The paper presents a new TTS system for the Navarro-Lapurdian dialect based on a standard Basque TTS. A phonetically balanced recording corpus of 4000 sentences has been designed and two speakers have recorded it. The voice has been built using a high quality speech coder in the context of HMM based speech synthesis. The new dialectal TTS system has been compared in a subjective evaluation with the existing TTS system for standard Basque and with a mixed system that applies the phonetic transcription rules of the dialect, but uses the speech generation module of the standard Basque system. The adaptation of the front-end module with the inclusion of new phonetic transcription rules and new sounds is not enough to get a system that works better than the standard Basque system. The results with the dialectal new voice indicate that users prefer the new dialectal system to the standard Basque one.

Keywords

Dialectal TTS Minoritarian Language Basque TTS 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Beskow, J., Gustafson, J.: Experiments with Synthesis of Swedish Dialects. In: FONETIK, Stockholm, pp. 28–29 (2009)Google Scholar
  2. 2.
    Pucher, M., Schabus, D., Yamagishi, J., Neubarth, F., Strom, V.: Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis. Speech Communication 52(2), 164–179 (2010)CrossRefGoogle Scholar
  3. 3.
    Hu, Q., Tao, J., Zhao, C.: HMM-based Tianjin Dialect speech synthesis using bilateral question Set. In: 2011 IEEE International Workshop on Machine Learning for Signal Processing, Beijing, pp. 1–4 (2011)Google Scholar
  4. 4.
    Langa, R., Manamela, J., Gasela, N.: Synthesis of dialect speech for an under-resourced language. In: SATNAC, Western Cape (2012)Google Scholar
  5. 5.
    Yoshimura, T., Tokuda, K., Masuko, T., Kobayashi, T., Kitamura, T.: Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis. In: EUROSPEECH, Budapest, pp. 2347–2350 (1999)Google Scholar
  6. 6.
    Zen, H., Tokuda, K., Black, A.W.: Statistical parametric speech synthesis. Speech Communication 51, 1039–1064 (2009)CrossRefGoogle Scholar
  7. 7.
    Basque Government: Fifth Sociolinguistic Survey: Basque Autonomous Community, Navarre and Iparralde, http://www.euskara.euskadi.net/r59-738/en/contenidos/informacion/sociolinguistic_research2011/en_2011/2011.html
  8. 8.
    Hernaez, I., Navas, E., Murugarren, J.L., Etxebarria, B.: Description of the AhoTTS Conversion System for the Basque Language. In: 4th ISCA Tutorial and Research Workshop (ITRW) on Speech Synthesis, paper 202, Perthshire (2001), http://sourceforge.net/projects/ahottsmultiling/
  9. 9.
    Hernaez, I., Navas, E., Odriozola, I., Sarasola, K., Diaz de Ilarraza, A., Leturia, I., Diaz de Lezana, A., Oyharçabal, B., Salaberria, J.: The Basque language in the digital age/Euskara aro digitalean. METANET White Paper Series. Springer (2012)Google Scholar
  10. 10.
    Sesma, A., Moreno, A.: CorpusCrt 1.0: Diseño de corpus orales equilibrados. Technical report, UPC (2000) (in Spanish)Google Scholar
  11. 11.
    Sainz, I., Erro, D., Navas, E., Hernaez, I., Sanchez, J., Saratxaga, O.I.: Versatile Speech Databases for High Quality Synthesis for Basque. In: LREC, Istanbul, pp. 3308–3312 (2012)Google Scholar
  12. 12.
    Boersma, P., Weenink, D.: Praat: doing phonetics by computer (Computer program). Version 5.1.38 (2010), http://www.praat.org/ (retrieved June 2, 2010)
  13. 13.
    International Telecommunication Union (ITU-T): Recommendation ITU-T P.56, Objective measurement of active speech level (2011), https://www.itu.int/rec/T-REC-P.56-201112-I/en
  14. 14.
    Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book, version 3.4 (2006)Google Scholar
  15. 15.
    Erro, D., Sainz, I., Navas, E., Heráaez, I.: HNM-Based MFCC+f0 Extractor Applied to Statistical Speech Synthesis. In: ICASSP, Florence, pp. 4728–4731 (2011)Google Scholar
  16. 16.
    Erro, D., Sainz, I., Luengo, I., Odriozola, I., Sanchez, J., Saratxaga, I., Navas, E., Hernaez, I.: HMM-based Speech Synthesis in Basque Language using HTS. In: FALA, Vigo, pp. 67–70 (2010)Google Scholar
  17. 17.
    Zen, H., Nose, T., Yamagishi, J., Sako, S., Black, A.W., Masuko, T., Tokuda, K.: The HMM-based speech synthesis system (HTS) version 2.0. In: SSW 2006, Bonn, pp. 294–299 (2006)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Eva Navas
    • 1
  • Inma Hernaez
    • 1
  • Daniel Erro
    • 1
    • 2
  • Jasone Salaberria
    • 3
  • Beñat Oyharçabal
    • 3
  • Manuel Padilla
    • 3
  1. 1.Aholab (UPV/EHU), ETSI BilbaoBilbaoSpain
  2. 2.IKERBASQUEBilbaoSpain
  3. 3.IKER UMR 5478BayonneFrance

Personalised recommendations