Abstract
Multilingual Automatic Speech Recognition (ASR) systems are of great interest in multilingual environments. We have studied the case of the Comunitat Valenciana because the two official languages are Spanish and Valencian. These two languages share most of their phonemes and their syntax and vocabulary are also quite similar since they have influenced each other for many years. In this work, we present the design of the language and the acoustic models for this bilingual situation. Acoustic models can be separate for each language or shared by both of them, and they can be obtained directly from a training corpus or by adapting a previous set of acoustic models. Language models can be separate for each language (monolingual recognition) or mixed for both languages (bilingual recognition). We performed experiments with a small corpus to determine which option was better for this case.
Work partially supported by VIDI-UPV under PAID06 program by the EC (FEDER) and the Spanish MEC under grant TIN2006-15694-CO2-01 and by the Spanish research programme Consolider Ingenio 2010: MIPRCV (CSD2007-00018).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wald, M.: Using Automatic Speech Recognition to enhance education for all students: turning a vision into reality. In: ASEE/IEEE Frontiers in Education Conference, Session S3G, October 2004, pp. 22–25 (2004)
Uebler, U.: Multilingual speech recognition in seven languages. Speech Communication 35, 53–69 (2001)
Eklund, R., Lindström, A.: Xenophones: An investigation of phone set expansion in Swedish and implications for speech recognition and speech synthesis. Speech Communication 35, 81–102 (2001)
Van Wijngaarden, S.J.: Intelligibility of native and non-native Dutch speech. Speech Communication 35, 103–113 (2001)
Vilajoana, J., Pons, D.: Catalan, language of Europe. Generalitat de Catalunya (2001)
Alabau, V., Martínez, C.D.: Bilingual speech corpus in two phonetically similar languages. In: Proc. of LREC 2006, pp. 1624–1627 (2006)
Young, S., Evermann, G., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book. v3.2, CUED, UK (July 2004)
Quilis, A.: Tratado de fonología y fonética españolas, 2nd edn., Madrid, Gredos (1999)
UCL, SAMPA computer readable phonetic alphabet (1993)
Casacuberta, F., Ney, H., Och, F.J., Vidal, E., Vilar, J.M., Barrachina, S., García-Varea, I., Mart’inez, C., Llorens, D., Molau, S., Nevado, F., Pastor, M., Picó, D., Sanchis, A.: Some approaches to statistical and finite-state speech-to-speech translation. Computer Speech and Language 18, 25–47 (2004)
Leggetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of continuous density hidden Markov models. Computer Speech and Language 9, 171–185 (1995)
Schultz, T., Waibel, A.: Language-independent and language-adaptive acoustic modeling for speech recognition. Speech Communication 35, 31–51 (2001)
Woodland, P.C.: Speaker Adaptation for Continuous Density HMMs: A Review. In: ITRW on Adaptation Methods for Speech Recognition, pp. 11–19 (2001)
Moreno, A., Febrer, A., Márquez, L.: Generation of Language Resources for the Development of Speech Tecnologies in Catalan. In: Proc. of LREC 2006, pp. 1632–1635 (2006)
Gauvain, J., Lee, C.: MAP Estimation of Continuous Density HMM: Theory and Applications. In: Proc. DARPA Speech and Natural Language Workshop, February 1992, pp. 185–190 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Luján-Mares, M., Martínez-Hinarejos, CD., Alabau, V. (2009). A Study on Bilingual Speech Recognition Involving a Minority Language. In: Vetulani, Z., Uszkoreit, H. (eds) Human Language Technology. Challenges of the Information Society. LTC 2007. Lecture Notes in Computer Science(), vol 5603. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04235-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-04235-5_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04234-8
Online ISBN: 978-3-642-04235-5
eBook Packages: Computer ScienceComputer Science (R0)