A Study on Bilingual Speech Recognition Involving a Minority Language

Luján-Mares, Míriam; Martínez-Hinarejos, Carlos-D.; Alabau, Vicente

doi:10.1007/978-3-642-04235-5_4

Míriam Luján-Mares²¹,
Carlos-D. Martínez-Hinarejos²¹ &
Vicente Alabau²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5603))

Included in the following conference series:

Language and Technology Conference

658 Accesses
1 Citations

Abstract

Multilingual Automatic Speech Recognition (ASR) systems are of great interest in multilingual environments. We have studied the case of the Comunitat Valenciana because the two official languages are Spanish and Valencian. These two languages share most of their phonemes and their syntax and vocabulary are also quite similar since they have influenced each other for many years. In this work, we present the design of the language and the acoustic models for this bilingual situation. Acoustic models can be separate for each language or shared by both of them, and they can be obtained directly from a training corpus or by adapting a previous set of acoustic models. Language models can be separate for each language (monolingual recognition) or mixed for both languages (bilingual recognition). We performed experiments with a small corpus to determine which option was better for this case.

Work partially supported by VIDI-UPV under PAID06 program by the EC (FEDER) and the Spanish MEC under grant TIN2006-15694-CO2-01 and by the Spanish research programme Consolider Ingenio 2010: MIPRCV (CSD2007-00018).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wald, M.: Using Automatic Speech Recognition to enhance education for all students: turning a vision into reality. In: ASEE/IEEE Frontiers in Education Conference, Session S3G, October 2004, pp. 22–25 (2004)
Google Scholar
Uebler, U.: Multilingual speech recognition in seven languages. Speech Communication 35, 53–69 (2001)
Article MATH Google Scholar
Eklund, R., Lindström, A.: Xenophones: An investigation of phone set expansion in Swedish and implications for speech recognition and speech synthesis. Speech Communication 35, 81–102 (2001)
Article MATH Google Scholar
Van Wijngaarden, S.J.: Intelligibility of native and non-native Dutch speech. Speech Communication 35, 103–113 (2001)
Article MATH Google Scholar
Vilajoana, J., Pons, D.: Catalan, language of Europe. Generalitat de Catalunya (2001)
Google Scholar
Alabau, V., Martínez, C.D.: Bilingual speech corpus in two phonetically similar languages. In: Proc. of LREC 2006, pp. 1624–1627 (2006)
Google Scholar
Young, S., Evermann, G., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book. v3.2, CUED, UK (July 2004)
Google Scholar
Quilis, A.: Tratado de fonología y fonética españolas, 2nd edn., Madrid, Gredos (1999)
Google Scholar
UCL, SAMPA computer readable phonetic alphabet (1993)
Google Scholar
Casacuberta, F., Ney, H., Och, F.J., Vidal, E., Vilar, J.M., Barrachina, S., García-Varea, I., Mart’inez, C., Llorens, D., Molau, S., Nevado, F., Pastor, M., Picó, D., Sanchis, A.: Some approaches to statistical and finite-state speech-to-speech translation. Computer Speech and Language 18, 25–47 (2004)
Article Google Scholar
Leggetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of continuous density hidden Markov models. Computer Speech and Language 9, 171–185 (1995)
Article Google Scholar
Schultz, T., Waibel, A.: Language-independent and language-adaptive acoustic modeling for speech recognition. Speech Communication 35, 31–51 (2001)
Article MATH Google Scholar
Woodland, P.C.: Speaker Adaptation for Continuous Density HMMs: A Review. In: ITRW on Adaptation Methods for Speech Recognition, pp. 11–19 (2001)
Google Scholar
Moreno, A., Febrer, A., Márquez, L.: Generation of Language Resources for the Development of Speech Tecnologies in Catalan. In: Proc. of LREC 2006, pp. 1632–1635 (2006)
Google Scholar
Gauvain, J., Lee, C.: MAP Estimation of Continuous Density HMM: Theory and Applications. In: Proc. DARPA Speech and Natural Language Workshop, February 1992, pp. 185–190 (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

Institut Tecnològic d’Informàtica, Universitat Politècnica de València, Camí de Vera, s/n., 46071, València, Spain
Míriam Luján-Mares, Carlos-D. Martínez-Hinarejos & Vicente Alabau

Authors

Míriam Luján-Mares
View author publications
You can also search for this author in PubMed Google Scholar
Carlos-D. Martínez-Hinarejos
View author publications
You can also search for this author in PubMed Google Scholar
Vicente Alabau
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Mathematics and Computer Science, Adam Mickiewicz University in Poznań, ul. Umultowska 87, P.O. Box, 61614, Poznań, Poland
Zygmunt Vetulani
Language Technology Lab, German Research Center for Artificial Intelligence (DFKI), Campus D 3 1, Stuhlsatzenhausweg 3, D-66123, Saarbrücken, Germany
Hans Uszkoreit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Luján-Mares, M., Martínez-Hinarejos, CD., Alabau, V. (2009). A Study on Bilingual Speech Recognition Involving a Minority Language. In: Vetulani, Z., Uszkoreit, H. (eds) Human Language Technology. Challenges of the Information Society. LTC 2007. Lecture Notes in Computer Science(), vol 5603. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04235-5_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-04235-5_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04234-8
Online ISBN: 978-3-642-04235-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics