Abstract
The development of Multilingual Large Vocabulary Continuous Speech Recognition systems involves issues as: Language Identification, Acoustic-Phonetic Decoding, Language Modelling or the development of appropriated Language Resources. The interest on Multilingual Systems arouses because there are three official languages in the Basque Country (Basque, Spanish, and French), and there is much linguistic interaction among them, even if Basque has very different roots than the other two languages. This paper describes the development of a Language Identification (LID) system oriented to robust Multilingual Speech Recognition for the Basque context. The work presents hybrid strategies for LID, based on the selection of system elements by Support Vector Machines and Multilayer Perceptron classifiers and stochastic methods for speech recognition tasks (Hidden Markov Models and n-grams).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Schultz, T., Kirchhoff, N.: Multilingual Speech Processing. Elsevier, Amsterdam (2006)
Schultz, T., Waibel, A.: Multilingual and Crosslingual Speech Recognition. In: Proceedings of the DARPA Broadcast News, Workshop (1998)
Seng, S., Sam, S., Le, V.B., Bigi, B., Besacier, L.: Which Units For Acoustic and Language Modeling For Khmer Automatic Speech Recognition. In: 1st International Conference on Spoken Language Processing for Under-resourced languages Hanoi, Vietnam (2008)
Dau-Cheng, L., Ren-Yuan, L.: Language Identification on Code-Switching Utterances Using Multiple Cues. In: Proc. of Interspeech 2008 (2008)
Lopez de Ipiña, K., Graña, M., Ezeiza, N., Hernández, M., Zulueta, E., Ezeiza, A., Tovar, C.: Selection of Lexical Units for CSR of Basque. In: Sanfeliu, A., Ruiz-Shulcloper, J. (eds.) CIARP 2003. LNCS, vol. 2905, pp. 244–250. Springer, Heidelberg (2003)
Barroso, N., Ezeiza, A., Gilisagasti, N., Lopez de Ipiña, K., López, A., López, J.M.: Development of Multimodal Resources for Multilingual Information Retrieval in the Basque context. In: Proccedings of Interspeech 2007, Antwerp, Belgium (2007)
Le, V.B., Besacier, L.: Automatic speech recognition for under-resourced languages: application to Vietnamese language. IEEE Transactions on Audio, Speech, and Language Processing 17(8), 1471–1482 (2009)
Li, H., Ma, B.: A Phonotactic Language Model for Spoken LID. ACL (2005)
Ma, B., Li, H.: An Acoustic Segment Modeling Approach to Automatic Language Identification. In: Proc. Interspeech 2005, Lisbon, Portugal, pp. 2829–2832 (2005)
Matejka, P., Schwarz, P., Cernocky, J., Chytil, P.: Phonotactic LID using High Quality Phoneme Recognition. In: Proc. Interspeech 2005, Lisbon, Portugal, pp. 2237–2240 (2005)
Nagarajan, T., Murthy, H.A.: Language Identification, Using Parallel Syllable-Like Unit Recognition. In: Proc. ICASSP, pp. I-401 – I-404 (2004)
Vandecatseye, A., Martens, J.P., Neto, J., Meinedo, H., Garcia-Mateo, C., Dieguez, F.J., Mihelic, F., Zibert, J., Nouza, J., David, P., Pleva, M., Cizmar, A., Papageorgiou, H., Alexandris, C.: The COST278 pan-European Broadcast News Database. In: Proceedings of LREC 2004, Lisbon, Portugal (2004)
Wheatley, B., Kondo, K., Anderson, W., Muthusamy, Y.: An evaluation of Cross-Language Adaptation for Rapid HMM Development in a New Language. In: International Conference on Acoustics, Speech, and Signal Processing, Adelaine, pp. 237–240 (1994)
Padrell, J., Martín-Iglesias, D., Díaz-de-María, F.: Support Vector Machines for Continuous Speech Recognition. In: 14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8 (2006)
Ganapathiraju, A., Hmaker, J., Picone, J.: Hybrid SVM/HMM architectures for speech recognition. In: Proc. of the International Conference on Spoken Language Processing, vol. 4, pp. 504–507 (2000)
Smith, N., Gales, M.: Speech recognition using SVMs. In: Advances in Neural Information Processing Systems, vol. 14. MIT Press, Cambridge (2002)
Cosi, P.: Hybrid HMM-NN architectures for connected digit recognition. In: Proc. of the International Joint Conference on Neural Networks, vol. 5 (2000)
Ambikairajah, L., Choi, E.: Robust language identification based on fused phonotactic information with MLKSFM ICME. In: 2009 IEEE International Conference on pre-classifier, Multimedia and Expo. (2009)
Graña, M., Torrealdea, F.J.: Hierarchically structured systems. European Journal of Operational Research 25, 20–26 (1986)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Barroso, N., de Ipiña, K.L., Ezeiza, A., Barroso, O., Susperregi, U. (2010). Hybrid Approach for Language Identification Oriented to Multilingual Speech Recognition in the Basque Context. In: Graña Romay, M., Corchado, E., Garcia Sebastian, M.T. (eds) Hybrid Artificial Intelligence Systems. HAIS 2010. Lecture Notes in Computer Science(), vol 6076. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13769-3_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-13769-3_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13768-6
Online ISBN: 978-3-642-13769-3
eBook Packages: Computer ScienceComputer Science (R0)