Hybrid Approach for Language Identification Oriented to Multilingual Speech Recognition in the Basque Context

Barroso, N.; de Ipiña, K. López; Ezeiza, A.; Barroso, O.; Susperregi, U.

doi:10.1007/978-3-642-13769-3_24

N. Barroso²¹,
K. López de Ipiña²²,
A. Ezeiza²²,
O. Barroso²¹ &
…
U. Susperregi²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6076))

Included in the following conference series:

International Conference on Hybrid Artificial Intelligence Systems

1420 Accesses
3 Citations

Abstract

The development of Multilingual Large Vocabulary Continuous Speech Recognition systems involves issues as: Language Identification, Acoustic-Phonetic Decoding, Language Modelling or the development of appropriated Language Resources. The interest on Multilingual Systems arouses because there are three official languages in the Basque Country (Basque, Spanish, and French), and there is much linguistic interaction among them, even if Basque has very different roots than the other two languages. This paper describes the development of a Language Identification (LID) system oriented to robust Multilingual Speech Recognition for the Basque context. The work presents hybrid strategies for LID, based on the selection of system elements by Support Vector Machines and Multilayer Perceptron classifiers and stochastic methods for speech recognition tasks (Hidden Markov Models and n-grams).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Schultz, T., Kirchhoff, N.: Multilingual Speech Processing. Elsevier, Amsterdam (2006)
Google Scholar
Schultz, T., Waibel, A.: Multilingual and Crosslingual Speech Recognition. In: Proceedings of the DARPA Broadcast News, Workshop (1998)
Google Scholar
Seng, S., Sam, S., Le, V.B., Bigi, B., Besacier, L.: Which Units For Acoustic and Language Modeling For Khmer Automatic Speech Recognition. In: 1st International Conference on Spoken Language Processing for Under-resourced languages Hanoi, Vietnam (2008)
Google Scholar
Dau-Cheng, L., Ren-Yuan, L.: Language Identification on Code-Switching Utterances Using Multiple Cues. In: Proc. of Interspeech 2008 (2008)
Google Scholar
Lopez de Ipiña, K., Graña, M., Ezeiza, N., Hernández, M., Zulueta, E., Ezeiza, A., Tovar, C.: Selection of Lexical Units for CSR of Basque. In: Sanfeliu, A., Ruiz-Shulcloper, J. (eds.) CIARP 2003. LNCS, vol. 2905, pp. 244–250. Springer, Heidelberg (2003)
Chapter Google Scholar
Barroso, N., Ezeiza, A., Gilisagasti, N., Lopez de Ipiña, K., López, A., López, J.M.: Development of Multimodal Resources for Multilingual Information Retrieval in the Basque context. In: Proccedings of Interspeech 2007, Antwerp, Belgium (2007)
Google Scholar
Le, V.B., Besacier, L.: Automatic speech recognition for under-resourced languages: application to Vietnamese language. IEEE Transactions on Audio, Speech, and Language Processing 17(8), 1471–1482 (2009)
Article Google Scholar
Li, H., Ma, B.: A Phonotactic Language Model for Spoken LID. ACL (2005)
Google Scholar
Ma, B., Li, H.: An Acoustic Segment Modeling Approach to Automatic Language Identification. In: Proc. Interspeech 2005, Lisbon, Portugal, pp. 2829–2832 (2005)
Google Scholar
Matejka, P., Schwarz, P., Cernocky, J., Chytil, P.: Phonotactic LID using High Quality Phoneme Recognition. In: Proc. Interspeech 2005, Lisbon, Portugal, pp. 2237–2240 (2005)
Google Scholar
Nagarajan, T., Murthy, H.A.: Language Identification, Using Parallel Syllable-Like Unit Recognition. In: Proc. ICASSP, pp. I-401 – I-404 (2004)
Google Scholar
Vandecatseye, A., Martens, J.P., Neto, J., Meinedo, H., Garcia-Mateo, C., Dieguez, F.J., Mihelic, F., Zibert, J., Nouza, J., David, P., Pleva, M., Cizmar, A., Papageorgiou, H., Alexandris, C.: The COST278 pan-European Broadcast News Database. In: Proceedings of LREC 2004, Lisbon, Portugal (2004)
Google Scholar
Wheatley, B., Kondo, K., Anderson, W., Muthusamy, Y.: An evaluation of Cross-Language Adaptation for Rapid HMM Development in a New Language. In: International Conference on Acoustics, Speech, and Signal Processing, Adelaine, pp. 237–240 (1994)
Google Scholar
Padrell, J., Martín-Iglesias, D., Díaz-de-María, F.: Support Vector Machines for Continuous Speech Recognition. In: 14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8 (2006)
Google Scholar
Ganapathiraju, A., Hmaker, J., Picone, J.: Hybrid SVM/HMM architectures for speech recognition. In: Proc. of the International Conference on Spoken Language Processing, vol. 4, pp. 504–507 (2000)
Google Scholar
Smith, N., Gales, M.: Speech recognition using SVMs. In: Advances in Neural Information Processing Systems, vol. 14. MIT Press, Cambridge (2002)
Google Scholar
Cosi, P.: Hybrid HMM-NN architectures for connected digit recognition. In: Proc. of the International Joint Conference on Neural Networks, vol. 5 (2000)
Google Scholar
Ambikairajah, L., Choi, E.: Robust language identification based on fused phonotactic information with MLKSFM ICME. In: 2009 IEEE International Conference on pre-classifier, Multimedia and Expo. (2009)
Google Scholar
Graña, M., Torrealdea, F.J.: Hierarchically structured systems. European Journal of Operational Research 25, 20–26 (1986)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Irunweb Enterprise, Auzolan 2B – 2, Irun, 20303, Spain
N. Barroso, O. Barroso & U. Susperregi
Departamento de Ingeniería de Sistemas y Automática, Grupo de Inteligencia Computacional, Escuela Politécnica Universidad del País Vasco/Euskal Herriko Unibertsitatea, Plaza de Europa1, Donostia, 20008
K. López de Ipiña & A. Ezeiza

Authors

N. Barroso
View author publications
You can also search for this author in PubMed Google Scholar
K. López de Ipiña
View author publications
You can also search for this author in PubMed Google Scholar
A. Ezeiza
View author publications
You can also search for this author in PubMed Google Scholar
O. Barroso
View author publications
You can also search for this author in PubMed Google Scholar
U. Susperregi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Facultad de informatica UPV/EHU, San Sebastian, Spain
Manuel Graña Romay & M. Teresa Garcia Sebastian &
Universidad de Salamanca, Spain
Emilio Corchado

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Barroso, N., de Ipiña, K.L., Ezeiza, A., Barroso, O., Susperregi, U. (2010). Hybrid Approach for Language Identification Oriented to Multilingual Speech Recognition in the Basque Context. In: Graña Romay, M., Corchado, E., Garcia Sebastian, M.T. (eds) Hybrid Artificial Intelligence Systems. HAIS 2010. Lecture Notes in Computer Science(), vol 6076. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13769-3_24

Download citation

DOI: https://doi.org/10.1007/978-3-642-13769-3_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13768-6
Online ISBN: 978-3-642-13769-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics