Abstract
This paper presents a series of language identification (LID) experiments for Spanish, Basque and English. Spanish and Basque are both official languages in the Basque Country, a region located in northern Spain. We focused our research on some techniques based on phone decoding. We propose the use of phone segments as decoding units instead of just phones. We describe a simple procedure to obtain a set of phone segments that typically appear in the languages involved. In comparison with similar techniques that do not rely on phone segments, the choice of these segments as decoding units yields a remarkable improvement in terms of LID accuracy: from 93.02% using phones to 98.32% using phone segments, when applied to trilingual read speech.
This work was partially supported by the Spanish CICYT project TIN2005-08660-C04-03 and by the University of the Basque Country under grant 9/UPV 00224.310-15900/2004.
Chapter PDF
Similar content being viewed by others
References
Itakahashi, S., Du, L.: Language identification based on speech fundamental frequency. In: EUROSPEECH, Madrid, Spain, vol. 2, pp. 1359–1362 (1995)
Zissman, M.A., Singer, E.: Automatic language identification of telephone speech messages using phoneme recognition and n-gram modelling. In: ICASSP, Adelaide, Australia, vol. 1, pp. 305–308 (1994)
Navrátil, J., Zühlke, W.: An efficient phonotactic-acoustic system for language identification. In: ICASSP, Seattle, USA, vol. 2, pp. 781–784 (1998)
Singer, E., Torres-Carrasquillo, P.A., Gleason, T.P., Campbell, W.M., Reynolds, D.A.: Acoustic, phonetic and discriminative approaches to automatic language identification. In: EUROSPEECH, Geneva, Switzerland, pp. 1349–1352 (2003)
Schultz, T., Rogina, I., Waibel, A.: Lvcsr-based language identification. In: ICASSP, Atlanta, USA, pp. 781–784 (1996)
Martin, A.F., Le, A.N.: The current state of language recognition: Nist 2005 evaluation results. In: Proceedings of the IEEE Odyssey 2006, the Speaker and Language Recognition Workshop, San Juan, Puerto Rico (2006)
Li, H., Ma, B.: A phonotactic language model for spoken language identification. In: ACL 2005, Morristown, NJ, USA, pp. 515–522 (2005)
Guijarrubia, V., Torres, I.: Basque-spanish language identification using phonebased methods. In: Proceedings of International Conference of Spoken Language Processing, Pittsburgh, USA, pp. 1780–1783 (2006)
Young, S.R.: Detecting misrecognitions and out-of-vocabulary words. In: ICASSP, Adelaide, Australia, vol. 2, pp. 21–24 (1994)
Hieronymus, J.L., Kadambe, S.: Spoken Language Identification Using Large Vocabulary Speech Recognition. In: Proceedings of International Conference of Spoken Language Processing, Philadelphia, USA, pp. 1780–1783 (1996)
Guijarrubia, V., Torres, I., Rodríguez, L.J.: Evaluation of a Spoken Phonetic Database in Basque Language. In: LREC 2004, Lisbon, vol. 6, pp. 2127–2130 (2004)
Moreno, A., Poch, D., Bonafonte, A., Lleida, E., Llisterri, J., Mariño, J.B., Nadeu, C.: Albayzin speech database: Design of the phonetic corpus. In: EUROSPEECH, Lisbon (1993)
Pérez, A., Torres, I., Casacuberta, F., Guijarrubia, V.: A Spanish-Basque weather forecast corpus for probabilistic speech translation. In: 5th SALTMIL Workshop on Minority Languages, Genoa, Italy, pp. 99–101 (2006)
Torres, I., Varona, A.: K-TSS Language Model in a Speech Recognition System. Computer Speech and Language 15(2), 127–149 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Guijarrubia, V., Torres, M.I. (2007). Phone-Segments Based Language Identification for Spanish, Basque and English. In: Rueda, L., Mery, D., Kittler, J. (eds) Progress in Pattern Recognition, Image Analysis and Applications. CIARP 2007. Lecture Notes in Computer Science, vol 4756. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76725-1_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-76725-1_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76724-4
Online ISBN: 978-3-540-76725-1
eBook Packages: Computer ScienceComputer Science (R0)