Abstract
This paper presents the application of morpheme-based and factored language models in an Amharic speech recognition task. Since the use of morphemes in both acoustic and language models often results in performance degradation due to a higher acoustic confusability and since it is problematic to use factored language models in standard word decoders, we applied the models in a lattice rescoring framework. Lattices of 100 best alternatives for each test sentence of the 5k development test set have been generated using a baseline speech recognizer with a word-based backoff bigram language model. The lattices have then been rescored by means of various morpheme-based and factored language models. A slight improvement in word recognition accuracy has been observed with morpheme-based language models while factored language models led to notable improvements in word recognition accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Junqua, J.-C., Haton, J.-P.: Robustness in Automatic Speech Recognition: Fundamentals and Applications. Kluwer Academic, London (1996)
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book. Cambridge University Engineering Department (2006)
Vergyri, D., Kirchhoff, K., Duh, K., Stolcke, A.: Morphology-Based Language Modeling for Arabic Speech Recognition. In: ICSLP 2004, pp. 2245–2248 (2004)
Geutner, P.: Using Morphology towards Better Large-Vocabulary Speech Recognition Systems. IEEE International on Acoustics, Speech and Signal Processing I, 445–448 (1995)
Whittaker, E., Woodland, P.: Particle-Based Language Modeling. In: Proceeding of International Conference on Spoken Language Processing, pp. 170–173 (2000)
Byrne, W., Hajič, J., Ircing, P., Jelinek, F., Khudanpur, S., Krebc, P., Psutka, J.: On Large Vocabulary Continuous Speech Recognition of Highly Inflectional Language - Czech. In: Proceeding of the European Conference on Speech Communication and Technology, pp. 487–489 (2001)
Kirchhoff, K., Bilmes, J., Henderson, J., Schwartz, R., Noamany, M., Schone, P., Ji, G., Das, S., Egan, M., He, F., Vergyri, D., Liu, D., Duta, N.: Novel Speech Recognition Models for Arabic. In: Johns-Hopkins University Summer Research Workshop (2002)
Hirsimäki, T., Creutz, M., Siivola, V., Kurimo, M.: Morphologically Motivated Language Models in Speech Recognition. In: Proceedings of the International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning, pp. 121–126 (2005)
Abate, S. T.: Automatic Speech Recognition for Amharic. University of Hamburg (2006)
Tachbelie, M.Y., Menzel, W.: Sub-Word Based Language Modeling for Amharic. In: Proceedings of International Conference on Recent Advances in Natural Language Processing, pp. 564–571 (2007)
Tachbelie, M.Y., Menzel, W.: Morpheme-Based Language Modeling for Inflectional Language - Amharic. In: Nicolov, N., Angelova, G., Mitkov, R. (eds.) Recent Advances in Natural Language Processing Selected Papers from RANLP 2007, vol. V, pp. 301–310. John Benjamin’s Publishing, Amsterdam (2009)
Pellegrini, T., Lamel, L.: Investigating Automatic Decomposition for ASR in Less Represented Languages. In: Proceedings of INTERSPEECH 2006 (2006)
Pellegrini, T., Lamel, L.: Using Phonetic Features in Unsupervised Word Decompounding for ASR with Application to A Less-Represented Language. In: Proceedings of INTERSPEECH 2007, pp. 1797–1800 (2007)
Creutz, M., Lagus, K.: Unsupervised Morpheme Segmentation and Morphology Induction from Text Corpora Using Morfessor 1.1. A81, Neural Networks Research Center, Helsinki University of Technology (2005)
Kirchhoff, K., Bilmes, J., Das, S., Duta, N., Egan, M., Ji, G., He, F., Henderson, J., Liu, D., Noamany, M., Schone, P., Schwartz, R., Vergyri, D.: Novel Approaches to Arabic Speech Recognition: Report from the 2002 Johns-Hopkins Summer Workshop. In: Proceedings of International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 344–347 (2003)
Tachbelie, M.Y., Abate, S.T., Menzel, W.: Morpheme-Based Language Modeling for Amharic Speech Recognition. In: Proceedings of the 4th Language and Technology Conference, pp. 114–118 (2009)
Kirchhoff, K., Bilmes, J., Duh, K.: Factored Language Models - a Tutorial. Dept. of Electrical Eng., Univ. of Washington (2008)
Duh, K., Kirchhoff, K.: Automatic Learning of Language Model Structure. In: Proceeding of International Conference on Computational Linguistics (2004)
Bender, M.L., Bowen, J.D., Cooper, R.L., Ferguson, C.A.: Languages in Ethiopia. Oxford Univ. Press, London (1976)
Yimam, B.: yäamarIŋa säwasäw. 2nd. ed. EMPDE, Addis Ababa (2007)
Abate, S.T., Menzel, W., Tafila, B.: An Amharic Speech Corpus for Large Vocabulary Continuous Speech Recognition. In: Proceedings of 9th European Conference on Speech Communication and Technology (2005)
Stolcke, A.: SRILM - an Extensible Language Modeling Toolkit. In: Proceedings of International Conference on Spoken Language Processing, vol. II, pp. 901–904 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tachbelie, M.Y., Abate, S.T., Menzel, W. (2011). Morpheme-Based and Factored Language Modeling for Amharic Speech Recognition. In: Vetulani, Z. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2009. Lecture Notes in Computer Science(), vol 6562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20095-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-20095-3_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20094-6
Online ISBN: 978-3-642-20095-3
eBook Packages: Computer ScienceComputer Science (R0)