Abstract
Most of state-of-the-art large vocabulary continuous speech recognition systems use word-based n-gram language models. Such models are not optimal solution for inflectional or agglutinative languages. The Polish language is highly inflectional one and requires a very large corpora to create a sufficient language model with the small out-of-vocabulary ratio. We propose a syllable-based language model, which is better suited to highly inflectional language like Polish. In case of lack of resources (i.e. small corpora) syllable-based model outperforms word-based models in terms of number of out-of-vocabulary units (syllables in our model). Such model is an approximation of the morpheme-based model for Polish. In our paper, we show results of evaluation of syllable based model and its usefulness in speech recognition tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bilmes, J., Kirchhoff, K.: Factored Language Models and Generalized Parallel Backoff. In: Human Language Technology Conference, Edmonton (2003)
Byrne, W., Hajič, J., Ircing, P., Krbec, P., Psutka, J.: Morpheme based language models for speech recognition of Czech. In: International Conference on Text Speech and Dialogue, Brno (2000)
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice-Hall, Englewood Cliffs (2000)
Larson, M., Eickeler, S.: Using Syllable-based Indexing Features and Language Models to improve German Spoken Document Retrieval. In: Proceedings of the 8th European Conference on Speech Communication and Technology (EUROSPEECH), Geneva, pp. 1217–1220 (2003)
Ogrodniczuk, M.(ed): Enhanced Corpus of Frequency Dictionary of Contemporary Polish, http://www.mimuw.edu.pl/polszczyzna/pl196x/index_en.htm
Rotovnik, T., Sepesy, M.M., Kačič, Z.: Large vocabulary continuous speech recognition of an inflected language using stems and endings. Speech Communication 49, 437–452 (2007)
Sawicka, I.: Fonologia. In: Gramatyka współczesnego jȩzyka polskiego, t. Fonetyka i fonologia, Instytut Jȩzyka Polskiego PAN, Kraków (1995)
Siivola, V., Hirsimäki, T., Creutz, M., Kurimo, M.: Unlimited Vocabulary Speech Recognition Based on Morphs Discovered in an Unsupervised Manner. In: Proceedings of the 8th European Conference on Speech Communication and Technology (EUROSPEECH), Geneva, pp. 2293–2296 (2003)
Stolcke, A.: SRILM – an extensible language modeling toolkit. In: Proc. Intl. Conf. on Spoken Language Processing, Denver (2002)
Xu, B., Ma, B., Zhang, S., Qu, F., Huang, T.: Speaker-independent Dictation of Chinese Speech with 32K Vocabulary. In: Proc. Intl. Conf. on Spoken Language Processing, Philadelphia, vol. 4, pp. 2320–2323 (1996)
Wierzchowska, B.: Fonetyka i fonologia jȩzyka polskiego. Ossolineum, Wrocław (1980)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Majewski, P. (2008). Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2008. Lecture Notes in Computer Science(), vol 5246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87391-4_51
Download citation
DOI: https://doi.org/10.1007/978-3-540-87391-4_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87390-7
Online ISBN: 978-3-540-87391-4
eBook Packages: Computer ScienceComputer Science (R0)