Abstract
In this paper we propose a new type of syllable-based unit for recognition and language model to improve recognition rate for Korean phones, syllables and characters. We propose ‘combined’ units for which both Korean characters and syllable units realized in speech are taken into consideration. We can obtain character, syllable and phone sequences directly from the recognition results by using proposed units. To test the performance of the proposed approach we perform two types of experiments. First, we perform language modeling for phones, characters, syllables and propose combined units based on the same text corpus, and we test the performance for each unit. Second, we perform a vector space model based retrieval experiment by using the proposed combined units.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ng, K.: Subword-based Approaches for Spoken Document Retrieval, Ph.D. Thesis, Massachusetts Institute of Technology (MIT), Cambridge, MA (2000)
Moreau, N., Kim, H.-G., Sikora, T.: Phone-based Spoken Document Retrieval in Conformance with the MPEG-7 Standard. In: 25th International AES Conference (Metadata for Audio), London, UK (2004)
Speech/Language Technology Research Department in ETRI, http://voice.etri.re.kr
SiTEC (Speech Information Technology and Industry Promotion Center), http://www.sitec.or.kr
Sohn, H.-M.: The Korean language. Cambridge University Press, Cambridge (1999)
Korea Broadcasting System, Dictionary of Standard Pronunciation of Korean, Emunkak (1993)
HTK (Hidden Markov Model Toolkit), http://htk.eng.cam.ac.uk
CMU-Cambridge Statistical Language Modeling toolkit, http://mi.eng.cam.ac.uk/~prc14/toolkit.html
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
TREC, Common Evaluation Measures, NIST. In: 10th Text Retrieval Conference (TREC 2001), Gaithersburg, MD (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, BW., Um, Y., Lee, YJ. (2006). Syllable-Based Recognition Unit to Reduce Error Rate for Korean Phones, Syllables and Characters. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_42
Download citation
DOI: https://doi.org/10.1007/11846406_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)