Speech and Language Resources within Speech Recognition and Synthesis Systems for Serbian and Kindred South Slavic Languages
Unlike other new technologies, most speech technologies are heavily language dependent and have to be developed separately for each language. The paper gives a detailed description of speech and language resources for Serbian and kindred South Slavic languages developed during the last decade within joint projects of the Faculty of Technical Sciences, Novi Sad, Serbia and the company “AlfaNum”. It points out the advantages of simultaneous development of speech synthesis and recognition as complementary speech technologies, and discusses the possibility of reuse of speech and language resources across kindred languages.
Keywordsspeech technologies speech and language resources South Slavic languages
Unable to display preview. Download preview PDF.
- 1.Vitas, D., Popović, L., Krstev, C., Obradović, I., Pavlović-Lažetić, G., Stanojević, M.: The Serbian language in the digital age. White paper series. Springer (2012)Google Scholar
- 2.Brill, E.: Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging. Comput. Linguist. 21(4), 543–566 (1995)Google Scholar
- 3.Merialdo, B.: Tagging English text with a probabilistic model. Comput. Linguist. 20, 155–172 (1994)Google Scholar
- 4.Obradović, R., Pekar, D.: C++ library for signal processing – SLIB. In: Digital Signal and Image Processing Conference DOGS, Novi Sad, Serbia (2000)Google Scholar
- 5.Pakoci, E., Mak, R.: HMM-based speech synthesis for the Serbian language. In: 56th ETRAN, Zlatibor, Serbia, vol. TE4, pp. 1–4 (2012)Google Scholar
- 7.Sečujski, M.: Obtaining prosodic information from text in Serbian language. In: EUROCON, Belgrade, Serbia, pp. 1654–1657 (2005)Google Scholar
- 8.Sečujski, M.: Automatic part-of-speech tagging in Serbian. PhD thesis, University of Novi Sad, Serbia (2009)Google Scholar
- 9.Sečujski, M., Jakovljević, N., Pekar, D.: Automatic prosody generation for Serbo-Croatian speech synthesis based on regression trees. In: Interspeech 2011, Florence, Italy, pp. 3157–3160 (2011)Google Scholar
- 10.Ostrogonac, S., Popović, B., Sečujski, M., Mak, R., Pekar, D.: Language model reduction for practical implementation in LVCSR systems. In: Infoteh, Jahorina, Bosnia and Herzegovina (accepted for publication, 2013)Google Scholar