International Journal of Speech Technology

, Volume 6, Issue 3, pp 221–232 | Cite as

Spoken Language Resources at LUKS of the University of Ljubljana

  • France Mihelič
  • Jerneja Gros
  • Simon Dobrišek
  • Janez Žibert
  • Nikola Pavešić#x0107;

Abstract

This paper presents the Slovene-language spoken resources that were acquired at the Laboratory of Artificial Perception, Systems and Cybernetics (LUKS) at the Faculty of Electrical Engineering, University of Ljubljana over the past ten years. The resources consist of:

• isolated-spoken-word corpora designed for phonetic research of the Slovene spoken language;

• read-speech corpora from dialogues relating to air flight information;

• isolated-word corpora, designed for studying the Slovene spoken diphthongs;

• Slovene diphone corpora used for text-to-speech synthesis systems;

• a weather forecast speech database, as an attempt to capture radio and television broadcast news in the Slovene language; and

• read- and spontaneous-speech corpora used to study the effects of the psycho physical conditions of the speakers on their speech characteristics.

All the resources are accompanied by relevant text transcriptions, lexicons and various segmentation labels. The read-speech corpora relating to the air flight information domain also are annotated prosodically and semantically. The words in the orthographic transcription were automatically tagged for their lemma and morphosyntactic description. Many of the mentioned speech resources are freely available for basic research purposes in speech technology and linguistics. In this paper we describe all the resources in more detail and give a brief description of their use in the spoken language technology products developed at LUKS.

speech corpora Slovene language read speech spontaneous speech 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aretoulaki, M., Harbeck, S., Gallwitz, F., Nöth, E., Niemann, H., Ivanecky, J., Ipšić, I., Pavešić, N., and Matoušek, V. (1998). SQEL: A multilingual and multifunctional dialogue system. Proc. Int. Conf. on Spoken Language Processing, Sydney, Australia, pp. 855-858.Google Scholar
  2. Barras, C., Geoffrois, E., Wu, Z., and Liberman, M. (2001). Transcriber: Use of a tool for assisting speech corpora production. Speech Communication special issue on Speech Annotation and Corpus Tools, 33(1/2):5-22.Google Scholar
  3. Brants, T. (2000). TnT-A Statistical Part-of-Speech Tagger. Proceedings of the ANLP-NAACL, Seattle, pp. 224-231.Google Scholar
  4. Brenner, M. and Cash, J.R. (1991). Speech analysis as an index of alcohol intoxication-The Exxon Valdez Accident. Aviation, Space and Environmental Medicine, 62(9):893-898.Google Scholar
  5. Dimitrova, L., Erjavec, T., Ide, N., Kaalep, H.J., Petkevi?, V., and Tufis, D. (1998). Multext-east: Parallel and comparable corpora and lexicons for six central and eastern European languages. COLING-ACL'98 Proceedings, Montreal, pp. 315-319.Google Scholar
  6. Dobrišek, S., Kačič, Z., Gros, J., Horvat, B., and Mihelič, F. (1996). Initiative for the standardization of machine readable phonetic alphabet for Slovene speech. Proceedings of the ERK'96 Conference, Portorož, Slovenia, vol. B, pp. 247-250.Google Scholar
  7. Dobrišek, S., Gros, J., Mihelič, F., and Pavešić, N. (1997). Automatic segmentation and labeling for the GOPOLIS speech database. Proceedings of the 2nd SQELWorkshop on Multi-Lingual Information Retreival Dialogs, Plzen, Czech Republic, pp. 37-46.Google Scholar
  8. Dobrišek, S., Gros, J., Mihelič, F., and Pavešić, N. (1998). Recording and labeling of the GOPOLIS Slovene speech database. Proc. 1st Int. Conf. on Language Resources & Evaluation, Granada, vol. 2, pp. 1089-1096.Google Scholar
  9. Dobrišek, S., Gros, J., Mihelič, F., and Pavešić, N. (1998a). Analysis of speech formant characteristics for selection of basic Slovene speech units. Proceedings of the Scientific Conference Artificial Intelligence in Industry from Theory to Practice and 3rd SQEL Workshop on Multi-Lingual Information Retrieval Dialogs, High Tatras, Slovakia, pp. 399-406.Google Scholar
  10. Dobrišek, S., Mihelič, F., and Pavešić, N. (1999). Acoustical modeling of phone transitions: Biphones and diphones-What are the differences? Eurospeech'99: Proceedings, Hungary, vol. 3, pp. 1307-1310.Google Scholar
  11. Dobrišek, S. (2001). Analysis and recognition of phones in speech signal. Ph.D. Thesis, University of Ljubljana.Google Scholar
  12. Dobrišek, S., Gros, J., Vesnicer, B., Mihelič, F., and Pavešić, N. (2003). Homer III-Evolution of the information retrieval system for blind and visually impaired people. International Journal of Speech Technology, vol. 6, pp. 301-309.Google Scholar
  13. Erjavec, T. (1998). The MULTEXT-East Slovene Lexicon. Proceedings of the ERK'98 Conference, Portorož, Slovenia, pp. 189-192.Google Scholar
  14. Garofolo, J., Fiscus, J.G., and Fisher, W.M. (1997). Design and preparation of the 1996 Hub-4 Broadcast News Benchmark Test Corpora. Proceedings of DARPA Speech Recognition Workshop, Chantilly, pp. 15-21.Google Scholar
  15. Gibbon, D., Moore, R., and Winski, R. (1997). EAGLES Handbook, Handbook of Standards and Resources for Spoken Language Systems. Berlin: Mouton de Gruyter.Google Scholar
  16. Gros, J., Mihelič, F., and Pavešić, N. (1995). Sentence hypothesisation using Ng-grams. Proceedings of the Eurospeech95, Madrid, pp. 1759-1762.Google Scholar
  17. Gros, J., Ipšić, I., Mihelič, F., and Pavešić, N. (1996). Segmentation and labeling of Slovene diphone inventories. COLING'96, Copenhagen, Denmark, pp. 298-303.Google Scholar
  18. Gros, J., Pavešić, N., and Mihelič, F. (1997). Text-to-speech synthesis: A complete system for the Slovene language. Journal of Computing and Information Technology, 5(1):11-19.Google Scholar
  19. Gros, J., Pavešić, N., and Mihelič, F. (1997a). Speech Timing in Slovene TTS. EUROSPEECH'97, Proceedings of the 5'th European Conference on Speech Communication and Technology, Rodos, Greece, vol. 1, pp. 323-326.Google Scholar
  20. Ide, N., Tufis, D., and Erjavec, T. (1998). Development and assessment of common Lexical specifications for six central and Eastern European languages. Proceedings of the First International Conference on Language Resources and Evaluation, LREC'98, Granada, pp. 233-240.Google Scholar
  21. Ipšić, I., Mihelič, F., Dobrišek, S., Gros, J., and Pavešić, N. (1998). An overview of the spoken queries in European languages: The Slovene spoken dialog system. Proceedings of the Scientific Conference Artificial Intelligence in Industry from Theory to Practice and 3rd SQEL Workshop on Multi-Lingual Information Retrieval Dialogs, High Tatras, Slovakia, pp. 431-438.Google Scholar
  22. Kačič, Z. and Horvat, B. (1998). Setting up the speech resources needed for the development of speech technology for Slovene language. Proceedings of the Conference on Language Technologies for the Slovene Language, Ljubljana, pp. 100-104.Google Scholar
  23. Kačič, Z., Horvat, B., and Zögling, A. (2000). Issues in design and collection of large telephone speech corpus for Slovene language. Proceedings of LREC 2000, 2nd International Conference on Language Resources & Evaluation, Athens, Greece, pp. 943-946.Google Scholar
  24. Kaiser, J. and Kačič, Z. (1998). Development of Slovene Speech-Dat database. Proceedings of the Workshop on Speech Database Development for Central and Eastern European Languages, Granada, Spain.Google Scholar
  25. Mihelič, F., Gros, J., Noeth, E., and Warnke, V. (2000). Recognition and labeling of prosodic events in Slovene speech. Text, Speech and Dialogue (Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence, 1902). Berlin, Heidelberg: Springer, pp. 165-170.Google Scholar
  26. Mihelič, F., Ipšić, I., Zibert, J., and MartinčićIpšić, S. (2002). Development of a SLO-CRO Bilingual Speech Database. International Conference on Software, Telecommunications and Computer Networks, SoftCOM 2002, Proceedings, pp. 577-581.Google Scholar
  27. Odell, J., Kershaw, D., Ollason, D., Valtchev, V., and Whitehouse, D. (1998). The HAPI Book. Entropic, Cambridge, Great Britain.Google Scholar
  28. Pepelnjak, K., Mihelič, F., and Pavešić, N. (1996). Semantic decomposition of sentences in the system supporting flight services. CIT-Journal of Computing and Information Technology, Zagreb, 4(1):17-24.Google Scholar
  29. Sperberg-McQueen, C.M. and Burnard, L. (Eds). (1994). Guidelines for Electronic Text Encoding and Interchange. Chicago and Oxford, ACH/ACL/ALLC Text Encoding Initiative.Google Scholar
  30. Škerl, M., Mihelič, F., Gros, J., and Dobrišek S. (2001). Speech corpora VINDAT-The influence of the psychophysical condition of the speaker on speech characteristics. Proceedings of 10th Electrotechnical and Computer Science Conference ERK 2001, Portorož, Slovenia, pp. 261-264.Google Scholar
  31. Šustarši?, R., Komar, S., and Petek, B. (1999). Slovene IPA Symbols. Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet. Cambridge: Cambridge University Press, pp. 135-139.Google Scholar
  32. Vesnicer, B., Pavešić, N., and Mihelič, F. (2001). Corpus based speech synthesis. Proceedings of 10th Electrotechnical and Computer Science Conference ERK 2001, Portorož, Slovenia, pp. 253-255.Google Scholar
  33. Young, S., Kershaw, D., Odell, J., Ollason, D., Vatchev, V., and Woodland, P. (2000). The HTK Book. Cambridge, Great Britain: Cambridge University Engineering Department.Google Scholar
  34. Žibert, J., Gros, J., Dobrišek, S., and Mihelič, F. (1999). Language model representations for the GOPOLIS database. Text, Speech and Dialogue, (Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence, vol. 1692). Berlin [etc.]: Springer, pp. 380-383.Google Scholar
  35. Žibert, J. and Mihelič, F. (2000). Slovenian weather forecast speech database. International Conference on Software, Telecommunications and Computer Networks, SoftCOM 2000, Proceedings, vol. 1, 199-206.Google Scholar
  36. Žibert, J., Mihelič, F., and Dobrišek, S. (2000). Automatic subtitling of TV weather forecasts. Proceedings of 9th Electrotechnical and Computer Science Conference ERK 2000, vol. B, pp. 165-168.Google Scholar
  37. Žibert, J., Mihelič, F., and Pavešić, N. (2002). Speech Features Extraction Using Cone-Shaped Kernel Distribution. Lecture Notes in Artificial Intelligence, vol. 2448. Berlin: Springer, pp. 245-252.Google Scholar

Copyright information

© Kluwer Academic Publishers 2003

Authors and Affiliations

  • France Mihelič
    • 1
  • Jerneja Gros
    • 1
  • Simon Dobrišek
    • 1
  • Janez Žibert
    • 1
  • Nikola Pavešić#x0107;
    • 1
  1. 1.Faculty of Electrical EngineeringUniversity of Ljubljana, Laboratory of Artificial Perception, Systems and CyberneticsLjubljanaSlovenia

Personalised recommendations