Indonesian Shallow Stemmer for Text Reading Support System
Our project involves the construction of a web-based system to facilitate the reading and comprehension of Indonesian text. The system will help users to understand difficult words in a text by displaying dictionary information about the words in a window. A large number of words in the Indonesian language are formed by combining root words with affixes and other combining forms. To search for the related dictionary entry, we need a stemming program to extract these root words. We develop an Indonesian stemming program for ourselves. Our stemmer does not need to be perfect because our application is limited to that of a text reading system. In this paper, we describe such a stemmer and present the results of preliminary examinations to evaluate it. We also describe a design for the text reading support system that uses the developed stemming program.
KeywordsText Reading Base Word Baseline System Input Word Root Word
Unable to display preview. Download preview PDF.
- 1.Yusuf, H.R.: An analysis of indonesian language for interlingual machine-ranslation system. In: Proceedings of the 15th International Conference on Computational Linguistics, pp. 1228–1232 (1992)Google Scholar
- 2.Nazief, B.: Panel: Development of computational linguistics research: A challenge for indonesia. In: Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, pp. 1–2. Association for Computational Linguistics, Hong Kong (2000), http://www.aclweb.org/anthology/P00-1075 Google Scholar
- 4.TruAlfa and IndoDic.com. Forming Indonesian Words & using Indonesian Affixes, http://indodic.com/index.html
- 5.CICC, Indonesian basic dictionary, Center of the International Cooperation for Computerization Technical Report. Tech. Rep. 6-CICC-MT 53 (1995)Google Scholar