Multiple Sets of Rules for Text Categorization
- Cite this paper as:
- Bi Y., Anderson T., McClean S. (2004) Multiple Sets of Rules for Text Categorization. In: Yakhno T. (eds) Advances in Information Systems. ADVIS 2004. Lecture Notes in Computer Science, vol 3261. Springer, Berlin, Heidelberg
This paper concerns how multiple sets of rules can be generated using a rough sets-based inductive learning method and how they can be combined for text categorization by using Dempster’s rule of combination. We first propose a boosting-like technique for generating multiple sets of rules based on rough set theory, and then model outcomes inferred from rules as pieces of evidence. The various experiments have been carried out on 10 out of the 20-newsgroups – a benchmark data collection – individually and in combination. Our experimental results support the claim that “k experts may be better than any one if their individual judgements are appropriately combined”.
Unable to display preview. Download preview PDF.