Chapter

Advances in Information Systems

Volume 3261 of the series Lecture Notes in Computer Science pp 263-272

Multiple Sets of Rules for Text Categorization

  • Yaxin BiAffiliated withLancaster UniversitySchool of Computer Science, Queen’s University of BelfastSchool of Biomedical Science, University of Ulster
  • , Terry AndersonAffiliated withLancaster UniversityFaculty of Engineering, University of Ulster
  • , Sally McCleanAffiliated withLancaster UniversityFaculty of Engineering, University of Ulster

* Final gross prices may vary according to local VAT.

Get Access

Abstract

This paper concerns how multiple sets of rules can be generated using a rough sets-based inductive learning method and how they can be combined for text categorization by using Dempster’s rule of combination. We first propose a boosting-like technique for generating multiple sets of rules based on rough set theory, and then model outcomes inferred from rules as pieces of evidence. The various experiments have been carried out on 10 out of the 20-newsgroups – a benchmark data collection – individually and in combination. Our experimental results support the claim that “k experts may be better than any one if their individual judgements are appropriately combined”.