The Implementation of Text Categorization with ARC-BC Algorithm
Chapter
First Online:
Abstract
The Apriori based Associative Rule Classifier By Category (ARC-BC) algorithm is implemented to classify text documents. In ARC-BC algorithm, each individual category is considered separately and different. Rules are extracted from documents of each category independently. The experimental result shows that the performance of ARC-BC based text categorization is very pretty efficient and effective and it is comparable to Naïve Bayesian(NB) algorithm[2] based text categorization.
Keywords
text categorization Associative Rule Naïve Bayesian algorithm ARC-BC algorithmPreview
Unable to display preview. Download preview PDF.
References
- 1.Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: European Conference on Machine Learning (ECML), pp. 137–142 (1998)Google Scholar
- 2.Murata, M., Ma, Q., Uchimoto, K., Ozaku, H., Isahara, H., Utiyama, M.: Information retrieval using locaion and category information. Journal of the Association for Natural Language Processing 7(2) (2000)Google Scholar
- 3.Strzalkowski, T.: Natural language information retrieval: TIPSTER-2 final report. In: Proceedings of A Workshop on Held At Vienna, Virginia: Annual Meeting of the ACL, May 6-8, pp. 143–148. Association for Computational Linguistics, Morristown (1996)Google Scholar
- 4.Wai, L., C.Y. Ho.: Using a Generalized Instance Set for Automatic Text Categorization. SIGIR, 81–89 (1998)Google Scholar
Copyright information
© Springer-Verlag GmbH Berlin Heidelberg 2012