The Implementation of Text Categorization with ARC-BC Algorithm

  • Chen Zuyi
  • Zhao Taixiang
Chapter
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 127)

Abstract

The Apriori based Associative Rule Classifier By Category (ARC-BC) algorithm is implemented to classify text documents. In ARC-BC algorithm, each individual category is considered separately and different. Rules are extracted from documents of each category independently. The experimental result shows that the performance of ARC-BC based text categorization is very pretty efficient and effective and it is comparable to Naïve Bayesian(NB) algorithm[2] based text categorization.

Keywords

text categorization Associative Rule Naïve Bayesian algorithm ARC-BC algorithm 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: European Conference on Machine Learning (ECML), pp. 137–142 (1998)Google Scholar
  2. 2.
    Murata, M., Ma, Q., Uchimoto, K., Ozaku, H., Isahara, H., Utiyama, M.: Information retrieval using locaion and category information. Journal of the Association for Natural Language Processing 7(2) (2000)Google Scholar
  3. 3.
    Strzalkowski, T.: Natural language information retrieval: TIPSTER-2 final report. In: Proceedings of A Workshop on Held At Vienna, Virginia: Annual Meeting of the ACL, May 6-8, pp. 143–148. Association for Computational Linguistics, Morristown (1996)Google Scholar
  4. 4.
    Wai, L., C.Y. Ho.: Using a Generalized Instance Set for Automatic Text Categorization. SIGIR, 81–89 (1998)Google Scholar

Copyright information

© Springer-Verlag GmbH Berlin Heidelberg 2012

Authors and Affiliations

  • Chen Zuyi
    • 1
  • Zhao Taixiang
    • 1
  1. 1.Department of FoundationThe First Aeronautic Institute of Air ForceXinyangChina

Personalised recommendations