Advertisement

Associative Classification in Text Categorization

  • Jian Chen
  • Jian Yin
  • Jun Zhang
  • Jin Huang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3644)

Abstract

Text categorization has become one of the key techniques for handling and organizing text data. This model is used to classify new article to its most relevant category. In this paper, we propose a novel associative classification algorithm ACTC for text categorization. ACTC aims at extracting the k-best strong correlated positive and negative association rules directly from training set for classification, avoiding to appoint complex support and confidence threshold. ACTC integrates the advantages of the previously proposed effective strategies as well as the new strategies presented in this paper. An extensive performance study reveals that the improvement of ACTC outperform other rule-based classification approaches on accuracy.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Huang, D.-S.: Systematic Theory of Neural Networks for Pattern Recognition. Publishing House of Electronic Industry of China, Beijing (1996)Google Scholar
  2. 2.
    Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proceedings of International Conference on Knowledge Discovery and Data Mining, pp. 80–86. AAAI, New York (1998)Google Scholar
  3. 3.
    Li, W., Han, J., Pei, J.: CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules. In: Proceedings of the 2001 IEEE International Conference on Data Mining, San José, California, USA. IEEE Computer Society, Los Alamitos (2001)Google Scholar
  4. 4.
    Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)Google Scholar
  5. 5.
    Cohen, W.W.: Fast Effective Rule Induction. In: Proceedings of 12th International Conference on Machine Learning, Tahoe City, California, USA, pp. 115–123. Morgan Kaufmann, San Francisco (1995)Google Scholar
  6. 6.
    Yin, X., Han, J.: CPAR: Classification Based on Predictive Association Rules. In: Proceedings of SIAM International Conference on Data Mining (SDM 2003), San Fransisco, CA (May 2003)Google Scholar
  7. 7.
    Zhang, J., Henry, S.H., Chung, W.L.: LO: Adaptive Genetic Algorithms using Clustering Technique. IEEE Trans. Evolutionary Computation (accepted with revision)Google Scholar
  8. 8.
    Quinlan, J.R., Cameron-Jones, R.M.: FOIL: A midterm report. In: Brazdil, P.B. (ed.) ECML 1993. LNCS, vol. 667, pp. 3–20. Springer, Heidelberg (1993)Google Scholar
  9. 9.
    Piatetsky-Shapiro, G.: Discovery, Analysis, and Presentation of Strong Rules. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 229–238. AAAI/MIT Press (1991)Google Scholar
  10. 10.
    Wu, X., Zhang, C., Zhang, S.: Efficient Mining of Both Positive and Negative Association Rules. ACM Transactions on Information Systems 22(3), 381–405 (2004)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Jian Chen
    • 1
  • Jian Yin
    • 1
  • Jun Zhang
    • 1
  • Jin Huang
    • 1
  1. 1.Department of Computer ScienceZhongshan UniversityGuangzhouChina

Personalised recommendations