Learning and Evaluation in the Presence of Class Hierarchies: Application to Text Categorization

  • Svetlana Kiritchenko
  • Stan Matwin
  • Richard Nock
  • A. Fazel Famili
Conference paper

DOI: 10.1007/11766247_34

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4013)
Cite this paper as:
Kiritchenko S., Matwin S., Nock R., Famili A.F. (2006) Learning and Evaluation in the Presence of Class Hierarchies: Application to Text Categorization. In: Lamontagne L., Marchand M. (eds) Advances in Artificial Intelligence. AI 2006. Lecture Notes in Computer Science, vol 4013. Springer, Berlin, Heidelberg

Abstract

This paper deals with categorization tasks where categories are partially ordered to form a hierarchy. First, it introduces the notion of consistent classification which takes into account the semantics of a class hierarchy. Then, it presents a novel global hierarchical approach that produces consistent classification. This algorithm with AdaBoost as the underlying learning procedure significantly outperforms the corresponding “flat” approach, i.e. the approach that does not take into account the hierarchical information. In addition, the proposed algorithm surpasses the hierarchical local top-down approach on many synthetic and real tasks. For evaluation purposes, we use a novel hierarchical evaluation measure that has some attractive properties: it is simple, requires no parameter tuning, gives credit to partially correct classification and discriminates errors by both distance and depth in a class hierarchy.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Svetlana Kiritchenko
    • 1
    • 4
  • Stan Matwin
    • 1
    • 2
  • Richard Nock
    • 3
  • A. Fazel Famili
    • 4
  1. 1.University of OttawaCanada
  2. 2.Institute of Computer SciencePolish Academy of SciencesWarsawPoland
  3. 3.Université Antilles-GuyaneMartiniqueFrance
  4. 4.Institute for Information TechnologyNational Research CouncilCanada

Personalised recommendations