Machine Learning

, Volume 39, Issue 2, pp 135–168

BoosTexter: A Boosting-based System for Text Categorization

Authors

  • Robert E. Schapire
    • Shannon LaboratoryAT&T Labs
  • Yoram Singer
    • School of Computer Science & EngineeringThe Hebrew University
Article

DOI: 10.1023/A:1007649029923

Cite this article as:
Schapire, R.E. & Singer, Y. Machine Learning (2000) 39: 135. doi:10.1023/A:1007649029923

Abstract

This work focuses on algorithms which learn from examples to perform multiclass text and speech categorization tasks. Our approach is based on a new and improved family of boosting algorithms. We describe in detail an implementation, called BoosTexter, of the new boosting algorithms for text categorization tasks. We present results comparing the performance of BoosTexter and a number of other text-categorization algorithms on a variety of tasks. We conclude by describing the application of our system to automatic call-type identification from unconstrained spoken customer responses.

text and speech categorization multiclass classification problems boosting algorithms

Copyright information

© Kluwer Academic Publishers 2000