Learning from Little: Comparison of Classifiers Given Little Training

Forman, George; Cohen, Ira

doi:10.1007/978-3-540-30116-5_17

George Forman²² &
Ira Cohen²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3202))

Included in the following conference series:

European Conference on Principles of Data Mining and Knowledge Discovery

3001 Accesses
51 Citations
3 Altmetric

Abstract

Many real-world machine learning tasks are faced with the problem of small training sets. Additionally, the class distribution of the training set often does not match the target distribution. In this paper we compare the performance of many learning models on a substantial benchmark of binary text classification tasks having small training sets. We vary the training size and class distribution to examine the learning surface, as opposed to the traditional learning curve. The models tested include various feature selection methods each coupled with four learning algorithms: Support Vector Machines (SVM), Logistic Regression, Naive Bayes, and Multinomial Naive Bayes. Different models excel in different regions of the learning surface, leading to meta-knowledge about which to apply in different situations. This helps guide the researcher and practitioner when facing choices of model and feature selection methods in, for example, information retrieval settings and others.

Download to read the full chapter text

Chapter PDF

Analytic Feature Selection for Support Vector Machines

Statistical Learning Theory in Practice

Auto-WEKA: Automatic Model Selection and Hyperparameter Optimization in WEKA

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Liu, B., Dai, Y., Li, X., Lee, W.S., Yu, P.S.: Building text classifiers using positive and unlabeled examples. In: Intl. Conf. on Data Mining, pp. 179–186 (2003)
Google Scholar
Forman, G.: An extensive empirical study of feature selection metrics for text classification. Journal of Machine Learning Research 3, 1289–1305 (2003)
Article MATH Google Scholar
Yang, Y., Liu, X.: A re-examination of text categorization methods. In: ACM SIGIR Conf. on Research and Development in Information Retrieval, pp. 42–49 (1999)
Google Scholar
Ng, A.Y., Jordan, M.I.: On discriminative vs. generative classifiers: A comparison of logistic regression and naive Bayes. In: Neural Information Processing Systems: Natural and Synthetic, pp. 841–848 (2001)
Google Scholar
Weiss, G.M., Provost, F.: Learning when training data are costly: The effect of class distribution on tree induction. Journal of Artificial Intelligence Research 19, 315–354 (2003)
MATH Google Scholar
Japkowicz, N., Holte, R.C., Ling, C.X., Matwin, S. (eds.): AAAI Workshop: Learning from Imbalanced Datasets, TR WS-00-05. AAAI Press, Menlo Park (2000)
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools with Java implementations. Morgan Kaufmann, San Francisco (2000)
Google Scholar
Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. John Wiley and Sons, Chichester (1973)
MATH Google Scholar
Domingos, P., Pazzani, M.: Beyond independence: conditions for the optimality of the simple Bayesian classifier. In: Proc. 13th International Conference on Machine Learning, pp. 105–112 (1996)
Google Scholar
McCallum, A., Nigam, K.: A comparison of event models for naive Bayes text classification. In: AAAI 1998 Workshop on Learning for Text Categorization (1998)
Google Scholar
le Cessie, S., van Houwelingen, J.: Ridge estimators in logistic regression. Applied Statistics 41, 191–201 (1992)
Article MATH Google Scholar
Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: European Conf. on Machine Learning, pp. 137–142 (1998)
Google Scholar
Han, E., Karypis, G.: Centroid-based document classification: Analysis & experimental results. In: Conference on Principles of Data Mining and Knowledge Discovery, pp. 424–431 (2000)
Google Scholar
Elkan, C.: The foundations of cost-sensitive learning. In: International Joint Conference on Artificial Intelligence, pp. 973–978 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Hewlett-Packard Research Laboratories, 1501 Page Mill Rd., Palo Alto, CA, 94304, USA
George Forman & Ira Cohen

Authors

George Forman
View author publications
You can also search for this author in PubMed Google Scholar
Ira Cohen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INSA-Lyon, LIRIS CNRS UMR5205, F-69621, Villeurbanne, France
Jean-François Boulicaut
Dipartimento di Informatica, Università degli Studi di Bari,
Floriana Esposito
Pisa KDD Laboratory, ISTI - CNR, Area della Ricerca di Pisa, Via Giuseppe Moruzzi 1, Pisa, Italy
Fosca Giannotti
Dipartimento di Informatica, Via F. Buonarroti 2, 56127, Pisa, Italy
Dino Pedreschi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Forman, G., Cohen, I. (2004). Learning from Little: Comparison of Classifiers Given Little Training. In: Boulicaut, JF., Esposito, F., Giannotti, F., Pedreschi, D. (eds) Knowledge Discovery in Databases: PKDD 2004. PKDD 2004. Lecture Notes in Computer Science(), vol 3202. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30116-5_17

Download citation

DOI: https://doi.org/10.1007/978-3-540-30116-5_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23108-0
Online ISBN: 978-3-540-30116-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Learning from Little: Comparison of Classifiers Given Little Training

Abstract

Chapter PDF

Similar content being viewed by others

Analytic Feature Selection for Support Vector Machines

Statistical Learning Theory in Practice

Auto-WEKA: Automatic Model Selection and Hyperparameter Optimization in WEKA

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning from Little: Comparison of Classifiers Given Little Training

Abstract

Chapter PDF

Similar content being viewed by others

Analytic Feature Selection for Support Vector Machines

Statistical Learning Theory in Practice

Auto-WEKA: Automatic Model Selection and Hyperparameter Optimization in WEKA

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation