The Effectiveness of the Max Entropy Classifier for Feature Selection

Schn¨oll, Martin; Ferner, Cornelia; Wegenkittl, Stefan

doi:10.1007/978-3-658-27495-5_4

Martin Schn¨oll⁴,
Cornelia Ferner⁵ &
Stefan Wegenkittl⁵

2577 Accesses

Abstract

Feature selection is the task of systematically reducing the number of input features for a classification task. In natural language processing, basic feature selection is often achieved by removing common stop words. In order to more drastically reduce the number of input features, actual feature selection methods such as Mutual Information or Chi-Squared are used on a count-based input representation. We suggest a task-oriented approach to select features based on the weights as learned by a Max Entropy classifier trained on the classification task. The remaining features can then be used by other classifiers to do the actual classification. Experiments on different natural language processing tasks confirm that the weight-based method is comparable to count-based methods. The number of input features can be reduced considerably while maintaining the classification performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Fact AI GmbH, Salzburg, Austria
Martin Schn¨oll
Salzburg University of Applied Sciences, Salzburg, Austria
Cornelia Ferner & Stefan Wegenkittl

Authors

Martin Schn¨oll
View author publications
You can also search for this author in PubMed Google Scholar
Cornelia Ferner
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Wegenkittl
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Informationstechnik & System-Management, Fachhochschule Salzburg, Puch/Salzburg, Austria
Peter Haber
Dept. für E-Governance in Wirtschaft und Verwaltung, Donau-Universität Krems, Krems an der Donau, Austria
Thomas Lampoltshammer
Informationstechnik & System-Management, Fachhochschule Salzburg, Puch/Salzburg, Austria
Manfred Mayr

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schn¨oll, M., Ferner, C., Wegenkittl, S. (2019). The Effectiveness of the Max Entropy Classifier for Feature Selection. In: Haber, P., Lampoltshammer, T., Mayr, M. (eds) Data Science – Analytics and Applications. Springer Vieweg, Wiesbaden. https://doi.org/10.1007/978-3-658-27495-5_4

Download citation

DOI: https://doi.org/10.1007/978-3-658-27495-5_4
Published: 10 October 2019
Publisher Name: Springer Vieweg, Wiesbaden
Print ISBN: 978-3-658-27494-8
Online ISBN: 978-3-658-27495-5
eBook Packages: Computer Science and Engineering (German Language)

Publish with us

Policies and ethics