Abstract
We present a text classification method based upon maximum entropy optimization. Having a set of documents which must be classified into some given classes, a maximum entropy optimization problem is considered. In order to solve this problem, we consider its Lagrange dual and we derive, by means of strong duality, the optimality conditions.
After solving the dual problem, we obtain, as solution of the primal problem, a distribution of probabilities representing the chances of the documents to belong to each class.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R., Ribeiro-Neto, B. (1999) Modern Information Retrieval. Addison-Wesley Verlag
Bapat, R.B., Raghavan, T.E.S. (1997) Nonnegative Matrices and Applications. Cambridge University Press, Cambridge
Berger, A.L., Delia Pietra, S.A., Delia Pietra, V.J. (1996) A Maximum Entropy Approach to Natural Language Processing. Comput. Ling. 22, 1, 1–36
Darroch, J.N., Ratcliff, D. (1972) Genelarized iterative scaling for log-linear models. Ann. Math. Stat. 43, 1470–1480
Delia Pietra, S., Delia Pietra, V., Lafferty, J. (1997) Inducing Features of Random Fields. IEEE Trans. Pattern Anal. Mach. Intell. 19, 4, 1–13
Elster, K.H., Reinhart, R., Schäuble, M., Donath, G. (1977) Einführung in die nichtlineare Optimierung. BSB B.G. Teubner Verlagsgesellschaft, Leipzig
Fang, S.C., Rajasekera, J.R., Tsao, H.-S.J. (1997) Entropy Optimization and Mathematical Programming. Kluwer Academic Publishers, Boston
Guiaşu, S., Shenitzer, A. (1985) The Principle of Maximum Entropy. The Math. Intell. 7, 1, 42–48
Jurafsky, D., Martin, J.H. (2000) Speech and Language Processing. Prentice-Hall Inc.
Kapur, J.N., Kesavan, H.K. (1992) Entropy Optimization Principles with Applications. Academic Press Inc., San Diego
Nigam, K., Lafferty, J., McCallum, A. (1999) Using Maximum Entropy for Text Classification. Proceedings of IJCAI-99 Workshop on Machine Learning for Information Filtering, 61–67
Wanka, G., Boţ, R.I. (2002) On the Relations Between Different Dual Problems in Convex Mathematical Programming. In Chamoni, P., Leisten, R., Martin, A., Minnemann, J., Stadtler, H., (eds.), Operations Research Proceedings 2001, Springer, Heidelberg, 255–262
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2003 Deutscher Universitäts-Verlag/GWV Fachverlage GmbH, Wiesbaden
About this chapter
Cite this chapter
Boţ, R.I., Grad, SM., Wanka, G. (2003). Maximum entropy optimization for text classification problems. In: Habenicht, W., Scheubrein, B., Scheubrein, R. (eds) Multi-Criteria- und Fuzzy-Systeme in Theorie und Praxis. Deutscher Universitätsverlag. https://doi.org/10.1007/978-3-322-81539-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-322-81539-2_13
Publisher Name: Deutscher Universitätsverlag
Print ISBN: 978-3-8244-7864-4
Online ISBN: 978-3-322-81539-2
eBook Packages: Springer Book Archive