Text classification using lattice machine

Wang, Hui; Nguyen Hung Son

doi:10.1007/BFb0095109

Text classification using lattice machine

Hui Wang¹ &
Nguyen Hung Son²

Communications
Conference paper
First Online: 20 October 2006

101 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1609))

Abstract

A novel approach to supervised learning, called Lattice Machine, was proposed in [5]. In the Lattice Machine, it was assumed that data are structured as relations. In this paper we investigate the application of the Lattice Machine in the area of text classification, where textual data are unstructured. We represent a set of textual documents as a collection of Boolean feature vectors, where each vector corresponds to one document and each entry in a tuple indicates whether a particular term appears in the document. This is a common representation of textual documents. We show that using this representation, the Lattice Machine’s operations are simply set theoretic operations. In particular, the lattice sum operation is simply set intersection and the ordering relationship is simply set inclusion. Experiments show that the Lattice Machine, under this configuration, is quite competitive with state-of-the-art learning algorithms for text classification.

The authors gratefully acknowledge support by the KBN/British Council Grant No WAR/992/151

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

References

William W. Cohen. Fast effective rule induction. In Machine Learning: Proceedings of the Twelfth International Conference. Morgan Kaufmann, 1995. http:/www.research.att.com/~wcohen/.
Google Scholar
William W. Cohen and Haym Hirsh. Joins that generalize: Text classification using whirl. In Proc. KDD-98, New York, 1998. http://www.research.att.com/~wcohen/.
Google Scholar
M. Porter. An algorithm for suffix stripping, Program, 14(3):130–137, 1980.
Google Scholar
Ross Quinlan. C4.5.: Programs for Machine Learning. Morgan Kaufmann, San Mateo, 1993.
Google Scholar
Hui Wang, Ivo Düntsch, and David Bell. Data reduction based on hyper relations. In Proceedings of KDD98, New York, pages 349–353, 1998.
Google Scholar
Jinxi Xu and W.B. Croft. Cropus-based stemming using co-occurrence of word variants. ACM TOIS, 16(1):61–81, Jan. 1998.
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Software Engineering, University of Ulster, BT 37 0QB, Newtownabbey, N. Ireland
Hui Wang
Institute of Mathematics, Warsaw University, 02-095, Banacha 2 Str., Warsaw, Poland
Nguyen Hung Son

Authors

Hui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Nguyen Hung Son
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zbigniew W. Raś Andrzej Skowron

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, H., Nguyen Hung Son (1999). Text classification using lattice machine. In: Raś, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1999. Lecture Notes in Computer Science, vol 1609. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095109

Download citation

DOI: https://doi.org/10.1007/BFb0095109
Published: 20 October 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65965-5
Online ISBN: 978-3-540-48828-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics