Intelligent Information Processing and Web Mining pp 489-498 | Cite as
Hierarchical Document Clustering Using Frequent Closed Sets
Abstract
Aerial archaeology plays an important role in the detection and documentation of archaeological sites, which often cannot be easily seen from the ground. It is a quick way to survey large areas, but requires a lot of error-prone human work to analyze it afterwards. In this paper we utilize some of the best-performing image processing and data mining methods to develop a system capable of an accurate automated classification of such aerial photographs. The system consists of phases of image indexing, rough image segmentation, feature extraction, feature grouping and building the classifier. We present the results of experiments conducted on a real set of archaeological and non-archaeological aerial photographs and conclude with perspectives for future work.
Keywords
Feature Vector Frequent Itemsets Initial Cluster Frequent Item Document ClusterPreview
Unable to display preview. Download preview PDF.
References
- 1.1. Agrawal R., Srikant R.: Fast Algorithms for Mining Association Rules, VLDB, Santiago, Chile, Morgan Kaufmann, 1994, 487–499Google Scholar
- 2.2. Beil F., Ester M., Xu X.: Frequent term-based text clustering. KDD 2002: 436–442Google Scholar
- 3.3. Fung B.C.M., Wan K., Ester M.: Hierarchical Document Clustering Using Frequent Itemsets, SDM'03, 2003Google Scholar
- 4.4. Ganter B., Wille R.: Formal Concept Analysis, Mathematical Foundations, Springer, 1999Google Scholar
- 5.5. Pasquier N., Bastide Y., Taouil R., Lakhal L.: Discovering Frequent Closed Itemsets for Association Rules, LNCS, Vol. 1540. Springer, 1999, 398–416Google Scholar
- 6.6. Steinbach M., Karypis G., Kumar V.: A comparison of Document Clustering Techniques, KDD Workshop on Text Mining, 2000Google Scholar
- 7.7. Xu X., Ester M., Kriegel H.P., Sander J.: A Distribution-Based Clustering Algorithm for Mining in Large Spatial Databases. In: Proc. of the 14th ICDE Conference (1998)Google Scholar
- 8.8. Wang K., Xu C., Liu B.: Clustering Transactions Using Large Items, CIKM, 1999, 483–490Google Scholar
- 9.9. http://fimi.cs.helsinki.fiGoogle Scholar
- 10.10. http://www-users.cs.umn.edu/~karypis/clutoGoogle Scholar