Clustering XML Documents Using Self-organizing Maps for Structures
Self-Organizing Maps capable of encoding structured information will be used for the clustering of XML documents. Documents formatted in XML are appropriately represented as graph data structures. It will be shown that the Self-Organizing Maps can be trained in an unsupervised fashion to group XML structured data into clusters, and that this task is scaled in linear time with increasing size of the corpus. It will also be shown that some simple prior knowledge of the data structures is beneficial to the efficient grouping of the XML documents.
KeywordsRoot Node Compression Ratio Codebook Vector Display Space Collapse Sequence
Unable to display preview. Download preview PDF.
- 2.Hagenbuchner, M., Sperduti, A., Tsoi, A.: Contextual processing of graphs using self-organizing maps. In: European symposium on Artificial Neural Networks, Poster track, Bruges, Belgium, April 27 - 29 (2005)Google Scholar
- 3.Hagenbuchner, M., Sperduti, A., Tsoi, A.C.: Contextual self-organizing maps for structured domains. In: Relational Machine Learning, pp. 46–55 (2005)Google Scholar
- 4.Hagenbuchner, M., Tsoi, A.C.: A supervised self-organizing map for structures. In: International Joint Conference on Neural Networks, Budapest, Hungary, July 25-29, vol. 3, pp. 1923–1928 (2004)Google Scholar
- 5.Hagenbuchner, M., Tsoi, A.C.: A supervised training algorithm for self-organizing maps for structures. Artificial Neural Networks in Pattern Recognition, Special Issue Pattern Recognition Letters 26(12), 1874–1884 (2006)Google Scholar
- 6.Kohonen, T.: Self-Organisation and Associative Memory, 3rd edn. Springer, Heidelberg (1990)Google Scholar