Evolution of Multiple Tree Structured Patterns from Tree-Structured Data Using Clustering
We propose a new genetic programming approach to extraction of multiple tree structured patterns from tree-structured data using clustering. As a combined pattern we use a set of tree structured patterns, called tag tree patterns. A structured variable in a tag tree pattern can be substituted by an arbitrary tree. A set of tag tree patterns matches a tree, if at least one of the set of patterns matches the tree. By clustering positive data and running GP subprocesses on each cluster with negative data, we make a combined pattern which consists of best individuals in GP subprocesses. The experiments on some glycan data show that our proposed method has a higher support of about 0.8 while the previous method for evolving single patterns has a lower support of about 0.5.
KeywordsPositive Data Edge Label Negative Data Single Pattern Tree Structure Data
Unable to display preview. Download preview PDF.
- 2.Doubet, S., Albersheim, P.: Clycobiology. Data Mining and Knowledge Discovery 2(6), 505 (1992)Google Scholar
- 3.Yao, X. (ed.): Evolutionary Computation: Theory and Applications. World Scientific, Singapore (1999)Google Scholar
- 8.Kuboyama, T., Hirata, K., Aoki-Kinoshita, K.F., Kashima, H., Yasuda, H.: A gram distribution kernel applied to glycan classification and motifextraction. Genome Informatics 17(2), 25–34 (2006)Google Scholar