An Evolutionary Approach for Sample-Based Clustering on Microarray Data
Sample-based clustering is one of the most common methods for discovering disease subtypes as well as unknown taxonomies. By revealing hidden structures in microarray data, cluster analysis can potentially lead to more tailored therapies for patients as well as better diagnostic procedures. In this work, we present a novel method for automatically discovering clusters of samples which are coherent from a genetic point of view. Each possible cluster is characterized by a fuzzy pattern which maintains a fuzzy discretization of relevant gene expression values. Noise genes are identified and removed from the fuzzy pattern based on their probability of appearance. Possible clusters are randomly constructed and iteratively refined by following a probabilistic search and an optimization schema. Experimental results over publicly available microarray data show the effectiveness of the proposed method.
Keywordssimulated annealing sample-based clustering discriminant fuzzy pattern microarray data
Unable to display preview. Download preview PDF.
- 4.Ding, C.: Analysis of gene expression profiles: class discovery and leaf ordering. In: Proceedings of the Six Annual International Conference on Computational Molecular Biology, pp. 127–136 (2002)Google Scholar
- 6.Ben-Dor, A., Friedman, N., Yakhini, Z.: Class discovery in gene expression data. In: Proceedings of the fifth Annual International Conference on Computational Biology, pp. 31–38 (2001)Google Scholar
- 12.Armstrong, S.A., Stauton, J.E., Silverman, L.B., Pieters, R., den Boer, M.L., Minden, M.D., Sallan, S.E., Lander, E.S., Golub, T.R., Korsmeyer, S.J.: MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia. Nature Genetics 20, 41–47 (2002)CrossRefGoogle Scholar