Hierarchical Signature Clustering for Time Series Microarray Data
Existing clustering techniques provide clusters from time series microarray data, but the distance metrics used lack interpretability for these types of data. While some previous methods are concerned with matching levels, of interest are genes that behave in the same manner but with varying levels. These are not clustered together using an Euclidean metric, and are indiscernible using a correlation metric, so we propose a more appropriate metric and modified hierarchical clustering method to highlight those genes of interest. Use of hashing and bucket sort allows for fast clustering and the hierarchical dendrogram allows for direct comparison with easily understood meaning of the distance. The method also extends well to use k-means clustering when a desired number of clusters are known.
KeywordsGene pattern discovery and identification Microarrays
- 9.Phang T.L., Neville, M.C., Rudolph, M. and Hunter, L. (2003) Trajectory clustering: a non-parametric method for grouping gene expression time courses, with applications to mammary development. Pacific Symposium on Biocomputing, 351–362.Google Scholar
- 11.Bréhélin, L., Gascuel1 O. and Martin O. (2008) Using repeated measurements to validate hierarchical gene clusters. Bioinformatics, 24, 682–688.Google Scholar