Clustering of short time-course gene expression data with dissimilar replicates
- 58 Downloads
Microarrays are used in genetics and medicine to examine large numbers of genes simultaneously through their expression levels under any condition such as a disease of interest. The information from these experiments can be enriched by following the expression levels through time and biological replicates. The purpose of this study is to propose an algorithm which clusters the genes with respect to the similarities between their behaviors through time. The algorithm is also aimed at highlighting the genes which show different behaviors between the replicates and separating the constant genes that keep their baseline expression levels throughout the study. Finally, we aim to feature cluster validation techniques to suggest a sensible number of clusters when it is not known a priori. The illustrations show that the proposed algorithm in this study offers a fast approach to clustering the genes with respect to their behavior similarities, and also separates the constant genes and the genes with dissimilar replicates without any need for pre-processing. Moreover, it is also successful at suggesting the correct number of clusters when that is not known.
KeywordsMicroarray gene expression Short time-series Replication Distance Clustering Cluster validation
The authors would like to thank the Academic Writing Center at Middle East Technical University for their writing consultation and editorial help.
- Do, J. H., & Choi, D. (2008). Clustering approaches to identfying gene expression patterns from dna microarray data. Molecules and Cells, 25(2), 279.Google Scholar
- Galbraith, J., & Jiaqing, L. (1999). Cluster and discriminant analysis on time series as a research tool UTIP Working Paper Number 6, The University of Texas at Austin, Austin: Lyndon BGoogle Scholar
- Heard, N. A., Holmes, C. C., Stephens, D. A., Hand, D. J., & Dimopoulos, G. (2005). Bayesian coclustering of anopheles gene expression time series: Study of immune defense response to multiple experimental challenges. Proceedings of the National Academy of Sciences of the United States of America, 102(47), 16,939–16,944.CrossRefGoogle Scholar
- Khan, J., Simon, R., Bittner, M., Chen, Y., Leighton, S. B., Pohida, T., et al. (1998). Gene expression profiling of alveolar rhabdomyosarcoma with cdna microarrays. Cancer Research, 58(22), 5009–5013.Google Scholar
- Tamayo, P., Slonim, D., Mesirov, J., Zhu, Q., Kitareewan, S., Dmitrovsky, E., et al. (1999). Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation. Proceedings of the National Academy of Sciences, 96(6), 2907–2912.CrossRefGoogle Scholar