Mining Time-Delayed Coherent Patterns in Time Series Gene Expression Data

  • Linjun Yin
  • Guoren Wang
  • Keming Mao
  • Yuhai Zhao
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4093)


Unlike previous pattern-based biclustering methods that focus on grouping objects on the same subset of dimensions, in this paper, we propose a novel model of coherent cluster for time series gene expression data, namely td-cluster (time-delayed cluster). Under this model, objects can be coherent on different subsets of dimensions if these objects follow a certain time-delayed relationship. Such a cluster can discover the cycle time of gene expression, which is essential in revealing the gene regulatory networks. This work is missed by previous research. A novel algorithm is also presented and implemented to mine all the significant td-clusters. Experimental results from both real and synthetic microarray datasets prove its effectiveness and efficiency.


Time Sequence Gene Regulatory Network Slide Window Approach Pruning Rule Scaling Pattern 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Aggarwal, C.C., Yu, P.S.: Finding generalized projected clusters in high dimensional spaces. In: ACM SIGMOD Conference (2000)Google Scholar
  2. 2.
    Bar-Joseph, Z.: Analyzing time series gene expression data. Bioinformatics 20(16), 2493–2503 (2004)CrossRefGoogle Scholar
  3. 3.
    Chen, T., Filkov, V., Skiena, S.S.: Identifying gene regulatory networks from experimental data. In: Recomb (1999)Google Scholar
  4. 4.
    Cheng, Y., Church, G.M.: Biclustering of expression data. In: 8th Int’l Conference on Intelligent Systems for Molecular Biology (2000)Google Scholar
  5. 5.
    Cho, R.J., Campbell, M.J., Winzeler, E.A., Steinmetz, L., Conway, A., Wodicka, L., et al.: A genome-wide transcriptional analysis of the mitotic cell cycle. Mol. Cell 2, 65–73 (1998)CrossRefGoogle Scholar
  6. 6.
    Dhillon, I.S., Marcotte, E.M., Roshan, U.: Diametrical clustering for identifying anti-correlated gene clusters. Bioinformatics 19, 1612–1619 (2003)CrossRefGoogle Scholar
  7. 7.
    Eisen, M., Spellman, P., Brown, P., Botstein, D.: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Science, USA 95(25), 14863–14868 (1998)CrossRefGoogle Scholar
  8. 8.
    Erdal, S., Ozturk, O., Armbruster, D., Ferhatosmanoglu, H., Ray, W.: A time series analysis of microarray data. In: 4th IEEE Int’l Symposium on Bioinformatics and Bioengineering (May 2004)Google Scholar
  9. 9.
    Feng, J., Barbano, P.E., Mishra, B.: Time-frequency feature detection for timecourse microarray data. In: 2004 ACM Symposium on Applied Computing (2004)Google Scholar
  10. 10.
    Filkov, V., Skiena, S., Zhi, J.: Analysis techniques for microarray time-series data. In: 5th Annual Int’l Conference on Computational Biology (2001)Google Scholar
  11. 11.
    Hughes, T.R., Marton, M.J., Jones, A.R., Roberts, C.J., Stoughton, R., Armour, C.D., Bennett, H.A., Coffey, E., Dai, H., He, Y.D., Kidd, M.J., King, A.M., Meyer, M.R., Slade, D., Lum, P.Y., Stepaniants, S.B., Shoemaker, D.D., Gachotte, D., Chakraburtty, K., Simon, J., Bard, M., Friend, S.H.: Functional discovery via a compendium of expression profiles. Cell 102, 109–126 (2000)CrossRefGoogle Scholar
  12. 12.
    Liu, J., Wang, W., Yang, J.: Gene ontology friendly biclustering of expression profiles. In: Computational Systems Bioinformatics (2004)Google Scholar
  13. 13.
    Madeira, S.C., Oliveira, A.L.: Biclustering algorithms for biological data analysis: a survey. IEEE/ACM Transactions on Computational Biology and Bioinformatics 1(1), 24–45 (2004)CrossRefGoogle Scholar
  14. 14.
    Qian, J., Dolled-Filhart, M., Lin, J., Yu, H., Gerstein, M.: Beyond synexpression relationships: Local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions. Journal of Molecular Biology (2001)Google Scholar
  15. 15.
    Wang, H., Wang, W., Yang, J., Yu, P.S.: Clustering by pattern similarity in large data sets. In: ACM SIGMOD Conference (2002)Google Scholar
  16. 16.
    Yi, B.-K., Jagadish, H.V., Faloutsos, C.: Efficient retrieval of similar time sequences under time warping. In: Proc. of the 14th Intl. Conf. on Data Eng (ICDE 1998), Orlando, February 1998, pp. 201–208 (1998)Google Scholar
  17. 17.
    Yu, H., Luscombe, N., Qian, J., Gerstein, M.: Genomic analysis of gene expression relationships in transcriptional regulatory networks. Trends Genet. 19(8), 422–427 (2003)CrossRefGoogle Scholar
  18. 18.
    Zhao, L., Zaki, M.J.: Tricluster: An effective algorithm for mining coherent clusters in 3d microarray data. In: Proceedings of the 2005 ACM SIGMOD international conference on Management of data (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Linjun Yin
    • 1
  • Guoren Wang
    • 1
  • Keming Mao
    • 1
  • Yuhai Zhao
    • 1
  1. 1.Northeastern UniversityChina

Personalised recommendations