Skip to main content

Identification of K-Tolerance Regulatory Modules in Time Series Gene Expression Data Using a Biclustering Algorithm

  • Conference paper
Active Media Technology (AMT 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8210))

Included in the following conference series:

  • 1153 Accesses

Abstract

Nowadays, biclustering problem is still an intractable problem. But in time series expression data, the clusters can be limited those with contiguous columns. This restriction makes biclustering problem to be tractable problem. However existing contiguous column biclustering algorithm can only find the biclusters which have the same value for each column in biclusters without error tolerance. This characteristic leads the algorithm to overlook some patterns in its clustering process. We propose a suffix tree based algorithm that allows biclusters to have inconsistencies in at most k contiguous column. This can reveals previously undiscoverable biclusters. Our algorithm still has tractable run time with this additional feature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. McLachlan, G.J., Do, K., Ambroise, C.: Analysing Microarray Gene Expression Data. John Wiley & Sons, New Jersey (2004)

    Book  Google Scholar 

  2. Cheng, Y., Church, G.M.: Biclustering of Expression Data. In: Proc. 8th Int’l Conf. Intelligent Systems for Molecular Biology (ISMB 2000), pp. 93–103. ACM Press, New York (2000)

    Google Scholar 

  3. Madeira, S.C., Oliveira, A.L.: Biclustering Algorithms for Biological Data Analsis: a survey. IEEE/ACM Trans. Computational Biology and Bioinformatics 1(1), 24–45 (2004)

    Article  Google Scholar 

  4. Van Mechelen, I., Bock, H.H., De Boeck, P.: Two-Mode Clustering Methods: A Structured Overview. Statistical Methods in Medical Research 13(5), 979–981 (2004)

    Google Scholar 

  5. Ben-Dor, A., Chor, B., Karp, R., Yakhini, Z.: Discovering Local Structure in Gene Expression Data: The Order-Preserving Submatrix Problem. In: Proc. 6th Int’l Conf. Computational Biology (RECOMB 2002), pp. 49–57 (2002)

    Google Scholar 

  6. Ji, L., Tan, K.: Identifying Time-Lagged Gene Clusters Using Gene Expression Data. Bioinformatics 21(4), 509–516 (2005)

    Article  Google Scholar 

  7. Koyuturk, M., Szpankowski, W., Grama, A.: Biclustering Gene-Feature Matrices for Statistically Significant Dense Patterns. In: Proc. 8th Int’l Conf. Research in Computational Molecular Biology (RECOMB 2004), pp. 480–484 (2004)

    Google Scholar 

  8. Liu, J., Wang, W., Yang, J.: Biclustering in Gene Expression Data by Tendency. In: Proc. 3rd Int’l IEEE CS Computational Systems Bioinformatics Conf. (CSB 2004), pp. 182–193 (2004)

    Google Scholar 

  9. Liu, J., Wang, W., Yang, J.: A Framework for Ontology-Driven Subspace Clustering. In: Proc. ACM SIGKDD 2004, pp. 623–628 (2004)

    Google Scholar 

  10. Liu, J., Wang, W., Yang, J.: Gene Ontology Friendly Biclustering of Expression Profiles. In: Proc. 3rd IEEE CS Computational Systems Bioinformatics Conf. (CSB 2004), pp. 436–447 (2004)

    Google Scholar 

  11. Liu, J., Wang, W., Yang, J.: Mining Sequential Patterns from Large Data Sets. Advances in Database Systems, vol. 18. Kluwer Academic Publishers (2005)

    Google Scholar 

  12. Lonardi, S., Szpankowski, W., Yang, Q.: Finding Biclusters by Random Projections. In: Sahinalp, S.C., Muthukrishnan, S.M., Dogrusoz, U. (eds.) CPM 2004. LNCS, vol. 3109, pp. 102–116. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  13. Madeira, S.C., Oliveira, A.L.: A Linear Time Algorithm for Biclustering Time Series Expression Data. In: Casadio, R., Myers, G. (eds.) WABI 2005. LNCS (LNBI), vol. 3692, pp. 39–52. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  14. Murali, T.M., Kasif, S.: Extracting Conserved Gene Expression Motifs from Gene Expression Data. In: Proc. 8th Pacific Symp. Biocomputing (PSB 2003), vol. 8, pp. 77–88 (2003)

    Google Scholar 

  15. Prelic, A., Bleuler, S., Zimmermann, P., Wille, A., Bühlmann, P., Gruissem, W., Hennig, L., Thiele, L., Zitzler, E.: A Systematic Comparison and Evaluation of Biclustering Methods for Gene Expression Data. Bioinformatics 22(10), 1282–1283 (2006)

    Article  Google Scholar 

  16. Sheng, Q., Moreau, Y., De Moor, B.: Biclustering Microarray Data by Gibbs Sampling. Bioinformatics 19(2), 196–205 (2003)

    Google Scholar 

  17. Tanay, A., Sharan, R., Shamir, R.: Discovering Statiscally Significant Biclusters in Gene Expression Data. Bioinformatics 18(1), 136–144 (2002)

    Article  Google Scholar 

  18. Wu, C., Fu, Y., Murali, T.M., Kasif, S.: Gene Expression Module Discovery Using Gibbs Sampling. Genome Informatics 15(1), 239–248 (2004)

    Google Scholar 

  19. Madeira, S.C., Teixeira, M.C., Sá-Correia, L., Oliveira, A.L.: Identification of Regulatory Modules in Time Series Gene Expression Data Using a Linear Time Biclustering Algorithm. IEEE/ACM Transaction Computational Biology and Bioinformatics 7(1), 153–165 (2010)

    Article  Google Scholar 

  20. Getz, G., Levine, E., Domany, E.: Coupled Two-Way Clustering Analysis of Gene Microarray Data. Proc. Natural Academy of Sciences Us, 12079–12084 (2000)

    Google Scholar 

  21. Madeira, S.C., Gonçalves, J.P., Oliveira, A.L.: Efficient Biclustering Algorithms for identifying transacriptional regulation relationships using time series gene expression data. INESC_ID Tec. Rep. 22 (2007)

    Google Scholar 

  22. Peeters, R.: The Maximum Edge Biclique Problem Is NP-Complete. Discrete Applied Math. 131(3), 651–654 (2003)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Phukhachee, T., Maneewongvatana, S. (2013). Identification of K-Tolerance Regulatory Modules in Time Series Gene Expression Data Using a Biclustering Algorithm. In: Yoshida, T., Kou, G., Skowron, A., Cao, J., Hacid, H., Zhong, N. (eds) Active Media Technology. AMT 2013. Lecture Notes in Computer Science, vol 8210. Springer, Cham. https://doi.org/10.1007/978-3-319-02750-0_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-02750-0_15

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-02749-4

  • Online ISBN: 978-3-319-02750-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics