Annals of Operations Research

, Volume 188, Issue 1, pp 77–110 | Cite as

Analysing DNA microarray data using Boolean techniques

Article
  • 121 Downloads

Abstract

We address in this manuscript a problem arising in molecular biology, namely a problem of discovering dependencies among gene expression levels. The problem is formulated in mathematical terms as a search for a fully defined three valued function defined on three valued variables which is partially specified by the DNA microarray measurements. This formulation as well as our solution methods are strongly motivated by results in the area of logical analysis of data (LAD) and in the area of partially defined Boolean functions (pdBfs), in particular by procedures for finding fully defined extensions of pdBfs. On one hand we present several algorithms which (under some assumptions) construct the desired three valued functional extension of the input data, and on the other hand we derive several proofs showing that (under different assumptions) finding such an extension is NP-hard.

Keywords

Partially defined Boolean function Extension problem Gene regulatory networks DNA microarray 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Akutsu, T., Miyano, S., & Kuhara, S. (1999). Identification of genetic networks from a small number of gene expression patterns under the Boolean network model. In Pac. symp. biocomput. (pp. 17–28). Google Scholar
  2. Akutsu, T., Miyano, S., & Kuhara, S. (2000a). Algorithms for identifying Boolean networks and related biological networks based on matrix multiplication and fingerprint function. In RECOMB ’00: Proceedings of the fourth annual international conference on computational molecular biology (pp. 8–14). New York: ACM. doi: 10.1145/332306.332317. CrossRefGoogle Scholar
  3. Akutsu, T., Miyano, S., & Kuhara, S. (2000b). Inferring qualitative relations in genetic networks and metabolic pathways. Bioinformatics, 16(8), 727–734. CrossRefGoogle Scholar
  4. Akutsu, T., Kuhara, S., Maruyama, O., & Miyano, S. (2003). Identification of genetic networks by strategic gene disruptions and gene over expressions under a Boolean model. Theoretical Computer Science, 298(1), 235–251. doi: 10.1016/S0304-3975(02)00425-5. CrossRefGoogle Scholar
  5. Alexe, G., Alexe, S., Bonates, T. O., & Kogan, A. (2007). Logical analysis of data—the vision of Peter L. Hammer. Annals of Mathematics and Artificial Intelligence, 49(1–4), 265–312. doi: 10.1007/s10472-007-9065-2. CrossRefGoogle Scholar
  6. Boros, E., Ibaraki, T., & Makino, K. (1996). Extensions of partially defined Boolean functions with missing data (Tech. Rep. 6-96, RUTCOR Research Report RRR). Rutgers University, New Brunswick, NJ. Google Scholar
  7. Boros, E., Ibaraki, T., & Makino, K. (1998). Error-free and best-fit extensions of partially defined Boolean functions. Information and Computation, 140, 254–283. CrossRefGoogle Scholar
  8. Boros, E., Ibaraki, T., & Makino, K. (1999). Fully consistent extensions of partially defined Boolean functions with missing bits (Tech. Rep. 99-30, RUTCOR Research Report RRR). Google Scholar
  9. Bussemaker, H., Li, H., & Siggia, E. (2001). Regulatory element detection using correlation with expression. Nature Genetics, 2(27), 167–171. CrossRefGoogle Scholar
  10. Chen, T., Filkov, V., & Skiena, S. S. (1999). Identifying gene regulatory networks from experimental data. In RECOMB ’99: Proceedings of the third annual international conference on computational molecular biology (pp. 94–103). New York: ACM. doi: 10.1145/299432.299462. CrossRefGoogle Scholar
  11. Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2001). Introduction to algorithms (2nd ed.). Cambridge: The MIT Press. Google Scholar
  12. Crama, Y., Hammer, P. L., & Ibaraki, T. (1988). Cause-effect relationships and partially defined Boolean functions. Annals of Operation Research, 16(1–4), 299–325. CrossRefGoogle Scholar
  13. DeRisi, J., Lyer, V., & Brown, P. (1997). Exploring the metabolic and genetic control of gene expression on a genomic scale. Science, 278. Google Scholar
  14. D’Haeseleer, P., Wen, X., Fuhrman, S., & Somogyi, R. (1999). Linear modeling of mRNA expression levels during CNS development and injury. In Pac. symp. biocomput. (pp. 41–52). Google Scholar
  15. Friedman, N., Linial, M., Nachman, I., & Pe’er, D. (2000). Using Bayesian networks to analyze expression data. Journal of Computational Biology, 7(3–4), 601–620. CrossRefGoogle Scholar
  16. Garey, M., & Johnson, D. (1979). Computers and intractability: a guide to the theory of NP-completeness. San Francisco: Freeman. Google Scholar
  17. Imade, H., Morishita, R., Ono, I., & Okamoto, M. (2003). A grid-oriented genetic algorithm for estimating genetic networks by s-systems. In Proc. SICE annual conference (Vol. 3, pp. 2750–2755). Google Scholar
  18. Kauffman, S. A. (1993). The origins of order: self-organization and selection in evolution. London: Oxford University Press. Google Scholar
  19. Liang, S., Fuhrman, S., & Somogyi, R. (1998). REVEAL, a general reverse engineering algorithm for inference of genetic network architectures. In Pacific symposium on biocomputing (Vol. 3, pp. 18–29). Google Scholar
  20. Murphy, K., & Mian, S. (1999). Modelling gene expression data using dynamic Bayesian networks. (Tech. rep.). University of California, Berkeley. http://citeseer.ist.psu.edu/murphy99modelling.html.
  21. Schena, M., Shalon, D., Davis, R. W., & Brown, P. (1995). Quantitative monitoring of gene expression pattern with a complementing DNA microarray. Science, 270, 467–470. CrossRefGoogle Scholar
  22. Shmulevich, I., Dougherty, E. R., Kim, S., & Zhang, W. (2002). Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks. Bioinformatics, 18(2), 261–274. CrossRefGoogle Scholar
  23. Toh, H., & Horimoto, K. (2002). Inference of a genetic network by a combined approach of cluster analysis and graphical Gaussian modeling. Bioinformatics, 18(2), 287–297. CrossRefGoogle Scholar
  24. van Someren, E. P., Wessels, L. F. A., & Reinders, M. J. T. (2000). Linear modeling of genetic networks from experimental data. In Proceedings of the eighth international conference on intelligent systems for molecular biology (pp. 355–366). Menlo Park: AAAI Press. Google Scholar
  25. Weaver, D. C., Workman, C. T., & Stromo, G. D. (1999). Modeling regulatory networks with weight matrices. In Pacific symposium on biocomputing. citeseer.ist.psu.edu/weaver99modeling.html.

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  1. 1.Institute of Finance and Administration (VŠFS)PragueCzech Republic
  2. 2.Department of Theoretical Computer Science and Mathematical Logic, Faculty of Mathematics and PhysicsCharles UniversityPragueCzech Republic

Personalised recommendations