Abstract
Identification of models of gene regulatory networks is sensitive to the amount of data used as input. Considering the substantial costs in conducting experiments, it is of value to have an estimate of the amount of data required to infer the network structure. To minimize wasted resources, it is also beneficial to know which data are necessary to identify the network. Knowledge of the data and knowledge of the terms in polynomial models are often required a priori in model identification. In applications, it is unlikely that the structure of a polynomial model will be known, which may force data sets to be unnecessarily large in order to identify a model. Furthermore, none of the known results provides any strategy for constructing data sets to uniquely identify a model. We provide a specialization of an existing criterion for deciding when a set of data points identifies a minimal polynomial model when its monomial terms have been specified. Then, we relax the requirement of the knowledge of the monomials and present results for model identification given only the data. Finally, we present a method for constructing data sets that identify minimal polynomial models.
Similar content being viewed by others
References
Adams W, Loustaunau P (1994) An introduction to Gröbner bases, graduate studies in mathematics. American Mathematical Society, Providence
Albert R, Othmer H (2003) The topology of the regulatory interactions predicts the expression pattern of the segment polarity genes in Drosophila melanogaster. J Theor Biol 223:1–18
Allman E, Rhodes J (2003) Mathematical models in biology: an introduction. Cambridge University Press, Cambridge
Cheng D, Li Z, Qi H (2010) Realization of boolean control networks. Automatica 46:62–69
Cheng D, Qi H, Li Z (2011) Model construction of boolean networks via observed data. IEEE Trans Neural Netw 22(4):525
Cheng D, Zhao Y (2011) Identification of boolean control networks. Automatica 47(4):702–710
Cho K-H, Choo S-M, Jung SH, Kim J-R, Choi H-S, Kim J (2007) Reverse engineering of gene regulatory networks. IET Syst Biol 1(3):149–163
Cox D, Little J, O’Shea D (1997) Ideals, varieties, and algorithms. Springer Verlag, New York
de Jong H (2002) Modeling and simulation of genetic regulatory systems: a literature review. J Comput Biol 9:67–103
Davidson E et al (2002) A genomic regulatory network for development. Science 295(5560):1669–1678
D’haeseleer P, Liang S, Somogyi R (2000) Genetic network inference: from co-expression clustering to reverse engineering. Bioinformatics 16(8):707–726
Dimitrova ES (2006) Polynomial models for systems biology: data discretization and term order effect on dynamics. PhD thesis, Virginia Polytechnic Institute and State University
Dimitrova ES, Garcia-Puente L, Hinkelmann F, Jarrah A, Laubenbacher R, Stigler B, Stillman M, Vera-Licona P (2011) Parameter estimation for boolean models of biological networks. Theor Comput Sci 412(26):2816–2826
Dimitrova ES, Jarrah A, Laubenbacher R, Stigler B (2007) A Gröbner fan method for biochemical network modeling. In: Proceedings of International Symposium on Symbolic and Algebraic Computation (ISSAC), pp 122–126
Dimitrova ES, McGee J, Laubenbacher R, Vera Licona P (2010) Discretization of time series data. J Comput Biol 17(6):853–868
Dimitrova ES, Stigler B (2013) Inferring the topology of gene regulatory networks: an algebraic approach to reverse engineering. In: Robeva R, Hodge T (eds) Mathematical concepts and methods in modernbiology. Using modern discrete models, 1st edn. Academic Press, Waltham
Eisenbud D (1995) Introduction to commutative algebra with a view towards algebraic geometry. Graduate texts in mathematics. Springer, New York
Hecker M, Lambeck S, Toepfer S, van Someren E, Guthke R (2009) Gene regulatory network inference: data integration in dynamic models: a review. Biosystems 96(1):86–103
Hickman G, Hodgman T (2009) Inference of gene regulatory networks using boolean-network inference methods. J Bioinform Comput Biol 7(6):1013–1029
Kauffman S (1969) Metabolic stability and epigenesis in randomly constructed genetic nets. J Theor Biol 22:437–467
Laubenbacher R, Pareigis B (2003) Decomposition and simulation of sequential dynamical systems. Adv Appl Math 30:655–678
Laubenbacher R, Stigler B (2004) A computational algebra approach to the reverse engineering of gene regulatory networks. J Theor Biol 229(4):523–537
Li F, Long T, Lu Y, Ouyang Q, Tang C (2004) The yeast cell-cycle network is robustly designed. PANS 11(14):4781–4786
Pachter L, Sturmfels B (2005) Algebraic statistics for computational biology. Cambridge University Press, Cambridge
Prill RJ, Saez-Rodriguez J, Alexopoulos LG, Sorger PK, Stolovitzky G (2011) Crowdsourcing network inference: the DREAM predictive signaling network challenge. Sci Signal 4(189):mr7
Robbiano L (1998) Gröbner bases and statistics. In: Buchberger B, Winkler F (eds) Gröbner Bases and Applications, volume 251 of London Mathematical Society Lecture Notes Series. Cambridge University Press, New York, pp 179–204
Robeva R, Hodge T (eds) (2013) Mathematical concepts and methods in modern biology: using modern discrete models. Academic Press, Waltham
Samal A, Jain D (2008) The regulatory network of E. coli metabolism as a boolean dynamical system exhibits both homeostasis and flexibility of response. BMC Syst Biol 2(21):1
Stigler B, Chamberlin H (2012) A regulatory network modeled from wild-type gene expression data guides functional predictions in Caenorhabditis elegans development. BMC Syst Biol 6:77
Thomas R (1991) Regulatory networks seen as asynchronous automata: a logical description. J Theor Biol 153:1–23
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Dimitrova, E., Stigler, B. Data Identification for Improving Gene Network Inference using Computational Algebra. Bull Math Biol 76, 2923–2940 (2014). https://doi.org/10.1007/s11538-014-9979-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11538-014-9979-x