Plant Gene Regulatory Networks pp 283-295 | Cite as
Computational Approaches to Study Gene Regulatory Networks
Abstract
The goal of the gene regulatory network (GRN) inference is to determine the interactions between genes given heterogeneous data capturing spatiotemporal gene expression. Since transcription underlines all cellular processes, the inference of GRN is the first step in deciphering the determinants of the dynamics of biological systems. Here, we first describe the generic steps of the inference approaches that rely on similarity measures and group the similarity measures based on the computational methodology used. For each group of similarity measures, we not only review the existing approaches but also describe specifically the detailed steps of the existing state-of-the-art algorithms.
Key words
Gene regulatory networks Gene expression profiles Similarity measures Correlation Gaussian graphical models Information theory Regression Bayesian networkReferences
- 1.Schäfer J, Strimmer K (2005) A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics. Stat Appl Genet Mol Biol 4:32CrossRefGoogle Scholar
- 2.Lee TI, Rinaldi NJ, Robert F et al (2002) Transcriptional regulatory networks in Saccharomyces cerevisiae. Science (New York, NY) 298:799–804CrossRefGoogle Scholar
- 3.Hempel S, Koseska A, Nikoloski Z et al (2011) Unraveling gene regulatory networks from time-resolved gene expression data - a measures comparison study. BMC Bioinformatics 12:292CrossRefPubMedPubMedCentralGoogle Scholar
- 4.Marbach D, Costello JC, Küffner R et al (2012) Wisdom of crowds for robust gene network inference. Nat Methods 9:796–804CrossRefPubMedPubMedCentralGoogle Scholar
- 5.Hartemink AJ (2005) Reverse engineering gene regulatory networks. Nat Biotechnol 23:554–555CrossRefPubMedGoogle Scholar
- 6.Huang Y, Tienda-Luna IM, Wang Y (2009) A survey of statistical models for reverse engineering gene regulatory networks. IEEE Signal Process Mag 26:76–97CrossRefPubMedPubMedCentralGoogle Scholar
- 7.Johnstone IM, Titterington DM (2009) Statistical challenges of high-dimensional data. Philos Trans A Math Phys Eng Sci 367:4237–4253CrossRefPubMedPubMedCentralGoogle Scholar
- 8.R Core Team (2013), R: a language and environment for statistical computing. http://www.r-project.org/
- 9.Butte J, Kohane IS (1999) Unsupervised knowledge discovery in medical databases using relevance networks. Proceedings/AMIA annual symposium, pp 711–715Google Scholar
- 10.Butte J, Kohane IS (2000) Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. Pacific symposium on biocomputing, pp 418–429Google Scholar
- 11.Song L, Langfelder P, Horvath S (2012) Comparison of co-expression measures: mutual information, correlation, and model based indices. BMC Bioinformatics 13:328CrossRefPubMedPubMedCentralGoogle Scholar
- 12.Brazhnik P, de la Fuente A, Mendes P (2002) Gene networks: how to put the function in genomics. Trends Biotechnol 20:467–472CrossRefPubMedGoogle Scholar
- 13.Han L, Zhu J (2008) Using matrix of thresholding partial correlation coefficients to infer regulatory network. Bio Systems 91:158–165CrossRefPubMedGoogle Scholar
- 14.Rice JJ, Tu Y, Stolovitzky G (2005) Reconstructing biological networks using conditional correlation analysis. Bioinformatics (Oxford) 21(6):765–773CrossRefGoogle Scholar
- 15.Yuan Y, Li C-T, Windram O (2011) Directed partial correlation: inferring large-scale gene regulatory network through induced topology disruptions. PLoS One 6:e16835CrossRefPubMedPubMedCentralGoogle Scholar
- 16.Opgen-Rhein R, Schäfer J, Strimmer K (2007) GeneNet: modeling and inferring gene networks. R package version 1Google Scholar
- 17.Schäfer J, Strimmer K (2005) An empirical Bayes approach to inferring large-scale gene association networks. Bioinformatics 21:754–764CrossRefPubMedGoogle Scholar
- 18.Steuer R, Kurths J, Daub C et al (2002) The mutual information: detecting and evaluating dependencies between variables. Bioinformatics 18(Suppl 2):S231–S240CrossRefPubMedGoogle Scholar
- 19.Moon Y, Rajagopalan B, Lall U (1995) Estimation of mutual information using kernel density estimators. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics 52(3):2318–2321PubMedGoogle Scholar
- 20.Cellucci C, Albano A, Rapp P (2005) Statistical validation of mutual information calculations: comparison of alternative numerical algorithms. Phys Rev E 71:066208CrossRefGoogle Scholar
- 21.Daub CO, Steuer R, Selbig J et al (2004) Estimating mutual information using B-spline functions—an improved similarity measure for analysing gene expression data. BMC Bioinformatics 5:118CrossRefPubMedPubMedCentralGoogle Scholar
- 22.Schreiber T (2000) Measuring information transfer. Phys Rev Lett 85:461–464CrossRefPubMedGoogle Scholar
- 23.Meyer PE, Lafitte F, Bontempi G (2008) Minet: a R/bioconductor package for inferring large transcriptional networks using mutual information. BMC Bioinformatics 9:461CrossRefPubMedPubMedCentralGoogle Scholar
- 24.Margolin A, Wang K, Lim WK et al (2006) Reverse engineering cellular networks. Nat Protoc 1:662–671CrossRefPubMedGoogle Scholar
- 25.Zoppoli P, Morganella S, Ceccarelli M (2010) TimeDelay-ARACNE: reverse engineering of gene networks from time-course data by an information theoretic approach. BMC Bioinformatics 11:154CrossRefPubMedPubMedCentralGoogle Scholar
- 26.Faith JJ, Hayete B, Thaden JT et al (2007) Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles. PLoS Biol 5:e8CrossRefPubMedPubMedCentralGoogle Scholar
- 27.Friedman N, Linial M, Nachman I et al (2000) Using Bayesian networks to analyze expression data. J Comput Biology 7:601–620CrossRefGoogle Scholar
- 28.Tsamardinos I, Brown LE, Aliferis CF (2006) The max-min hill-climbing Bayesian network structure learning algorithm. Mach Learn 65:31–78CrossRefGoogle Scholar
- 29.T. Wang, C. Science, and J.W. Touchman (2004) Applying two-level simulated annealing on bayesian structure learning to infer genetic networksGoogle Scholar
- 30.Cooper GF (1990) The computational complexity of probabilistic inference using bayesian belief networks. Artif Intell 42:393–405CrossRefGoogle Scholar
- 31.Dagum P, Luby M (1993) Approximating probabilistic inference in Bayesian belief networks is NP-hard. Artif Intell 60:141–153CrossRefGoogle Scholar
- 32.D. Chickering (1996) Learning Bayesian networks is NP-complete, learning from dataGoogle Scholar
- 33.Werhli AV, Grzegorczyk M, Husmeier D (2006) Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks. Bioinformatics (Oxford) 22:2523–2531CrossRefGoogle Scholar
- 34.Beaumont MA, Zhang W, Balding DJ (2002) Approximate Bayesian computation in population genetics. Genetics 162:2025–2035PubMedPubMedCentralGoogle Scholar
- 35.Toni T, Stumpf MPH (2010) Simulation-based model selection for dynamical systems in systems and population biology. Bioinformatics (Oxford) 26:104–110CrossRefGoogle Scholar
- 36.Kim SY, Imoto S, Miyano S (2003) Inferring gene networks from time series microarray data using dynamic Bayesian networks. Brief Bioinform 4:228–235CrossRefPubMedGoogle Scholar
- 37.Yu J, Smith VA, Wang PP et al (2004) Advances to Bayesian network inference for generating causal networks from observational biological data. Bioinformatics (Oxford) 20:3594–3603CrossRefGoogle Scholar
- 38.Dondelinger F, Husmeier D, Lèbre S (2011) Dynamic Bayesian networks in molecular plant science: inferring gene regulatory networks from multiple gene expression time series. Euphytica 183:361–377CrossRefGoogle Scholar
- 39.Zou M, Conzen SD (2005) A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data. Bioinformatics (Oxford) 21:71–79CrossRefGoogle Scholar
- 40.N Balov, P Salzman (2014) catnet: categorical Bayesian Network inference, R packageGoogle Scholar
- 41.Balov N (2013) A categorical network approach for discovering differentially expressed regulations in cancer. BMC Med Genet 6(Suppl 3):S1Google Scholar
- 42.Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol 58:267–288Google Scholar
- 43.R. Bonneau, D.J. Reiss, P. Shannon, et al. (2006) The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo. Genome Biology 7, R36.Google Scholar
- 44.Yuan M, Lin Y (2006) Model selection and estimation in regression with. J R Stat Soc B 68:49–67CrossRefGoogle Scholar
- 45.Friedman J, Hastie T, Tibshirani R (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics (Oxford) 9:432–441CrossRefGoogle Scholar
- 46.Haury A, Mordelet F, Vera-Licona P et al (2012) TIGRESS: trustful inference of gene REgulation using stability selection. BMC Syst Biol 6:145CrossRefPubMedPubMedCentralGoogle Scholar
- 47.Villa-Vialaneix N, Vignes M, Viguerie N et al (2016) Inferring networks from multiple samples with consensus LASSO. Qual Technol Quant Manag 11:39–60CrossRefGoogle Scholar
- 48.J Ulbricht (2010) lqa: penalized likelihood inference for GLMs, R packageGoogle Scholar
- 49.Omranian N, Eloundou-Mbebi JMO, Mueller-Roeber B et al (2016) Gene regulatory network inference using fused LASSO on multiple data sets. Sci Rep 6:20533CrossRefPubMedPubMedCentralGoogle Scholar
- 50.Villaverde AF, Banga JR (2014) Reverse engineering and identification in systems biology: strategies , perspectives and challenges. J R Soc Interface 11:20130505CrossRefPubMedPubMedCentralGoogle Scholar
- 51.Lim WK, Wang K, Lefebvre C et al (2007) Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks. Bioinformatics (Oxford) 23:i282–i288CrossRefGoogle Scholar