Computational Approaches for Reconstruction of Time-Varying Biological Networks from Omics Data

Jethava, Vinay; Bhattacharyya, Chiranjib; Dubhashi, Devdatt

doi:10.1007/978-94-007-6803-1_7

Vinay Jethava³,
Chiranjib Bhattacharyya⁴ &
Devdatt Dubhashi³

3124 Accesses

Abstract

This chapter presents a survey of recent methods for reconstruction of time-varying biological networks such as gene interaction networks based on time series node observations (e.g. gene expressions) from a modeling perspective. Time series gene expression data has been extensively used for analysis of gene interaction networks, and studying the influence of regulatory relationships on different phenotypes. Traditional correlation and regression based methods have focussed on identifying a single interaction network based on time series data. However, interaction networks vary over time and in response to environmental and genetic stress during the course of the experiment. Identifying such time-varying networks promises new insight into transient interactions and their role in the biological process. A key challenge in inferring such networks is the problem of high-dimensional data i.e. the number of unknowns p is much larger than the number of observations n. We discuss the computational aspects of this problem and examine recent methods that have addressed this problem. These methods have modeled the relationship between the latent regulatory network and the observed time series data using the framework of probabilistic graphical models. A key advantage of this approach is natural interpretability of network reconstruction results; and easy incorporation of domain knowledge into the model. We also discuss methods that have addressed the problem of inferring such time-varying regulatory networks by integrating multiple sources or experiments including time series data from multiple perturbed networks. Finally, we mention software tools that implement some of the methods discussed in this chapter. With next generation sequencing promising yet further growth in publicly available -omics data, the potential of such methods is significant.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abbreviations

PGM:: Probabilistic Graphical Model
GGM:: Gaussian Graphical Model
HMM:: Hidden Markov Model
BN:: Bayesian Network
DBN:: Dynamic Bayesian Network
MLE:: Maximum Likelihood Estimate
LASSO:: Least Absolute Shrinkage and Selection Operator
PML:: Penalized Maximum Likelihood
GLASSO:: Graphical LASSO (see LASSO)
KELLER:: KErnel-reweighted Logistic Regression
TESLA:: TEmporally Smoothed l1-regularized Logistic Regression
NETGEM:: Network Embedded Temporal GEnerative Model for gene expression data
ERGM:: Exponential Random Graph Model
PPI:: Protein-Protein Interaction

References

Ahmed A, Xing E (2009) Recovering time-varying networks of dependencies in social and biological studies. In: Proceedings of the National Academy of Sciences 106(29),11878–11883
Google Scholar
Alon U (2007) An introduction to systems biology: design principles of biological circuits, vol. 10. CRC press, Boca Raton, USA
Google Scholar
Ambroise C, Chiquet J, Matias C (2009) Inferring sparse Gaussian graphical models with latent structure. Electron J Stat 3:205–238
Article Google Scholar
Androulakis I, Yang E, Almon R (2007) Analysis of time-series gene expression data: methods, challenges, and opportunities. Annu Rev Biomed Eng 9:205–228
Article CAS PubMed Google Scholar
Arbeitman M, Furlong E, Imam F, Johnson E, Null B, Baker B, Krasnow M, Scott M, Davis R, White K (2002) Gene expression during the life cycle of drosophila melanogaster. Science 297(5590):2270–2275
Article CAS PubMed Google Scholar
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J, Davis A, Dolinski K, Dwight S, Eppig J et al (2000) Gene ontology: tool for the unification of biology. Nat Genet 25(1):25
Article CAS PubMed Central PubMed Google Scholar
Banerjee O, El Ghaoui L, d’Aspremont A (2008) Model selection through sparse maximum likelihood estimation for multivariate Gaussian or binary data. J Mach Learn Res 9:485–516
Google Scholar
Barabási A, Oltvai Z (2004) Network biology: understanding the cell’s functional organization. Nat Rev Genet 5(2):101–113
Article PubMed Google Scholar
Barabasi L, Gulbahce N, Loscalso J (2011) Network medicine: a network-based approach to human disease. Nat Rev Genet 12:56–68
Article CAS PubMed Central PubMed Google Scholar
Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, Cambridge
Google Scholar
Buhl S (1993) On the existence of maximum likelihood estimators for graphical Gaussian models. Scand J Stat 20(3):263–270
Google Scholar
Bühlmann P, Van De Geer S (2011) Statistics for high-dimensional data: methods theory and applications. Springer, New York Inc
Book Google Scholar
Candes E, Tao T (2007) The dantzig selector: statistical estimation when p is much larger than n. Ann Stat 35(6):2313–2351
Article Google Scholar
Carroll S (2005) Evolution at two levels: on genes and form. PLoS Biol 3(7):e245
Article PubMed Central PubMed Google Scholar
Cipollina C, van den Brink J, Daran-Lapujade P, Pronk J, Porro D, de Winde J (2008) Saccharomyces cerevisiae sfp1: at the crossroads of central metabolism and ribosome biogenesis. Microbiology 154(6):1686–1699
Article CAS PubMed Google Scholar
Clarke R, Ressom H, Wang A, Xuan J, Liu M, Gehan E, Wang Y (2008) The properties of high-dimensional data spaces: implications for exploring gene and protein expression data. Nat Rev Cancer 8(1):37–49
Article CAS PubMed Central PubMed Google Scholar
Davidson E (2001) Genomic regulatory systems: development and evolution. Academic Press, London, UK
Google Scholar
Dempster A (1972) Covariance selection. Biometrics, 28(1):157–175
Google Scholar
Donoho D (2000) High-dimensional data analysis: the curses and blessings of dimensionality. AMS Math Challenges Lect, 1–32. http://www-stat.stanford.edu/~donoho/Lectures/AMS2000/AMS2000.html
Donoho D (2006) Compressed sensing. Inf Theor, IEEE Trans on 52(4):1289–1306
Article Google Scholar
Duchi J, Shalev-Shwartz S, Singer Y, Chandra T (2008) Efficient projections onto the l 1-ball for learning in high dimensions. In: Proceedings of the 25th international conference on Machine learning, pp. 272–279. ACM
Google Scholar
Ernst J, Nau G, Bar-Joseph Z (2005) Clustering short time series gene expression data. Bioinformatics 21(suppl 1):i159–i168
Article CAS PubMed Google Scholar
Friedman J, Hastie T, Tibshirani R (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9(3):432–441
Article PubMed Central PubMed Google Scholar
Hastie T, Tibshirani R, Friedman J (2008) The elements of statistical learning, 2 edn. Springer-Verlag, Springer series in statistics, 763 p
Google Scholar
Friedman N, Linial M, Nachman I, Pe’er D (2000) Using Bayesian networks to analyze expression data. J Comput Biol 7(3–4):601–620
Article CAS PubMed Google Scholar
Fu W, Song L, Xing E (2009) Dynamic mixed membership blockmodel for evolving networks. In: Proceedings of the 26th annual international conference on machine learning, pp 329–336. ACM
Google Scholar
Gitter A, Lu Y, Bar-Joseph Z (2010) Computational methods for analyzing dynamic regulatory networks. Methods in molecular biology (Clifton, NJ) 674, 419
Google Scholar
Glass L, Kaplan D (1993) Time series analysis of complex dynamics in physiology and medicine. Med Progr Technol 19:115–115
CAS Google Scholar
Guo F, Hanneke S, Fu W, Xing E (2007) Recovering temporally rewiring networks: a model-based approach. In: Proceedings of the 24th international conference on Machine learning, pp 321–328. ACM
Google Scholar
Guo J, Levina E, Michailidis G, Zhu J (2011) Joint estimation of multiple graphical models. Biometrika 98(1):1–15
Article PubMed Central PubMed Google Scholar
Hartemink A et al (2005) Reverse engineering gene regulatory networks. Nat Biotechnol 23(5):554–555
Article CAS PubMed Google Scholar
de Hoon M, Imoto S, Miyano S (2002) Inferring gene regulatory networks from time-ordered gene expression data using differential equations. In: Discovery science, 283–288. Springer
Google Scholar
Hu H, Yan X, Huang Y, Han J, Zhou X (2005) Mining coherent dense subgraphs across massive biological networks for functional discovery. Bioinformatics 21(suppl 1):i213–i221
Article CAS PubMed Google Scholar
Ideker T, Sharan R (2008) Protein networks in disease. Genome Res 18:644–652
Article CAS PubMed Central PubMed Google Scholar
Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y (2001) A comprehensive two-hybrid analysis to explore the yeast protein interactome. In: Proceedings of the National Academy of Sciences 98(8):4569
CAS Google Scholar
Jethava V, Bhattacharyya C, Dubhashi D, Vemuri G (2011) Netgem:network embedded temporal generative model for gene expression data. BMC Bioinform 12(1):327
Article CAS Google Scholar
Kim S, Imoto S, Miyano S (2004) Dynamic Bayesian network and nonparametric regression for nonlinear modeling of gene networks from time series gene expression data. Biosystems 75(1):57–65
Article CAS PubMed Google Scholar
Koh K, Kim S, Boyd S (2007) An interior-point method for large-scale l1-regularized logistic regression. J Mach Learn Res 8(8):1519–1555
Google Scholar
Koller D, Friedman N (2009) Probabilistic graphical models: principles and techniques. The MIT Press, Cambridge, MA
Google Scholar
Lam C, Fan J (2009) Sparsistency and rates of convergence in large covariance matrix estimation. Ann Stat 37(6B), 4254
Google Scholar
Lauritzen S (1996) Graphical models, vol 17. Oxford University Press, USA
Google Scholar
Lin C, Weng R, Keerthi S (2008) Trust region newton method for logistic regression. J Mach Learn Res 9:627–650
Google Scholar
Luscombe N, Babu M, Yu H, Snyder M, Teichmann S, Gerstein M (2004) Genomic analysis of regulatory network dynamics reveals large topological changes. Nature 431(7006):308–312
Article CAS PubMed Google Scholar
Ma S, Gong Q, Bohnert H (2007) An arabidopsis gene network based on the graphical Gaussian model. Genome Res 17(11):1614–1625
Article CAS PubMed Central PubMed Google Scholar
Meinshausen N, Bühlmann P (2006) High-dimensional graphs and variable selection with the lasso. Ann Stat 34(3):1436–1462
Article Google Scholar
Mewes H, Frishman D, Gruber C, Geier B, Haase D, Kaps A, Lemcke K, Mannhaupt G, Pfeiffer F, Schüller C et al (2000) Mips: a database for genomes and protein sequences. Nucleic Acids Res 28(1):37–40
Article CAS PubMed Central PubMed Google Scholar
Parisi G, Shankar R (1988) Statistical field theory. Phys Today 41:110
Article Google Scholar
Peer D, Regev A, Elidan G, Friedman N (2001) Inferring subnetworks from perturbed expression profiles. Bioinformatics 17(suppl 1), S215–S224
Google Scholar
Perrin B, Ralaivola L, Mazurie A, Bottani S, Mallet J, dAlche Buc F (2003) Gene networks inference using dynamic Bayesian networks. Bioinformatics 19(suppl 2), ii138-ii148 .
Google Scholar
Przytycka T, Singh M, Slonim D (2010) Toward the dynamic interactome: it’s about time. Briefings Bioinform 11(1):15–29
Article CAS Google Scholar
Ravikumar P, Wainwright M, Lafferty J (2010) High-dimensional ising model selection using 1-regularized logistic regression. Ann Stat 38(3):1287–1319
Article Google Scholar
Robins G, Pattison P, Kalish Y, Lusher D (2007) An introduction to exponential random graph \(p^*\) models for social networks. Soc Netw 29(2):173–191
Article Google Scholar
Rothman A, Bickel P, Levina E, Zhu J (2008) Sparse permutation invariant covariance estimation. Electron J Stat 2:494–515
Article Google Scholar
Sachs K, Perez O, Pe’er D, Lauffenburger D, Nolan G (2005) Causal protein-signaling networks derived from multiparameter single-cell data. Science’s STKE 308(5721), 523
Google Scholar
Schadt E (2009) Molecular networks as sensors and drivers of common human diseases. Nature 416:218–223
Article Google Scholar
Schäfer J, Strimmer K (2005) An empirical Bayes approach to inferring large-scale gene association networks. Bioinformatics 21(6):754–764
Article PubMed Google Scholar
Schliep A, Schönhuth A, Steinhoff C (2003) Using hidden Markov models to analyze gene expression time course data. Bioinformatics 19(suppl 1):i255–i263
Article PubMed Google Scholar
Shermin A, Orgun M (2009) Using dynamic bayesian networks to infer gene regulatory networks from expression profiles. In: Proceedings of the 2009 ACM symposium on applied computing, 799–803. ACM
Google Scholar
Song L, Kolar M, Xing E (2009) Keller: estimating time-varying interactions between genes. Bioinformatics 25(12):i128–i136
Article CAS PubMed Central PubMed Google Scholar
Soranzo N, Bianconi G, Altafini C (2007) Comparing association network algorithms for reverse engineering of large-scale gene regulatory networks: synthetic versus real data. Bioinformatics 23(13):1640–1647
Article CAS PubMed Google Scholar
Speed T, Kiiveri H (1986) Gaussian Markov distributions over finite graphs. Ann Stat 14(1):138–150
Article Google Scholar
Tegner J, Yeung M, Hasty J, Collins J (2003) Reverse engineering gene networks: integrating genetic perturbations with dynamical modeling. In: Proceedings of the National Academy of Sciences 100(10):5944
CAS Google Scholar
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Roy Stat Soc. Series B (Methodological) 58(1):267–288
Google Scholar
Uetz P, Giot L, Cagney G, Mansfield T, Judson R, Knight J, Lockshon D, Narayan V, Srinivasan M, Pochart P et al (2000) A comprehensive analysis of protein-protein interactions in saccharomyces cerevisiae. Nature 403(6770):623–627
Article CAS PubMed Google Scholar
Wainwright M, Ravikumar P, Lafferty J (2007) High-dimensional graphical model selection using 1^~ 1-regularized logistic regression. Advances in neural information processing systems 19:1465
Google Scholar
Werhli A, Grzegorczyk M, Husmeier D (2006) Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical Gaussian models and Bayesian networks. Bioinformatics 22(20):2523–2531
Article CAS PubMed Google Scholar
Wille A, Zimmermann P, Vranová E, Fürholz A, Laule O, Bleuler S, Hennig L, Prelic A, Von Rohr P, Thiele L et al (2004) Sparse graphical Gaussian modeling of the isoprenoid gene network in arabidopsis thaliana. Genome Biol 5(11):R92
Article PubMed Central PubMed Google Scholar
Workman C, Mak H, McCuine S, Tagne J, Agarwal M, Ozier O, Begley T, Samson L, Ideker T (2006) A systems approach to mapping dna damage response pathways. Science’s STKE 312(5776):1054
CAS Google Scholar
Yeang C, Mak H, McCuine S, Workman C, Jaakkola T, Ideker T (2005) Validation and refinement of gene-regulatory pathways on a network of physical interactions. Genome Biol 6(7):R62
Article PubMed Central PubMed Google Scholar
Yeung M, Tegnér J, Collins J (2002) Reverse engineering gene networks using singular value decomposition and robust regression. In: Proceedings of the National Academy of Sciences 99(9):6163
CAS Google Scholar
Yuan M, Lin Y (2007) Model selection and estimation in the Gaussian graphical model. Biometrika 94(1):19–35
Article Google Scholar
Zhou S, Lafferty J, Wasserman L (2010) Time varying undirected graphs. Mach Learn 80(2):295–319
Article Google Scholar
Zou M, Conzen S (2005) A new dynamic Bayesian network (dbn) approach for identifying gene regulatory networks from time course microarray data. Bioinformatics 21(1):71–79
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Chalmers University of Technology, Göteborg, Sweden
Vinay Jethava & Devdatt Dubhashi
Indian Institute of Science, Bangalore, India
Chiranjib Bhattacharyya

Authors

Vinay Jethava
View author publications
You can also search for this author in PubMed Google Scholar
Chiranjib Bhattacharyya
View author publications
You can also search for this author in PubMed Google Scholar
Devdatt Dubhashi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vinay Jethava .

Editor information

Editors and Affiliations

Chemical and Biological Engineering, Vanderbilt University, Nashville, TN, USA
Aleš Prokop
Research Group on Process Network Engineering, Kaposvár University, Kaposvár, Hungary
Béla Csukás

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Jethava, V., Bhattacharyya, C., Dubhashi, D. (2013). Computational Approaches for Reconstruction of Time-Varying Biological Networks from Omics Data. In: Prokop, A., Csukás, B. (eds) Systems Biology. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-6803-1_7

Download citation

DOI: https://doi.org/10.1007/978-94-007-6803-1_7
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-6802-4
Online ISBN: 978-94-007-6803-1
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics