Integrating Microarray Data and GRNs

  • L. KoumakisEmail author
  • G. Potamias
  • M. Tsiknakis
  • M. Zervakis
  • V. Moustakis
Part of the Methods in Molecular Biology book series (MIMB, volume 1375)


With the completion of the Human Genome Project and the emergence of high-throughput technologies, a vast amount of molecular and biological data are being produced. Two of the most important and significant data sources come from microarray gene-expression experiments and respective databanks (e,g., Gene Expression Omnibus—GEO (, and from molecular pathways and Gene Regulatory Networks (GRNs) stored and curated in public (e.g., Kyoto Encyclopedia of Genes and Genomes—KEGG (, Reactome ( as well as in commercial repositories (e.g., Ingenuity IPA ( The association of these two sources aims to give new insight in disease understanding and reveal new molecular targets in the treatment of specific phenotypes.

Three major research lines and respective efforts that try to utilize and combine data from both of these sources could be identified, namely: (1) de novo reconstruction of GRNs, (2) identification of Gene-signatures, and (3) identification of differentially expressed GRN functional paths (i.e., sub-GRN paths that distinguish between different phenotypes). In this chapter, we give an overview of the existing methods that support the different types of gene-expression and GRN integration with a focus on methodologies that aim to identify phenotype-discriminant GRNs or subnetworks, and we also present our methodology.


Microarray Gene expression Gene regulatory networks Pathways Functional pathways Bioinformatics Systems biology 



This work was supported by the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement N° 270089 and by the European Union (European Social Fund—ESF) and by the European Union (European Social Fund—ESF) and Greek national funds through the Operational Program “Education and Lifelong Learning” of the National Strategic Reference Framework (NSRF)—Research Funding Program: Heracleitus II Investing in knowledge society through the European Social Fund.


  1. 1.
    Brown PO, Botstein D (1999) Exploring the new world of the genome with DNA microarrays. Nat Genet 21:33–37CrossRefPubMedGoogle Scholar
  2. 2.
    Huang Y, Zhao Z, Xu H, Shyr Y, Zhang B (2012) Advances in systems biology: computational algorithms and applications. BMC Syst Biol 6(3)Google Scholar
  3. 3.
    Hung J-H, Yang T-H, Zhenjun H, Weng Z, DeLisi C (2012) Gene set enrichment analysis: performance evaluation and usage guidelines. Brief Bioinform 13(3):281–291CrossRefPubMedPubMedCentralGoogle Scholar
  4. 4.
    Heckera M, Lambecka S, Toepferb S, van Somerenc E, Guthke R (2009) Gene regulatory network inference: data integration in dynamic models—a review. Biosystems 96(1):86–103CrossRefGoogle Scholar
  5. 5.
    Ein-Dor L, Kela I, Getz G, Givol D, Domany E (2005) Outcome signature genes in breast cancer: is there a unique set? Bioinformatics 21(2):171–178CrossRefPubMedGoogle Scholar
  6. 6.
    Iwamoto T, Pusztai L (2010) Predicting prognosis of breast cancer with gene signatures: are we lost in a sea of data? Genome Med 2(11):81CrossRefPubMedPubMedCentralGoogle Scholar
  7. 7.
    Shannon CEA (1948) Mathematical theory of communication. Bell Sys Tech J 27(3):379–423CrossRefGoogle Scholar
  8. 8.
    Potamias G, Koumakis L, Moustakis V (2004) Gene selection via discretized gene-expression profiles and greedy feature-elimination. Meth Appl Artif Intelligence 3025:256–266CrossRefGoogle Scholar
  9. 9.
    Li L, Weinberg CR, Darden TA, Pedersen LG (2001) Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics 17(12):1131–1142CrossRefPubMedGoogle Scholar
  10. 10.
    Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Yamanishi Y (2008) KEGG for linking genomes to life and the environment. Nucleic Acids Res 36:480–484CrossRefGoogle Scholar
  11. 11.
    Ott MA, Gert V (2006) Correcting ligands, metabolites, and pathways. BMC Bioinformatics 7(1):517CrossRefPubMedPubMedCentralGoogle Scholar
  12. 12.
    Khatri P, Draghici S (2005) Ontological analysis of gene expression data: current tools, limitations, and open problems. Bioinformatics 21:3587–3595CrossRefPubMedPubMedCentralGoogle Scholar
  13. 13.
    Kauffman SA (1993) The origins of order: self-organization and selection in evolution. Oxford University Press, New YorkGoogle Scholar
  14. 14.
    Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Ian H (2009) The WEKA data mining software: an update. SIGKDD Explorations 11(1)Google Scholar
  15. 15.
    Sutherland RL (2011) Endocrine resistance in breast cancer: new roles for ErbB3 and ErbB4. Breast Cancer Res 13(3):106CrossRefPubMedPubMedCentralGoogle Scholar
  16. 16.
    Hutcheson IR et al (2007) Heregulin beta1 drives gefitinib-resistant growth and invasion in tamoxifen-resistant MCF-7 breast cancer cells. Breast Cancer Res 9(4):50CrossRefGoogle Scholar
  17. 17.
    Geistlinger L, Csaba G, Küffner R, Mulde N, Zimmer R (2011) From sets to graphs towards a realistic enrichment analysis of transcriptomic systems. Bioinformatics 27(13):366–373CrossRefGoogle Scholar
  18. 18.
    Tarca AL, Draghici S, Khatri P, Hassan SS, Mittal P, Kim JS, Kim CJ, Kusanovic JP, Romero R (2009) A novel signaling pathway impact analysis. Bioinformatics 25(1):75–82CrossRefPubMedPubMedCentralGoogle Scholar
  19. 19.
    Judeh T, Johnson C, Kumar A, Zhu D (2013) TEAK: Topology Enrichment Analysis frameworK for detecting activated biological subpathways. Nucleic Acids Res 41(1):1425–1437CrossRefPubMedPubMedCentralGoogle Scholar
  20. 20.
    Nam S, Chang HR, Kim KT et al (2014) PATHOME: an algorithm for accurately detecting differentially expressed subpathways. Oncogene 33(41):4941–4951CrossRefPubMedPubMedCentralGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2015

Authors and Affiliations

  • L. Koumakis
    • 1
    • 2
    Email author
  • G. Potamias
    • 2
  • M. Tsiknakis
    • 2
    • 3
  • M. Zervakis
    • 4
  • V. Moustakis
    • 1
  1. 1.Department of Production and Management EngineeringTechnical University of CreteChaniaGreece
  2. 2.Foundation for Research and Technology—Hellas (FORTH)Institute of Computer ScienceHeraklionGreece
  3. 3.Department of Applied Informatics and MultimediaTechnological Educational InstituteHeraklionGreece
  4. 4.Department of Electronic and Computer EngineeringTechnical University of CreteChaniaGreece

Personalised recommendations