Functional Analysis of OMICs Data and Small Molecule Compounds in an Integrated “Knowledge-Based” Platform

  • Yuri Nikolsky
  • Eugene Kirillov
  • Roman Zuev
  • Eugene Rakhmatulin
  • Tatiana Nikolskaya
Part of the Methods in Molecular Biology book series (MIMB, volume 563)


Analysis of microarray, SNPs, proteomics, and other high-throughput (OMICs) data is challenging because of its biological complexity and high level of technical and biological noise. One way to deal with both problems is to perform analysis with a high-fidelity annotated knowledge base of protein interactions, pathways, and functional ontologies. This knowledge base has to be structured in a computer-readable format and must include software tools for managing experimental data, analysis, and reporting. Here we present MetaDiscovery, an integrated platform for functional data analysis which is being developed at GeneGo for the past 8 years. On the content side, MetaDiscovery encompasses a comprehensive database of protein interactions of different types, pathways, network models and 10 functional ontologies covering human, mouse, and rat proteins. The analytical toolkit includes tools for gene/protein list enrichment analysis, statistical “interactome” tool for identification of over- and under-connected proteins in the data set, and a network module made up of network generation algorithms and filters. The suite also features MetaSearch, an application for combinatorial search of the database content, as well as a Java-based tool called MapEditor for drawing and editing custom pathway maps. Applications of MetaDiscovery include identification of potential biomarkers and drug targets, pathway hypothesis generation, analysis of biological effects for novel small molecule compounds, and clinical applications (analysis of large cohorts of patients and translational and personalized medicine).

Key words

Pathway analysis functional analysis systems biology signaling and metabolic networks biological networks “knowledge-based” platform interactome 


  1. 1.
    Salwinski L, Eisenberg D. Computational methods of analysis of protein–protein interactions. Curr Opin Struct Biol, 2003, 13, 377–382PubMedCrossRefGoogle Scholar
  2. 2.
    Kemmeren P, et al. Protein interaction verification and functional annotation by integrated analysis of genome-scale data. Mol Cell, 2002, 9, 1133–1143PubMedCrossRefGoogle Scholar
  3. 3.
    Subramaniam A, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A, 2005, 102(43),15545–15550CrossRefGoogle Scholar
  4. 4.
    Klebanov L, Glazko G, Salzman P, Yakovlev A, Xiao Y. A multivariate extension of the gene set enrichment analysis. J Bioinform Comput Biol, 2007, 5(5), 1139–1153PubMedCrossRefGoogle Scholar
  5. 5.
    Yook SH, Oltvai ZN, Barabási AL. Functional and topological characterization of protein interaction networks. Proteomics, 2004, 4(4), 928–942PubMedCrossRefGoogle Scholar
  6. 6.
    Barabasi AL, Oltvai Z. Network biology: understanding the cell’s functional organization. Nat Rev Gen, 2004, 5, 101CrossRefGoogle Scholar
  7. 7.
    Bader SKS, Gavin AC. Interaction networks for systems biology. FEBS Lett, 2008, 9, 582(8), 1220–1224.Google Scholar
  8. 8.
    Yip KY, Yu H, Kim PM, Schultz M, Gerstein M. The tYNA platform for comparative interactomics: a web tool for managing, comparing and mining multiple networks. Bioinformatics, 2006, 22(23), 2968–2970PubMedCrossRefGoogle Scholar
  9. 9.
    Yu H, Zhu X, Greenbaum D, Karro J, Gerstein M. ToPNet–an application for interactive analysis of expression data and biological networks. Bioinformatics, 2004, 20(9), 1470–1471CrossRefGoogle Scholar
  10. 10.
    Broder A, Kumar R, Maghoul F, Raghavan P, Rajagopalan S, Stata R, Tomkins A, Wiener J. Graph structure in the web. Computer Networks, 2000, 33, 309CrossRefGoogle Scholar
  11. 11.
    Wood LA, et al. The genomic landscapes of human breast and colorectal cancers. Science, 2007, 16, 318, 1108–1113CrossRefGoogle Scholar
  12. 12.
    Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res, 2003, 13(11), 2498–2504PubMedCrossRefGoogle Scholar
  13. 13.
    Galperin MY, Cochrane GR. Nucleic acids research annual database issue and the NAR online molecular biology database collection in 2009. Nucleic Acids Res, 2009 Jan, 37 (Database issue), D1-4. Epub 2008 Nov 25.Google Scholar

Copyright information

© Humana Press, a part of Springer Science+Business Media, LLC 2009

Authors and Affiliations

  • Yuri Nikolsky
    • 1
  • Eugene Kirillov
    • 2
  • Roman Zuev
    • 2
  • Eugene Rakhmatulin
    • 2
  • Tatiana Nikolskaya
    • 2
  1. 1.GeneGo, Inc.Saint JosephUSA
  2. 2.Vavilov Institute for General Genetics, Russian Academy of SciencesMoscowRussia

Personalised recommendations