Semantic Data Integration and Knowledge Management to Represent Biological Network Associations

  • Sascha Losko
  • Klaus Heumann
Part of the Methods in Molecular Biology book series (MIMB, volume 563)

Abstract

The vast quantities of information generated by academic and industrial research groups are reflected in a rapidly growing body of scientific literature and exponentially expanding resources of formalized data including experimental data from “-omics” platforms, phenotype information, and clinical data. For bioinformatics, several challenges remain: to structure this information as biological networks enabling scientists to identify relevant information; to integrate this information as specific “knowledge bases”; and to formalize this knowledge across multiple scientific domains to facilitate hypothesis generation and validation and, thus, the generation of new knowledge. Risk management in drug discovery and clinical research is used as a typical example to illustrate this approach. In this chapter we will introduce techniques and concepts (such as ontologies, semantic objects, typed relationships, contexts, graphs, and information layers) that are used to represent complex biomedical networks. The BioXM™ Knowledge Management Environment is used as an example to demonstrate how a domain such as oncology is represented and how this representation is utilized for research.

Key words

Knowledge management bioinformatics biomarkers biological networks semantic technologies data integration ontologies oncology 

References

  1. 1.
    Searls, D.B. (2005) Data integration: challenges for drug discovery. Nat. Rev. Drug Discov. 4 (1), 45–58.PubMedCrossRefGoogle Scholar
  2. 2.
    Mukherjea, S. (2005) Information retrieval and knowledge discovery utilizing a biomedical Semantic Web. Brief. Bioinform. 6 (3), 252–262.PubMedCrossRefGoogle Scholar
  3. 3.
    Kashyap (2003) The UMLS Semantic Network and the Semantic Web. AMIA Annu. Symp. Proc. 2003, 351–355.Google Scholar
  4. 4.
    Losko, S., Wenger, K., Kalus, W., Ramge, A., Wiehler, J. and Heumann, K. (2006) Knowledge networks of biological and medical data: an exhaustive and flexible solution to model life science domains. Lecture Notes in Computer Science 4075, 232–239.CrossRefGoogle Scholar
  5. 5.
    Spira, A., Beane, J., Shah, V., Liu, G., Schembri, F., Yang, X., Palma, J. and Brody, J.S. (2004) Effects of cigarette smoke on the human airway epithelial cell transcriptome. Proc. Natl. Acad. Sci. U S A 101 (27), 10143–10148.PubMedCrossRefGoogle Scholar
  6. 6.
    The Gene Ontology Consortium. (2000) Gene Ontology: tool for the unification of biology. Nature Genet. 25, 25–29.CrossRefGoogle Scholar
  7. 7.
    Sioutos, N., de Coronado, S., Haber, M.W., Hartel, F.W., Shaiu, W.L., Wright, L.W. (2007) NCI Thesaurus: a semantic model integrating cancer-related clinical and molecular information. J. Biomed. Inform. 40 (1), 30–43.PubMedCrossRefGoogle Scholar
  8. 8.
    Hubbard, S.M., Henney, J.E., DeVita, V.T. Jr. (1987) A computer data base for information on cancer treatment. N. Engl. J. Med. 316 (6), 315–318.PubMedCrossRefGoogle Scholar
  9. 9.

Copyright information

© Humana Press, a part of Springer Science+Business Media, LLC 2009

Authors and Affiliations

  • Sascha Losko
    • 1
  • Klaus Heumann
    • 1
  1. 1.Biomax Informatics AGMartinsriedGermany

Personalised recommendations