Abstract
This paper describes our experience with the first steps towards integrating pathway and protein interaction data with other data sets within the framework of a federated database system based on the functional data model. We have made use of DTD and XML files produced by the BIND project. The DTD provides a specification for information about biomolecular interactions, complexes and pathways, and can be translated semi-automatically to a database schema. The load utility uses metadata derived from this schema to help identify data items of interest when recursively traversing a Prolog tree structure representing the XML data. We also show how derived functions can be used to make explicit those relationships that are present in data sets but which are not fully described in DTD files.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bader, G.D., Betel, D., Hogue, C.W.V.: BIND: the Biomolecular Interaction Network Database. Nucleic Acids Research 31, 248–250 (2003)
Bader, G.D., Hogue, C.W.V.: BIND – a data specification for storing and describing biomolecular interactions, molecular complexes and pathways. Bioinformatics 16, 465–477 (2000)
Dubuisson, O.: ASN.I Communication Between Heterogeneous Systems. Morgan Kaufmann Publishers, San Francisco (2000)
Durbin, R., Thierry-Mieg, J.: Syntactic Definitions for the ACEDB Data Base Manager (1992)
Etzold, T., Argos, P.: SRS an indexing and retrieval tool for flat file data libraries. CABIOS 9, 49–57 (1993)
Gray, P.M.D., Kulkarni, K.G., Paton, N.W.: Object-Oriented Databases: a Semantic Data Model Approach. Prentice Hall Series in Computer Science. Prentice Hall Int. Ltd, Englewood Cliffs (1992)
Kanehisa, M., Goto, S.: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Research 28, 27–30 (2000)
Kemp, G.J.L., Angelopoulos, N., Gray, P.M.D.: Architecture of a Mediator for a Bioinformatics Database Federation. IEEE Transactions on Information Technology in Biomedicine 6, 116–122 (2002)
Mewes, H.W., Frishman, D., Güldener, U., Mannhaupt, G., Mayer, K., Mokrejs, M., Morgenstern, B., Münsterkötter, M., Rudd, S., Weil, B.: MIPS: a database for genomes and protein sequences. Nucleic Acids Research 28, 31–34 (2002)
Shipman, D.W.: The Functional Data Model and the Data Language DAPLEX. ACM Transactions on Database Systems 6(1), 140–173 (1981)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kemp, G.J.L., Selpi, S. (2004). Pathway and Protein Interaction Data: from XML to FDM Database. In: Rahm, E. (eds) Data Integration in the Life Sciences. DILS 2004. Lecture Notes in Computer Science(), vol 2994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24745-6_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-24745-6_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21300-0
Online ISBN: 978-3-540-24745-6
eBook Packages: Springer Book Archive