DEXA 1999: Database and Expert Systems Applications pp 540-549 | Cite as
Database Challenges for Genome Information in the Post Sequencing Phase Moussouni
Abstract
Genome sequencing projects are making available to scientists complete records of the genetic make-up of organisms. The resulting data sets, along with the results of experiments that seek systematically to find new information on the functions of genes, will present numerous opportunities and challenges to biologists. However, the complexity and variety of both the data and the analyses required over such data sets also pose significant challenges to computer scientists charged with providing effective information management systems for use with genome data. This paper presents models for the sorts of information that are being produced on genomes and genome-wide experiments, and outlines a project developing an information management system aimed at supporting analyses over genomic data. This information management system replicates data from other sources, with a view to providing an integrated environment for performing complex analyses.
Keywords
Information Management System Object Database Expression Event Biological Data Source Interactive BrowserPreview
Unable to display preview. Download preview PDF.
References
- [BBB+98]P.G. Baker, A. Brass, S. Bechhofer, C.A. Goble, N.W. Paton, and R. Stevens. TAMBIS-Transparent Access to Multiple Biological Information Sources. In Proc. Int. Conf. on Intelligent Systems for Molecular Biology, pages 25–34. AAAI Press, 1998.Google Scholar
- [BDH+95]P. Buneman, S.B. Davidson, K. Hart, C. Overton, and L. Wong. A Data Transformation System for Biological Data Sources. In Proc. 21st VLDB, pages 158–169. Morgan Kaufmann, 1995.Google Scholar
- [C+97]R.G.G. Cattell et al. The Object Database Standard: ODMG 2.0. Morgan Kaufmann, 1997.Google Scholar
- [CKMS97]I-Min A. Chen, A.S. Kosky, V.M. Markowitz, and E. Szeto. Constructing and Maintaining Scientific Database Views in the Framework of the Object Protocol Model. In Proc. SSDBM. IEEE Press, 1997.Google Scholar
- [Dav98]B. Daviss. What silicon chips have done for computers, DNA chips may do for biological research. New Scientist, November:47–50, 1998.Google Scholar
- [DTM92]R. Durbin and J. Thierry-Mieg. ACeDB-A C.elegans Database. Technical report, December 1992. Available at http://probe.nalusda.gov:8000/acedocs/index.html.
- [Goo95]N. Goodman. An Object-Oriented DBMW War Story: Developing a Genome Mapping Database in C++. In Modern Database Systems. Addison-Wesley, 1995.Google Scholar
- [MAH+97]H.W. Mewes, K. Albermann, K. Heumann, S. Liebl, and F. Pfeiffer. MIPS: a database for protein sequences, homology data and yeast genome information. Nucleic Acids Research, 25(1):28–30, 1997.CrossRefGoogle Scholar
- [PS98]N.W. Paton and P.R. Sampaio. Extending the ODMG Architecture with a Deductive Object Query Language. In S.M. Embury et al., editors, Proc. 16th British National Conference on Databases, pages 149–164. Springer-Verlag, 1998.Google Scholar
- [Wid95]J. Widom. Research Problems in DataWarehousing. In Proc. 4th Int. Conf. on Information and Knowledge Management, pages 25–30, November 1995.Google Scholar