Designing a Global Information Resource for Molecular Biology (Short Paper)
Research in molecular biology is continuously producing an immense amount of data, but this information is spread over numerous heterogeneous data repositories. Their integration into a federated information system would drastically reduce the time a biologist has to spend browsing different WWW sites or databases in search for a particular piece of information.
In this study we point out the specific problems that molecular biology is posing to data integration. We present our approach to cope with these problems. It is based on a mediator architecture and uses query correspondence assertions (QCA) to describe sources in a flexible yet expressive manner. QCAs both capture content and query capabilities of arbitrary data sources with respect to a federated schema. Based on such QCAs a mediator can answer queries against the federated schema by constructing semantically equivalent combinations of source queries.
Unable to display preview. Download preview PDF.
- Hull, R. (1997). Managing Semantic Heterogeneity in Databases: A Theoretical Perspective. 16th ACM PODS.Google Scholar
- Leser, U. (1998). Combining Heterogeneous Data Sources through Query Correspondence Assertions. 1st Workshop on Web Information and Data Management, Washington, D.C.Google Scholar
- Leser, U. (1998). Maintenance and Mediation in Federated Databases. 8th WITS, Helsinki, Finland, to appear.Google Scholar
- Levy, A. Y., A. O. Mendelzon, et al. (1995). Answering Queries using Views. 14th ACM PODS, San Jose, CA pp. 95–104.Google Scholar
- Levy, A. Y., A. Rajaraman, et al. (1996). Querying Heterogeneous Information Sources Using Source Descriptions. 22th VLDB, Bombay, India pp. 251–262.Google Scholar
- Miller, R. J. (1998). Using Schematically Heterogenous Structures. ACM SIGMOD, Seattle, Washington pp. 189–200.Google Scholar
- Naumann, F., J. C. Freytag, et al. (1998). Quality driven Source Selection using Data Envelopment Analysis. Int. Conf. on Information Quality, MIT, Cambridge.Google Scholar
- Sheth, A. and J. A. Larson (1990). “Federated Database Systems for Managing Distributed, Heterogeneous and Autonomous Databases.„ ACM Computing Survey 22 (3).Google Scholar
- Wiederhold, G. (1992). “Mediators in the Architecture of Future Information Systems.„ IEEE Computer 25 (3): 38–49.Google Scholar