Integrating Scientific Data through External, Concept-Based Annotations

  • Michael Gertz
  • Kai-Uwe Sattler
Conference paper

DOI: 10.1007/3-540-36556-7_18

Part of the Lecture Notes in Computer Science book series (LNCS, volume 2590)
Cite this paper as:
Gertz M., Sattler KU. (2003) Integrating Scientific Data through External, Concept-Based Annotations. In: Bressan S., Lee M.L., Chaudhri A.B., Yu J.X., Lacroix Z. (eds) Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web. Lecture Notes in Computer Science, vol 2590. Springer, Berlin, Heidelberg


In several scientific application domains, such as the computational sciences, the transparent and integrated access to distributed and heterogeneous data sources is key to leveraging the knowledge and findings of researchers. Standard database integration approaches, however, are either not applicable or insufficient because of lack of local and global schema structures. In these application domains, data integration often occurs manually in that researchers collect data and categorize them using “semantic indexing”, in the most simple case through local bookmarking, which leaves them without appropriate data query, sharing, and management mechanisms.

In this paper, we present a data integration technique suitable for such application domains. This technique is based on the notion of controlled data annotations, resembling the idea of associating semantic rich metadata with diverse types of data, including images and text-based documents. Using concept like structures defined by scientists, data annotations allow scientists to link such Web-accessible data at different levels of granularity to concepts. Annotated data describing instances of such concepts then provide for sophisticated query schemes that researchers can employ to query the distributed data in an integrated and transparent fashion. We present our data annotation framework in the context of the Neurosciences where researchers employ concepts and annotations to integrate and query diverse types of data managed and distributed among individual research groups.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Michael Gertz
    • 1
  • Kai-Uwe Sattler
    • 2
  1. 1.Department of Computer ScienceUniversity of CaliforniaDavisUSA
  2. 2.Department of Computer ScienceUniversity of MagdeburgGermany

Personalised recommendations