Converting and Annotating Quantitative Data Tables

  • Mark van Assem
  • Hajo Rijgersberg
  • Mari Wigham
  • Jan Top
Conference paper

DOI: 10.1007/978-3-642-17746-0_2

Volume 6496 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
van Assem M., Rijgersberg H., Wigham M., Top J. (2010) Converting and Annotating Quantitative Data Tables. In: Patel-Schneider P.F. et al. (eds) The Semantic Web – ISWC 2010. ISWC 2010. Lecture Notes in Computer Science, vol 6496. Springer, Berlin, Heidelberg

Abstract

Companies, governmental agencies and scientists produce a large amount of quantitative (research) data, consisting of measurements ranging from e.g. the surface temperatures of an ocean to the viscosity of a sample of mayonnaise. Such measurements are stored in tables in e.g. spreadsheet files and research reports. To integrate and reuse such data, it is necessary to have a semantic description of the data. However, the notation used is often ambiguous, making automatic interpretation and conversion to RDF or other suitable format difficult. For example, the table header cell “f (Hz)” refers to frequency measured in Hertz, but the symbol “f” can also refer to the unit farad or the quantities force or luminous flux. Current annotation tools for this task either work on less ambiguous data or perform a more limited task. We introduce new disambiguation strategies based on an ontology, which allows to improve performance on “sloppy” datasets not yet targeted by existing systems.

Download to read the full conference paper text

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Mark van Assem
    • 1
  • Hajo Rijgersberg
    • 3
  • Mari Wigham
    • 2
    • 3
  • Jan Top
    • 1
    • 2
    • 3
  1. 1.VU University AmsterdamThe Netherlands
  2. 2.Top Institute Food and NutritionThe Netherlands
  3. 3.Wageningen University and Research CentreThe Netherlands