Chapter

The Semantic Web: Research and Applications

Volume 4011 of the series Lecture Notes in Computer Science pp 245-258

Extracting Instances of Relations from Web Documents Using Redundancy

  • Viktor de BoerAffiliated withHuman-Computer Studies Laboratory, Informatics Institute, Universiteit van Amsterdam
  • , Maarten van SomerenAffiliated withHuman-Computer Studies Laboratory, Informatics Institute, Universiteit van Amsterdam
  • , Bob J. WielingaAffiliated withHuman-Computer Studies Laboratory, Informatics Institute, Universiteit van Amsterdam

* Final gross prices may vary according to local VAT.

Get Access

Abstract

In this document we describe our approach to a specific subtask of ontology population, the extraction of instances of relations. We present a generic approach with which we are able to extract information from documents on the Web. The method exploits redundancy of information to compensate for loss of precision caused by the use of domain independent extraction methods. In this paper, we present the general approach and describe our implementation for a specific relation instance extraction task in the art domain. For this task, we describe experiments, discuss evaluation measures and present the results.