Semantic Web Evaluation Challenge

Semantic Web Evaluation Challenges pp 16-27

CETUS – A Baseline Approach to Type Extraction

  • Michael Röder
  • Ricardo Usbeck
  • René Speck
  • Axel-Cyrille Ngonga Ngomo
Conference paper

DOI: 10.1007/978-3-319-25518-7_2

Part of the Communications in Computer and Information Science book series (CCIS, volume 548)
Cite this paper as:
Röder M., Usbeck R., Speck R., Ngomo AC.N. (2015) CETUS – A Baseline Approach to Type Extraction. In: Gandon F., Cabrio E., Stankovic M., Zimmermann A. (eds) Semantic Web Evaluation Challenges. Communications in Computer and Information Science, vol 548. Springer, Cham

Abstract

The concurrent growth of the Document Web and the Data Web demands accurate information extraction tools to bridge the gap between the two. In particular, the extraction of knowledge on real-world entities is indispensable to populate knowledge bases on the Web of Data. Here, we focus on the recognition of types for entities to populate knowledge bases and enable subsequent knowledge extraction steps. We present CETUS, a baseline approach to entity type extraction. CETUS is based on a three-step pipeline comprising (i) offline, knowledge-driven type pattern extraction from natural-language corpora based on grammar-rules, (ii) an analysis of input text to extract types and (iii) the mapping of the extracted type evidence to a subset of the DOLCE+DnS Ultra Lite ontology classes. We implement and compare two approaches for the third step using the YAGO ontology as well as the FOX entity recognition tool.

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Michael Röder
    • 1
  • Ricardo Usbeck
    • 1
  • René Speck
    • 1
  • Axel-Cyrille Ngonga Ngomo
    • 1
  1. 1.University of LeipzigLeipzigGermany

Personalised recommendations