European Knowledge Acquisition Workshop

EKAW 2016: Knowledge Engineering and Knowledge Management pp 621-635

Detecting Meaningful Compounds in Complex Class Labels

  • Heiner Stuckenschmidt
  • Simone Paolo Ponzetto
  • Christian Meilicke
Conference paper

DOI: 10.1007/978-3-319-49004-5_40

Volume 10024 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Stuckenschmidt H., Ponzetto S.P., Meilicke C. (2016) Detecting Meaningful Compounds in Complex Class Labels. In: Blomqvist E., Ciancarini P., Poggi F., Vitali F. (eds) Knowledge Engineering and Knowledge Management. EKAW 2016. Lecture Notes in Computer Science, vol 10024. Springer, Cham

Abstract

Real-world ontologies such as, for instance, those for the medical domain often represent highly specific, fine-grained concepts using complex labels that consist of a sequence of sublabels. In this paper, we investigate the problem of automatically detecting meaningful compounds in such complex class labels to support methods that require an automatic understanding of their meaning such as, for example, ontology matching, ontology learning and semantic search. We formulate compound identification as a supervised learning task and investigate a variety of heterogeneous features, including statistical (i.e., knowledge-lean) as well as knowledge-based, for the task at hand. Our classifiers are trained and evaluated using a manually annotated dataset consisting of about 300 complex labels taken from real-world ontologies, which we designed to provide a benchmarking gold standard for this task. Experimental results show that by using a combination of distributional and knowledge-based features we are able to reach an accuracy of more than 90 % for compounds of length one and almost 80 % for compounds of length two. Finally, we evaluate our method in an extrinsic experimental setting: this consists of a use case highlighting the benefits of using automatically identified compounds for the high-end semantic task of ontology matching.

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Heiner Stuckenschmidt
    • 1
  • Simone Paolo Ponzetto
    • 1
  • Christian Meilicke
    • 1
  1. 1.Data and Web Science GroupUniversity of MannheimMannheimGermany