Automating the Generation of Semantic Annotation Tools Using a Clustering Technique

  • Vitór Souza
  • Nicola Zeni
  • Nadzeya Kiyavitskaya
  • Periklis Andritsos
  • Luisa Mich
  • John Mylopoulos
Conference paper

DOI: 10.1007/978-3-540-69858-6_10

Part of the Lecture Notes in Computer Science book series (LNCS, volume 5039)
Cite this paper as:
Souza V., Zeni N., Kiyavitskaya N., Andritsos P., Mich L., Mylopoulos J. (2008) Automating the Generation of Semantic Annotation Tools Using a Clustering Technique. In: Kapetanios E., Sugumaran V., Spiliopoulou M. (eds) Natural Language and Information Systems. NLDB 2008. Lecture Notes in Computer Science, vol 5039. Springer, Berlin, Heidelberg

Abstract

In order to generate semantic annotations for a collection of documents, one needs an annotation schema consisting of a semantic model (a.k.a. ontology) along with lists of linguistic indicators (keywords and patterns) for each concept in the ontology. The focus of this paper is the automatic generation of the linguistic indicators for a given semantic model and a corpus of documents. Our approach needs a small number of user-defined seeds and bootstraps itself by exploiting a novel clustering technique. The baseline for this work is the Cerno project [8] and the clustering algorithm LIMBO [2]. We also present results that compare the output of the clustering algorithm with linguistic indicators created manually for two case studies.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Vitór Souza
    • 1
  • Nicola Zeni
    • 1
  • Nadzeya Kiyavitskaya
    • 1
  • Periklis Andritsos
    • 1
  • Luisa Mich
    • 2
  • John Mylopoulos
    • 1
  1. 1.Dept. of Information Engineering and Computer Science  
  2. 2.Dept. of Computer and Management SciencesUniversity of TrentoItaly

Personalised recommendations