Aligning GermaNet Senses with Wiktionary Sense Definitions

  • Verena Henrich
  • Erhard Hinrichs
  • Tatiana Vodolazova
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8387)

Abstract

Sense definitions are a crucial component for wordnets and enhance the usability of wordnets for a wide variety of NLP applications. Many wordnets for languages other than English – including the German wordnet GermaNet – lack comprehensive coverage of such definitions. The purpose of this paper is to automatically align sense descriptions from the web-based dictionary Wiktionary to lexical units in GermaNet in order to extend GermaNet with sense descriptions. An alignment algorithm based on word overlaps is developed and different setups of the algorithm are compared. This algorithm yields as the best result an accuracy of 93.8 % and an F1-score of 84.3, which confirms the viability of the proposed method for automatically enriching GermaNet. This best result crucially involves the use of coordinated relations as a novel concept for calculating sense alignment.

Keywords

Word sense alignment Mapping lexical resources Germa-Net German wordnet Wiktionary Sense definitions 

References

  1. 1.
    Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, IJCAI’03, pp. 805–810. Morgan Kaufmann Publishers Inc., San Francisco (2003)Google Scholar
  2. 2.
    Eisenberg, P.: Das Wort - Grundriss der Deutschen Grammatik, 3rd edn. Verlag J. B. Melzer, Stuttgart/Weimar, Germany (2006)Google Scholar
  3. 3.
    Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)MATHGoogle Scholar
  4. 4.
    Fernando, S., Stevenson, M.: Mapping WordNet synsets to Wikipedia articles. In: Proceedings of the Eight International Conference on Language Resources and Evaluation, LREC’12, Istanbul, Turkey, pp. 590–596 (2012)Google Scholar
  5. 5.
    Gurevych, I.: Using the structure of a conceptual network in computing semantic relatedness. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 767–778. Springer, Heidelberg (2005) CrossRefGoogle Scholar
  6. 6.
    Hamp, B., Feldweg, H.: GermaNet - a lexical-semantic net for German. In: Proceedings of ACL Workshop Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications, Madrid (1997)Google Scholar
  7. 7.
    Henrich, V., Hinrichs, E.: GernEdiT – the GermaNet editing tool. In: Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC’10, Valletta, Malta, pp. 2228–2235 (2010)Google Scholar
  8. 8.
    Henrich, V., Hinrichs, E., Suttner, K.: Automatically linking GermaNet to Wikipedia for harvesting Corpus examples for GermaNet senses. J. Lang. Technol. Comput. Linguist. (JLCL) 27(1), 1–19 (2012)Google Scholar
  9. 9.
    Henrich, V., Hinrichs, E., Vodolazova, T.: Semi-Automatic extension of GermaNet with sense definitions from Wiktionary. In: Proceedings of the 5th Language and Technology Conference, LTC’11, Poznań, Poland, pp. 126–130 (2011)Google Scholar
  10. 10.
    Henrich, V., Hinrichs, E., Vodolazova, T.: WebCAGe – a web-harvested corpus annotated with GermaNet senses. In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, EACL ’12, pp. 387–396, Avignon, France (2012)Google Scholar
  11. 11.
    Kwong, O.Y.: Aligning wordnet with additional lexical resources. In: Proceedings of the COLING-ACL’98 Workshop on ‘Usage of WordNet in Natural Language Processing Systems’, Montreal, QC, Canada, pp. 73–79 (1998)Google Scholar
  12. 12.
    Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, SIGDOC ’86, pp. 24–26. ACM, New York (1986)Google Scholar
  13. 13.
    Litkowski, K.C.: Towards a meaning-full comparison of lexical resources. In: Proceedings of the ACL Special Interest Group on the Lexicon Workshop on Standardizing Lexical Resources, College Park, MD, USA, pp. 30–37 (1999)Google Scholar
  14. 14.
    Matuschek, M., Gurevych, I.: Dijkstra-wsa: a graph-based approach to word sense alignment. Trans. Assoc. Comput. Linguist. (TACL) 1, 151–164 (2013)Google Scholar
  15. 15.
    Meyer, C.M., Gurevych, I.: What psycholinguists know about chemistry: aligning wiktionary and wordnet for increased domain coverage. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, IJCNLP ’11, pp. 883–892 (2011)Google Scholar
  16. 16.
    Niemann, E., Gurevych, I.: The people’s web meets linguistic knowledge: automatic sense alignment of Wikipedia and WordNet. In: Proceedings of the 9th International Conference on Computational Semantics, IWCS ’11, pp. 205–214, Association for Computational Linguistics, Stroudsburg (2011)Google Scholar
  17. 17.
    Niles, I., Pease, A.: Linking Lexicons and ontologies: mapping WordNet to the suggested upper merged ontology. In: Proceedings of the IEEE International Conference on Information and Knowledge Engineering, IKE’03, pp. 412–416, Las Vegas, Nevada (2003)Google Scholar
  18. 18.
    Ponzetto, S.P., Navigli, R.: Knowledge-rich word sense disambiguation rivaling supervised systems. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL ’10, pp. 1522–1531. Association for Computational Linguistics, Stroudsburg (2010)Google Scholar
  19. 19.
    Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)CrossRefGoogle Scholar
  20. 20.
    Ruiz-Casado, M., Alfonseca, E., Castells, P.: Automatic assignment of wikipedia encyclopedic entries to wordnet synsets. In: Szczepaniak, P.S., Kacprzyk, J., Niewiadomski, A. (eds.) AWIC 2005. LNCS (LNAI), vol. 3528, pp. 380–386. Springer, Heidelberg (2005) CrossRefGoogle Scholar
  21. 21.
    Zesch, T.: What’s the difference? - comparing expert-built and collaboratively-built lexical semantic resources. In: FLaReNet Forum 2010, Barcelona, Spain (2010)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Verena Henrich
    • 1
  • Erhard Hinrichs
    • 1
  • Tatiana Vodolazova
    • 1
  1. 1.Department of General and Computational LinguisticsTübingenGermany

Personalised recommendations