Advertisement

Experiments on Extracting Semantic Relations from Syntactic Relations

  • Caroline Varaschin Gasperin
  • Vera Lúcia Strube de Lima
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2588)

Abstract

Thiswork presents the results of the application of a technique for automatic extraction of semantic relations among words from a corpus. The technique used is the one proposed by Grefenstette in [1]. We brought contributions to the syntactic context notion in [1], aiming to improve the identification of semantically related words. Then, we carried on three different experiments using a Portuguese language corpus: the first one compares the original Grefenstette’s technique with the technique modified with our contributions, the second experiment investigates which syntactic relation is more relevant when identifying semantic relations, and the last experiment investigates the influence of the parser errors on the quality of the extracted semantic relations. Results and their analyses are detailed in this article.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, USA (1994)zbMATHGoogle Scholar
  2. 2.
    Lin, D.: Automatic retrieval and clustering of similar words. In: Proceedings of the COLINGACL’ 98, Montreal (1998)Google Scholar
  3. 3.
    Ruge, G.: Automatic detection of thesaurus relations for information retrieval. In: Foundations of Computer Science. Springer, Berlin (1997) 499–506CrossRefGoogle Scholar
  4. 4.
    Park, Y.C., Han, Y.S., Choi, K.S.: Automatic thesaurus construction using bayesian networks. In: Proceedings of the 1995 International Conference on Information and Knowledge Management, Baltimore (1995) 212–217Google Scholar
  5. 5.
    Pacey, M.: The use of clustering techniques to reveal semantic relations between words. In Renouf, A., ed.: Explorations in Corpus Linguistics. Rodopi, Amsterdam (1998) 269–280Google Scholar
  6. 6.
    Thanopoulos, A., Fakotakis, N., Kokkinakis, G.: Automatic extraction of semantic relations from specialized corpora. In: Proceedings of COLING’2000, Saarbrücken (2000)Google Scholar
  7. 7.
    Bick, E.: The Parsing System Palavras: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. PhD thesis, Århus University, Århus (2000)Google Scholar
  8. 8.
    Bolshakov, I., Gelbukh, A.: Heuristics-based replenishment of collocation databases. In Ranchhod, E., Mamede, N., eds.: Lecture Notes in Computer Science N 2389: Advances in Natural Language Processing. Springer-Verlag (2002)CrossRefGoogle Scholar
  9. 9.
    Gasperin, C.V.: Extração automática de relações semânticas a partir de relações sintáticas. Master’s thesis, PUCRS, Porto Alegre (2001)Google Scholar
  10. 10.
    A. Gelbukh, G. Sidorov, L.C.H.: Compilation of a spanish representative corpus. In: Lecture Notes in Computer Science N 2276: Computational Linguistics and Intelligent Text Processing. Springer-Verlag (2002) 285–288CrossRefGoogle Scholar
  11. 11.
    Bolshakov, I., Gelbukh, A.: A very large database of collocations and semantic links. In et al., M.B., ed.: Lecture Notes in Computer Science N 1959: Natural Language Processing and Information Systems. Springer-Verlag (2001) 103–114CrossRefGoogle Scholar
  12. 12.
    Lee, L.: Measures of distributional similarity. In: 37th Annual Meeting of the ACL. (1999) 25–32Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Caroline Varaschin Gasperin
    • 1
  • Vera Lúcia Strube de Lima
    • 1
  1. 1.PUCRSFaculdade de Informática PPGCCPorto AlegreBrasil

Personalised recommendations