Extracting Equivalents from Aligned Parallel Texts: Comparison of Measures of Similarity

  • António Ribeiro
  • Gabriel Pereira Lopes
  • João Mexia
Conference paper

DOI: 10.1007/3-540-44399-1_35

Part of the Lecture Notes in Computer Science book series (LNCS, volume 1952)
Cite this paper as:
Ribeiro A., Pereira Lopes G., Mexia J. (2000) Extracting Equivalents from Aligned Parallel Texts: Comparison of Measures of Similarity. In: Monard M.C., Sichman J.S. (eds) Advances in Artificial Intelligence. Lecture Notes in Computer Science, vol 1952. Springer, Berlin, Heidelberg

Abstract

Extraction of term equivalents is one of the most important tasks for building bilingual dictionaries. Several measures have been proposed to extract translation equivalents from aligned parallel texts. In this paper, we will compare 28 measures of similarity based on the co-occurrence of words in aligned parallel text segments. Parallel texts are aligned using a simple method that extends previous work by Pascale Fung & Kathleen McKeown and Melamed but which, in contrast, does not use statistically unsupported heuristics to filter reliable points.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • António Ribeiro
    • 1
  • Gabriel Pereira Lopes
    • 1
  • João Mexia
    • 2
  1. 1.Departamento de Informática, Quinta da TorreUniversidade Nova de Lisboa, Faculdade de Ciências e TecnologiaMonte da CaparicaPortugal
  2. 2.Departamento de Matemática, Quinta da TorreUniversidade Nova de Lisboa, Faculdade de Ciências e TecnologiaMonte da CaparicaPortugal

Personalised recommendations