Hybrid Algorithm for Word-Level Alignment of Parallel Texts

  • Eduardo Cendejas
  • Grettel Barceló
  • Alexander Gelbukh
  • Grigori Sidorov
Conference paper

DOI: 10.1007/978-3-642-12550-8_25

Part of the Lecture Notes in Computer Science book series (LNCS, volume 5723)
Cite this paper as:
Cendejas E., Barceló G., Gelbukh A., Sidorov G. (2010) Hybrid Algorithm for Word-Level Alignment of Parallel Texts. In: Horacek H., Métais E., Muñoz R., Wolska M. (eds) Natural Language Processing and Information Systems. NLDB 2009. Lecture Notes in Computer Science, vol 5723. Springer, Berlin, Heidelberg

Abstract

Given a text in two languages, word alignment task consists of identifying in the two variants of the text specific word occurrences that are mutual translations. The majority of existing text alignment systems follow either a linguistic or a statistical approach. We argue for that both approaches are insufficient when used separately, and suggest a flexible algorithm that combines statistical and linguistic techniques.

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Eduardo Cendejas
    • 1
  • Grettel Barceló
    • 1
  • Alexander Gelbukh
    • 1
  • Grigori Sidorov
    • 1
  1. 1.Center for Computing ResearchNational Polytechnic InstituteMexico CityMexico

Personalised recommendations