Skip to main content

A Positional Linguistics-Based System for Word Alignment

  • Conference paper
Text, Speech and Dialogue (TSD 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3206))

Included in the following conference series:

  • 865 Accesses

Abstract

This paper describes an algorithm which represents one of the few linguistics-based systems for word-to-word alignment. Most systems are purely statistic and assume some hypotheses about the structure of texts which are often infirmed. Our approach combines statistic methods with positional and linguistic ones in order to can be successfully applied to any kind of bitext as far as the internal structure of the texts is concerned. The linguistic part uses shallow parsing by regular expressions and relies on very general linguistic principles. However a component of language-specific methods can be developed for improving results. Our word-alignment system was evaluated on a Romanian-English bitext.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Piperidis, S., Papageorgiou, H., Boutsis, S.: From sentences to words and clauses. In: Véronis, J. (ed.) Parallel Text Processing. Alignment and Use of Translation Corpora, pp. 117–138. Kluwer Academic Publishers, Dordrecht (2000)

    Google Scholar 

  2. Melamed, D.: Pattern recognition for mapping bitext correspondence. In: Véronis, J. (ed.) Parallel Text Processing. Alignment and Use of Translation Corpora, pp. 25–47. Kluwer Academic Publishers, Dordrecht (2000)

    Google Scholar 

  3. Mihalcea, R., Pedersen, T.: An Evaluation Exercise for Word Alignment. In: Proceedings of the HLTNAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, Edmonton, Canada, pp. 1–10 (2003)

    Google Scholar 

  4. Tufis, D., Barbu, A.M.: Revealing Translators’ Knowledge: Statistical Methods in Constructing Practical Translation Lexicons for Language and Speech Processing. International Journal of Speech Technology 5, 199–209 (2002)

    Article  MATH  Google Scholar 

  5. Melamed, D.: Models of translation equivalence among words. Computational Linguistics 26(2), 221–249 (2000)

    Article  Google Scholar 

  6. Hunt, J.W., Szymanski, T.G.: A Fast Algorithm for Computing Longest Common Subsequences. Comunications of the ACM 20(5), 350–353 (1977)

    Article  MATH  MathSciNet  Google Scholar 

  7. Dejean, H., Gaussier, E., Goutte, C., Yamanda, K.: Reducing Parameter Space for Word Alignment. In: Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, Edmonton, Canada, pp. 23–26 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Barbu, AM. (2004). A Positional Linguistics-Based System for Word Alignment. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2004. Lecture Notes in Computer Science(), vol 3206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30120-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30120-2_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23049-6

  • Online ISBN: 978-3-540-30120-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics