Morphology to the Rescue Redux: Resolving Borrowings and Code-Mixing in Machine Translation

  • Esmé Manandise
  • Claudia Gdaniec
Conference paper

DOI: 10.1007/978-3-642-23138-4_6

Part of the Communications in Computer and Information Science book series (CCIS, volume 100)
Cite this paper as:
Manandise E., Gdaniec C. (2011) Morphology to the Rescue Redux: Resolving Borrowings and Code-Mixing in Machine Translation. In: Mahlow C., Piotrowski M. (eds) Systems and Frameworks for Computational Morphology. SFCM 2011. Communications in Computer and Information Science, vol 100. Springer, Berlin, Heidelberg

Abstract

In the IBM LMT machine translation system, derivational morphological rules recognize and analyze words that are not found in its source lexicons, and generate default transfers for these unlisted words. Unfound words with no inflectional or derivational affixes are by default nouns. These rules are now expanded to provide lexical coverage of a particular set of words created on the fly in emails by bilingual Spanish-English speakers. What characterizes the approach is the generation of additional default parts of speech, and the use of morphological, semantic, and syntactic features from both source and target lexicons for analysis and transfer. A built-in rule-based strategy to handle language borrowing and code-mixing allows for the recognition of words with variable and unpredictable frequency of occurrence, which would remain otherwise unfound, thus affecting the accuracy of parsing and the quality of translation output.

Keywords

Unfound words rule-based morphology derivational morphology parsing code-mixing code-switching borrowing scoring unsupervised email machine translation languages in contact Spanish-English 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Esmé Manandise
    • 1
  • Claudia Gdaniec
    • 2
  1. 1.IBM Thomas J. Watson Research CenterYorktown HeightsUSA
  2. 2.South Westphalia University of Applied SciencesSoestGermany

Personalised recommendations