Skip to main content

Efficient Integration of Maximum Entropy Lexicon Models within the Training of Statistical Alignment Models

  • 596 Accesses

Part of the Lecture Notes in Computer Science book series (LNAI,volume 2499)

Abstract

Maximum entropy (ME) models have been successfully applied to many natural language problems. In this paper, we show how to integrate ME models efficiently within a maximum likelihood training scheme of statistical machine translation models. Specifically, we define a set of context-dependent ME lexicon models and we present how to perform an efficient training of these ME models within the conventional expectation-maximization (EM) training of statistical translation models. Experimental results are also given in order to demonstrate how these ME models improve the results obtained with the traditional translation models. The results are presented by means of alignment quality comparing the resulting alignments with manually annotated reference alignments.

Keywords

  • Target Word
  • Training Corpus
  • Translation Model
  • Statistical Machine Translation
  • Word Class

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This work has been partially supported by Spanish CICYT under grant TIC2000-1599-C02-01

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/3-540-45820-4_6
  • Chapter length: 10 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   69.99
Price excludes VAT (USA)
  • ISBN: 978-3-540-45820-3
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   89.99
Price excludes VAT (USA)

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berger, A.L., Della Pietra, S.A., Della Pietra, V.J.: A maximum entropy approach to natural language processing. Computational Linguistics 22 (1996) 39–72

    Google Scholar 

  2. Brown, P.F., Della Pietra, S.A., Della Pietra, V.J., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19 (1993) 263–311

    Google Scholar 

  3. Darroch, J., Ratcliff, D.: Generalized iterative scaling for log-linear models. Annals of Mathematical Statistics 43 (1972) 95–144

    CrossRef  MathSciNet  Google Scholar 

  4. Della Pietra, S.A., Della Pietra, V.J., Lafferty, J.: Inducing features in random fields. IEEE Trans. on PAMI 19 (1997) 380–393

    Google Scholar 

  5. Foster, G.: Incorporating position information into a maximum entropy/minimum divergence translation model. In: Proc. of CoNNL-2000 and LLL-2000, Lisbon, Portugal (2000) 37–52

    Google Scholar 

  6. García-Varea, I., Och, F.J., Ney, H., Casacuberta, F.: Refined lexicon models for statistical machine translation usign a maximum entropy approach. In: Proc. of the 39th Annual Meeting of the ACL, Toulouse, France (2001) 204–211

    Google Scholar 

  7. Och, F.J.: An efficient method for determining bilingual word classes. In: 9th Conf. of the Europ. Chapter of the ACL, Bergen, Norway (1999) 71–76

    Google Scholar 

  8. Och, F.J., Ney, H.: Giza++: Training of statistical translation models (2001) http://www-i6.Informatik.RWTH-Aachen.DE/~och/software/GIZA++.html.

  9. Och, F.J., Ney, H.: A comparison of alignment models for statistical machine translation. In: COLING’ 00: The 18th Int. Conf. on Computational Linguistics, Saarbrücken, Germany (2000) 1086–1090

    Google Scholar 

  10. Papineni, K., Roukos, S., Ward, R.: Maximum likelihood and discriminative training of direct translation models. In: Proc. Int. Conf. on Acoustics, Speech, and Signal Processing. (1998) 189–192

    Google Scholar 

  11. Rosenfeld, R.: A maximum entropy approach to adaptive statistical language modeling. Computer, Speech and Language 10 (1996) 187–228

    CrossRef  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Varea, I.G., Och, F.J., Ney, H., Casacuberta, F. (2002). Efficient Integration of Maximum Entropy Lexicon Models within the Training of Statistical Alignment Models. In: Richardson, S.D. (eds) Machine Translation: From Research to Real Users. AMTA 2002. Lecture Notes in Computer Science(), vol 2499. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45820-4_6

Download citation

  • DOI: https://doi.org/10.1007/3-540-45820-4_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44282-0

  • Online ISBN: 978-3-540-45820-3

  • eBook Packages: Springer Book Archive