Machine Translation

, Volume 10, Issue 1–2, pp 59–92 | Cite as

Verb semantics for English-Chinese translation

  • Martha Palmer
  • Zhibiao Wu
Article

Abstract

A common practice in operational Machine Translation (MT) and Natural Language Processing (NLP) systems is to assume that a verb has a fixed number of senses and rely on a precompiled lexicon to achieve large coverage. This paper demonstrates that this assumption is too weak to cope with the similar problems of lexical divergences between languages and unexpected uses of words that give rise to cases outside of the pre-compiled lexicon coverage. We first examine the lexical divergences between English verbs and Chinese verbs. We then focus on a specific lexical selection problem—translating Englishchange-of-state verbs into Chinese verb compounds. We show that an accurate translation depends not only on information about the participants, but also on contextual information. Therefore, selectional restrictions on verb arguments lack the necessary power for accurate lexical selection. Second, we examine verb representation theories and practices in MT systems and show that under the fixed sense assumption, the existing representation schemes are not adequate for handling these lexical divergences and extending existing verb senses to unexpected usages. We then propose a method of verb representation based on conceptual lattices which allows the similarities among different verbs in different languages to be quantitatively measured. A prototype system UNICON implements this theory and performs more accurate MT lexical selection for our chosen set of verbs. An additional lexical module for UNICON is also provided that handles sense extension.

Keywords

verb semantics lexical divergences lexical organization 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Church, K. 1991. Some Statistical Opportunities in Speech and Language. InProceedings of 23rd Symposium on the Interface, Computing Science and Statistics, Seattle, Washington, April.Google Scholar
  2. Dong, Z.D. 1987. Language, Translation, Machine Translation. InProceedings of International Forum on Today's Translation, Hong Kong.Google Scholar
  3. Dong, Z.D. 1994. A Categorization System of Chinese Movement Concepts. InProceedings of International Conference on Chinese Computing, Singapore.Google Scholar
  4. Dorr, B.J. 1993.Machine Translation: A View from the Lexicon. MIT Press, Cambridge, Massachusetts.Google Scholar
  5. Egedi, D. 1991. Learning Through Metaphor. In D. Nerin-Aime, editor,Knowledge Modeling and Expertise Transfer. ISO Press, Washington, DC.Google Scholar
  6. Feng, Z. and X. Zhou, editors. 1989.New Chinese Multi-Purpose Dictionary. International Culture Publisher.Google Scholar
  7. Grishman, R. and J. Sterling. 1989. Preference Semantics for Message Understanding. InProceedings of the DARPA Speech and Natural Language Workshop, Cape Cod, MA.Google Scholar
  8. Henne, H. 1977.A handbook on Chinese Language Structure. Columbia University Press.Google Scholar
  9. Hobbs, J. 1986. Overview of the TACITUS Project.Computational Linguistics, 12(3).Google Scholar
  10. Jackendoff, R. 1990.Semantic Structures. MIT Press.Google Scholar
  11. Kurohashi, S. and M. Nagao. 1992. Dynamic Programming Method for Analyzing Conjunctive Structures in Japanese. InProceedings of the 14th International Conference on Computational Linguistics (COLING-92), Nantes, France.Google Scholar
  12. Langacker, R.W. 1988. An Overview of Cognitive Grammar. In B. Rudzka-Ostyn, editor,Topics in Cognitive Grammar. John Benjamins Publishing Company, Amsterdam/Philadelphia.Google Scholar
  13. Levin, B. 1987. Approaches to Lexical Semantic representation. In B. Levin, editor,Readings for Lexical Semantics. Northwestern University.Google Scholar
  14. Levin, B. 1993.English Verb Classes and Alternations: A Preliminary Investigation. University of Chicago Press, Chicago.Google Scholar
  15. Nirenburg, S., J. Carbonell, M. Tomita, and K. Goodman. 1992.Machine Translation: A Knowledge-Based Approach. Morgan Kaufmann Publishers.Google Scholar
  16. Onyshkevich, B. and S. Nirenburg. 1995. A Lexicon for Knowledge-Based MT.Machine Translation, 10(1–2).Google Scholar
  17. P. Resnik. 1993.Selection and Information: A Class-Based Approach to Lexical Relationships. Ph.D. thesis, Department of Information and Computer Science, University of Pennsylvania.Google Scholar
  18. Palmer, M. 1990. Customizing Verb Definitions for Specific Semantic Domains.Machine Translation, 5(30).Google Scholar
  19. Palmer, M. and A. Polguère. 1995. A Preliminary Lexical and Conceptual Analysis of BREAK: a Computational Perspective. In P. Saint-Dizier and E. Viegas, editors,Computational Lexical Semantics. Cambridge University Press.Google Scholar
  20. Procter, P. 1978.Longman Dictionary of Contemporary English. Longman, London.Google Scholar
  21. Pustejovsky, J. 1991. The generative lexicon.Computational Linguistics, 17(4).Google Scholar
  22. Wilks, Y. 1986. An intelligent analyzer and understander of English. In B.J. Grosz, K. Sparck-Jones, and B.L. Webber, editors,Readings in Natural Language Processing. Morgan Kaufmann.Google Scholar
  23. Wu, D. 1991.On Serial verb Construction. Ph.D. thesis, Department of Information and Computer Science, University of Maryland.Google Scholar
  24. Wu, Z. 1994.Verb Semantics and Lexical Selection. Ph.D. thesis, Department of Information System and Computer Science, National University of Singapore.Google Scholar
  25. Wu, Z. and M. Palmer. 1994. Verb Semantics and Lexical Selection. InProceedings of ACL94, New Mexico State University, USA, May.Google Scholar
  26. Y.R.Chao. 1968.A Grammar of Spoken Chinese. University of California Press.Google Scholar
  27. Yuan, C., C. Huang, Y. Peng, and Z. Guo. 1994. Construction and Application of the Chinese Morpheme Database. InProceedings of International Conference on Chinese Computing, Singapore.Google Scholar
  28. Zernik, U. 1987. The Self-Extending Phrasal Lexicon.Computational Linguistics, 13(3–4).Google Scholar

Copyright information

© Kluwer Academic Publishers 1995

Authors and Affiliations

  • Martha Palmer
    • 1
  • Zhibiao Wu
    • 2
  1. 1.Department of Computer and Information ScienceUniversity of Pennsylvania
  2. 2.Linguistic Data ConsortiumUniversity of Pennsylvania

Personalised recommendations