Oxygen: A Language Independent Linearization Engine
This paper describes a language independent linearization engine, oxyGen. This system compiles target language grammars into programs that take feature graphs as inputs and generate word lattices that can be passed along to the statistical extraction module of the generation system Nitrogen. The grammars are written using a flexible and powerful language, oxyL, that has the power of a programming language but focuses on natural language realization. This engine has been used successfully in creating an English linearization program that is currently employed as part of a Chinese-English machine translation system.
KeywordsMachine Translation Grammar Rule Word Lattice Thematic Hierarchy Referential Function
Unable to display preview. Download preview PDF.
- 1.Dorr, B., Habash, N. and Traum, D.: A Thematic Hierarchy for Efficient Generation from Lexical-Conceptal Structure. In Proceedings of the Third Conference of the Association for Machine Translation in the Americas (AMTA-98). Langhorne, PA, (1998) 333–343Google Scholar
- 2.Knight, K. and Hatzivassiloglou, V.: Two-Level, Many-Paths Generation. In Proceedings of ACL-91. (1991) 143–151Google Scholar
- 3.Langkilde,_I. and Knight, K.: Generating Word Lattices from Abstract Meaning Representation. Technical report, Information Science Institute, University of Southern California (1998)Google Scholar
- 4.Langkilde, I. and Knight, K.: Generation that Exploits Corpus-Based Statistical Knowledge. In Proceedings of COLING-ACL’ 98. (1998) 704–710Google Scholar
- 5.Langkilde, I. and Knight, K.: The Practical Value of N-Grams in Generation. In International Natural Language Generation Workshop. (1998)Google Scholar
- 7.Traum, D. and Habash, N.: Generation from Lexical Conceptual Structures. In Proceedings of the Workshop on Applied Interlinguas, NAACL/ANLP 2000, Seattle, WA. (2000)Google Scholar