Skip to main content
Log in

Parameterizing and Eliciting Text Elements across Languages for Use in Natural Language Processing Systems

  • Published:
Machine Translation

Abstract

This paper analyzes the structure and meaning of text elements cross-linguistically and discusses how that information can be elicited from people in a way that is directly useful for NLP applications. We describe a recently developed computer-based linguistic knowledge elicitation system that initiates a new paradigm of knowledge acquisition methodologies for NLP. In particular, we describe the natural language phenomena the system seeks to cover, the approach to knowledge elicitation and its rationale, the elicitation modules themselves, and broader implications of this work.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Allen, B. J., D. B. Gardiner, and D. G. Frantz: 1984, 'Noun Incorporation in Southern Tiwa'', International Journal of American Linguistics 50, 292-311.

    Google Scholar 

  • Baker, M.C.: 1988, 'Morphology and Syntax: An Interlocking Independence', in M. Everaet et al. (1988), pp. 9-32.

  • Blythe, J., J. Kim, S. Ramachandran, and Y. Gil: 2001, 'An Integrated Environment for Knowledge Acquisition', in International Conference on Intelligent User Interfaces, Santa Fe, NewMexico, pp. 13-20.

  • Bok-Bennema, R. and A. Groos: 1988, 'Adjacency and Incorporation', in M. Everaet et al. (1988), pp. 33-56.

  • Boose, J. H. and J. M. Bradshaw: 1987, 'Expertise Transfer and Complex Problems: Using AQUINAS as a Knowledge Acquisition Workbench for Knowledge-Based Systems', International Journal of Man-Machine Studies 26, 3-28.

    Google Scholar 

  • Charney, Jean Ormsbee: 1993, A Grammar of Comanche, University of Nebraska Press, Lincoln, NB.

    Google Scholar 

  • Comrie, Bernard and Norval Smith: 1977, 'Lingua Descriptive Studies: Questionnaire', Lingua 42, 1-72.

    Google Scholar 

  • Dura, E.: 1998, Parsing Words, Göteborg, University, Göteborg, Sweden.

    Google Scholar 

  • Eshelman, L., D. Ehret, J. McDermott, and M. Tan: 1987, 'MOLE: A Tenacious Knowledge Acquisition Tool', International Journal of Man-Machine Studies 26, 41-54.

    Google Scholar 

  • Everaet, M., A. Evers, R. Huybregts, and M. Trommelen (eds): 1988, Morphology and Modularity, Foris Publications, Dordrecht.

    Google Scholar 

  • Fortescue, M.: 1984. West Greenlandic, Croom Helm, London.

  • Franks, S. and P. Bański: 1999, 'Approaches to “Schizophrenic” Polish Person Agreement', in K. Dziwirek and C. M. Vakareliyska, (eds), Annual Workshop on Formal Approaches to Slavic Linguistics: The Seattle Meeting, 1998, Michigan Slavic Publications, Ann Arbor, pp. 123-143.

    Google Scholar 

  • Frantz, D. G.: 1991, Blackfoot Grammar, University of Toronto Press, Toronto, Ontario.

    Google Scholar 

  • Gaines, B. R. and M. L. G. Shaw: 1993, 'Eliciting Knowledge and Transferring it Effectively to a Knowledge-Based System', IEEE Transactions on Knowledge and Data Engineering 5, 4-14.

    Google Scholar 

  • Karlsson, F.: 1995, 'Designing a Parser for Unrestricted Text', in F. Karlsson, A. Voutilainen, J. Heikkilä, and A. Anttila (eds), Constraint Grammar, Mouton de Gruyter, New York, pp. 1-40.

    Google Scholar 

  • Leavitt, John R. R., Deryle W. Lonsdale, Kevin Keck, and Eric H. Nyberg: 1994, 'Tooling the Lexicon Acquisition Process for Large-Scale KBMT', in Proceedings of the 5th International IEEE Conference on Tools for Artificial Intelligence, New Orleans, pp. 283-289.

  • Lewis, M. B.: 1954, Teach Yourself Malay, English Universities Press.

  • Longacre, R. E.: 1964, Grammar Discovery Procedures, Mouton, The Hague.

    Google Scholar 

  • McShane, Marjorie and Sergei Nirenburg: 2003, 'Blasting Open a Choice Space: Learning Inflectional Morphology for NLP', Computational Intelligence 19, 111-135.

    Google Scholar 

  • McShane, Marjorie, Sergei Nirenburg, James Cowie, and Ron Zacharski: in press, a, 'Embedding Knowledge Elicitation and MT Systems within a Single Architecture', to appear in Machine Translation.

  • McShane, Marjorie, Sergei Nirenburg, and Ron Zacharski.: in press, b, 'Mood and Modality: Out of Theory and into the Fray', to appear in Journal of Natural Language Engineering.

  • McShane, Marjorie and Ron Zacharski: 2003, 'Preparing for Eventualities in User-Extensible On-Line Lexicons', manuscript, Institute of Language and Information Technologies, University of Maryland Baltimore County.

  • Medushevsky, A. and R. Zyatkovska [MедуHевський A. и P. Зятьковська]: 1963, Укрαϊнськα рαмαмαкα [Ukrainian Grammar]. КиÏв: Радянська шкоиа.

  • Mel'čuk, I. A., N. Arbatchewsky-Jumarie, L. Elnitsky, L. Iordanskaja and A. Lessard: 1984, Dictionnaire explicatif et combinatoire du français contemporain: Recherches lexico-sémantiques I [Explanatory and combinatorial dictionary of contemporary French: Lexico-semantique research I]. Les Presses de l'Université de Montréal, Montréal.

    Google Scholar 

  • Mel'čuk, I. A., N. Arbatchewsky-Jumarie, L. Dagenais, L. Elnitsky, L. Iordanskaja, M.-N. Lefebvre, and S. Mantha: 1988, Dictionnaire explicatif et combinatoire du français contemporain: Recherches lexico-sémantiques II [Explanatory and combinatorial dictionary of contemporary French: Lexico-semantique research II]. Les Presses de l'Université de Montréal, Montréal.

    Google Scholar 

  • Mithun, M.: 1984. 'The Evolution of Noun Incorporation', Language 60, 847-895.

    Google Scholar 

  • Motta, Enrico, Tim Rajan, and Marc Eisenstadt: n.d., 'A Methodology and Tool for Knowledge Acquisition', Technical Report TR-32, Human Condition Research Laboratory, Open University, Milton Keynes, UK; available at http://citeseer.nj.nec.com/cache/papers/cs/319/ftp:zSzzSzhcrl.open.ac. ukzSzwebzSztechreportszSzpaperszSztr32.pdf/a-methodology-and-tool.pdf.

  • Musen, M. A., L. M. Fagan, D. M. Combs, and E. H. Shortliffe: 1987, 'Use of a Domain Model to Drive an Interactive Knowledge Editing Tool', International Journal of Man-Machine Studies 26, 105-121.

    Google Scholar 

  • Newmark, L., P. Hubbard, and P. Prifti: 1982, Standard Albanian: A Reference Grammar for Students, Stanford University Press, Stanford, CA.

    Google Scholar 

  • Nirenburg, S.: 1996, 'On Supply-Side vs. Demand-Side Lexical Semantics', in Proceedings of the ACL SIGLEX Workshop on Breadth and Depth of Semantic Lexicons, Santa Cruz, CA.

  • Nirenburg, Sergei, Stephen Beale, Kavi Mahesh, Boyan Onyshkevych, Victor Raskin, Evelyne Viegas, Yorick Wilks, and Rémi Zajac: 1996, 'Lexicons in theMikrokosmos Project', in Proceedings of the AISB Workshop on Multilinguality in the Lexicon, Brighton.

  • Oflazer, Kemal, Sergei Nirenburg, and Marjorie McShane: 2001, 'Bootstrapping Morphological Analyzers by Combining Human Elicitation and Machine Learning', Computational Linguistics 27, 59-85.

    Google Scholar 

  • Ó'Sé, D. and J. Sheils: 1993, Irish, NTC Publishing Group, Lincolnwood, IL.

    Google Scholar 

  • Ó'Siadhail, M.: 1989, Modern Irish, Cambridge University Press, Cambridge.

    Google Scholar 

  • Ó'Siadhail, M.: 1995, Learning Irish, Yale University Press, New Haven, CT.

    Google Scholar 

  • Payne, T. E.: 1995, 'Object Incorporation in Panare', International Journal of American Linguistics 61, 295-311.

    Google Scholar 

  • Regh, K. L.: 1981, Ponapean Reference Grammar, University Press of Hawaii, Honolulu, HI.

    Google Scholar 

  • Schachter, P.: 1972, Tagalog Reference Grammar, University of California Press, Berkeley, CA.

    Google Scholar 

  • Sullivan, T. D.: 1988, Compendium of Nahuatl Grammar, translated from the Spanish by T. D.

  • Sullivan and N. Stiles., University of Utah Press, Salt Lake City, UT.

  • Trask, R. L.: 1993, A Dictionary of Grammatical Terms in Linguistics, Routledge, London.

    Google Scholar 

  • Weggelaar, C.: 1986. 'Noun Incorporation in Dutch', International Journal of American Linguistics 52, 301-305.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

McShane, M., Nirenburg, S. Parameterizing and Eliciting Text Elements across Languages for Use in Natural Language Processing Systems. Machine Translation 18, 129–165 (2003). https://doi.org/10.1023/B:COAT.0000021002.59161.82

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/B:COAT.0000021002.59161.82

Navigation