Abstract
This paper analyzes the structure and meaning of text elements cross-linguistically and discusses how that information can be elicited from people in a way that is directly useful for NLP applications. We describe a recently developed computer-based linguistic knowledge elicitation system that initiates a new paradigm of knowledge acquisition methodologies for NLP. In particular, we describe the natural language phenomena the system seeks to cover, the approach to knowledge elicitation and its rationale, the elicitation modules themselves, and broader implications of this work.
Similar content being viewed by others
References
Allen, B. J., D. B. Gardiner, and D. G. Frantz: 1984, 'Noun Incorporation in Southern Tiwa'', International Journal of American Linguistics 50, 292-311.
Baker, M.C.: 1988, 'Morphology and Syntax: An Interlocking Independence', in M. Everaet et al. (1988), pp. 9-32.
Blythe, J., J. Kim, S. Ramachandran, and Y. Gil: 2001, 'An Integrated Environment for Knowledge Acquisition', in International Conference on Intelligent User Interfaces, Santa Fe, NewMexico, pp. 13-20.
Bok-Bennema, R. and A. Groos: 1988, 'Adjacency and Incorporation', in M. Everaet et al. (1988), pp. 33-56.
Boose, J. H. and J. M. Bradshaw: 1987, 'Expertise Transfer and Complex Problems: Using AQUINAS as a Knowledge Acquisition Workbench for Knowledge-Based Systems', International Journal of Man-Machine Studies 26, 3-28.
Charney, Jean Ormsbee: 1993, A Grammar of Comanche, University of Nebraska Press, Lincoln, NB.
Comrie, Bernard and Norval Smith: 1977, 'Lingua Descriptive Studies: Questionnaire', Lingua 42, 1-72.
Dura, E.: 1998, Parsing Words, Göteborg, University, Göteborg, Sweden.
Eshelman, L., D. Ehret, J. McDermott, and M. Tan: 1987, 'MOLE: A Tenacious Knowledge Acquisition Tool', International Journal of Man-Machine Studies 26, 41-54.
Everaet, M., A. Evers, R. Huybregts, and M. Trommelen (eds): 1988, Morphology and Modularity, Foris Publications, Dordrecht.
Fortescue, M.: 1984. West Greenlandic, Croom Helm, London.
Franks, S. and P. Bański: 1999, 'Approaches to “Schizophrenic” Polish Person Agreement', in K. Dziwirek and C. M. Vakareliyska, (eds), Annual Workshop on Formal Approaches to Slavic Linguistics: The Seattle Meeting, 1998, Michigan Slavic Publications, Ann Arbor, pp. 123-143.
Frantz, D. G.: 1991, Blackfoot Grammar, University of Toronto Press, Toronto, Ontario.
Gaines, B. R. and M. L. G. Shaw: 1993, 'Eliciting Knowledge and Transferring it Effectively to a Knowledge-Based System', IEEE Transactions on Knowledge and Data Engineering 5, 4-14.
Karlsson, F.: 1995, 'Designing a Parser for Unrestricted Text', in F. Karlsson, A. Voutilainen, J. Heikkilä, and A. Anttila (eds), Constraint Grammar, Mouton de Gruyter, New York, pp. 1-40.
Leavitt, John R. R., Deryle W. Lonsdale, Kevin Keck, and Eric H. Nyberg: 1994, 'Tooling the Lexicon Acquisition Process for Large-Scale KBMT', in Proceedings of the 5th International IEEE Conference on Tools for Artificial Intelligence, New Orleans, pp. 283-289.
Lewis, M. B.: 1954, Teach Yourself Malay, English Universities Press.
Longacre, R. E.: 1964, Grammar Discovery Procedures, Mouton, The Hague.
McShane, Marjorie and Sergei Nirenburg: 2003, 'Blasting Open a Choice Space: Learning Inflectional Morphology for NLP', Computational Intelligence 19, 111-135.
McShane, Marjorie, Sergei Nirenburg, James Cowie, and Ron Zacharski: in press, a, 'Embedding Knowledge Elicitation and MT Systems within a Single Architecture', to appear in Machine Translation.
McShane, Marjorie, Sergei Nirenburg, and Ron Zacharski.: in press, b, 'Mood and Modality: Out of Theory and into the Fray', to appear in Journal of Natural Language Engineering.
McShane, Marjorie and Ron Zacharski: 2003, 'Preparing for Eventualities in User-Extensible On-Line Lexicons', manuscript, Institute of Language and Information Technologies, University of Maryland Baltimore County.
Medushevsky, A. and R. Zyatkovska [MедуHевський A. и P. Зятьковська]: 1963, Укрαϊнськα рαмαмαкα [Ukrainian Grammar]. КиÏв: Радянська шкоиа.
Mel'čuk, I. A., N. Arbatchewsky-Jumarie, L. Elnitsky, L. Iordanskaja and A. Lessard: 1984, Dictionnaire explicatif et combinatoire du français contemporain: Recherches lexico-sémantiques I [Explanatory and combinatorial dictionary of contemporary French: Lexico-semantique research I]. Les Presses de l'Université de Montréal, Montréal.
Mel'čuk, I. A., N. Arbatchewsky-Jumarie, L. Dagenais, L. Elnitsky, L. Iordanskaja, M.-N. Lefebvre, and S. Mantha: 1988, Dictionnaire explicatif et combinatoire du français contemporain: Recherches lexico-sémantiques II [Explanatory and combinatorial dictionary of contemporary French: Lexico-semantique research II]. Les Presses de l'Université de Montréal, Montréal.
Mithun, M.: 1984. 'The Evolution of Noun Incorporation', Language 60, 847-895.
Motta, Enrico, Tim Rajan, and Marc Eisenstadt: n.d., 'A Methodology and Tool for Knowledge Acquisition', Technical Report TR-32, Human Condition Research Laboratory, Open University, Milton Keynes, UK; available at http://citeseer.nj.nec.com/cache/papers/cs/319/ftp:zSzzSzhcrl.open.ac. ukzSzwebzSztechreportszSzpaperszSztr32.pdf/a-methodology-and-tool.pdf.
Musen, M. A., L. M. Fagan, D. M. Combs, and E. H. Shortliffe: 1987, 'Use of a Domain Model to Drive an Interactive Knowledge Editing Tool', International Journal of Man-Machine Studies 26, 105-121.
Newmark, L., P. Hubbard, and P. Prifti: 1982, Standard Albanian: A Reference Grammar for Students, Stanford University Press, Stanford, CA.
Nirenburg, S.: 1996, 'On Supply-Side vs. Demand-Side Lexical Semantics', in Proceedings of the ACL SIGLEX Workshop on Breadth and Depth of Semantic Lexicons, Santa Cruz, CA.
Nirenburg, Sergei, Stephen Beale, Kavi Mahesh, Boyan Onyshkevych, Victor Raskin, Evelyne Viegas, Yorick Wilks, and Rémi Zajac: 1996, 'Lexicons in theMikrokosmos Project', in Proceedings of the AISB Workshop on Multilinguality in the Lexicon, Brighton.
Oflazer, Kemal, Sergei Nirenburg, and Marjorie McShane: 2001, 'Bootstrapping Morphological Analyzers by Combining Human Elicitation and Machine Learning', Computational Linguistics 27, 59-85.
Ó'Sé, D. and J. Sheils: 1993, Irish, NTC Publishing Group, Lincolnwood, IL.
Ó'Siadhail, M.: 1989, Modern Irish, Cambridge University Press, Cambridge.
Ó'Siadhail, M.: 1995, Learning Irish, Yale University Press, New Haven, CT.
Payne, T. E.: 1995, 'Object Incorporation in Panare', International Journal of American Linguistics 61, 295-311.
Regh, K. L.: 1981, Ponapean Reference Grammar, University Press of Hawaii, Honolulu, HI.
Schachter, P.: 1972, Tagalog Reference Grammar, University of California Press, Berkeley, CA.
Sullivan, T. D.: 1988, Compendium of Nahuatl Grammar, translated from the Spanish by T. D.
Sullivan and N. Stiles., University of Utah Press, Salt Lake City, UT.
Trask, R. L.: 1993, A Dictionary of Grammatical Terms in Linguistics, Routledge, London.
Weggelaar, C.: 1986. 'Noun Incorporation in Dutch', International Journal of American Linguistics 52, 301-305.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
McShane, M., Nirenburg, S. Parameterizing and Eliciting Text Elements across Languages for Use in Natural Language Processing Systems. Machine Translation 18, 129–165 (2003). https://doi.org/10.1023/B:COAT.0000021002.59161.82
Issue Date:
DOI: https://doi.org/10.1023/B:COAT.0000021002.59161.82