Abstract
Machine readable dictionaries (MRDs) contain knowledge about language and the world essential for tasks in natural language processing (NLP). However, this knowledge, collected and recorded by lexicographers for human readers, is not presented in a manner for MRDs to be used directly for NLP tasks. What is badly needed are machine tractable dictionaries (MTDs): MRDs transformed into a format usable for NLP. This paper discusses three different but related large-scale computational methods to transform MRDs into MTDs. The MRD used is The Longman Dictionary of Contemporary English (LDOCE). The three methods differ in the amount of knowledge they start with and the kinds of knowledge they provide. All require some handcoding of initial information but are largely automatic. Method I, a statistical approach, uses the least handcoding. It generates “relatedness” networks for words in LDOCE and presents a method for doing partial word sense disambiguation. Method II employs the most handcoding because it develops and builds lexical entries for a very carefully controlled defining vocabulary of 2,000 word senses (1,000 words). The payoff is that the method will provide an MTD containing highly structured semantic information. Method III requires the handcoding of a grammar and the semantic patterns used by its parser, but not the handcoding of any lexical material. This is because the method builds up lexical material from sources wholly within LDOCE. The information extracted is a set of sources of information, individually weak, but which can be combined to give a strong and determinate linguistic data base.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alshawi, Hiyan (1987) Processing Dictionary Definitions with Phrasal Pattern Hierarchies. Computational Linguistics 13, 203–218.
Alshawi, H., Boguraev, B., and Briscoe, T. (1985) Towards a Dictionary Support Environment for Real Time Parsing. In Proceedings of the 2nd European Conference on Computational Linguistics, Geneva, pp. 171-178.
Amsler, R.A. (1980) The Structure of the Merriam-Webster Pocket Dictionary, Technical Report TR-164, University of Texas at Austin.
Amsler, R.A. (1981) A Taxonomy of English Nouns and Verbs. In Proceedings of ACL-19, Stanford, pp. 133-138.
Amsler, R.A. (1982) Computational Lexicology: A Research Program. In AFIPS Conference Proceedings, 1982 National Computer Conference, pp. 657-663.
Amsler, R.A. and White, J.S. (1979) Development of a Computational Methodology for Deriving Natural Language Semantic Structures via Analysis of Machine-Readable Dictionaries, NSF Technical Report MCS77-01315.
Binot, J.-L. and Jensen, K. (1987) A Semantic Expert Using an Online Standard Dictionary. In Proceedings of UCAl-87, Milan, pp. 709-714.
Boguraev, B.K. (1987) The Definitional Power of Words. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 11-15.
Boguraev, B.K. and Briscoe, T. (1987) Large Lexicons for Natural Language Processing: Exploring the Grammar Coding System of LDOCE, Computational Linguistics 13, 203–218.
Boguraev, B.K., Briscoe, T., Carroll, J., Carter, D., and Grover, C. (1987) The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English. In Proceedings of ACL-25, Stanford, pp. 193-200.
Byrd, R.J. (1989) Discovering Relationships Among Word Senses. In Proceedings of the 5th Conference of the UW Centre for the New OED (Dictionaries in the Electronic Age), Oxford, pp. 67-79.
Carre, B. (1979) Graphs and Networks, Clarendon Press, Oxford.
Chodorow, M.S., Byrd, R.J., and Heidorn, G.E. (1985) Extracting Semantic Hierarchies from a Large On-Line Dictionary. In Proceedings of ACL-23, Chicago, pp. 299-304.
Cottrell, G.W. and Small, S.L. (1983) A Connectionist Scheme for Modelling Word-Sense Disambiguation, Cognition and Brain Theory 6, 89–120.
Dietterich, T.G. and Michalski, R. (1981) Inductive Learning of Structural Descriptions, Artificial Intelligence 16, 257–294.
Evens, M., and R.N. Smith (1983) Determination of Adverbial Senses from Webster’s Seventh Collegiate Definitions, Paper presented at Workshop on Machine Readable Dictionaries, SRI-International, April 1983.
Fass, D.C. (1986) Collative Semantics: An Approach to Coherence, Memorandum in Computer and Cognitive Science, MCCS-86-56, Computing Research Laboratory, New Mexico State University, Las Cruces.
Fass, D.C. (1988a) Collative Semantics: A Semantics for Natural Language Processing, Memorandum in Computer and Cognitive Science, MCCS-88-118, Computing Research Laboratory, New Mexico State University, Las Cruces.
Fass, D.C. (1988b) Metonymy and Metaphor: What’s the Difference? In Proceedings of COLING-88, Budapest, pp. 177-181.
Fass, D.C. (1988c) An Account of Coherence, Semantic Relations, Metonymy, and Lexical Ambiguity Resolution. In S.L. Small, G.W. Cottrell and M.K. Tanenhaus (eds.), Lexical Ambiguity Resolution in the Comprehension of Human Language, Morgan Kaufmann, Los Altos, pp. 151–178.
Fass, D.C. and Wilks, Y.A. (1983) Preference Semantics, Ill-Formedness and Metaphor, American Journal of Computational Linguistics 9, 178–187.
Guo, C. (1987) Interactive Vocabulary Acquisition in XTRA. In Proceedings of IJCAI-87, Milan, pp. 715-717.
Harary, F. (1969) Graph Theory, Addison-Wesley, Reading, MA.
Harris, Z. (1951) Structural Linguistics, University of Chicago Press, Chicago.
Hobbs, J.R. (1987) World Knowledge and World Meaning. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 20-25.
Jensen, K. and Binot, J.-L. (1987) Disambiguating Prepositional Phrase Attachments by Using On-Line Dictionary Definitions, Computational Linguistics 13, 251–260.
Johnson, S.C. (1967) Hierarchical Clustering Schemes, Psychometrika 32, 241–254.
Kegl, J. (1987) The Boundary Between Word Knowledge and World Knowledge. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 26-31.
Kucera, H. and Francis, W.N. (1967) Computational Analysis of Present-Day American English, Brown University Press, Providence, RI.
Lenat, D.B. and Feigenbaum. E.A. (1987) On The Thresholds of Knowledge. In Proceedings of IJCAI-87, Milan, pp. 1173-1182.
Lenat, D.B., Prakash, M., and Shepherd, M. (1986) CYC: Using Common Sense Knowledge to Overcome Brittleness and Knowledge Acquisition Bottlenecks, A1 Magazine 7(4), 65–85.
Lesk, M.E. (1986) Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone. In Proceedings of the ACM SIGDOC Conference, Toronto, pp. 24-26.
Lyons, J. (1977) Semantics, Volume 2, Cambridge University Press, Cambridge, MA.
McClelland, J., Rumelhart, D.E. and the PDP Research Group (eds.) (1986) Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Two Volumes, Volume 2: Psychological and Biological Models, MIT Press/Bradford Books, Cambridge, MA.
McDonald, J.E., Plate, T., and Schvaneveldt, R.W. (1990) Using Pathfinder to Extract Semantic Information from Text. In R. Schvaneveldt (ed.), Pathfinder Associative Networks: Studies in Knowledge Organization, Ablex, New Jersey, pp. 197–211.
Markowitz, J., Ahlswede, T. and Evens, M. (1986) Semantically Significant Patterns in Dictionary Definitions. In Proceedings of ACL-24, New York, pp. 112-119.
Masterman, M. (1957) The Thesaurus in Syntax and Semantics. Mechanical Translation 4, 1–2.
Michiels, A., Mullenders, J., and Noel, J. (1980) Exploiting a Large Data Base by Longman. In Proceedings of COLING-80, Tokyo, pp. 374-382.
Miller, G.A. (1985) Dictionaries of the Mind. In Proceedings of ACL-23, Chicago, pp. 305-314.
Newell, A. (1973) Artificial Intelligence and the Concept of Mind. In R.C. Schank and K.M. Colby (eds.), Computer Models of Thought and Language, W.H. Freeman, San Francisco, pp. 1–60.
Ogden, C.K. (1942) The General Basic English Dictionary, W.W Norton, New York.
Procter, P. et al. (eds.) (1978) Longman Dictionary of Contemporary English, Longman, Harlow, Essex.
Pulman, S.G. (1985) Generalised Phrase Structure Grammar, Earley’s Algorithm, and the Minimisation of Recursion. In K. Sparck Jones and Y.A. Wilks (eds.), Automatic Natural Language Parsing, John Wiley and Sons, New York, pp. 117–131.
Pustejovsky, J. and Bergler, S. (1987) The Acquisition of Conceptual Structure for the Lexicon. In Proceedings of AAAI-87, Seattle, pp. 556-570.
Quillian, M.R. (1967) Word Concepts: A Theory and Simulation of Some Basic Semantic Capabilities, Behavioral Science 12, 410–430. Reprinted in R.J. Brachman and H.J. Levesque (eds.), Readings in Knowledge Representation, Morgan Kaufmann, Los Altos, 1985, pp. 98-118.
Quirk, R., Greenbaum, S., Leech, G. and Svartik, J. (1972) A Grammar of Contemporary English, Longman, Harlow, Essex.
Quirk, R., Greenbaum, S., Leech, G., and Svartik, J. (1985) A Comprehensive Grammar of English, Longman, Harlow, Essex.
St. John, M.R and McClelland, J.L. (1986) Reconstructive Memory for Sentences: A PDP Approach, Ohio University Inference Conference.
Sampson, G. (1986) A Stochastic Approach to Parsing. In Proceedings of CIOLING-86, Bonn, pp. 151-155.
Schvaneveldt, R.W. and Durso, F.T. (1981) Generalized Semantic Networks, Paper presented at the meeting of the Psychonomic Society, Philadelphia.
Schvaneveldt, R.W., Durso, F.T., and Dearholt, D.W. (1985) Pathfinder: Scaling with Network Structure, Memorandum in Computer and Cognitive Science, MCCS-85-9, Computing Research Laboratory, New Mexico State University, Las Cruces.
Shortliffe, E.H. (1976) Computer-Based Medical Consultation: MYCIN. Elsevier, New York.
Slator, B.M. (1988a) Lexical Semantics and a Preference Semantics Parser, Memorandum in Computer and Cognitive Science, MCCS-88-116, Computing Research Laboratory, New Mexico State University, Las Cruces.
Slator, B.M. (1988b) PREMO: The PREference Machine Organization. In Proceedings of the Third Annual Rocky Mountain Conference on Artificial Intelligence, Denver, pp. 258-265.
Slator, B.M. (1988c) Constructing Contextually Organized Lexical Semantic Knowledge-Bases. In Proceedings of the Third Annual Rocky Mountain Conference on Artificial Intelligence, Denver, CO, pp. 142-148.
Slator, B.M. and Wilks, Y.A. ( 1987) Toward Semantic Structures from Dictionary Entries. In Proceedings of the Second Annual Rocky Mountain Conference on Artificial Intelligence, Boulder, CO, pp. 85-96. Also, Memorandum in Computer and Cognitive Science, MCCS-87-96, Computing Research Laboratory, New Mexico State University, Las Cruces.
Slocum, J. (1985) Parser Construction Techniques: A Tutorial, Tutorial held at the 23rd Annual Meeting of the Association for Computational Linguistics, Chicago.
Slocum, J. and Morgan, M.G. (1993, forthcoming) The Role of Dictionaries and Machine Readable Lexicons in Translation. In D. Walker, A. Zampolli and N. Calzolari (eds.), Automating the Lexicon: Research and Practice in a Multilingual Environment, Cambridge University Press, Cambridge.
Sparck Jones, K. (1964) Synonymy and Semantic Classification, Ph.D. Thesis, University of Cambridge.
Sparck Jones, K. (1986) Synonymy and Semantic Classification: (Ph.D. thesis with new Foreword. Edinburgh Information Technology Series (EDITS). Edinburgh: Edinburgh University Press.
Walker, D.E. and Amsler, R.A. (1986) The Use of Machine-Readable Dictionaries in Sublanguage Analysis. In R. Grishman and R. Kittredge (eds.), Analyzing Language in Restricted Domains, Lawrence Erlbaum, Hillsdale, NJ, pp. 69–84.
Waltz, D.L. and Pollack, J.B. (1985) Massively Parallel Parsing: A Strongly Interactive Model of Natural Language Interpretation, Cognitive Science 9, 51–74.
Wilks, Y.A. (1972) Grammar, Meaning, and the Machine Analysis of Language, Routledge and Kegan Paul, London.
Wilks, Y.A. (1973) An Artificial Intelligence Approach to Machine Translation. In R.C. Schank and K.M. Colby (eds.), Computer Models of Thought and Language, W.H. Freeman, San Francisco, pp. 114–151.
Wilks, Y.A. (1975a) A Preferential Pattern-Seeking Semantics for Natural Language Inference, Artificial Intelligence 6, 53–74.
Wilks, Y.A. (1975b) An Intelligent Analyser and Understander for English, Communications of the ACM 18, 264–274.
Wilks, Y.A. (1977) Good and Bad Arguments about Semantic Primitives, Communication and Cognition 10, 182–221.
Wilks, Y.A. (1978) Making Preferences More Active, Artificial Intelligence 10, 75–97.
Wilks, Y.A., Fass, D.C., Guo, C, McDonald, J.E., Plate, T., and Slator, B.M. (1987) A Tractable Machine Dictionary as a Resource for Computational Semantics. Memorandum in Computer and Cognitive Science, MCCS-87-105, Computing Research Laboratory, New Mexico State University, Las Cruces. To appear in B. Boguraev and T. Briscoe (eds.), Computational Lexicography for Natural Language Processing, Longman, Harlow, Essex.
Wilks, Y.A., Fass, D.C., Guo, C, McDonald, J.E., Plate, T., and Slator, B.M. (1988) Machine Tractable Dictionaries as Tools and Resources for Natural Language Processing. In Proceedings of COLING-88, Budapest, pp.750-755.
Winston, P.H. (1975) Learning Structural Descriptions from Examples. In P.H. Winston (ed.), The Psychology of Computer Vision, McGraw-Hill, New York.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1993 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Wilks, Y. (1993). Providing Machine Tractable Dictionary Tools. In: Pustejovsky, J. (eds) Semantics and the Lexicon. Studies in Linguistics and Philosophy, vol 49. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-1972-6_16
Download citation
DOI: https://doi.org/10.1007/978-94-011-1972-6_16
Publisher Name: Springer, Dordrecht
Print ISBN: 978-0-7923-2386-0
Online ISBN: 978-94-011-1972-6
eBook Packages: Springer Book Archive