Skip to main content

Providing Machine Tractable Dictionary Tools

  • Chapter
Book cover Semantics and the Lexicon

Part of the book series: Studies in Linguistics and Philosophy ((SLAP,volume 49))

Abstract

Machine readable dictionaries (MRDs) contain knowledge about language and the world essential for tasks in natural language processing (NLP). However, this knowledge, collected and recorded by lexicographers for human readers, is not presented in a manner for MRDs to be used directly for NLP tasks. What is badly needed are machine tractable dictionaries (MTDs): MRDs transformed into a format usable for NLP. This paper discusses three different but related large-scale computational methods to transform MRDs into MTDs. The MRD used is The Longman Dictionary of Contemporary English (LDOCE). The three methods differ in the amount of knowledge they start with and the kinds of knowledge they provide. All require some handcoding of initial information but are largely automatic. Method I, a statistical approach, uses the least handcoding. It generates “relatedness” networks for words in LDOCE and presents a method for doing partial word sense disambiguation. Method II employs the most handcoding because it develops and builds lexical entries for a very carefully controlled defining vocabulary of 2,000 word senses (1,000 words). The payoff is that the method will provide an MTD containing highly structured semantic information. Method III requires the handcoding of a grammar and the semantic patterns used by its parser, but not the handcoding of any lexical material. This is because the method builds up lexical material from sources wholly within LDOCE. The information extracted is a set of sources of information, individually weak, but which can be combined to give a strong and determinate linguistic data base.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Alshawi, Hiyan (1987) Processing Dictionary Definitions with Phrasal Pattern Hierarchies. Computational Linguistics 13, 203–218.

    Google Scholar 

  • Alshawi, H., Boguraev, B., and Briscoe, T. (1985) Towards a Dictionary Support Environment for Real Time Parsing. In Proceedings of the 2nd European Conference on Computational Linguistics, Geneva, pp. 171-178.

    Google Scholar 

  • Amsler, R.A. (1980) The Structure of the Merriam-Webster Pocket Dictionary, Technical Report TR-164, University of Texas at Austin.

    Google Scholar 

  • Amsler, R.A. (1981) A Taxonomy of English Nouns and Verbs. In Proceedings of ACL-19, Stanford, pp. 133-138.

    Google Scholar 

  • Amsler, R.A. (1982) Computational Lexicology: A Research Program. In AFIPS Conference Proceedings, 1982 National Computer Conference, pp. 657-663.

    Google Scholar 

  • Amsler, R.A. and White, J.S. (1979) Development of a Computational Methodology for Deriving Natural Language Semantic Structures via Analysis of Machine-Readable Dictionaries, NSF Technical Report MCS77-01315.

    Google Scholar 

  • Binot, J.-L. and Jensen, K. (1987) A Semantic Expert Using an Online Standard Dictionary. In Proceedings of UCAl-87, Milan, pp. 709-714.

    Google Scholar 

  • Boguraev, B.K. (1987) The Definitional Power of Words. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 11-15.

    Google Scholar 

  • Boguraev, B.K. and Briscoe, T. (1987) Large Lexicons for Natural Language Processing: Exploring the Grammar Coding System of LDOCE, Computational Linguistics 13, 203–218.

    Google Scholar 

  • Boguraev, B.K., Briscoe, T., Carroll, J., Carter, D., and Grover, C. (1987) The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English. In Proceedings of ACL-25, Stanford, pp. 193-200.

    Google Scholar 

  • Byrd, R.J. (1989) Discovering Relationships Among Word Senses. In Proceedings of the 5th Conference of the UW Centre for the New OED (Dictionaries in the Electronic Age), Oxford, pp. 67-79.

    Google Scholar 

  • Carre, B. (1979) Graphs and Networks, Clarendon Press, Oxford.

    Google Scholar 

  • Chodorow, M.S., Byrd, R.J., and Heidorn, G.E. (1985) Extracting Semantic Hierarchies from a Large On-Line Dictionary. In Proceedings of ACL-23, Chicago, pp. 299-304.

    Google Scholar 

  • Cottrell, G.W. and Small, S.L. (1983) A Connectionist Scheme for Modelling Word-Sense Disambiguation, Cognition and Brain Theory 6, 89–120.

    Google Scholar 

  • Dietterich, T.G. and Michalski, R. (1981) Inductive Learning of Structural Descriptions, Artificial Intelligence 16, 257–294.

    Article  Google Scholar 

  • Evens, M., and R.N. Smith (1983) Determination of Adverbial Senses from Webster’s Seventh Collegiate Definitions, Paper presented at Workshop on Machine Readable Dictionaries, SRI-International, April 1983.

    Google Scholar 

  • Fass, D.C. (1986) Collative Semantics: An Approach to Coherence, Memorandum in Computer and Cognitive Science, MCCS-86-56, Computing Research Laboratory, New Mexico State University, Las Cruces.

    Google Scholar 

  • Fass, D.C. (1988a) Collative Semantics: A Semantics for Natural Language Processing, Memorandum in Computer and Cognitive Science, MCCS-88-118, Computing Research Laboratory, New Mexico State University, Las Cruces.

    Google Scholar 

  • Fass, D.C. (1988b) Metonymy and Metaphor: What’s the Difference? In Proceedings of COLING-88, Budapest, pp. 177-181.

    Google Scholar 

  • Fass, D.C. (1988c) An Account of Coherence, Semantic Relations, Metonymy, and Lexical Ambiguity Resolution. In S.L. Small, G.W. Cottrell and M.K. Tanenhaus (eds.), Lexical Ambiguity Resolution in the Comprehension of Human Language, Morgan Kaufmann, Los Altos, pp. 151–178.

    Google Scholar 

  • Fass, D.C. and Wilks, Y.A. (1983) Preference Semantics, Ill-Formedness and Metaphor, American Journal of Computational Linguistics 9, 178–187.

    Google Scholar 

  • Guo, C. (1987) Interactive Vocabulary Acquisition in XTRA. In Proceedings of IJCAI-87, Milan, pp. 715-717.

    Google Scholar 

  • Harary, F. (1969) Graph Theory, Addison-Wesley, Reading, MA.

    Google Scholar 

  • Harris, Z. (1951) Structural Linguistics, University of Chicago Press, Chicago.

    Google Scholar 

  • Hobbs, J.R. (1987) World Knowledge and World Meaning. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 20-25.

    Google Scholar 

  • Jensen, K. and Binot, J.-L. (1987) Disambiguating Prepositional Phrase Attachments by Using On-Line Dictionary Definitions, Computational Linguistics 13, 251–260.

    Google Scholar 

  • Johnson, S.C. (1967) Hierarchical Clustering Schemes, Psychometrika 32, 241–254.

    Article  Google Scholar 

  • Kegl, J. (1987) The Boundary Between Word Knowledge and World Knowledge. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 26-31.

    Google Scholar 

  • Kucera, H. and Francis, W.N. (1967) Computational Analysis of Present-Day American English, Brown University Press, Providence, RI.

    Google Scholar 

  • Lenat, D.B. and Feigenbaum. E.A. (1987) On The Thresholds of Knowledge. In Proceedings of IJCAI-87, Milan, pp. 1173-1182.

    Google Scholar 

  • Lenat, D.B., Prakash, M., and Shepherd, M. (1986) CYC: Using Common Sense Knowledge to Overcome Brittleness and Knowledge Acquisition Bottlenecks, A1 Magazine 7(4), 65–85.

    Google Scholar 

  • Lesk, M.E. (1986) Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone. In Proceedings of the ACM SIGDOC Conference, Toronto, pp. 24-26.

    Google Scholar 

  • Lyons, J. (1977) Semantics, Volume 2, Cambridge University Press, Cambridge, MA.

    Google Scholar 

  • McClelland, J., Rumelhart, D.E. and the PDP Research Group (eds.) (1986) Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Two Volumes, Volume 2: Psychological and Biological Models, MIT Press/Bradford Books, Cambridge, MA.

    Google Scholar 

  • McDonald, J.E., Plate, T., and Schvaneveldt, R.W. (1990) Using Pathfinder to Extract Semantic Information from Text. In R. Schvaneveldt (ed.), Pathfinder Associative Networks: Studies in Knowledge Organization, Ablex, New Jersey, pp. 197–211.

    Google Scholar 

  • Markowitz, J., Ahlswede, T. and Evens, M. (1986) Semantically Significant Patterns in Dictionary Definitions. In Proceedings of ACL-24, New York, pp. 112-119.

    Google Scholar 

  • Masterman, M. (1957) The Thesaurus in Syntax and Semantics. Mechanical Translation 4, 1–2.

    Google Scholar 

  • Michiels, A., Mullenders, J., and Noel, J. (1980) Exploiting a Large Data Base by Longman. In Proceedings of COLING-80, Tokyo, pp. 374-382.

    Google Scholar 

  • Miller, G.A. (1985) Dictionaries of the Mind. In Proceedings of ACL-23, Chicago, pp. 305-314.

    Google Scholar 

  • Newell, A. (1973) Artificial Intelligence and the Concept of Mind. In R.C. Schank and K.M. Colby (eds.), Computer Models of Thought and Language, W.H. Freeman, San Francisco, pp. 1–60.

    Google Scholar 

  • Ogden, C.K. (1942) The General Basic English Dictionary, W.W Norton, New York.

    Google Scholar 

  • Procter, P. et al. (eds.) (1978) Longman Dictionary of Contemporary English, Longman, Harlow, Essex.

    Google Scholar 

  • Pulman, S.G. (1985) Generalised Phrase Structure Grammar, Earley’s Algorithm, and the Minimisation of Recursion. In K. Sparck Jones and Y.A. Wilks (eds.), Automatic Natural Language Parsing, John Wiley and Sons, New York, pp. 117–131.

    Google Scholar 

  • Pustejovsky, J. and Bergler, S. (1987) The Acquisition of Conceptual Structure for the Lexicon. In Proceedings of AAAI-87, Seattle, pp. 556-570.

    Google Scholar 

  • Quillian, M.R. (1967) Word Concepts: A Theory and Simulation of Some Basic Semantic Capabilities, Behavioral Science 12, 410–430. Reprinted in R.J. Brachman and H.J. Levesque (eds.), Readings in Knowledge Representation, Morgan Kaufmann, Los Altos, 1985, pp. 98-118.

    Article  Google Scholar 

  • Quirk, R., Greenbaum, S., Leech, G. and Svartik, J. (1972) A Grammar of Contemporary English, Longman, Harlow, Essex.

    Google Scholar 

  • Quirk, R., Greenbaum, S., Leech, G., and Svartik, J. (1985) A Comprehensive Grammar of English, Longman, Harlow, Essex.

    Google Scholar 

  • St. John, M.R and McClelland, J.L. (1986) Reconstructive Memory for Sentences: A PDP Approach, Ohio University Inference Conference.

    Google Scholar 

  • Sampson, G. (1986) A Stochastic Approach to Parsing. In Proceedings of CIOLING-86, Bonn, pp. 151-155.

    Google Scholar 

  • Schvaneveldt, R.W. and Durso, F.T. (1981) Generalized Semantic Networks, Paper presented at the meeting of the Psychonomic Society, Philadelphia.

    Google Scholar 

  • Schvaneveldt, R.W., Durso, F.T., and Dearholt, D.W. (1985) Pathfinder: Scaling with Network Structure, Memorandum in Computer and Cognitive Science, MCCS-85-9, Computing Research Laboratory, New Mexico State University, Las Cruces.

    Google Scholar 

  • Shortliffe, E.H. (1976) Computer-Based Medical Consultation: MYCIN. Elsevier, New York.

    Google Scholar 

  • Slator, B.M. (1988a) Lexical Semantics and a Preference Semantics Parser, Memorandum in Computer and Cognitive Science, MCCS-88-116, Computing Research Laboratory, New Mexico State University, Las Cruces.

    Google Scholar 

  • Slator, B.M. (1988b) PREMO: The PREference Machine Organization. In Proceedings of the Third Annual Rocky Mountain Conference on Artificial Intelligence, Denver, pp. 258-265.

    Google Scholar 

  • Slator, B.M. (1988c) Constructing Contextually Organized Lexical Semantic Knowledge-Bases. In Proceedings of the Third Annual Rocky Mountain Conference on Artificial Intelligence, Denver, CO, pp. 142-148.

    Google Scholar 

  • Slator, B.M. and Wilks, Y.A. ( 1987) Toward Semantic Structures from Dictionary Entries. In Proceedings of the Second Annual Rocky Mountain Conference on Artificial Intelligence, Boulder, CO, pp. 85-96. Also, Memorandum in Computer and Cognitive Science, MCCS-87-96, Computing Research Laboratory, New Mexico State University, Las Cruces.

    Google Scholar 

  • Slocum, J. (1985) Parser Construction Techniques: A Tutorial, Tutorial held at the 23rd Annual Meeting of the Association for Computational Linguistics, Chicago.

    Google Scholar 

  • Slocum, J. and Morgan, M.G. (1993, forthcoming) The Role of Dictionaries and Machine Readable Lexicons in Translation. In D. Walker, A. Zampolli and N. Calzolari (eds.), Automating the Lexicon: Research and Practice in a Multilingual Environment, Cambridge University Press, Cambridge.

    Google Scholar 

  • Sparck Jones, K. (1964) Synonymy and Semantic Classification, Ph.D. Thesis, University of Cambridge.

    Google Scholar 

  • Sparck Jones, K. (1986) Synonymy and Semantic Classification: (Ph.D. thesis with new Foreword. Edinburgh Information Technology Series (EDITS). Edinburgh: Edinburgh University Press.

    Google Scholar 

  • Walker, D.E. and Amsler, R.A. (1986) The Use of Machine-Readable Dictionaries in Sublanguage Analysis. In R. Grishman and R. Kittredge (eds.), Analyzing Language in Restricted Domains, Lawrence Erlbaum, Hillsdale, NJ, pp. 69–84.

    Google Scholar 

  • Waltz, D.L. and Pollack, J.B. (1985) Massively Parallel Parsing: A Strongly Interactive Model of Natural Language Interpretation, Cognitive Science 9, 51–74.

    Article  Google Scholar 

  • Wilks, Y.A. (1972) Grammar, Meaning, and the Machine Analysis of Language, Routledge and Kegan Paul, London.

    Google Scholar 

  • Wilks, Y.A. (1973) An Artificial Intelligence Approach to Machine Translation. In R.C. Schank and K.M. Colby (eds.), Computer Models of Thought and Language, W.H. Freeman, San Francisco, pp. 114–151.

    Google Scholar 

  • Wilks, Y.A. (1975a) A Preferential Pattern-Seeking Semantics for Natural Language Inference, Artificial Intelligence 6, 53–74.

    Article  Google Scholar 

  • Wilks, Y.A. (1975b) An Intelligent Analyser and Understander for English, Communications of the ACM 18, 264–274.

    Article  Google Scholar 

  • Wilks, Y.A. (1977) Good and Bad Arguments about Semantic Primitives, Communication and Cognition 10, 182–221.

    Google Scholar 

  • Wilks, Y.A. (1978) Making Preferences More Active, Artificial Intelligence 10, 75–97.

    Google Scholar 

  • Wilks, Y.A., Fass, D.C., Guo, C, McDonald, J.E., Plate, T., and Slator, B.M. (1987) A Tractable Machine Dictionary as a Resource for Computational Semantics. Memorandum in Computer and Cognitive Science, MCCS-87-105, Computing Research Laboratory, New Mexico State University, Las Cruces. To appear in B. Boguraev and T. Briscoe (eds.), Computational Lexicography for Natural Language Processing, Longman, Harlow, Essex.

    Google Scholar 

  • Wilks, Y.A., Fass, D.C., Guo, C, McDonald, J.E., Plate, T., and Slator, B.M. (1988) Machine Tractable Dictionaries as Tools and Resources for Natural Language Processing. In Proceedings of COLING-88, Budapest, pp.750-755.

    Google Scholar 

  • Winston, P.H. (1975) Learning Structural Descriptions from Examples. In P.H. Winston (ed.), The Psychology of Computer Vision, McGraw-Hill, New York.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1993 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Wilks, Y. (1993). Providing Machine Tractable Dictionary Tools. In: Pustejovsky, J. (eds) Semantics and the Lexicon. Studies in Linguistics and Philosophy, vol 49. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-1972-6_16

Download citation

  • DOI: https://doi.org/10.1007/978-94-011-1972-6_16

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-0-7923-2386-0

  • Online ISBN: 978-94-011-1972-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics