Providing Machine Tractable Dictionary Tools

Wilks, Yorik

doi:10.1007/978-94-011-1972-6_16

Yorik Wilks⁵

Part of the book series: Studies in Linguistics and Philosophy ((SLAP,volume 49))

316 Accesses
9 Citations

Abstract

Machine readable dictionaries (MRDs) contain knowledge about language and the world essential for tasks in natural language processing (NLP). However, this knowledge, collected and recorded by lexicographers for human readers, is not presented in a manner for MRDs to be used directly for NLP tasks. What is badly needed are machine tractable dictionaries (MTDs): MRDs transformed into a format usable for NLP. This paper discusses three different but related large-scale computational methods to transform MRDs into MTDs. The MRD used is The Longman Dictionary of Contemporary English (LDOCE). The three methods differ in the amount of knowledge they start with and the kinds of knowledge they provide. All require some handcoding of initial information but are largely automatic. Method I, a statistical approach, uses the least handcoding. It generates “relatedness” networks for words in LDOCE and presents a method for doing partial word sense disambiguation. Method II employs the most handcoding because it develops and builds lexical entries for a very carefully controlled defining vocabulary of 2,000 word senses (1,000 words). The payoff is that the method will provide an MTD containing highly structured semantic information. Method III requires the handcoding of a grammar and the semantic patterns used by its parser, but not the handcoding of any lexical material. This is because the method builds up lexical material from sources wholly within LDOCE. The information extracted is a set of sources of information, individually weak, but which can be combined to give a strong and determinate linguistic data base.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alshawi, Hiyan (1987) Processing Dictionary Definitions with Phrasal Pattern Hierarchies. Computational Linguistics 13, 203–218.
Google Scholar
Alshawi, H., Boguraev, B., and Briscoe, T. (1985) Towards a Dictionary Support Environment for Real Time Parsing. In Proceedings of the 2nd European Conference on Computational Linguistics, Geneva, pp. 171-178.
Google Scholar
Amsler, R.A. (1980) The Structure of the Merriam-Webster Pocket Dictionary, Technical Report TR-164, University of Texas at Austin.
Google Scholar
Amsler, R.A. (1981) A Taxonomy of English Nouns and Verbs. In Proceedings of ACL-19, Stanford, pp. 133-138.
Google Scholar
Amsler, R.A. (1982) Computational Lexicology: A Research Program. In AFIPS Conference Proceedings, 1982 National Computer Conference, pp. 657-663.
Google Scholar
Amsler, R.A. and White, J.S. (1979) Development of a Computational Methodology for Deriving Natural Language Semantic Structures via Analysis of Machine-Readable Dictionaries, NSF Technical Report MCS77-01315.
Google Scholar
Binot, J.-L. and Jensen, K. (1987) A Semantic Expert Using an Online Standard Dictionary. In Proceedings of UCAl-87, Milan, pp. 709-714.
Google Scholar
Boguraev, B.K. (1987) The Definitional Power of Words. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 11-15.
Google Scholar
Boguraev, B.K. and Briscoe, T. (1987) Large Lexicons for Natural Language Processing: Exploring the Grammar Coding System of LDOCE, Computational Linguistics 13, 203–218.
Google Scholar
Boguraev, B.K., Briscoe, T., Carroll, J., Carter, D., and Grover, C. (1987) The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English. In Proceedings of ACL-25, Stanford, pp. 193-200.
Google Scholar
Byrd, R.J. (1989) Discovering Relationships Among Word Senses. In Proceedings of the 5th Conference of the UW Centre for the New OED (Dictionaries in the Electronic Age), Oxford, pp. 67-79.
Google Scholar
Carre, B. (1979) Graphs and Networks, Clarendon Press, Oxford.
Google Scholar
Chodorow, M.S., Byrd, R.J., and Heidorn, G.E. (1985) Extracting Semantic Hierarchies from a Large On-Line Dictionary. In Proceedings of ACL-23, Chicago, pp. 299-304.
Google Scholar
Cottrell, G.W. and Small, S.L. (1983) A Connectionist Scheme for Modelling Word-Sense Disambiguation, Cognition and Brain Theory 6, 89–120.
Google Scholar
Dietterich, T.G. and Michalski, R. (1981) Inductive Learning of Structural Descriptions, Artificial Intelligence 16, 257–294.
Article Google Scholar
Evens, M., and R.N. Smith (1983) Determination of Adverbial Senses from Webster’s Seventh Collegiate Definitions, Paper presented at Workshop on Machine Readable Dictionaries, SRI-International, April 1983.
Google Scholar
Fass, D.C. (1986) Collative Semantics: An Approach to Coherence, Memorandum in Computer and Cognitive Science, MCCS-86-56, Computing Research Laboratory, New Mexico State University, Las Cruces.
Google Scholar
Fass, D.C. (1988a) Collative Semantics: A Semantics for Natural Language Processing, Memorandum in Computer and Cognitive Science, MCCS-88-118, Computing Research Laboratory, New Mexico State University, Las Cruces.
Google Scholar
Fass, D.C. (1988b) Metonymy and Metaphor: What’s the Difference? In Proceedings of COLING-88, Budapest, pp. 177-181.
Google Scholar
Fass, D.C. (1988c) An Account of Coherence, Semantic Relations, Metonymy, and Lexical Ambiguity Resolution. In S.L. Small, G.W. Cottrell and M.K. Tanenhaus (eds.), Lexical Ambiguity Resolution in the Comprehension of Human Language, Morgan Kaufmann, Los Altos, pp. 151–178.
Google Scholar
Fass, D.C. and Wilks, Y.A. (1983) Preference Semantics, Ill-Formedness and Metaphor, American Journal of Computational Linguistics 9, 178–187.
Google Scholar
Guo, C. (1987) Interactive Vocabulary Acquisition in XTRA. In Proceedings of IJCAI-87, Milan, pp. 715-717.
Google Scholar
Harary, F. (1969) Graph Theory, Addison-Wesley, Reading, MA.
Google Scholar
Harris, Z. (1951) Structural Linguistics, University of Chicago Press, Chicago.
Google Scholar
Hobbs, J.R. (1987) World Knowledge and World Meaning. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 20-25.
Google Scholar
Jensen, K. and Binot, J.-L. (1987) Disambiguating Prepositional Phrase Attachments by Using On-Line Dictionary Definitions, Computational Linguistics 13, 251–260.
Google Scholar
Johnson, S.C. (1967) Hierarchical Clustering Schemes, Psychometrika 32, 241–254.
Article Google Scholar
Kegl, J. (1987) The Boundary Between Word Knowledge and World Knowledge. In Proceedings of the 3rd Workshop on Theoretical Issues in Natural Language Processing (TINLAP-3), Las Cruces, pp. 26-31.
Google Scholar
Kucera, H. and Francis, W.N. (1967) Computational Analysis of Present-Day American English, Brown University Press, Providence, RI.
Google Scholar
Lenat, D.B. and Feigenbaum. E.A. (1987) On The Thresholds of Knowledge. In Proceedings of IJCAI-87, Milan, pp. 1173-1182.
Google Scholar
Lenat, D.B., Prakash, M., and Shepherd, M. (1986) CYC: Using Common Sense Knowledge to Overcome Brittleness and Knowledge Acquisition Bottlenecks, A1 Magazine 7(4), 65–85.
Google Scholar
Lesk, M.E. (1986) Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone. In Proceedings of the ACM SIGDOC Conference, Toronto, pp. 24-26.
Google Scholar
Lyons, J. (1977) Semantics, Volume 2, Cambridge University Press, Cambridge, MA.
Google Scholar
McClelland, J., Rumelhart, D.E. and the PDP Research Group (eds.) (1986) Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Two Volumes, Volume 2: Psychological and Biological Models, MIT Press/Bradford Books, Cambridge, MA.
Google Scholar
McDonald, J.E., Plate, T., and Schvaneveldt, R.W. (1990) Using Pathfinder to Extract Semantic Information from Text. In R. Schvaneveldt (ed.), Pathfinder Associative Networks: Studies in Knowledge Organization, Ablex, New Jersey, pp. 197–211.
Google Scholar
Markowitz, J., Ahlswede, T. and Evens, M. (1986) Semantically Significant Patterns in Dictionary Definitions. In Proceedings of ACL-24, New York, pp. 112-119.
Google Scholar
Masterman, M. (1957) The Thesaurus in Syntax and Semantics. Mechanical Translation 4, 1–2.
Google Scholar
Michiels, A., Mullenders, J., and Noel, J. (1980) Exploiting a Large Data Base by Longman. In Proceedings of COLING-80, Tokyo, pp. 374-382.
Google Scholar
Miller, G.A. (1985) Dictionaries of the Mind. In Proceedings of ACL-23, Chicago, pp. 305-314.
Google Scholar
Newell, A. (1973) Artificial Intelligence and the Concept of Mind. In R.C. Schank and K.M. Colby (eds.), Computer Models of Thought and Language, W.H. Freeman, San Francisco, pp. 1–60.
Google Scholar
Ogden, C.K. (1942) The General Basic English Dictionary, W.W Norton, New York.
Google Scholar
Procter, P. et al. (eds.) (1978) Longman Dictionary of Contemporary English, Longman, Harlow, Essex.
Google Scholar
Pulman, S.G. (1985) Generalised Phrase Structure Grammar, Earley’s Algorithm, and the Minimisation of Recursion. In K. Sparck Jones and Y.A. Wilks (eds.), Automatic Natural Language Parsing, John Wiley and Sons, New York, pp. 117–131.
Google Scholar
Pustejovsky, J. and Bergler, S. (1987) The Acquisition of Conceptual Structure for the Lexicon. In Proceedings of AAAI-87, Seattle, pp. 556-570.
Google Scholar
Quillian, M.R. (1967) Word Concepts: A Theory and Simulation of Some Basic Semantic Capabilities, Behavioral Science 12, 410–430. Reprinted in R.J. Brachman and H.J. Levesque (eds.), Readings in Knowledge Representation, Morgan Kaufmann, Los Altos, 1985, pp. 98-118.
Article Google Scholar
Quirk, R., Greenbaum, S., Leech, G. and Svartik, J. (1972) A Grammar of Contemporary English, Longman, Harlow, Essex.
Google Scholar
Quirk, R., Greenbaum, S., Leech, G., and Svartik, J. (1985) A Comprehensive Grammar of English, Longman, Harlow, Essex.
Google Scholar
St. John, M.R and McClelland, J.L. (1986) Reconstructive Memory for Sentences: A PDP Approach, Ohio University Inference Conference.
Google Scholar
Sampson, G. (1986) A Stochastic Approach to Parsing. In Proceedings of CIOLING-86, Bonn, pp. 151-155.
Google Scholar
Schvaneveldt, R.W. and Durso, F.T. (1981) Generalized Semantic Networks, Paper presented at the meeting of the Psychonomic Society, Philadelphia.
Google Scholar
Schvaneveldt, R.W., Durso, F.T., and Dearholt, D.W. (1985) Pathfinder: Scaling with Network Structure, Memorandum in Computer and Cognitive Science, MCCS-85-9, Computing Research Laboratory, New Mexico State University, Las Cruces.
Google Scholar
Shortliffe, E.H. (1976) Computer-Based Medical Consultation: MYCIN. Elsevier, New York.
Google Scholar
Slator, B.M. (1988a) Lexical Semantics and a Preference Semantics Parser, Memorandum in Computer and Cognitive Science, MCCS-88-116, Computing Research Laboratory, New Mexico State University, Las Cruces.
Google Scholar
Slator, B.M. (1988b) PREMO: The PREference Machine Organization. In Proceedings of the Third Annual Rocky Mountain Conference on Artificial Intelligence, Denver, pp. 258-265.
Google Scholar
Slator, B.M. (1988c) Constructing Contextually Organized Lexical Semantic Knowledge-Bases. In Proceedings of the Third Annual Rocky Mountain Conference on Artificial Intelligence, Denver, CO, pp. 142-148.
Google Scholar
Slator, B.M. and Wilks, Y.A. ( 1987) Toward Semantic Structures from Dictionary Entries. In Proceedings of the Second Annual Rocky Mountain Conference on Artificial Intelligence, Boulder, CO, pp. 85-96. Also, Memorandum in Computer and Cognitive Science, MCCS-87-96, Computing Research Laboratory, New Mexico State University, Las Cruces.
Google Scholar
Slocum, J. (1985) Parser Construction Techniques: A Tutorial, Tutorial held at the 23rd Annual Meeting of the Association for Computational Linguistics, Chicago.
Google Scholar
Slocum, J. and Morgan, M.G. (1993, forthcoming) The Role of Dictionaries and Machine Readable Lexicons in Translation. In D. Walker, A. Zampolli and N. Calzolari (eds.), Automating the Lexicon: Research and Practice in a Multilingual Environment, Cambridge University Press, Cambridge.
Google Scholar
Sparck Jones, K. (1964) Synonymy and Semantic Classification, Ph.D. Thesis, University of Cambridge.
Google Scholar
Sparck Jones, K. (1986) Synonymy and Semantic Classification: (Ph.D. thesis with new Foreword. Edinburgh Information Technology Series (EDITS). Edinburgh: Edinburgh University Press.
Google Scholar
Walker, D.E. and Amsler, R.A. (1986) The Use of Machine-Readable Dictionaries in Sublanguage Analysis. In R. Grishman and R. Kittredge (eds.), Analyzing Language in Restricted Domains, Lawrence Erlbaum, Hillsdale, NJ, pp. 69–84.
Google Scholar
Waltz, D.L. and Pollack, J.B. (1985) Massively Parallel Parsing: A Strongly Interactive Model of Natural Language Interpretation, Cognitive Science 9, 51–74.
Article Google Scholar
Wilks, Y.A. (1972) Grammar, Meaning, and the Machine Analysis of Language, Routledge and Kegan Paul, London.
Google Scholar
Wilks, Y.A. (1973) An Artificial Intelligence Approach to Machine Translation. In R.C. Schank and K.M. Colby (eds.), Computer Models of Thought and Language, W.H. Freeman, San Francisco, pp. 114–151.
Google Scholar
Wilks, Y.A. (1975a) A Preferential Pattern-Seeking Semantics for Natural Language Inference, Artificial Intelligence 6, 53–74.
Article Google Scholar
Wilks, Y.A. (1975b) An Intelligent Analyser and Understander for English, Communications of the ACM 18, 264–274.
Article Google Scholar
Wilks, Y.A. (1977) Good and Bad Arguments about Semantic Primitives, Communication and Cognition 10, 182–221.
Google Scholar
Wilks, Y.A. (1978) Making Preferences More Active, Artificial Intelligence 10, 75–97.
Google Scholar
Wilks, Y.A., Fass, D.C., Guo, C, McDonald, J.E., Plate, T., and Slator, B.M. (1987) A Tractable Machine Dictionary as a Resource for Computational Semantics. Memorandum in Computer and Cognitive Science, MCCS-87-105, Computing Research Laboratory, New Mexico State University, Las Cruces. To appear in B. Boguraev and T. Briscoe (eds.), Computational Lexicography for Natural Language Processing, Longman, Harlow, Essex.
Google Scholar
Wilks, Y.A., Fass, D.C., Guo, C, McDonald, J.E., Plate, T., and Slator, B.M. (1988) Machine Tractable Dictionaries as Tools and Resources for Natural Language Processing. In Proceedings of COLING-88, Budapest, pp.750-755.
Google Scholar
Winston, P.H. (1975) Learning Structural Descriptions from Examples. In P.H. Winston (ed.), The Psychology of Computer Vision, McGraw-Hill, New York.
Google Scholar

Download references

Author information

Authors and Affiliations

Computing Research Laboratory, New Mexico State University, Las Cruces, USA
Yorik Wilks

Authors

Yorik Wilks
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Brandeis University, USA
James Pustejovsky

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wilks, Y. (1993). Providing Machine Tractable Dictionary Tools. In: Pustejovsky, J. (eds) Semantics and the Lexicon. Studies in Linguistics and Philosophy, vol 49. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-1972-6_16

Download citation

DOI: https://doi.org/10.1007/978-94-011-1972-6_16
Publisher Name: Springer, Dordrecht
Print ISBN: 978-0-7923-2386-0
Online ISBN: 978-94-011-1972-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics