Skip to main content
Log in

Automatic Acquisition and Expansion of Hypernym Links

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

Recent developments in computational terminology call for the design of multiple and complementary tools for the acquisition, the structuring and the exploitation of terminological data. This paper proposes to bridge the gap between term acquisition and thesaurus construction by offering a framework for automatic structuring of multi-word candidate terms with the help of corpus-based links between single-word terms. First, we present a system for corpus-based acquisition of terminological relationships through discursive patterns. This system is built on previous work on automatic extraction of hyponymy links through shallow parsing. Second, we show how hypernym links between single-word terms can be extended to semantic links between multi-word terms through corpus-based extraction of semantic variants. The induced hierarchy is incomplete but provides an automatic generalization of single-word terms relations to multi-word terms that are pervasive in technical thesauri and corpora.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • R. Basili M.T. Pazienza P. Velardi (1993) ArticleTitleAcquisition of Selectional Patterns in Sublanguages Machine Tranlation 8 175–201

    Google Scholar 

  • Bourigault D. (1995) LEXTER, A Terminology Extraction Software for Knowledge Acquisition from Texts. In Proceedings, 9th Banff Knowledge Acquisition for Knowledge-Based Systems Workshop. Banff, Vol. 5, pp. 1–17.

  • K.W. Church P. Hanks (1990) ArticleTitleWord Association Norms, Mutual Information, and Lexicography Computational Linguistics 16 IssueID1 22–29

    Google Scholar 

  • A. Condamines J. Rebeyrolle (2001) Searching for and Identifying Conceptual Relationships via a Corpus-Based Approach to a Terminological Knowledge Base (CTKB): Method and Results D. Bourigault C. Jacquemin M.-C. L’Homme (Eds) Recent Advances in Computational Terminology John Benjamins Amsterdam 127–148

    Google Scholar 

  • B. Daille (1996) Study and Implementation of Combined Techniques for Automatic Extraction of Terminology J.L. Klavans P. Resnik (Eds) The Balancing Act: Combining Symbolic and Statistical Approaches to Language MIT press Cambridge, MA 49–66

    Google Scholar 

  • C. Fellbaum (Eds) (1998) WordNet: An Electronic Lexical Database MIT Press. Cambridge MA

    Google Scholar 

  • G. Grefenstette (1994) Explorations in Automatic Thesaurus Discovery Kluwer Academic Publisher Boston, MA

    Google Scholar 

  • Grishman R., Sterling J. (1992) Acquisition of Selectional Patterns. In Proceedings of the 14th International Conference on Computational Linguistics (COLING’92 ). Nantes, France, pp. 658–664.

  • Hamon T., Nazarenko A., Gros C. (1998) A Step Towards the Detection of Semantic Variants of Terms in Technical Documents. InProceedings of the 36th Annual Meeting of the Association for Computational Linguistics (COLING-ACL’98). Montreal, pp. 498–504.

  • M.A. Hearst (1992) Automatic Acquisition of Hyponyms from Large Text Corpora. In Proceedings of the 14th International Conference on Computational Linguistics (COLING’92). Nantes Francepp 539–545

    Google Scholar 

  • M.A. Hearst (1998) Automated Discovery of WordNet Relations C. Fellbaum (Eds) WordNet: An Electronic Lexical Database MIT Press Cambridge, MA 131–151

    Google Scholar 

  • D. Hindle (1990) Noun Classification from Predicate Argument Structures. In Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics (ACL’90). Berkeley CA 268–275

    Google Scholar 

  • C. Jacquemin E. Tzoukermann (1999) NLP for Term Variant Extraction: A Synergy of Morphology, Lexicon, and Syntax T. Strzalkowski (Eds) Natural Language Information Retrieval Kluwer Academic Publishers Boston, MA 25–74

    Google Scholar 

  • C. Jacquemin (1996) A Symbolic and Surgical Acquisition of Terms Through Variation S. Wermter E. Riloff G. Scheler (Eds) Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing Springer Heidelberg 425–438

    Google Scholar 

  • Jacquemin C. (1999) Syntagmatic and Paradigmatic Representation of Term Variation. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL’99). University of Maryland, pp. 341–348.

  • C. Jacquemin (2001) Spotting and Discovering Terms through NLP. MIT Press Cambridge, MA

    Google Scholar 

  • Jacquemin C., Royautè J. (1994) Retrieving Terms and Their Variants in a Lexicalized Unification-Based Framework. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’94). Dublin, Springer Verlag, New York. pp. 132–141.

  • J.S. Justeson S.M. Katz (1995) ArticleTitleTechnical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text. Natural Language Engineering 1 IssueID1 9–27

    Google Scholar 

  • E. Morin (1999a) ArticleTitleDes Patrons Lexico-Syntaxiques Pour Aider au Dépouillement Terminologique. Traitement Automatique des Langues 40 IssueID1 143–166

    Google Scholar 

  • Morin E. (1999b) Extraction de Liens Sémantiques Entre Termes à Partir de Corpus de Textes Techniques. PhD thesis in Computer Science, University of Nantes.

  • Rijsbergen van C.J. (1979) Information Retrieval. Butterworths.

  • Rijsbergen van C.J. (1979) Information Retrieval. Butterworths.

  • Riloff E. (1993) Automatically Constructing a Dictionary for Information Extraction Tasks. In Proceedings of the 11th National Conference on Artificial Intelligence (AAAI’93). Washington, DC, pp. 811–816.

  • G. Ruge (1991) Experiments on Linguistically Based Term Associations. In Proceedings of the Intelligent Multimedia Information Retrieval Systems and Management (RIAO’91). Barcelona Spainpp 528–545

    Google Scholar 

  • G. Salton M.J. McGill (1983) Introduction to Modern Information Retrieval McGraw-Hill New York

    Google Scholar 

  • H. Schu¨tze (1993) Word Space S.J. Hanson J.D. Cowan L. Giles (Eds) Advances in Neural Information Processing Systems 5 San Mateo CA, Morgan Kauffmann 895–902

    Google Scholar 

  • F. Smadja (1993) ArticleTitleRetrieving Collocations from Text: Xtract Computational Linguistics 19 IssueID1 143–177

    Google Scholar 

  • J. Vivaldi Palatresi Rodríguez Hontoria (2002) Medical Term Extraction using EWN ontology. In Proceedings of the Terminology and Knowledge Engineering (TKE’2002). Nancy France, Gesellschaft fu¨r Terminologie und Wissenstransfer

    Google Scholar 

  • F. Yoshikane K. Tsuji K. Kageura C. Jacquemin (1998) Detecting Japanese Term Variation in Textual Corpus. In Proceedings of the Fourth International Workshop on Information Retrieval with Asian Languages (IRAL’99). Academia Sinica Taipei, Taiwan 97–108

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Emmanuel Morin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Morin, E., Jacquemin, C. Automatic Acquisition and Expansion of Hypernym Links. Comput Hum 38, 363–396 (2004). https://doi.org/10.1007/s10579-004-1926-2

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10579-004-1926-2

Keywords

Navigation