Abstract
Recent developments in computational terminology call for the design of multiple and complementary tools for the acquisition, the structuring and the exploitation of terminological data. This paper proposes to bridge the gap between term acquisition and thesaurus construction by offering a framework for automatic structuring of multi-word candidate terms with the help of corpus-based links between single-word terms. First, we present a system for corpus-based acquisition of terminological relationships through discursive patterns. This system is built on previous work on automatic extraction of hyponymy links through shallow parsing. Second, we show how hypernym links between single-word terms can be extended to semantic links between multi-word terms through corpus-based extraction of semantic variants. The induced hierarchy is incomplete but provides an automatic generalization of single-word terms relations to multi-word terms that are pervasive in technical thesauri and corpora.
Similar content being viewed by others
References
R. Basili M.T. Pazienza P. Velardi (1993) ArticleTitleAcquisition of Selectional Patterns in Sublanguages Machine Tranlation 8 175–201
Bourigault D. (1995) LEXTER, A Terminology Extraction Software for Knowledge Acquisition from Texts. In Proceedings, 9th Banff Knowledge Acquisition for Knowledge-Based Systems Workshop. Banff, Vol. 5, pp. 1–17.
K.W. Church P. Hanks (1990) ArticleTitleWord Association Norms, Mutual Information, and Lexicography Computational Linguistics 16 IssueID1 22–29
A. Condamines J. Rebeyrolle (2001) Searching for and Identifying Conceptual Relationships via a Corpus-Based Approach to a Terminological Knowledge Base (CTKB): Method and Results D. Bourigault C. Jacquemin M.-C. L’Homme (Eds) Recent Advances in Computational Terminology John Benjamins Amsterdam 127–148
B. Daille (1996) Study and Implementation of Combined Techniques for Automatic Extraction of Terminology J.L. Klavans P. Resnik (Eds) The Balancing Act: Combining Symbolic and Statistical Approaches to Language MIT press Cambridge, MA 49–66
C. Fellbaum (Eds) (1998) WordNet: An Electronic Lexical Database MIT Press. Cambridge MA
G. Grefenstette (1994) Explorations in Automatic Thesaurus Discovery Kluwer Academic Publisher Boston, MA
Grishman R., Sterling J. (1992) Acquisition of Selectional Patterns. In Proceedings of the 14th International Conference on Computational Linguistics (COLING’92 ). Nantes, France, pp. 658–664.
Hamon T., Nazarenko A., Gros C. (1998) A Step Towards the Detection of Semantic Variants of Terms in Technical Documents. InProceedings of the 36th Annual Meeting of the Association for Computational Linguistics (COLING-ACL’98). Montreal, pp. 498–504.
M.A. Hearst (1992) Automatic Acquisition of Hyponyms from Large Text Corpora. In Proceedings of the 14th International Conference on Computational Linguistics (COLING’92). Nantes Francepp 539–545
M.A. Hearst (1998) Automated Discovery of WordNet Relations C. Fellbaum (Eds) WordNet: An Electronic Lexical Database MIT Press Cambridge, MA 131–151
D. Hindle (1990) Noun Classification from Predicate Argument Structures. In Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics (ACL’90). Berkeley CA 268–275
C. Jacquemin E. Tzoukermann (1999) NLP for Term Variant Extraction: A Synergy of Morphology, Lexicon, and Syntax T. Strzalkowski (Eds) Natural Language Information Retrieval Kluwer Academic Publishers Boston, MA 25–74
C. Jacquemin (1996) A Symbolic and Surgical Acquisition of Terms Through Variation S. Wermter E. Riloff G. Scheler (Eds) Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing Springer Heidelberg 425–438
Jacquemin C. (1999) Syntagmatic and Paradigmatic Representation of Term Variation. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL’99). University of Maryland, pp. 341–348.
C. Jacquemin (2001) Spotting and Discovering Terms through NLP. MIT Press Cambridge, MA
Jacquemin C., Royautè J. (1994) Retrieving Terms and Their Variants in a Lexicalized Unification-Based Framework. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’94). Dublin, Springer Verlag, New York. pp. 132–141.
J.S. Justeson S.M. Katz (1995) ArticleTitleTechnical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text. Natural Language Engineering 1 IssueID1 9–27
E. Morin (1999a) ArticleTitleDes Patrons Lexico-Syntaxiques Pour Aider au Dépouillement Terminologique. Traitement Automatique des Langues 40 IssueID1 143–166
Morin E. (1999b) Extraction de Liens Sémantiques Entre Termes à Partir de Corpus de Textes Techniques. PhD thesis in Computer Science, University of Nantes.
Rijsbergen van C.J. (1979) Information Retrieval. Butterworths.
Rijsbergen van C.J. (1979) Information Retrieval. Butterworths.
Riloff E. (1993) Automatically Constructing a Dictionary for Information Extraction Tasks. In Proceedings of the 11th National Conference on Artificial Intelligence (AAAI’93). Washington, DC, pp. 811–816.
G. Ruge (1991) Experiments on Linguistically Based Term Associations. In Proceedings of the Intelligent Multimedia Information Retrieval Systems and Management (RIAO’91). Barcelona Spainpp 528–545
G. Salton M.J. McGill (1983) Introduction to Modern Information Retrieval McGraw-Hill New York
H. Schu¨tze (1993) Word Space S.J. Hanson J.D. Cowan L. Giles (Eds) Advances in Neural Information Processing Systems 5 San Mateo CA, Morgan Kauffmann 895–902
F. Smadja (1993) ArticleTitleRetrieving Collocations from Text: Xtract Computational Linguistics 19 IssueID1 143–177
J. Vivaldi Palatresi Rodríguez Hontoria (2002) Medical Term Extraction using EWN ontology. In Proceedings of the Terminology and Knowledge Engineering (TKE’2002). Nancy France, Gesellschaft fu¨r Terminologie und Wissenstransfer
F. Yoshikane K. Tsuji K. Kageura C. Jacquemin (1998) Detecting Japanese Term Variation in Textual Corpus. In Proceedings of the Fourth International Workshop on Information Retrieval with Asian Languages (IRAL’99). Academia Sinica Taipei, Taiwan 97–108
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Morin, E., Jacquemin, C. Automatic Acquisition and Expansion of Hypernym Links. Comput Hum 38, 363–396 (2004). https://doi.org/10.1007/s10579-004-1926-2
Issue Date:
DOI: https://doi.org/10.1007/s10579-004-1926-2