Abstract
In this paper we present a method for the automatic term clustering. The method uses a hybrid similarity measure to cluster terms automatically extracted from a corpus by applying the C/NC-value method. The measure comprises contextual, functional and lexical similarity, and it is used to instantiate the cell values in a similarity matrix. The clustering algorithm uses either the nearest neighbour or the Ward’s method to calculate the distance between clusters. The approach has been tested and evaluated in the domain of molecular biology and the results are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fasulo, D.: Analysis on Recent Work on Clustering Algorithms. Technical Report 01-03-02, University of Washington, Seattle (1999), p. 24.
Frantzi, K., Ananiadou, S., Mima, H.: Automatic Recognition of Multi-Word Terms. Int. J. on Digital Libraries 3/2 (2000), pp. 117–132.
Hearst, M.: Automatic Acquisition of Hyponyms From Large Text Corpora. Proc. of COLING 1992, Nantes, France (1992).
MEDLINE: National Library of Medicine. http://www.ncbi.nlm.nih.gov/PubMed/, (2002).
Mima, H., Ananiadou, S., Nenadić, G.: ATRACTWorkbench: An Automatic Term Recognition and Clustering of Terms. In: Matoušek, V. et al. (Eds.): Text, Speech and Dialogue — TSD 2001. LNAI 2166. Springer Verlag (2001), pp. 126–133.
Spasić, I., Nenadić, G., Manios, K., Ananiadou, S.: Supervised Learning of Term Similarities. Proc. of IDEAL 2002, Manchester, UK (2002).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nenadić, G., Spasić, I., Ananiadou, S. (2002). Term Clustering Using a Corpus-Based Similarity Measure. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_20
Download citation
DOI: https://doi.org/10.1007/3-540-46154-X_20
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive