A method for the computation of the semantic similarity and relatedness between natural language words

Anisimov, A. V.; Marchenko, O. O.; Kysenko, V. K.

doi:10.1007/s10559-011-9334-2

A method for the computation of the semantic similarity and relatedness between natural language words

Published: 03 August 2011

Volume 47, pages 515–522, (2011)
Cite this article

Cybernetics and Systems Analysis Aims and scope

A. V. Anisimov¹,
O. O. Marchenko¹ &
V. K. Kysenko¹

114 Accesses
3 Citations
Explore all metrics

Abstract

This paper develops methods for calculating the semantic similarity (closeness)-relatedness of natural language words. The concept of semantic relatedness allows one to construct algorithmic models for the context-linguistic analysis with a view to solving problems such as word sense disambiguation, named entity recognition, natural language text analysis, etc. A new algorithm is proposed for estimating the semantic distance between natural language words. This method is a weighted modification of the well-known Lesk approach based on the lexical intersection of glossary entries.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dimensions of Semantic Similarity

A Semantic Similarity Measurement Tool for WordNet-Like Databases

An Analysis of Semantic Similarity Measures for Information Retrieval

References

M. Lesk, “Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone,” in: Proc. of the 5th Annu. Intern. Conf. on Syst. Document SIGDOC’86, ACM, New York (1986), pp. 24–26.
Chapter Google Scholar
S. Wubben, “Using free link structure to calculate semantic relatedness,” ILK Research Group Technical Report Series No. 08–01, Tilburq Univ., Tilburq (2008).
S. P. Ponzetto and M. Strube, “Knowledge deriver from Wikipedia for computing semantic relatedness,” Artif. Intell. Res., No. 30, 181–212 (2007).
Google Scholar
E. Gabrilovich and S. Markovitch, “Computing semantic relatedness using Wikipedia-based explicit semantic analysis,” in: Proc. 20th Intern. Joint Conf. on Artif. Intell. (Hyderabad, 2007), Morgan Kauffman, San Francisco (2007), pp. 1606–1611.
Google Scholar
P. Resnik, “Using information content to evaluate semantic similarity in a taxonomy,” in: Proc. Intern. Joint Conf. on Artif. Intell. (Montreal, 1995), Morgan Kauffman, San Francisco (1995), pp. 448–453.
Google Scholar
C. Leacock, M. Chodorow, and G. A. Miller, “Using corpus statistics and wordnet relations for sense identification,” Comput. Ling., 24, No. 1, 147–165 (1998).
Google Scholar
Z. Wu and M. Palmer, “Verb semantics and lexical selection,” in: Proc. 32nd. Annu. Meet. of the Assoc. for Comput. Ling. (Las Cruces, 1994), Morgan Kauffman, San Francisco (1994), pp. 133–138.
Google Scholar
M. Strube and S. P. Ponzetto, “WikiRelate! Computing semantic relatedness using Wikipedia,” in: Proc. 21st Nat. Conf. on Artif. Intell., AAAI, Boston, MA (2006), pp. 1419–1424.
Google Scholar
D. Milne and I. H. Witten, “An effective, low-cost measure of semantic relatedness obtained from Wikipedia links,” in: Proc. 1st AAAI Workshop on Wikipedia and Artif. Intell. (CIKM’2008) (Chicago, 2008), AAAI Press, Menlo Park (USA) (2008).
Google Scholar
E. Yeh, D. Ramage, C. D. Manning, et al., “WikiWalk: Random walks on Wikipedia for semantic relatedness,” in: ACL-IJCNLP TextGraphs-4 Workshop 2009, Singapore (2009).
S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi, “Optimization by simulated annealing,” Science, New Series, No. 220, 671–680 (1983).
S. Luke, Essentials of Metaheuristics (2009), http://cs.gmu.edu/!sean/book/metaheuristics/.
M. Odersky, Scala by Example, Progr. Meth. Lab., EPFL, Lausanne (2009).
M. Odersky, L. Spoon, and B. Venners, Programming in Scala, Artima Press, Montain View (2008).
Google Scholar
L. Finkelstein, E. Gabrilovich, Y. Matias, et al., “Placing search in context: The concept revisited,” ACM Trans. Inform. Systems, 20, No. 1, 116–131 (2002).
Article Google Scholar
T. Pedersen, S. Pathwardhan, and J. Michelizzi, “Wordnet::Similarity — Measuring the relatedness of concepts,” in: Proc. 19th Nat. Conf. on Artif. Intell. (San Jose, 2004), Springer, Berlin (2004), pp. 1024–1025.
Google Scholar

Download references

Author information

Authors and Affiliations

Taras Shevchenko National University, Kiev, Ukraine
A. V. Anisimov, O. O. Marchenko & V. K. Kysenko

Authors

A. V. Anisimov
View author publications
You can also search for this author in PubMed Google Scholar
O. O. Marchenko
View author publications
You can also search for this author in PubMed Google Scholar
V. K. Kysenko
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. V. Anisimov.

Additional information

Translated from Kibernetika i Sistemnyi Analiz, No. 4, pp. 18–27, July–August 2011.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Anisimov, A.V., Marchenko, O.O. & Kysenko, V.K. A method for the computation of the semantic similarity and relatedness between natural language words. Cybern Syst Anal 47, 515–522 (2011). https://doi.org/10.1007/s10559-011-9334-2

Download citation

Received: 10 March 2011
Published: 03 August 2011
Issue Date: July 2011
DOI: https://doi.org/10.1007/s10559-011-9334-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A method for the computation of the semantic similarity and relatedness between natural language words

Abstract

Access this article

Similar content being viewed by others

Dimensions of Semantic Similarity

A Semantic Similarity Measurement Tool for WordNet-Like Databases

An Analysis of Semantic Similarity Measures for Information Retrieval

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A method for the computation of the semantic similarity and relatedness between natural language words

Abstract

Access this article

Similar content being viewed by others

Dimensions of Semantic Similarity

A Semantic Similarity Measurement Tool for WordNet-Like Databases

An Analysis of Semantic Similarity Measures for Information Retrieval

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation