Abstract
Relational similarity measurement between word-pairs is important in many natural language processing tasks such as information extraction and information retrieval. The paper proposes a hybrid approach for relational similarity measurement based on various aspects including term co-occurrence, lexicon-syntactic patterns, as well as their combinations. In this approach, we first extract two relation-term sets from sentences of Wikipedia documents in which two words coincide, and compute the semantic relatedness score of each word-pair in the two relation-term sets. Second, we model the semantic relatedness value of two words together with their frequencies as a point in the three-dimensional space. Afterward, we apply DBSCAN - the classic density-based spatial clustering algorithm to group these 3D points. We finally calculate the similarity based on the clusters. We evaluate this hybrid approach using the well-known 374 SAT analogy questions. The experimental results show that our approach can significantly reduce computational time for measuring relational similarity with a relatively higher score of 52.9% compared to the state-of-the-art.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Turney, P., Pantel, P.: From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research 37, 141–188 (2010)
Veale, T.: WordNet sits the sat: A knowledge-based approach to lexical analogy. In: 16th European Conference on Artificial Intelligence, pp. 606–612. IOS Press (2004)
Cao, Y., Lu, Z., Cai, S.: Relational Similarity Measure: An Approach Combining Wik-ipedia and WordNet. Applied Mechanics and Materials 55-57, 955–960 (2011)
Bollegala, D., Matsuo, Y., Ishizuka, M.: Www sits the sat: Measuring relational similarity on the web. In: 18th European Conference on Artificial Intelligience, pp. 333–337. IOS Press (2008)
Bollegala, D., Matsuo, Y., Ishizuka, M.: Measuring the similarity between implicit semantic relations from the web. In: 18th Int. World Wide Web Conference, pp. 651–660. ACM (2009)
Ester, M., Kriegel, H., Sander, J., Xu, X.: A density-based algorithm for discovering clus-ters in large spatial databases with noise. In: Second International Conference on Knowledge Discovery and Data Mining, pp. 226–231. AAAI Press (1996)
Bollegala, D., Matsuo, Y., Ishizuka, M.: Measuring the Degree of Synonymy between Words Using Relational Similarity between Word Pairs as a Proxy. In: IEICE Transaction on Information and System, vol. E95D, pp. 2116–2123 (2012)
Budanitsky, A., Hirst, G.: Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In: Workshop on WordNet and Other Lexical Resources, Second Meeting of the North American Chapter of the Association for Computational Linguistics, pp. 29–24 (2001)
Patwardhan, S., Pedersen, T.: Using WordNet-based context vectors to estimate the se-mantic relatedness of concepts. In: EACL 2006 Workshop, Making Sense of Sense: Bringing Computational Linguistics and Psycholinguistics Together, pp. 1–8. IOS Press (2006)
Kato, M., Ohshima, H., Oyama, S., Tanaka, K.: Query by analogical example: relational search using web search engine indices. In: 18th ACM Conference on Information and Knowledge Management, pp. 27–36. ACM Press (2009)
Bollegala, D., et al.: Improving relational similarity measurement using symmetries in proportional word analogies. In: Information Processing and Management (2012)
Yan, D., Lu, Z.: Relational Similarity Measurement Between Word-pairs using Multi-Task Lasso. In: International Conference on Cloud and Service Computing (2012)
Baroni, M., Bisi, S.: Using cooccurrence statistics and the web to discover synonyms in a technical language. In: Fourth International Conference on Language Resources and Evaluation, pp. 1725–1728 (2004)
Nakov, P., Hearst, M.: Solving relational similarity problems using the web as a corpus. In: ACL 2008-HLT, pp. 452–460 (2008)
Duc, N., et al.: Using relational similarity between word pairs for latent relational search on the web. In: Intl. Conf. on Web Intelligence, pp. 196–199 (2010)
Liang, C., Lu, Z.: Chinese Latent Relational Search Based on Relational Similarity. In: Xiang, Y., Pathan, M., Tao, X., Wang, H. (eds.) ICDKE 2012. LNCS, vol. 7696, pp. 115–127. Springer, Heidelberg (2012)
Turney, P.: Measuring semantic similarity by latent relational analysis. In: Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, pp, pp. 1136–1141 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lu, Z., Yan, Z. (2013). A Hybrid Approach for Relational Similarity Measurement. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds) Database Systems for Advanced Applications. DASFAA 2013. Lecture Notes in Computer Science, vol 7826. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37450-0_32
Download citation
DOI: https://doi.org/10.1007/978-3-642-37450-0_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37449-4
Online ISBN: 978-3-642-37450-0
eBook Packages: Computer ScienceComputer Science (R0)