A Hybrid Approach for Relational Similarity Measurement

Lu, Zhao; Yan, Zhixian

doi:10.1007/978-3-642-37450-0_32

Zhao Lu²¹ &
Zhixian Yan²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7826))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

1750 Accesses

Abstract

Relational similarity measurement between word-pairs is important in many natural language processing tasks such as information extraction and information retrieval. The paper proposes a hybrid approach for relational similarity measurement based on various aspects including term co-occurrence, lexicon-syntactic patterns, as well as their combinations. In this approach, we first extract two relation-term sets from sentences of Wikipedia documents in which two words coincide, and compute the semantic relatedness score of each word-pair in the two relation-term sets. Second, we model the semantic relatedness value of two words together with their frequencies as a point in the three-dimensional space. Afterward, we apply DBSCAN - the classic density-based spatial clustering algorithm to group these 3D points. We finally calculate the similarity based on the clusters. We evaluate this hybrid approach using the well-known 374 SAT analogy questions. The experimental results show that our approach can significantly reduce computational time for measuring relational similarity with a relatively higher score of 52.9% compared to the state-of-the-art.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Discovering Representative Space for Relational Similarity Measurement

Joint Distance and Information Content Word Similarity Measure

Lemon and Tea Are Not Similar: Measuring Word-to-Word Similarity by Combining Different Methods

References

Turney, P., Pantel, P.: From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research 37, 141–188 (2010)
MathSciNet MATH Google Scholar
Veale, T.: WordNet sits the sat: A knowledge-based approach to lexical analogy. In: 16th European Conference on Artificial Intelligence, pp. 606–612. IOS Press (2004)
Google Scholar
Cao, Y., Lu, Z., Cai, S.: Relational Similarity Measure: An Approach Combining Wik-ipedia and WordNet. Applied Mechanics and Materials 55-57, 955–960 (2011)
Google Scholar
Bollegala, D., Matsuo, Y., Ishizuka, M.: Www sits the sat: Measuring relational similarity on the web. In: 18th European Conference on Artificial Intelligience, pp. 333–337. IOS Press (2008)
Google Scholar
Bollegala, D., Matsuo, Y., Ishizuka, M.: Measuring the similarity between implicit semantic relations from the web. In: 18th Int. World Wide Web Conference, pp. 651–660. ACM (2009)
Google Scholar
Ester, M., Kriegel, H., Sander, J., Xu, X.: A density-based algorithm for discovering clus-ters in large spatial databases with noise. In: Second International Conference on Knowledge Discovery and Data Mining, pp. 226–231. AAAI Press (1996)
Google Scholar
Bollegala, D., Matsuo, Y., Ishizuka, M.: Measuring the Degree of Synonymy between Words Using Relational Similarity between Word Pairs as a Proxy. In: IEICE Transaction on Information and System, vol. E95D, pp. 2116–2123 (2012)
Google Scholar
Budanitsky, A., Hirst, G.: Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In: Workshop on WordNet and Other Lexical Resources, Second Meeting of the North American Chapter of the Association for Computational Linguistics, pp. 29–24 (2001)
Google Scholar
Patwardhan, S., Pedersen, T.: Using WordNet-based context vectors to estimate the se-mantic relatedness of concepts. In: EACL 2006 Workshop, Making Sense of Sense: Bringing Computational Linguistics and Psycholinguistics Together, pp. 1–8. IOS Press (2006)
Google Scholar
Kato, M., Ohshima, H., Oyama, S., Tanaka, K.: Query by analogical example: relational search using web search engine indices. In: 18th ACM Conference on Information and Knowledge Management, pp. 27–36. ACM Press (2009)
Google Scholar
Bollegala, D., et al.: Improving relational similarity measurement using symmetries in proportional word analogies. In: Information Processing and Management (2012)
Google Scholar
Yan, D., Lu, Z.: Relational Similarity Measurement Between Word-pairs using Multi-Task Lasso. In: International Conference on Cloud and Service Computing (2012)
Google Scholar
Baroni, M., Bisi, S.: Using cooccurrence statistics and the web to discover synonyms in a technical language. In: Fourth International Conference on Language Resources and Evaluation, pp. 1725–1728 (2004)
Google Scholar
http://nlp.stanford.edu/software/tagger.shtml (2012)
Nakov, P., Hearst, M.: Solving relational similarity problems using the web as a corpus. In: ACL 2008-HLT, pp. 452–460 (2008)
Google Scholar
Duc, N., et al.: Using relational similarity between word pairs for latent relational search on the web. In: Intl. Conf. on Web Intelligence, pp. 196–199 (2010)
Google Scholar
Liang, C., Lu, Z.: Chinese Latent Relational Search Based on Relational Similarity. In: Xiang, Y., Pathan, M., Tao, X., Wang, H. (eds.) ICDKE 2012. LNCS, vol. 7696, pp. 115–127. Springer, Heidelberg (2012)
Chapter Google Scholar
Turney, P.: Measuring semantic similarity by latent relational analysis. In: Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, pp, pp. 1136–1141 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, East China Normal University, 200241, Shanghai, China
Zhao Lu
Samsung Research America, Silicon Valley, USA
Zhixian Yan

Authors

Zhao Lu
View author publications
You can also search for this author in PubMed Google Scholar
Zhixian Yan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Binghamton University, 13902, Binghamton, NY, USA
Weiyi Meng
Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, China
Ling Feng
Department of Computer Science, National University of Singapore, 117417, Singapore
Stéphane Bressan
Research Group Data Analystics and Computing, University of Vienna, 1090, Vienna, Austria
Werner Winiwarter
School of Computer, Wuhan University, 430072, Wuhan, China
Wei Song

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, Z., Yan, Z. (2013). A Hybrid Approach for Relational Similarity Measurement. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds) Database Systems for Advanced Applications. DASFAA 2013. Lecture Notes in Computer Science, vol 7826. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37450-0_32

Download citation

DOI: https://doi.org/10.1007/978-3-642-37450-0_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37449-4
Online ISBN: 978-3-642-37450-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Hybrid Approach for Relational Similarity Measurement

Abstract

Access this chapter

Preview

Similar content being viewed by others

Discovering Representative Space for Relational Similarity Measurement

Joint Distance and Information Content Word Similarity Measure

Lemon and Tea Are Not Similar: Measuring Word-to-Word Similarity by Combining Different Methods

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Hybrid Approach for Relational Similarity Measurement

Abstract

Access this chapter

Preview

Similar content being viewed by others

Discovering Representative Space for Relational Similarity Measurement

Joint Distance and Information Content Word Similarity Measure

Lemon and Tea Are Not Similar: Measuring Word-to-Word Similarity by Combining Different Methods

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation