Automatic Synonym Acquisition Based on Matching of Definition Sentences in Multiple Dictionaries

Murata, Masaki; Kanamaru, Toshiyuki; Isahara, Hitoshi

doi:10.1007/978-3-540-30586-6_32

Automatic Synonym Acquisition Based on Matching of Definition Sentences in Multiple Dictionaries

Masaki Murata¹⁷,
Toshiyuki Kanamaru¹⁸ &
Hitoshi Isahara¹⁷

Conference paper

2266 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3406))

Abstract

Studies on paraphrasing are important with respect to various research topics such as sentence generation, summarization, and question-answering. We consider the automatic extraction of synonyms (which are a kind of paraphrase) through the matching of word definitions from two dictionaries, and describe a new method for extracting paraphrases. Higher precision was obtained than with a conventional frequency-based method. The new method provided a precision rate of 0.764 for the top 500 data pairs and 0.220 for 500 randomly extracted data pairs when only synonyms were considered a correct answer. It provided a precision rate of 0.974 for the top 500 data pairs and 0.722 for 500 randomly extracted data pairs when hypernyms and similar expressions were also considered correct answers. Our method should be useful for other studies on paraphrase extraction.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barzilay, R., McKeown, K.R.: Extracting paraphrases from a parallel corpus. In: 39th Annual Meeting of the Association of the Computational Linguistics, pp. 50–57 (2001)
Google Scholar
IWPT committe. Nlprs 2001 workshop on automatic paraphrasing: Theories and applications (2001)
Google Scholar
Katoh, N., Uratani, N.: A new approach to acquiring linguistic knowledge for locally summarizing Japanese news sentences. Journal of Natural Language Processing 6(7) (1999) (in Japanese)
Google Scholar
Kurohashi, S., Nagao, M.: Japanese Morphological Analysis System JUMAN version 3.6. Department of Informatics, Kyoto University (1998) (in Japanese)
Google Scholar
Murata, M.: NLP using DIFF — use of convenient tool for detecting differences, MDIFF. Journal of Natural Language Processing 9(2) (2002) (in Japanese)
Google Scholar
Murata, M., Isahara, H.: Universal model for paraphrasing — using transformation based on a defined criteria. In: NLPRS 2001 Workshop on Automatic Paraphrasing: Theories and Applications (2001)
Google Scholar
Murata, M., Isahara, H.: Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language. In: LERC 2002 (2002)
Google Scholar
Murata, M., Isahara, H.: Using the diff command for natural language processing (2002), http://arxiv.org/abs/cs.CL/0208020
Murata, M., Isahara, H.: Using the diff. command in patent documents. In: Proceedings of the Third NTCIR Workshop (PATENT) (2002)
Google Scholar
Shimohata, M.: Acquiring paraphrases from corpora and its application to machine translation. Doctor’s Thesis, NAIST-IS-DD0261014 (2004)
Google Scholar
Shinyama, Y., Sekine, S.: Paraphrase acquisition for information extraction. In: The Second International Workshop on Paraphrasing: Paraphrase Acquisition and Applications (2003)
Google Scholar
Takahashi, T., Nawata, K., Inui, K., Matsumoto, Y.: Effect of structural matching and paraphrasing in question answering. IEICE Transactions on Information and Systems E86–D(9), 1677–1685 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Information and Communications Technology, 3-5 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0289, Japan
Masaki Murata & Hitoshi Isahara
Kyoto University, Yoshida-nihonmatsu-cho, Sakyo-ku, Kyoto, 606-8501, Japan
Toshiyuki Kanamaru

Authors

Masaki Murata
View author publications
You can also search for this author in PubMed Google Scholar
Toshiyuki Kanamaru
View author publications
You can also search for this author in PubMed Google Scholar
Hitoshi Isahara
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Murata, M., Kanamaru, T., Isahara, H. (2005). Automatic Synonym Acquisition Based on Matching of Definition Sentences in Multiple Dictionaries. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_32

Download citation

DOI: https://doi.org/10.1007/978-3-540-30586-6_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24523-0
Online ISBN: 978-3-540-30586-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics