Abstract
Studies on paraphrasing are important with respect to various research topics such as sentence generation, summarization, and question-answering. We consider the automatic extraction of synonyms (which are a kind of paraphrase) through the matching of word definitions from two dictionaries, and describe a new method for extracting paraphrases. Higher precision was obtained than with a conventional frequency-based method. The new method provided a precision rate of 0.764 for the top 500 data pairs and 0.220 for 500 randomly extracted data pairs when only synonyms were considered a correct answer. It provided a precision rate of 0.974 for the top 500 data pairs and 0.722 for 500 randomly extracted data pairs when hypernyms and similar expressions were also considered correct answers. Our method should be useful for other studies on paraphrase extraction.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Barzilay, R., McKeown, K.R.: Extracting paraphrases from a parallel corpus. In: 39th Annual Meeting of the Association of the Computational Linguistics, pp. 50–57 (2001)
IWPT committe. Nlprs 2001 workshop on automatic paraphrasing: Theories and applications (2001)
Katoh, N., Uratani, N.: A new approach to acquiring linguistic knowledge for locally summarizing Japanese news sentences. Journal of Natural Language Processing 6(7) (1999) (in Japanese)
Kurohashi, S., Nagao, M.: Japanese Morphological Analysis System JUMAN version 3.6. Department of Informatics, Kyoto University (1998) (in Japanese)
Murata, M.: NLP using DIFF — use of convenient tool for detecting differences, MDIFF. Journal of Natural Language Processing 9(2) (2002) (in Japanese)
Murata, M., Isahara, H.: Universal model for paraphrasing — using transformation based on a defined criteria. In: NLPRS 2001 Workshop on Automatic Paraphrasing: Theories and Applications (2001)
Murata, M., Isahara, H.: Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language. In: LERC 2002 (2002)
Murata, M., Isahara, H.: Using the diff command for natural language processing (2002), http://arxiv.org/abs/cs.CL/0208020
Murata, M., Isahara, H.: Using the diff. command in patent documents. In: Proceedings of the Third NTCIR Workshop (PATENT) (2002)
Shimohata, M.: Acquiring paraphrases from corpora and its application to machine translation. Doctor’s Thesis, NAIST-IS-DD0261014 (2004)
Shinyama, Y., Sekine, S.: Paraphrase acquisition for information extraction. In: The Second International Workshop on Paraphrasing: Paraphrase Acquisition and Applications (2003)
Takahashi, T., Nawata, K., Inui, K., Matsumoto, Y.: Effect of structural matching and paraphrasing in question answering. IEICE Transactions on Information and Systems E86–D(9), 1677–1685 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Murata, M., Kanamaru, T., Isahara, H. (2005). Automatic Synonym Acquisition Based on Matching of Definition Sentences in Multiple Dictionaries. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-30586-6_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24523-0
Online ISBN: 978-3-540-30586-6
eBook Packages: Computer ScienceComputer Science (R0)