Skip to main content

Automatic Synonym Acquisition Based on Matching of Definition Sentences in Multiple Dictionaries

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3406))

Abstract

Studies on paraphrasing are important with respect to various research topics such as sentence generation, summarization, and question-answering. We consider the automatic extraction of synonyms (which are a kind of paraphrase) through the matching of word definitions from two dictionaries, and describe a new method for extracting paraphrases. Higher precision was obtained than with a conventional frequency-based method. The new method provided a precision rate of 0.764 for the top 500 data pairs and 0.220 for 500 randomly extracted data pairs when only synonyms were considered a correct answer. It provided a precision rate of 0.974 for the top 500 data pairs and 0.722 for 500 randomly extracted data pairs when hypernyms and similar expressions were also considered correct answers. Our method should be useful for other studies on paraphrase extraction.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barzilay, R., McKeown, K.R.: Extracting paraphrases from a parallel corpus. In: 39th Annual Meeting of the Association of the Computational Linguistics, pp. 50–57 (2001)

    Google Scholar 

  2. IWPT committe. Nlprs 2001 workshop on automatic paraphrasing: Theories and applications (2001)

    Google Scholar 

  3. Katoh, N., Uratani, N.: A new approach to acquiring linguistic knowledge for locally summarizing Japanese news sentences. Journal of Natural Language Processing 6(7) (1999) (in Japanese)

    Google Scholar 

  4. Kurohashi, S., Nagao, M.: Japanese Morphological Analysis System JUMAN version 3.6. Department of Informatics, Kyoto University (1998) (in Japanese)

    Google Scholar 

  5. Murata, M.: NLP using DIFF — use of convenient tool for detecting differences, MDIFF. Journal of Natural Language Processing 9(2) (2002) (in Japanese)

    Google Scholar 

  6. Murata, M., Isahara, H.: Universal model for paraphrasing — using transformation based on a defined criteria. In: NLPRS 2001 Workshop on Automatic Paraphrasing: Theories and Applications (2001)

    Google Scholar 

  7. Murata, M., Isahara, H.: Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language. In: LERC 2002 (2002)

    Google Scholar 

  8. Murata, M., Isahara, H.: Using the diff command for natural language processing (2002), http://arxiv.org/abs/cs.CL/0208020

  9. Murata, M., Isahara, H.: Using the diff. command in patent documents. In: Proceedings of the Third NTCIR Workshop (PATENT) (2002)

    Google Scholar 

  10. Shimohata, M.: Acquiring paraphrases from corpora and its application to machine translation. Doctor’s Thesis, NAIST-IS-DD0261014 (2004)

    Google Scholar 

  11. Shinyama, Y., Sekine, S.: Paraphrase acquisition for information extraction. In: The Second International Workshop on Paraphrasing: Paraphrase Acquisition and Applications (2003)

    Google Scholar 

  12. Takahashi, T., Nawata, K., Inui, K., Matsumoto, Y.: Effect of structural matching and paraphrasing in question answering. IEICE Transactions on Information and Systems E86–D(9), 1677–1685 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Murata, M., Kanamaru, T., Isahara, H. (2005). Automatic Synonym Acquisition Based on Matching of Definition Sentences in Multiple Dictionaries. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30586-6_32

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24523-0

  • Online ISBN: 978-3-540-30586-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics