Skip to main content

Automatic Construction of a Japanese Onomatopoeic Dictionary Using Text Data on the WWW

  • Conference paper
Natural Language Processing and Information Systems (NLDB 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3999))

Abstract

As new onomatopoeic words are often created at short notice, existing dictionaries tend to have an insufficient number of their entries. Furthermore, onomatopoeic words seldom appear in collections of newspaper articles, that have been used as corpora in natural language processing. In this work, we present a method of automatically acquiring lexical knowledge for Japanese onomatopoeic words from the WWW. As a result, we could automatically construct a onomatopoeic dictionary that contained 5,130 entries. By manually evaluating 487 newly acquired words that were not in the existing dictionary, we found that we could acquire 266 new onomatopoeic words, and if words in the existing dictionary were regarded as being correct, precision of our automatic acquisition was 83.6%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sinclair, J. (ed.): Collins Cobuild English Dictionary. HarperCollins Publishers (1995)

    Google Scholar 

  2. Kurohashi, S., Nagao, M.: Kyoto university text corpus project. In: Proceedings of ANLP 1997, pp. 115–118 (1997) (in Japanese)

    Google Scholar 

  3. Japanese Electronic Dictionary Research Institute Ltd.: EDR electronic dictionary technical guide ver.2.0 (1999)

    Google Scholar 

  4. Kilgarriff, A., Grefenstette, G.: Introduction to the special issue on the web as corpus. Computational Linguistics 29(3), 333–347 (2003)

    Article  Google Scholar 

  5. Dumais, S., Banko, M., Brill, E., Lin, J., Ng, A.: Web question answering: Is more always better? In: Proceedings of SIGIR 2002, pp. 291–298 (2002)

    Google Scholar 

  6. Ravichandran, D., Hovy, E.: Learning surface text patterns for a question answering system. In: Proceedings of ACL 2002 (2002)

    Google Scholar 

  7. Kehoe, A., Renouf, A.: Webcorp: Applying the web to linguistics and linguistics to the web. In: Proceedings of The Eleventh International World Wide Web Conference (2002)

    Google Scholar 

  8. Tamori, I.: Nihongo onomatope no on’in keitai. In: Kakei, H., Tamori, I. (eds.) Onomatopia GionEGitaigo no Rakuen, pp. 1–15 (1993) (in Japanese)

    Google Scholar 

  9. Tamori, I.: Nihongo onomatope no tougo hanchuu. In: Kakei, H., Tamori, I. (eds.) Onomatopia GionEGitaigo no Rakuen, Keisou Shobou, pp. 17–75 (1993) (in Japanese)

    Google Scholar 

  10. Kurohashi, S., Nagao, M.: Japanese Morphological Analysis System JUMAN version 3.61 Manual (1999) (in Japanese)

    Google Scholar 

  11. Kurohashi, S., Nagao, M.: Kn parser: Japanese dependency/case structure analyzer. In: Proceedings of the Workshop on Sharable Natural Language Resources, pp. 48–55 (1994)

    Google Scholar 

  12. Hida, Y., Asada, H.: Gendai Giongo Gitaigo Youhou Jiten. Tokyodo Shuppan (2002) (in Japanese)

    Google Scholar 

  13. Michibata, H.: Eijirou. 1st edn. Alc (2002) (in Japanese), http://www.alc.co.jp/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Okumura, M., Okumura, A., Saito, S. (2006). Automatic Construction of a Japanese Onomatopoeic Dictionary Using Text Data on the WWW. In: Kop, C., Fliedl, G., Mayr, H.C., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2006. Lecture Notes in Computer Science, vol 3999. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11765448_20

Download citation

  • DOI: https://doi.org/10.1007/11765448_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34616-6

  • Online ISBN: 978-3-540-34617-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics