Automatic Construction of a Japanese Onomatopoeic Dictionary Using Text Data on the WWW

  • Manabu Okumura
  • Atsushi Okumura
  • Suguru Saito
Conference paper

DOI: 10.1007/11765448_20

Part of the Lecture Notes in Computer Science book series (LNCS, volume 3999)
Cite this paper as:
Okumura M., Okumura A., Saito S. (2006) Automatic Construction of a Japanese Onomatopoeic Dictionary Using Text Data on the WWW. In: Kop C., Fliedl G., Mayr H.C., Métais E. (eds) Natural Language Processing and Information Systems. NLDB 2006. Lecture Notes in Computer Science, vol 3999. Springer, Berlin, Heidelberg

Abstract

As new onomatopoeic words are often created at short notice, existing dictionaries tend to have an insufficient number of their entries. Furthermore, onomatopoeic words seldom appear in collections of newspaper articles, that have been used as corpora in natural language processing. In this work, we present a method of automatically acquiring lexical knowledge for Japanese onomatopoeic words from the WWW. As a result, we could automatically construct a onomatopoeic dictionary that contained 5,130 entries. By manually evaluating 487 newly acquired words that were not in the existing dictionary, we found that we could acquire 266 new onomatopoeic words, and if words in the existing dictionary were regarded as being correct, precision of our automatic acquisition was 83.6%.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Manabu Okumura
    • 1
  • Atsushi Okumura
    • 2
  • Suguru Saito
    • 1
  1. 1.Tokyo Institute of TechnologyYokohamaJapan
  2. 2.Sony CorporationJapan

Personalised recommendations