Advertisement

Compilation of a Dictionary of Japanese Functional Expressions with Hierarchical Organization

  • Suguru Matsuyoshi
  • Satoshi Sato
  • Takehito Utsuro
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4285)

Abstract

The Japanese language has a lot of functional expressions, which consist of more than one word and behave like a single functional word. A remarkable characteristic of Japanese functional expressions is that each functional expression has many different surface forms. This paper proposes a methodology for compilation of a dictionary of Japanese functional expressions with hierarchical organization. We use a hierarchy with nine abstraction levels: the root node is a dummy node that governs all entries; a node in the first level is a headword in the dictionary; a leaf node corresponds to a surface form of a functional expression. Two or more lists of functional expressions can be integrated into this hierarchy. This hierarchy also provides a way of systematic generation of all different surface forms. We have compiled the dictionary with 292 headwords and 13,958 surface forms, which covers almost all of major functional expressions.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Hyodo, Y., Murakami, Y., Ikeda, T.: A dictionary of long-unit functional words for analyzing bunsetsu. In: Proceedings of the 6th Annual Meeting of the Association for Natural Language Processing, pp. 407–410 (2000) (in Japanese)Google Scholar
  2. 2.
    Jamashii, G. (ed.): Nihongo Bunkei Jiten (Dictionary of Sentence Patterns in Japanese. Kurosio Publisher (1998) (in Japanese)Google Scholar
  3. 3.
    Morita, Y., Matsuki, M.: Nihongo Hyougen Bunkei. NAFL Sensho (Expression Patterns in Japanese), vol. 5. ALC Press Inc. (1989) (in Japanese) Google Scholar
  4. 4.
    Numata, Y.: Toritateshi (Focus Particles). In: Okutsu, K., Numata, Y., Sugimoto, T. (eds.) Iwayuru Nihongo Joshi no Kenkyu (Studies on So-called Particles in Japanese), ch.2, BONJINSHA (1986) (in Japanese)Google Scholar
  5. 5.
    Shudo, K., Narahara, T., Yoshida, S.: Morphological aspect of Japanese language processing. In: Proceedings of the 8th COLING, pp. 1–8 (1980)Google Scholar
  6. 6.
    Shudo, K., Tanabe, T., Takahashi, M., Yoshimura, K.: MWEs as non-propositional content indicators. In: Proceedings of the 2nd ACL Workshop on Multiword Expressions: Integrating Processing (MWE 2004), pp. 32–39 (2004)Google Scholar
  7. 7.
    Tsuchiya, M., Shime, T., Takagi, T., Utsuro, T., Uchimoto, K., Matsuyoshi, S., Sato, S., Nakagawa, S.: Chunking Japanese compound functional expressions by machine learning. In: Proceedings of the EACL 2006 Workshop on Multi-Word-Expressions in a Multilingual Context, pp. 25–32 (2006)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Suguru Matsuyoshi
    • 1
  • Satoshi Sato
    • 1
  • Takehito Utsuro
    • 2
  1. 1.Graduate School of EngineeringNagoya UniversityChikusa, NagoyaJapan
  2. 2.Graduate School of Systems and Information EngineeringUniversity of TsukubaTennodai, TsukubaJapan

Personalised recommendations