Design and Compilation of Syntactically Tagged Corpus of Japanese Statutory Sentences

  • Yasuhiro Ogawa
  • Masayuki Yamada
  • Ryuta Kato
  • Katsuhiko Toyama
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6797)


This paper describes how to analyze Japanese statutory sentences. Although statutory sentences have many technical terms and characteristic structures, the design of the general Japanese corpus has no tags to handle such structures and usual Japanese parsers cannot analyze them correctly. Thus, we propose a new design of syntactic tags for statutory sentences and develop a support tool that corrects the output of a parser. In this paper, we focus on dependency structure of sentences and present an overview of the design and compilation of the corpus of Japanese statutory sentences.


legal corpus statutory text processing dependency structure statute document 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Asahara, M., Matsumoto, Y.: Extended models and tools for high-performance part-of-speech tagger. In: Proc. COLING 2000, pp. 21–27 (2000)Google Scholar
  2. 2.
    Hagiwara, M., Ogawa, Y., Toyama, K.: Effective Use of Indirect Dependency for Distributional Similarity. Journal of Natural Language Processing 14(5), 119–150 (2008)CrossRefGoogle Scholar
  3. 3.
    Hagiwara, M., Ogawa, Y., Toyama, K.: Bootstrapping-based Extraction of Dictionary Terms from Unsegmented Legal Text. In: Hattori, H., Kawamura, T., Idé, T., Yokoo, M., Murakami, Y. (eds.) JSAI 2008. LNCS, vol. 5447, pp. 213–227. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  4. 4.
    Kudo, T., Matsumoto, M.: Japanese Dependency Analysis using Cascaded Chunking. In: Proc. CoNLL 2002, pp. 63–69 (2002)Google Scholar
  5. 5.
    Kudo, T., Matsumoto, M.: Fast Methods for Kernel-based Text Analysis. In: Proc. ACL 2003, pp.24–31 (2003)Google Scholar
  6. 6.
    Kurohashi, S., Nagao, M.: KN Parser: Japanese Dependency/Case Structure Analyzer. In: Proc. Workshop on Sharable Natural Language Resources, pp. 48–55 (1994)Google Scholar
  7. 7.
    Kurohashi, S., Nakamura, T., Matsumoto, Y., Nagao, M.: Improvements of Japanese morphological analyzer JUMAN. In: Proc. Workshop on Sharable Natural Language Resources, pp. 22–28 (1994)Google Scholar
  8. 8.
    Kurohashi, S., Nagao, M.: Building a Japanese Parsed Corpus. In: Abeille, A. (ed.) Treebanks: Building and Using Parsed Corpora, pp. 249–260. Kluwer, Dordrecht (2003)CrossRefGoogle Scholar
  9. 9.
    Maeda, M.: Workbook Hosei Shitsumu (revised ed.), Gyosei, Tokyo (2005)Google Scholar
  10. 10.
    Ohno, T., Matsubara, S., Kawaguchi, N., Inagaki, Y.: Robust Dependency Parsing of Spontaneous Japanese Spoken Language. IEICE Trans. E88-D(3), 545–552 (2005)CrossRefGoogle Scholar
  11. 11.
    Padó, S., Lapata, M.: Dependency-Based Construction of Semantic Space Models. Computational Linguistics 33(2), 161–199 (2007)CrossRefzbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Yasuhiro Ogawa
    • 1
  • Masayuki Yamada
    • 1
  • Ryuta Kato
    • 1
  • Katsuhiko Toyama
    • 1
  1. 1.Graduate School of Information ScienceNagoya UniversityNagoyaJapan

Personalised recommendations