Language Resources and Evaluation

, Volume 42, Issue 2, pp 117–126

Semi-automatic documentation of an implemented linguistic grammar augmented with a treebank

  • Chikara Hashimoto
  • Francis Bond
  • Takaaki Tanaka
  • Melanie Siegel
Article

Abstract

We have constructed a large scale and detailed database of lexical types in Japanese from a treebank that includes detailed linguistic information. The database helps treebank annotators and grammar developers to share precise knowledge about the grammatical status of words that constitute the treebank, allowing for consistent large-scale treebanking and grammar development. In addition, it clarifies what lexical types are needed for precise Japanese NLP on the basis of the treebank. In this paper, we report on the motivation and methodology of the database construction.

Keywords

Documentation Lexical types Linguistic grammar Treebank 

Copyright information

© Springer Science+Business Media B.V. 2008

Authors and Affiliations

  • Chikara Hashimoto
    • 1
  • Francis Bond
    • 2
  • Takaaki Tanaka
    • 3
  • Melanie Siegel
    • 4
  1. 1.Graduate School of Science and EngineeringYamagata UniversityYamagataJapan
  2. 2.Computational Linguistics GroupNational Institute of Information and Communications TechnologyKyotoJapan
  3. 3.Machine Translation Research GroupNTT Communication Science LaboratoriesSoraku-gunJapan
  4. 4.Acrolinx GmbHBerlinGermany

Personalised recommendations