Abstract
We have constructed a web-based framework for collaborative multilingual grammar and treebank development in which developers are distributed around the world. It is important for developers of the world-wide collaboration to i) grasp and share the big picture of the grammar and treebank of each language and ii) understand commonalities of languages. Our framework, the Lextype DB, describes lexical types of the grammar and treebank. Lexical types can be seen as detailed parts-of-speech and are the essence for the two important points just mentioned. Information about a lexical type that the Lextype DB provides includes its linguistic characteristics; examples of usage from a treebank; the way it is implemented in a grammar; and correspondences to major computational dictionaries. It consists of a database management system and a web-based interface, and is constructed semi-automatically. Currently, we have applied the Lextype DB to grammars and treebanks of Japanese and English.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bond, F., Nichols, E., Fujita, S., Tanaka, T.: Acquiring an Ontology for a Fundamental Vocabulary. In: 20th International Conference on Computational Linguistics (COLING-2004), Geneva, pp. 1319–1325 (2004)
Baldridge, J., Osborne, M.: Active learning for HPSG parse selection. In: Proceedings of the 7th Conference on Natural Language Learning (2003)
Toutanova, K., Manning, C.D., Flickinger, D., Oepen, S.: Stochastic hpsg parse disambiguation using the redwoods corpus. Research on Language and Computation 3(1), 83–105 (2005)
Bond, F., Copestake, A., Flickinger, D., Oepen, S., Siegel, M.: Open source machine translation with delph-in. In: Proceedings of The 10th Machine Translation Summit (2005)
Flickinger, D.: On building a more efficient grammar by exploiting types. Natural Language Engineering 6(1), 15–28 (2000)
Kim, J.B., Yang, J.: Parsing korean case phenomena in a type-feature structure grammar. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 60–72. Springer, Heidelberg (2005)
Crysmann, B.: Relative clause extraposition in German: An efficient and portable implementation. Research on Language and Computation 3(1), 61–82 (2005)
Siegel, M.: HPSG analysis of Japanese. In: Wahlster, W. (ed.) Verbmobil: Foundations of Speech-to-Speech Translation, pp. 265–280. Springer, Heidelberg (2000)
Pollard, C.J., Sag, I.A.: Head-Driven Phrase Structure Grammar. University of Chicago Press, Chicago (1994)
Sag, I.A., Wasow, T., Bender, E.M.: Syntactic Theory: A Formal Introduction, 2nd edn. CSLI Publications, Stanford (2003)
Copestake, A., Flickinger, D., Pollard, C., Sag, I.A.: Minimal Recursion Semantics. An introduction. Research on Language and Computation 3(4), 281–332 (2005)
Bender, E.M., Flickinger, D., Oepen, S.: The grammar matrix: An open-source starter-kit for the rapid development of cross-linguistically consistent broad-coverage precision grammars. In: Proceedings of the Workshop on Grammar Engineering and Evaluation at the 19th International Conference on Computational Linguistics, Taipei, Taiwan, pp. 8–14 (2002)
Sadakane, K., Koizumi, M.: On the nature of the “dative” particle ni in Japanese. Linguistics 33, 5–33 (1995)
Copestake, A.: Implementing Typed Feature Structure Grammars. CSLI Publications, Stanford (2002)
Bond, F., Fujita, S., Hashimoto, C., Nariyama, S., Nichols, E., Ohtani, A., Tanaka, T., Amano, S.: The Hinoki Treebank — A Treebank for Text Understanding. In: Proceedings of the First International Joint Conference of Natural Language Processing, pp. 554–559 (2004)
Siegel, M., Bender, E.M.: Efficient Deep Processing of Japanese. In: Proceedings of the 3rd Workshop on Asian Language Resources and International Standardization, Taipei, Taiwan (2002)
Matsumoto, Y., Kitauchi, A., Yamashita, T., Hirano, Y., Matsuda, H., Takaoka, K., Asahara, M.: Morphological Analysis System ChaSen version 2.2.1 Manual. Nara Institute of Science and Technology (2000)
Kurohashi, S., Nagao, M.: Building a Japanese parsed corpus — while improving the parsing system. In: Abeillé, A. (ed.) Treebanks: Building and Using Parsed Corpora, pp. 249–260. Kluwer Academic Publishers, Dordrecht (2003)
Ikehara, S., Shirai, S., Yokoo, A., Nakaiwa, H.: Toward an MT system without pre-editing – effects of new methods in ALT-J/E–. In: Third Machine Translation Summit: MT Summit III, Washington DC, pp. 101–106 (1991), http://xxx.lanl.gov/abs/cmp-lg/9510008
Breen, J.: Jmdict: a japanese-multilingual dictionary. In: Coling 2004 Workshop on Multilingual Linguistic Resources (2004)
Miyazaki, M., Shirai, S., Ikehara, S.: gengo katēsetsu-ni motozuku nihongo hinshi-no taikēka-to sono kōyō [a Japanesesyntactic category system based on the constructive process theory and its use] (in Japanese). Journal of Natural Language Processing 2(3), 3–25 (1995)
Grishman, R., Macleod, C., Myers, A.: COMLEX syntax: Building a computational lexicon. In: 15th International Conference on Computational Linguistics: COLING 1994, Kyoto, Japan, vol. 1, pp. 268–272 (1994)
Krieger, H.U., Schafer, U.: \(\mathcal{TDL}\) — a type description language for constraint-based grammars. In: Proceedings of the 15th International Conference on Computational Linguistics (1994)
Copestake, A., Lambeau, F., Waldron, B., Bond, F., Flickinger, D., Oepen, S.: A lexicon module for a grammar development environment. In: 4th International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, vol. IV, pp. 1111–1114 (2004)
Oepen, S., Flickinger, D., Toutanova, K., Manning, C.D.: LinGO Redwoods: A Rich and Dynamic Treebank for HPSG. In: Proceedings of The First Workshop on Treebanks and Linguistic Theories, Sozopol, Bulgaria, pp. 139–149 (2002)
Siegel, M.: JACY, A Grammar for Annotating Syntax, Semantics and Pragmatics of Written and Spoken Japanese for NLP Application Purposes. ms (2006)
Butt, M., King, T.H., no, M.E.N., Segond, F.: A Grammar Writer’s Cookbook. CSLI publications, Stanford (1999)
Oepen, S., Bender, E.M., Callmeier, U., Flickinger, D., Siegel, M.: Parallel distributed grammar engineering for practical applications. In: Proceedings of COLING 2002 Workshop on Grammar Engineering and Evaluation, Taipei, Taiwan, pp. 15–21 (2002)
Oepen, S., Flickinger, D., Bond, F.: Towards Holistic Grammar Engineering and Testing. Grafting Treebank Maintenance into the Grammar Revision Cycle. In: Su, K.-Y., Tsujii, J., Lee, J.-H., Kwong, O.Y. (eds.) IJCNLP 2004. LNCS (LNAI), vol. 3248, Springer, Heidelberg (2005)
Dini, L., Mazzini, G.: Hypertextual Grammar Development. In: Computational Environments for Grammar Development and Linguistic Engineering, Madrid, ACL, pp. 24–29 (1997)
Knuth, D.E.: Literate Programming. CSLI Publications, Stanford, CA (1992)
Bender, E.M., Flickinger, D., Good, J., Sag, I.A.: Montage: Leveraging Advances in Grammar Engineering, Linguistic Ontologies, and Mark-up for the Documentation of Underdescribed Languages. In: Proceedings of the Workshop on First Steps for the Documentation of Minority Languages: Computational Linguistic Tools for Morphology, Lexicon and Corpus Compilation, LREC 2004, Lisbon, Portugal (2004)
Ishida, T.: Language grid: An infrastructure for intercultural collaboration. In: IEEE/IPSJ Symposium on Applications and the Internet (SAINT-06), pp. 96–100 (2006) (keynote address)
Fellbaum, C., Vossen, P.T.: Connecting the universal to the specific: Towards the global grid. In: Ishida, T., Fussell, S., Vossen, P.T. (eds.) IWIC 2007. LNCS, vol. 4568, Springer, Heidelberg (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hashimoto, C., Bond, F., Flickinger, D. (2007). The Lextype DB: A Web-Based Framework for Supporting Collaborative Multilingual Grammar and Treebank Development. In: Ishida, T., Fussell, S.R., Vossen, P.T.J.M. (eds) Intercultural Collaboration. IWIC 2007. Lecture Notes in Computer Science, vol 4568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74000-1_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-74000-1_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73999-9
Online ISBN: 978-3-540-74000-1
eBook Packages: Computer ScienceComputer Science (R0)