Abstract
The paper describes an annotation scheme for English based on Panini’s concept of karakas. We describe how the scheme handles certain constructions in English. By extending the karaka scheme for a fixed word order language, we hope to bring out its advantages as a concept that incorporates some ‘local semantics’. Our comparison with PTB-II and PropBank brings out its intermediary status between a morpho-syntactic and semantic level. Further work can show how this could benefit tasks like semantic role labeling and automatic conversion of existing English treebanks into this scheme.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency annotation scheme for Indian languages. In: Proceedings of IJCNLP 2008 (2008)
Bharati, A., Sangal, R., Sharma, D.M., Bai, L.: AnnCorra: Annotating Corpora Guidelines For POS And Chunk Annotation For Indian Languages. Technical Report, Language Technologies Research Centre IIIT, Hyderabad (2006)
Bharati, A., Sangal, R., Sharma, D.M.: Shakti Analyser: SSF Representation (2005), http://shiva.iiit.ac.in/SPSAL2007/ssf-analysis-representation.pdf
Bharati, A., Chaitanya, V., Sangal, R.: Natural Language Processing: A Paninian Perspective. Prentice-Hall of India, New Delhi (1995), http://ltrc.iiit.ac.in/downloads/nlpbook/nlp-panini.pdf
Babko-Malaya, O.: PropBank Annotation Guidelindes (2005), http://verbs.colorado.edu/~mpalmer/projects/ace/PBguidelines.pdf
Ekeklint, S., Nivre, J.: A Dependency-Based Conversion of PropBank. In: Proceedings of FRAME 2007: Building Frame Semantics Resources for Scandinavian and Baltic Languages, pp. 19–25 (2007)
Gildea, D., Palmer, M.: The Necessity of Parsing for Predicate Argument Recognition. In: Proceedings of ACL 2002 (2002)
Hajicova, E.: Prague Dependency Treebank: From Analytic to Tectogrammatical Annotation. In: Proc. TSD 1998 (1998)
Herbst, T.: English Valency Structures - A first sketch. Technical report EESE 2/99 (1999)
Kahane, S.: The Meaning-Text Theory. In: Dependency and Valency. An International Handbook on Contemporary Research. De Gruyter, Berlin (2003)
Kingsbury, P., Palmer, M.: From Treebank to PropBank. In: Proceedings of the 3rd LREC, Las Palmas, Canary Islands, Spain (2002)
Kiparsky., P.: On the Architecture of Panini’s grammar. In: Three lectures delivered at the Hyderabad Conference on the Architecture of Grammar (2002), http://www.stanford.edu/~kiparsky/Papers/hyderabad.pdf
Kroeger., P.: Analyzing Syntax: A lexical functional approach. Cambridge University Press, Cambridge (2004)
Marcus, M., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn Treebank. In: Computational Linguistics (1993)
Marcus, M., Kim, G., Marcinkiewicz, M., MacIntyre, R., Bies, A., Ferguson, M., Katz, K., Schasberger, B.: The Penn treebank: Annotating predicate argument structure. In: Proceedings of the ARPA Human Language Technology Workshop (1994)
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., Webber, B.: The Penn Discourse Treebank 2.0. In: Proceedings of the 6th LREC (2008)
Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryigit, G., Kübler, S., Marinov, S., Marsi, E.: MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering 13(2), 95–135 (2007)
Rambow, O., Creswell, C., Szekely, R., Taber, H., Walker, M.: A dependency treebank for English. In: Proceedings of the 3rd LREC, Las Palmas, Gran Canaria, Spain (2002)
Rambow, O., Dorr, B., Kucerova, I., Palmer, M.: Automatically Deriving Tectogrammatical Labels from other resources- A comparison of Semantic labels across frameworks. The Prague Bulletin of Mathematical Linguistics 79-80, 23–35 (2003)
Sgall, P., Hajicova, E., Panevova, J.: The meaning of the sentence and its semantic and pragmatic aspects. Reidel, Dordrecht (1986)
Subrahmanyam, P.S.: Pa: ninian Linguistics, Tokyo, Japan: Inst. for the Study of Languages and Cultures of Asia and Africa, Tokyo University of Foreign Studies (1999)
Tesnière, L.: Eléments de Syntaxe Structurale. Klincksiek, Paris (1959)
Yamada, H., Matsumoto, Y.: Statistical dependency analysis with support vector machines. In: Proceedings of IWPT (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vaidya, A., Husain, S., Mannem, P., Sharma, D.M. (2009). A Karaka Based Annotation Scheme for English. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2009. Lecture Notes in Computer Science, vol 5449. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00382-0_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-00382-0_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00381-3
Online ISBN: 978-3-642-00382-0
eBook Packages: Computer ScienceComputer Science (R0)