A Karaka Based Annotation Scheme for English

Vaidya, Ashwini; Husain, Samar; Mannem, Prashanth; Sharma, Dipti Misra

doi:10.1007/978-3-642-00382-0_4

Ashwini Vaidya¹⁷,
Samar Husain¹⁷,
Prashanth Mannem¹⁷ &
…
Dipti Misra Sharma¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5449))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

4 Citations

Abstract

The paper describes an annotation scheme for English based on Panini’s concept of karakas. We describe how the scheme handles certain constructions in English. By extending the karaka scheme for a fixed word order language, we hope to bring out its advantages as a concept that incorporates some ‘local semantics’. Our comparison with PTB-II and PropBank brings out its intermediary status between a morpho-syntactic and semantic level. Further work can show how this could benefit tasks like semantic role labeling and automatic conversion of existing English treebanks into this scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency annotation scheme for Indian languages. In: Proceedings of IJCNLP 2008 (2008)
Google Scholar
Bharati, A., Sangal, R., Sharma, D.M., Bai, L.: AnnCorra: Annotating Corpora Guidelines For POS And Chunk Annotation For Indian Languages. Technical Report, Language Technologies Research Centre IIIT, Hyderabad (2006)
Google Scholar
Bharati, A., Sangal, R., Sharma, D.M.: Shakti Analyser: SSF Representation (2005), http://shiva.iiit.ac.in/SPSAL2007/ssf-analysis-representation.pdf
Bharati, A., Chaitanya, V., Sangal, R.: Natural Language Processing: A Paninian Perspective. Prentice-Hall of India, New Delhi (1995), http://ltrc.iiit.ac.in/downloads/nlpbook/nlp-panini.pdf
MATH Google Scholar
Babko-Malaya, O.: PropBank Annotation Guidelindes (2005), http://verbs.colorado.edu/~mpalmer/projects/ace/PBguidelines.pdf
Ekeklint, S., Nivre, J.: A Dependency-Based Conversion of PropBank. In: Proceedings of FRAME 2007: Building Frame Semantics Resources for Scandinavian and Baltic Languages, pp. 19–25 (2007)
Google Scholar
Gildea, D., Palmer, M.: The Necessity of Parsing for Predicate Argument Recognition. In: Proceedings of ACL 2002 (2002)
Google Scholar
Hajicova, E.: Prague Dependency Treebank: From Analytic to Tectogrammatical Annotation. In: Proc. TSD 1998 (1998)
Google Scholar
Herbst, T.: English Valency Structures - A first sketch. Technical report EESE 2/99 (1999)
Google Scholar
Kahane, S.: The Meaning-Text Theory. In: Dependency and Valency. An International Handbook on Contemporary Research. De Gruyter, Berlin (2003)
Google Scholar
Kingsbury, P., Palmer, M.: From Treebank to PropBank. In: Proceedings of the 3^rd LREC, Las Palmas, Canary Islands, Spain (2002)
Google Scholar
Kiparsky., P.: On the Architecture of Panini’s grammar. In: Three lectures delivered at the Hyderabad Conference on the Architecture of Grammar (2002), http://www.stanford.edu/~kiparsky/Papers/hyderabad.pdf
Kroeger., P.: Analyzing Syntax: A lexical functional approach. Cambridge University Press, Cambridge (2004)
Book Google Scholar
Marcus, M., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn Treebank. In: Computational Linguistics (1993)
Google Scholar
Marcus, M., Kim, G., Marcinkiewicz, M., MacIntyre, R., Bies, A., Ferguson, M., Katz, K., Schasberger, B.: The Penn treebank: Annotating predicate argument structure. In: Proceedings of the ARPA Human Language Technology Workshop (1994)
Google Scholar
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., Webber, B.: The Penn Discourse Treebank 2.0. In: Proceedings of the 6th LREC (2008)
Google Scholar
Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryigit, G., Kübler, S., Marinov, S., Marsi, E.: MaltParser: A language-independent system for data-driven dependency parsing. Natural Language Engineering 13(2), 95–135 (2007)
Google Scholar
Rambow, O., Creswell, C., Szekely, R., Taber, H., Walker, M.: A dependency treebank for English. In: Proceedings of the 3rd LREC, Las Palmas, Gran Canaria, Spain (2002)
Google Scholar
Rambow, O., Dorr, B., Kucerova, I., Palmer, M.: Automatically Deriving Tectogrammatical Labels from other resources- A comparison of Semantic labels across frameworks. The Prague Bulletin of Mathematical Linguistics 79-80, 23–35 (2003)
Google Scholar
Sgall, P., Hajicova, E., Panevova, J.: The meaning of the sentence and its semantic and pragmatic aspects. Reidel, Dordrecht (1986)
Google Scholar
Subrahmanyam, P.S.: Pa: ninian Linguistics, Tokyo, Japan: Inst. for the Study of Languages and Cultures of Asia and Africa, Tokyo University of Foreign Studies (1999)
Google Scholar
Tesnière, L.: Eléments de Syntaxe Structurale. Klincksiek, Paris (1959)
Google Scholar
Yamada, H., Matsumoto, Y.: Statistical dependency analysis with support vector machines. In: Proceedings of IWPT (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Language Technologies Research Centre, International Institute of Information Technology, Hyderabad, India
Ashwini Vaidya, Samar Husain, Prashanth Mannem & Dipti Misra Sharma

Authors

Ashwini Vaidya
View author publications
You can also search for this author in PubMed Google Scholar
Samar Husain
View author publications
You can also search for this author in PubMed Google Scholar
Prashanth Mannem
View author publications
You can also search for this author in PubMed Google Scholar
Dipti Misra Sharma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vaidya, A., Husain, S., Mannem, P., Sharma, D.M. (2009). A Karaka Based Annotation Scheme for English. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2009. Lecture Notes in Computer Science, vol 5449. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00382-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-00382-0_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00381-3
Online ISBN: 978-3-642-00382-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics