Abstract
In this paper we investigate the use of standard natural language processing (NLP) tools and annotation methods for processing linguistic data from ritual science, which is concerned with the study of structure and variance of rituals. The work is embedded in an interdisciplinary project that addresses this study by applying empirical and quantitative computational linguistic analysis techniques to ritual descriptions from Indian rituals.We present motivation and prospects of such a computational approach to ritual structure research and sketch the overall project research plan. In particular, we motivate the choice of frame semantics as a theoretical framework for the semantic analysis of rituals. We discuss the special characteristics of the textual data and examine several domain adaptation strategies in order to use standard NLP resources and tools on the ritual domain. We also report on our workflows and methods for semi-automatic semantic annotation, which is used as a basis for the extraction of event chains. We close with some preliminary investigations on how to uncover regularities and differences of rituals.-
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bagga, A., Baldwin, B.: Algorithms for Scoring Coreference Chains. In: Proceedings of the LREC 1998 Linguistic Coreference Workshop, pp. 536–566. Granada, Spain (1998)
Barzilay, R., Lee, L.: Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment. In: Proceedings of the 2003 Human Language Technology Conference of the NAACL (HLT-NAACL ’03), pp. 16–23. Edmonton (2003)
Bögel, T., Funk, L., Kull, A.: ELAC: Ensemble Learning for ACR-Systems. Software project, University of Heidelberg, http://dakhma.net/elac (2010)
Bortz, J.: Statistik für Human- und Sozialwissenschaftler, 6. edn. Springer Medizin Verlag, Heidelberg (2005)
Burchardt, A., Frank, A., Pinkal, M.: Building Text Meaning Representations from Contextually Related Frames – A Case Study. In: Proceedings of the 6th International Workshop on Computational Semantics (IWCS ’05) (2005)
Burchardt, A., Pado, S., Spohr, D., Frank, A., Heid, U.: Constructing Integrated Corpus and Lexicon Models for Multi-Layer Annotations in OWL DL. Linguistic Issues in Language Technology 1(1), 1–33 (2008)
Chambers, N., Jurafsky, D.: Unsupervised Learning of Narrative Event Chains. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT ’08), pp. 789–797 (2008). URL http://www.aclweb.org/anthology/P/P08/P08-1090
Chambers, N., Jurafsky, D.: Unsupervised Learning of Narrative Schemas and their Participants. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (ACL-IJCNLP ’09), pp. 602–610 (2009). URL http://www.aclweb.org/anthology/P/P09/P09-1068
Charniak, E.: A Maximum-Entropy-Inspired Parser. In: Proceedings of the 1st Conference of the North American Chapter of the Association for Computational Linguistics (NAACL ’00) (2000)
Daumé III, H.: Frustratingly Easy Domain Adaptation. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL ’07), pp. 256–263 (2007). URL http://www.aclweb.org/anthology/P07-1033
Denis, P.: New Learning Models for Robust Reference Resolution. Ph.D. thesis, University of Texas at Austin, Austin, TX, USA (2007)
Erk, K., Kowalski, A., Padó, S.: The SALSA Annotation Tool. In: Proceedings of the Workshop on Prospects and Advances in the Syntax/Semantics Interface (2003)
Erk, K., Padó, S.: Shalmaneser – a Toolchain for Shallow Semantic Parsing. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC ’06) (2006)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press (1998)
Fillmore, C.J., Johnson, C.R., Petruck, M.R.: Background to FrameNet. International Journal of Lexicography 16(3), 235–250 (2003)
Finkel, J.R., Manning, C.D.: Hierarchical Bayesian Domain Adaptation. In: Proceedings of the 2009 Human Language Technologies Conference of the NAACL (HLT-NAACL ’09), pp. 602–610 (2009). URL http://www.aclweb.org/anthology/N/N09/N09-1068
Hellwig, O.: A Chronometric Approach to Indian Alchemical Literature. Literary and Linguistic Computing 24(4), 373–383 (2009)
Hyland, K.: Hedging in Academic Writing and EAP Textbooks. English for Specific Purposes 13(3), 239–256 (1994)
Jiang, J., Zhai, C.: Instance Weighting for Domain Adaptation in NLP. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL ’07), pp. 264–271 (2007). URL http://www.aclweb.org/anthology/P07-1034
Kipper, K., Korhonen, A., Ryant, N., Palmer, M.: A Large-Scale Classification of English Verbs. Journal of Language Resources and Evaluation 42(1), 21–40 (2008)
Lappin, S., Leass, H.J.: An Algorithm for Pronominal Anaphora Resolution. Computational Linguistics 20(4), 535–561 (1994)
Light, M., Qiu, X.Y., Srinivasan, P.: The Language of Bioscience: Facts, Speculations, and Statements in Between. In: Proceedings of HLT-NAACL 2004 Workshop on Linking Biological Literature, Ontologies and Databases (BioLINK ’04), pp. 17–24 (2004)
Medlock, B., Briscoe, T.: Weakly Supervised Learning for Hedge Classification in Scientific Literature. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL ’07), pp. 992–999 (2007). URL http://www.aclweb.org/anthology/P/P07/P07-1125
Needleman, S.B., Wunsch, C.D.: A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins. Journal of Molecular Biology 48, 443–453 (1970)
Poesio, M., Kabadjov, M.A.: A General-Purpose, Off-the-Shelf Anaphora Resolution Module: Implementation and Preliminary Evaluation. In: Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC ’04) (2004)
Pradhan, S., Ward, W., Martin, J.H.: Towards Robust Semantic Role Labeling. Computational Linguistics, Special Issue on Semantic Role Labeling 34(2), 289–310 (2008)
Pradhan, S.S., Ward, W., Hacioglu, K., Martin, J.H., Jurafsky, D.: Shallow Semantic Parsing using Support Vector Machines. In: Proceedings of the 2004 Human Language Technology Conference of the NAACL (HLT-NAACL ’04) (2004)
R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Wien (2007)
Recasens, M., Martí, T., Taulé, M., Màrquez, L., Sapena, E.: SemEval-2010 Task 1: Coreference Resolution in Multiple Languages. In: Proceedings of the HLT-NAACL 2009 Workshop on Semantic Evaluations: Recent Achievements and Future Directions (SEW ’09), pp. 70–75 (2009)
Regneri, M., Koller, A., Pinkal, M.: Learning script knowledge with web experiments. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL ’10), pp. 979–988. Uppsala, Sweden (2010). URL http://www.aclweb.org/anthology/P10-1100
Reiter, N., Hellwig, O., Mishra, A., Frank, A., Burkhardt, J.: Using NLP methods for the Analysis of Rituals. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC ’10) (2010)
Reiter, N., Hellwig, O., Mishra, A., Gossmann, I., Larios, B.M., Rodrigues, J., Zeller, B., Frank, A.: Adapting Standard NLP Tools and Resources to the Processing of Ritual Descriptions. In: Proceedings of ECAI 2010 Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH ’10) (2010). URL http://www.cl.uni-heidelberg.de/~reiter/publications/Reiter2010b.pdf
Ruppenhofer, J., Sporleder, C., Morante, R., Baker, C., Palmer, M.: SemEval-2010 Task 10: Linking Events and Their Participants in Discourse. In: Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions (SEW ’09), pp. 106–111 (2009)
Sang, E.F.T.K., Buchholz, S.: Introduction to the CoNLL-2000 Shared Task: Chunking. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th conference on Computational Natural Language Learning (CoNLL ’00 and LLL ’00) (2000)
Soon, W.M., Lim, D.C.Y., Ng, H.T.: A Machine Learning Approach to Coreference Resolution of Noun Phrases. Computational Linguistics 27(4), 521–544 (2001)
Szarvas, G., Vincze, V., Farkas, R., Csirik, J.: The BioScope Corpus: Annotation for Negation, Uncertainty and their Scope in Biomedical Texts. In: Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing (BioNLP ’08), pp. 38–45 (2008). URL http://www.aclweb.org/anthology/W/W08/W08-0606
Thompson, J.D., Higgins, D.G., Gibson, T.J.: CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix cxshoice. Nucleic Acids Research 22(22), 4673–4680 (1994)
Toutanova, K., Klein, D., Manning, C., Singer, Y.: Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network. In: Proceedings of the 2003 Human Language Technologies Conference of the NAACL (HLT-NAACL ’03), pp. 252–259 (2003)
Verhagen, M., Pustejovsky, J.: Temporal Processing with the TARSQI Toolkit. In: Proceedings of the 22nd International Conference on Computational Linguistics: Demonstration Papers (COLING ’08), pp. 189–192. Manchester, UK (2008). URL http://www.aclweb.org/anthology/C08-3012
Versley, Y., Ponzetto, S.P., Poesio, M., Eidelman, V., Jern, A., Smith, J., Yang, X., Moschitti, A.: BART: A Modular Toolkit for Coreference Resolution. In: Companion Volume of the Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL ’08), pp. 9–12 (2008)
Vilain, M., Burger, J., Aberdeen, J., Connolly, D., Hirschman, L.: A Model-Theoretic Coreference Scoring Scheme. In: Proceedings of the 6th Conference on Message Understanding (MUC6 ’95), pp. 45–52. Morristown, NJ, USA (1995)
Acknowledgements
This research has been funded by the German Research Foundation (DFG) through the collaborative research center on ritual dynamics (Sonderforschungsbereich SFB-619, Ritualdynamik).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Reiter, N. et al. (2011). Adapting NLP Tools and Frame-Semantic Resources for the Semantic Analysis of Ritual Descriptions. In: Sporleder, C., van den Bosch, A., Zervanou, K. (eds) Language Technology for Cultural Heritage. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20227-8_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-20227-8_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20226-1
Online ISBN: 978-3-642-20227-8
eBook Packages: Computer ScienceComputer Science (R0)