Abstract
In this paper we describe a system that performs morphological generation and analysis for Pali. We discuss the morphological aspects of the tasks our system performs with emphasis on Pali specific characteristics and difficulties and present insights into how this system is integrated into a technical infrastracture used in research about Pali.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Note: The data structure is specified in JSON format, not XML. This is because our dictionary data is maintained in a NoSQL data base which uses JSON as communication data format.
- 2.
For reasons of easy interoperability with all system components we primarily focus on JSON as output format. As mentioned above the programming interface allows retrieval of XML output as well, which then contains the same data as the JSON data structure above.
References
Aikhenvald, A.Y.: Typological distinctions in word-formation. Lang. Typology Syntactic Description 3, 1–65 (2007)
Alfter, D.: Morphological Analyzer and Generator for Pali. Bachelor’s thesis, Universität Trier (2014)
Bloch, J.: The Formation of the Marathi Language. Motilal Banarsidass, Delhi (1970)
Burrow, T.: The Sanskrit Language. Motilal Banarsidass, Delhi (2001)
Davids, T.R., Stede, W.: Pali-English dictionary. Motilal Banarsidass, Delhi (1993)
Duroiselle, C.: A Practical grammar of the Pali language. 4th edn. (2007). http://www.pratyeka.org/duroiselle/
Hellwig, O.: SanskritTagger: a stochastic lexical and POS tagger for sanskrit. In: Huet, G., Kulkarni, A., Scharf, P. (eds.) Sanskrit Computational Linguistics. LNCS, vol. 5402, pp. 266–277. Springer, Heidelberg (2009)
Huet, G.: Formal structure of sanskrit text: requirements analysis for a mechanical sanskrit processor. In: Huet, G., Kulkarni, A., Scharf, P. (eds.) Sanskrit Computational Linguistics. LNCS, vol. 5402, pp. 162–199. Springer, Heidelberg (2009)
Huet, G.: Sanskrit segmentation. In: XXVIIIth South Asian Languages Analysis Roundtable, University of Denton, Texas (2009). http://yquem.inria.fr/huet/PUBLIC/SALA.pdf
Knauth, J., Alfter, D.: A dictionary data processing environment and its application in algorithmic processing of Pali dictionary data for future NLP tasks. In: Proceedings of the 5th Workshop on South and Southeast Asian NLP, 25th International Conference on Computational Linguistics, pp. 65–73 (2014). http://www.aclweb.org/anthology/W14-5509
Kulkarni, A., Paul, S., Kulkarni, M., Kumar, A., Surtani, N.: Semantic processing of compounds in Indian languages. In: Proceedings of COLING 2012: Technical Papers, pp. 1489–1502 (2012). http://www.aclweb.org/anthology/C12-1091
Kulkarni, A., Shukl, D.: Sanskrit morphological analyser: some issues. Indian Linguist. 70(1–4), 169–177 (2009)
Kumar, A., Mittal, V., Kulkarni, A.: Sanskrit compound processor. In: Jha, G.N. (ed.) Sanskrit Computational Linguistics. LNCS, vol. 6465, pp. 57–69. Springer, Heidelberg (2010)
Vipassana Research Institute: The Pali Tipitaka (2012). www.tipitaka.org
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Alfter, D., Knauth, J. (2015). Morphological Analysis and Generation for Pali. In: Mahlow, C., Piotrowski, M. (eds) Systems and Frameworks for Computational Morphology. SFCM 2015. Communications in Computer and Information Science, vol 537. Springer, Cham. https://doi.org/10.1007/978-3-319-23980-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-23980-4_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23978-1
Online ISBN: 978-3-319-23980-4
eBook Packages: Computer ScienceComputer Science (R0)