Abstract
Natural Language Processing, one of the contemporary research area has adopted parsing technologies for various languages across the world for different objectives. In the present task, a new approach has been introduced for classifying the dependency parsed relations for a morphologically rich and free-phrase-ordered Indian language like Bengali. The pair of dependency parsed relations (also referred as kaarakas ‘cases’) are classified based on different features like vibhaktis (inflections), Part-of-Speech (POS), punctuation, gender, number and post-position. It is observed that the consecutive and non-consecutive occurrences of such relations play a vital role in the classification. We employed three different machine-learning classifiers, namely NaiveBayes, Sequential Minimal Optimization (SMO) and Conditional Random Field (CRF) which obtained the average F-Scores of 0.895, 0.869 and 0.697, respectively for classifying relation pairs of three primary kaarakas and one primary vibhakti relation. We have also conducted the error analysis for such primary relations using confusion matrices.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
References
Dhar, A., Chatterji, S., Sarkar, S., Basu, S.: A hybrid dependency parser for Bangla. In: Proceedings of the 10th Workshop on Asian Language Resources, COLING Mumbai, pp. 55–64, India (2012)
Ghosh, A., Bhaskar, P., Das, A., Bandyopadhyay, S.: Dependency parser for Bengali. In: JU System at ICON (2009)
Chatterji, S., Sonare, P., Sarkar, S., Roy, D.: Grammar driven rules for hybrid Bengali dependency parsing. In: Proceedings of ICON 2009 NLP Tools Contest: Indian Language Dependency Parsing, Hyderabad, India (2009)
Das, A., Shee, A., Garain, U.: Evaluation of two Bengali dependency parsers. In: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL), COLING, pp. 133–142 (2012)
Garain, U., De. S.: Dependency Parsing in Bangla. IGI Global (2013)
Haque, M.N., Khan, M.: Parsing Bangla using LFG. In: Proceedings of Association for Computational Linguistic (1997)
Kosaraju, P., Kesidi, S.R., Ainavolu, V.B.R., Kukkadapu, P.: Experiments on Indian language dependency parsing. In: Proceedings of ICON (2010)
Bharati, A., Sangal, R., Sharma, D.M.: SSF: Shakti Standard Format Guide (2007)
Das, D., Choudhury, M.: Chunker and shallow parser for free word order languages: an approach based on valency theory and feature structures. In: Proceedings of ICON (2004)
Begum, R., Husain, S., Sharma, D.M., Bai, L.: Developing verb frames in Hindi. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC), Marrakech, Morocco (2008)
Chatterji, S., Sarkar, T.M., Sarkar, S., Chakrabory, J.: Kaaraka relations in Bengali. In: Proceedings of 31st All-India Conference of Linguists (AICL), Hyderabad, pp. 33–36, India (2009)
Bharati, R., Sangal, D.M., Bai, L.: AnnCorra: annotating corpora guidelines for POS and chunk annotation for Indian languages. Technical report (TR-LTRC-31), LTRC, IIIT Hyderabad, India (2006)
Ghosh, A., Das, A., Bhaskar, P., Bandyopadhyay, S.: Bengali parsing system. In: ICON NLP Tool Contest (2010)
Rao, P.R.K., Vijay, S.R.R., Vijaykrishna, R., Sobha, L.: A text chunker and hybrid POS tagger for Indian languages. In: Proceedings of IJCAI Workshop on Shallow Parsing for South Asian Languages (2007)
De, S., Dhar, A., Garain, U.: Structure simplification and demand satisfaction approach to dependency parsing in Bangla. In: Proceedings of ICON 2009 NLP Tools Contest: Indian Language Dependency Parsing, Hyderabad, India (2009)
Bandyopadhyay, S., Ekbal, A., Halder, D.: HMM based POS tagger and rule-based chunker for Bengali. In: Proceedings of NLPAI Machine Learning Workshop on Part of Speech and Chunking for Indian Languages (2006)
Das, D., Ekbal, A., Bandyopadhyay, S.: Acquiring verb subcategorization frames in Bengali from corpora. In: Li, W., Mollá-Aliod, D. (eds.) ICCPOL 2009. LNCS, vol. 5459, pp. 386–393. Springer, Heidelberg (2009)
Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency annotation scheme for Indian Languages. In: Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP), Hyderabad, India (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Mondal, A., Das, D. (2015). A Supervised Framework for Classifying Dependency Relations from Bengali Shallow Parsed Sentences. In: Prasath, R., Vuppala, A., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2015. Lecture Notes in Computer Science(), vol 9468. Springer, Cham. https://doi.org/10.1007/978-3-319-26832-3_56
Download citation
DOI: https://doi.org/10.1007/978-3-319-26832-3_56
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26831-6
Online ISBN: 978-3-319-26832-3
eBook Packages: Computer ScienceComputer Science (R0)