Skip to main content

A Supervised Framework for Classifying Dependency Relations from Bengali Shallow Parsed Sentences

  • Conference paper
  • First Online:
Mining Intelligence and Knowledge Exploration (MIKE 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9468))

  • 1755 Accesses

Abstract

Natural Language Processing, one of the contemporary research area has adopted parsing technologies for various languages across the world for different objectives. In the present task, a new approach has been introduced for classifying the dependency parsed relations for a morphologically rich and free-phrase-ordered Indian language like Bengali. The pair of dependency parsed relations (also referred as kaarakas ‘cases’) are classified based on different features like vibhaktis (inflections), Part-of-Speech (POS), punctuation, gender, number and post-position. It is observed that the consecutive and non-consecutive occurrences of such relations play a vital role in the classification. We employed three different machine-learning classifiers, namely NaiveBayes, Sequential Minimal Optimization (SMO) and Conditional Random Field (CRF) which obtained the average F-Scores of 0.895, 0.869 and 0.697, respectively for classifying relation pairs of three primary kaarakas and one primary vibhakti relation. We have also conducted the error analysis for such primary relations using confusion matrices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://listverse.com/2008/06/26/top-10-most-spoken-languages-in-the-world/.

  2. 2.

    https://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers.

  3. 3.

    http://ltrc.iiit.ac.in/mtpil2012/.

  4. 4.

    www.maltparser.org.

  5. 5.

    http://shiva.iiit.ac.in/SPSAL2007/.

  6. 6.

    www.cs.waikato.ac.nz/ml/weka.

  7. 7.

    nlp.stanford.edu/software/CRF-NER.shtml.

References

  1. Dhar, A., Chatterji, S., Sarkar, S., Basu, S.: A hybrid dependency parser for Bangla. In: Proceedings of the 10th Workshop on Asian Language Resources, COLING Mumbai, pp. 55–64, India (2012)

    Google Scholar 

  2. Ghosh, A., Bhaskar, P., Das, A., Bandyopadhyay, S.: Dependency parser for Bengali. In: JU System at ICON (2009)

    Google Scholar 

  3. Chatterji, S., Sonare, P., Sarkar, S., Roy, D.: Grammar driven rules for hybrid Bengali dependency parsing. In: Proceedings of ICON 2009 NLP Tools Contest: Indian Language Dependency Parsing, Hyderabad, India (2009)

    Google Scholar 

  4. Das, A., Shee, A., Garain, U.: Evaluation of two Bengali dependency parsers. In: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL), COLING, pp. 133–142 (2012)

    Google Scholar 

  5. Garain, U., De. S.: Dependency Parsing in Bangla. IGI Global (2013)

    Google Scholar 

  6. Haque, M.N., Khan, M.: Parsing Bangla using LFG. In: Proceedings of Association for Computational Linguistic (1997)

    Google Scholar 

  7. Kosaraju, P., Kesidi, S.R., Ainavolu, V.B.R., Kukkadapu, P.: Experiments on Indian language dependency parsing. In: Proceedings of ICON (2010)

    Google Scholar 

  8. Bharati, A., Sangal, R., Sharma, D.M.: SSF: Shakti Standard Format Guide (2007)

    Google Scholar 

  9. Das, D., Choudhury, M.: Chunker and shallow parser for free word order languages: an approach based on valency theory and feature structures. In: Proceedings of ICON (2004)

    Google Scholar 

  10. Begum, R., Husain, S., Sharma, D.M., Bai, L.: Developing verb frames in Hindi. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC), Marrakech, Morocco (2008)

    Google Scholar 

  11. Chatterji, S., Sarkar, T.M., Sarkar, S., Chakrabory, J.: Kaaraka relations in Bengali. In: Proceedings of 31st All-India Conference of Linguists (AICL), Hyderabad, pp. 33–36, India (2009)

    Google Scholar 

  12. Bharati, R., Sangal, D.M., Bai, L.: AnnCorra: annotating corpora guidelines for POS and chunk annotation for Indian languages. Technical report (TR-LTRC-31), LTRC, IIIT Hyderabad, India (2006)

    Google Scholar 

  13. Ghosh, A., Das, A., Bhaskar, P., Bandyopadhyay, S.: Bengali parsing system. In: ICON NLP Tool Contest (2010)

    Google Scholar 

  14. Rao, P.R.K., Vijay, S.R.R., Vijaykrishna, R., Sobha, L.: A text chunker and hybrid POS tagger for Indian languages. In: Proceedings of IJCAI Workshop on Shallow Parsing for South Asian Languages (2007)

    Google Scholar 

  15. De, S., Dhar, A., Garain, U.: Structure simplification and demand satisfaction approach to dependency parsing in Bangla. In: Proceedings of ICON 2009 NLP Tools Contest: Indian Language Dependency Parsing, Hyderabad, India (2009)

    Google Scholar 

  16. Bandyopadhyay, S., Ekbal, A., Halder, D.: HMM based POS tagger and rule-based chunker for Bengali. In: Proceedings of NLPAI Machine Learning Workshop on Part of Speech and Chunking for Indian Languages (2006)

    Google Scholar 

  17. Das, D., Ekbal, A., Bandyopadhyay, S.: Acquiring verb subcategorization frames in Bengali from corpora. In: Li, W., Mollá-Aliod, D. (eds.) ICCPOL 2009. LNCS, vol. 5459, pp. 386–393. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  18. Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency annotation scheme for Indian Languages. In: Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP), Hyderabad, India (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dipankar Das .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Mondal, A., Das, D. (2015). A Supervised Framework for Classifying Dependency Relations from Bengali Shallow Parsed Sentences. In: Prasath, R., Vuppala, A., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2015. Lecture Notes in Computer Science(), vol 9468. Springer, Cham. https://doi.org/10.1007/978-3-319-26832-3_56

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26832-3_56

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26831-6

  • Online ISBN: 978-3-319-26832-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics