Advertisement

Impact of Named Entity Recognition on Kannada Documents Classification

  • R. Jayashree
  • Basavaraj S. Anami
  • S. Teju
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 801)

Abstract

Natural language processing in Kannada language is promising research field due to the unavailability of tools and challenges in various aspects such as lack of annotated Kannada corpus. The important aim objective of this paper is to study and understand the impact of named entity recognition (NER) on Kannada text documents classification. Rule based Kannada named entity recognition system is implemented and integrated with Naïve Bayes classifier using a tool for this purpose. Rule based approach is considered for the purpose of experimentation. Another important aspect of this work is the attempt made to improving the classifier performance for Kannada Documents through NER. Comprehensive study is conducted to investigate the impact of Kannada named entity recognition on Kannada text document classification using Naïve Bayes classifier. Experimental results shows classification algorithm produces better results for Kannada text documents with previously recognized named entities.

Keywords

Natural language processing Kannada named entity recognition Rule based approach Text classification 

References

  1. 1.
    Hiremath, P., Shambhavi, B.R.: Approaches to named entity recognition in Indian languages: a study. Int. J. Eng. Adv. Technol. (IJEAT) 3(6), 191–194 (2014)Google Scholar
  2. 2.
    Bhuvaneshwari, C.M.: Rule based methodology for recognition of Kannada named entities. IJLTET 3, 50–59 (2014)Google Scholar
  3. 3.
    Amarappa, S., Sathyanarayana, S.V.: Named entity recognition and classification in Kannada language. Int. J. Electron. Comput. Sci. Eng. Trans. Mach. Learn. Artif. Intell. 2, 281–289 (2012)Google Scholar
  4. 4.
    Alfred, R., Leong, L.C., On, C.K., Anthony, P.: Malay named entity recognition based on rule-based approach. Int. J. Mach. Learn. Comput. 4(3), 300–306 (2014)CrossRefGoogle Scholar
  5. 5.
    Riaz, K.: Rule-based named entity recognition in Urdu. In: Proceedings of the Named Entities Workshop, pp. 126–135 (2010)Google Scholar
  6. 6.
    Srikanth, P., Murthy, K.N.: Named entity recognition for Telugu. In: Proceedings of the IJCNLP-2008 Workshop on NER for South and South East Asian Languages Hyderabad (2008)Google Scholar
  7. 7.
    Mansouri, A., Affendey, L.S., Mamat, A.: Named entity recognition approaches. IJCSNS Int. J. Comput. Sci. Netw. Secur. 8(2), 339–344 (2008)Google Scholar
  8. 8.
    Kaur, K., Gupta, V.: Named entity recognition for Punjabi language. Int. J. Comput. Sci. Inf. Technol. Secur. (IJCSITS) 2(3) (2012)Google Scholar
  9. 9.
    Jayashree, R., Srikanta, M.K., Anami, B.S.: An analysis of sentence level text classification in the Kannada language. In: International Conference of Soft Computing and Pattern Recognition (SoCPaR), pp. 147–151. IEEE (2011)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.PES Institute of TechologyBangaloreIndia

Personalised recommendations