Abstract
Named Entity Recognition (NER) is the process of automatic extraction of Named Entities (NEs) by means of recognizing (finding the entities in a given text) and their classification (assigning a type). A lot of work has been done in English and other languages for NER but very little research has been done for Nepali NER. Nepali language has its own issues and challenges which need to be addressed. In this paper, we present the NE Tagset and features that can be potentially used for Nepali NER. To train the model classifiers like Support Vector Machine (SVM), Multinomial Naïve Bayes and Logistic Regression are used and their precision, recall and F1-score have been calculated. 5 fold cross validation is used to evaluate and validate our model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bam, S.B., Shahi, T.B.: Named entity recognition for Nepali text using support vector machines. Int. Inf. Manag. 6, 21–29 (2014)
Dey, A.D., Paul, A., Syam Purkayastha, B.: Named entity recognition for Nepali language: a semi hybrid approach. Int. J. Eng. Innov. Technol. (IJEIT) 3(8), 21–25 (2014)
Gayen, V., Sarkar, K.: An HMM based named entity recognition system for Indian languages: The JU System at ICON 2013. Research Gate (2013)
Kaur, D., Gupta, V.: A survey of named entity recognition in English and other Indian languages. Int. J. Comput. Sci. 7(6), 239–245 (2010)
Kaur, Y., Kaur, R.: Named entity recognition (NER) system for Hindi Language using combination of rule based approach and list look up approach. Int. J. Sci. Res. Manag. (IJSRM) 3, 2300–2306 (2015)
Kumar, N., Bhattacharyya, P.: Named entity recognition in Hindi using MEMM. Technical report, IIT Bombay India (2006)
Li, W., Andrew, M.: rapid development of Hindi named entity recognition using conditional random fields and feature induction. ACM TALIP 2, 290–294 (2004)
Pillai, A.S., Sobha, L.:. Named entity recognition for Indian languages: a survey. Int. J. Adv. Res. Comput. Sci. Softw. Eng. (IJARCSSE) 3(11) (2013)
Saha, S.K., Sarkar, S., Mitra, P.: A hybrid feature set maximum entropy Hindi named entity recognition (n.d.)
Srivastava, S., Sanglikar, M., Kothari, D.C.: Named entity recognition for Hindi language: a hybrid approach. Int. J. Comput. Linguist. 2(1), 10–23 (2011)
Vinayak, A., Shreenivas, B., Monik, P., Prabhu Ameya, S.M.: Towards deep learning in Hindi NER: an approach to tackle the labelled data scarcity. In: International Conference on Natural Language Processing (ICON), Varanasi, India, pp. 154–160 (2016)
Gupta, D., Ekbal, A., Bhattacharyya, P.: A deep neural network based approach for entity extraction in code-mixed Indian social media text. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation, pp. 1762–1767. European Language Resource Association, Miyazaki (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Maharjan, G., Bal, B.K., Regmi, S. (2019). Named Entity Recognition (NER) for Nepali. In: Kravets, A., Groumpos, P., Shcherbakov, M., Kultsova, M. (eds) Creativity in Intelligent Technologies and Data Science. CIT&DS 2019. Communications in Computer and Information Science, vol 1084. Springer, Cham. https://doi.org/10.1007/978-3-030-29750-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-29750-3_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29749-7
Online ISBN: 978-3-030-29750-3
eBook Packages: Computer ScienceComputer Science (R0)