Abstract
Named Entity Recognition (NER) System aims to extract the existing information into the following categories such as: Person’s Name, Organization, Location, Date and Time, Term, Designation and Short forms. Now, it is considered to be important aspect for many natural languages processing (NLP) tasks such as: information retrieval system, machine translation system, information extraction system and question answering. Even at a surface level, the understanding of the named entities involved in a document gives richer analytical framework and cross referencing. It has been used for different Arabic Script-Based languages like, Arabic, Persian and Urdu but, Sindhi could not come into being yet. This paper explains the problem of NER in the framework of Sindhi Language and provides relevant solution. The system is developed to tag ten different Named Entities. We have used Ruled based approach for NER system of Sindhi Language. For the training and testing, 936 words were used and calculated performance accuracy of 98.71%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Shah, H., Bhandari, P., Mistry, K., Thakor, S., Patel, M., Ahir, K.: Study of NER for Indian languages. Int. J. Inf. Sci. Tech. (IJIST) 6(1/2), 15–20 (2016)
Kanan, T., Ayoub, S., Saif, E., Kanaan, G., Chandrasekar, P., Fox, E.A.: Extracting named entities using named entity recognizer and generating topics using Latent Dirichlet allocation algorithm for Arabic news articles. In: Proceedings of the International Computer Sciences and Informatics Conference (ICSIC) (2016)
Patil, N., Patil, A.S., Pawar, B.V.: Survey of NER systems with respect to Indian and Foreign languages. Int. J. Comput. Appl. 134(16), 21–26 (2016)
Jiang, R., Banchs, R.E., Li, H.: Evaluating and combining NER systems. In: Proceedings of the Sixth Named Entity Workshop, Joint with 54th ACL, Berlin, Germany, pp. 21–27 (2016)
Patawar, M.M.L., Potey, M.M.A.: Approaches to NER: a survey. Int. J. Innov. Res. Comput. Commun. Eng. 3(12), 37–42 (2015)
van Hooland, S., Wilde, M.D., Verborgh, R., Steiner, T., Van de Walle, R.: Exploring entity recognition and disambiguation for cultural heritage collections. J. Digital Sch. Humanit. 30, 262–279 (2014)
Shaalan, K.: A survey of Arabic NER and classification. Assoc. Comput. Linguist. 40(2), 469–510 (2014)
Tkachenko, M., Simanovsky, A.: NER: exploring features. In: Proceedings of KONVENS 2012, Vienna (2012)
Abdallah, S., Shaalan, K., Shoaib, M.: Integrating Rule-Based System with Classification for Arabic NER, pp. 311–322. Springer-Verlag, Heidelberg (2012)
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (Almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)
Kaur, D., Gupta, V.: A survey of NER in English and other Indian languages. Int. J. Comput. Sci. Issues 7(6), 89–95 (2010)
Liao, W., Veeramachaneni, S.: A simple semi-supervised algorithm for NER. In: Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing, pp. 58–65 (2009)
Mansouri, A., Affendey, L.S., Mamat, A.: NER approaches. Int. J. Comput. Sci. Netw. Secur. 8(2), 67–71 (2008)
Tran, T., Pham, T., Hung, T.X., Dinh, D., Collier, N.: NER in Vietnamese documents. Natural Institute of Informatics (2007)
Singh, U.P., Goyal, V., Lehal, G.S.: NER system for Urdu. In: Proceedings of Cooling Mumbai, pp. 2507–2518 (2012)
Kazama, J., Torisawa, K.: Exploiting Wikipedia as external knowledge for NER. In: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 698–707 (2007)
Alexander, E., Richman, P., Schone, P.: Mining Wiki resources for multilingual NER. In: Proceedings of the 46th Annual Meeting of the Association of Computational Linguistics: Human Language Technologies, Stroudsburg, PA, pp. 1–9 (2008)
Nadeau, D., Sekine, S.: A survey of NER and classification (2008). http://nlp.cs.nyu.edu/sekine/papers/li07.pdf
Belgaum, M.R., Soomro, S., Alansari, Z., Alam, M.: Ideal node enquiry search algorithm (INESH) in MANETS. Ann. Emerg. Technol. Comput. (AETiC) 1(1), 26–33 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Jumani, A.K., Memon, M.A., Khoso, F.H., Sanjrani, A.A., Soomro, S. (2018). Named Entity Recognition System for Sindhi Language. In: Miraz, M., Excell, P., Ware, A., Soomro, S., Ali, M. (eds) Emerging Technologies in Computing. iCETiC 2018. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 200. Springer, Cham. https://doi.org/10.1007/978-3-319-95450-9_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-95450-9_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95449-3
Online ISBN: 978-3-319-95450-9
eBook Packages: Computer ScienceComputer Science (R0)