Abstract
With huge improvement of digital connectivity (Wifi, 3G, 4G) and digital devices access to internet has reached in the remotest corners now a days. Rural people can easily access web or apps from PDAs, laptops, smart phones etc. This is an opportunity of the Government to reach to the citizen in large number, get their feedback, associate them in policy decision with e governance without deploying huge man, material or resources. But the Government of multilingual countries face a lot of problem in successful implementation of Government to Citizen (G2C) and Citizen to Government (C2G) governance as the rural people tend and prefer to interact in their native languages. Presenting equal experience over web or app to different language group of speakers is a real challenge. In this research we have sorted out the problems faced by Indo Aryan speaking netizens which is in general also applicable to any language family groups or subgroups. Then we have tried to give probable solutions using Etymology. Etymology is used to correlate the words using their ROOT forms. In 5th century BC Panini wrote Astadhyayi where he depicted sutras or rules- how a word is changed according to person, tense, gender, number etc. Later this book was followed in Western countries also to derive their grammar of comparatively new languages. We have trained our system for automatic root extraction from the surface level or morphed form of words using Panian Grammatical rules. We have tested our system over 10000 bengali Verbs and extracted the root form with 98% accuracy. We are now working to extend the program to successfully lemmatize any words of any language and correlate them by applying those rule sets in Artificial Neural Network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Panini: Astadhyayi, 6th to 5th centuri BCE
Das, A., Halder, T., Saha, D.: Automatic extraction of Bengali root verbs using Paninian grammar. In: 2nd IEEE International Conference On Recent Trends in Electronics, Information & Communication Technology, Bangaluru, pp. 944–947 (2017)
Mridha, M.F., Saha, A.K., Das, J.K.: New approach of solving semantic ambiguity problem of bangla root words using universal networking language (UNL). In: 3rd International Conference on Informatics, Electronics and Vision, Dhaka, Bangladesh
Choudhury, M., Jalan, V., Sarkar, S., Basu, A.: Evolution, optimization and language change: the case of Bengali verb inflections. In: Proceedings of Ninth Meeting of the ACL Special Interest Group in Computational Morphology and Phonology, Prague, pp. 65–74
Islam, M.S.: Research on Bangla language processing in Bangladesh: progress and challenges. In: 8th International Language & Development Conference, Bangladesh, pp. 23–25
Pal, A.R., Saha, D., Naskar, S., Das, N.S.: Word sense disambiguation in Bengali: a lemmatized system increases the accuracy of the result. In: 2nd International Conference on Recent Trends in Information System, Kolkata, India
Ekbal, A., Haque, R., Bandyopadhyay, S.: Maximum entropy based Bengali part of speech tagging. In: Advances in Natural Language Processing and Applications Research in Computing Science, pp. 67–78
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Das, A., Saha, D. (2018). Improvement of Electronic Governance and Mobile Governance in Multilingual Countries with Digital Etymology Using Sanskrit Grammar. In: Mandal, J., Sinha, D. (eds) Social Transformation – Digital Way. CSI 2018. Communications in Computer and Information Science, vol 836. Springer, Singapore. https://doi.org/10.1007/978-981-13-1343-1_44
Download citation
DOI: https://doi.org/10.1007/978-981-13-1343-1_44
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1342-4
Online ISBN: 978-981-13-1343-1
eBook Packages: Computer ScienceComputer Science (R0)