Abstract
Language is a system of communication used by humans to express and convey thoughts, ideas, emotions, and information. It serves several functions, including facilitating communication, preserving culture and identity, transmitting knowledge, and expressing creativity. The endangerment of a language occurs when it is no longer being passed down from one generation to the next, typically due to factors such as language shift, cultural assimilation, or the dominance of another more widely spoken language. Ge’ez is an ancient Semitic language that traces its roots back to what is now northern Ethiopia and Eritrea. Presently, Ge’ez serves as the primary liturgical language for the Ethiopian and Eritrean Orthodox Tewahedo Church, Ethiopian and Eritrean Catholic Church and the Beta Israel Jewish community of Ethiopia. Ge’ez holds significant importance as it is the precursor of two major Semitic languages in Ethiopia: Amharic, which functions as the country's official working language, with approximately 33% of the population using it as their first language, primarily in the northwest and central regions, and Tigrinya/Tigrigna, predominantly spoken in northern and northeastern Ethiopia. Additionally, the Tigré language, spoken by the Eritrean people, also has its origins in Ge’ez. The fact that the language is not used as a means of everyday communication as it is the liturgical language, furthermore not being easily accessible through the Internet are the reasons for Ge’ez being vulnerable to extinction. The purpose of this paper is to study the role of Natural Language Processing (NLP) to ensure the language does not become extinct and is accessible to a wide range of people throughout the world.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Allen, J. F. (1987). Natural language understanding. https://www.semanticscholar.org/paper/Natural-language-understanding-Allen/ba6797b59c38702725ea0f3e6c141d471164662e
Fgaim. (n.d.). GitHub - fgaim/GeezSwitch-data: A dataset for language identification in five low-resource Ge’ez based languages. GitHub. https://github.com/fgaim/geezswitch-data
Liddy, E. D. (2001). Natural Language Processing. In Encyclopedia of Library and Information Science (2nd ed.). NY. Marcel Decker, Inc. (syr.edu).
Trivedi, K. S. (2023). Fundamentals of Natural Language Processing. In Apress eBooks (pp. 119–180). https://doi.org/10.1007/978-1-4842-9221-1_5
Dey, A. (2022). Natural Language Processing (NLP) : Beginner’s Guide. ResearchGate. https://www.researchgate.net/publication/358210736_Natural_Language_Processing_NLP_Beginner%27s_Guide
Abel, B. (2018). Geez to Amharic Machine Translation. Addis Ababa. http://etd.aau.edu.et/bitstream/handle/123456789/18800/Biruk%20Abel%20%202018.pdf?sequence=1
Maybury, M. T. (2004). New Directions in Question Answering. AAAI Press.
Teshome, W. O. N. D. W. O. S. S. E. N. (2013). Designing Amharic Definitive Question Answering. Designing Amharic Definitive Question Answering.
L. Altunyurt, T. Güngör and Z. Orhan.(2006). “A Composite Approach for Part of Speech Tagging in Turkish.“ In Proceeding of International Scientific Conference on Computer Science, Istanbul, Turkey, 2006.
P. Smit.(2010). Unsupervised, supervised and semi-supervised learning.
GETINET, Y. (2015). Unsupervised Part Of Speech Tagging For Amharic (Doctoral dissertation).
Agizew, T., & Maryam, S. (2015). AMHARIC NAMED ENTITY RECOGNITION [ANER]: USING SUPERVISED APPROACH (Doctoral dissertation).
Borthwick, A. 1999. ‘A Maximum Entropy Approach to Named Entity Recognition’. New York University.
Abate, S. T., Tachbelie, M. Y., & Menzel, W. (2023). Amharic Speech Recognition: Past, present and future. ResearchGate.
Lu, X., Li, S., & Fujimoto, M. (2019). Automatic speech recognition. In SpringerBriefs in computer science (pp. 21–38). Springer Nature. https://doi.org/10.1007/978-981-15-0595-9_2
Automatic text summarization and it’s methods - a review. (2016, January 1). IEEE Conference Publication | IEEE Xplore. https://ieeexplore.ieee.org/abstract/document/7508049
Mekuria, G. T., & Jagtap, A. S. (2017). Automatic amharic text summarization using NLP parser. International Journal of Engineering Trends and Technology (IJETT), 53(1), 52–58.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Mandefro, E.G., Belayhun, E.Z., Assabie, Y., Dejene, A.H., Dash, S.R. (2024). The Role of NLP to Facilitate the Growth of Ge’ez Language. In: Mohanty, S.S., Dash, S.R., Parida, S. (eds) Applying AI-Based Tools and Technologies Towards Revitalization of Indigenous and Endangered Languages. Studies in Computational Intelligence, vol 1148. Springer, Singapore. https://doi.org/10.1007/978-981-97-1987-7_8
Download citation
DOI: https://doi.org/10.1007/978-981-97-1987-7_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-1986-0
Online ISBN: 978-981-97-1987-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)