Abstract
Natural Language Processing is a field of computer science, AI and linguistics concerned with the interactions between computers and human (natural) languages. Specifically, computer extracts meaningful information from natural language input and/or producing natural language output. The major task in NLP is machine translation, which automatically translates text from one human language to another by preserving its meaning. This paper proposes new model for Machine-Translation system in which Rule-Based, Dictionary-Based approaches are applied for English-to-Kannada/Telugu Language-Identification and Machine Translation. The proposed method has four steps: first, Analyze and tokenize an English sentence into a string of grammatical nodes second, Map the input pattern with a table of English–Kannada/Telugu sentence patterns, third, Look-up the bilingual-dictionary for the equivalent Kannada/Telugu words, reorder and then generate output sentences and fourth step is to Display the output sentences. The future work will focus on sentence translation by using semantic features to make a more precise translation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Konchady M (2008) Text mining application programming, 3rd edn. Charles River Media, Boston
Homiedan AH (2010) Machine translation.http://faculty.ksu.edu.sa/homiedan/Publications/Machine%20Translation.pdf. Accessed 4 Sep 2011
The Karnataka Official Language Act. Official website of Department of Parliamentary Affairs and Legislation. Government of Karnataka. Retrieved 29 July 2012
http://en.wikipedia.org/wiki/Telugu_language Retrieved 29 July 2012
http://www.ssec.wisc.edu/~tomw/java/unicode.html#x0C80. Retrieved 29 July 2012
Botha G, Zimu V, Barnard E (2007) Text-based language identification for South African languages. Published in South African Institute Of Electrical Engineers vol 98 (4)
Sibun P, Reynar JC (1996) Language identification: examining the issues.http://www.cis.upenn.edu/~nenkova/Courses/cis430/languageIdentification.pdf. Accessed on 10 Jan 2012
Vatanen T, Väyrynen JJ, Virpioja S (2010) Language identification of short text segments with N-gram models
Ahmed B, Cha SH, Tappert C (2004) Language identification from text using N-gram based cumulative frequency addition. In: Proceedings of Student/Faculty Research Day, CSIS, Pace University
Pingali P, Varma V (2006) Hindi and Telugu to English cross language information retrieval at CLEF 2006, 20–22 Sep, Alicante, Spain
Kereto S, Wongchaisuwat C, Poovarawan Y (1993) Machine translation research and development. In: Proceedings of the Symposium on Natural Language processing in Thailand, pp. 167–195
Knowles F (1982) The pivotal role of the dictionaries in a machine translation system. In: Lawson V (ed) Practical experience of machine translation. North-Holland, Amsterdam
Ritchie G, Whitelock (eds) (1985) The lexicon. pp. 225–256
Ballesteros L, Croft WB (1997) Phrasal translation and query expansion techniques for cross-language information retrieval. In: Proceedings of ACM SIGIR Conference 20: 84–91
Reddy MV, Hanumanthappa M (2009) CLIR Project (English to Kannada and Telugu). Available athttp://bangaloreuniversitydictionary//menu.asp
Acknowledgments
I owe my sincere feelings of gratitude to Dr. M. Hanumanthappa, for his valuable guidance and suggestions which helped me a lot to write this paper. This is the major research project entitled Cross-Language Information Retrieval sanctioned to Dr. M. Hanumanthappa, PI-UGC-MH, Department of computer science and applications by the University grant commission. I thank to the UGC for financial assistance. This paper is in continuation of the project carried out at the Bangalore University, Bangalore, India.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer India
About this paper
Cite this paper
Reddy, M.V., Hanumanthappa, M. (2013). Indic Language Machine Translation Tool: English to Kannada/Telugu. In: Swamy, P., Guru, D. (eds) Multimedia Processing, Communication and Computing Applications. Lecture Notes in Electrical Engineering, vol 213. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1143-3_4
Download citation
DOI: https://doi.org/10.1007/978-81-322-1143-3_4
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1142-6
Online ISBN: 978-81-322-1143-3
eBook Packages: EngineeringEngineering (R0)