Abstract
The development of spoken language systems for the tribal languages of India is very beneficial to society. The details of the implementation of automatic speech recognition for Galo language, spoken in the northeast Indian state of Arunachal Pradesh, are presented here. A multi-speaker speech database of continuously spoken Galo sentences was specifically created for this purpose. The speech recognition system was implemented using Kaldi, a public domain software toolkit. The automatic speech recognition system recognizes Galo sentences spoken continuously by new speakers with an accuracy of about 80%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
List of notified scheduled tribes, census of India. Online: http://censusindia.gov.in/Tables_Published/SCST/ST%20Lists.pdf
Office of the Registrar General and Census Commissioner India (2011) Statement-1 part-b languages not specified in the eighth schedule (non-scheduled languages). Online: http://censusindia.gov.in/2011Census/Language-2011/Statement-1.pdf
Moseley C (ed.) (2010) Atlas of the world’s languages in danger, 3rd edn. Paris, UNESCO Publishing. Online version: http://www.unesco.org/languages-atlas/en/atlasmap/language-id-988.html
Office of the Registrar General and Census Commissioner, India, State Maps. http://censusindia.gov.in/maps/State_Maps/StateMaps_links/arunachal01.html
Mark William Post (2007) A grammar of Galo. PhD Thesis, La Trobe University
Bhattacharjee U, Sarmah K (2012) A multilingual speech database for speaker recognition. In: 2012 IEEE international conference on signal processing, computing and control. https://doi.org/10.1109/ispcc.2012.6224374
Bhattacharjee U, Sarmah K (2013) Language identification system using MFCC and prosodic features. In: 2013 international conference on intelligent systems and signal processing (ISSP). https://doi.org/10.1109/issp.2013.6526901
Das Gupta SK (1963) An introduction to the Gallong language. Shillong, Northeast Frontier Agency
Sora M, Talukdar J, Talukdar PH (January, 2013) Formant frequency and Cepstral method estimation of Galo phonemes using acoustical cues. Int J Inf Electron Eng 3(1)
Rwbaa I, Post MW, Rwbaa I, Xodu M, Bagra K, Rwbaa B, Rwbaa T, Ado N, Keenaa D (2009) Galo-English dictionary with English-Galo index
Povey D, Ghoshal A, Boulianne G, Burget L, Glembek O, Goel N, Hannemann M, Motlicek P, Qian Y, Schwarz P, Silovsky J, Stemmer G, Vesely K (December, 2011) The Kaldi speech recognition toolkit. In: IEEE 2011 workshop on automatic speech recognition and understanding. IEEE signal processing society. Online: https://publications.idiap.ch/downloads/papers/2012/PoveyASRU20-112011.pdf
Indian Language speech sound label set (ILSL12). Online: https://www.iitm.ac.in/donlab/tts/downloads/cls/clsv2.1.6.pdf
Federico M, Bertoldi N, Cettolo M (2008) IRSTLM: an open source toolkit for handling large scale language models. Proceedings of Interspeech, pp 1618–1621
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Nyodu, K., Vijaya, S. (2020). Automatic Speech Recognition of Galo. In: Mallick, P.K., Meher, P., Majumder, A., Das, S.K. (eds) Electronic Systems and Intelligent Computing. Lecture Notes in Electrical Engineering, vol 686. Springer, Singapore. https://doi.org/10.1007/978-981-15-7031-5_63
Download citation
DOI: https://doi.org/10.1007/978-981-15-7031-5_63
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7030-8
Online ISBN: 978-981-15-7031-5
eBook Packages: Computer ScienceComputer Science (R0)