Abstract
This paper presents “Google” Lithuanian speech recognition efficiency evaluation research. For the experiment it was chosen method that consists of three parts: (1) to process all voice records without adding any noise; (2) process all voice records with several different types of noise, modified so as to get some predefined signal-to-noise ratio (SNR); (3) after one month reprocess all voice records without any additional noise and to assess improvements in the quality of the speech recognition. It was chosen WER metrics for speech recognition quality assessment. Analyzing the results of the experiment it was observed that the greatest impact on the quality of speech recognition has a SNR and speech type (most recognizable is isolated words, the worst - spontaneous speech). Meanwhile, characteristics such as the gender of the speaker, smooth speech, speech speed, speech volume does not make any significant influence on speech recognition quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The standard deviation is a numerical value used to indicate how widely individuals in a group vary. If individual observations vary greatly from the group mean, the standard deviation is big; and vice versa.
References
Telksnys, A.L., Navickas, G.: Žmonių ir kompiuterių sąveika šnekant. In: Kompiuterininkų dienos - 2015, ISBN: 9789986343134, pp. 185–193. Žara. Vilnius (2015)
Google says its speech recognition technology now has only an 8 % word error rate. http://venturebeat.com/2015/05/28/google-says-its-speech-recognition-technology-now-has-only-an-8-word-error-rate/, 25 Apr. 2016
Maskeliunas, R., Ratkevicius, K., Rudzionis, V.: Some aspects of voice user interfaces development for internet and computer control applications. Elektronika ir elektrotechnika 19(2), 53–56 (2013). ISSN 1392-1215
Rudzionis, V., Ratkevicius, K., Rudzionis, A., Maskeliunas, R., Raskinis, G.: Voice controlled interface for the medical-pharmaceutical information system. In: Skersys, T., Butleris, R., Butkiene, R. (eds.) ICIST 2012. CCIS, vol. 319, pp. 288–296. Springer, Heidelberg (2012). ISBN: 9783642333071
Rudzionis, V., Raskinis, G., Maskeliunas, R., Rudzionis, A., Ratkevicius, K.: Comparative analysis of adapted foreign language and native lithuanian speech recognizers for voice user interface. Elektronika ir elektrotechnika 19(7), 90–93 (2013). ISSN 1392-1215
Rudžionis, V., Ratkevičius, K., Rudžionis, A., Raškinis, G., Maskeliunas, R.: Recognition of voice commands using hybrid approach. In: Skersys, T., Butleris, R., Butkiene, R. (eds.) ICIST 2013. CCIS, vol. 403, pp. 249–260. Springer, Heidelberg (2013)
Rudzionis, V., Raskinis, G., Maskeliunas, R., Rudzionis, A., Ratkevicius, K., Bartisiute, G.: Web services based hybrid recognizer of lithuanian voice commands. Elektronika ir elektrotechnika 20(9), 50–53 (2014). ISSN 1392-1215
Rudžionis, V., Raškinis, G., Ratkevičius, K., Rudžionis, A., Bartišiūtė, G.: Medical – pharmaceutical information system with recognition of Lithuanian voice commands. In: Human language technologies. In: The Baltic Perspective: Proceedings of the 6th International Conference. ISBN: 978161499441, pp. 40–45. IOS Press. Amsterdam (2014)
Bartišiūtė, G., Ratkevičius, K., Paškauskaitė, G.: Hybrid recognition technology for isolated voice commands. In: Information Systems Architecture and Technology: Proceedings of 36th International Conference on Information Systems Architecture and Technology – ISAT 2015 – Part IV, ISBN 978-3-319-28565-8, pp. 207–216 (2016)
Bartišiūtė, G., Paškauskaitė, G., Ratkevičius, K.: Investigation of disease codes recognition accuracy. In: Proceedings of the 9th International Conference on Electrical and Control Technologies, ECT 2014, pp. 60–63 (2014)
Rasymas, T., Rudžionis, V.: Evaluation of methods to combine different speech recognizers. In: Computer Science and Information Systems (FedCSIS), pp. 1043–1047 (2015)
Rasymas, T., Rudžionis, V.: Lithuanian digits recognition by using hybrid approach by combining lithuanian google recognizer and some foreign language recognizers. In: Information and Software Technologies, ISBN 978-3-319-24769-4, pp 449–459 (2015)
Lileikytė, R., Telksnys, A.L.: Metrics based quality estimation of speech recognition features. Informatica Vilnius, Matematikos ir informatikos institutas 24(3), 435–446 (2013). ISSN: 0868-4952
Schalkwyk, J., Beeferman, D., Beaufays, F., Byrne, B., Chelba, C., Cohen, M., Garret, M., Strope, B.: Google Search by Voice: A case study
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Sipavičius, D., Maskeliūnas, R. (2016). “Google” Lithuanian Speech Recognition Efficiency Evaluation Research. In: Dregvaite, G., Damasevicius, R. (eds) Information and Software Technologies. ICIST 2016. Communications in Computer and Information Science, vol 639. Springer, Cham. https://doi.org/10.1007/978-3-319-46254-7_49
Download citation
DOI: https://doi.org/10.1007/978-3-319-46254-7_49
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46253-0
Online ISBN: 978-3-319-46254-7
eBook Packages: Computer ScienceComputer Science (R0)