Skip to main content

“Google” Lithuanian Speech Recognition Efficiency Evaluation Research

  • Conference paper
  • First Online:
Information and Software Technologies (ICIST 2016)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 639))

Included in the following conference series:

  • 1282 Accesses

Abstract

This paper presents “Google” Lithuanian speech recognition efficiency evaluation research. For the experiment it was chosen method that consists of three parts: (1) to process all voice records without adding any noise; (2) process all voice records with several different types of noise, modified so as to get some predefined signal-to-noise ratio (SNR); (3) after one month reprocess all voice records without any additional noise and to assess improvements in the quality of the speech recognition. It was chosen WER metrics for speech recognition quality assessment. Analyzing the results of the experiment it was observed that the greatest impact on the quality of speech recognition has a SNR and speech type (most recognizable is isolated words, the worst - spontaneous speech). Meanwhile, characteristics such as the gender of the speaker, smooth speech, speech speed, speech volume does not make any significant influence on speech recognition quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The standard deviation is a numerical value used to indicate how widely individuals in a group vary. If individual observations vary greatly from the group mean, the standard deviation is big; and vice versa.

References

  1. Telksnys, A.L., Navickas, G.: Žmonių ir kompiuterių sąveika šnekant. In: Kompiuterininkų dienos - 2015, ISBN: 9789986343134, pp. 185–193. Žara. Vilnius (2015)

    Google Scholar 

  2. Google says its speech recognition technology now has only an 8 % word error rate. http://venturebeat.com/2015/05/28/google-says-its-speech-recognition-technology-now-has-only-an-8-word-error-rate/, 25 Apr. 2016

  3. Maskeliunas, R., Ratkevicius, K., Rudzionis, V.: Some aspects of voice user interfaces development for internet and computer control applications. Elektronika ir elektrotechnika 19(2), 53–56 (2013). ISSN 1392-1215

    Article  Google Scholar 

  4. Rudzionis, V., Ratkevicius, K., Rudzionis, A., Maskeliunas, R., Raskinis, G.: Voice controlled interface for the medical-pharmaceutical information system. In: Skersys, T., Butleris, R., Butkiene, R. (eds.) ICIST 2012. CCIS, vol. 319, pp. 288–296. Springer, Heidelberg (2012). ISBN: 9783642333071

    Chapter  Google Scholar 

  5. Rudzionis, V., Raskinis, G., Maskeliunas, R., Rudzionis, A., Ratkevicius, K.: Comparative analysis of adapted foreign language and native lithuanian speech recognizers for voice user interface. Elektronika ir elektrotechnika 19(7), 90–93 (2013). ISSN 1392-1215

    Article  Google Scholar 

  6. Rudžionis, V., Ratkevičius, K., Rudžionis, A., Raškinis, G., Maskeliunas, R.: Recognition of voice commands using hybrid approach. In: Skersys, T., Butleris, R., Butkiene, R. (eds.) ICIST 2013. CCIS, vol. 403, pp. 249–260. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  7. Rudzionis, V., Raskinis, G., Maskeliunas, R., Rudzionis, A., Ratkevicius, K., Bartisiute, G.: Web services based hybrid recognizer of lithuanian voice commands. Elektronika ir elektrotechnika 20(9), 50–53 (2014). ISSN 1392-1215

    Article  Google Scholar 

  8. Rudžionis, V., Raškinis, G., Ratkevičius, K., Rudžionis, A., Bartišiūtė, G.: Medical – pharmaceutical information system with recognition of Lithuanian voice commands. In: Human language technologies. In: The Baltic Perspective: Proceedings of the 6th International Conference. ISBN: 978161499441, pp. 40–45. IOS Press. Amsterdam (2014)

    Google Scholar 

  9. Bartišiūtė, G., Ratkevičius, K., Paškauskaitė, G.: Hybrid recognition technology for isolated voice commands. In: Information Systems Architecture and Technology: Proceedings of 36th International Conference on Information Systems Architecture and Technology – ISAT 2015 – Part IV, ISBN 978-3-319-28565-8, pp. 207–216 (2016)

    Google Scholar 

  10. Bartišiūtė, G., Paškauskaitė, G., Ratkevičius, K.: Investigation of disease codes recognition accuracy. In: Proceedings of the 9th International Conference on Electrical and Control Technologies, ECT 2014, pp. 60–63 (2014)

    Google Scholar 

  11. Rasymas, T., Rudžionis, V.: Evaluation of methods to combine different speech recognizers. In: Computer Science and Information Systems (FedCSIS), pp. 1043–1047 (2015)

    Google Scholar 

  12. Rasymas, T., Rudžionis, V.: Lithuanian digits recognition by using hybrid approach by combining lithuanian google recognizer and some foreign language recognizers. In: Information and Software Technologies, ISBN 978-3-319-24769-4, pp 449–459 (2015)

    Google Scholar 

  13. Lileikytė, R., Telksnys, A.L.: Metrics based quality estimation of speech recognition features. Informatica Vilnius, Matematikos ir informatikos institutas 24(3), 435–446 (2013). ISSN: 0868-4952

    MathSciNet  Google Scholar 

  14. Schalkwyk, J., Beeferman, D., Beaufays, F., Byrne, B., Chelba, C., Cohen, M., Garret, M., Strope, B.: Google Search by Voice: A case study

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Donatas Sipavičius .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Sipavičius, D., Maskeliūnas, R. (2016). “Google” Lithuanian Speech Recognition Efficiency Evaluation Research. In: Dregvaite, G., Damasevicius, R. (eds) Information and Software Technologies. ICIST 2016. Communications in Computer and Information Science, vol 639. Springer, Cham. https://doi.org/10.1007/978-3-319-46254-7_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46254-7_49

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46253-0

  • Online ISBN: 978-3-319-46254-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics