Skip to main content

Abstract

The paper deals with two elements of the artificial intelligence methods—the natural language processing and machine learning. Hybrid recognition technology for isolated Lithuanian voice commands is described. By the hybrid approach we assume the combination of two different recognition methods to achieve higher recognition accuracy. The method which is based on the machine learning algorithm to combine the recognition results provided by two different recognizers is described. The first recognizer was HTK-based Lithuanian recognizer, the second one—the Spanish language recognizer adapted to the Lithuanian language. The experimental results show that a hybrid decision-making rule learned by “random forest” classifier works with 99.46 % accuracy and exceeds the accuracy of the “blind” decision-making rule (96.12 %). The average hybrid operation accuracy reaches 99.24 %, when the recognizer recognizes voice commands out of 12 known speakers, and is equal to 99.18 %, when it is applied to the unknown speaker.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Suendermann, D., Pieraccini, R.: SLU in commercial and research spoken dialogue systems. In: Tur, G., De Mori, R. (eds.) Spoken Language Understanding, pp. 171–194. Wiley, New York (2011)

    Google Scholar 

  2. Saon, G., Chien, J.-T.: Large-vocabulary continuous speech recognition systems: a look at some recent advances. IEEE Sig. Process. Mag. 29(6), 18–33 (2012)

    Article  Google Scholar 

  3. Kumar, N., Andreou, A.: Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition. Speech Commun. 25(4), 283–297 (1998)

    Article  Google Scholar 

  4. Rabiner, L.: In: Proceedings of IEEEA Tutorial on Hidden Markov Models on Selected Applications in Speech Recognition, vol. 77, no. 2, pp. 257–286, February (1989)

    Google Scholar 

  5. Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book (2000). http://htk.eng.cam.ac.uk/docs/docs.shtml

  6. Rudžionis, V., Raškinis, G., Maskeliūnas, R., Rudžionis, A., Ratkevičius, K., Bartišiūtė, G.: Web services based hybrid recognizer of Lithuanian voice commands. In: Electronics and Electrical Engineering, vol. 20, no. 9, pp. 50–53, Kaunas (2014)

    Google Scholar 

  7. Maskeliūnas, R., Rudžionis, A., Ratkevičius, K., Rudžionis, V.: Investigation of foreign languages models for Lithuanian speech recognition. In: Electronics and Electrical Engineering, no. 3(91), pp. 37–42, Kaunas (2009)

    Google Scholar 

  8. Wang, Y., Wang, H., Gu, Z.-G.: A survey of data mining softwares used for real projects. In: International Workshop on Open-Source Software for Scientific Computation (OSSC), pp. 94–97, Beijing (2011)

    Google Scholar 

  9. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)

    Article  Google Scholar 

  10. Jovic, A., Brkic, K., Bogunovic, N.: An Overview of free software tools for general data mining. In: 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pp. 1112–1117, Opatija, Croatia (2014)

    Google Scholar 

  11. Chen, X., Williams, G., Xu, X.: A survey of open source data mining systems. In: Emerging Technologies in Knowledge Discovery and Data Mining, vol. 4819, pp. 3–14. Springer, Berlin (2007)

    Google Scholar 

  12. Wahben, A.H., Al-Radaideh, Q.A., Alkabi, M.N., Shawakfa, E.M.: A comparison study between data mining tools over some classification methods. In: International Journal of Advanced Computer Science and Applications, Special Issue on Artificial Intelligence, vol. 0(3), pp. 18–25 (2011)

    Google Scholar 

  13. Wu, X., Kumar, V., Quinlan, J.R., et al.: Top 10 algorithms in data mining. In: Knowledge and Information Systems, vol. 14, issue 1, pp. 1–37. Springer, Berlin (2007)

    Google Scholar 

  14. Jovic, A., Bogunovic, N.: Feature set extension for heart rate variability analysis by using non-linear, statistical and geometric measures. In: Proceedings of the 31st International Conference on ITI, pp. 35–40 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gintarė Bartišiūtė .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Bartišiūtė, G., Ratkevičius, K., Paškauskaitė, G. (2016). Hybrid Recognition Technology for Isolated Voice Commands. In: Wilimowska, Z., Borzemski, L., Grzech, A., Świątek, J. (eds) Information Systems Architecture and Technology: Proceedings of 36th International Conference on Information Systems Architecture and Technology – ISAT 2015 – Part IV. Advances in Intelligent Systems and Computing, vol 432. Springer, Cham. https://doi.org/10.1007/978-3-319-28567-2_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-28567-2_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-28565-8

  • Online ISBN: 978-3-319-28567-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics