Abstract
The paper deals with two elements of the artificial intelligence methods—the natural language processing and machine learning. Hybrid recognition technology for isolated Lithuanian voice commands is described. By the hybrid approach we assume the combination of two different recognition methods to achieve higher recognition accuracy. The method which is based on the machine learning algorithm to combine the recognition results provided by two different recognizers is described. The first recognizer was HTK-based Lithuanian recognizer, the second one—the Spanish language recognizer adapted to the Lithuanian language. The experimental results show that a hybrid decision-making rule learned by “random forest” classifier works with 99.46 % accuracy and exceeds the accuracy of the “blind” decision-making rule (96.12 %). The average hybrid operation accuracy reaches 99.24 %, when the recognizer recognizes voice commands out of 12 known speakers, and is equal to 99.18 %, when it is applied to the unknown speaker.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Suendermann, D., Pieraccini, R.: SLU in commercial and research spoken dialogue systems. In: Tur, G., De Mori, R. (eds.) Spoken Language Understanding, pp. 171–194. Wiley, New York (2011)
Saon, G., Chien, J.-T.: Large-vocabulary continuous speech recognition systems: a look at some recent advances. IEEE Sig. Process. Mag. 29(6), 18–33 (2012)
Kumar, N., Andreou, A.: Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition. Speech Commun. 25(4), 283–297 (1998)
Rabiner, L.: In: Proceedings of IEEEA Tutorial on Hidden Markov Models on Selected Applications in Speech Recognition, vol. 77, no. 2, pp. 257–286, February (1989)
Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book (2000). http://htk.eng.cam.ac.uk/docs/docs.shtml
Rudžionis, V., Raškinis, G., Maskeliūnas, R., Rudžionis, A., Ratkevičius, K., Bartišiūtė, G.: Web services based hybrid recognizer of Lithuanian voice commands. In: Electronics and Electrical Engineering, vol. 20, no. 9, pp. 50–53, Kaunas (2014)
Maskeliūnas, R., Rudžionis, A., Ratkevičius, K., Rudžionis, V.: Investigation of foreign languages models for Lithuanian speech recognition. In: Electronics and Electrical Engineering, no. 3(91), pp. 37–42, Kaunas (2009)
Wang, Y., Wang, H., Gu, Z.-G.: A survey of data mining softwares used for real projects. In: International Workshop on Open-Source Software for Scientific Computation (OSSC), pp. 94–97, Beijing (2011)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)
Jovic, A., Brkic, K., Bogunovic, N.: An Overview of free software tools for general data mining. In: 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pp. 1112–1117, Opatija, Croatia (2014)
Chen, X., Williams, G., Xu, X.: A survey of open source data mining systems. In: Emerging Technologies in Knowledge Discovery and Data Mining, vol. 4819, pp. 3–14. Springer, Berlin (2007)
Wahben, A.H., Al-Radaideh, Q.A., Alkabi, M.N., Shawakfa, E.M.: A comparison study between data mining tools over some classification methods. In: International Journal of Advanced Computer Science and Applications, Special Issue on Artificial Intelligence, vol. 0(3), pp. 18–25 (2011)
Wu, X., Kumar, V., Quinlan, J.R., et al.: Top 10 algorithms in data mining. In: Knowledge and Information Systems, vol. 14, issue 1, pp. 1–37. Springer, Berlin (2007)
Jovic, A., Bogunovic, N.: Feature set extension for heart rate variability analysis by using non-linear, statistical and geometric measures. In: Proceedings of the 31st International Conference on ITI, pp. 35–40 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Bartišiūtė, G., Ratkevičius, K., Paškauskaitė, G. (2016). Hybrid Recognition Technology for Isolated Voice Commands. In: Wilimowska, Z., Borzemski, L., Grzech, A., Świątek, J. (eds) Information Systems Architecture and Technology: Proceedings of 36th International Conference on Information Systems Architecture and Technology – ISAT 2015 – Part IV. Advances in Intelligent Systems and Computing, vol 432. Springer, Cham. https://doi.org/10.1007/978-3-319-28567-2_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-28567-2_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28565-8
Online ISBN: 978-3-319-28567-2
eBook Packages: EngineeringEngineering (R0)