Speech Keyword Spotting with Rule Based Segmentation

Greibus, Mindaugas; Telksnys, Laimutis

doi:10.1007/978-3-642-41947-8_17

Mindaugas Greibus⁴ &
Laimutis Telksnys⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 403))

Included in the following conference series:

International Conference on Information and Software Technologies

1766 Accesses
2 Citations

Abstract

Speech keyword spotting is a retrieval of all instances of a given keyword in utterances. This paper presents improved template based keyword spotting algorithm. It solves speaker dependent speech segment detection in continuous speech with small vocabulary. The rules based segmentation algorithm allows to extract quasi-syllables. We evaluated the algorithm by experimental with synthetic signals. The algorithm results outperform classical keyword spotting algorithm with experimental data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Maskeliunas, R., Ratkevicius, K., Rudzionis, V.: Some aspects of voice user interfaces development for internet and computer control applications. Electronics and Electrical Engineering 19(2), 53–56 (2013)
Article Google Scholar
Veveo: Conversational interfaces whitepaper. Technical report, Veveo (2012)
Google Scholar
Frinken, V., Fischer, A., Manmatha, R., Bunke, H.: A novel word spotting method based on recurrent neural networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 211–224 (2011)
Google Scholar
Von Zeddelmann, D., Kurth, F., Muller, M.: Perceptual audio features for unsupervised key-phrase detection. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing, ICASSP, pp. 257–260 (2010)
Google Scholar
Keshet, J., Grangier, D., Bengio, S.: Discriminative keyword spotting. Speech Communication, 317–329 (2009)
Google Scholar
Jansen, A., Niyogi, P.: Point process models for spotting keywords in continuous speech. IEEE Transactions on Audio, Speech, and Language Processing 17(8), 1457–1470 (2009)
Article Google Scholar
Shao, J., Zhao, Q., Zhang, P., Liu, Z., Yan, Y.: A fast fuzzy keyword spotting algorithm based on syllable confusion network. In: Eighth Annual Conference of the International Speech Communication Association, pp. 2405–2408 (2007)
Google Scholar
Ramachandran, R.P., Mammone, R.J.: Modern methods of speech processing, vol. 327. Springer (1995)
Google Scholar
Christiansen, R., n, C.: Detecting and locating key words in continuous speech using linear predictive coding. IEEE Transactions on Acoustics, Speech and Signal Processing 25(5), 361–367 (1977)
Google Scholar
Myers, C., Rabiner, L., Rosenberg, A.: An investigation of the use of dynamic time warping for word spotting and connected speech recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1980, pp. 173–177. IEEE (1980)
Google Scholar
Rohlicek, J.R., Russell, W., Roukos, S., Gish, H.: Continuous hidden markov modeling for speaker-independent word spotting. In: Acoustics, Speech, and Signal Processing 1989, pp. 627–630. IEEE (1989)
Google Scholar
Wilpon, J., Rabiner, L., Lee, C., Goldman, E.: Automatic recognition of keywords in unconstrained speech using hidden markov models. IEEE Transactions on Acoustics, Speech and Signal Processing 38(11), 1870–1878 (1990)
Article Google Scholar
Weintraub, M.: Lvcsr log-likelihood ratio scoring for keyword spotting. In: 1995 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1995, pp. 297–300 (1995)
Google Scholar
Szoke, I., Schwarz, P., Matejka, P., Burget, L., Karafiát, M., Fapso, M., Cernocky, J.: Comparison of keyword spotting approaches for informal continuous speech. In: Proc. of Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (2005)
Google Scholar
Szöke, I., Schwarz, P., Matějka, P., Burget, L., Karafiát, M., Černocký, J.‘.: Phoneme based acoustics keyword spotting in informal continuous speech. In: Matoušek, V., Mautner, P., Pavelka, T. (eds.) TSD 2005. LNCS (LNAI), vol. 3658, pp. 302–309. Springer, Heidelberg (2005)
Chapter Google Scholar
Laurinciukaite, S., Lipeika, A.: Syllable–phoneme based continuous speech recognition. Electronics and Electrical Engineering 6, 70 (2006)
Google Scholar
Zhang, Y., Glass, J.R.: Unsupervised spoken keyword spotting via segmental dtw on gaussian posteriorgrams. In: Automatic Speech Recognition & Understanding, pp. 398–403. IEEE (2009)
Google Scholar
Aradilla, G., Vepa, J., Bourlard, H.: Using posterior-based features in template matching for speech recognition. In: Int. Conf. on Spoken Language Processing (2006)
Google Scholar
Park, A.S., Glass, J.R.: Unsupervised pattern discovery in speech, vol. 16, pp. 186–197. IEEE (2008)
Google Scholar
Zhang, S., Shuang, Z., Shi, Q., Qin, Y.: Improved mandarin keyword spotting using confusion garbage model. In: 2010 20th International Conference on Pattern Recognition (ICPR), pp. 3700–3703. IEEE (2010)
Google Scholar
Greibus, M., Telksnys, L.: Speech segmentation analysis using synthetic signals. In: Electronics and Electrical Engineering (2012)
Google Scholar
Skripkauskas, M.: Lietuviu snekos signalu segmentavimas kvazifonemomis. In: Informacins Technologijos, pp. 76–81 (2006)
Google Scholar
Horák, P.: Automatic speech segmentation based on alignment with a text-to-speech system. Improvements in Speech Synthesis, pp. 328–338. Wiley Online Library (2002)
Google Scholar
Greibus, M., Telksnys, L.: Rule based speech signal segmentation. Journal of Telecommunications and Information Technology (JTIT) 1, 37–44 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Vilnius University Institute of Mathematics and Informatics, Akademijos str., 4, LT-08663, Vilnius, Lithuania
Mindaugas Greibus & Laimutis Telksnys

Authors

Mindaugas Greibus
View author publications
You can also search for this author in PubMed Google Scholar
Laimutis Telksnys
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Kaunas University of Technology, Studentu g. 50-313a, 51368, Kaunas, Lithuania
Tomas Skersys
Centre of Information Systems Design Technologies, Kaunas University of Technology, Studentu st. 50-313a, 51368, Kaunas, Lithuania
Rimantas Butleris
Kaunas University of Technology, Studentu g. 50-309a, 51368, Kaunas, Lithuania
Rita Butkiene

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Greibus, M., Telksnys, L. (2013). Speech Keyword Spotting with Rule Based Segmentation. In: Skersys, T., Butleris, R., Butkiene, R. (eds) Information and Software Technologies. ICIST 2013. Communications in Computer and Information Science, vol 403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41947-8_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-41947-8_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41946-1
Online ISBN: 978-3-642-41947-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics