Skip to main content

Keyword Spotting Techniques

  • Chapter
  • First Online:
Searching Speech Databases

Part of the book series: SpringerBriefs in Speech Technology ((BRIEFSSPEECHTECH))

Abstract

Keyword spotting (KWS) refers to the spotting and retrieval of predefined keywords from audio database. Different supervised as well as unsupervised approaches have been implemented to do keyword spotting. Keyword spotting is considered to be the first among speech searching. Later, keyword spotting paved the way to Spoken Term Detection (STD) and Query by Example STD (QbE-STD). In the early days, researchers have used HMM for KWS, where the speech data is converted into corresponding text data for text-level matching. But, the latest techniques make use of MLP and DNN for doing search, so that the speech to text conversion is not necessary. All such techniques for keyword spotting are discussed briefly in this chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Salehinejad H, Barfett J, Aarabi P, Valaee S, Colak E, Gray B, Dowdell T (2017) A convolutional neural network for search term detection. In: Proceedings of IEEE 28th annual international symposium on personal, indoor, and mobile radio communications (PIMRC), Montreal, QC, pp 1–6

    Google Scholar 

  2. https://catalog.ldc.upenn.edu/ldc94s13a

  3. Alumäe T, Karakos D, Hartmann W, Hsiao R, Zhang L, Nguyen L, Tsakalidis S, Schwartz, R (2017) The 2016 BBN Georgian telephone speech keyword spotting system. In: ICASSP, pp 5755–5759

    Google Scholar 

  4. Gupta V, Ajmera J, Kumar A, Verma A, (2011) A language independent approach to audio search. In: Proceedings of INTERSPEECH, pp 1125–01128

    Google Scholar 

  5. Barnwal S, Sahni K, Singh R, Raj B (2012) Spectrographic seam patterns for discriminative word spotting. In: Proceedings of ICASSP, pp 4725–4728

    Google Scholar 

  6. Ezzat T, Poggio T (2008) Discriminative word-spotting using ordered spectro-temporal patch features. In: Proceedings of SAPA workshop, in INTERSPEECH, pp 35–40

    Google Scholar 

  7. James DA, Young SJ (1994) A fast lattice-based approach to vocabulary independent word spotting. In: Proceedings of ICASSP, pp 337–380

    Google Scholar 

  8. Bridle JS (1973) An efficient elastic-template method for detecting given words in running speech. In: British acoustical society spring meeting, pp 1–4

    Google Scholar 

  9. Higgins AL, Wohlford RE, Bahler LG (1986) Keyword recognition system using template-concatenation model, European Patent Application, Publication No. 0 177 854

    Google Scholar 

  10. Rose RC, Paul DB (1990) A hidden Markov model based keyword recognition system. In: Proceedings of ICASSP, vol 1, pp 129–132

    Google Scholar 

  11. Weintraub M (1993) Keyword-spotting using SRI’s DECIPHER large-vocabulary speech-recognition system. In: Proceedings of ICASSP, vol 2, pp 463–466

    Google Scholar 

  12. Weintraub M (1995) LVSCR log-likelihood ratio scoring for keyword spotting. In: Proceedings of ICASSP, vol 1, pp 297–300

    Google Scholar 

  13. Iso K, Watanabe T (1990) Speaker-independent word recognition using a neural prediction model. In: Proceedings of ICASSP, pp 441–444

    Chapter  Google Scholar 

  14. Suhardi S, Felbaum K (1997) Wordspotting using a predictive neural model for the telephone speech corpus. In: Proceedings of ICASSP, vol 2, pp 915–918

    Google Scholar 

  15. Thambiratnam K, Sridharan S (2005) Dynamic match phone-lattice searches for very fast and accurate unrestricted vocabulary keyword spotting. In: Proceedings of ICASSP, vol 1, pp 465–468

    Google Scholar 

  16. Garcia A, Gish H (2006) Keyword spotting of arbitrary words using minimal speech resources. In: Proceedings of ICASSP, vol 1, pp 949–952

    Google Scholar 

  17. Audhkhasi K, Verma A (2007) Keyword spotting using modified minimum edit distance measure. In: Proceedings of ICASSP, vol 4, pp 929–932

    Google Scholar 

  18. Xin L, Wang BX (2001) Utterance verification for spontaneous Mandarin speech keyword spotting. In: IEEE international conference on Info-tech and Info-net (ICII), vol 3, pp 397–401

    Google Scholar 

  19. Ou J, Chen K, Wang X, Li Z (2001) Utterance verification of short keywords using hybrid neural-network/HMM approach. In: IEEE international conference on Info-tech and Info-net (ICII), vol 2, pp 671–676

    Google Scholar 

  20. Thambiratnam K, Sridharan S (2003) Isolated word verification using cohort word-level verification. In: EUROSPEECH, pp 905–908

    Google Scholar 

  21. Chen G, Parada C, Heigold G (2014) Small-footprint keyword spotting using deep neural networks. In: Proceedings of ICASSP, pp 4087–4091

    Google Scholar 

  22. Sainath TN, Parada C (2015) Convolutional neural networks for small-footprint keyword spotting. In: INTERSPEECH, pp 1478–1482

    Google Scholar 

  23. Lim H, Kim Y, Kim Y, Kim H (2017) CNN-based bottleneck feature for noise robust query-by-example spoken term detection. In: Proceedings of Asia-Pacific signal and information processing association annual summit and conference (APSIPA ASC), Kuala Lumpur, pp 1278–1281

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2019 The Author(s), under exclusive licence to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mary, L., G, D. (2019). Keyword Spotting Techniques. In: Searching Speech Databases. SpringerBriefs in Speech Technology. Springer, Cham. https://doi.org/10.1007/978-3-319-97761-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-97761-4_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-97760-7

  • Online ISBN: 978-3-319-97761-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics