Abstract
Traditionally, information retrieval techniques create an index of words or terms found in a textual database that can later be rapidly searched by simply entering a query for a desired word or term. Naturally, straightforward application of this technique to non-textual materials is impossible. When it comes to speech, using text-based techniques requires a preprocessing stage of transforming the digital speech signal into some form of text. However, since classical speech recognition engines are not totally accurate, the indexing will necessarily include errors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barras C, Allauzen A et al (2002) Transcribing audio-video archives. In: 2002 I.E. international conference on acoustics, speech, and signal processing (ICASSP), IEEE
Burget L, Černocký J et al (2006) Indexing and search methods for spoken document. In: Text, speech and dialogue 4188/2006 of Lecture notes in computer science. pp 351–358
Cardillo PS, Clements M et al (2002) Phonetic searching vs. LVCSR: how to find what you really want in audio archives. Int J Speech Technol 5(1):9–22
Gusfield D (1997) Algorithms on strings, trees and sequences: computer science and computational biology. Cambridge University Press, Cambridge
Har-Lev B, Aharonson V et al (2010) An efficient phoneme distance measure using a lexical tree. In: IEEE 26th convention of electrical and electronics engineering in Israel, IEEE, Eilat
Hermelin D, Landau GM et al (2009) A unified algorithm for accelerating edit-distance computation via text compression. In: 26th international symposium on theoretical aspects of computer science, Feiburg
Pucher M, Türk A et al (2007) Phonetic distance measures for speech recognition vocabulary and grammar optimization. In: 3rd congress of the Alps Adria Acoustics Association, Graz
Thambiratnam K, Sridharan S (2007) Rapid yet accurate speech indexing using dynamic match lattice spotting. IEEE Trans Audio Speech Lang Process 15(1):346–357
Vergyri D, Shafran I et al (2007) The SRI/OGI 2006 spoken term detection system. In: 8th annual conference of the international speech communication association (INTERSPEECH 2007), ISCA, Antwerp
Wallace R, Vogt R et al (2007) A phonetic search approach to the to the 2006 NIST spoken term detection evaluation. In: 8th annual conference of the international speech communication association (INTERSPEECH 2007), ISCA, Antwerp
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2013 The Author(s)
About this chapter
Cite this chapter
Moyal, A., Aharonson, V., Tetariy, E., Gishri, M. (2013). Phonetic Search. In: Phonetic Search Methods for Large Speech Databases. SpringerBriefs in Electrical and Computer Engineering(). Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6489-1_3
Download citation
DOI: https://doi.org/10.1007/978-1-4614-6489-1_3
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6488-4
Online ISBN: 978-1-4614-6489-1
eBook Packages: EngineeringEngineering (R0)