Application of Zero-Frequency Filtering for Vowel Onset Point Detection
Vowel onset points in speech signals, are the instances where the voicing of the vowels begin. These points serve as important landmarks for the analysis as well as synthesis of speech signals. These landmarks help to identify the information about the behaviour of transition of several different sounds into and out of the vowel regions. In this paper, we propose a new method to identify vowel onset points for a speech signal using the zero frequency filtered (ZFF) speech signal and its frequency spectrum. The ZFF signal is obtained by passing the speech signal through a resonator with central frequency as 0 Hz. Therefore, ZFF signal essentially contains the low pass components of a given speech signal. Vowels are mostly characterized by the significant energy content in the relatively low frequency bands. Significant improvement in VOP detection performance is observed using proposed method compared to existing methods.
KeywordsVowel onset point (VOP) Vowels zero frequency filtering frequency spectrum
Unable to display preview. Download preview PDF.
- 3.Prasanna, S.R.M., Gangashetty, S.V., Yegnanarayana, B.: Significance of vowel onset point for speech analysis. In: Proc. of Int. Conf. Signal Processing and Communications, Bangalore, India, pp. 81–88 (2001)Google Scholar
- 5.Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B.: Detection of vowel onset points in continuous speech using autoassociative neural network models. In: Proc. Int. Conf. Spoken Language Processing, Jeju Island, Korea, pp. 401–410 (2004)Google Scholar
- 12.Wang, J.-H., Chen, S.-H.: A C/V segmentation algorithm for Mandarin speech using wavelet transforms. In: Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Phoenix, Arizona, pp. 1261–1264 (1999)Google Scholar
- 13.Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B.: Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances. In: Proc. of IEEE ICISIP, pp. 159–164 (2004)Google Scholar
- 14.Prasanna, S.R.M., Yegnanarayana, B.: Detection of vowel onset point events using excitation source information. In: Proc. of Interspeech, Lisbon, Portugal, pp. 1133–1136 (2005)Google Scholar
- 16.Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L., Zue, V.: TIMIT acoustic-phonetic continuous speech corpus linguistic data consortium. In: Proc. of IEEE ICISIP, Philadelphia, PA (1993)Google Scholar