Zero-Crossing-Based Feature Extraction for Voice Command Systems Using Neck-Microphones

Park, Sang Kyoon; Kil, Rhee Man; Jung, Young-Giu; Han, Mun-Sung

doi:10.1007/978-3-540-72383-7_154

Sang Kyoon Park²¹,
Rhee Man Kil²¹,
Young-Giu Jung²² &
…
Mun-Sung Han²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4491))

Included in the following conference series:

International Symposium on Neural Networks

1982 Accesses

Abstract

This paper presents zero-crossing-based feature extraction for the speech recognition using neck-microphones. One of the solutions in noise-robust speech recognition is using neck-microphones which are not affected by the environmental noises. However, neck-microphones distort the original voice signals significantly since they only capture the vibrations of vocal tracts. In this context, we consider a new method of enhancing speech features of neck-microphone signals using zero-crossings. Furthermore, for the improvement of zero-crossing features, we consider to use the statistics of two adjacent zero-crossing intervals, that is, the statistics of two samples referred to as the second order statistics. Through the simulation for speech recognition using the neck-microphone voice command system, we have shown that the suggested method provides the better performance than other approaches using conventional speech features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kay, S., Sudhaker, R.: A Zero Crossing-Based Spectrum Analyzer. IEEE Transactions on Acoustics, Speech, and Signal Processing 34(1), 96–104 (1986)
Article Google Scholar
Sreenivas, T., Niederjohn, R.: Zero-Crossing Based Spectral Analysis and Svd Spectral Analysis for Formant Frequency Estimation in Noise. IEEE Transactions on Signal Processing 40(2), 282–293 (1992)
Article Google Scholar
Kim, D., Lee, S., Kil, R.M.: Auditory Processing of Speech Signals for Robust Speech Recognition in Real-World Noisy Environments. IEEE Transactions on Speech and Audio Processing 7(1), 55–69 (1999)
Article Google Scholar
Blachman, N.: Zero-Crossing Rate for the Sum of Two Sinusoids or a Signal Plus Noise. IEEE Transactions on Information Theory, 671–675 (1975)
Google Scholar
Kedem, B.: Time series analysis by higher order crossings. IEEE Computer Society Press, Los Alamitos (1994)
MATH Google Scholar
Haralick, R.M., Shanmugam, K., Dinstein, I.: Texture Features for Image Classification. IEEE Transactions on Systems, Man and Cybernetics 3(6), 610–621 (1973)
Article Google Scholar
Davis, L.S., Johns, S.A., Aggarwal, J.K.: Texture Analysis Using Generalized Co-Occurrence Matrices. IEEE Transactions on Pattern Recognition and Machine Intelligence 1(3), 251–259 (1979)
Article Google Scholar
Clausi, D.A., Jernigan, M.E.: A Fast Method to Determine Cooccurrence Texture Features Using a Linked List Implementation. Remote Sensing of Environment, 506–509 (1996)
Google Scholar
Clausi, D.A., Zhao, Y.: Rapid Extraction of Image Texture by Co-Occurrence Using a Hybrid Data Structure. Computers and Geosciences 28(6), 763–774 (2002)
Article Google Scholar
Hermansky, H.: Rasta Processing of Speech. IEEE Transactions on Speech and Audio Processing 2(4), 578–589 (1994)
Article Google Scholar
Ghulam, M., Fukuda, T., Horikawa, J., Nitta, T.: A Noise-Robust Feature-Extraction Method Based on Pitch-Synchronous Zcpa for Asr. In: Proc. of INTERSPEECH-ICSLP, vol. 1, pp. 133–136 (2004)
Google Scholar
Hanazawa, T., Hinton, G., Shikano, K., Waibel, A., Lang, K.: Phonem Recognition Using Time Delay Neural Networks. IEEE Transactions on Acoustics, Speech, and Signal Processing 37(1), 328–339 (1989)
Google Scholar
Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: HTK Book. Microsoft Corporation (2000)
Google Scholar
Rabiner, L.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
Rabiner, L., Sambur, M.: An Algorithm for Determining the Endpoints of Isolated Utterances. The Bell System Technical Journal 54(2), 297–315 (1975)
Article Google Scholar
Savoji, M.H.: Endpointing of Speech Signals. Speech Communication 8(1), 46–60 (1989)
Article Google Scholar
Mak, B., Junqua, J., Reaves, B.: A Robust Algorithm for Word Boundary Detection in the Presence of Noise. IEEE Transactions on Speech and Audio Processing 2(3), 406–412 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Division of Applied Mathematics, Korea Advanced Institute of Science and Technology, 373-1 Guseong-dong, Yuseong-gu, Daejeon 305-701, Korea
Sang Kyoon Park & Rhee Man Kil
Smart Interface Research Team, Electronics and Telecommunications Research Institute, 161 Gajeong-dong, Yuseong-gu, Daejeon 305-700, Korea
Young-Giu Jung & Mun-Sung Han

Authors

Sang Kyoon Park
View author publications
You can also search for this author in PubMed Google Scholar
Rhee Man Kil
View author publications
You can also search for this author in PubMed Google Scholar
Young-Giu Jung
View author publications
You can also search for this author in PubMed Google Scholar
Mun-Sung Han
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering (M/C 154), University of Illinois at Chicago, 851 S. Morgan Street, 60607-7053, Chicago, IL, USA
Derong Liu
School of Automation, Southeast University, 210096, Nanjing, China
Shumin Fei
Laboratory of Complex Systems, Institute of Automation, Chinese Adacemy of Sciences, 100080, Beijing, P. R. China
Zeng-Guang Hou
School of Information Science and Engineering, Northeast University, Shenyang, 110004, China
Huaguang Zhang
School of Electrical Engineering, Hohai University, Nanjing, 210098, China
Changyin Sun

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, S.K., Kil, R.M., Jung, YG., Han, MS. (2007). Zero-Crossing-Based Feature Extraction for Voice Command Systems Using Neck-Microphones. In: Liu, D., Fei, S., Hou, ZG., Zhang, H., Sun, C. (eds) Advances in Neural Networks – ISNN 2007. ISNN 2007. Lecture Notes in Computer Science, vol 4491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72383-7_154

Download citation

DOI: https://doi.org/10.1007/978-3-540-72383-7_154
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72382-0
Online ISBN: 978-3-540-72383-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics