Voice Activity Detection under Rayleigh distribution

Li, Yu; Chen, Jianming; Tan, Hongzhou

doi:10.1007/s11767-008-0133-5

Voice Activity Detection under Rayleigh distribution

Published: 19 September 2009

Volume 26, pages 552–556, (2009)
Cite this article

Journal of Electronics (China)

Yu Li¹,
Jianming Chen¹ &
Hongzhou Tan^1,2

42 Accesses
1 Citation
3 Altmetric
Explore all metrics

Abstract

This paper presents an improved Voice Activity Detection (VAD) algorithm which uses the Signal-to-Noise Ratio (SNR) measure. We assume that noise Power Spectral Density (PSD) in each spectral bin follows a Rayleigh distribution. Rayleigh distributions with its asymmetric tail characteristics give a better description of the noise PSD distribution than Gaussian distribution. Under this assumption, a new threshold updating expression is derived. Since the analytical integral of the false alarm probability, the threshold updating expression can be represented without the inverse complementary error function and low computational complexity is achieved in our system. Experimental results show that the proposed VAD outperforms or at least is comparable with the VAD scheme presented by Davis under several noise environments and has a lower computational complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

ITU-T Recommendation G.729, Annex B, 1996.
F. Beritelli, S. Casale, and A. Cavallaro. A robust voice activity detector for wireless communications using soft omputing. IEEE Journal on Selected Areas in Communications, 16(1998)9, 1818–1829.
Article Google Scholar
S. Gazor and W. Zhang. A soft voice activity detector based on a Laplacian-Gasussian model. IEEE Transactions on Speech and Audio Processing, 11 (2003)5, 498–505.
Article Google Scholar
J. H. Chang, J. W. Shin, and N. S. Kim. Likelihood ratio test with complex Laplacian model for voice activity detection. Proceedings of Eurospeech, Geneva, Switzerland, 2003, 1065–1068.
J. Sohn, N. S. Kim, and W. Sung. A statistical model-based voice activity detection. IEEE Signal Procesing Letters, 6(1999)1, 1–3.
Article Google Scholar
A. Davis, S. Nordholm, and R. Togneri. Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold. IEEE Transactions on Audio, Speech, and Language Processing, 14(2006)2, 412–424.
Article Google Scholar
C. Breithaupt and R. Martin. Voice activity detection in the DFT domain based on a parametric noise model. Procceeding of the International Workshop of Acoustic Echo and Noise Control (IWAENC), Paris, Sep. 2006.
A. Varga and H. J. M. Steeneken. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication, 12(1993)3, 247–251.
Article Google Scholar
J.-H. Chang, N. S. Kim, and S. K. Mitra. Voice activity detection based on multiple statistical models. IEEE Transactions on Signal Processing, 54(2006)6, 1965–1976.
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Science and Technology, Sun Yat-Sen University, Guangzhou, 510275, China
Yu Li, Jianming Chen & Hongzhou Tan
Dept of Electronics, Sun Yat-Sen University, Guangzhou, 510275, China
Hongzhou Tan

Authors

Yu Li
View author publications
You can also search for this author in PubMed Google Scholar
Jianming Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hongzhou Tan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongzhou Tan.

Additional information

Supported by the National Natural Science Foundation of China (No. 60874060).

Communication author: Tan Hongzhou, born in 1965, male, Ph.D., Professor.

About this article

Cite this article

Li, Y., Chen, J. & Tan, H. Voice Activity Detection under Rayleigh distribution. J. Electron.(China) 26, 552–556 (2009). https://doi.org/10.1007/s11767-008-0133-5

Download citation

Received: 31 October 2008
Revised: 24 March 2009
Published: 19 September 2009
Issue Date: July 2009
DOI: https://doi.org/10.1007/s11767-008-0133-5

Key words

CLC index

TN912.3

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Voice Activity Detection under Rayleigh distribution

Abstract

Access this article

Similar content being viewed by others

Robust Voice Activity Detection Using the Combination of Short-Term and Long-Term Spectral Patterns

Improvements on self-adaptive voice activity detector for telephone data

A Novel and Efficient Voice Activity Detector Using Shape Features of Speech Wave

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Key words

CLC index

Navigation

Voice Activity Detection under Rayleigh distribution

Abstract

Access this article

Similar content being viewed by others

Robust Voice Activity Detection Using the Combination of Short-Term and Long-Term Spectral Patterns

Improvements on self-adaptive voice activity detector for telephone data

A Novel and Efficient Voice Activity Detector Using Shape Features of Speech Wave

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Share this article

Key words

CLC index

Search

Navigation