Cognitive Computation

, Volume 2, Issue 3, pp 191–198

A Non-Linear VAD for Noisy Environments



This paper deals with non-linear transformations for improving the performance of an entropy-based voice activity detector (VAD). The idea to use a non-linear transformation has already been applied in the field of speech linear prediction, or linear predictive coding, based on source separation techniques, where a score function is added to classical equations in order to take into account the true distribution of the signal. We explore the possibility of estimating the entropy of frames after calculating its score function, instead of using original frames. We observe that if the signal is clean, the estimated entropy is essentially the same; if the signal is noisy, however, the frames transformed using the score function may give entropy that is different in voiced frames as compared to unvoiced ones. Experimental evidence is given to show that this fact enables voice activity detection under high noise, where the simple entropy method fails.


VAD Score function Entropy Speech 

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  1. 1.Digital Technologies GroupUniversity of VicVicSpain

Personalised recommendations