Monaural voiced speech segregation based on elaborate harmonic grouping strategies
- 50 Downloads
In this paper, an enhanced algorithm based on several elaborate harmonic grouping strategies for monaural voiced speech segregation is proposed. Main achievements of the proposed algorithm lie in three aspects. Firstly, the algorithm classifies the time-frequency (T-F) units into resolved and unresolved ones by carrier-to-envelope energy ratio, which leads to more accurate classification results than by cross-channel correlation. Secondly, resolved T-F units are grouped together according to minimum amplitude principle, which has been verified to exist in human perception, as well as the harmonic principle. Finally, “enhanced” envelope autocorrelation function is employed to detect amplitude modulation rates, which helps a lot in reducing half-frequency error in grouping of unresolved units. Systematic evaluation and comparison show that performance of separation is greatly improved by the proposed algorithm. Specifically, signal-to-noise ratio (SNR) is improved by 0.96 dB compared with that of previous method. Besides, our algorithm is also effective in improving the PESQ score and subjective perception score.
Keywordscomputational auditory scene analysis voiced speech separation harmonistic principle minimum amplitude principle elaborate harmonic grouping strategies
Unable to display preview. Download preview PDF.
Supplementary material, approximately 2.75 MB.
- 3.Benesty J, Makino S, Chen J. Speech Enhancement. New York: Springer, 2005Google Scholar
- 6.Wang D L, Brown G J. Computational auditory scene analysis: principles, algorithms and applications. New Jersey: Wiley-IEEE Press, 2006Google Scholar
- 7.Bregman S. Auditory Scene Analysis. MA: MIT Press, 1990Google Scholar
- 8.Weintraub M. A theory and computational model of monaural auditory sound separation. Dissertation for Doctoral Degree. Palo Alto: Stanford University, 1985Google Scholar
- 9.Cooke M P. Modeling auditory processing and organization. Dissertation for Doctoral Degree. Sheffield: University of Sheffield, 1991Google Scholar