Skip to main content
Log in

A noise cross PSD estimator for dual-microphone speech enhancement based on minimum statistics

  • Published:
Journal of Zhejiang University-SCIENCE A Aims and scope Submit manuscript

Abstract

Some two-microphone noise reduction techniques that work in the frequency domain exploit coherence function between two noisy signals. They have shown good results when noise signals on two sensors are uncorrelated, but their performance decreases with correlated noises. Coherence based methods can be improved when the cross power spectral density (CPSD) of correlated noise signals is available. In this paper, we propose a new method for estimation of the CPSD of the noise, which is based on the minimum tracking technique. Despite the fact that the proposed estimator does not need to implement a voice activity detector (VAD), its performance is comparable to a CPSD estimator that uses an ideal VAD.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Cohen, I., 2002. Noise estimation by minima controlled recursive averaging for robust speech enhancement. IEEE Signal Processing Lett., 9(1):12–15. [doi:10.1109/97.988717]

    Article  Google Scholar 

  • Deller, J.R., Hansen, J.H.L., Proakis, J.G., 2000. Discrete-time Processing of Speech Signals (2nd Ed.). IEEE Press, New York, USA.

    Google Scholar 

  • Derakhshan, N., Ayatollahi, A., Akbari, A., Rahmani, M., 2007. Noise Power Spectrum Estimation Using Time-variant Spectral Smoothing and Low-delay Minima Tracking. SPECOM, p.542–548.

  • Guerin, A., La Bouquin-Jeannes, R., Faucon, G., 2003. A two-sensor noise reduction system: applications for hands-free car kit. EURASIP J. Appl. Signal Processing, 2003(11):1125–1134. [doi:10.1155/S1110865703305098]

    Article  MATH  Google Scholar 

  • ITU-T P.862, 2001. Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs. International Telecommunication Union, Geneva.

    Google Scholar 

  • La Bouquin-Jeannes, R., Azirani, A.A., Faucon, G., 1997. Enhancement of speech degraded by coherent and incoherent noise using a cross-spectral estimator. IEEE Trans. Speech Audio Processing, 5(5):484–487. [doi:10.1109/89.622576]

    Article  Google Scholar 

  • Martin, R., 1994. Spectral Subtraction Based on Minimum Statistics. 7th European Signal Processing Conf., p.1182–1185.

  • Martin, R., 2001. Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Trans. Speech Audio Processing, 9(5):504–512. [doi:10.1109/89.928915]

    Article  Google Scholar 

  • Martin, R., 2006. Bias compensation methods for minimum statistics noise power spectral density estimation. Signal Processing, 86(6):1215–1229. [doi:10.1016/j.sigpro.2005.07.037]

    Article  MATH  Google Scholar 

  • Zhang, X., Jia, Y., 2005. A Soft Decision Based Noise Cross Power Spectral Density Estimation for Two-microphone Speech Enhancement Systems. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, p.813–816. [doi:10.1109/ICASSP.2005.1415238]

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohsen Rahmani.

Additional information

Project supported by the Iran Telecommunications Research Center (ITRC)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rahmani, M., Akbari, A., Ayad, B. et al. A noise cross PSD estimator for dual-microphone speech enhancement based on minimum statistics. J. Zhejiang Univ. Sci. A 10, 805–809 (2009). https://doi.org/10.1631/jzus.A0820390

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/jzus.A0820390

Key words

CLC number

Navigation