Speech enhancement with adaptive spectral estimators

  • Y. Sandoval-Ibarra
  • V. H. Diaz-Ramirez
  • V. I. Kober
  • V. N. Karnaukhov
Mathematical Models and Computational Methods


Common statistical estimators for speech enhancement rely on several assumptions about statistical properties of speech and noise processes. In real applications, these assumptions may not be always satisfied due to the effects of a nonstationary environment. In this work, we propose new robust spectral estimators for speech enhancement by incorporation of calculation of rank-order statistics to existing speech enhancement estimators. The proposed estimators are better adapted to nonstationary characteristics of speech signals and noise processes in real environments. By means of computer simulations, we show that the proposed estimators outperform the known estimators in terms of objective criteria of quality.


speech filtering speech enhancement robust estimators 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    P. Loizou, Speech Enhancement: Theory and Practice, 2nd Ed. (Taylor & Francis, Boca Raton, 2013).Google Scholar
  2. 2.
    R. McAulay and M. Malpass, “Speech enhancement using a softdecision noise suppression filter,” IEEE Trans. Acoust., Speech, Signal Process. 28, 137–145 (1980).CrossRefGoogle Scholar
  3. 3.
    N. S. Kim and J.-H. Chang, “Spectral enhancement based on global soft decision,” IEEE Signal Process. Lett. 7 (5), 108–110 (2000).CrossRefGoogle Scholar
  4. 4.
    Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean square error short-time spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process. 32, 1109–1121 (1984).CrossRefGoogle Scholar
  5. 5.
    G.-H. Ding, T. Huang, and B. Xu, “Suppression of additive noise using a power spectral density MMSE estimator,” IEEE Signal Process. Lett. 11, 585–588 (2005).CrossRefGoogle Scholar
  6. 6.
    R. Martin, “Speech enhancement based on minimum mean-square error estimation and supergaussian priors,” IEEE Trans. Speech Audio Process. 13, 845–856 (2005).CrossRefGoogle Scholar
  7. 7.
    Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error log-spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process. 33, 443–445 (1985).CrossRefGoogle Scholar
  8. 8.
    T. Lotter and P. Vary, “Speech enhancement by map spectral amplitude estimation using a super-Gaussian speech model,” EURASIP J. Appl. Signal Process., No. 7, 1110–1126 (2005).CrossRefzbMATHGoogle Scholar
  9. 9.
    L. Yaroslavsky and M. Eden, Fundamentals of Digital Optics (Birkhaeuser, Boston, 1996).CrossRefzbMATHGoogle Scholar
  10. 10.
    V. Kober, M. Mozerov, and J. Alvarez-Borrego, “Nonlinear filters with spatially connected neighborhoods,” Opt. Eng. 40, 971–983 (2001).CrossRefGoogle Scholar
  11. 11.
    P. J. Huber, P. C. Pop, and E. M. Ronchetti, Robust Statistics, 2nd Ed., (Wiley, New York, 2009).CrossRefzbMATHGoogle Scholar
  12. 12.
    V. M. Diaz-Ramirez and V. Kober, “Robust speech processing using local adaptive nonlinear filtering,” IET Signal Process. 7, 345–359 (2013).MathSciNetCrossRefGoogle Scholar
  13. 13.
    “IEEE Subcommittee (1969), IEEE recommended practice for speech quality measurements,” IEEE Trans. Audio Electroacoust. 17, 225–246 (1969).Google Scholar
  14. 14.
    “ITU, Perceptual evaluation of speech quality (PESQ). An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs,” ITU-T Recommendation, 862 (2001).Google Scholar
  15. 15.
    C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, “An algorithm for intelligibility prediction of time-frequency weighted noisy speech,” IEEE Trans. Audio, Speech, Language Process. 19, 2125–2136 (2011).CrossRefGoogle Scholar
  16. 16.
    E. Vincent, R. Gribonval, and C. F‘evotte, “Performance measurement in blind audio source separation,” IEEE Trans. Audio, Speech, Language Process. 14, 1462–1469 (2006).CrossRefGoogle Scholar

Copyright information

© Pleiades Publishing, Inc. 2016

Authors and Affiliations

  1. 1.Instituto Politécnico Nacional-CITEDIMesa de Otay, TijuanaMéxico
  2. 2.Departament of Computer ScienceCICESEZona Playitas, EnsenadaMéxico
  3. 3.Institute for Information Transmission ProblemsRussian Academy of SciencesMoscowRussia

Personalised recommendations