Optimal smoothing for microphone array post-filtering under a combined deterministic-stochastic hybrid model

Hu, Xiaohu; Zheng, Chengshi; Li, Xiaodong

doi:10.1007/s11767-012-0778-y

Optimal smoothing for microphone array post-filtering under a combined deterministic-stochastic hybrid model

Published: 08 March 2012

Volume 28, pages 524–530, (2011)
Cite this article

Journal of Electronics (China)

Xiaohu Hu¹,
Chengshi Zheng^1,2 &
Xiaodong Li¹

34 Accesses
3 Altmetric
Explore all metrics

Abstract

This paper shows the importance of the optimal smoothing scheme in Microphone Array Post-Filtering (MAPF) under a combined Deterministic-Stochastic Hybrid Model (DSHM). We reveal that some of the well-known MAPF algorithms may cause serious speech distortion without using the optimal smoothing scheme, which is resulted from oversmoothing the raw periodogram over time. Using a minimum conditional mean square error criterion, we derive the optimal smoothing factor under the DSHM, where the Deterministic-to-Stochastic-Ratio (DSR) and the stationarity determine the value of the optimal smoothing factor. The optimal smoothing scheme is applied to the Transient-Beam-to-Reference-Ratio (TBRR)-based MAPF algorithm and experimental results show its better performance in terms of both the Log-Spectral Distance (LSD) and the Perceptual Evaluation of Speech Quality (PESQ).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Review on Kalman Filter Models

Article 01 October 2022

A Review on Sound Source Localization Systems

Article 05 May 2022

Virtual Augmentation of the Beamforming Array Based on a Sub-cross-spectral Matrix Computation for Localizing Stationary Signal Noise Sources

Article 07 May 2024

References

J. Benesty, S. Makino, and J. Chen. Speech Enhancement. Berlin, Springer-Verlag, 2005, 9–134.
Book Google Scholar
P. C. Loizou. Speech Enhancement: Thoery and Practice. Boca Raton, Florida, CRC, 2007, 97–462.
P. C. Loizou and G. Kim. Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions. IEEE Transactions on Audio, Speech, and Language Processing, 19(2011)1, 47–56.
Article Google Scholar
M. Brandstein and D. Ward. Microphone Arrays: Signal Processing Techniques and Applications. Berlin, Springer-Verlag, 2001, 255–280.
Google Scholar
J. Benesty, J. Chen, and Y. Huang. Microphone Array Signal Processing. Berlin, Springer-Verlag, 2008, 39–222.
Google Scholar
C. Liu, B. C. Wheeler, W. D. O’Brien, et al. Localization of multiple sound sources with two microphones. Journal of Acoustics Society of America, 108(2000)4, 1888–1905.
Article Google Scholar
G. Shi, P. Aarabi, and H. Jiang. Phase-based dual-microphone speech enhancement using a prior speech model. IEEE Transactions on Audio, Speech and Language Processing, 15(2007)1, 109–118.
Article Google Scholar
I. A. McCowan and H. Bourland. Microphone array post-filtering based on noise field coherence. IEEE Transactions on Speech and Audio Processing, 11 (2003)6, 709–716.
Article Google Scholar
C. Zheng, Y. Zhou, X. Hu, and X. Li. Two-channel post-filtering based on adaptive smoothing and noise properties. International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, May 22–27, 2011, 1745–1748.
N. Yousefian, M. Rahmani, and A. Akabari. Power level difference as a criterion for speech enhancement. International Conference on Acoustics, Speech, and Signal Processing, Taipei, Apr. 19–24, 2009, 4653–4656.
I. Cohen. Mulitchannel post-filtering in nonstationary noise environments. IEEE Transactions on Signal Processing, 52(2004)5, 1149–1160.
Article MathSciNet Google Scholar
A. Guerin, R. Le. Bouguin-Jeannes, and G. Faucon. A two-sensor noise reduction system: applications for hands-free car kit. European Association for Signal Processing Journal on Applied Signal Processing, 2003(2003)11, 1125–1134.
Article MATH Google Scholar
R. Martin. Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Transactions on Speech and Audio Processing, 9(2001)5, 504–512.
Article Google Scholar
R. C. Hendriks, R. Heusdens, and J. Jensen. An MMSE estimator for speech enhancement under a combined stochastic-deterministic speech model. IEEE Transactions on Audio, Speech, and Language Processing, 15(2007) 2, 406–415.
Article Google Scholar
S. Jo and C. D. Yoo. Speech enhancement based on the decomposition of speech into deterministic and stochastic components and psychacoustic model. International Conference on Acoustics, Speech, and Signal Processing, Honolulu, Havaii, USA, Apr. 15–20, 2007, 897–900.
N. L. Johnson, S. Kotz, and N. Balakrishnan. Continuous Univariate Distributions. 2nd edition. New York, John Wiely Sons, INC, 1995, Vol. 2, 435–436.
MATH Google Scholar
I. S. Grandshteyn and I. M. Ryzik. Table of Integrals Series and Products. Sixth edition. New York, Academic, 2000, 651–652.
Google Scholar

Download references

Author information

Authors and Affiliations

Communication Acoustics, Institute of Acoustics, Chinese Academy of Sciences, Beijing, 100190, China
Xiaohu Hu, Chengshi Zheng & Xiaodong Li
Institute of Acoustics, Chinese Academy of Sciences, Beijing, 100190, China
Chengshi Zheng

Authors

Xiaohu Hu
View author publications
You can also search for this author in PubMed Google Scholar
Chengshi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaodong Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chengshi Zheng.

Additional information

Supported by the National Natural Science Foundation of China (No. 61072123).

Communication author: Zheng Chengshi, born in 1980, male, Assistant Professor.

About this article

Cite this article

Hu, X., Zheng, C. & Li, X. Optimal smoothing for microphone array post-filtering under a combined deterministic-stochastic hybrid model. J. Electron.(China) 28, 524–530 (2011). https://doi.org/10.1007/s11767-012-0778-y

Download citation

Received: 26 August 2011
Revised: 11 September 2011
Published: 08 March 2012
Issue Date: November 2011
DOI: https://doi.org/10.1007/s11767-012-0778-y

Key words

CLC index

TN912.16

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimal smoothing for microphone array post-filtering under a combined deterministic-stochastic hybrid model

Abstract

Access this article

Similar content being viewed by others

A Review on Kalman Filter Models

A Review on Sound Source Localization Systems

Virtual Augmentation of the Beamforming Array Based on a Sub-cross-spectral Matrix Computation for Localizing Stationary Signal Noise Sources

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Key words

CLC index

Navigation

Optimal smoothing for microphone array post-filtering under a combined deterministic-stochastic hybrid model

Abstract

Access this article

Similar content being viewed by others

A Review on Kalman Filter Models

A Review on Sound Source Localization Systems

Virtual Augmentation of the Beamforming Array Based on a Sub-cross-spectral Matrix Computation for Localizing Stationary Signal Noise Sources

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Share this article

Key words

CLC index

Search

Navigation