Abstract
The concept of informed array processing is introduced in this chapter. The conceptual aim of informed array processing is to incorporate relevant spatial information about the problem to be solved into the design of spatial filters and into the estimation of the second-order statistics that are required to implement the beamformers of Chap. 7. Informed array processing techniques are developed for two important signal enhancement problems: noise reduction and dereverberation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
For brevity, the dependency of all quantities on the discrete time and frequency indices \(\ell \) and \(\nu \) is omitted where possible in the rest of Sect. 9.1.
- 3.
When the sphere is a 2-sphere (i.e., an ordinary sphere), as it is here, the von Mises–Fisher distribution is sometimes referred to simply as a Fisher distribution.
- 4.
A number of audio examples are also available at http://www.ee.ic.ac.uk/sap/sphdoa/.
- 5.
The dependency on time is omitted for brevity. In practice, the signals acquired using a spherical microphone array are usually processed in the short-time Fourier transform domain, as explained in Sect. 3.1, where the discrete frequency index is denoted by \(\nu \).
- 6.
If the real SHT is applied instead of the complex SHT, the complex spherical harmonics \(Y_{lm}\) used throughout this chapter should be replaced with the real spherical harmonics \(R_{lm}\), as defined in Sect. 3.3.
- 7.
It should be noted that this simplified expression is only valid if the filter is applied to mode strength compensated eigenbeams. As a result, it is different to the expression given in Chap. 6.
- 8.
A number of audio examples can be accessed from https://www.audiolabs-erlangen.de/resources/2013-ICASSP-RR.
References
Berge, S., Barrett, N.: High angular resolution planewave expansion. In: Proceedings of the 2nd International Symposium on Ambisonics and Spherical Acoustics (2010)
Bradley, J.S., Sato, H., Picard, M.: On the importance of early reflections for speech in rooms. J. Acoust. Soc. Am. 113(6), 3233–3244 (2003)
Braun, S., Jarrett, D.P., Fischer, J., Habets, E.A.P.: An informed spatial filter for dereverberation in the spherical harmonic domain. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 669–673. Vancouver, Canada (2013)
Cohen, I.: Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging. IEEE Trans. Speech Audio Process. 11(5), 466–475 (2003). doi:10.1109/TSA.2003.811544
Cohen, I.: Multichannel post-filtering in nonstationary noise environments. IEEE Trans. Signal Process. 52(5), 1149–1160 (2004)
Cohen, I., Gannot, S., Berdugo, B.: An integrated real-time beamforming and post filtering system for nonstationary noise environments. EURASIP J. Appl. Signal Process. 11, 1064–1073 (2003)
Cox, H., Zeskind, R.M., Owen, M.M.: Robust adaptive beamforming. IEEE Trans. Acoust., Speech, Signal Process. 35(10), 1365–1376 (1987)
Ephraim, Y., Malah, D.: Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator. IEEE Trans. Acoust., Speech, Signal Process. 32(6), 1109–1121 (1984)
European Broadcasting Union: Sound quality assessment material recordings for subjective tests. http://tech.ebu.ch/publications/sqamcd (1988)
Falk, T., Zheng, C., Chan, W.Y.: A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech. IEEE Trans. Audio, Speech, Lang. Process. 18(7), 1766–1774 (2010)
Fisher, R.: Dispersion on a sphere. Proc. R. Soc. Lond. Ser. A 217(1130), 295–305 (1953). doi:10.1098/rspa.1953.0064
Habets, E.A.P.: Single- and multi-microphone speech dereverberation using spectral enhancement. Ph.D. thesis, Technische Universiteit Eindhoven. http://alexandria.tue.nl/extra2/200710970.pdf (2007)
Habets, E.A.P.: A distortionless subband beamformer for noise reduction in reverberant environments. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1–4 (2010)
Habets, E.A.P., Benesty, J., Cohen, I., Gannot, S., Dmochowski, J.: New insights into the MVDR beamformer in room acoustics. IEEE Trans. Audio, Speech, Lang. Process. 18, 158–170 (2010)
Hendriks, R., Gerkmann, T.: Noise correlation matrix estimation for multi-microphone speech enhancement. IEEE Trans. Audio, Speech, Lang. Process. 20(1), 223–233 (2012)
ITU-T: Objective measurement of active speech level (1993)
Jarrett, D.P.: Spherical microphone array impulse response (SMIR) generator. http://www.ee.ic.ac.uk/sap/smirgen/
Jarrett, D.P., Habets, E.A.P.: On the noise reduction performance of a spherical harmonic domain tradeoff beamformer. IEEE Signal Process. Lett. 19(11), 773–776 (2012)
Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: 3D source localization in the spherical harmonic domain using a pseudointensity vector. In: Proceedings of the European Signal Processing Conference (EUSIPCO), pp. 442–446. Aalborg, Denmark (2010)
Jarrett, D.P., Habets, E.A.P., Thomas, M.R.P., Gaubitch, N.D., Naylor, P.A.: Dereverberation performance of rigid and open spherical microphone arrays: theory & simulation. In: Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 145–150. Edinburgh, UK (2011)
Jarrett, D.P., Habets, E.A.P., Benesty, J., Naylor, P.A.: A tradeoff beamformer for noise reduction in the spherical harmonic domain. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC). Aachen, Germany (2012)
Jarrett, D.P., Thiergart, O., Habets, E.A.P., Naylor, P.A.: Coherence-based diffuseness estimation in the spherical harmonic domain. In: Proceedings of the IEEE Convention of Electrical & Electronics Engineers in Israel (IEEEI). Eilat, Israel (2012)
Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: Spherical harmonic domain noise reduction using an MVDR beamformer and DOA-based second-order statistics estimation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 654–658. Vancouver, Canada (2013)
Jarrett, D.P., Taseska, M., Habets, E.A.P., Naylor, P.A.: Noise reduction in the spherical harmonic domain using a tradeoff beamformer and narrowband DOA estimates. IEEE/ACM Trans. Audio, Speech, Lang. Process. 22(5), 965–976 (2014)
Kuttruff, H.: Room Acoustics, 4th edn. Taylor & Francis, London (2000)
Mardia, K.V., Jupp, P.E.: Directional Statistics. Wiley-Blackwell, New York (1999)
McCowan, I., Bourlard, H.: Microphone array post-filter based on noise field coherence. IEEE Trans. Speech Audio Process. 11(6), 709–716 (2003)
Meyer, J., Elko, G.: A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2, pp. 1781–1784 (2002)
Nábělek, A.K., Mason, D.: Effect of noise and reverberation on binaural and monaural word identification by subjects with various audiograms. J Speech Hear. Res. 24, 375–383 (1981)
Naylor, P.A., Gaubitch, N.D. (eds.): Speech Dereverberation. Springer, Heidelberg (2010)
Ngo, K., Spriet, A., Moonen, M., Wouters, J., Jensen, S.: Incorporating the conditional speech presence probability in multi-channel Wiener filter based noise reduction in hearing aids. EURASIP J. Adv. Signal Process. (1) (2009). doi:10.1155/2009/930625 (Special Issue on Digital Signal Processing for Hearing Instruments)
Peled, Y., Rafaely, B.: Study of speech intelligibility in noisy enclosures using spherical microphones arrays. In: Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 160–163 (2008). doi:10.1109/HSCMA.2008.4538711
Peled, Y., Rafaely, B.: Method for dereverberation and noise reduction using spherical microphone arrays. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 113–116 (2010). doi:10.1109/ICASSP.2010.5496154
Peled, Y., Rafaely, B.: Linearly constrained minimum variance method for spherical microphone arrays in a coherent environment. In: Proceedings of the Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 86–91 (2011). doi:10.1109/HSCMA.2011.5942416
Pulkki, V.: Spatial sound reproduction with directional audio coding. J. Audio Eng. Soc. 55(6), 503–516 (2007)
Rafaely, B.: Plane-wave decomposition of the pressure on a sphere by spherical convolution. J. Acoust. Soc. Am. 116(4), 2149–2157 (2004)
Rickard, S., Yilmaz, Z.: On the approximate W-disjoint orthogonality of speech. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 1, pp. 529–532 (2002)
Silzle, A., Geyersberger, S., Brohasga, G., Weninger, D., Leistner, M.: Vision and technique behind the new studios and listening rooms of the Fraunhofer IIS audio laboratory. In: Proceedings of the Audio Engineering Society Convention (2009)
Souden, M., Chen, J., Benesty, J., Affes, S.: Gaussian model-based multichannel speech presence probability. IEEE Trans. Audio, Speech, Lang. Process. 18(5), 1072–1077 (2010)
Souden, M., Chen, J., Benesty, J., Affes, S.: An integrated solution for online multichannel noise tracking and reduction. IEEE Trans. Audio, Speech, Lang. Process. 19(7), 2159–2169 (2011). doi:10.1109/TASL.2011.2118205
Sra, S.: A short note on parameter approximation for von Mises-Fisher distributions: and a fast implementation of \(I_{s}(x)\). Comput. Stat. 27(1), 177–190 (2012). doi:10.1007/s00180-011-0232-x
Steinberg, J.C.: Effects of distortion upon the recognition of speech sounds. J. Acoust. Soc. Am. 1, 35–35 (1929)
Taseska, M., Habets, E.A.P.: MMSE-based blind source extraction in diffuse noise fields using a complex coherence-based a priori SAP estimator. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC) (2012)
Teutsch, H.: Wavefield decomposition using microphone arrays and its application to acoustic scene analysis. Ph.D. thesis, Friedrich-Alexander Universität Erlangen-Nürnberg (2005)
Viola, P., Jones, M.J.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 511–518 (2001). doi:10.1109/CVPR.2001.990517
Wu, P.K.T., Epain, N., Jin, C.: A dereverberation algorithm for spherical microphone arrays using compressed sensing techniques. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4053–4056 (2012)
Yan, S., Sun, H., Svensson, U.P., Ma, X., Hovem, J.M.: Optimal modal beamforming for spherical microphone arrays. IEEE Trans. Audio, Speech, Lang. Process. 19(2), 361–371 (2011). doi:10.1109/TASL.2010.2047815
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2017 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Jarrett, D.P., Habets, E.A.P., Naylor, P.A. (2017). Informed Array Processing. In: Theory and Applications of Spherical Microphone Array Processing. Springer Topics in Signal Processing, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-319-42211-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-42211-4_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42209-1
Online ISBN: 978-3-319-42211-4
eBook Packages: EngineeringEngineering (R0)