Informed Array Processing

Jarrett, Daniel P.; Habets, Emanuël A. P.; Naylor, Patrick A.

doi:10.1007/978-3-319-42211-4_9

Daniel P. Jarrett⁶,
Emanuël A. P. Habets⁷ &
Patrick A. Naylor⁸

Part of the book series: Springer Topics in Signal Processing ((STSP,volume 9))

1699 Accesses

Abstract

The concept of informed array processing is introduced in this chapter. The conceptual aim of informed array processing is to incorporate relevant spatial information about the problem to be solved into the design of spatial filters and into the estimation of the second-order statistics that are required to implement the beamformers of Chap. 7. Informed array processing techniques are developed for two important signal enhancement problems: noise reduction and dereverberation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
For brevity, the dependency of all quantities on \(\ell \) is omitted throughout Sects. 9.1.1 and 9.1.2.
2.
For brevity, the dependency of all quantities on the discrete time and frequency indices \(\ell \) and \(\nu \) is omitted where possible in the rest of Sect. 9.1.
3.
When the sphere is a 2-sphere (i.e., an ordinary sphere), as it is here, the von Mises–Fisher distribution is sometimes referred to simply as a Fisher distribution.
4.
A number of audio examples are also available at http://www.ee.ic.ac.uk/sap/sphdoa/.
5.
The dependency on time is omitted for brevity. In practice, the signals acquired using a spherical microphone array are usually processed in the short-time Fourier transform domain, as explained in Sect. 3.1, where the discrete frequency index is denoted by \(\nu \).
6.
If the real SHT is applied instead of the complex SHT, the complex spherical harmonics \(Y_{lm}\) used throughout this chapter should be replaced with the real spherical harmonics \(R_{lm}\), as defined in Sect. 3.3.
7.
It should be noted that this simplified expression is only valid if the filter is applied to mode strength compensated eigenbeams. As a result, it is different to the expression given in Chap. 6.
8.
A number of audio examples can be accessed from https://www.audiolabs-erlangen.de/resources/2013-ICASSP-RR.

References

Berge, S., Barrett, N.: High angular resolution planewave expansion. In: Proceedings of the 2nd International Symposium on Ambisonics and Spherical Acoustics (2010)
Google Scholar
Bradley, J.S., Sato, H., Picard, M.: On the importance of early reflections for speech in rooms. J. Acoust. Soc. Am. 113(6), 3233–3244 (2003)
Article Google Scholar
Braun, S., Jarrett, D.P., Fischer, J., Habets, E.A.P.: An informed spatial filter for dereverberation in the spherical harmonic domain. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 669–673. Vancouver, Canada (2013)
Google Scholar
Cohen, I.: Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging. IEEE Trans. Speech Audio Process. 11(5), 466–475 (2003). doi:10.1109/TSA.2003.811544
Article Google Scholar
Cohen, I.: Multichannel post-filtering in nonstationary noise environments. IEEE Trans. Signal Process. 52(5), 1149–1160 (2004)
Article MathSciNet Google Scholar
Cohen, I., Gannot, S., Berdugo, B.: An integrated real-time beamforming and post filtering system for nonstationary noise environments. EURASIP J. Appl. Signal Process. 11, 1064–1073 (2003)
Article MATH Google Scholar
Cox, H., Zeskind, R.M., Owen, M.M.: Robust adaptive beamforming. IEEE Trans. Acoust., Speech, Signal Process. 35(10), 1365–1376 (1987)
Article Google Scholar
Ephraim, Y., Malah, D.: Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator. IEEE Trans. Acoust., Speech, Signal Process. 32(6), 1109–1121 (1984)
Article Google Scholar
European Broadcasting Union: Sound quality assessment material recordings for subjective tests. http://tech.ebu.ch/publications/sqamcd (1988)
Falk, T., Zheng, C., Chan, W.Y.: A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech. IEEE Trans. Audio, Speech, Lang. Process. 18(7), 1766–1774 (2010)
Article Google Scholar
Fisher, R.: Dispersion on a sphere. Proc. R. Soc. Lond. Ser. A 217(1130), 295–305 (1953). doi:10.1098/rspa.1953.0064
Article MathSciNet MATH Google Scholar
Habets, E.A.P.: Single- and multi-microphone speech dereverberation using spectral enhancement. Ph.D. thesis, Technische Universiteit Eindhoven. http://alexandria.tue.nl/extra2/200710970.pdf (2007)
Habets, E.A.P.: A distortionless subband beamformer for noise reduction in reverberant environments. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1–4 (2010)
Google Scholar
Habets, E.A.P., Benesty, J., Cohen, I., Gannot, S., Dmochowski, J.: New insights into the MVDR beamformer in room acoustics. IEEE Trans. Audio, Speech, Lang. Process. 18, 158–170 (2010)
Google Scholar
Hendriks, R., Gerkmann, T.: Noise correlation matrix estimation for multi-microphone speech enhancement. IEEE Trans. Audio, Speech, Lang. Process. 20(1), 223–233 (2012)
Google Scholar
ITU-T: Objective measurement of active speech level (1993)
Google Scholar
Jarrett, D.P.: Spherical microphone array impulse response (SMIR) generator. http://www.ee.ic.ac.uk/sap/smirgen/
Jarrett, D.P., Habets, E.A.P.: On the noise reduction performance of a spherical harmonic domain tradeoff beamformer. IEEE Signal Process. Lett. 19(11), 773–776 (2012)
Article Google Scholar
Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: 3D source localization in the spherical harmonic domain using a pseudointensity vector. In: Proceedings of the European Signal Processing Conference (EUSIPCO), pp. 442–446. Aalborg, Denmark (2010)
Google Scholar
Jarrett, D.P., Habets, E.A.P., Thomas, M.R.P., Gaubitch, N.D., Naylor, P.A.: Dereverberation performance of rigid and open spherical microphone arrays: theory & simulation. In: Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 145–150. Edinburgh, UK (2011)
Google Scholar
Jarrett, D.P., Habets, E.A.P., Benesty, J., Naylor, P.A.: A tradeoff beamformer for noise reduction in the spherical harmonic domain. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC). Aachen, Germany (2012)
Google Scholar
Jarrett, D.P., Thiergart, O., Habets, E.A.P., Naylor, P.A.: Coherence-based diffuseness estimation in the spherical harmonic domain. In: Proceedings of the IEEE Convention of Electrical & Electronics Engineers in Israel (IEEEI). Eilat, Israel (2012)
Google Scholar
Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: Spherical harmonic domain noise reduction using an MVDR beamformer and DOA-based second-order statistics estimation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 654–658. Vancouver, Canada (2013)
Google Scholar
Jarrett, D.P., Taseska, M., Habets, E.A.P., Naylor, P.A.: Noise reduction in the spherical harmonic domain using a tradeoff beamformer and narrowband DOA estimates. IEEE/ACM Trans. Audio, Speech, Lang. Process. 22(5), 965–976 (2014)
Google Scholar
Kuttruff, H.: Room Acoustics, 4th edn. Taylor & Francis, London (2000)
Google Scholar
Mardia, K.V., Jupp, P.E.: Directional Statistics. Wiley-Blackwell, New York (1999)
Google Scholar
McCowan, I., Bourlard, H.: Microphone array post-filter based on noise field coherence. IEEE Trans. Speech Audio Process. 11(6), 709–716 (2003)
Article Google Scholar
Meyer, J., Elko, G.: A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2, pp. 1781–1784 (2002)
Google Scholar
Nábělek, A.K., Mason, D.: Effect of noise and reverberation on binaural and monaural word identification by subjects with various audiograms. J Speech Hear. Res. 24, 375–383 (1981)
Article Google Scholar
Naylor, P.A., Gaubitch, N.D. (eds.): Speech Dereverberation. Springer, Heidelberg (2010)
Google Scholar
Ngo, K., Spriet, A., Moonen, M., Wouters, J., Jensen, S.: Incorporating the conditional speech presence probability in multi-channel Wiener filter based noise reduction in hearing aids. EURASIP J. Adv. Signal Process. (1) (2009). doi:10.1155/2009/930625 (Special Issue on Digital Signal Processing for Hearing Instruments)
Peled, Y., Rafaely, B.: Study of speech intelligibility in noisy enclosures using spherical microphones arrays. In: Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 160–163 (2008). doi:10.1109/HSCMA.2008.4538711
Peled, Y., Rafaely, B.: Method for dereverberation and noise reduction using spherical microphone arrays. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 113–116 (2010). doi:10.1109/ICASSP.2010.5496154
Peled, Y., Rafaely, B.: Linearly constrained minimum variance method for spherical microphone arrays in a coherent environment. In: Proceedings of the Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 86–91 (2011). doi:10.1109/HSCMA.2011.5942416
Pulkki, V.: Spatial sound reproduction with directional audio coding. J. Audio Eng. Soc. 55(6), 503–516 (2007)
Google Scholar
Rafaely, B.: Plane-wave decomposition of the pressure on a sphere by spherical convolution. J. Acoust. Soc. Am. 116(4), 2149–2157 (2004)
Article Google Scholar
Rickard, S., Yilmaz, Z.: On the approximate W-disjoint orthogonality of speech. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 1, pp. 529–532 (2002)
Google Scholar
Silzle, A., Geyersberger, S., Brohasga, G., Weninger, D., Leistner, M.: Vision and technique behind the new studios and listening rooms of the Fraunhofer IIS audio laboratory. In: Proceedings of the Audio Engineering Society Convention (2009)
Google Scholar
Souden, M., Chen, J., Benesty, J., Affes, S.: Gaussian model-based multichannel speech presence probability. IEEE Trans. Audio, Speech, Lang. Process. 18(5), 1072–1077 (2010)
Google Scholar
Souden, M., Chen, J., Benesty, J., Affes, S.: An integrated solution for online multichannel noise tracking and reduction. IEEE Trans. Audio, Speech, Lang. Process. 19(7), 2159–2169 (2011). doi:10.1109/TASL.2011.2118205
Sra, S.: A short note on parameter approximation for von Mises-Fisher distributions: and a fast implementation of \(I_{s}(x)\). Comput. Stat. 27(1), 177–190 (2012). doi:10.1007/s00180-011-0232-x
Article MathSciNet MATH Google Scholar
Steinberg, J.C.: Effects of distortion upon the recognition of speech sounds. J. Acoust. Soc. Am. 1, 35–35 (1929)
Article Google Scholar
Taseska, M., Habets, E.A.P.: MMSE-based blind source extraction in diffuse noise fields using a complex coherence-based a priori SAP estimator. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC) (2012)
Google Scholar
Teutsch, H.: Wavefield decomposition using microphone arrays and its application to acoustic scene analysis. Ph.D. thesis, Friedrich-Alexander Universität Erlangen-Nürnberg (2005)
Google Scholar
Viola, P., Jones, M.J.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 511–518 (2001). doi:10.1109/CVPR.2001.990517
Wu, P.K.T., Epain, N., Jin, C.: A dereverberation algorithm for spherical microphone arrays using compressed sensing techniques. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4053–4056 (2012)
Google Scholar
Yan, S., Sun, H., Svensson, U.P., Ma, X., Hovem, J.M.: Optimal modal beamforming for spherical microphone arrays. IEEE Trans. Audio, Speech, Lang. Process. 19(2), 361–371 (2011). doi:10.1109/TASL.2010.2047815

Download references

Author information

Authors and Affiliations

Kilburn & Strode LLP, London, UK
Daniel P. Jarrett
International Audio Laboratories Erlangen, Erlangen, Germany
Emanuël A. P. Habets
Department of Electrical and Electronic Engineering, Imperial College London, London, UK
Patrick A. Naylor

Authors

Daniel P. Jarrett
View author publications
You can also search for this author in PubMed Google Scholar
Emanuël A. P. Habets
View author publications
You can also search for this author in PubMed Google Scholar
Patrick A. Naylor
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel P. Jarrett .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Jarrett, D.P., Habets, E.A.P., Naylor, P.A. (2017). Informed Array Processing. In: Theory and Applications of Spherical Microphone Array Processing. Springer Topics in Signal Processing, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-319-42211-4_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-42211-4_9
Published: 27 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42209-1
Online ISBN: 978-3-319-42211-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics