Skip to main content

Part of the book series: Springer Topics in Signal Processing ((STSP,volume 9))

  • 1699 Accesses

Abstract

The concept of informed array processing is introduced in this chapter. The conceptual aim of informed array processing is to incorporate relevant spatial information about the problem to be solved into the design of spatial filters and into the estimation of the second-order statistics that are required to implement the beamformers of Chap. 7. Informed array processing techniques are developed for two important signal enhancement problems: noise reduction and dereverberation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    For brevity, the dependency of all quantities on \(\ell \) is omitted throughout Sects. 9.1.1 and 9.1.2.

  2. 2.

    For brevity, the dependency of all quantities on the discrete time and frequency indices \(\ell \) and \(\nu \) is omitted where possible in the rest of Sect. 9.1.

  3. 3.

    When the sphere is a 2-sphere (i.e., an ordinary sphere), as it is here, the von Mises–Fisher distribution is sometimes referred to simply as a Fisher distribution.

  4. 4.

    A number of audio examples are also available at http://www.ee.ic.ac.uk/sap/sphdoa/.

  5. 5.

    The dependency on time is omitted for brevity. In practice, the signals acquired using a spherical microphone array are usually processed in the short-time Fourier transform domain, as explained in Sect. 3.1, where the discrete frequency index is denoted by \(\nu \).

  6. 6.

    If the real SHT is applied instead of the complex SHT, the complex spherical harmonics \(Y_{lm}\) used throughout this chapter should be replaced with the real spherical harmonics \(R_{lm}\), as defined in Sect. 3.3.

  7. 7.

    It should be noted that this simplified expression is only valid if the filter is applied to mode strength compensated eigenbeams. As a result, it is different to the expression given in Chap. 6.

  8. 8.

    A number of audio examples can be accessed from https://www.audiolabs-erlangen.de/resources/2013-ICASSP-RR.

References

  1. Berge, S., Barrett, N.: High angular resolution planewave expansion. In: Proceedings of the 2nd International Symposium on Ambisonics and Spherical Acoustics (2010)

    Google Scholar 

  2. Bradley, J.S., Sato, H., Picard, M.: On the importance of early reflections for speech in rooms. J. Acoust. Soc. Am. 113(6), 3233–3244 (2003)

    Article  Google Scholar 

  3. Braun, S., Jarrett, D.P., Fischer, J., Habets, E.A.P.: An informed spatial filter for dereverberation in the spherical harmonic domain. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 669–673. Vancouver, Canada (2013)

    Google Scholar 

  4. Cohen, I.: Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging. IEEE Trans. Speech Audio Process. 11(5), 466–475 (2003). doi:10.1109/TSA.2003.811544

    Article  Google Scholar 

  5. Cohen, I.: Multichannel post-filtering in nonstationary noise environments. IEEE Trans. Signal Process. 52(5), 1149–1160 (2004)

    Article  MathSciNet  Google Scholar 

  6. Cohen, I., Gannot, S., Berdugo, B.: An integrated real-time beamforming and post filtering system for nonstationary noise environments. EURASIP J. Appl. Signal Process. 11, 1064–1073 (2003)

    Article  MATH  Google Scholar 

  7. Cox, H., Zeskind, R.M., Owen, M.M.: Robust adaptive beamforming. IEEE Trans. Acoust., Speech, Signal Process. 35(10), 1365–1376 (1987)

    Article  Google Scholar 

  8. Ephraim, Y., Malah, D.: Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator. IEEE Trans. Acoust., Speech, Signal Process. 32(6), 1109–1121 (1984)

    Article  Google Scholar 

  9. European Broadcasting Union: Sound quality assessment material recordings for subjective tests. http://tech.ebu.ch/publications/sqamcd (1988)

  10. Falk, T., Zheng, C., Chan, W.Y.: A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech. IEEE Trans. Audio, Speech, Lang. Process. 18(7), 1766–1774 (2010)

    Article  Google Scholar 

  11. Fisher, R.: Dispersion on a sphere. Proc. R. Soc. Lond. Ser. A 217(1130), 295–305 (1953). doi:10.1098/rspa.1953.0064

    Article  MathSciNet  MATH  Google Scholar 

  12. Habets, E.A.P.: Single- and multi-microphone speech dereverberation using spectral enhancement. Ph.D. thesis, Technische Universiteit Eindhoven. http://alexandria.tue.nl/extra2/200710970.pdf (2007)

  13. Habets, E.A.P.: A distortionless subband beamformer for noise reduction in reverberant environments. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1–4 (2010)

    Google Scholar 

  14. Habets, E.A.P., Benesty, J., Cohen, I., Gannot, S., Dmochowski, J.: New insights into the MVDR beamformer in room acoustics. IEEE Trans. Audio, Speech, Lang. Process. 18, 158–170 (2010)

    Google Scholar 

  15. Hendriks, R., Gerkmann, T.: Noise correlation matrix estimation for multi-microphone speech enhancement. IEEE Trans. Audio, Speech, Lang. Process. 20(1), 223–233 (2012)

    Google Scholar 

  16. ITU-T: Objective measurement of active speech level (1993)

    Google Scholar 

  17. Jarrett, D.P.: Spherical microphone array impulse response (SMIR) generator. http://www.ee.ic.ac.uk/sap/smirgen/

  18. Jarrett, D.P., Habets, E.A.P.: On the noise reduction performance of a spherical harmonic domain tradeoff beamformer. IEEE Signal Process. Lett. 19(11), 773–776 (2012)

    Article  Google Scholar 

  19. Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: 3D source localization in the spherical harmonic domain using a pseudointensity vector. In: Proceedings of the European Signal Processing Conference (EUSIPCO), pp. 442–446. Aalborg, Denmark (2010)

    Google Scholar 

  20. Jarrett, D.P., Habets, E.A.P., Thomas, M.R.P., Gaubitch, N.D., Naylor, P.A.: Dereverberation performance of rigid and open spherical microphone arrays: theory & simulation. In: Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 145–150. Edinburgh, UK (2011)

    Google Scholar 

  21. Jarrett, D.P., Habets, E.A.P., Benesty, J., Naylor, P.A.: A tradeoff beamformer for noise reduction in the spherical harmonic domain. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC). Aachen, Germany (2012)

    Google Scholar 

  22. Jarrett, D.P., Thiergart, O., Habets, E.A.P., Naylor, P.A.: Coherence-based diffuseness estimation in the spherical harmonic domain. In: Proceedings of the IEEE Convention of Electrical & Electronics Engineers in Israel (IEEEI). Eilat, Israel (2012)

    Google Scholar 

  23. Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: Spherical harmonic domain noise reduction using an MVDR beamformer and DOA-based second-order statistics estimation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 654–658. Vancouver, Canada (2013)

    Google Scholar 

  24. Jarrett, D.P., Taseska, M., Habets, E.A.P., Naylor, P.A.: Noise reduction in the spherical harmonic domain using a tradeoff beamformer and narrowband DOA estimates. IEEE/ACM Trans. Audio, Speech, Lang. Process. 22(5), 965–976 (2014)

    Google Scholar 

  25. Kuttruff, H.: Room Acoustics, 4th edn. Taylor & Francis, London (2000)

    Google Scholar 

  26. Mardia, K.V., Jupp, P.E.: Directional Statistics. Wiley-Blackwell, New York (1999)

    Google Scholar 

  27. McCowan, I., Bourlard, H.: Microphone array post-filter based on noise field coherence. IEEE Trans. Speech Audio Process. 11(6), 709–716 (2003)

    Article  Google Scholar 

  28. Meyer, J., Elko, G.: A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2, pp. 1781–1784 (2002)

    Google Scholar 

  29. Nábělek, A.K., Mason, D.: Effect of noise and reverberation on binaural and monaural word identification by subjects with various audiograms. J Speech Hear. Res. 24, 375–383 (1981)

    Article  Google Scholar 

  30. Naylor, P.A., Gaubitch, N.D. (eds.): Speech Dereverberation. Springer, Heidelberg (2010)

    Google Scholar 

  31. Ngo, K., Spriet, A., Moonen, M., Wouters, J., Jensen, S.: Incorporating the conditional speech presence probability in multi-channel Wiener filter based noise reduction in hearing aids. EURASIP J. Adv. Signal Process. (1) (2009). doi:10.1155/2009/930625 (Special Issue on Digital Signal Processing for Hearing Instruments)

  32. Peled, Y., Rafaely, B.: Study of speech intelligibility in noisy enclosures using spherical microphones arrays. In: Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 160–163 (2008). doi:10.1109/HSCMA.2008.4538711

  33. Peled, Y., Rafaely, B.: Method for dereverberation and noise reduction using spherical microphone arrays. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 113–116 (2010). doi:10.1109/ICASSP.2010.5496154

  34. Peled, Y., Rafaely, B.: Linearly constrained minimum variance method for spherical microphone arrays in a coherent environment. In: Proceedings of the Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 86–91 (2011). doi:10.1109/HSCMA.2011.5942416

  35. Pulkki, V.: Spatial sound reproduction with directional audio coding. J. Audio Eng. Soc. 55(6), 503–516 (2007)

    Google Scholar 

  36. Rafaely, B.: Plane-wave decomposition of the pressure on a sphere by spherical convolution. J. Acoust. Soc. Am. 116(4), 2149–2157 (2004)

    Article  Google Scholar 

  37. Rickard, S., Yilmaz, Z.: On the approximate W-disjoint orthogonality of speech. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 1, pp. 529–532 (2002)

    Google Scholar 

  38. Silzle, A., Geyersberger, S., Brohasga, G., Weninger, D., Leistner, M.: Vision and technique behind the new studios and listening rooms of the Fraunhofer IIS audio laboratory. In: Proceedings of the Audio Engineering Society Convention (2009)

    Google Scholar 

  39. Souden, M., Chen, J., Benesty, J., Affes, S.: Gaussian model-based multichannel speech presence probability. IEEE Trans. Audio, Speech, Lang. Process. 18(5), 1072–1077 (2010)

    Google Scholar 

  40. Souden, M., Chen, J., Benesty, J., Affes, S.: An integrated solution for online multichannel noise tracking and reduction. IEEE Trans. Audio, Speech, Lang. Process. 19(7), 2159–2169 (2011). doi:10.1109/TASL.2011.2118205

  41. Sra, S.: A short note on parameter approximation for von Mises-Fisher distributions: and a fast implementation of \(I_{s}(x)\). Comput. Stat. 27(1), 177–190 (2012). doi:10.1007/s00180-011-0232-x

    Article  MathSciNet  MATH  Google Scholar 

  42. Steinberg, J.C.: Effects of distortion upon the recognition of speech sounds. J. Acoust. Soc. Am. 1, 35–35 (1929)

    Article  Google Scholar 

  43. Taseska, M., Habets, E.A.P.: MMSE-based blind source extraction in diffuse noise fields using a complex coherence-based a priori SAP estimator. In: Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC) (2012)

    Google Scholar 

  44. Teutsch, H.: Wavefield decomposition using microphone arrays and its application to acoustic scene analysis. Ph.D. thesis, Friedrich-Alexander Universität Erlangen-Nürnberg (2005)

    Google Scholar 

  45. Viola, P., Jones, M.J.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 511–518 (2001). doi:10.1109/CVPR.2001.990517

  46. Wu, P.K.T., Epain, N., Jin, C.: A dereverberation algorithm for spherical microphone arrays using compressed sensing techniques. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4053–4056 (2012)

    Google Scholar 

  47. Yan, S., Sun, H., Svensson, U.P., Ma, X., Hovem, J.M.: Optimal modal beamforming for spherical microphone arrays. IEEE Trans. Audio, Speech, Lang. Process. 19(2), 361–371 (2011). doi:10.1109/TASL.2010.2047815

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel P. Jarrett .

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Jarrett, D.P., Habets, E.A.P., Naylor, P.A. (2017). Informed Array Processing. In: Theory and Applications of Spherical Microphone Array Processing. Springer Topics in Signal Processing, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-319-42211-4_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42211-4_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42209-1

  • Online ISBN: 978-3-319-42211-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics