Skip to main content

Introduction

  • Chapter
  • First Online:

Part of the book series: Springer Topics in Signal Processing ((STSP,volume 9))

Abstract

The motivation behind this book lies in the rapidly growing interest in spherical microphone arrays over the last decade. Important applications for these arrays include human-human and human-machine speech communication systems and spatial sound recording. While human-human speech communication systems have a long history, speech also plays an ever-growing part in human-machine communication. This trend has been fuelled by advances in speech recognition technology, as well as the explosion in available computing power, particularly on mobile devices. With the widespread availability of 3D sound cinema systems and virtual reality gear with 3D binaural sound reproduction, the need to capture spatial sound is rapidly growing. Spherical microphone arrays are particularly suitable for capturing all three dimensions of the sound field, including both ambient sounds and sounds from particular directions. In this chapter, we introduce the topic of acoustic signal processing using microphone arrays, and then explore spherical microphone arrays in more detail. We provide an outline of the structure of the book, and discuss the relationships between each of the subsequent chapters.

Portions of this chapter were first published in [25], and are reproduced here with the author’s permission.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Abhayapala, T.D., Ward, D.B.: Theory and design of high order sound field microphones using spherical microphone array. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2, pp. 1949–1952 (2002). doi:10.1109/ICASSP.2002.1006151

  2. Allen, J.B., Berkley, D.A.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943–950 (1979)

    Article  Google Scholar 

  3. Assmann, P., Summerfield, Q.: The perception of speech under adverse conditions. In: Greenberg, S., Ainsworth, W.A., Popper, A.N., Fay, R.R. (eds.) Speech Processing in the Auditory System, Chap. 5, pp. 231–308. Springer, Berlin, Germany (2004)

    Google Scholar 

  4. Benesty, J., Chen, J., Habets, E.A.P.: Speech Enhancement in the STFT Domain. SpringerBriefs in Electrical and Computer Engineering. Springer, Berlin (2011)

    Google Scholar 

  5. Benesty, J., Chen, J., Huang, Y.: Microphone Array Signal Processing. Springer, Berlin, Germany (2008)

    Google Scholar 

  6. Benesty, J., Chen, J., Huang, Y., Cohen, I.: Noise Reduction in Speech Processing. Springer, Berlin (2009)

    Google Scholar 

  7. Benesty, J., Gänsler, T., Morgan, D.R., Sondhi, M.M., Gay, S.L.: Advances in Network and Acoustic Echo Cancellation. Springer, Berlin (2001)

    Google Scholar 

  8. Benesty, J., Sondhi, M.M., Huang, Y. (eds.): Springer Handbook of Speech Processing. Springer, Berlin (2008)

    Google Scholar 

  9. Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by acoustic noise. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 4, pp. 208–211 (1979)

    Google Scholar 

  10. Brandstein, M.S., Ward, D.B. (eds.): Microphone Arrays: Signal Processing Techniques and Applications. Springer, Berlin (2001)

    Google Scholar 

  11. Braun, S., Jarrett, D.P., Fischer, J., Habets, E.A.P.: An informed spatial filter for dereverberation in the spherical harmonic domain. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 669–673. Vancouver, Canada (2013)

    Google Scholar 

  12. Compton, Jr., R.: Adaptive Antennas, 1st edn. Prentice-Hall, Upper Saddle River (1988)

    Google Scholar 

  13. Doclo, S., Gannot, S., Moonen, M., Spriet, A.: Acoustic beamforming for hearing aid applications. In: Haykin, S., Liu, K.R. (eds.) Handbook on Array Processing and Sensor Networks, chap. 9. Wiley, New York (2008)

    Google Scholar 

  14. Eaton, J., Gaubitch, N.D., Naylor, P.A.: Noise-robust reverberation time estimation using spectral decay distributions with reduced computational cost. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vancouver, Canada (2013)

    Google Scholar 

  15. Elko, G.W.: Future directions for microphone arrays. In: Brandstein and Ward [10], chap. 17, pp. 383–387

    Google Scholar 

  16. Elko, G.W., Meyer, J.: Spherical microphone arrays for 3D sound recordings. In: Huang, Y., Benesty, J. (eds.) Audio Signal Processing for Next-Generation Multimedia Communication Systems, chap. 3, pp. 67–89 (2004)

    Google Scholar 

  17. Elko, G.W., Meyer, J.: Microphone arrays. In: Benesty et al. [8], chap. 50

    Google Scholar 

  18. Gaubitch, N.D.: Blind identification of acoustic systems and enhancement of reverberant speech. Ph.D. thesis, Imperial College London (2006)

    Google Scholar 

  19. Gover, B.N., Ryan, J.G., Stinson, M.R.: Microphone array measurement system for analysis of directional and spatial variations of sound fields. J. Acoust. Soc. Am. 112(5), 1980–1991 (2002). doi:10.1121/1.1508782

    Article  Google Scholar 

  20. Gustafsson, T., Rao, B., Trivedi, M.: Source localization in reverberant environments: modeling and statistical analysis. IEEE Trans. Speech Audio Process. 11(6), 791–803 (2003)

    Article  Google Scholar 

  21. Habets, E.A.P.: Single- and multi-microphone speech dereverberation using spectral enhancement. Ph.D. thesis, Technische Universiteit Eindhoven (2007). http://alexandria.tue.nl/extra2/200710970.pdf

  22. Habets, E.A.P., Benesty, J.: A perspective on frequency-domain beamformers in room acoustics. IEEE Trans. Audio, Speech, Lang. Process. 20(3), 947–960 (2012)

    Google Scholar 

  23. Habets, E.A.P., Cohen, I., Gannot, S.: Generating nonstationary multisensor signals under a spatial coherence constraint. J. Acoust. Soc. Am. 124(5), 2911–2917 (2008). doi:10.1121/1.2987429

    Article  Google Scholar 

  24. Huang, Y., Benesty, J., Chen, J.: Dereverberation. In: Benesty et al. [8], chap. 5

    Google Scholar 

  25. Jarrett, D.P.: Spherical microphone array processing for acoustic parameter estimation and signal enhancement. Ph.D. thesis, Imperial College London (2013)

    Google Scholar 

  26. Jarrett, D.P., Habets, E.A.P., Benesty, J., Naylor, P.A.: A tradeoff beamformer for noise reduction in the spherical harmonic domain. In: Proceedings of the International Workshop on Acoust. Signal Enhancement (IWAENC). Aachen, Germany (2012)

    Google Scholar 

  27. Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: 3D source localization in the spherical harmonic domain using a pseudointensity vector. In: Proceedings of the European Signal Processing Conference (EUSIPCO), pp. 442–446. Aalborg, Denmark (2010)

    Google Scholar 

  28. Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: Spherical harmonic domain noise reduction using an MVDR beamformer and DOA-based second-order statistics estimation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 654–658. Vancouver, Canada (2013)

    Google Scholar 

  29. Jarrett, D.P., Habets, E.A.P., Thomas, M.R.P., Gaubitch, N.D., Naylor, P.A.: Dereverberation performance of rigid and open spherical microphone arrays: Theory & simulation. In: Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA), pp. 145–150. Edinburgh, UK (2011)

    Google Scholar 

  30. Jarrett, D.P., Habets, E.A.P., Thomas, M.R.P., Naylor, P.A.: Simulating room impulse responses for spherical microphone arrays. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 129–132. Prague, Czech Republic (2011)

    Google Scholar 

  31. Jarrett, D.P., Thiergart, O., Habets, E.A.P., Naylor, P.A.: Coherence-based diffuseness estimation in the spherical harmonic domain. In: Proceedings of the IEEE Convention of Electrical & Electronics Engineers in Israel (IEEEI). Eilat, Israel (2012)

    Google Scholar 

  32. Jeub, M., Nelke, C., Beaugeant, C., Vary, P.: Blind estimation of the coherent-to-diffuse energy ratio from noisy speech signals. In: Proceedings of the European Signal Processing Conf. (EUSIPCO). Barcelona, Spain (2011)

    Google Scholar 

  33. Kellermann, W.: Acoustic echo cancellation for beamforming microphone arrays. In: Brandstein, M.S., Ward, D.B. (eds.) Microphone Arrays: Signal Processing Techniques and Applications, pp. 281–306. Springer, Berlin, Germany (2001)

    Chapter  Google Scholar 

  34. Khaykin, D., Rafaely, B.: Coherent signals direction-of-arrival estimation using a spherical microphone array: Frequency smoothing approach. In: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 221–224 (2009). doi:10.1109/ASPAA.2009.5346492

  35. Kuttruff, H.: Room Acoustics, 4th edn. Taylor & Francis, London (2000)

    Google Scholar 

  36. Li, Z., Duraiswami, R.: Flexible and optimal design of spherical microphone arrays for beamforming. IEEE Trans. Audio, Speech, Lang. Process. 15(2), 702–714 (2007). doi:10.1109/TASL.2006.876764

  37. Lim, F., Naylor, P.A.: Robust low-complexity multichannel equalization for dereverberation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vancouver, Canada (2013)

    Google Scholar 

  38. Lim, F., Thomas, M., Naylor, P.: Mintformer: A spatially aware channel equalizer. In: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, USA (2013)

    Google Scholar 

  39. Löllmann, H., Vary, P.: Estimation of the frequency dependent reverberation time by means of warped filter-banks. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 309 –312 (2011). doi:10.1109/ICASSP.2011.5946402

  40. de M. Prego, T., de Lima, A.A., Netto, S.L., Lee, B., Said, A., Schafer, R.W., Kalker, T.: A blind algorithm for reverberation-time estimation using subband decomposition of speech signals. J. Acoust. Soc. Am. 131(4), 2811–2816 (2012)

    Google Scholar 

  41. Meyer, J., Elko, G.: A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2, pp. 1781–1784 (2002)

    Google Scholar 

  42. Naylor, P.A., Gaubitch, N.D. (eds.): Speech Dereverberation. Springer, Berlin (2010)

    Google Scholar 

  43. Pulkki, V.: Spatial sound reproduction with directional audio coding. J. Audio Eng. Soc. 55(6), 503–516 (2007)

    Google Scholar 

  44. Rafaely, B.: Analysis and design of spherical microphone arrays. IEEE Trans. Speech Audio Process. 13(1), 135–143 (2005). doi:10.1109/TSA.2004.839244

    Article  Google Scholar 

  45. Rafaely, B., Peled, Y., Agmon, M., Khaykin, D., Fisher, E.: Spherical microphone array beamforming. In: I. Cohen, J. Benesty, S. Gannot (eds.) Speech Processing in Modern Communication: Challenges and Perspectives, chap. 11. Springer (2010)

    Google Scholar 

  46. Ratnam, R., Jones, D.L., Wheeler, B.C., O’Brien Jr., W.D., Lansing, C.R., Feng, A.S.: Blind estimation of reverberation time. J. Acoust. Soc. Am. 114(5), 2877–2892 (2003)

    Article  Google Scholar 

  47. Sondhi, M.: Adaptive echo cancelation for voice signals. In: Benesty et al. [8], chap. 45. Part H

    Google Scholar 

  48. Sun, H., Yan, S., Svensson, U.P.: Robust minimum sidelobe beamforming for spherical microphone arrays. IEEE Trans. Audio, Speech, Lang. Process. 19(4), 1045–1051 (2011). doi:10.1109/TASL.2010.2076393

  49. Talmon, R., Habets, E.A.P.: Blind reverberation time estimation by intrinsic modeling of reverberant speech. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vancouver, Canada (2013)

    Google Scholar 

  50. Teutsch, H.: Wavefield decomposition using microphone arrays and its application to acoustic scene analysis. Ph.D. thesis, Friedrich-Alexander Universität Erlangen-Nürnberg (2005)

    Google Scholar 

  51. Teutsch, H., Kellermann, W.: EB-ESPRIT: 2D localization of multiple wideband acoustic sources using eigen-beams. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 3, pp. iii/89–iii/92 (2005). doi:10.1109/ICASSP.2005.1415653

  52. Teutsch, H., Kellermann, W.: Eigen-beam processing for direction-of-arrival estimation using spherical apertures. In: Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays. Piscataway, New Jersey, USA (2005)

    Google Scholar 

  53. Teutsch, H., Kellermann, W.: Detection and localization of multiple wideband acoustic sources based on wavefield decomposition using spherical apertures. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5276–5279 (2008). doi:10.1109/ICASSP.2008.4518850

  54. Thiergart, O., Del Galdo, G., Habets, E.A.P.: On the spatial coherence in mixed sound fields and its application to signal-to-diffuse ratio estimation. J. Acoust. Soc. Am. 132(4), 2337–2346 (2012)

    Article  Google Scholar 

  55. Thiergart, O., Del Galdo, G., Habets, E.A.P.: Signal-to-reverberant ratio estimation based on the complex spatial coherence between omnidirectional microphones. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 309–312 (2012)

    Google Scholar 

  56. Wang, H., Kaveh, M.: Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources. IEEE Trans. Acoust., Speech, Signal Process. 33(4), 823–831 (1985)

    Article  Google Scholar 

  57. Wax, M.: Detection and localization of multiple sources via the stochastic signals model. IEEE Trans. Signal Process. 39(11), 2450–2456 (1991)

    Article  MATH  Google Scholar 

  58. Wen, J.Y.C., Habets, E.A.P., Naylor, P.A.: Blind estimation of reverberation time based on the distribution of signal decay rates. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Las Vegas, USA (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel P. Jarrett .

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Jarrett, D.P., Habets, E.A.P., Naylor, P.A. (2017). Introduction. In: Theory and Applications of Spherical Microphone Array Processing. Springer Topics in Signal Processing, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-319-42211-4_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42211-4_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42209-1

  • Online ISBN: 978-3-319-42211-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics