Skip to main content

Binaural Assessment of Parametrically Coded Spatial Audio Signals

  • Chapter
The Technology of Binaural Listening

Part of the book series: Modern Acoustics and Signal Processing ((MASP))

Abstract

In parametric time-frequency-domain spatial audio techniques, the sound field is encoded as a combination of a few audio channels with metadata. The metadata parametrizes the spatial properties of the sound field that are known to be perceivable to humans. The most well-known techniques are reviewed in this chapter. The spatial artifacts specific to such techniques are described, such as dynamically or statically biased directions, spatially too narrow auditory images, and effects of off-sweet-spot listening. Such cases are analyzed with a binaural auditory model, and it is shown that the artifacts are clearly visualized thereby.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. J. Ahonen. Microphone configurations for teleconference application of Directional Audio Coding and subjective evaluation. In Proc. 40th Intl. Conf. Audio Eng. Soc., Tokyo, Japan, Oct. 8–10 2010. Paper No. P-5.

    Google Scholar 

  2. C. Avendano and J.-M. Jot. Frequency domain techniques for stereo to multichannel upmix. In Proc. 22nd Intl. Conf. Audio Eng. Soc., Espoo, Finland, Jun. 15–17 2004. Paper No. 251.

    Google Scholar 

  3. G. Barry and D. Kearney. Localization quality assessment in source separation-based upmixing algorithms. In Proc. 35th Intl. Conf. Audio Eng. Soc., London, UK, Feb. 11–13 2009. Paper No. 33.

    Google Scholar 

  4. G. Barry, B. Lawlor, and E. Coyle. Sound source separation: Azimuth discrimination and resynthesis. In Proc. 7th Intl. Conf. Digital Audio Effects, pages 240–244, Naples, Italy, Oct. 5–8 2004.

    Google Scholar 

  5. S. Berge and N. Barrett. A new method for B-format to binaural transcoding. In Proc. 40th Intl. Conf. Audio Eng. Soc., Tokyo, Japan, Oct. 8–10 2010. Paper No. 6–5.

    Google Scholar 

  6. S. Berge and N. Barrett. High angular resolution planewave expansion. In Proc. 2nd Intl. Symp. Ambisonics and Spherical Acoustics, Paris, France, May 6–7 2010.

    Google Scholar 

  7. A. J. Berkhout. A holographic approach to acoustic control. J. Audio Eng. Soc., 36:977–995, 1988.

    Google Scholar 

  8. J. Blauert. Spatial hearing. The psychophysics of human sound localization. MIT Press, Cambridge, MA, USA, revised edition, 1997.

    Google Scholar 

  9. A. D. Blumlein. U.K. Patent 394,325, 1931. Reprinted in Stereophonic Techniques, Audio Eng. Soc., 1986.

    Google Scholar 

  10. M. M. Boone, E. N. G. Verheijen, and P. F. van Tol. Spatial sound-field reproduction by wave-field synthesis. J. Audio Eng. Soc., 43:1003–1012, 1995.

    Google Scholar 

  11. J. Breebaart, S. Disch, C. Faller, J. Herre, G. Hotho, K. Kjörling, F. Myburg, M. Neusinger, W. Oomen, H. Purnhagen, and J. Rödén. MPEG spatial audio coding / MPEG surround: Overview and current status. In Proc. 119th Intl. Conv. Audio Eng. Soc., New York, NY, USA, Oct. 7–10 2005. Paper No. 6599.

    Google Scholar 

  12. J. Breebaart and C. Faller. Spatial audio processing: MPEG surround and other applications. John Wiley & Sons, Ltd., Chichester, UK, 2008.

    Google Scholar 

  13. H. S. Colburn and N. I. Durlach. Models of binaural interaction. In E. Carrette and M. Friedman, editors, Handbook of perception, volume IV, pages 467–518. Academic Press, San Diego, CA, USA, 1978.

    Google Scholar 

  14. J. Daniel, S. Moreau, and R. Nicol. Further investigations of high-order Ambisonics and Wavefield synthesis for holophonic sound imaging. In Proc. 114th Intl. Conv. Audio Eng. Soc., Amsterdam, The Netherlands, Mar. 22–25 2003. Paper No. 5788.

    Google Scholar 

  15. D. de Vries. Wave field synthesis. Audio Eng. Soc. monograph, New York, NY, USA, 2009. 93 pages.

    Google Scholar 

  16. C. Faller. Binaural cue coding-Part I: Psychoacoustic fundamentals and design principles. IEEE Trans. Speech and Audio Processing, 11:509–519, 2003.

    Google Scholar 

  17. C. Faller. Multiple-loudspeaker playback of stereo signals. J. Audio Eng. Soc., 54:1051–1064, 2006.

    Google Scholar 

  18. C. Faller. A highly directive 2-capsule based microphone system. In Proc. 123rd Intl. Conv. Audio Eng. Soc., New York, NY, USA, Oct. 5–8 2007.

    Google Scholar 

  19. C. Faller. Method to generate multi-channel audio signals from stereo signals. EP Patent 1,761,110, Mar. 2007.

    Google Scholar 

  20. C. Faller. Microphone front-ends for spatial audio coders. In Proc. 125th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Oct. 2–5 2008.

    Google Scholar 

  21. C. Faller and F. Baumgarte. Efficient representation of spatial audio using perceptual parametrization. In Proc. IEEE Worksh. Appl. of Signal Processing to Audio and Acoustics, pages 199–202, New Paltz, New York, Oct. 21–24 2001.

    Google Scholar 

  22. C. Faller, A. Favrot, C. Langen, C. Tournery, and H. Wittek. Digitally enhanced shotgun microphone with increased directivity. In Proc. 129th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Nov. 4–7 2010.

    Google Scholar 

  23. C. Faller and V. Pulkki. Directional Audio Coding: Filterbank and STFT-based design. In Proc. 120th Intl. Conv. Audio Eng. Soc., Paris, France, May 20–23 2006. Paper No. 6658.

    Google Scholar 

  24. M. A. Gerzon. Periphony: With-height sound reproduction. J. Audio Eng. Soc., 21:2–10, 1973.

    Google Scholar 

  25. M. M. Goodwin. Enhanced microphone-array beamforming based on frequency-domain spatial analysis-synthesis. In IEEE Worksh. Appl. Signal Processing to Audio and Acoustics, pages 6–9, New Paltz, NY, USA, Oct. 21–24 2007.

    Google Scholar 

  26. M. M. Goodwin and J.-M. Jot. A frequency-domain framework for spatial audio coding based on universal spatial cues. In Proc. 120th Intl. Conv. Audio Eng. Soc., Paris, France, May 20–23 2006. Paper No. 6751.

    Google Scholar 

  27. M. M. Goodwin and J.-M. Jot. Spatial audio scene coding. In Proc. 125th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Oct. 2–5 2008. Paper No. 7507.

    Google Scholar 

  28. Harpex Ltd. Online audio conversion service BETA, 2012. (Accessed: Jan. 22, 2013).

    Google Scholar 

  29. M. L. Hawley, R. Y. Litovsky, and H. S. Colburn. Speech intelligibility and localization in a multi-source environment. J. Acoust. Soc. Am., 105:3436–3448, 1999.

    Google Scholar 

  30. J. Herre, C. Falch, D. Mahne, G. del Galdo, M. Kallinger, and O. Thiergart. Interactive teleconferencing combining spatial audio object coding and DirAC technology. In Proc. 128th Intl. Conv. Audio Eng. Soc., London, UK, May 22–25 2010. Paper No. 8098.

    Google Scholar 

  31. J. Herre, K. Kjörling, J. Breebaart, C. Faller, S. Disch, H. Purnhagen, J. Koppens, J. Hilpert, J. Rödén, W. Oomen, K. Linzmeier, and K. S. Chong. MPEG surround-the ISO/MPEG standard for efficient and compatible multichannel audio coding. J. Audio Eng. Soc., 56:932–955, 2008.

    Google Scholar 

  32. G. Hotho, S. van de Par, and J. Breebaart. Multichannel coding of applause signals. EURASIP J. Adv. in Signal Process., 2008, 2008. Article No. 10.

    Google Scholar 

  33. M.-V. Laitinen, F. Kuech, S. Disch, and V. Pulkki. Reproducing applause-type signals with Directional Audio Coding. J. Audio Eng. Soc., 59:29–43, 2011.

    Google Scholar 

  34. M.-V. Laitinen, T. Pihlajamäki, C. Erkut, and V. Pulkki. Parametric time-frequency representation of spatial sound in virtual worlds. ACM Trans. Appl. Percept., 9:1–20, 2012.

    Google Scholar 

  35. M.-V. Laitinen and V. Pulkki. Converting 5.1 audio recordings to B-format for Directional Audio Coding reproduction. In Proc. Intl. Conf. Acoustics, Speech and Signal Processing (ICASSP), pages 61–64, Prague, Czech Republic, May 22–27 2011.

    Google Scholar 

  36. M.-V. Laitinen and V. Pulkki. Utilizing instantaneous direct-to-reverberant ratio in parametric spatial audio coding. In Proc. 133rd Intl. Conv. Audio Eng. Soc., San Francisco, USA, Oct. 26–29 2012. Paper No. 8804.

    Google Scholar 

  37. A. Politis, T. Pihlajamäki, and V. Pulkki. Parametric spatial audio effects. In Proc. 15th Intl. Conf. Digital Audio Effects, York, UK, Sept. 17–21 2012. Paper No. 22.

    Google Scholar 

  38. V. Pulkki. Virtual sound source positioning using Vector Base Amplitude Panning. J. Audio Eng. Soc., 45(6):456–466, 1997.

    Google Scholar 

  39. V. Pulkki. Spatial sound reproduction with Directional Audio Coding. J. Audio Eng. Soc., 55:503–516, 2007.

    Google Scholar 

  40. V. Pulkki and C. Faller. The directional effect of cross-talk in multi-channel sound reproduction. In Proc. 18th Intl. Congr. Acoust., pages 3167–3170, Kyoto, Japan, Apr. 4–9 2004.

    Google Scholar 

  41. V. Pulkki and T. Hirvonen. Functional count-comparison model for binaural decoding. Acta Acust./Acustica, 95:883–900, 2009.

    Google Scholar 

  42. V. Pulkki, J. Merimaa, and T. Lokki. Reproduction of reverberation with Spatial Impulse Response Rendering. In Proc. 116th Intl. Conv. Audio Eng. Soc., Berlin, Germany, May 8–11 2004. Paper No. 6057.

    Google Scholar 

  43. F. Rumsey. Spatial audio. Music Technology. Focal Press, Oxford, UK, 2nd edition, 2001.

    Google Scholar 

  44. E. Schuijers, J. Breebaart, H. Purnhagen, and J. Engdegard. Low complexity parametric stereo coding. In Proc. 116th Intl. Conv. Audio Eng. Soc., Berlin, Germany, May 8–11 2004. Paper No. 6073.

    Google Scholar 

  45. A. Solvang. Spectral impairment of two-dimensional higher order Ambisonics. J. Audio Eng. Soc., 56:267–279, 2008.

    Google Scholar 

  46. M. Takanen, O. Santala, and V. Pulkki. Visualization of functional count-comparison-based binaural auditory model output. Unpublished manuscript, 2013.

    Google Scholar 

  47. O. Thiergart and E. A. P. Habets. Robust direction-of-arrival estimation of two simultaneous plane waves from a B-format signal. In IEEE 27th Conv. Electrical and Electronics Engineers, pages 1–5, Eilat, Israel, Nov. 14–17 2012.

    Google Scholar 

  48. O. Thiergart and E. A. P. Habets. Sound field model violations in parametric spatial sound processing. In Proc. of IWAENC 2012 Intl. Workshop Acoustic Signal Enhancement, pages 1–4, Aachen, Germany, Sept. 4–6 2012.

    Google Scholar 

  49. O. Thiergart, M. Kratschmer, M. Kallinger, and G. del Galdo. Parameter estimation in Directional Audio Coding using linear microphone arrays. In Proc. 130th Intl. Conv. Audio Eng. Soc., London, UK, May 13–16 2011. Paper No. 8434.

    Google Scholar 

  50. S. Verhulst, T. Dau, and C. A. Shera. Nonlinear time-domain cochlear model for transient stimulation and human otoacoustic emission. J. Acoust. Soc. Am., 132:3842–3848, 2012.

    Google Scholar 

  51. J. Vilkamo, T. Lokki, and V. Pulkki. Directional Audio Coding: Virtual microphone-based synthesis and subjective evaluation. J. Audio Eng. Soc., 57:709–724, 2009.

    Google Scholar 

Download references

Acknowledgments

The authors would like to thank S. Verhulst from the Boston University for providing the cochlea model and assisting in its use, C. Faller from Illusonic GmbH for providing the samples processed with the Faller method, V. Sivonen from Cochlear Nordic for providing the head-related transfer functions, and J. Ahonen, M.-V. Laitinen, and T. Pihlajamäki from Aalto University for providing the DirAC-processed samples for the tests. Further, they are indebted to two anonymous reviewers for constructive comments. This work has been supported by The Academy of Finland and by the European Research Council under the European Community’s Seventh Framework Programme (FP7/2007-2013)/ERC Grant agreement No. 240453.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. Pulkki .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Takanen, M., Santala, O., Pulkki, V. (2013). Binaural Assessment of Parametrically Coded Spatial Audio Signals. In: Blauert, J. (eds) The Technology of Binaural Listening. Modern Acoustics and Signal Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37762-4_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37762-4_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37761-7

  • Online ISBN: 978-3-642-37762-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics