Abstract
In parametric time-frequency-domain spatial audio techniques, the sound field is encoded as a combination of a few audio channels with metadata. The metadata parametrizes the spatial properties of the sound field that are known to be perceivable to humans. The most well-known techniques are reviewed in this chapter. The spatial artifacts specific to such techniques are described, such as dynamically or statically biased directions, spatially too narrow auditory images, and effects of off-sweet-spot listening. Such cases are analyzed with a binaural auditory model, and it is shown that the artifacts are clearly visualized thereby.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
J. Ahonen. Microphone configurations for teleconference application of Directional Audio Coding and subjective evaluation. In Proc. 40th Intl. Conf. Audio Eng. Soc., Tokyo, Japan, Oct. 8–10 2010. Paper No. P-5.
C. Avendano and J.-M. Jot. Frequency domain techniques for stereo to multichannel upmix. In Proc. 22nd Intl. Conf. Audio Eng. Soc., Espoo, Finland, Jun. 15–17 2004. Paper No. 251.
G. Barry and D. Kearney. Localization quality assessment in source separation-based upmixing algorithms. In Proc. 35th Intl. Conf. Audio Eng. Soc., London, UK, Feb. 11–13 2009. Paper No. 33.
G. Barry, B. Lawlor, and E. Coyle. Sound source separation: Azimuth discrimination and resynthesis. In Proc. 7th Intl. Conf. Digital Audio Effects, pages 240–244, Naples, Italy, Oct. 5–8 2004.
S. Berge and N. Barrett. A new method for B-format to binaural transcoding. In Proc. 40th Intl. Conf. Audio Eng. Soc., Tokyo, Japan, Oct. 8–10 2010. Paper No. 6–5.
S. Berge and N. Barrett. High angular resolution planewave expansion. In Proc. 2nd Intl. Symp. Ambisonics and Spherical Acoustics, Paris, France, May 6–7 2010.
A. J. Berkhout. A holographic approach to acoustic control. J. Audio Eng. Soc., 36:977–995, 1988.
J. Blauert. Spatial hearing. The psychophysics of human sound localization. MIT Press, Cambridge, MA, USA, revised edition, 1997.
A. D. Blumlein. U.K. Patent 394,325, 1931. Reprinted in Stereophonic Techniques, Audio Eng. Soc., 1986.
M. M. Boone, E. N. G. Verheijen, and P. F. van Tol. Spatial sound-field reproduction by wave-field synthesis. J. Audio Eng. Soc., 43:1003–1012, 1995.
J. Breebaart, S. Disch, C. Faller, J. Herre, G. Hotho, K. Kjörling, F. Myburg, M. Neusinger, W. Oomen, H. Purnhagen, and J. Rödén. MPEG spatial audio coding / MPEG surround: Overview and current status. In Proc. 119th Intl. Conv. Audio Eng. Soc., New York, NY, USA, Oct. 7–10 2005. Paper No. 6599.
J. Breebaart and C. Faller. Spatial audio processing: MPEG surround and other applications. John Wiley & Sons, Ltd., Chichester, UK, 2008.
H. S. Colburn and N. I. Durlach. Models of binaural interaction. In E. Carrette and M. Friedman, editors, Handbook of perception, volume IV, pages 467–518. Academic Press, San Diego, CA, USA, 1978.
J. Daniel, S. Moreau, and R. Nicol. Further investigations of high-order Ambisonics and Wavefield synthesis for holophonic sound imaging. In Proc. 114th Intl. Conv. Audio Eng. Soc., Amsterdam, The Netherlands, Mar. 22–25 2003. Paper No. 5788.
D. de Vries. Wave field synthesis. Audio Eng. Soc. monograph, New York, NY, USA, 2009. 93 pages.
C. Faller. Binaural cue coding-Part I: Psychoacoustic fundamentals and design principles. IEEE Trans. Speech and Audio Processing, 11:509–519, 2003.
C. Faller. Multiple-loudspeaker playback of stereo signals. J. Audio Eng. Soc., 54:1051–1064, 2006.
C. Faller. A highly directive 2-capsule based microphone system. In Proc. 123rd Intl. Conv. Audio Eng. Soc., New York, NY, USA, Oct. 5–8 2007.
C. Faller. Method to generate multi-channel audio signals from stereo signals. EP Patent 1,761,110, Mar. 2007.
C. Faller. Microphone front-ends for spatial audio coders. In Proc. 125th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Oct. 2–5 2008.
C. Faller and F. Baumgarte. Efficient representation of spatial audio using perceptual parametrization. In Proc. IEEE Worksh. Appl. of Signal Processing to Audio and Acoustics, pages 199–202, New Paltz, New York, Oct. 21–24 2001.
C. Faller, A. Favrot, C. Langen, C. Tournery, and H. Wittek. Digitally enhanced shotgun microphone with increased directivity. In Proc. 129th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Nov. 4–7 2010.
C. Faller and V. Pulkki. Directional Audio Coding: Filterbank and STFT-based design. In Proc. 120th Intl. Conv. Audio Eng. Soc., Paris, France, May 20–23 2006. Paper No. 6658.
M. A. Gerzon. Periphony: With-height sound reproduction. J. Audio Eng. Soc., 21:2–10, 1973.
M. M. Goodwin. Enhanced microphone-array beamforming based on frequency-domain spatial analysis-synthesis. In IEEE Worksh. Appl. Signal Processing to Audio and Acoustics, pages 6–9, New Paltz, NY, USA, Oct. 21–24 2007.
M. M. Goodwin and J.-M. Jot. A frequency-domain framework for spatial audio coding based on universal spatial cues. In Proc. 120th Intl. Conv. Audio Eng. Soc., Paris, France, May 20–23 2006. Paper No. 6751.
M. M. Goodwin and J.-M. Jot. Spatial audio scene coding. In Proc. 125th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Oct. 2–5 2008. Paper No. 7507.
Harpex Ltd. Online audio conversion service BETA, 2012. (Accessed: Jan. 22, 2013).
M. L. Hawley, R. Y. Litovsky, and H. S. Colburn. Speech intelligibility and localization in a multi-source environment. J. Acoust. Soc. Am., 105:3436–3448, 1999.
J. Herre, C. Falch, D. Mahne, G. del Galdo, M. Kallinger, and O. Thiergart. Interactive teleconferencing combining spatial audio object coding and DirAC technology. In Proc. 128th Intl. Conv. Audio Eng. Soc., London, UK, May 22–25 2010. Paper No. 8098.
J. Herre, K. Kjörling, J. Breebaart, C. Faller, S. Disch, H. Purnhagen, J. Koppens, J. Hilpert, J. Rödén, W. Oomen, K. Linzmeier, and K. S. Chong. MPEG surround-the ISO/MPEG standard for efficient and compatible multichannel audio coding. J. Audio Eng. Soc., 56:932–955, 2008.
G. Hotho, S. van de Par, and J. Breebaart. Multichannel coding of applause signals. EURASIP J. Adv. in Signal Process., 2008, 2008. Article No. 10.
M.-V. Laitinen, F. Kuech, S. Disch, and V. Pulkki. Reproducing applause-type signals with Directional Audio Coding. J. Audio Eng. Soc., 59:29–43, 2011.
M.-V. Laitinen, T. Pihlajamäki, C. Erkut, and V. Pulkki. Parametric time-frequency representation of spatial sound in virtual worlds. ACM Trans. Appl. Percept., 9:1–20, 2012.
M.-V. Laitinen and V. Pulkki. Converting 5.1 audio recordings to B-format for Directional Audio Coding reproduction. In Proc. Intl. Conf. Acoustics, Speech and Signal Processing (ICASSP), pages 61–64, Prague, Czech Republic, May 22–27 2011.
M.-V. Laitinen and V. Pulkki. Utilizing instantaneous direct-to-reverberant ratio in parametric spatial audio coding. In Proc. 133rd Intl. Conv. Audio Eng. Soc., San Francisco, USA, Oct. 26–29 2012. Paper No. 8804.
A. Politis, T. Pihlajamäki, and V. Pulkki. Parametric spatial audio effects. In Proc. 15th Intl. Conf. Digital Audio Effects, York, UK, Sept. 17–21 2012. Paper No. 22.
V. Pulkki. Virtual sound source positioning using Vector Base Amplitude Panning. J. Audio Eng. Soc., 45(6):456–466, 1997.
V. Pulkki. Spatial sound reproduction with Directional Audio Coding. J. Audio Eng. Soc., 55:503–516, 2007.
V. Pulkki and C. Faller. The directional effect of cross-talk in multi-channel sound reproduction. In Proc. 18th Intl. Congr. Acoust., pages 3167–3170, Kyoto, Japan, Apr. 4–9 2004.
V. Pulkki and T. Hirvonen. Functional count-comparison model for binaural decoding. Acta Acust./Acustica, 95:883–900, 2009.
V. Pulkki, J. Merimaa, and T. Lokki. Reproduction of reverberation with Spatial Impulse Response Rendering. In Proc. 116th Intl. Conv. Audio Eng. Soc., Berlin, Germany, May 8–11 2004. Paper No. 6057.
F. Rumsey. Spatial audio. Music Technology. Focal Press, Oxford, UK, 2nd edition, 2001.
E. Schuijers, J. Breebaart, H. Purnhagen, and J. Engdegard. Low complexity parametric stereo coding. In Proc. 116th Intl. Conv. Audio Eng. Soc., Berlin, Germany, May 8–11 2004. Paper No. 6073.
A. Solvang. Spectral impairment of two-dimensional higher order Ambisonics. J. Audio Eng. Soc., 56:267–279, 2008.
M. Takanen, O. Santala, and V. Pulkki. Visualization of functional count-comparison-based binaural auditory model output. Unpublished manuscript, 2013.
O. Thiergart and E. A. P. Habets. Robust direction-of-arrival estimation of two simultaneous plane waves from a B-format signal. In IEEE 27th Conv. Electrical and Electronics Engineers, pages 1–5, Eilat, Israel, Nov. 14–17 2012.
O. Thiergart and E. A. P. Habets. Sound field model violations in parametric spatial sound processing. In Proc. of IWAENC 2012 Intl. Workshop Acoustic Signal Enhancement, pages 1–4, Aachen, Germany, Sept. 4–6 2012.
O. Thiergart, M. Kratschmer, M. Kallinger, and G. del Galdo. Parameter estimation in Directional Audio Coding using linear microphone arrays. In Proc. 130th Intl. Conv. Audio Eng. Soc., London, UK, May 13–16 2011. Paper No. 8434.
S. Verhulst, T. Dau, and C. A. Shera. Nonlinear time-domain cochlear model for transient stimulation and human otoacoustic emission. J. Acoust. Soc. Am., 132:3842–3848, 2012.
J. Vilkamo, T. Lokki, and V. Pulkki. Directional Audio Coding: Virtual microphone-based synthesis and subjective evaluation. J. Audio Eng. Soc., 57:709–724, 2009.
Acknowledgments
The authors would like to thank S. Verhulst from the Boston University for providing the cochlea model and assisting in its use, C. Faller from Illusonic GmbH for providing the samples processed with the Faller method, V. Sivonen from Cochlear Nordic for providing the head-related transfer functions, and J. Ahonen, M.-V. Laitinen, and T. Pihlajamäki from Aalto University for providing the DirAC-processed samples for the tests. Further, they are indebted to two anonymous reviewers for constructive comments. This work has been supported by The Academy of Finland and by the European Research Council under the European Community’s Seventh Framework Programme (FP7/2007-2013)/ERC Grant agreement No. 240453.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Takanen, M., Santala, O., Pulkki, V. (2013). Binaural Assessment of Parametrically Coded Spatial Audio Signals. In: Blauert, J. (eds) The Technology of Binaural Listening. Modern Acoustics and Signal Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37762-4_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-37762-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37761-7
Online ISBN: 978-3-642-37762-4
eBook Packages: EngineeringEngineering (R0)