Binaural Assessment of Parametrically Coded Spatial Audio Signals

Takanen, M.; Santala, O.; Pulkki, V.

doi:10.1007/978-3-642-37762-4_13

M. Takanen²,
O. Santala² &
V. Pulkki²

Part of the book series: Modern Acoustics and Signal Processing ((MASP))

4119 Accesses
2 Citations

Abstract

In parametric time-frequency-domain spatial audio techniques, the sound field is encoded as a combination of a few audio channels with metadata. The metadata parametrizes the spatial properties of the sound field that are known to be perceivable to humans. The most well-known techniques are reviewed in this chapter. The spatial artifacts specific to such techniques are described, such as dynamically or statically biased directions, spatially too narrow auditory images, and effects of off-sweet-spot listening. Such cases are analyzed with a binaural auditory model, and it is shown that the artifacts are clearly visualized thereby.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Binaural Evaluation of Sound Quality and Quality of Experience

JND-based spatial parameter quantization of multichannel audio signals

Article Open access 21 May 2016

Spatial Audio Rendering

References

J. Ahonen. Microphone configurations for teleconference application of Directional Audio Coding and subjective evaluation. In Proc. 40th Intl. Conf. Audio Eng. Soc., Tokyo, Japan, Oct. 8–10 2010. Paper No. P-5.
Google Scholar
C. Avendano and J.-M. Jot. Frequency domain techniques for stereo to multichannel upmix. In Proc. 22nd Intl. Conf. Audio Eng. Soc., Espoo, Finland, Jun. 15–17 2004. Paper No. 251.
Google Scholar
G. Barry and D. Kearney. Localization quality assessment in source separation-based upmixing algorithms. In Proc. 35th Intl. Conf. Audio Eng. Soc., London, UK, Feb. 11–13 2009. Paper No. 33.
Google Scholar
G. Barry, B. Lawlor, and E. Coyle. Sound source separation: Azimuth discrimination and resynthesis. In Proc. 7th Intl. Conf. Digital Audio Effects, pages 240–244, Naples, Italy, Oct. 5–8 2004.
Google Scholar
S. Berge and N. Barrett. A new method for B-format to binaural transcoding. In Proc. 40th Intl. Conf. Audio Eng. Soc., Tokyo, Japan, Oct. 8–10 2010. Paper No. 6–5.
Google Scholar
S. Berge and N. Barrett. High angular resolution planewave expansion. In Proc. 2nd Intl. Symp. Ambisonics and Spherical Acoustics, Paris, France, May 6–7 2010.
Google Scholar
A. J. Berkhout. A holographic approach to acoustic control. J. Audio Eng. Soc., 36:977–995, 1988.
Google Scholar
J. Blauert. Spatial hearing. The psychophysics of human sound localization. MIT Press, Cambridge, MA, USA, revised edition, 1997.
Google Scholar
A. D. Blumlein. U.K. Patent 394,325, 1931. Reprinted in Stereophonic Techniques, Audio Eng. Soc., 1986.
Google Scholar
M. M. Boone, E. N. G. Verheijen, and P. F. van Tol. Spatial sound-field reproduction by wave-field synthesis. J. Audio Eng. Soc., 43:1003–1012, 1995.
Google Scholar
J. Breebaart, S. Disch, C. Faller, J. Herre, G. Hotho, K. Kjörling, F. Myburg, M. Neusinger, W. Oomen, H. Purnhagen, and J. Rödén. MPEG spatial audio coding / MPEG surround: Overview and current status. In Proc. 119th Intl. Conv. Audio Eng. Soc., New York, NY, USA, Oct. 7–10 2005. Paper No. 6599.
Google Scholar
J. Breebaart and C. Faller. Spatial audio processing: MPEG surround and other applications. John Wiley & Sons, Ltd., Chichester, UK, 2008.
Google Scholar
H. S. Colburn and N. I. Durlach. Models of binaural interaction. In E. Carrette and M. Friedman, editors, Handbook of perception, volume IV, pages 467–518. Academic Press, San Diego, CA, USA, 1978.
Google Scholar
J. Daniel, S. Moreau, and R. Nicol. Further investigations of high-order Ambisonics and Wavefield synthesis for holophonic sound imaging. In Proc. 114th Intl. Conv. Audio Eng. Soc., Amsterdam, The Netherlands, Mar. 22–25 2003. Paper No. 5788.
Google Scholar
D. de Vries. Wave field synthesis. Audio Eng. Soc. monograph, New York, NY, USA, 2009. 93 pages.
Google Scholar
C. Faller. Binaural cue coding-Part I: Psychoacoustic fundamentals and design principles. IEEE Trans. Speech and Audio Processing, 11:509–519, 2003.
Google Scholar
C. Faller. Multiple-loudspeaker playback of stereo signals. J. Audio Eng. Soc., 54:1051–1064, 2006.
Google Scholar
C. Faller. A highly directive 2-capsule based microphone system. In Proc. 123rd Intl. Conv. Audio Eng. Soc., New York, NY, USA, Oct. 5–8 2007.
Google Scholar
C. Faller. Method to generate multi-channel audio signals from stereo signals. EP Patent 1,761,110, Mar. 2007.
Google Scholar
C. Faller. Microphone front-ends for spatial audio coders. In Proc. 125th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Oct. 2–5 2008.
Google Scholar
C. Faller and F. Baumgarte. Efficient representation of spatial audio using perceptual parametrization. In Proc. IEEE Worksh. Appl. of Signal Processing to Audio and Acoustics, pages 199–202, New Paltz, New York, Oct. 21–24 2001.
Google Scholar
C. Faller, A. Favrot, C. Langen, C. Tournery, and H. Wittek. Digitally enhanced shotgun microphone with increased directivity. In Proc. 129th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Nov. 4–7 2010.
Google Scholar
C. Faller and V. Pulkki. Directional Audio Coding: Filterbank and STFT-based design. In Proc. 120th Intl. Conv. Audio Eng. Soc., Paris, France, May 20–23 2006. Paper No. 6658.
Google Scholar
M. A. Gerzon. Periphony: With-height sound reproduction. J. Audio Eng. Soc., 21:2–10, 1973.
Google Scholar
M. M. Goodwin. Enhanced microphone-array beamforming based on frequency-domain spatial analysis-synthesis. In IEEE Worksh. Appl. Signal Processing to Audio and Acoustics, pages 6–9, New Paltz, NY, USA, Oct. 21–24 2007.
Google Scholar
M. M. Goodwin and J.-M. Jot. A frequency-domain framework for spatial audio coding based on universal spatial cues. In Proc. 120th Intl. Conv. Audio Eng. Soc., Paris, France, May 20–23 2006. Paper No. 6751.
Google Scholar
M. M. Goodwin and J.-M. Jot. Spatial audio scene coding. In Proc. 125th Intl. Conv. Audio Eng. Soc., San Francisco, CA, USA, Oct. 2–5 2008. Paper No. 7507.
Google Scholar
Harpex Ltd. Online audio conversion service BETA, 2012. (Accessed: Jan. 22, 2013).
Google Scholar
M. L. Hawley, R. Y. Litovsky, and H. S. Colburn. Speech intelligibility and localization in a multi-source environment. J. Acoust. Soc. Am., 105:3436–3448, 1999.
Google Scholar
J. Herre, C. Falch, D. Mahne, G. del Galdo, M. Kallinger, and O. Thiergart. Interactive teleconferencing combining spatial audio object coding and DirAC technology. In Proc. 128th Intl. Conv. Audio Eng. Soc., London, UK, May 22–25 2010. Paper No. 8098.
Google Scholar
J. Herre, K. Kjörling, J. Breebaart, C. Faller, S. Disch, H. Purnhagen, J. Koppens, J. Hilpert, J. Rödén, W. Oomen, K. Linzmeier, and K. S. Chong. MPEG surround-the ISO/MPEG standard for efficient and compatible multichannel audio coding. J. Audio Eng. Soc., 56:932–955, 2008.
Google Scholar
G. Hotho, S. van de Par, and J. Breebaart. Multichannel coding of applause signals. EURASIP J. Adv. in Signal Process., 2008, 2008. Article No. 10.
Google Scholar
M.-V. Laitinen, F. Kuech, S. Disch, and V. Pulkki. Reproducing applause-type signals with Directional Audio Coding. J. Audio Eng. Soc., 59:29–43, 2011.
Google Scholar
M.-V. Laitinen, T. Pihlajamäki, C. Erkut, and V. Pulkki. Parametric time-frequency representation of spatial sound in virtual worlds. ACM Trans. Appl. Percept., 9:1–20, 2012.
Google Scholar
M.-V. Laitinen and V. Pulkki. Converting 5.1 audio recordings to B-format for Directional Audio Coding reproduction. In Proc. Intl. Conf. Acoustics, Speech and Signal Processing (ICASSP), pages 61–64, Prague, Czech Republic, May 22–27 2011.
Google Scholar
M.-V. Laitinen and V. Pulkki. Utilizing instantaneous direct-to-reverberant ratio in parametric spatial audio coding. In Proc. 133rd Intl. Conv. Audio Eng. Soc., San Francisco, USA, Oct. 26–29 2012. Paper No. 8804.
Google Scholar
A. Politis, T. Pihlajamäki, and V. Pulkki. Parametric spatial audio effects. In Proc. 15th Intl. Conf. Digital Audio Effects, York, UK, Sept. 17–21 2012. Paper No. 22.
Google Scholar
V. Pulkki. Virtual sound source positioning using Vector Base Amplitude Panning. J. Audio Eng. Soc., 45(6):456–466, 1997.
Google Scholar
V. Pulkki. Spatial sound reproduction with Directional Audio Coding. J. Audio Eng. Soc., 55:503–516, 2007.
Google Scholar
V. Pulkki and C. Faller. The directional effect of cross-talk in multi-channel sound reproduction. In Proc. 18th Intl. Congr. Acoust., pages 3167–3170, Kyoto, Japan, Apr. 4–9 2004.
Google Scholar
V. Pulkki and T. Hirvonen. Functional count-comparison model for binaural decoding. Acta Acust./Acustica, 95:883–900, 2009.
Google Scholar
V. Pulkki, J. Merimaa, and T. Lokki. Reproduction of reverberation with Spatial Impulse Response Rendering. In Proc. 116th Intl. Conv. Audio Eng. Soc., Berlin, Germany, May 8–11 2004. Paper No. 6057.
Google Scholar
F. Rumsey. Spatial audio. Music Technology. Focal Press, Oxford, UK, 2nd edition, 2001.
Google Scholar
E. Schuijers, J. Breebaart, H. Purnhagen, and J. Engdegard. Low complexity parametric stereo coding. In Proc. 116th Intl. Conv. Audio Eng. Soc., Berlin, Germany, May 8–11 2004. Paper No. 6073.
Google Scholar
A. Solvang. Spectral impairment of two-dimensional higher order Ambisonics. J. Audio Eng. Soc., 56:267–279, 2008.
Google Scholar
M. Takanen, O. Santala, and V. Pulkki. Visualization of functional count-comparison-based binaural auditory model output. Unpublished manuscript, 2013.
Google Scholar
O. Thiergart and E. A. P. Habets. Robust direction-of-arrival estimation of two simultaneous plane waves from a B-format signal. In IEEE 27th Conv. Electrical and Electronics Engineers, pages 1–5, Eilat, Israel, Nov. 14–17 2012.
Google Scholar
O. Thiergart and E. A. P. Habets. Sound field model violations in parametric spatial sound processing. In Proc. of IWAENC 2012 Intl. Workshop Acoustic Signal Enhancement, pages 1–4, Aachen, Germany, Sept. 4–6 2012.
Google Scholar
O. Thiergart, M. Kratschmer, M. Kallinger, and G. del Galdo. Parameter estimation in Directional Audio Coding using linear microphone arrays. In Proc. 130th Intl. Conv. Audio Eng. Soc., London, UK, May 13–16 2011. Paper No. 8434.
Google Scholar
S. Verhulst, T. Dau, and C. A. Shera. Nonlinear time-domain cochlear model for transient stimulation and human otoacoustic emission. J. Acoust. Soc. Am., 132:3842–3848, 2012.
Google Scholar
J. Vilkamo, T. Lokki, and V. Pulkki. Directional Audio Coding: Virtual microphone-based synthesis and subjective evaluation. J. Audio Eng. Soc., 57:709–724, 2009.
Google Scholar

Download references

Acknowledgments

The authors would like to thank S. Verhulst from the Boston University for providing the cochlea model and assisting in its use, C. Faller from Illusonic GmbH for providing the samples processed with the Faller method, V. Sivonen from Cochlear Nordic for providing the head-related transfer functions, and J. Ahonen, M.-V. Laitinen, and T. Pihlajamäki from Aalto University for providing the DirAC-processed samples for the tests. Further, they are indebted to two anonymous reviewers for constructive comments. This work has been supported by The Academy of Finland and by the European Research Council under the European Community’s Seventh Framework Programme (FP7/2007-2013)/ERC Grant agreement No. 240453.

Author information

Authors and Affiliations

Department of Signal Processing and Acoustics, Aalto University, Espoo, Finland
M. Takanen, O. Santala & V. Pulkki

Authors

M. Takanen
View author publications
You can also search for this author in PubMed Google Scholar
O. Santala
View author publications
You can also search for this author in PubMed Google Scholar
V. Pulkki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to V. Pulkki .

Editor information

Editors and Affiliations

Fak. Elektrotechnik, LS Allgm.Elektrotechn.+Akustik, Univ. Bochum, Bochum, Germany
Jens Blauert

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Takanen, M., Santala, O., Pulkki, V. (2013). Binaural Assessment of Parametrically Coded Spatial Audio Signals. In: Blauert, J. (eds) The Technology of Binaural Listening. Modern Acoustics and Signal Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37762-4_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-37762-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37761-7
Online ISBN: 978-3-642-37762-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Binaural Assessment of Parametrically Coded Spatial Audio Signals

Abstract

Access this chapter

Similar content being viewed by others

Binaural Evaluation of Sound Quality and Quality of Experience

JND-based spatial parameter quantization of multichannel audio signals

Spatial Audio Rendering

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Binaural Assessment of Parametrically Coded Spatial Audio Signals

Abstract

Access this chapter

Similar content being viewed by others

Binaural Evaluation of Sound Quality and Quality of Experience

JND-based spatial parameter quantization of multichannel audio signals

Spatial Audio Rendering

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation