Abstract
The directional perception of human ear for the sound at horizontal plane mainly depends on binaural cues, Interaural Level Difference (ILD), Interaural Time Difference (ITD) and Interaural Correlation (IC). And ILD plays a leading role for human to locate the position of sound with frequency above 1.5 KHz. In spatial audio applications, Inter-Channel Level Difference (ICLD) between loudspeaker signals are used to represent the location information of phantom sources generated by two loudspeakers. For headphone application, ILD and ICLD are approximate, so the perceptual characteristics of ILD can be used as a replacement for that of ICLD. But due to the attenuation influence of the transfer procedure from loudspeakers to humans ears, ICLD between loudspeakers signals are no longer the same with ILD between signals arrive at two ears. And these differences are always ignored in current spatial audio applications such as the perceptual coding of spatial parameters. So in this paper we focus on the analysis and comparison of ICLD and ILD from their formation and their values with different loudspeaker configurations. Experimental results showed that the difference of ILD and ICLD could be up to 55 dB, and the research of this paper may be an important part or reference for further research about spatial audio applications such as coding, reconstruction, etc.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Boring, E.G.: Sensation and Perception in the History of Experimental Psychology. D. Appleton Century Company, New York (1942)
Blauert, J.: Spatial Hearing: The Psychophysics of Human Sound Localization. MIT Press, Cambridge (1997)
Baumgarte, F., Faller, C.: Binaural cue coding-Part I: psychoacoustic fundamentals and design principles. IEEE Trans. Speech Audio Process. 11(LCAV–ARTICLE–2005–032), 509–519 (2003)
Pulkki, V., Karjalainen, M.: Localization of amplitude-panned virtual sources I: stereophonic panning. J. Audio Eng. Soc. 49(9), 739–752 (2001)
ISO/IEC JTC1/SC29/WG11 (MPEG) Document N7947, Text of ISO/IEC 23003–1:2006/FCD, MPEG Surround, Bangkok (2006)
Jiang, W., Wang, J., Zhao, Y., et al.: Multi-channel audio compression method based on ITU-T G. 719 Codec. In: 2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, pp. 293–297. IEEE (2013)
Breebaart, J.: Comparison of interaural intensity differences evoked by real and phantom sources. J. Audio Eng. Soc. 61(11), 850–859 (2013)
Gao, L., Hu, R., Yang, Y., Wang, X., Tu, W., Wu, T.: Azimuthal perceptual resolution model based adaptive 3D spatial parameter coding. In: He, X., Luo, S., Tao, D., Xu, C., Yang, J., Hasan, M.A. (eds.) MMM 2015, Part I. LNCS, vol. 8935, pp. 534–545. Springer, Heidelberg (2015)
Gao, L., Hu, R., Yang, Y.: A spatial priority based scalable audio coding. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3670–3674. IEEE (2014)
Pierce, A.D.: Acoustics: An Introduction to Its Physical Principles and Applications. McGrawHill, New York (1981)
Fahy, F.J.: Measurement of acoustic intensity using the cross spectral density of two microphone signals. J. Acoust. Soc. Am. 62(4), 1057–1059 (1977)
Everest, F.A., Pohlmann, K.C.: The Master Handbook of Acoustics. McGraw-Hill, New York (2001)
Algazi, V.R., Duda, R.O., Thompson, D.M., Avendano, C.: The CIPIC HRTF database. In: Proceedings of the 2001 IEEE Workshop on Applications of Signal Processing to Audio and Electroacoustics, pp. 99–102. Mohonk Mountain House, New Paltz, 21–24 October 2001
Acknowledgments
The research was supported by National Nature Science Foundation of China (No. 61231015); National High Technology Research and Development Program of China (863 Program) No. 2015AA016306; National Nature Science Foundation of China (No. 61201169, 61201340); Science and Technology Plan Projects of Shenzhen (ZDSYS2014050916575763).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Wu, T., Hu, R., Gao, L., Wang, X., Ke, S. (2016). Analysis and Comparison of Inter-Channel Level Difference and Interaural Level Difference. In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9516. Springer, Cham. https://doi.org/10.1007/978-3-319-27671-7_49
Download citation
DOI: https://doi.org/10.1007/978-3-319-27671-7_49
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27670-0
Online ISBN: 978-3-319-27671-7
eBook Packages: Computer ScienceComputer Science (R0)