Principles of Psychoacoustics

Lin, Yiqing; Abdulla, Waleed H.

doi:10.1007/978-3-319-07974-5_2

Principles of Psychoacoustics

Yiqing Lin³ &
Waleed H. Abdulla³

Chapter
First Online: 01 January 2014

2614 Accesses
13 Citations
3 Altmetric

Abstract

Psychoacoustics is the science of sound perception, i.e., investigating the statistical relationships between acoustic stimuli and hearing sensations [51]. This study aims to build up the psychoacoustic model, a kind of quantitative model, which could closely match the hearing mechanism. A good understanding of the sensory response of the human auditory system (HAS) is essential to the development of psychoacoustic models for audio watermarking, where the perceptual quality of processed audio must be preserved to the greatest extent.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
In this sense, the auditory canal closed with the eardrum at its proximal end has a configuration as a resonator.
2.
Acoustic impedance is a constant related to the propagation of sound waves in an acoustic medium. Technically, sound waves encounter much less resistance when travelling in air than in fluid.
3.
Note that the cochlea is a cavity within the skull, not a structure by itself [58]. Hence the unraveled cochlea in Fig. 2.3b is impossible in practice, only for the sake of illustration.
4.
The hair cells including the outer and inner hair cells (OHC and IHC) are auditory receptors on the organ of corti.
5.
There is one fact worth of attention, i.e., any location on the BM will respond to a wide range of tones that are lower than its CF. That’s why low frequencies are less selective than high frequencies.
6.
The whole length of 32 mm basilar membrane divided by 24 critical bands is 1.3 mm for each band.
7.
Here, narrowband means the bandwidth equal to or smaller than a critical band.
8.
Hereafter, this rule does apply to all the graphs in Sect. 2.3.
9.
For illustration, all the curves are shifted upward to the masker’s SPL (60 dB).
10.
ISO: International Organization for Standardization; IEC: International Electrotechnical Committee; MPEG: Moving Picture Experts Group.
11.
The frequency edges are calculated based on the sampling frequency F _s.
12.
Critical band boundaries vary with the Layer and sampling frequency. ISO/IEC IS 11172-3 [77] has tabulated such parameters in Table D.2a–f. In our case, Table D.2b for Layer I at a sampling frequency of 44.1 kHz is adopted.
13.
The geometric mean of a data set \(\left [a_{1},a_{2},\ldots,a_{M}\right ]\) is defined as \(\left (\prod _{m=1}^{M}a_{m}\right )^{1/M}\). It is sometimes called the log-average, i.e., \(\left (\prod _{m=1}^{M}a_{m}\right )^{1/M} = 10^{{\hat{}{}}}\left [ \frac{1} {M}\sum _{m=1}^{M}\log _{ 10}\left (a_{m}\right )\right ]\).

References

X. He, Watermarking in Audio: Key Techniques and Technologies (Cambria Press, Youngstown, 2008)
Google Scholar
N. Cvejic, T. Seppanen, Robust audio watermarking in wavelet domain using frequency hopping and patchwork method, in Proceedings of the 3rd International Symposium on Image and Signal Processing and Analysis, 2003, pp. 251–255
Google Scholar
E. Zwicker, H. Fastl, Psychoacoustics: Facts and Models (Springer, Berlin, 1990)
Google Scholar
M. Arnold, M. Schmucker, S.D. Wolthusen, Techniques and Applications of Digital Watermarking and Content Protection (Artech House, Boston, 2003)
Google Scholar
M. Bosi, R.E. Goldberg, Introduction to Digital Audio Coding and Standards (Kluwer Academic, Boston, 2003)
Book Google Scholar
W.J. Vincoli (ed.), Lewis’ Dictionary of Occupational and Environmental Safety and Health (Lewis Publishers, Boca Raton, 2000)
Google Scholar
K. Johnson, Acoustic and Auditory Phonetics (Blackwell Publisher, Malden, 2003)
Google Scholar
P.H. Lindsay, D.A. Norman, Human Information Processing: An Introduction to Psychology (Academic, New York, 1977)
Google Scholar
T.S. Gunawan, Audio compression and speech enhancement using temporal masking models. Ph.D. dissertation, The University of New South Wales, 2007
Google Scholar
M.W. Levine, Levine and Shefner’s Fundamentals of Sensation and Perception (Oxford University Press, Oxford, 2000)
Google Scholar
W.A. Yost, D.W. Nielsen, Fundamentals of Hearing: An Introduction (Holt, Rinehart and Winston, New York, 1977)
Google Scholar
E.A.G. Shaw, Earcanal pressure generated by a free sound field. J. Acoust. Soc. Am. 39(3), 465–470 (1966)
Article Google Scholar
B.C.J. Moore, An Introduction to the Psychology of Hearing (Academic, New York, 2003)
Google Scholar
[Online]. Available: http://www.chicagoear.com/images/earworks.gif
I.J. Hirsh, The Measurement of Hearing (McGraw-Hill, New York, 1952)
Google Scholar
H. Fletcher, W.A. Munson, Loudness, its definition, measurement and calculation. J. Acoust. Soc. Am. 5(2), 82–108 (1933)
Article Google Scholar
X.M. Quan, H.B. Zhang, Statistical audio watermarking algorithm based on perceptual analysis, in Proceedings of the 5th ACM Workshop on Digital Rights Management, 2005, pp. 112–118
Google Scholar
E. Ambikairajah, A.G. Davis, W.T.K. Wong, Auditory masking and MPEG-1 audio compression. Electron. Comm. Eng. J. 9, 165–173 (1997)
Article Google Scholar
A. Spanias, T. Painter, V. Atti, Audio Signal Processing and Coding (Wiley-Interscience, Hoboken, 2007)
Book Google Scholar
M.D. Swanson, B. Zhu, A.H. Tewfik, L. Boney, Robust audio watermarking using perceptual masking. Signal Process. 66(3), 337–355 (1998)
Article MATH Google Scholar
R.A. Garcia, Digital watermarking of audio signals using a psychoacoustic auditory model and spread spectrum theory. AES E-Library, 1999
Google Scholar
S. Ratanasanya, S. Poomdaeng, S. Tachphetpiboon, T. Amornraksa, New psychoacoustic models for wavelet based audio watermarking, in IEEE International Symposium on Communications and Information Technology (ISCIT), vol. 1, pp. 602–605, 2005
Google Scholar
ISO/IEC IS 11172-3, Information Technology - Coding of Moving Picture and Associated Audio for Digital Storage Media Up To About 1.5Mbit/s, Part 3: Audio (BSI, London, 1993)
Google Scholar
K.C. Pohlmann, Principles of Digital Audio (McGraw-Hill, New York, 2000)
Google Scholar
D. Pan, A tutorial on MPEG/audio compression. IEEE Multimed. 2, 60–74 (1995)
Article Google Scholar
F.A.P. Petitcolas, MPEG for Matlab, v.1.2.8 ed. (2003) [Online], http://www.petitcolas.net/fabien/software/mpeg
C.-Y. Lin, An investigation into perceptual audio coding and the use of auditory gammatone filterbanks. Master’s thesis, The University of Auckland, 2007
Google Scholar
SQAM - Sound Quality Assessment Material, European Broadcasting Union (EBU) [Online], http://sound.media.mit.edu/mpeg4/audio/sqam
A. Takahashi, R. Nishimura, Y. Suzuki, Multiple watermarks for stereo audio signals using phase-modulation techniques. IEEE Trans. Signal Process. 53(2), 806–815 (2005)
Article MathSciNet Google Scholar
P. Liew, M. Armand, Inaudible watermarking via phase manipulation of random frequencies. Multimed. Tools Appl. 35(3), 357–377 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

The University of Auckland, Auckland, New Zealand
Yiqing Lin & Waleed H. Abdulla

Authors

Yiqing Lin
View author publications
You can also search for this author in PubMed Google Scholar
Waleed H. Abdulla
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lin, Y., Abdulla, W.H. (2015). Principles of Psychoacoustics. In: Audio Watermark. Springer, Cham. https://doi.org/10.1007/978-3-319-07974-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-07974-5_2
Published: 08 July 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07973-8
Online ISBN: 978-3-319-07974-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics