Abstract
Psychoacoustics is the science of sound perception, i.e., investigating the statistical relationships between acoustic stimuli and hearing sensations [51]. This study aims to build up the psychoacoustic model, a kind of quantitative model, which could closely match the hearing mechanism. A good understanding of the sensory response of the human auditory system (HAS) is essential to the development of psychoacoustic models for audio watermarking, where the perceptual quality of processed audio must be preserved to the greatest extent.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
In this sense, the auditory canal closed with the eardrum at its proximal end has a configuration as a resonator.
- 2.
Acoustic impedance is a constant related to the propagation of sound waves in an acoustic medium. Technically, sound waves encounter much less resistance when travelling in air than in fluid.
- 3.
- 4.
The hair cells including the outer and inner hair cells (OHC and IHC) are auditory receptors on the organ of corti.
- 5.
There is one fact worth of attention, i.e., any location on the BM will respond to a wide range of tones that are lower than its CF. That’s why low frequencies are less selective than high frequencies.
- 6.
The whole length of 32 mm basilar membrane divided by 24 critical bands is 1.3 mm for each band.
- 7.
Here, narrowband means the bandwidth equal to or smaller than a critical band.
- 8.
Hereafter, this rule does apply to all the graphs in Sect. 2.3.
- 9.
For illustration, all the curves are shifted upward to the masker’s SPL (60 dB).
- 10.
ISO: International Organization for Standardization; IEC: International Electrotechnical Committee; MPEG: Moving Picture Experts Group.
- 11.
The frequency edges are calculated based on the sampling frequency F s .
- 12.
Critical band boundaries vary with the Layer and sampling frequency. ISO/IEC IS 11172-3 [77] has tabulated such parameters in Table D.2a–f. In our case, Table D.2b for Layer I at a sampling frequency of 44.1 kHz is adopted.
- 13.
The geometric mean of a data set \(\left [a_{1},a_{2},\ldots,a_{M}\right ]\) is defined as \(\left (\prod _{m=1}^{M}a_{m}\right )^{1/M}\). It is sometimes called the log-average, i.e., \(\left (\prod _{m=1}^{M}a_{m}\right )^{1/M} = 10^{{\hat{}{}}}\left [ \frac{1} {M}\sum _{m=1}^{M}\log _{ 10}\left (a_{m}\right )\right ]\).
References
X. He, Watermarking in Audio: Key Techniques and Technologies (Cambria Press, Youngstown, 2008)
N. Cvejic, T. Seppanen, Robust audio watermarking in wavelet domain using frequency hopping and patchwork method, in Proceedings of the 3rd International Symposium on Image and Signal Processing and Analysis, 2003, pp. 251–255
E. Zwicker, H. Fastl, Psychoacoustics: Facts and Models (Springer, Berlin, 1990)
M. Arnold, M. Schmucker, S.D. Wolthusen, Techniques and Applications of Digital Watermarking and Content Protection (Artech House, Boston, 2003)
M. Bosi, R.E. Goldberg, Introduction to Digital Audio Coding and Standards (Kluwer Academic, Boston, 2003)
W.J. Vincoli (ed.), Lewis’ Dictionary of Occupational and Environmental Safety and Health (Lewis Publishers, Boca Raton, 2000)
K. Johnson, Acoustic and Auditory Phonetics (Blackwell Publisher, Malden, 2003)
P.H. Lindsay, D.A. Norman, Human Information Processing: An Introduction to Psychology (Academic, New York, 1977)
T.S. Gunawan, Audio compression and speech enhancement using temporal masking models. Ph.D. dissertation, The University of New South Wales, 2007
M.W. Levine, Levine and Shefner’s Fundamentals of Sensation and Perception (Oxford University Press, Oxford, 2000)
W.A. Yost, D.W. Nielsen, Fundamentals of Hearing: An Introduction (Holt, Rinehart and Winston, New York, 1977)
E.A.G. Shaw, Earcanal pressure generated by a free sound field. J. Acoust. Soc. Am. 39(3), 465–470 (1966)
B.C.J. Moore, An Introduction to the Psychology of Hearing (Academic, New York, 2003)
[Online]. Available: http://www.chicagoear.com/images/earworks.gif
I.J. Hirsh, The Measurement of Hearing (McGraw-Hill, New York, 1952)
H. Fletcher, W.A. Munson, Loudness, its definition, measurement and calculation. J. Acoust. Soc. Am. 5(2), 82–108 (1933)
X.M. Quan, H.B. Zhang, Statistical audio watermarking algorithm based on perceptual analysis, in Proceedings of the 5th ACM Workshop on Digital Rights Management, 2005, pp. 112–118
E. Ambikairajah, A.G. Davis, W.T.K. Wong, Auditory masking and MPEG-1 audio compression. Electron. Comm. Eng. J. 9, 165–173 (1997)
A. Spanias, T. Painter, V. Atti, Audio Signal Processing and Coding (Wiley-Interscience, Hoboken, 2007)
M.D. Swanson, B. Zhu, A.H. Tewfik, L. Boney, Robust audio watermarking using perceptual masking. Signal Process. 66(3), 337–355 (1998)
R.A. Garcia, Digital watermarking of audio signals using a psychoacoustic auditory model and spread spectrum theory. AES E-Library, 1999
S. Ratanasanya, S. Poomdaeng, S. Tachphetpiboon, T. Amornraksa, New psychoacoustic models for wavelet based audio watermarking, in IEEE International Symposium on Communications and Information Technology (ISCIT), vol. 1, pp. 602–605, 2005
ISO/IEC IS 11172-3, Information Technology - Coding of Moving Picture and Associated Audio for Digital Storage Media Up To About 1.5Mbit/s, Part 3: Audio (BSI, London, 1993)
K.C. Pohlmann, Principles of Digital Audio (McGraw-Hill, New York, 2000)
D. Pan, A tutorial on MPEG/audio compression. IEEE Multimed. 2, 60–74 (1995)
F.A.P. Petitcolas, MPEG for Matlab, v.1.2.8 ed. (2003) [Online], http://www.petitcolas.net/fabien/software/mpeg
C.-Y. Lin, An investigation into perceptual audio coding and the use of auditory gammatone filterbanks. Master’s thesis, The University of Auckland, 2007
SQAM - Sound Quality Assessment Material, European Broadcasting Union (EBU) [Online], http://sound.media.mit.edu/mpeg4/audio/sqam
A. Takahashi, R. Nishimura, Y. Suzuki, Multiple watermarks for stereo audio signals using phase-modulation techniques. IEEE Trans. Signal Process. 53(2), 806–815 (2005)
P. Liew, M. Armand, Inaudible watermarking via phase manipulation of random frequencies. Multimed. Tools Appl. 35(3), 357–377 (2007)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Lin, Y., Abdulla, W.H. (2015). Principles of Psychoacoustics. In: Audio Watermark. Springer, Cham. https://doi.org/10.1007/978-3-319-07974-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-07974-5_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07973-8
Online ISBN: 978-3-319-07974-5
eBook Packages: EngineeringEngineering (R0)