Abstract
This paper introduces a method adjustable audio watermarking approach with high auditory quality by exploiting the Spread Spectrum (SS) and the psychoacoustic modeling-based Discrete Wavelet Packet Transform (DWPT). The psychoacoustic model is used to shape the amplitude of the watermark signal during the embedding phase, ensuring a high level of imperceptibility. The approach based DWPT carefully mimics the multi- resolution properties of the human ear and incorporates simultaneous and temporal auditory masking. This paper aims to increase data payload and security. The watermark bits positions in the watermarking domain are determined by the logistic chaotic map in a random manner ensuring a high security level. The watermark used is an audio signal in parametric representation using Linear Predictive Coding (LPC) to accomplish a higher data payload. The LPC technique predicts a small number of coefficients, which represent different speech parameters, that are then applied in digital filters to create a synthetic version of the original speech signal. Various host audio signals are examined under various watermarking threats to assess the performance of the watermarking system. Experimental results expose that the introduced scheme provides a good measure of imperceptibility and robustness which are evaluated by the Objective Difference Grade (ODG) and the Bit Error Rate (BER%) respectively. The ODG‘s values were between -0.74 and -2.75, while the BER was in the range of 0 to 7.54%. The comparison introduced between the standard psychoacoustic model-1 (Standard PM) and the enhanced psychoacoustic model-1 based DWPT (Enhanced PM) for audio watermarking shows that Enhanced PM enhances the status of insufficient time-frequency resolution. A comparison of the proposed method to other state-of-the-art audio watermarking algorithms shows that it is a feasible alternative.
Similar content being viewed by others
References
Zailan MKN et al (2023) Comparative analysis of LPC and MFCC for male speaker recognition in text-independent context. ESTEEM Acad J 19:101–112
Salah E, Amine K, Redouane K, Fares K (2021) A Fourier transform based audio watermarking algorithm. Appl Acoustics 172:107652
Liang X, Xiang S (2020) Robust reversible audio watermarking based on high-order difference statistics. Signal Process 173:107584
Liu X, et al. (2020) Weighted-sampling audio adversarial example attack. Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 34. No. 04
Wang X-Y, Niu P-P, Yang H-Y (2009) A robust digital audio watermarking based on statistics characteristics. Pattern Recog 42(11):3057–3064
Pourhashemi SM, Mosleh M, Erfani Y (2021) A novel audio watermarking scheme using ensemble-based watermark detector and discrete wavelet transform. Neural Comput Appl 33(11):6161–6181
Panchal UH, Srivastava R (2015) A comprehensive survey on digital image watermarking techniques." 2015 Fifth International Conference on Communication Systems and Network Technologies. IEEE
Chen A, Dai X (2010) Internal combustion engine vibration analysis with short-term Fourier-transform." 2010 3rd International Congress on Image and Signal Processing. IEEE 9
Hu H-T, Hsu L-Y, Chou H-H (2014) Variable-dimensional vector modulation for perceptual-based DWT blind audio watermarking with adjustable payload capacity. Digital Signal Process 31:115–123
Lim TY, et al. (2018) Time-frequency networks for audio super-resolution. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE
Spanias A, Thiagarajan J (2022) Analysis of the MPEG-1 Layer III (MP3) Algorithm using MATLAB. Springer Nature
Lagerstrom K (2001) Design and implementation of an MPEG-1 layer III audio decoder." Chalmers University of Technology, Department of Computer Engineering Gothenburg, Sweden
Wang C et al (2019) Ternary radial harmonic Fourier moments based robust stereo image zero-watermarking algorithm. Inform Sci 470:109–120
Wang C et al (2017) Geometric correction based color image watermarking using fuzzy least squares support vector machine and Bessel K form distribution. Signal Process 134:197–208
Wang C et al (2016) Geometrically resilient color image zero-watermarking algorithm based on quaternion exponent moments. J Vis Commun Image Represent 41:247–259
Xia Z et al (2019) Efficient copyright protection for three CT images based on quaternion polar harmonic Fourier moments. Signal Process 164:368–379
Xia Z et al (2021) Color image triple zero-watermarking using decimal-order polar harmonic transforms and chaotic system. Signal Process 180:107864
Xia Z et al (2019) Color medical image lossless watermarking using chaotic system and accurate quaternion polar harmonic transforms. Signal Process 157:108–118
Xia Z et al (2021) Novel quaternion polar complex exponential transform and its application in color image zero-watermarking. Digital Signal Process 116:103130
Chun-peng W, Xing-yuan W, Zhi-qiu X (2016) Geometrically invariant image watermarking based on fast radial harmonic Fourier moments. Signal Process: Image Commun 45:10–23
Xia Z et al (2021) Local quaternion polar harmonic Fourier moments-based multiple zero-watermarking scheme for color medical images. Knowledge-Based Syst 216:106568
Wang C et al (2018) Quaternion polar harmonic Fourier moments for color images. Inform Sci 450:141–156
Wang C et al (2019) Image description with polar harmonic Fourier moments. IEEE Trans Circuits Syst Video Technol 30(12):4440–4452
Xia Z et al (2022) A robust zero-watermarking algorithm for lossless copyright protection of medical images. Appl Intell 52(1):607–621
Jabbar ZJ, George LE (2022) A Survey of Transform Coding for High-Speed Audio Compression. J Al-Qadisiyah Comput Sci Math 14(1):55
Wang C, Zhang H, Zhou X (2018) LBP and DWT Based Fragile Watermarking for Image Authentication. J Inform Process Syst 14:3
Kim YG et al (2022) An Efficient Compression Method of Underwater Acoustic Sensor Signals for Underwater Surveillance. Sensors 22(9):3415
Zhao X, Ho ATS (2010) An introduction to robust transform based image watermarking techniques. Intell Multimed Anal Secur Appl: 337-364
Begum M, Ferdush J, Uddin MS (2022) A Hybrid robust watermarking system based on discrete cosine transform, discrete wavelet transform, and singular value decomposition. J King Saud Univ-Comput Inform Sci 34(8):5856–5867
Xiang Y et al (2015) Spread spectrum-based high embedding capacity watermarking method for audio signals. IEEE/ACM Trans audio, Speech, Lang Process 23(12):2228–2237
Xue Y, et al. (2019) Improved high capacity spread spectrum-based audio watermarking by Hadamard matrices. Digital Forensics and Watermarking: 17th International Workshop, IWDW 2018, Jeju Island, Korea, October 22-24, 2018, Proceedings 17. Springer International Publishing
Kumaraswamy E, et al. (2020) Digital Watermarking: State of The Art and Research Challenges in Health Care & Multimedia Applications. IOP Conference Series: Materials Science and Engineering. Vol. 981. No. 3. IOP Publishing
You J, et al. (2022) Estimating the Secret Key of Spread Spectrum Watermarking Based on Equivalent Keys. IEEE Trans Multimed
Hemis M et al (2018) Adjustable audio watermarking algorithm based on DWPT and psychoacoustic modeling. Multimed Tools Appl 77:11693–11725
He X, Scordilis MS (2006) An enhanced psychoacoustic model based on the discrete wavelet packet transform. J Franklin Institute 343(7):738–755
Xiang Y et al (2015) Spread spectrum-based high embedding capacity watermarking method for audio signals. IEEE/ACM Trans Audio Speech Lang Process 23(12):2228–2237
Korany NO, Elboghdadly NM, Elabdein MZ (2021) Audio Watermarking Technique Integrating Spread Spectrum and CNN-autoencoder. Audio Engineering Society Convention 151. Audio Engineering Society
Kundur D (1999) Improved digital watermarking through diversity and attack characterization. Proc ACM Workshop Multimed Secur 99
Kundur D, Hatzinakos D (2001) Diversity and attack characterization for improved robust watermarking. IEEE Trans Signal Process 49(10):2383–2396
Cox IJ et al (1997) Secure spread spectrum watermarking for multimedia. IEEE Trans Image Process 6(12):1673–1687
Salau AO, et al. (2019) Audio compression using a modified discrete cosine transform with temporal auditory masking. 2019 International Conference on Signal Processing and Communication (ICSC). IEEE
Sharma N, Anand A, Singh AK (2021) Bio-signal data sharing security through watermarking: a technical survey. Computing:1-35
Isoyama T, Kidani S, Unoki M (2022) Blind Speech Watermarking Method with Frame Self-Synchronization Based on Spread-Spectrum Using Linear Prediction Residue. Entropy 24(5):677
Xiong X-j et al (2018) Improved wavelet decomposition method and its application. Prog Geophys 33(4):1617–1621
Herre J, Dick S (2019) Psychoacoustic models for perceptual audio coding—A tutorial review. Appl Sci 9(14):2854
El-Khamy SE, Korany NO, El-Sherif MH (2018) Chaos-based image hiding scheme between silent intervals of high quality audio signals using feature extraction and image bits spreading. 2018 35th National Radio Science Conference (NRSC). IEEE
Salau AO, and Jain S (2019) Feature extraction: a survey of the types, techniques, applications." 2019 international conference on signal processing and communication (ICSC). IEEE
Zailan MKN et al (2023) Comparative analysis of LPC and MFCC for male speaker recognition in text-independent context. ESTEEM Acad J 19:101–112
Astuti Y, Hidayat R, Bejo A (2020) Comparison of Feature Extraction for Speaker Identification System." 2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI). IEEE
Ali YM et al (2022) Speech-based gender recognition using linear prediction and mel-frequency cepstral coefficients. Indonesian J Electric Eng Comput Sci 28(2):753–761
Hamid OK (2017) Speech sound coding using linear predictive coding (LPC). signal 1: 5
Birkholz P et al (2019) How the peak glottal area affects linear predictive coding-based formant estimates of vowels. J Acoustic Soc Am 146(1):223–232
Ghido F, Tabus I (2012) Sparse modeling for lossless audio compression. IEEE Trans Audio Speech Lang Process 21(1):14–28
Malik H, Ansari R, Khokhar A (2008) Robust audio watermarking using frequency-selective spread spectrum. IET Information Security 2(4):129–150
Bellaaj M, Ouni K (2020) Audio watermarking technique in frequency domain: comparative study MDCT Vs DCT. Multimed Tools Appl 79(37-38):27161–27184
Attari AA, Shirazi AAB (2018) Robust and Transparent Spread Spectrum Audio Watermarking. Int J Electron Commun Eng 12(2):145–149
Ahmad M et al (2017) A simple secure hash function scheme using multiple chaotic maps. 3DResearch 8:1–15
Mohamed AG, Korany NO, El-Khamy SE (2021) New DNA coded fuzzy based (DNAFZ) S-boxes: Application to robust image encryption using hyper chaotic maps. IEEE Access 9:14284–14305
Chen Z et al (2016) Performance analysis and improvement of logistic chaotic mapping. 电子与信息学报 38(6):1547–1551
Thiede T et al (2000) PEAQ-The ITU standard for objective measurement of perceived audio quality. J Audio Eng Soc 48(1/2):3–29
Breed G (2003) Bit error rate: Fundamental concepts and measurement issues. High Frequency Electron 2(1):46–47
Krimi S, Auni K, Ellouze N (2007) An Improved Psychoacoustic Model for Audio Coding Based on Wavelet Packet. IEEE Int Conf SETIT 2007
Zeyad T, Hanoon A (2005) Speech signal compression using wavelet and linear predictive coding. Al-Khwarizmi Eng J 1(1):52–60
Lin Y, et al. (2015) Principles of psychoacoustics." Audio Watermark: A Comprehensive Foundation Using MATLAB: 15-49
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interests
There is no financial, or non-financial assistance provided by a third party for the reported work.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Korany, N.O., Elboghdadly, N.M. & Elabdein, M.Z. High capacity, secure audio watermarking technique integrating spread spectrum and linear predictive coding. Multimed Tools Appl 83, 50645–50668 (2024). https://doi.org/10.1007/s11042-023-17630-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17630-w