Skip to main content

Advertisement

Log in

High capacity, secure audio watermarking technique integrating spread spectrum and linear predictive coding

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper introduces a method adjustable audio watermarking approach with high auditory quality by exploiting the Spread Spectrum (SS) and the psychoacoustic modeling-based Discrete Wavelet Packet Transform (DWPT). The psychoacoustic model is used to shape the amplitude of the watermark signal during the embedding phase, ensuring a high level of imperceptibility. The approach based DWPT carefully mimics the multi- resolution properties of the human ear and incorporates simultaneous and temporal auditory masking. This paper aims to increase data payload and security. The watermark bits positions in the watermarking domain are determined by the logistic chaotic map in a random manner ensuring a high security level. The watermark used is an audio signal in parametric representation using Linear Predictive Coding (LPC) to accomplish a higher data payload. The LPC technique predicts a small number of coefficients, which represent different speech parameters, that are then applied in digital filters to create a synthetic version of the original speech signal. Various host audio signals are examined under various watermarking threats to assess the performance of the watermarking system. Experimental results expose that the introduced scheme provides a good measure of imperceptibility and robustness which are evaluated by the Objective Difference Grade (ODG) and the Bit Error Rate (BER%) respectively. The ODG‘s values were between -0.74 and -2.75, while the BER was in the range of 0 to 7.54%. The comparison introduced between the standard psychoacoustic model-1 (Standard PM) and the enhanced psychoacoustic model-1 based DWPT (Enhanced PM) for audio watermarking shows that Enhanced PM enhances the status of insufficient time-frequency resolution. A comparison of the proposed method to other state-of-the-art audio watermarking algorithms shows that it is a feasible alternative.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Zailan MKN et al (2023) Comparative analysis of LPC and MFCC for male speaker recognition in text-independent context. ESTEEM Acad J 19:101–112

    Google Scholar 

  2. Salah E, Amine K, Redouane K, Fares K (2021) A Fourier transform based audio watermarking algorithm. Appl Acoustics 172:107652

    Google Scholar 

  3. Liang X, Xiang S (2020) Robust reversible audio watermarking based on high-order difference statistics. Signal Process 173:107584

    Google Scholar 

  4. Liu X, et al. (2020) Weighted-sampling audio adversarial example attack. Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 34. No. 04

  5. Wang X-Y, Niu P-P, Yang H-Y (2009) A robust digital audio watermarking based on statistics characteristics. Pattern Recog 42(11):3057–3064

    Google Scholar 

  6. Pourhashemi SM, Mosleh M, Erfani Y (2021) A novel audio watermarking scheme using ensemble-based watermark detector and discrete wavelet transform. Neural Comput Appl 33(11):6161–6181

    Google Scholar 

  7. Panchal UH, Srivastava R (2015) A comprehensive survey on digital image watermarking techniques." 2015 Fifth International Conference on Communication Systems and Network Technologies. IEEE

  8. Chen A, Dai X (2010) Internal combustion engine vibration analysis with short-term Fourier-transform." 2010 3rd International Congress on Image and Signal Processing. IEEE 9

  9. Hu H-T, Hsu L-Y, Chou H-H (2014) Variable-dimensional vector modulation for perceptual-based DWT blind audio watermarking with adjustable payload capacity. Digital Signal Process 31:115–123

    Google Scholar 

  10. Lim TY, et al. (2018) Time-frequency networks for audio super-resolution. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE

  11. Spanias A, Thiagarajan J (2022) Analysis of the MPEG-1 Layer III (MP3) Algorithm using MATLAB. Springer Nature

  12. Lagerstrom K (2001) Design and implementation of an MPEG-1 layer III audio decoder." Chalmers University of Technology, Department of Computer Engineering Gothenburg, Sweden

  13. Wang C et al (2019) Ternary radial harmonic Fourier moments based robust stereo image zero-watermarking algorithm. Inform Sci 470:109–120

    Google Scholar 

  14. Wang C et al (2017) Geometric correction based color image watermarking using fuzzy least squares support vector machine and Bessel K form distribution. Signal Process 134:197–208

    Google Scholar 

  15. Wang C et al (2016) Geometrically resilient color image zero-watermarking algorithm based on quaternion exponent moments. J Vis Commun Image Represent 41:247–259

    Google Scholar 

  16. Xia Z et al (2019) Efficient copyright protection for three CT images based on quaternion polar harmonic Fourier moments. Signal Process 164:368–379

    Google Scholar 

  17. Xia Z et al (2021) Color image triple zero-watermarking using decimal-order polar harmonic transforms and chaotic system. Signal Process 180:107864

    Google Scholar 

  18. Xia Z et al (2019) Color medical image lossless watermarking using chaotic system and accurate quaternion polar harmonic transforms. Signal Process 157:108–118

    Google Scholar 

  19. Xia Z et al (2021) Novel quaternion polar complex exponential transform and its application in color image zero-watermarking. Digital Signal Process 116:103130

    Google Scholar 

  20. Chun-peng W, Xing-yuan W, Zhi-qiu X (2016) Geometrically invariant image watermarking based on fast radial harmonic Fourier moments. Signal Process: Image Commun 45:10–23

    Google Scholar 

  21. Xia Z et al (2021) Local quaternion polar harmonic Fourier moments-based multiple zero-watermarking scheme for color medical images. Knowledge-Based Syst 216:106568

    Google Scholar 

  22. Wang C et al (2018) Quaternion polar harmonic Fourier moments for color images. Inform Sci 450:141–156

    MathSciNet  Google Scholar 

  23. Wang C et al (2019) Image description with polar harmonic Fourier moments. IEEE Trans Circuits Syst Video Technol 30(12):4440–4452

    Google Scholar 

  24. Xia Z et al (2022) A robust zero-watermarking algorithm for lossless copyright protection of medical images. Appl Intell 52(1):607–621

    Google Scholar 

  25. Jabbar ZJ, George LE (2022) A Survey of Transform Coding for High-Speed Audio Compression. J Al-Qadisiyah Comput Sci Math 14(1):55

    Google Scholar 

  26. Wang C, Zhang H, Zhou X (2018) LBP and DWT Based Fragile Watermarking for Image Authentication. J Inform Process Syst 14:3

    Google Scholar 

  27. Kim YG et al (2022) An Efficient Compression Method of Underwater Acoustic Sensor Signals for Underwater Surveillance. Sensors 22(9):3415

    Google Scholar 

  28. Zhao X, Ho ATS (2010) An introduction to robust transform based image watermarking techniques. Intell Multimed Anal Secur Appl: 337-364

  29. Begum M, Ferdush J, Uddin MS (2022) A Hybrid robust watermarking system based on discrete cosine transform, discrete wavelet transform, and singular value decomposition. J King Saud Univ-Comput Inform Sci 34(8):5856–5867

    Google Scholar 

  30. Xiang Y et al (2015) Spread spectrum-based high embedding capacity watermarking method for audio signals. IEEE/ACM Trans audio, Speech, Lang Process 23(12):2228–2237

    Google Scholar 

  31. Xue Y, et al. (2019) Improved high capacity spread spectrum-based audio watermarking by Hadamard matrices. Digital Forensics and Watermarking: 17th International Workshop, IWDW 2018, Jeju Island, Korea, October 22-24, 2018, Proceedings 17. Springer International Publishing

  32. Kumaraswamy E, et al. (2020) Digital Watermarking: State of The Art and Research Challenges in Health Care & Multimedia Applications. IOP Conference Series: Materials Science and Engineering. Vol. 981. No. 3. IOP Publishing

  33. You J, et al. (2022) Estimating the Secret Key of Spread Spectrum Watermarking Based on Equivalent Keys. IEEE Trans Multimed

  34. Hemis M et al (2018) Adjustable audio watermarking algorithm based on DWPT and psychoacoustic modeling. Multimed Tools Appl 77:11693–11725

    Google Scholar 

  35. He X, Scordilis MS (2006) An enhanced psychoacoustic model based on the discrete wavelet packet transform. J Franklin Institute 343(7):738–755

    Google Scholar 

  36. Xiang Y et al (2015) Spread spectrum-based high embedding capacity watermarking method for audio signals. IEEE/ACM Trans Audio Speech Lang Process 23(12):2228–2237

    Google Scholar 

  37. Korany NO, Elboghdadly NM, Elabdein MZ (2021) Audio Watermarking Technique Integrating Spread Spectrum and CNN-autoencoder. Audio Engineering Society Convention 151. Audio Engineering Society

  38. Kundur D (1999) Improved digital watermarking through diversity and attack characterization. Proc ACM Workshop Multimed Secur 99

  39. Kundur D, Hatzinakos D (2001) Diversity and attack characterization for improved robust watermarking. IEEE Trans Signal Process 49(10):2383–2396

    Google Scholar 

  40. Cox IJ et al (1997) Secure spread spectrum watermarking for multimedia. IEEE Trans Image Process 6(12):1673–1687

    Google Scholar 

  41. Salau AO, et al. (2019) Audio compression using a modified discrete cosine transform with temporal auditory masking. 2019 International Conference on Signal Processing and Communication (ICSC). IEEE

  42. Sharma N, Anand A, Singh AK (2021) Bio-signal data sharing security through watermarking: a technical survey. Computing:1-35

  43. Isoyama T, Kidani S, Unoki M (2022) Blind Speech Watermarking Method with Frame Self-Synchronization Based on Spread-Spectrum Using Linear Prediction Residue. Entropy 24(5):677

    MathSciNet  Google Scholar 

  44. Xiong X-j et al (2018) Improved wavelet decomposition method and its application. Prog Geophys 33(4):1617–1621

    Google Scholar 

  45. Herre J, Dick S (2019) Psychoacoustic models for perceptual audio coding—A tutorial review. Appl Sci 9(14):2854

    Google Scholar 

  46. El-Khamy SE, Korany NO, El-Sherif MH (2018) Chaos-based image hiding scheme between silent intervals of high quality audio signals using feature extraction and image bits spreading. 2018 35th National Radio Science Conference (NRSC). IEEE

  47. Salau AO, and Jain S (2019) Feature extraction: a survey of the types, techniques, applications." 2019 international conference on signal processing and communication (ICSC). IEEE

  48. Zailan MKN et al (2023) Comparative analysis of LPC and MFCC for male speaker recognition in text-independent context. ESTEEM Acad J 19:101–112

    Google Scholar 

  49. Astuti Y, Hidayat R, Bejo A (2020) Comparison of Feature Extraction for Speaker Identification System." 2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI). IEEE

  50. Ali YM et al (2022) Speech-based gender recognition using linear prediction and mel-frequency cepstral coefficients. Indonesian J Electric Eng Comput Sci 28(2):753–761

    MathSciNet  Google Scholar 

  51. Hamid OK (2017) Speech sound coding using linear predictive coding (LPC). signal 1: 5

  52. Birkholz P et al (2019) How the peak glottal area affects linear predictive coding-based formant estimates of vowels. J Acoustic Soc Am 146(1):223–232

    Google Scholar 

  53. Ghido F, Tabus I (2012) Sparse modeling for lossless audio compression. IEEE Trans Audio Speech Lang Process 21(1):14–28

    Google Scholar 

  54. Malik H, Ansari R, Khokhar A (2008) Robust audio watermarking using frequency-selective spread spectrum. IET Information Security 2(4):129–150

    Google Scholar 

  55. Bellaaj M, Ouni K (2020) Audio watermarking technique in frequency domain: comparative study MDCT Vs DCT. Multimed Tools Appl 79(37-38):27161–27184

    Google Scholar 

  56. Attari AA, Shirazi AAB (2018) Robust and Transparent Spread Spectrum Audio Watermarking. Int J Electron Commun Eng 12(2):145–149

    Google Scholar 

  57. Ahmad M et al (2017) A simple secure hash function scheme using multiple chaotic maps. 3DResearch 8:1–15

    Google Scholar 

  58. Mohamed AG, Korany NO, El-Khamy SE (2021) New DNA coded fuzzy based (DNAFZ) S-boxes: Application to robust image encryption using hyper chaotic maps. IEEE Access 9:14284–14305

    Google Scholar 

  59. Chen Z et al (2016) Performance analysis and improvement of logistic chaotic mapping. 电子与信息学报 38(6):1547–1551

    Google Scholar 

  60. Thiede T et al (2000) PEAQ-The ITU standard for objective measurement of perceived audio quality. J Audio Eng Soc 48(1/2):3–29

    Google Scholar 

  61. Breed G (2003) Bit error rate: Fundamental concepts and measurement issues. High Frequency Electron 2(1):46–47

    Google Scholar 

  62. Krimi S, Auni K, Ellouze N (2007) An Improved Psychoacoustic Model for Audio Coding Based on Wavelet Packet. IEEE Int Conf SETIT 2007

  63. Zeyad T, Hanoon A (2005) Speech signal compression using wavelet and linear predictive coding. Al-Khwarizmi Eng J 1(1):52–60

    Google Scholar 

  64. Lin Y, et al. (2015) Principles of psychoacoustics." Audio Watermark: A Comprehensive Foundation Using MATLAB: 15-49

  65. Website: Https://pixabay.com/sound-effects/search/wav/

  66. Website: https://www2.cs.uic.edu/i101/SoundFiles/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohamed Z. Elabdein.

Ethics declarations

Conflicts of interests

There is no financial, or non-financial assistance provided by a third party for the reported work.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Korany, N.O., Elboghdadly, N.M. & Elabdein, M.Z. High capacity, secure audio watermarking technique integrating spread spectrum and linear predictive coding. Multimed Tools Appl 83, 50645–50668 (2024). https://doi.org/10.1007/s11042-023-17630-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-17630-w

Keywords

Navigation