Simultaneous Enhancement and Watermarking of Speech Signals

abd-ElMordy, Eman; el-Gazar, Safaa; Abbas, Alaa M.; El-Dolil, Sami; El-Dokany, Ibrahim M.; Dessouky, Moawad I.; El-Rabaie, El-Sayed M.; El-Fishawy, Adel S.; El-Samie, Fathi E. Abd

doi:10.1007/s10772-019-09638-1

Simultaneous Enhancement and Watermarking of Speech Signals

Published: 16 January 2021

Volume 24, pages 219–234, (2021)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

Eman abd-ElMordy¹,
Safaa el-Gazar ORCID: orcid.org/0000-0001-8002-0543¹,
Alaa M. Abbas¹,
Sami El-Dolil¹,
Ibrahim M. El-Dokany¹,
Moawad I. Dessouky¹,
El-Sayed M. El-Rabaie¹,
Adel S. El-Fishawy¹ &
…
Fathi E. Abd El-Samie¹

221 Accesses
Explore all metrics

Abstract

The paper presents an improvement of the watermark extraction in speech signal watermarking. The noise added to the speech signal during transmission affects the efficiency of the watermark extraction. Removing this noise may aid to enhance the extraction process. There are methods of the speech signal enhancement, which aims to reduce the noise distortion in the speech signal. In this paper, the watermark process is done using the hybrid strategy of the Empirical Mode Decomposition (EMD) and the block-based Singular Value Decomposition (block-based SVD) with chaotic encrypted watermark. The watermark is embedded into the Singular Values matrix (SVs) because of its stability against any disturbance in the speech signal due to the different attacks. The encrypted watermark increases the security level of the watermark. When the watermark extracted at the receiver side, the speech signal will be enhanced first using spectral subtraction, Wiener filter or adaptive Wiener filter enhancement methods. The paper introduces a comparison study to evaluate the performance of each of them. Simulation results indicate that using the enhancement step improve the watermark extraction, especially using the adaptive Wiener filter.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Video steganography: recent advances and challenges

Article Open access 04 April 2023

A robust blind color watermarking algorithm based on the Radon-DCT transform

Article Open access 17 January 2024

Review of wavelet denoising algorithms

Article 03 April 2023

References

Abd El-Fattah, M. A., Dessouky, M. I., Abbas, A. M., Diab, S. M., El-Rabaie, E. M., Al-Nuaimy, W., et al. (2014). Speech enhancement with an adaptive Wiener filter. International Journal of Speech Technology, 17, 53–64.
Article Google Scholar
Abd El-Moneim, S., Dessouky, M. I., Abd El-Samie, F. E., Nassar, M. A., & El-Naby, M. A. (2015). Hybrid speech enhancement with empirical mode decomposition and spectral subtraction for efficient speaker identification. International Journal of Speech Technology, 18, 555–564.
Article Google Scholar
Abd El-Samie, F. E. (2009). An efficient singular value decomposition algorithm for digital audio watermarking. International Journal of Speech Technology, 12, 27–45.
Article Google Scholar
Aparna, R., and Chithra, P.l. (2016) A review on cryptographic algorithms for speech signal security. International Journal of Emerging Trends & Technology in Computer Science(IJETTCS), 5, 84–88
Google Scholar
Alvarez, G., & Li, S. (2006). Breaking an encryption scheme based on chaotic baker map. Physics Letters A, 352, 78–82.
Article Google Scholar
Bassia, P., & Pitas, I. P. (2011). Robust audio watermarking in the time domain. IEEE Transactions on Multimedia Journal, 3, 232–241.
Article Google Scholar
Bhatt, K., Vinitha, C. S., & Gupta, R. (2018). Secure speech enhancement using LPC based FEM in Wiener filter. In S. Satapathy, V. Bhateja, K. Raju, & B. Janakiramaiah (Eds.), Data engineering and intelligent computing. Advances in intelligent systems and computing (Vol. 542, pp. 657–665). Singapore: Springer.
Google Scholar
Celik, M., Sharma, G., & Tekalp, A. M. (2005) Pitch and duration modification for speech watermarking. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Cheng, Q., & Sorenson, J. (2001) Spread spectrum signaling for speech watermarking. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Cheung, S.-C., Chiu, D. K. W., & Ho, C. (2008). The use of digital watermarking for intelligence multimedia document distribution. Journal of Theoretical and Applied Electronic Commerce Research, 3, 103–118.
Article Google Scholar
Cox, I. J., Miller, M. L., & Bloom, J. A. (2000) Watermarking applications and their properties. In Proceedings International Conference on Information Technology: Coding and Computing.
Desai, H. V. (2012). Steganography, cryptography, watermarking: A comparative study. Journal of Global Research in Computer Science, 3(12), 33–35.
Google Scholar
Ghazy, R. A., ElFishawy, N. A., Hadhoud, M. M., Dessouky, M. I., & El-Samie, F. E. A. (2007). An efficient block-by-block SVD-based image watermarking scheme. In Proceedings of the National Radio Science Conference.
Girin, L., & Marchand, S. (2004) Watermarking of speech signals using the sinusoidal model and frequency modulation of the partials. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. 1, pp. 633–636).
Gurijala, A., & Deller Jr, J. R. (2007) On the robustness of parametric watermarking of speech. In Proccedings of International Workshop on Multimedia Content Analysis and Mining (Vol. 4577/2007, pp. 501–510). Springer, Berlin.
Hasan, M. K., Salahddin, S., & Khan, M. R. (2004). A modified a priori SNR for speech enhancement using spectral subtraction rules. IEEE Signal Processing Letters, 11, 450–453.
Article Google Scholar
Kirovski, D., & Malvar, H. S. (2003). Spread-spectrum watermarking of audio signal. IEEE Transactions on Signal Processing Journal, 51, 1020–1033.
Article MathSciNet Google Scholar
Kwok, H. S., & Tang, W. K. S. (2007). A fast image encryption system based on chaotic maps with finite precision representation. Chaos, Solitons & Fractals, 32, 1518–1529.
Article MathSciNet Google Scholar
Liu, Y.-W., & Smith, J. O. (2007). Audio watermarking through deterministic plus stochastic signal decomposition. EURASIP Journal on Information Security, 2007, 1–12.
Article Google Scholar
Mahajan, S., & Singh, A. (2012). A review of methods and approach for secure steganography. International Journal of Advanced Research in Computer Science and Software Engineering, 2, 67–70.
Google Scholar
Najafi, E., & Loukhaoukha, K. (2019). Hybrid secure and robust image watermarking scheme based on SVD and sharp frequency localized contourlet transform. Journal of Information Security and Applications, 44, 144–156.
Article Google Scholar
Nematollahi, M. A., & Al-Haddad, S. A. R. (2013). An overview of digital speech watermarking. International Journal of Speech Technology, 78, 1–18.
Google Scholar
Nematollahi, M. A., Rosales, H. G., Akhaee, M. A., & Al-Haddad, S. A. R. (2015). Robust digital speech watermarking for online speaker recognition. Mathematical Problems in Engineering. https://doi.org/10.1155/2015/372398.
Article Google Scholar
Nematollahi, M. A., Vorakulpipat, C., & Rosale, H. G. (2016) Speech watermarking. In Digital watermarking. Springer topics in signal processing (Vol. 11, pp. 39–53). Springer, Singapore.
Pattanshetti, P., Dongaonkar, S., & Karpe, S. (2015). Digital watermarking in audio using least significant bit and discrete cosine transform. International Journal of Computer Science and Information Technologies, 6(4), 3688–3692.
Google Scholar
Poddar, A., Sahidullah, M., & Saha, G. (2019). Quality measures for speaker verification with short utterances. Digital Signal Processing, 88, 66–79.
Article Google Scholar
Ramos, D., Haraksim, R., & Meuwly, D. (2017). Likelihood ratio data to report the validation of a forensic fingerprint evaluation method. Data in Brief, 10, 75–92.
Article Google Scholar
Sakaguchi, S., Arai, T., & Murahara, Y. (2000). The effect of polarity inversion of speech on human perception and data hiding as an application. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. 2, pp. 917–920).
Skopin, D. E., El-Emary, I. M. M., Rasras, R. J., & Diab, R. S. (2010) Advanced algorithms in audio steganography for hiding human speech signal. In Proceedings of the International Conference on Advanced Computer Control.
Upadhyay, N., & Karmakar, A. (2015). Speech enhancement using spectral subtraction-type algorithms: A comparison and simulation study. Procedia Computer Science, 54, 574–584.
Article Google Scholar
Upadhyay, N., & Jaiswal, R. K. (2016). Single channel speech enhancement: Using Wiener filtering with recursive noise estimation. Procedia Computer Science, 84, 22–30.
Article Google Scholar
Wang, S., Sekey, A., & Gersho, A. (1992). An objective measure for predicting subjective quality of speech coders. IEEE Journal on Selected Areas in Communications, 10, 819–829.
Article Google Scholar
Xiang, P. T., Wong, K. W., & Liao, X. (2007). A novel symmetrical cryptosystem based on discretized two-dimensional chaotic map. Physics Letters A, 364, 252–258.
Article Google Scholar
Yelwande, A., Kansal, S., & Dixit, A. (2017). Adaptive wiener filter for speech enhancement. In Proceedings of the International Conference on Information, Communication, Instrumentation and Control.
Zeiler, A., Faltermeier, R., Keck, I. R., Tomé, A. M., Puntonet, C. G. & Lang, E. W. (2010). Empirical mode decomposition—An introduction. In Proceedings of the International Joint Conference on Neural Networks.

Download references

Author information

Authors and Affiliations

Faculty of Electronic Engineering, Menoufia University, Menouf, 32952, Egypt
Eman abd-ElMordy, Safaa el-Gazar, Alaa M. Abbas, Sami El-Dolil, Ibrahim M. El-Dokany, Moawad I. Dessouky, El-Sayed M. El-Rabaie, Adel S. El-Fishawy & Fathi E. Abd El-Samie

Authors

Eman abd-ElMordy
View author publications
You can also search for this author in PubMed Google Scholar
Safaa el-Gazar
View author publications
You can also search for this author in PubMed Google Scholar
Alaa M. Abbas
View author publications
You can also search for this author in PubMed Google Scholar
Sami El-Dolil
View author publications
You can also search for this author in PubMed Google Scholar
Ibrahim M. El-Dokany
View author publications
You can also search for this author in PubMed Google Scholar
Moawad I. Dessouky
View author publications
You can also search for this author in PubMed Google Scholar
El-Sayed M. El-Rabaie
View author publications
You can also search for this author in PubMed Google Scholar
Adel S. El-Fishawy
View author publications
You can also search for this author in PubMed Google Scholar
Fathi E. Abd El-Samie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Safaa el-Gazar.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

abd-ElMordy, E., el-Gazar, S., Abbas, A.M. et al. Simultaneous Enhancement and Watermarking of Speech Signals. Int J Speech Technol 24, 219–234 (2021). https://doi.org/10.1007/s10772-019-09638-1

Download citation

Received: 06 March 2019
Accepted: 16 September 2019
Published: 16 January 2021
Issue Date: March 2021
DOI: https://doi.org/10.1007/s10772-019-09638-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Simultaneous Enhancement and Watermarking of Speech Signals

Abstract

Access this article

Similar content being viewed by others

Video steganography: recent advances and challenges

A robust blind color watermarking algorithm based on the Radon-DCT transform

Review of wavelet denoising algorithms

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Simultaneous Enhancement and Watermarking of Speech Signals

Abstract

Access this article

Similar content being viewed by others

Video steganography: recent advances and challenges

A robust blind color watermarking algorithm based on the Radon-DCT transform

Review of wavelet denoising algorithms

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation