Abstract
The paper presents an improvement of the watermark extraction in speech signal watermarking. The noise added to the speech signal during transmission affects the efficiency of the watermark extraction. Removing this noise may aid to enhance the extraction process. There are methods of the speech signal enhancement, which aims to reduce the noise distortion in the speech signal. In this paper, the watermark process is done using the hybrid strategy of the Empirical Mode Decomposition (EMD) and the block-based Singular Value Decomposition (block-based SVD) with chaotic encrypted watermark. The watermark is embedded into the Singular Values matrix (SVs) because of its stability against any disturbance in the speech signal due to the different attacks. The encrypted watermark increases the security level of the watermark. When the watermark extracted at the receiver side, the speech signal will be enhanced first using spectral subtraction, Wiener filter or adaptive Wiener filter enhancement methods. The paper introduces a comparison study to evaluate the performance of each of them. Simulation results indicate that using the enhancement step improve the watermark extraction, especially using the adaptive Wiener filter.
Similar content being viewed by others
References
Abd El-Fattah, M. A., Dessouky, M. I., Abbas, A. M., Diab, S. M., El-Rabaie, E. M., Al-Nuaimy, W., et al. (2014). Speech enhancement with an adaptive Wiener filter. International Journal of Speech Technology, 17, 53–64.
Abd El-Moneim, S., Dessouky, M. I., Abd El-Samie, F. E., Nassar, M. A., & El-Naby, M. A. (2015). Hybrid speech enhancement with empirical mode decomposition and spectral subtraction for efficient speaker identification. International Journal of Speech Technology, 18, 555–564.
Abd El-Samie, F. E. (2009). An efficient singular value decomposition algorithm for digital audio watermarking. International Journal of Speech Technology, 12, 27–45.
Aparna, R., and Chithra, P.l. (2016) A review on cryptographic algorithms for speech signal security. International Journal of Emerging Trends & Technology in Computer Science(IJETTCS), 5, 84–88
Alvarez, G., & Li, S. (2006). Breaking an encryption scheme based on chaotic baker map. Physics Letters A, 352, 78–82.
Bassia, P., & Pitas, I. P. (2011). Robust audio watermarking in the time domain. IEEE Transactions on Multimedia Journal, 3, 232–241.
Bhatt, K., Vinitha, C. S., & Gupta, R. (2018). Secure speech enhancement using LPC based FEM in Wiener filter. In S. Satapathy, V. Bhateja, K. Raju, & B. Janakiramaiah (Eds.), Data engineering and intelligent computing. Advances in intelligent systems and computing (Vol. 542, pp. 657–665). Singapore: Springer.
Celik, M., Sharma, G., & Tekalp, A. M. (2005) Pitch and duration modification for speech watermarking. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Cheng, Q., & Sorenson, J. (2001) Spread spectrum signaling for speech watermarking. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing.
Cheung, S.-C., Chiu, D. K. W., & Ho, C. (2008). The use of digital watermarking for intelligence multimedia document distribution. Journal of Theoretical and Applied Electronic Commerce Research, 3, 103–118.
Cox, I. J., Miller, M. L., & Bloom, J. A. (2000) Watermarking applications and their properties. In Proceedings International Conference on Information Technology: Coding and Computing.
Desai, H. V. (2012). Steganography, cryptography, watermarking: A comparative study. Journal of Global Research in Computer Science, 3(12), 33–35.
Ghazy, R. A., ElFishawy, N. A., Hadhoud, M. M., Dessouky, M. I., & El-Samie, F. E. A. (2007). An efficient block-by-block SVD-based image watermarking scheme. In Proceedings of the National Radio Science Conference.
Girin, L., & Marchand, S. (2004) Watermarking of speech signals using the sinusoidal model and frequency modulation of the partials. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. 1, pp. 633–636).
Gurijala, A., & Deller Jr, J. R. (2007) On the robustness of parametric watermarking of speech. In Proccedings of International Workshop on Multimedia Content Analysis and Mining (Vol. 4577/2007, pp. 501–510). Springer, Berlin.
Hasan, M. K., Salahddin, S., & Khan, M. R. (2004). A modified a priori SNR for speech enhancement using spectral subtraction rules. IEEE Signal Processing Letters, 11, 450–453.
Kirovski, D., & Malvar, H. S. (2003). Spread-spectrum watermarking of audio signal. IEEE Transactions on Signal Processing Journal, 51, 1020–1033.
Kwok, H. S., & Tang, W. K. S. (2007). A fast image encryption system based on chaotic maps with finite precision representation. Chaos, Solitons & Fractals, 32, 1518–1529.
Liu, Y.-W., & Smith, J. O. (2007). Audio watermarking through deterministic plus stochastic signal decomposition. EURASIP Journal on Information Security, 2007, 1–12.
Mahajan, S., & Singh, A. (2012). A review of methods and approach for secure steganography. International Journal of Advanced Research in Computer Science and Software Engineering, 2, 67–70.
Najafi, E., & Loukhaoukha, K. (2019). Hybrid secure and robust image watermarking scheme based on SVD and sharp frequency localized contourlet transform. Journal of Information Security and Applications, 44, 144–156.
Nematollahi, M. A., & Al-Haddad, S. A. R. (2013). An overview of digital speech watermarking. International Journal of Speech Technology, 78, 1–18.
Nematollahi, M. A., Rosales, H. G., Akhaee, M. A., & Al-Haddad, S. A. R. (2015). Robust digital speech watermarking for online speaker recognition. Mathematical Problems in Engineering. https://doi.org/10.1155/2015/372398.
Nematollahi, M. A., Vorakulpipat, C., & Rosale, H. G. (2016) Speech watermarking. In Digital watermarking. Springer topics in signal processing (Vol. 11, pp. 39–53). Springer, Singapore.
Pattanshetti, P., Dongaonkar, S., & Karpe, S. (2015). Digital watermarking in audio using least significant bit and discrete cosine transform. International Journal of Computer Science and Information Technologies, 6(4), 3688–3692.
Poddar, A., Sahidullah, M., & Saha, G. (2019). Quality measures for speaker verification with short utterances. Digital Signal Processing, 88, 66–79.
Ramos, D., Haraksim, R., & Meuwly, D. (2017). Likelihood ratio data to report the validation of a forensic fingerprint evaluation method. Data in Brief, 10, 75–92.
Sakaguchi, S., Arai, T., & Murahara, Y. (2000). The effect of polarity inversion of speech on human perception and data hiding as an application. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. 2, pp. 917–920).
Skopin, D. E., El-Emary, I. M. M., Rasras, R. J., & Diab, R. S. (2010) Advanced algorithms in audio steganography for hiding human speech signal. In Proceedings of the International Conference on Advanced Computer Control.
Upadhyay, N., & Karmakar, A. (2015). Speech enhancement using spectral subtraction-type algorithms: A comparison and simulation study. Procedia Computer Science, 54, 574–584.
Upadhyay, N., & Jaiswal, R. K. (2016). Single channel speech enhancement: Using Wiener filtering with recursive noise estimation. Procedia Computer Science, 84, 22–30.
Wang, S., Sekey, A., & Gersho, A. (1992). An objective measure for predicting subjective quality of speech coders. IEEE Journal on Selected Areas in Communications, 10, 819–829.
Xiang, P. T., Wong, K. W., & Liao, X. (2007). A novel symmetrical cryptosystem based on discretized two-dimensional chaotic map. Physics Letters A, 364, 252–258.
Yelwande, A., Kansal, S., & Dixit, A. (2017). Adaptive wiener filter for speech enhancement. In Proceedings of the International Conference on Information, Communication, Instrumentation and Control.
Zeiler, A., Faltermeier, R., Keck, I. R., Tomé, A. M., Puntonet, C. G. & Lang, E. W. (2010). Empirical mode decomposition—An introduction. In Proceedings of the International Joint Conference on Neural Networks.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
abd-ElMordy, E., el-Gazar, S., Abbas, A.M. et al. Simultaneous Enhancement and Watermarking of Speech Signals. Int J Speech Technol 24, 219–234 (2021). https://doi.org/10.1007/s10772-019-09638-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-019-09638-1