August 2012, 2012:20,
Open Access This content is freely available online to anyone, anywhere at any time.
Date: 08 Aug 2012
Speech steganography using wavelet and Fourier transforms
A new method to secure speech communication using the discrete wavelet transforms (DWT) and the fast Fourier transform is presented in this article. In the first phase of the hiding technique, we separate the speech high-frequency components from the low-frequency components using the DWT. In a second phase, we exploit the low-pass spectral proprieties of the speech spectrum to hide another secret speech signal in the low-amplitude high-frequency regions of the cover speech signal. The proposed method allows hiding a large amount of secret information while rendering the steganalysis more complex. Experimental results prove the efficiency of the proposed hiding technique since the stego signals are perceptually indistinguishable from the equivalent cover signal, while being able to recover the secret speech message with slight degradation in the quality.
Kahn D: The History of Steganography. Lecture Notes in Computer Science. 1174 edition. Springer, New York; 1996:11023.
Johnson NF, Jajodia S: Exploring steganography: seeing the unseen. IEEE Comput. 1998,31(2):26–34.CrossRef
Sridevi R, Damodaram A, Narasimham SVL: Efficient method of audio steganography by modified LSB algorithm and strong encryption key with enhanced security. J. Theor. Appl. Inf. Technol. 2009,5(6):768–771.
Bender W, Gruhl D, Morimoto N: Techniques for data hiding. IBM Syst. J. 1996,35(3):313–336.CrossRef
Huang D, Yeo T: Robust and Inaudible Multi-Echo Audio Watermarking, in Proceedings of the Third IEEE Pacific-Rim Conference on Multimedia, Advances in Multimedia Information Processing Taipei. China; 2002:615–622.
Shirali-Shahreza S, Shirali-Shahreza M: Steganography in Silence Intervals of Speech, Proceedings of the Fourth IEEE International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008). Harbin, China; August 15–17, 2008:605–607.
Misiti M, Misiti Y, Oppenheim G, Poggi JM: Matlab Wavelet Toolbox (Version 4.0): Tutorial and Reference Guide The Mathworks. Natick, USA; janv 2007.
Lin B, Nguyen B, Olsen ET: Orthogonal Wavelets and Signal Processing. Edited by: Clarkson PM, Stark H. Academic, London; 1995:1–70. [Signal Processing Methods for Audio, Images and Telecommunications] ed. by
Mallat S: A Wavelet Tour of Signal Processing. Academic, San Diego, CA; 1998.MATH
Ooi J, Viswanathan V: Applications of Wavelets to Speech Processing. Edited by: Ramachandran RP, Mammone R. Kluwer Academic Publishers, Boston; 1995:449–464. [Modern Methods of Speech Processing] ed. by
Elliott DF, Rao KR: Fast Transforms: Algorithms. Analyses, Applications (Academic, New York; 1982.MATH
Andreas S, Ed PT, Venkatraman A: Audio Signal Processing and Coding. Wiley-Interscience Publication, USA; 2006. ISBN 978–0-471–79147–8, TK5102.92.S73
Strange W, Edman TR, Jenkins JJ: Acoustic and phonological factors in vowel identification. J. Exp. Psychol. Hum. Percept. Perform. 1979,5(4):643–656.CrossRef
CY Espy-Wilson: Acoustic measures for linguistic features distinguishing the semivowels in American English. J. Acoust. Soc. Am 1992, 92:736–757.CrossRef
Childers DG, Hahn M, Larar JN: Silent and voiced/unvoiced/mixed excitation (four-way) classification of speech. IEEE Trans. ASSP 1989,37(11):1771–1774.CrossRef
O’Shaughnessy D: Speech Communications: Human and Machine. 2nd edition. Wiley-IEEE Press, New York, NY; 1999.
Makhoul J: Linear prediction: a tutorial review. Proc. IEEE 1975,63(5):561–580.CrossRef
Itakura F: Line spectrum representation of linear predictive coefficients of speech signals. J. Acoust. Soc. Am 1975,57(1):S35.CrossRef
Oppenheim AV, Schafer WR, Buck AJ: Discrete-Time Signal Processing. Prentice Hall, Upper Saddle River, NJ; 1999:468–471. ISBN 0–13–754920–2
Hess W: Pitch Determination of Speech Signals. Springer, Berlin; 1983.CrossRef
Soong F, Juang B: Line spectrum pair (LSP) and speech data compression. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’84). San Diego, Calif, USA 9; March 1984:37–40.
ITU-T: Recommendation G. 723.1. Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s. 1996.
Hu Y, Loizou P: Subjective evaluation and comparison of speech enhancement algorithms. Speech Commun 2007, 49:588–601.CrossRef
Hu Y, Loizou P: Evaluation of objective quality measures for speech enhancement. IEEE Trans. Speech Audio Process. 2008,16(1):229–238.CrossRef
Ma J, Hu Y, Loizou P: Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions. J. Acoust. Soc. Am. 2009,125(5):3387–3405.CrossRef
ITU: Perceptual Evaluation of Speech Quality (PESQ), and Objective Method for End-to-End Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs, ITU-T Recommendation 862. 2000.
ITU-T Recommendation: Methods for Subjective Determination of Speech Quality International Telecommunication Union. Geneva; 2003:800.
- Speech steganography using wavelet and Fourier transforms
- Open Access
- Available under Open Access This content is freely available online to anyone, anywhere at any time.
EURASIP Journal on Audio, Speech, and Music Processing
- Online Date
- August 2012
- Online ISSN
- Springer International Publishing AG
- Additional Links
- Audio steganography
- Discrete wavelet transform
- Fast Fourier transform
- Data hiding
- Speech steganography