IH 2010: Information Hiding pp 208-220 | Cite as
STBS: A Statistical Algorithm for Steganalysis of Translation-Based Steganography
Abstract
Translation-Based Steganography is a secure text steganographic algorithm. In this paper, we present a novel statistical algorithm for steganalysis of Translation-Based Steganography (STBS). We first show that there are fewer high-frequency words in stegotexts than in normal texts. We then design a preprocessor to refine all the given texts to expand the frequency differences between normal texts and stegotexts. 12 dimensional feature vectors sensitive to frequency are derived from the refined texts. We finally use a SVM classifier to classify given texts to normal texts and stegotexts. A series of experiments is given to demonstrate the performance of STBS.
Keywords
steganalysis natural language steganography translation-based steganography text SVM STBSPreview
Unable to display preview. Download preview PDF.
References
- 1.Bennett, K.: Linguistic steganography: Survey, analysis, and robustness concerns for hiding information in text. Purdue University, CERIAS Tech. Report (2004)Google Scholar
- 2.Grothoff, C., Grothoff, K., Alkhutova, L., Stutsman, R., Atallah, M.: Translation-based steganography. In: Barni, M., Herrera-Joancomartí, J., Katzenbeisser, S., Pérez-González, F. (eds.) IH 2005. LNCS, vol. 3727, pp. 219–233. Springer, Heidelberg (2005)CrossRefGoogle Scholar
- 3.Maker, K.: TEXTO, ftp://ftp.funet.fi/pub/crypt/steganography/texto.tar.gz
- 4.Wayner, P.: Disappearing cryptography: information hiding: steganography and watermarking. Morgan Kaufmann Pub., San Francisco (2008)Google Scholar
- 5.Chapman, M., Davida, D.: Hiding the hidden: A software system for concealing ciphertext as innocuous text. LNCS, pp. 335–345. Springer, Heidelberg (1997)Google Scholar
- 6.Taskiran, C., Topkara, U., Topkara, M., Delp, E.: Attacks on lexical natural language steganography systems. In: Proceedings of SPIE, vol. 6072, pp. 97–105 (2006)Google Scholar
- 7.Zhili, C., Liusheng, H., Zhenshan, Y., Wei, Y., Lingjun, L., Xueling, Z., Xinxin, Z.: Linguistic steganography detection using statistical characteristics of correlations between words. In: Solanki, K., Sullivan, K., Madhow, U. (eds.) IH 2008. LNCS, vol. 5284, pp. 224–234. Springer, Heidelberg (2008)CrossRefGoogle Scholar
- 8.Zhili, C., Liusheng, H., Zhenshan, Y., Lingjun, L., Wei, Y.: A statistical algorithm for linguistic steganography detection based on distribution of words. In: Third International Conference on Availability, Reliability and Security, ARES 2008, pp. 558–563 (2008)Google Scholar
- 9.Zhili, C., Liusheng, H., Zhenshan, Y., Xinxin, Z.: Effective linguistic steganography detection. In: IEEE 8th International Conference on Computer and Information Technology Workshops, CIT Workshops 2008, pp. 224–229 (2008)Google Scholar
- 10.Stutsman, R., Atallah, M., Grothoff, K.: Lost in just the translation. In: Proceedings of the 2006 ACM Symposium on Applied Computing, pp. 338–345. ACM, New York (2006)CrossRefGoogle Scholar
- 11.Grothoff, C., Grothoff, K., Stutsman, R., Alkhutova, L., Atallah, M.: Translation-based steganography. Journal of Computer Security 17(3), 269–303 (2009)Google Scholar
- 12.Peng, M., Liusheng, H., Wei, Y., Zhili, C.: Attacks on translation based steganography. In: Proceedings of the 2009 IEEE Youth Conference on Information, Computing and Telecommunication, pp. 227–230 (2009)Google Scholar
- 13.Google: Google translator (2009), http://translate.google.cn
- 14.Systran: Systran translator (2009), https://www.systransoft.com
- 15.PROMT: Promt translation software (2009), http://www.promt.com
- 16.Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
- 17.Koehn, P.: Europarl: A parallel corpus for statistical machine translation. In: MT Summit, vol. 5 (2005)Google Scholar