IH 2010: Information Hiding pp 208-220 | Cite as

STBS: A Statistical Algorithm for Steganalysis of Translation-Based Steganography

  • Peng Meng
  • Liusheng Hang
  • Zhili Chen
  • Yuchong Hu
  • Wei Yang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6387)

Abstract

Translation-Based Steganography is a secure text steganographic algorithm. In this paper, we present a novel statistical algorithm for steganalysis of Translation-Based Steganography (STBS). We first show that there are fewer high-frequency words in stegotexts than in normal texts. We then design a preprocessor to refine all the given texts to expand the frequency differences between normal texts and stegotexts. 12 dimensional feature vectors sensitive to frequency are derived from the refined texts. We finally use a SVM classifier to classify given texts to normal texts and stegotexts. A series of experiments is given to demonstrate the performance of STBS.

Keywords

steganalysis natural language steganography translation-based steganography text SVM STBS 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bennett, K.: Linguistic steganography: Survey, analysis, and robustness concerns for hiding information in text. Purdue University, CERIAS Tech. Report (2004)Google Scholar
  2. 2.
    Grothoff, C., Grothoff, K., Alkhutova, L., Stutsman, R., Atallah, M.: Translation-based steganography. In: Barni, M., Herrera-Joancomartí, J., Katzenbeisser, S., Pérez-González, F. (eds.) IH 2005. LNCS, vol. 3727, pp. 219–233. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  3. 3.
  4. 4.
    Wayner, P.: Disappearing cryptography: information hiding: steganography and watermarking. Morgan Kaufmann Pub., San Francisco (2008)Google Scholar
  5. 5.
    Chapman, M., Davida, D.: Hiding the hidden: A software system for concealing ciphertext as innocuous text. LNCS, pp. 335–345. Springer, Heidelberg (1997)Google Scholar
  6. 6.
    Taskiran, C., Topkara, U., Topkara, M., Delp, E.: Attacks on lexical natural language steganography systems. In: Proceedings of SPIE, vol. 6072, pp. 97–105 (2006)Google Scholar
  7. 7.
    Zhili, C., Liusheng, H., Zhenshan, Y., Wei, Y., Lingjun, L., Xueling, Z., Xinxin, Z.: Linguistic steganography detection using statistical characteristics of correlations between words. In: Solanki, K., Sullivan, K., Madhow, U. (eds.) IH 2008. LNCS, vol. 5284, pp. 224–234. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  8. 8.
    Zhili, C., Liusheng, H., Zhenshan, Y., Lingjun, L., Wei, Y.: A statistical algorithm for linguistic steganography detection based on distribution of words. In: Third International Conference on Availability, Reliability and Security, ARES 2008, pp. 558–563 (2008)Google Scholar
  9. 9.
    Zhili, C., Liusheng, H., Zhenshan, Y., Xinxin, Z.: Effective linguistic steganography detection. In: IEEE 8th International Conference on Computer and Information Technology Workshops, CIT Workshops 2008, pp. 224–229 (2008)Google Scholar
  10. 10.
    Stutsman, R., Atallah, M., Grothoff, K.: Lost in just the translation. In: Proceedings of the 2006 ACM Symposium on Applied Computing, pp. 338–345. ACM, New York (2006)CrossRefGoogle Scholar
  11. 11.
    Grothoff, C., Grothoff, K., Stutsman, R., Alkhutova, L., Atallah, M.: Translation-based steganography. Journal of Computer Security 17(3), 269–303 (2009)Google Scholar
  12. 12.
    Peng, M., Liusheng, H., Wei, Y., Zhili, C.: Attacks on translation based steganography. In: Proceedings of the 2009 IEEE Youth Conference on Information, Computing and Telecommunication, pp. 227–230 (2009)Google Scholar
  13. 13.
    Google: Google translator (2009), http://translate.google.cn
  14. 14.
    Systran: Systran translator (2009), https://www.systransoft.com
  15. 15.
    PROMT: Promt translation software (2009), http://www.promt.com
  16. 16.
    Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
  17. 17.
    Koehn, P.: Europarl: A parallel corpus for statistical machine translation. In: MT Summit, vol. 5 (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Peng Meng
    • 1
  • Liusheng Hang
    • 1
    • 2
  • Zhili Chen
    • 1
    • 2
  • Yuchong Hu
    • 1
  • Wei Yang
    • 1
    • 2
  1. 1.NHPCC, Depart. of CS. & Tech., USTCHefeiChina
  2. 2.Suzhou Institute for Advanced Study, USTCSuzhouChina

Personalised recommendations