Abstract
The paper explores a novel methodology in source code obfuscation through the application of text-based recurrent neural network (RNN) encoder-decoder models in ciphertext generation and key generation. Sequence-to-sequence models are incorporated into the model architecture to generate obfuscated code, generate the deobfuscation key, and live execution. Quantitative benchmark comparison to existing obfuscation methods indicate significant improvement in stealth and execution cost for the proposed solution, and experiments regarding the model’s properties yield positive results regarding its character variation, dissimilarity to the original codebase, and consistent length of obfuscated code.
Keywords
- Code obfuscation
- Encoder-decoder models
S. Datta—Work performed at the Hong Kong University of Science and Technology.
This is a preview of subscription content, access via your institution.
Buying options







Notes
- 1.
Code repository: https://github.com/dattasiddhartha/DeepObfusCode.
- 2.
References
Popa, M.: Techniques of program code obfuscation for secure software. J. Mob. Embed. Distrib. Syst. 3, 205–219 (2011)
Viticchie, A., et al.: Assessment of Source Code Obfuscation Techniques (2017). https://arxiv.org/pdf/1704.02307.pdf
Schneider, J., Locher, T.: Obfuscation using Encryption (2016). https://arxiv.org/pdf/1612.03345.pdf
Baluja, S.: Hiding images in plain sight: deep steganography. In: Advances in Neural Information Processing Systems (2017)
Benoit, S.: ConvCrypt (2018). https://github.com/santient/convcrypt
Ismail, A., Galal-Edeen, H., Khattab, S., Mohamed, A.E., Bahtity, M.E.: Satellite image encryption using neural networks backpropagation. In: International Conference on Computer Theory and Applications (2012)
Hesamifard, E., Takabi, H., Ghasemi, M.: CryptoDL: Deep Neural Networks over Encrypted Data (2017). https://arxiv.org/pdf/1711.05189.pdf
Cho, K., et al.: Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation (2014). https://arxiv.org/pdf/1406.1078.pdf
Ahmadi, S.: Attention-based Encoder-Decoder Networks for Spelling and Grammatical Error Correction (2018). https://arxiv.org/pdf/1810.00660.pdf
Khatri, C., Singh, G., Parikh, N.: Abstractive and extractive text summarization using document context vector and recurrent neural networks. In: KDD Deep Learning Day (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Datta, S. (2021). DeepObfusCode: Source Code Obfuscation through Sequence-to-Sequence Networks. In: Arai, K. (eds) Intelligent Computing. Lecture Notes in Networks and Systems, vol 284. Springer, Cham. https://doi.org/10.1007/978-3-030-80126-7_45
Download citation
DOI: https://doi.org/10.1007/978-3-030-80126-7_45
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-80125-0
Online ISBN: 978-3-030-80126-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)