Abstract
With the improvement of security awareness, in order to guarantee information security, more advanced and secure encryption algorithms are applied to Microsoft Office. People also set more complex encryption passwords. However, once the initial password is forgotten, the encrypted information needs to be retrieved. The conventional brute force cracking methods and password recovery programs can hardly meet the actual deciphering needs. To this end, we develop a distributed parallel password recovery program (MT-Office) for Microsoft Office on the domestic heterogeneous multi-core processor (MT-3000). MT-Office takes full advantage of the multi-core and heterogeneous features of MT-3000, and is optimized and improved in both vectorization and global computing. At the same time, MT-Office provides multiple recovery strategies in password generation to improve the recovery efficiency. Compared with other platforms (e.g., Intel platforms and FT platforms), MT-3000 heterogeneous platform can achieve 60 × –218 × speedup ratio. For Office2010, we perform a strong scalability test on the new-generation supercomputer in National Supercomputer Center in Tianjin. MT-Office not only extends to 65,536 acceleration clusters on this system, shows good scalability, but also achieves almost linear speedup ratio. For Office2007, compared with other password recovery programs, MT-Office can achieve 2.5 × –131.1 × speedup ratio. It can be seen that MT-Office can better exploit the advantages of MT-3000, which not only has good scalability and parallelism, but also has faster deciphering speed and can be applied to practical engineering application.
Similar content being viewed by others
Data availability
The raw/processed code required to reproduce these findings cannot be shared at this time as the data also forms part of an ongoing study. But all data and pictures which appear in the submitted article are available from the corresponding author by request.
References
AOPR (Advanced Office Password Recovery) [EB/OL]. https://www.passwordrecovery.cn/. Accessed Jan 2023
Chen, L., Yang, Y., Wang, J., Chen, Z., Wen, L., Yang, G.: Word 2003 document password cracking based on the china supercomputer. Proceedings of the 6th International Asia Conference on Industrial Engineering and Management Innovation. Atlantis Press, 251–263 (2016)
Dong, B., Zhao, R., Zhang, H.: Office password recovery technology based on sunway many-core processor. Comput. Technol. Dev. 31(05), 137–142 (2021)
Dworkin M., Barker E., Nechvatal J., Foti J., Bassham L., Roback E., Dray J.: Advanced Encryption Standard (AES), Federal Inf. Process. Stds. (NIST FIPS), National Institute of Standards and Technology, Gaithersburg, MD (2001)
EDPR (Elcomsoft Distributed Password Recovery) [EB/OL]. https://cn.elcomsoft.com/. Accessed Jan 2023
Fang J., Zhang P., Huang C., Tang T., Lu K., Wang R., et al.: Programming bare-metal accelerators with heterogeneous threading models: a case study of matrix-3000. arXiv preprint arXiv:2210.12230 (2022)
Gong, C., Liu, J., Chi, L., Huang, H., Fang, J., Gong, Z.: GPU accelerated simulations of 3D deterministic particle transport using discrete ordinates method. J. Comput. Phys. 230(15), 6010–6022 (2011)
Gong, C., Liu, J., Huang, H., Gong, Z.: Particle transport with unstructured grid on GPU. Comput. Phys. Commun. 183(3), 588–593 (2012)
Gong, C., Liu, J., Bao, W., Pan, D., Gan, X., Li, S., et al.: Review on ecological construction of domestic high-performance parallel application software in post moore era[J]. J. Syst. Simul. 34(10), 2107–2118 (2022)
Hashcat [EB/OL]. https://hashcat.net/hashcat/. Accessed Jan 2023
Hong J., Chen Z., Hu J.: Analysis of encryption mechanism in Office 2013. 2015 IEEE 9th International Conference on Anti-counterfeiting, Security, and Identification (ASID). IEEE, 29–32 (2015)
Hranický, R., Matoušek, P., Ryšavý, O., Veselý, V.: Experimental evaluation of password recovery in encrypted documents. In: Proceedings of ICISSP, pp. 299–306. SciTePress-Science and Technology Publications, Setúbal (2016)
Hranický R., Zobal L., Večeřa V., Matoušek P.: Distributed password cracking in a hybrid environment. Proceedings of SPI. 75–90 (2017)
Hranický, R., Zobal, L., Ryšavý, O., Kolář, D.: Distributed password cracking with BOINC and hashcat. Digit. Investig. 30, 161–172 (2019)
John the Ripper [EB/OL]. https://www.openwall.com/john/. Accessed Jan 2023
Kim K.: Distributed password cracking on GPU nodes. 2012 7th International Conference on Computing and Convergence Technology (ICCCT). IEEE, 647–650 (2012)
Kim J., Park S.: Secure password translation for document protection of SOHO companies. 2017 International Conference on program Security and Assurance (ICSSA). IEEE, 66–68 (2017)
Kim, M., Jung, Y., Song, J.: A modified exhaustive search on a password system using SHA-1. Int. J. Inf. Secur. 16(3), 263–269 (2017)
Li, L., Zhou, Q., Li, B.: Method for implementing office document password cracking on multi-core FPGA. J. Chin. Computer Syst. 40(05), 929–934 (2019)
Li, B., Zhou, Q., Cao, Y., Si, X.: Cognitively reconfigurable mimic-based heterogeneous password recovery system. Comput. Secur. 116, 102667 (2022)
Liu, P., Li, S., Ding, Q.: An energy-efficient accelerator based on hybrid CPU-FPGA devices for password recovery. IEEE Trans. Comput. 68(2), 170–181 (2018)
Lu, K., Wang, Y., Guo, Y., Huang, C., Liu, S., Wang, R., et al.: MT-3000: a heterogeneous multi-zone processor for HPC. CCF Trans. High Perform. Comput. 4(2), 150–164 (2022)
MS-CFB: Compound File Binary File Format. Protocol Revision 11.0 (2021). https://learn.microsoft.com/en-us/openspecs/windows_protcols/ms-cfb/
MS-OFFCRYPTO: Office Document Cryptography Structure. Protocol Revision 10.0 (2021). https://learn.microsoft.com/en-us/openspecs/office_file_formats/ms-offcrypto/
Tiwari A., Sharma N., Kaushik I., Tiwari R.: Privacy issues & security techniques in big data. 2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS). IEEE, 51–56 (2019)
Wu X., Hong J., Zhang Y.: Analysis of OpenXML-based office encryption mechanism. 2012 7th International Conference on Computer Science & Education (ICCSE). IEEE, 521–524 (2012)
Xie, Q., Zhou, Q., Si, X., Li, B.: Decision Mechanism of Password Recovery Based on Bloom Filter in Heterogeneous Platform. In: 2016 2nd Workshop on Advanced Research and Technology in Industry Applications (WARTIA-16), pp. 347–350. Atlantis Press, Amsterdam (2016)
Zhan X., Hong J.: Study on GPU-based password recovery for MS Office2003 document. 2012 7th International Conference on Computer Science & Education (ICCSE). IEEE, 517–520 (2012)
Acknowledgements
This work was supported by the National Key Research and Development Program of China (Grant No.2021YFB0300101), and the National Natural Science Foundation of China (Grant No. 62032023, 61902411, 12002382). The authors would like to thank the National Super-Computer Center in Tianjin for the resources and technical support.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Luo, Y., Yang, B., Liu, J. et al. MT-office: parallel password recovery program for office on domestic heterogeneous multi-core processor. CCF Trans. HPC 5, 231–244 (2023). https://doi.org/10.1007/s42514-023-00146-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42514-023-00146-y