Abstract
A method of presenting source data into it’s compact form is known as data compression. In this process, data size is minimized, redundancy is eliminated, and excess information is gotten rid of. A reduction in actual data is usually advantageous because it uses less resources overall, including bandwidth, processing, space, time, and many others. There are numerous compression algorithms for reducing the size of data of different formats. Even for compressing a particular data type, many approaches are being used. The proposed research has explored three of the lossless compression techniques which are: Run Length Encoding, Lempel Ziv Welch, and Huffman Encoding algorithms. We found out that based on compression size, compression ratio, and space saving percentage, Lempel Ziv Welch outperformed the other two. In contrast, Huffman Encoding performed better than the other two based on compression time. In the best case, LZW got a compression size of 250992 bytes, a compression ratio of 5.0106, and a space saving percentage of 80.04% while Huffman encoding got a compression time of 32.28 ms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dobre, C., Xhafa, F.: Intelligent services for big data science. Futur. Gener. Comput. Syst. 37, 267–281 (2014)
Sivarajah, U., Kamal, M.M., Irani, Z., Weerakkody, V.: Critical analysis of big data challenges and analytical methods. J. Bus. Res. 70, 263–286 (2017)
Kodituwakku, S.R., Amarasinghe, U.S.: Comparison of lossless data compression algorithms for text data. Indian J. Comput. Sci. Eng. 1(4), 416–425 (2010)
Porwal, S., Chaudhary, Y., Joshi, J., Jain, M., et al.: Data compression methodologies for lossless data and comparison between algorithms. Int. J. Eng. Sci. Innov. Technol. (IJESIT) 2, 142–147 (2013)
Arnold, R., Bell, T.: A corpus for the evaluation of lossless compression algorithms. In: Proceedings DCC ’97. Data Compression Conference, pp. 201–210 (1997)
Brisaboa, N.R., Cánovas, R., Claude, F., Martínez-Prieto, M.A., Navarro, G.: Compressed string dictionaries. In: Pardalos, P.M., Rebennack, S. (eds.) SEA 2011. LNCS, vol. 6630, pp. 136–147. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20662-7_12
Brisaboa, N.R., Cerdeira-Pena, A., de Bernardo, G., Fariña, A., Navarro, G.: Space/time-efficient RDF stores based on circular suffix sorting. J. Supercomput. 1–41 (2022)
Kempa, D., Prezza, N.: At the roots of dictionary compression: string attractors. In: Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, pp. 827–840 (2018)
Nishimoto, T., Tabei, Y.: LZRR: LZ77 parsing with right reference. In: 2019 Data Compression Conference (DCC), pp. 211–220 (2019)
Platos, J., Dvorský, J.: Word-based text compression. CoRR, abs/0804.3680 (2008)
Javed, M., Nagabhushan, P.: Automatic page segmentation without decompressing the run-length compressed text documents. arXiv preprint arXiv:2007.01142 (2020)
Fiergolla, S., Wolf, P.: Improving run length encoding by preprocessing. In: 2021 Data Compression Conference (DCC), pp. 341–341 (2021)
Shanmugasundaram, S., Lourdusamy, R.: A comparative study of text compression algorithms. Int. J. Wisdom Based Comput. 1(3), 68–76 (2011)
Welch, T.A.: A technique for high-performance data compression. Computer 17(06), 8–19 (1984)
Huffman, D.A.: A method for the construction of minimum-redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hossain, M.B., Rahman, M.N.J. (2023). An Empirical Analysis on Lossless Compression Techniques. In: Neri, F., Du, KL., Varadarajan, V., San-Blas, AA., Jiang, Z. (eds) Computer and Communication Engineering. CCCE 2023. Communications in Computer and Information Science, vol 1823. Springer, Cham. https://doi.org/10.1007/978-3-031-35299-7_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-35299-7_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35298-0
Online ISBN: 978-3-031-35299-7
eBook Packages: Computer ScienceComputer Science (R0)