An Empirical Analysis on Lossless Compression Techniques

Hossain, Mohammad Badrul; Rahman, Md. Nowroz Junaed

doi:10.1007/978-3-031-35299-7_13

Mohammad Badrul Hossain¹⁰ &
Md. Nowroz Junaed Rahman¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1823))

Included in the following conference series:

International Conference on Computer and Communication Engineering

119 Accesses

Abstract

A method of presenting source data into it’s compact form is known as data compression. In this process, data size is minimized, redundancy is eliminated, and excess information is gotten rid of. A reduction in actual data is usually advantageous because it uses less resources overall, including bandwidth, processing, space, time, and many others. There are numerous compression algorithms for reducing the size of data of different formats. Even for compressing a particular data type, many approaches are being used. The proposed research has explored three of the lossless compression techniques which are: Run Length Encoding, Lempel Ziv Welch, and Huffman Encoding algorithms. We found out that based on compression size, compression ratio, and space saving percentage, Lempel Ziv Welch outperformed the other two. In contrast, Huffman Encoding performed better than the other two based on compression time. In the best case, LZW got a compression size of 250992 bytes, a compression ratio of 5.0106, and a space saving percentage of 80.04% while Huffman encoding got a compression time of 32.28 ms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dobre, C., Xhafa, F.: Intelligent services for big data science. Futur. Gener. Comput. Syst. 37, 267–281 (2014)
Article Google Scholar
Sivarajah, U., Kamal, M.M., Irani, Z., Weerakkody, V.: Critical analysis of big data challenges and analytical methods. J. Bus. Res. 70, 263–286 (2017)
Google Scholar
Kodituwakku, S.R., Amarasinghe, U.S.: Comparison of lossless data compression algorithms for text data. Indian J. Comput. Sci. Eng. 1(4), 416–425 (2010)
Google Scholar
Porwal, S., Chaudhary, Y., Joshi, J., Jain, M., et al.: Data compression methodologies for lossless data and comparison between algorithms. Int. J. Eng. Sci. Innov. Technol. (IJESIT) 2, 142–147 (2013)
Google Scholar
Arnold, R., Bell, T.: A corpus for the evaluation of lossless compression algorithms. In: Proceedings DCC ’97. Data Compression Conference, pp. 201–210 (1997)
Google Scholar
Brisaboa, N.R., Cánovas, R., Claude, F., Martínez-Prieto, M.A., Navarro, G.: Compressed string dictionaries. In: Pardalos, P.M., Rebennack, S. (eds.) SEA 2011. LNCS, vol. 6630, pp. 136–147. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20662-7_12
Chapter Google Scholar
Brisaboa, N.R., Cerdeira-Pena, A., de Bernardo, G., Fariña, A., Navarro, G.: Space/time-efficient RDF stores based on circular suffix sorting. J. Supercomput. 1–41 (2022)
Google Scholar
Kempa, D., Prezza, N.: At the roots of dictionary compression: string attractors. In: Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, pp. 827–840 (2018)
Google Scholar
Nishimoto, T., Tabei, Y.: LZRR: LZ77 parsing with right reference. In: 2019 Data Compression Conference (DCC), pp. 211–220 (2019)
Google Scholar
Platos, J., Dvorský, J.: Word-based text compression. CoRR, abs/0804.3680 (2008)
Google Scholar
Javed, M., Nagabhushan, P.: Automatic page segmentation without decompressing the run-length compressed text documents. arXiv preprint arXiv:2007.01142 (2020)
Fiergolla, S., Wolf, P.: Improving run length encoding by preprocessing. In: 2021 Data Compression Conference (DCC), pp. 341–341 (2021)
Google Scholar
Shanmugasundaram, S., Lourdusamy, R.: A comparative study of text compression algorithms. Int. J. Wisdom Based Comput. 1(3), 68–76 (2011)
Google Scholar
Welch, T.A.: A technique for high-performance data compression. Computer 17(06), 8–19 (1984)
Article Google Scholar
Huffman, D.A.: A method for the construction of minimum-redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)
Google Scholar

Download references

Author information

Authors and Affiliations

BRAC University, 66 Mohakhali, Dhaka, Bangladesh
Mohammad Badrul Hossain & Md. Nowroz Junaed Rahman

Authors

Mohammad Badrul Hossain
View author publications
You can also search for this author in PubMed Google Scholar
Md. Nowroz Junaed Rahman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Badrul Hossain .

Editor information

Editors and Affiliations

University of Naples Federico II, Naples, Italy
Filippo Neri
Concordia University, Montreal, QC, Canada
Ke-Lin Du
The University of New South Wales, Sydney, NSW, Australia
Vijayakumar Varadarajan
Miguel Hernández University of Elche, Elche, Spain
Angel-Antonio San-Blas
Northwestern Polytechnical University, Xi'an, China
Zhiyu Jiang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hossain, M.B., Rahman, M.N.J. (2023). An Empirical Analysis on Lossless Compression Techniques. In: Neri, F., Du, KL., Varadarajan, V., San-Blas, AA., Jiang, Z. (eds) Computer and Communication Engineering. CCCE 2023. Communications in Computer and Information Science, vol 1823. Springer, Cham. https://doi.org/10.1007/978-3-031-35299-7_13

Download citation

DOI: https://doi.org/10.1007/978-3-031-35299-7_13
Published: 14 June 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35298-0
Online ISBN: 978-3-031-35299-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Empirical Analysis on Lossless Compression Techniques