Skip to main content

An Empirical Analysis on Lossless Compression Techniques

  • Conference paper
  • First Online:
Computer and Communication Engineering (CCCE 2023)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1823))

Included in the following conference series:

  • 119 Accesses

Abstract

A method of presenting source data into it’s compact form is known as data compression. In this process, data size is minimized, redundancy is eliminated, and excess information is gotten rid of. A reduction in actual data is usually advantageous because it uses less resources overall, including bandwidth, processing, space, time, and many others. There are numerous compression algorithms for reducing the size of data of different formats. Even for compressing a particular data type, many approaches are being used. The proposed research has explored three of the lossless compression techniques which are: Run Length Encoding, Lempel Ziv Welch, and Huffman Encoding algorithms. We found out that based on compression size, compression ratio, and space saving percentage, Lempel Ziv Welch outperformed the other two. In contrast, Huffman Encoding performed better than the other two based on compression time. In the best case, LZW got a compression size of 250992 bytes, a compression ratio of 5.0106, and a space saving percentage of 80.04% while Huffman encoding got a compression time of 32.28 ms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dobre, C., Xhafa, F.: Intelligent services for big data science. Futur. Gener. Comput. Syst. 37, 267–281 (2014)

    Article  Google Scholar 

  2. Sivarajah, U., Kamal, M.M., Irani, Z., Weerakkody, V.: Critical analysis of big data challenges and analytical methods. J. Bus. Res. 70, 263–286 (2017)

    Google Scholar 

  3. Kodituwakku, S.R., Amarasinghe, U.S.: Comparison of lossless data compression algorithms for text data. Indian J. Comput. Sci. Eng. 1(4), 416–425 (2010)

    Google Scholar 

  4. Porwal, S., Chaudhary, Y., Joshi, J., Jain, M., et al.: Data compression methodologies for lossless data and comparison between algorithms. Int. J. Eng. Sci. Innov. Technol. (IJESIT) 2, 142–147 (2013)

    Google Scholar 

  5. Arnold, R., Bell, T.: A corpus for the evaluation of lossless compression algorithms. In: Proceedings DCC ’97. Data Compression Conference, pp. 201–210 (1997)

    Google Scholar 

  6. Brisaboa, N.R., Cánovas, R., Claude, F., Martínez-Prieto, M.A., Navarro, G.: Compressed string dictionaries. In: Pardalos, P.M., Rebennack, S. (eds.) SEA 2011. LNCS, vol. 6630, pp. 136–147. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20662-7_12

    Chapter  Google Scholar 

  7. Brisaboa, N.R., Cerdeira-Pena, A., de Bernardo, G., Fariña, A., Navarro, G.: Space/time-efficient RDF stores based on circular suffix sorting. J. Supercomput. 1–41 (2022)

    Google Scholar 

  8. Kempa, D., Prezza, N.: At the roots of dictionary compression: string attractors. In: Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, pp. 827–840 (2018)

    Google Scholar 

  9. Nishimoto, T., Tabei, Y.: LZRR: LZ77 parsing with right reference. In: 2019 Data Compression Conference (DCC), pp. 211–220 (2019)

    Google Scholar 

  10. Platos, J., Dvorský, J.: Word-based text compression. CoRR, abs/0804.3680 (2008)

    Google Scholar 

  11. Javed, M., Nagabhushan, P.: Automatic page segmentation without decompressing the run-length compressed text documents. arXiv preprint arXiv:2007.01142 (2020)

  12. Fiergolla, S., Wolf, P.: Improving run length encoding by preprocessing. In: 2021 Data Compression Conference (DCC), pp. 341–341 (2021)

    Google Scholar 

  13. Shanmugasundaram, S., Lourdusamy, R.: A comparative study of text compression algorithms. Int. J. Wisdom Based Comput. 1(3), 68–76 (2011)

    Google Scholar 

  14. Welch, T.A.: A technique for high-performance data compression. Computer 17(06), 8–19 (1984)

    Article  Google Scholar 

  15. Huffman, D.A.: A method for the construction of minimum-redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Badrul Hossain .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hossain, M.B., Rahman, M.N.J. (2023). An Empirical Analysis on Lossless Compression Techniques. In: Neri, F., Du, KL., Varadarajan, V., San-Blas, AA., Jiang, Z. (eds) Computer and Communication Engineering. CCCE 2023. Communications in Computer and Information Science, vol 1823. Springer, Cham. https://doi.org/10.1007/978-3-031-35299-7_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-35299-7_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-35298-0

  • Online ISBN: 978-3-031-35299-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics