An Image Compression Approach Based on Convolutional AutoEncoder

Jannani, Oussama; Idrissi, Najlae; Chakib, Houda

doi:10.1007/978-3-031-46584-0_7

Oussama Jannani¹⁵,
Najlae Idrissi¹⁵ &
Houda Chakib¹⁵

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 806))

Included in the following conference series:

International Conference on Artificial Intelligence and Green Computing

94 Accesses

Abstract

In recent years, neural networks have demonstrated their robustness and effectiveness in many fields mainly in computer vision tasks. Hence, recently researchers in image processing domain exploit neural networks, powerful ability of representing data, to develop different compression schemes. These schemes yield images with great visual quality and high compression ratio. In this paper, we propose a compression scheme that takes advantages of convolutional Auto-Encoder (CAE) to improve image compression performance. First, an RGB image is converted to the luminance/chrominance space YCbCr then the luminance component Y is compressed using an auto-encoder based convolutional neural network (CNN) whereas the chrominance components CbCr are sub-scaled. Different parameters used to evaluate the efficiency of our proposed method are: Mean Square Error (MSE), Peak Signal to Noise Ratio (PSNR) and Multi Scale Structural Similarity Index Measure (MS-SSIM). The results obtained show the effectiveness of our proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ballé, J., Laparra, V., Simoncelli, E.P.: End-to-end optimization of nonlinear transform codes for perceptual quality. In: 2016 Picture Coding Symposium (PCS), pp. 1–5. IEEE (2016)
Google Scholar
Ballé, J., Laparra, V., Simoncelli, E.P.: End-to-end optimized image compression (2016). arXiv:1611.01704
Ballé, J., Minnen, D., Singh, S., Hwang, S.J., Johnston, N.: Variational image compression with a scale hyperprior (2018). arXiv:1802.01436
Bellard, F.: Bpg image format (2020). https://bellard.org/bpg/
van den Branden Lambrecht, C.J.: Vision Models and Applications to Image and Video Processing. Springer Science & Business Media (2001)
Google Scholar
Chakib, H., Minaoui, B., Fakir, M., Salhi, A., Badi, I.: A proposed approach for image compression based on wavelet transform and neural network. Int. J. Adv. Comput. Sci. Appl. 8(9) (2017)
Google Scholar
Clevert, D.A., Unterthiner, T., Hochreiter, S.: Fast and accurate deep network learning by exponential linear units (elus) (2015). arXiv:1511.07289
Dosovitskiy, A., Brox, T.: Generating images with perceptual similarity metrics based on deep networks. In: NIPS (2016)
Google Scholar
Dumoulin, V., Visin, F.: A guide to convolution arithmetic for deep learning (2016). arXiv:1603.07285
Franzen, R.: Kodak lossless true color image suite, 4(2) (1999). http://r0k.us/graphics/kodak
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2015)
Google Scholar
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)
Google Scholar
Huffman, D.A.: A method for the construction of minimum-redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)
Article MATH Google Scholar
IBRAHIM, M.S.: A neural network approach for block coding image compression. In: The International Conference on Electrical Engineering, vol. 2, pp. 278–290. Military Technical College (1999)
Google Scholar
Jiang, J.: Image compression with neural networks-a survey. Signal Process. Image Commun. 14(9), 737–760 (1999)
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization (2014). arXiv:1412.6980
Kothari, S.C., Oh, H.: Neural networks for pattern recognition. Adv. Comput. 37, 119–166 (1993)
Article Google Scholar
Kramer, M.A.: Nonlinear principal component analysis using auto associative neural networks. AIChE J. 37(2), 233–243 (1991)
Article Google Scholar
Krizhevsky, A.: Learning multiple layers of features from tiny images (2009)
Google Scholar
Krizhevsky, A., Hinton, G.E.: Using very deep autoencoders for content-based image retrieval. In: ESANN, vol. 1, p. 2. Citeseer (2011)
Google Scholar
Marcellin, M.W., Gormish, M.J., Bilgin, A., Boliek, M.P.: An overview of jpeg-2000. In: Proceedings DCC 2000. Data Compression Conference, pp. 523–541 (2000)
Google Scholar
Skodras, A., Christopoulos, C., Ebrahimi, T.: The jpeg 2000 still image compression standard. IEEE Signal Process. Mag. 18(5), 36–58 (2001)
Article MATH Google Scholar
Stevens, J.C., Stevens, S.S.: Brightness function: Effects of adaptation. JOSA 53(3), 375–385 (1963)
Article Google Scholar
Taubman, D.S., Marcellin, M.W.: Jpeg2000-image compression fundamentals, standards and practice. In: The Kluwer International Series in Engineering and Computer Science (2002)
Google Scholar
Theis, L., Shi, W., Cunningham, A., Huszár, F.: Lossy image compression with compressive autoencoders (2017). arXiv:1703.00395
Toderici, G., O’Malley, S.M., Hwang, S.J., Vincent, D., Minnen, D., Baluja, S., Covell, M., Sukthankar, R.: Variable rate image compression with recurrent neural networks (2015). arXiv:1511.06085
Toderici, G., Vincent, D., Johnston, N., Jin Hwang, S., Minnen, D., Shor, J., Covell, M.: Full resolution image compression with recurrent neural networks. In: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pp. 5306–5314 (2017)
Google Scholar
Tschannen, M., Agustsson, E., Lucic, M.: Deep generative models for distribution-preserving lossy compression. Adv. Neural Inf. Process. Syst. 31 (2018)
Google Scholar
Union, I.T.: Recommendation itu-r bt. 601–607 (2011)
Google Scholar
Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.A.: Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th International Conference on Machine Learning, pp. 1096–1103 (2008)
Google Scholar
Wandell, B.A.: Foundations of Vision. Sinauer Associates (1995)
Google Scholar
Wang, W., Huang, Y., Wang, Y., Wang, L.: Generalized autoencoder: A neural network framework for dimensionality reduction. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 496–503 (2014)
Google Scholar
Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: The Thirty-Seventh Asilomar Conference on Signals, Systems & Computers, 2003, vol. 2, pp. 1398–1402. IEEE (2003)
Google Scholar
Yang, F., Herranz, L., Van De Weijer, J., Guitián, J.A.I., López, A.M., Mozerov, M.G.: Variable rate deep image compression with modulated autoencoder. IEEE Signal Process. Lett. 27, 331–335 (2020)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Data Science For Sustainable Earth Laboratory (Data4Earth), Faculty of Sciences and Technics, Sultan Moulay Slimane University, Beni Mellal, BP 325, Morocco
Oussama Jannani, Najlae Idrissi & Houda Chakib

Authors

Oussama Jannani
View author publications
You can also search for this author in PubMed Google Scholar
Najlae Idrissi
View author publications
You can also search for this author in PubMed Google Scholar
Houda Chakib
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Oussama Jannani .

Editor information

Editors and Affiliations

Data Science for Sustainable Earth Laboratory, Faculty of Sciences and Techniques, Sultan Moulay Slimane University, Beni Mellal, Morocco
Najlae Idrissi
Data Science for Sustainable Earth Laboratory, Faculty of Sciences and Techniques, Sultan Moulay Slimane University, Beni Mellal, Morocco
Abdellatif Hair
ENSIAS, Mohammed V University, Rabat, Morocco
Mohamed Lazaar
Data Science for Sustainable Earth Laboratory, Faculty of Sciences and Techniques, Sultan Moulay Slimane University, Beni Mellal, Morocco
Youssef Saadi
Data Science for Sustainable Earth Laboratory, Faculty of Sciences and Techniques, Sultan Moulay Slimane University, Beni Mellal, Morocco
Mohammed Erritali
Faculty of Sciences and Techniques, Hassan First University, Settat, Morocco
Said El Kafhali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jannani, O., Idrissi, N., Chakib, H. (2023). An Image Compression Approach Based on Convolutional AutoEncoder. In: Idrissi, N., Hair, A., Lazaar, M., Saadi, Y., Erritali, M., El Kafhali, S. (eds) Artificial Intelligence and Green Computing. ICAIGC 2023. Lecture Notes in Networks and Systems, vol 806. Springer, Cham. https://doi.org/10.1007/978-3-031-46584-0_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-46584-0_7
Published: 04 November 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46583-3
Online ISBN: 978-3-031-46584-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics