Skip to main content

Coupled Squeeze-and-Excitation Blocks Based CNN for Image Compression

  • Conference paper
  • First Online:
Intelligence Science and Big Data Engineering. Visual Data Engineering (IScIDE 2019)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11935))

Abstract

Recent researches have shown that deep convolutional neural networks (CNN) have achieved promising results in the field of image compression. In this paper, we propose an end-to-end image compression framework based on effective attention modules. In the proposed method, two channel attention mechanisms are employed jointly. The first is the Squeeze-and-Excitation block (SEblock) in the encoder. The other is the novel inversed SEblock (ISEblock) placed in decoder. These blocks, named coupled SEblocks, are placed behind the convolutional layer in both encoder and decoder. By using SEblocks, the encoder learns the interdependencies between different channels and the feature maps can be better distributed after entropy coding. In decoder, the inversed SEblock is employed which adaptively learns the weights and divides weights between the channels to supplement information compressed from the encoder. The whole network is trained as a joint rate-distortion optimization by using a subset of the ImageNet dataset. We evaluate our method on public Kodak test set. At low bit rates, our approach outperforms the existing Ballè’s, JPEG, JPEG2000 and WebP on multi-scale structural similarity (MS-SSIM) and gets good visual qualities for all images at test set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wallace, G.K.: The jpeg still picture compression standard. IEEE Trans. Consum. Electron. 38(1), xviii–xxxiv (1992)

    Article  Google Scholar 

  2. Skodras, A., Christopoulos, C., Ebrahimi, T.: The jpeg 2000 still image compression standard. IEEE Signal Process. Mag. 18(5), 36–58 (2001)

    Article  Google Scholar 

  3. Google.: WebP: Compression techniques (2017). http://developers.google.com/speed/webp/docs/compression. Accessed 30 Jan 2017

  4. Foi, A., Katkovnik, V., Egiazarian, K.: Pointwise shape-adaptive DCT for high-quality denoising and deblocking of grayscale and color images. IEEE Trans. Image Process. 16(5), 1395–1441 (2007)

    Article  MathSciNet  Google Scholar 

  5. Zhang, X., Xiong, R., Ma, S., Gao, W.: Reducing blocking artifacts in compressed images via transform-domain non-local coefficients estimation. In: IEEE International Conference on Multimedia and Expo (ICME), pp. 836–841 (2012)

    Google Scholar 

  6. Zhang, X., Xiong, R., Fan, X., Ma, S., Gao, W.: Compression artifact reduction by overlapped-block transform coefficient estimation with block similarity. IEEE Trans. Image Process. 22(12), 4613–4626 (2013)

    Article  MathSciNet  Google Scholar 

  7. Zhang, X., Xiong, R., Zhao, G., Zhang, Y., Ma, S., Gao, W.: CONCOLOR: Constrained non-convex low-rank model for image deblocking. IEEE Trans. Image Process. 25(3), 1246–1259 (2016)

    Article  MathSciNet  Google Scholar 

  8. Dong, C., Loy, C.C., He, K., Tang, X.: Learning a deep convolutional network for image super-resolution. In: European Conference on Computer Vision (ECCV), pp. 184–199 (2014)

    Chapter  Google Scholar 

  9. Dong, C., Loy, C.C., He, K., Tang, X.: Compression artifacts reduction by a deep convolutional network. In: IEEE International Conference on Computer Vision (ICCV), pp. 576–584 (2015)

    Google Scholar 

  10. Toderici, G., et al.: Variable rate image compression with recurrent neural networks. In: International Conference on Learning Representations (ICLR). arXiv: 1511.06085 (2015)

    Google Scholar 

  11. Toderici, G., et al.: Full resolution image compression with recurrent neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5435–5443 (2017)

    Google Scholar 

  12. Ballé, J., Laparra, V., Simoncelli, E.P.: End-to-end optimized image compression. In: International Conference on Learning Representations (ICLR). arXiv: 1608.05148 (2016)

    Google Scholar 

  13. Theis, L., Shi, W., Cunningham, A., Huszár, F.: Lossy image compression with compressive autoencoders. In: International Conference on Learning Representations (ICLR). arXiv:1703.00395 (2017)

  14. Li, M., Zuo, W., Gu, S., Zhao, D., Zhang, D.: Learning convolutional networks for content-weighted image compression. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3214–3223 (2018)

    Google Scholar 

  15. Mentzer, F., Agustsson, E., Tschannen, M., Timofte, R., Van Gool, L.: Conditional probability models for deep image compression. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4394–4402 (2018)

    Google Scholar 

  16. Hu, J., Shen, L., Albanie, S., Sun, G., Wu, E.: Squeeze-and-excitation networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7132–7141 (2018)

    Google Scholar 

  17. Park, J., Woo, S., Lee, J.Y., Kweon, I.S.: BAM: Bottleneck Attention Module. In: The British Machine Vision Conference (BMVC). arXiv:1807.06514 (2018)

  18. Hinton, G. E.: Rectified linear units improve restricted boltzmann machines. In: International Conference on International Conference on Machine Learning (ICML), pp. 807–814 (2010)

    Google Scholar 

  19. Eastman Kodak.: Kodak Lossless True Color Image Suite (2012). http://r0k.us/graphics/kodak. Accessed Oct 2012

  20. Marpe, D., Schwarz, H., Wiegand, T.: Context-based adaptive binary arithmetic coding in the h.264/avc video compression standard. IEEE Trans. Circuits Syst. Video Technol. 13(7), 620–636 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yang Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Du, J., Xu, Y., Wei, Z. (2019). Coupled Squeeze-and-Excitation Blocks Based CNN for Image Compression. In: Cui, Z., Pan, J., Zhang, S., Xiao, L., Yang, J. (eds) Intelligence Science and Big Data Engineering. Visual Data Engineering. IScIDE 2019. Lecture Notes in Computer Science(), vol 11935. Springer, Cham. https://doi.org/10.1007/978-3-030-36189-1_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-36189-1_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-36188-4

  • Online ISBN: 978-3-030-36189-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics