Abstract
The proliferation of remote sensing image data in recent years has posed a pressing need for efficient compression techniques due to constrained transmission bandwidth. While lossless compression preserves image fidelity, it falls short of meeting real-time demands. Conversely, conventional lossy compression methods can attain high compression ratios for real-time applications, but often introduce issues like block artifacts, blurring, and distortions in the decompressed images. Hence, we propose the Global Feature-Assisted Segmentation Compression Network (GFSCompNet) as a solution for high compression ratio lossy compression. Initially, we design a segmentation network utilizing a dual-branch global feature-assisted segmentation approach to precisely detect small targets in remote sensing images. On the compression side of the network, we leverage an attention mechanism and code rate allocation technique to seamlessly merge the segmented small target information with the original image, thereby allocating a higher compression code rate to the small target region. Furthermore, a joint hyper-priority decoding and entropy coding estimation network is proposed to further remove the redundancy in the potential representation and improve the compression ratio. Experimental results conducted under conditions of high compression ratios and comparable bit rates demonstrate that our approach yields higher-quality reconstructed images compared to the JPEG algorithm and outperforms other deep learning-based image compression methods. Additionally, it effectively preserves small target information, thereby enhancing the interpretability of machine learning models.
Similar content being viewed by others
Data Availability
This study uses the publicly available iSAID dataset which can be accessed at https://captain-whu.github.io/iSAID/dataset.html.
References
Linde Y, Buzo A, Gray R (1980) An algorithm for vector quantizer design. IEEE Trans Commun 28(1):84–95. https://doi.org/10.1109/TCOM.1980.1094577
Ahalt SC, Krishnamurthy AK, Chen P et al (1990) Competitive learning algorithms for vector quantization. Neural Netw 3(3):277–290. https://doi.org/10.1016/0893-6080(90)90071-R
Amrani N, Serra-Sagristá J, Laparra V et al (2016) Regression wavelet analysis for lossless coding of remote-sensing data. IEEE Trans Geosci Remote Sens 54(9):5616–5627. https://doi.org/10.1109/TGRS.2016.2569485
Cheng X, Li Z (2021) Predicting the lossless compression ratio of remote sensing images with configurational entropy. IEEE J Sel Top Appl Earth Obs Remote Sens 14:11936–11953. https://doi.org/10.1109/JSTARS.2021.3123650
Selwin Mich Priyadharson A, Thilipkumar C, Reddy LMK (2023) Sentinel-2 satellite image enhancement and compression based on dwt and vector quantization. Artificial intelligence and machine learning in satellite data processing and services. Springer Nature Singapore, Singapore, pp 1–7
Zhou S, Deng C, Zhao B, et al (2015) Remote sensing image compression: A review. In: 2015 IEEE International conference on multimedia big data, pp 406–410, https://doi.org/10.1109/BigMM.2015.16
Xiang S, Liang Q (2023) Remote sensing image compression with long-range convolution and improved non-local attention model. Signal Process 209:109005
Fu C, Du B, Zhang L (2023) Sar image compression based on multi-resblock and global context. EEE Geosci Remote Sens Lett 20:1–5
Gao J, Teng Q, He X, et al (2023) Mixed entropy model enhanced residual attention network for remote sensing image compression. Neural Processing Letters pp 1–13
Fu H, Liang F (2023) Learned image compression with generalized octave convolution and cross-resolution parameter estimation. Signal Process 202:108778
Wallace GK (1992) The jpeg still picture compression standard. IEEE Trans Consum Electron 38(1):xviii–xxxiv
Ballé J, Minnen D, Singh S, et al (2018) Variational image compression with a scale hyperprior. In: International conference on learning representations
Minnen D, Ballé J, Toderici GD (2018) Joint autoregressive and hierarchical priors for learned image compression. Advances in Neural Information Processing Systems 31
Cheng Z, Sun H, Takeuchi M, et al (2020) Learned image compression with discretized gaussian mixture likelihoods and attention modules. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7939–7948
Aggarwal K, Mijwil MM, Sonia, et al. (2022) Has the future started? the current growth of artificial intelligence, machine learning, and deep learning. Iraqi J Comput Sci Math 3(1):115–123. https://doi.org/10.52866/ijcsm.2022.01.01.013
Lijun Y, Mengbo L, Tongxin W et al (2023) Geo-information mapping improves canny edge detection method. IET Image Process 17(6):1893–1904
Majhi A, Sethy KM, Panda M (2022) Machine learning approach for change detection of chandaka wildlife sanctuary with the help of remote sensing data. Intell Syst Proc ICMIB 2021:523–535
Zhang W, Fu C, Chang X et al (2022) A more compact object detector head network with feature enhancement and relational reasoning. Neurocomputing 499:23–34
Zhang W, Fu C, Xie H et al (2021) Global context aware rcnn for object detection. Neural Comput Appl 33:11627–11639
Zhang W, Fu C, Cao L et al (2022) Codh++: Macro-semantic differences oriented instance segmentation network. Expert Syst Appl 202:117198
Zhang W, Fu C, Zheng Y et al (2022) Hsnet: A hybrid semantic network for polyp segmentation. Comput Biol Med 150:106173
Minaee S, Boykov Y, Porikli F et al (2022) Image segmentation using deep learning: A survey. IEEE Trans Pattern Anal Mach Intell 44(7):3523–3542. https://doi.org/10.1109/TPAMI.2021.3059968
Gao H, Xiao J, Yin Y, et al (2022) A mutually supervised graph attention network for few-shot segmentation: The perspective of fully utilizing limited samples. IEEE Transactions on Neural Networks and Learning Systems pp 1–13. https://doi.org/10.1109/TNNLS.2022.3155486
Hamaguchi R, Fujita A, Nemoto K, et al (2018) Effective use of dilated convolutions for segmenting small object instances in remote sensing imagery. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1442–1450
Kampffmeyer M, Salberg AB, Jenssen R (2016) Semantic segmentation of small objects and modeling of uncertainty in urban remote sensing images using deep convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 1–9
Wang H, Zhou L, Wang L (2019) Miss detection vs. false alarm: Adversarial learning for small object segmentation in infrared images. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 8509–8518
Guo D, Zhu L, Lu Y et al (2018) Small object sensitive segmentation of urban street scene with spatial adjacency between object classes. IEEE Trans Image Process 28(6):2643–2653
Dong R, Pan X, Li F (2019) Denseu-net-based semantic segmentation of small objects in urban remote sensing images. IEEE Access 7:65347–65356
Zheng Z, Zhong Y, Wang J, et al (2020) Foreground-aware relation network for geospatial object segmentation in high spatial resolution remote sensing imagery. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4096–4105
Cheng S, Chen H, Yao P et al (2023) Mlae: A pretraining method for automatic identification of urban public space. IEEE Trans Geosci Remote Sens Lett 20:1–5. https://doi.org/10.1109/LGRS.2023.3315687
Rahebi J (2022) Vector quantization using whale optimization algorithm for digital image compression. Multimed Tools Appl 81(14):20077–20103. https://doi.org/10.1007/s11042-022-11952-x
Yuen M, Wu H (1998) A survey of hybrid mc/dpcm/dct video coding distortions. Signal Processing 70(3):247–278. https://doi.org/10.1016/S0165-1684(98)00128-5
Bajpai S (2023) Low complexity image coding technique for hyperspectral image sensors. Multimedia Tools and Appl 82(20):31233–31258. https://doi.org/10.1007/s11042-023-14738-x
Toderici G, O’Malley SM, Hwang SJ, et al (2015) Variable rate image compression with recurrent neural networks. arXiv preprint arXiv:1511.06085
Toderici G, Vincent D, Johnston N, et al (2017) Full resolution image compression with recurrent neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5306–5314
Agustsson E, Tschannen M, Mentzer F, et al (2019) Generative adversarial networks for extreme learned image compression. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 221–231
Zhao S, Yang S, Gu J et al (2021) Symmetrical lattice generative adversarial network for remote sensing images compression. ISPRS J Photogramm Remote Sens 176:169–181
Ballé J, Laparra V, Simoncelli EP (2017) End-to-end optimized image compression. In: Int’l conf on learning representations (ICLR), Toulon, France
Ballé J, Laparra V, Simoncelli EP (2015) Density modeling of images using a generalized normalization transformation. arXiv preprint arXiv:1511.06281
Jamil S, Piran MJ, Rahman M et al (2023) Learning-driven lossy image compression: A comprehensive survey. Eng Appl Artif Intell 123:106361
Li M, Zuo W, Gu S, et al (2018) Learning convolutional networks for content-weighted image compression. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3214–3223
Xia Q, Liu H, Ma Z (2020) Object-based image coding: A learning-driven revisit. In: 2020 IEEE international conference on multimedia and expo (ICME), IEEE, pp 1–6
Ding L, Lin D, Lin S et al (2022) Looking outside the window: Wide-context transformer for the semantic segmentation of high-resolution remote sensing images. IEEE Trans Geosci Remote Sens 60:1–13. https://doi.org/10.1109/TGRS.2022.3168697
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, pp 7132–7141, https://doi.org/10.1109/CVPR.2018.00745
He X, Zhou Y, Zhao J et al (2022) Swin transformer embedding unet for remote sensing image semantic segmentation. IEEE Trans Geosci Remote Sens 60:1–15
Zan Z, Liu C, Sun H et al (2021) Learned image compression with separate hyperprior decoders. IEEE Open J Circuits Syst 2:627–632. https://doi.org/10.1109/OJCAS.2021.3125354
Chen T, Liu H, Ma Z et al (2021) End-to-end learnt image compression via non-local attention optimization and improved context modeling. IEEE Transactions on Image Processing 30:3179–3191. https://doi.org/10.1109/TIP.2021.3058615
Waqas Zamir S, Arora A, Gupta A, et al (2019) isaid: A large-scale dataset for instance segmentation in aerial images. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 28–37
Xia GS, Bai X, Ding J, et al (2018) Dota: A large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3974–3983
Chen LC, Papandreou G, Schroff F, et al (2017) Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587
Chen LC, Zhu Y, Papandreou G, et al (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818
Zhao H, Shi J, Qi X, et al (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890
Fu J, Liu J, Tian H, et al (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3146–3154
Sun K, Xiao B, Liu D, et al (2019) Deep high-resolution representation learning for human pose estimation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 5693–5703
Acknowledgements
This work was supported by the Fundamental Research Funds for the Central Universities of China (No. N2216010), the ‘Jie Bang Gua Shuai’ Science and Technology Major Project of Liaoning Province in 2022 (No.2022JH1/10400025) and the National Key Research and Development Program of China (No. 2018YFB1702000).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing of interest
The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ye, W., Lei, W., Zhang, W. et al. GFSCompNet: remote sensing image compression network based on global feature-assisted segmentation. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-18260-6
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-024-18260-6