Multi-view Deep Representations with Cross-Dataset Transfer for Remote Sensing Image Retrieval and Classification

Liu, Na; Wan, Lihong; Huang, Qiao; Ji, Yunfeng

doi:10.1007/s11042-020-08712-0

Multi-view Deep Representations with Cross-Dataset Transfer for Remote Sensing Image Retrieval and Classification

Published: 03 March 2020

Volume 80, pages 22891–22905, (2021)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Na Liu¹,
Lihong Wan²,
Qiao Huang³ &
…
Yunfeng Ji¹

379 Accesses
4 Citations
Explore all metrics

Abstract

Transfer learning is a challenging task in computer vision, due to the differences of data distribution. Although convolutional neural network (CNN) could learn different levels of image abstraction, the single-view features extracted from the final layer of a pre-trained or fine-tuned CNN may result in insufficient image description over different datasets. To address this issue, we focus on deep representations with multi-view analysis for remote sensing image (RSI) retrieval and classification tasks using five recently released large-scale datasets. First, cross-dataset transfer learning is presented by fine-tuning a pre-trained CNN on one dataset and testing the fine-tuned network on another one. Second, multi-view image representations are explored in terms of different activation vectors as well as CNNs. Finally, a multi-view fusion and a random projection (RP) strategy are proposed to improve the accuracies and computational cost of both RSI tasks, respectively. Extensive experiments are conducted to demonstrate the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-view feature learning for VHR remote sensing image classification

Article 23 April 2020

Incorporating Human Views into Unsupervised Deep Transfer Learning for Remote Sensing Image Retrieval

Cross-View Image Retrieval - Ground to Aerial Image Retrieval Through Deep Learning

References

Bingham E, Mannila H (2001) Random projection in dimensionality reduction: applications to image and text data[C]. In: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 245–250
Cao G, Iosifidis A, Gabbouj M et al (2019) Deep multi-view learning to rank[J]. IEEE Trans Knowl Data Eng
Cheng G, Han J, Lu X (2017) Remote sensing image scene classification: Benchmark and state of the art[J]. Proc IEEE 105(10):1865–1883
Article Google Scholar
Davis J, Goadrich M (2006) The relationship between Precision-Recall and ROC curves[C]. In: Proceedings of the 23rd international conference on Machine learning. ACM, pp 233–240
Jia Y, Shelhamer E, Donahue J et al (2014) Caffe: Convolutional architecture for fast feature embedding[C]. In: Proceedings of the 22nd ACM international conference on Multimedia. ACM, pp 675–678
Johnson WB, Lindenstrauss J (1984) Extensions of Lipschitz mappings into a Hilbert space[J]. Contemp Math 26(189-206):1
MathSciNet MATH Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks[C]. Advances in neural information processing systems, pp 1097–1105
Li H, Tao C, Wu Z et al (2017) Rsi-cb: A large scale remote sensing image classification benchmark via crowdsource data[J]. arXiv preprint arXiv 1705.10450
Li J, Zhang B, Lu G et al (2019) Generative multi-view and multi-feature learning for classification[J]. Information Fusion 45:215–226
Article Google Scholar
Liu N, Wan L, Zhang Y et al (2018) Exploiting convolutional neural networks with deeply local description for remote sensing image classification [J]. IEEE access 6:11215–11228
Article Google Scholar
Lu X, Ma C, Ni B et al (2018) Deep regression tracking with shrinkage loss[C]. Proceedings of the European Conference on Computer Vision (ECCV):353–369
Lu X, Guo Y, Liu N et al (2018) Non-convex joint bilateral guided depth upsampling[J]. Multimed Tools Appl 77(12):15521–15544
Article Google Scholar
Lu X, Ni B, Ma C et al (2019) Learning transform-aware attentive network for object tracking [J]. Neurocomputing 349:133–144
Article Google Scholar
Lu X, Wang W, Ma C et al (2019) See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks[C]. Proc IEEE Conf Comput Vis Pattern Recognit:3623–3632
Martnez AM, Kak AC (2001) Pca versus lda[J]. IEEE Trans Pattern Anal Mach Intell 23(2):228–233
Article Google Scholar
Nogueira K, Penatti OAB, dos Santos JA (2017) Towards better exploiting convolutional neural networks for remote sensing scene classification [J]. Pattern Recogn 61:539–556
Article Google Scholar
Penatti OAB, Nogueira K, Dos Santos JA (2015) Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?[C]. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 44–51
Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556
Su H, Maji S, Kalogerakis E et al (2015) Multi-view convolutional neural networks for 3d shape recognition[C]. In: Proceedings of the IEEE international conference on computer vision, pp 945–953
Google Scholar
Sun S (2013) A survey of multi-view machine learning[J]. Neural Comput & Applic 23(7-8):2031–2038
Article Google Scholar
Vedaldi A, Lenc K (2015) Matconvnet: Convolutional neural networks for matlab[C]. In: Proceedings of the 23rd ACM international conference on Multimedia. ACM, pp 689–692
Xia GS, Hu J, Hu F et al (2017) AID: A benchmark data set for performance evaluation of aerial scene classification[J]. IEEE Trans Geosci Remote Sens 55(7):3965–3981
Article Google Scholar
Yang Y, Newsam S (2010) Bag-of-visual-words and spatial extensions for land-use classification[C]. In: Proceedings of the 18th SIGSPATIAL international conference on advances in geographic information systems. ACM, pp 270–279
Zhou W, Newsam S, Li C et al (2017) Learning low dimensional convolutional neural networks for high-resolution remote sensing image retrieval[J]. Remote Sens 9(5):489
Article Google Scholar
Zhou W, Newsam S, Li C et al (2018) PatternNet: A benchmark dataset for performance evaluation of remote sensing image retrieval [J]. ISPRS J Photogramm Remote Sens 145:197–209
Article Google Scholar
Zhou T, Zhang C, Gong C et al (2018) Multiview latent space learning with feature redundancy minimization[J]. IEEE transactions on cybernetics
Zhou T, Zhang C, Peng X et al (2019) Dual shared-specific multiview subspace clustering[J]. IEEE transactions on cybernetics
Zhou T, Liu M, Thung KH et al (2019) Latent representation learning for Alzheimer’s disease diagnosis with incomplete multi-modality neuroimaging and genetic data[J]. IEEE transactions on medical imaging 38(10):2411–2422
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of machine intelligence, University of Shanghai for Science and Technology, Shanghai, China
Na Liu & Yunfeng Ji
Netease Inc., Hangzhou, China
Lihong Wan
Radiology Informatics Laboratory, Department of Radiology, Mayo Clinic, Rochester, MN, USA
Qiao Huang

Authors

Na Liu
View author publications
You can also search for this author in PubMed Google Scholar
Lihong Wan
View author publications
You can also search for this author in PubMed Google Scholar
Qiao Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yunfeng Ji
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yunfeng Ji.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, N., Wan, L., Huang, Q. et al. Multi-view Deep Representations with Cross-Dataset Transfer for Remote Sensing Image Retrieval and Classification. Multimed Tools Appl 80, 22891–22905 (2021). https://doi.org/10.1007/s11042-020-08712-0

Download citation

Received: 24 July 2019
Revised: 18 November 2019
Accepted: 28 January 2020
Published: 03 March 2020
Issue Date: June 2021
DOI: https://doi.org/10.1007/s11042-020-08712-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-view Deep Representations with Cross-Dataset Transfer for Remote Sensing Image Retrieval and Classification

Abstract

Access this article

Similar content being viewed by others

Multi-view feature learning for VHR remote sensing image classification

Incorporating Human Views into Unsupervised Deep Transfer Learning for Remote Sensing Image Retrieval

Cross-View Image Retrieval - Ground to Aerial Image Retrieval Through Deep Learning

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multi-view Deep Representations with Cross-Dataset Transfer for Remote Sensing Image Retrieval and Classification

Abstract

Access this article

Similar content being viewed by others

Multi-view feature learning for VHR remote sensing image classification

Incorporating Human Views into Unsupervised Deep Transfer Learning for Remote Sensing Image Retrieval

Cross-View Image Retrieval - Ground to Aerial Image Retrieval Through Deep Learning

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation