Cross-view Geo-localization Based on Cross-domain Matching

Wu, Xiaokang; Ma, Qianguang; Li, Qi; Yu, Yuanlong; Liu, Wenxi

doi:10.1007/978-3-031-20738-9_81

Xiaokang Wu⁸,
Qianguang Ma⁸,
Qi Li⁸,
Yuanlong Yu⁸ &
…
Wenxi Liu⁸

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 153))

Included in the following conference series:

The International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery

1557 Accesses

Abstract

As a recently emerging problem, cross-view geo-localization aims at finding image pairs captured from different views (e.g., drone and satellite views) or domains yet same location, which can be widely employed in various applications. However, unlike traditional scene classification problem, it faces several challenges, including large intra-class distance and small inter-class distance caused by domain gap, as well as redundant contextual information and visual distractors across views. To address the concerns, we propose a novel cross-domain matching framework to handle this task, which measures the similarity for query and candidate images from two different domains. Comparing to prior classification based framework, our matching based framework is better suited for the task by forcing the model to learn discriminative features for scenes. Moreover, to aid cross-domain matching, we propose a matching-oriented feature modulation scheme, in which we not only apply a large-view attention module to enhance spatial features but also employ channel shuffling to loose the correlation of key feature semantics and distractors in the respective domains. Last, we conduct experiments to show that our model achieves the state-of-the-art performance and surpasses the competing method by a large margin on the public benchmarks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 229.00; Price excludes VAT (USA)

Softcover Book: USD 299.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Liu, L., Li, H.: Lending orientation to neural networks for cross-view geo-localization. In: CVPR
Google Scholar
Shi, Y., Liu, L., Yu, X., Li, H.: Spatial-aware feature aggregation for image based cross-view geo-localization. NIPS 32 (2019)
Google Scholar
Zheng, Z., Wei, Y., Yang, Y.: University-1652: a multi-view multi-source benchmark for drone-based geo-localization. In: ACM MM
Google Scholar
Workman, S., Jacobs, N.: On the location dependence of convolutional neural network features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops
Google Scholar
Babenko, A., Lempitsky, V.: Aggregating local deep features for image retrieval. In: IEEE
Google Scholar
Zheng, L., Yang, Y., Tian, Q.: Sift meets cnn: a decade survey of instance retrieval. IEEE 40(5)
Google Scholar
Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., Tian, Q.: Scalable person re-identification: a benchmark. In: Proceedings of the IEEE International Conference on Computer Vision
Google Scholar
Chaudhuri, U., Banerjee, B., Bhattacharya, A.: Siamese graph convolutional network for content based remote sensing image retrieval. Comput. Vision Image Understand. 184
Google Scholar
Nair, L.R., Subramaniam, K., Prasannavenkatesan, G.: A review on multiple approaches to medical image retrieval system. In: Intelligent Computing in Engineering
Google Scholar
Shi, Y., Yu, X., Liu, L., Zhang, T., Li, H.: Optimal feature transport for cross-view image geo-localization. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34
Google Scholar
Shi, Y., Yu, X., Campbell, D., Li, H.: Where am I looking at? joint location and orientation estimation by cross-view matching. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Google Scholar
Hu, S., Feng, M., Nguyen, R.M., Lee, G.H.: Cvm-net: cross-view matching network for image-based ground-to-aerial geo-localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Google Scholar
Lin, T.Y., Cui, Y., Belongie, S., Hays, J.: Learning deep representations for ground-to-aerial geolocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Google Scholar
Tian, Y., Chen, C., Shah, M.: Cross-view image matching for geo-localization in urban environments. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Google Scholar
Vo, N.N., Hays, J.: Localizing and orienting street views using overhead imagery. In: European Conference on Computer Vision
Google Scholar
Workman, S., Souvenir, R., Jacobs, N.: Wide-area image geolocalization with aerial reference imagery. In: Proceedings of the IEEE International Conference on Computer Vision
Google Scholar
Zhai, M., Bessinger, Z., Workman, S., Jacobs, N.: Predicting ground-level scene layout from aerial imagery. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Google Scholar
Wang, T., Zheng, Z., Yan, C., Zhang, J., Sun, Y., Zheng, B., Yang, Y.: Each part matters: Local patterns facilitate cross-view geo-localization. IEEE (2021)
Google Scholar
Zhang, X., Zhou, X., Lin, M., Sun, J.: Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Google Scholar
Chechik, G., Sharma, V., Shalit, U., Bengio, S.: Large scale online learning of image similarity through ranking. J. Mach. Learn. Res. 11(3) (2010)
Google Scholar

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (NSFC) under grant 61873067, and University-Industry Cooperation Project of Fujian Provincial Department of Science and Technology under grant 2020H6101.

Author information

Authors and Affiliations

College of Computer and Data Science, Fuzhou University, Fuzhou, China
Xiaokang Wu, Qianguang Ma, Qi Li, Yuanlong Yu & Wenxi Liu

Authors

Xiaokang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Qianguang Ma
View author publications
You can also search for this author in PubMed Google Scholar
Qi Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuanlong Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wenxi Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xiaokang Wu or Yuanlong Yu .

Editor information

Editors and Affiliations

Division of Intelligent Future Technologies, Mälardalen University, Västerås, Västmanlands Län, Sweden
Ning Xiong
Department of Electronic and Computer Engineering, Brunel University London, Uxbridge, Middlesex, UK
Maozhen Li
School of Information Science and Technology, Hunan University, Changsha, Hunan, China
Kenli Li
School of Information Science and Technology, Hunan University, Changsha, Hunan, China
Zheng Xiao
College of Computer and Data Science, Fuzhou University, Fuzhou, Fujian, China
Longlong Liao
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, Singapore
Lipo Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, X., Ma, Q., Li, Q., Yu, Y., Liu, W. (2023). Cross-view Geo-localization Based on Cross-domain Matching. In: Xiong, N., Li, M., Li, K., Xiao, Z., Liao, L., Wang, L. (eds) Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery. ICNC-FSKD 2022. Lecture Notes on Data Engineering and Communications Technologies, vol 153. Springer, Cham. https://doi.org/10.1007/978-3-031-20738-9_81

Download citation

DOI: https://doi.org/10.1007/978-3-031-20738-9_81
Published: 30 January 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20737-2
Online ISBN: 978-3-031-20738-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Cross-view Geo-localization Based on Cross-domain Matching