Skip to main content

Image Classification via Multi-branch Position Attention Network

  • Conference paper
  • First Online:
Pattern Recognition and Artificial Intelligence (ICPRAI 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13363))

Abstract

Image classification is a hot spot in the field of pattern recognition and artificial intelligence. When there are apparent inter-class similarity and intra-class diversity, such as in the area of remote sensing, image classification becomes very challenge. With the continuous development of convolutional neural networks, a major breakthrough has been made in image classification. Although good performance have been achieved, there is still some room for improvement. First, in addition to global information, local features are crucial to image classification. Second, minimizing/maximizing the distance from the same/different classes allows the key points in image classification to be given full attention. In this paper, we propose an image classification method which is named multi-branch position attention network (MBPANet). We design a channel attention module containing position information, called Position Channel Attention Module (PCAM), and synthesize a new attention module Position Spatial Attention Module (PSAM) with a spatial attention module Local Spatial Attention Module (LSAM). The features obtained by the attention weighting method not only obtain local neighborhood semantic information but also contain global semantic information. Extensive experiments on three benchmark datasets show that our approach outperforms state-of-the-art methods.

Supported by Science and Technology Project of SGCC (No. 5500-202140127A).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)

  2. Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)

    Google Scholar 

  3. Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., Hu, Q.: ECA-Net: efficient channel attention for deep convolutional neural networks. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11531–11539 (2020)

    Google Scholar 

  4. Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1

    Chapter  Google Scholar 

  5. Yang, Y., Newsam, S.: Bag-of-visual-words and spatial extensions for land-use classification. In: Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 270–279 (2010)

    Google Scholar 

  6. Xia, G.S., et al.: AID: a benchmark data set for performance evaluation of aerial scene classification. IEEE Trans. Geosci. Remote Sens. 55(7), 3965–3981 (2017)

    Article  Google Scholar 

  7. Cheng, G., Han, J., Lu, X.: Remote sensing image scene classification: benchmark and state of the art. Proc. IEEE 105(10), 1865–1883 (2017)

    Article  Google Scholar 

  8. Huang, H., Xu, K.: Combing triple-part features of convolutional neural networks for scene classification in remote sensing. Remote Sens. 11(14), 1687 (2019)

    Article  Google Scholar 

  9. Zhang, W., Tang, P., Zhao, L.: Remote sensing image scene classification using CNN-CapsNet. Remote Sens. 11(5), 494 (2019)

    Article  Google Scholar 

  10. He, N., Fang, L., Li, S., Plaza, J., Plaza, A.: Skip-connected covariance network for remote sensing scene classification. IEEE Trans. Neural Netw. Learn. Syst. 31(5), 1461–1474 (2019)

    Article  Google Scholar 

  11. Wang, Q., Liu, S., Chanussot, J., Li, X.: Scene classification with recurrent attention of VHR remote sensing images. IEEE Trans. Geosci. Remote Sens. 57(2), 1155–1167 (2018)

    Article  Google Scholar 

  12. Sun, H., Li, S., Zheng, X., Lu, X.: Remote sensing scene classification by gated bidirectional network. IEEE Trans. Geosci. Remote Sens. 58(1), 82–96 (2019)

    Article  Google Scholar 

  13. Zhao, Z., Li, J., Luo, Z., Li, J., Chen, C.: Remote sensing image scene classification based on an enhanced attention module. IEEE Geosci. Remote Sens. Lett. 18(11), 1926–1930 (2021)

    Article  Google Scholar 

  14. Wang, S., Guan, Y., Shao, L.: Multi-granularity canonical appearance pooling for remote sensing scene classification. IEEE Trans. Image Process. 29, 5396–5407 (2020)

    Article  MATH  Google Scholar 

  15. Yu, D., Xu, Q., Guo, H., Zhao, C., Lin, Y., Li, D.: An efficient and lightweight convolutional neural network for remote sensing image scene classification. Sensors 20(7), 1999 (2020)

    Article  Google Scholar 

  16. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

    Google Scholar 

  17. Dosovitskiy, A., et al.: An image is worth 16 \(\times \) 16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)

  18. Chen, S.B., Wei, Q.S., Wang, W.Z., Tang, J., Luo, B., Wang, Z.Y.: Remote sensing scene classification via multi-branch local attention network. IEEE Trans. Image Process. 31, 99–109 (2022)

    Article  Google Scholar 

  19. Xu, K., Huang, H., Li, Y., Shi, G.: Multilayer feature fusion network for scene classification in remote sensing. IEEE Geosci. Remote Sens. Lett. 17(11), 1894–1898 (2020)

    Article  Google Scholar 

  20. Yu, Y., Liu, F.: A two-stream deep fusion framework for high-resolution aerial scene classification. Comput. Intell. Neurosci. 2018(8639367), 1–13 (2018)

    Google Scholar 

  21. Guo, Y., Ji, J., Shi, D., Ye, Q., Xie, H.: Multi-view feature learning for VHR remote sensing image classification. Multimedia Tools Appl. 80(15), 23009–23021 (2020). https://doi.org/10.1007/s11042-020-08713-z

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Si-Bao Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, K., Yang, J., Yuan, K., Wei, QS., Chen, SB. (2022). Image Classification via Multi-branch Position Attention Network. In: El Yacoubi, M., Granger, E., Yuen, P.C., Pal, U., Vincent, N. (eds) Pattern Recognition and Artificial Intelligence. ICPRAI 2022. Lecture Notes in Computer Science, vol 13363. Springer, Cham. https://doi.org/10.1007/978-3-031-09037-0_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-09037-0_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-09036-3

  • Online ISBN: 978-3-031-09037-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics