Skip to main content

Target Detection Algorithm of Forward-Looking Sonar Based on Swin Transformer

  • Conference paper
  • First Online:
International Conference on Cloud Computing and Computer Networks (CCCN 2023)

Abstract

In recent years, with the deepening of the exploitation of marine resources, the demand for marine environment exploration is also increasing. However, due to the complex and changeable marine environment, the exploration risk of underwater manned spacecraft is relatively high. Autonomous underwater vehicle (AUV) has become an essential tool to explore and exploit marine resources. During the execution of underwater tasks, AUVs often encounter collision and entanglement of various obstacles, which will pose a fatal threat to AUVs. Forward-looking sonar (FLS) is one of the main sensors used by AUVs to detect underwater targets. By continuously transmitting sound waves and receiving echo signals, AUVs equipped with multi-beam forward-looking sonar can generate real-time visual field acoustic images, and then detect the images with target detection algorithms to solve the above problems. However, the visual acoustic image has the characteristics of more texture feature information and less semantic feature information, and it requires real-time reasoning. Some existing classical target detection algorithms are mostly designed for some data scenes with strong semantic meaning, and cannot be well adapted to the visual acoustic image. Therefore, a new target detection algorithm, Swin_FLS, is proposed in this chapter. The improved Swin_FLS can more fully extract the texture features in the forward-looking sonar image and adapt to the characteristics of the forward-looking sonar image data set. Finally, 85.2 mAP and 14.1 FPS are obtained in the test set. It surpasses the accuracy of some classical algorithms directly applied to the forward-looking sonar image data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Clark D, Ruiz I T, Petillot Y, et al. Particle PHD filter multiple target tracking in sonar image[J]. IEEE Transactions on Aerospace and Electronic Systems, 2007, 43(1):409–416.

    Article  Google Scholar 

  2. Galceran E., Djapic V., Carreras M., Williams D.P. A real-time underwater object detection algorithm for multi-beam forward looking sonar. IFAC Proceedings Volumes, 2012, 45 (5): 306–311.

    Google Scholar 

  3. Valdenegro-Toro M. Submerged Marine Debris Detection with Autonomous Underwater Vehi-cles[C]. IEEE International Conference on Robotics and Automation for Humanitarian Applications, Kollam, India, 2016.

    Google Scholar 

  4. Kim B, Yu S C. Imaging sonar based real-time underwater object detection utilizing AdaBoost method[C]. IEEE Underwater Technology, Busan, South Korea, 2017.

    Google Scholar 

  5. Matheus D S, Pedro R, Pedro N, et al. Object Classification in Semi Structured Enviroment Using Forward-Looking Sonar[J]. Sensors, 2017, 17(10).

    Google Scholar 

  6. Ye X, Sun Y, Li C. FCN and Siamese Network for Small Target Tracking in Forward-looking Sonar Images[C]. MTS/IEEE OCEANS, Charleston, SC, USA, 2018

    Book  Google Scholar 

  7. Jin L, Liang H, Yang C. Accurate Underwater ATR in Forward-Looking Sonar Imagery Using Deep Convolutional Neural Networks[J]. IEEE Access, 2019, 7:125522–125531

    Article  Google Scholar 

  8. Kvasic I, Miskovic N, Vukic Z. Convolutional Neural Network Architectures for Sonar-Based Diver Detection and Tracking[C]. IEEE OCEANS, Marseille, France, 2019.

    Book  Google Scholar 

  9. Zacchini L, Franchi M, Manzari V, Pagliai M, Ridolfi A. Forward-Looking Sonar CNN-based Automatic Target Recognition: an experimental campaign with FeelHippo AUV[C]. IEEE Conference on Autonomous Underwater Vehicles Symposium (AUV), St. Johns, NL, Canada, 2020.

    Google Scholar 

  10. A. Yueyuan and W. Hong, “Swin Transformer Combined with Convolutional Encoder For Cephalometric Landmarks Detection,” 2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP), Chengdu, China, 2021, pp. 184–187, doi: https://doi.org/10.1109/ICCWAMTIP53232.2021.9674147.

  11. Russell B C, Torralba A, Murphy K P, et al. LabelMe: a database and web-based tool for image annotation[J]. International journal of computer vision, 2008, 77(1–3): 157–173

    Article  Google Scholar 

  12. H. Dong, L. Zhang and B. Zou, “Densely Connected Convolutional Neural Network Based Polarimetric SAR Image Classification,” IGARSS 2019–2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan, 2019, pp. 3764–3767, doi: https://doi.org/10.1109/IGARSS.2019.8900292.

  13. C. -Y. Wang, H. -Y. Mark Liao, Y. -H. Wu, P. -Y. Chen, J. -W. Hsieh and I. -H. Yeh, “CSPNet: A New Backbone that can Enhance Learning Capability of CNN,” 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA, 2020, pp. 1571–1580, doi: https://doi.org/10.1109/CVPRW50498.2020.00203.

  14. Lin T.Y, Dollar P, Girshick R, He K, Hariharan B, Belongie S. Feature Pyramid Networks for Object Detection[C]. IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, Hawaii, 2017.

    Google Scholar 

  15. Ren S, He, K, Girshick R, Sun J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence 2015, 39(6), 1137–1149.

    Article  Google Scholar 

  16. Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C, Berg A. C. SSD: Single Shot MultiBox Detector[C]. European Conference on Computer Vision, Amsterdam, The Netherlands, 2016.

    Google Scholar 

  17. T. -Y. Lin, P. Goyal, R. Girshick, K. He and P. Dollár, “Focal Loss for Dense Object Detection,” 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 2999–3007, doi: https://doi.org/10.1109/ICCV.2017.324.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, L., Zhang, X., Li, S., Gao, G., Wang, J., Wang, Q. (2024). Target Detection Algorithm of Forward-Looking Sonar Based on Swin Transformer. In: Meng, L. (eds) International Conference on Cloud Computing and Computer Networks. CCCN 2023. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-031-47100-1_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-47100-1_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-47099-8

  • Online ISBN: 978-3-031-47100-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics