Skip to main content
Log in

Double-branch fusion network with a parallel attention selection mechanism for camouflaged object detection

  • Research Paper
  • Published:
Science China Information Sciences Aims and scope Submit manuscript

Abstract

To meet the challenge of camouflaged object detection (COD), which has a high degree of intrinsic similarity between the object and background, this paper proposes a double-branch fusion network (DBFN) with a parallel attention selection mechanism (PASM). In detail, a schismatic receptive field block (SRF) combined with an attention mechanism for low-level information is performed to learn texture features in one branch, and an integration of the SRF, a hybrid attention mechanism (HAM), and a depth feature polymerization module (DFPM) is employed for high-level information to extract detection features in the other branch. Then, both texture features and detection features are input into the PASM to acquire selective expression matrices. Eventually, the final result is obtained after further selective matrix optimization with atrous spatial pyramid pooling (ASPP) and a residual channel attention block (RCAB) being applied serially. Experimental results on three public datasets verify that our method outperforms the state-of-the-art methods in terms of four evaluation metrics, i.e., mean absolute error (MAE), weighted measure (F ωβ ), structural measure (Sα), and E-measure (Eφ)

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

References

  1. Fan D P, Ji G P, Sun G, et al. Camouflaged object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. 2777–2787

  2. Fan D P, Ji G P, Cheng M M, et al. Concealed object detection. IEEE Trans Pattern Anal Mach Intell, 2021. doi: https://doi.org/10.1109/TPAMI.2021.3085766

  3. Zheng Y, Zhang X, Wang F, et al. Detection of people with camouflage pattern via dense deconvolution network. IEEE Signal Process Lett, 2018, 26: 29–33

    Article  Google Scholar 

  4. Chu H K, Hsu W H, Mitra N J, et al. Camouflage images. ACM Trans Graph, 2010, 29: 1–8

    Article  Google Scholar 

  5. Ge S, Jin X, Ye Q, et al. Image editing by object-aware optimal boundary searching and mixed-domain composition. Comp Visual Media, 2018, 4: 71–82

    Article  Google Scholar 

  6. Fan D P, Ji G P, Zhou T, et al. PraNet: parallel reverse attention network for polyp segmentation. In: Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention, 2020. 263–273

  7. Ren J, Hu X, Zhu L, et al. Deep texture-aware features for camouflaged object detection. IEEE Trans Circuits Syst Video Technol, 2021. doi: https://doi.org/10.1109/TCSVT.2021.3126591

  8. Zhai Q, Li X, Yang F, et al. Mutual graph learning for camouflaged object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021. 12997–13007

  9. Mei H, Ji G P, Wei Z, et al. Camouflaged object segmentation with distraction mining. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021. 8772–8781

  10. Li A, Zhang J, Lv Y, et al. Uncertainty-aware joint salient object and camouflaged object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021. 10071–10081

  11. Lv Y, Zhang J, Dai Y, et al. Simultaneously localize, segment and rank the camouflaged objects. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021. 11591–11601

  12. Feng M, Lu H, Ding E. Attentive feedback network for boundary-aware salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 1623–1632

  13. Qin X, Zhang Z, Huang C, et al. BASNet: boundary-aware salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 7479–7489

  14. Chen Z, Xu Q, Cong R, et al. Global context-aware progressive aggregation network for salient object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, 2020. 34: 10599–10606

  15. Liu S, Huang D, Wang Y. Learning spatial fusion for single-shot object detection. 2019. ArXiv:1911.09516

  16. Woo S, Park J, Lee J Y, et al. CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), 2018. 3–19

  17. Chen L, Zhang H, Xiao J, et al. SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017. 5659–5667

  18. Wang Q, Wu B, Zhu P, et al. ECA-Net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. 11531–11539

  19. Zhao T, Wu X. Pyramid feature attention network for saliency detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 3085–3094

  20. Gao S H, Cheng M M, Zhao K, et al. Res2Net: a new multi-scale backbone architecture. IEEE Trans Pattern Anal Mach Intell, 2019, 43: 652–662

    Article  Google Scholar 

  21. Yang M, Yu K, Zhang C, et al. DenseASPP for semantic segmentation in street scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018. 3684–3692

  22. Basak H, Kundu R, Agarwal A, et al. Single image super-resolution using residual channel attention network. In: Proceedings of 2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS), 2020. 219–224

  23. Liu S, Huang D. Receptive field block net for accurate and fast object detection. In: Proceedings of the European Conference on Computer Vision (ECCV), 2018. 385–400

  24. Skurowski P, Abdulameer H, Błaszczyk J, et al. Animal camouflage analysis: CHAMELEON database. http://kgwisc.aei.polsl.pl/datasets/CamouflageBase/animals.7z

  25. Le T N, Nguyen T V, Nie Z, et al. Anabranch network for camouflaged object segmentation. Comput Vision Image Understanding, 2019, 184: 45–56

    Article  Google Scholar 

  26. Margolin R, Zelnik-Manor L, Tal A. How to evaluate foreground maps? In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014. 248–255

  27. Fan D P, Cheng M M, Liu Y, et al. Structure-measure: a new way to evaluate foreground maps. In: Proceedings of the IEEE International Conference on Computer Vision, 2017. 4548–4557

  28. Fan D P, Gong C, Cao Y, et al. Enhanced-alignment measure for binary foreground map evaluation. In: Proceedings of International Joint Conference on Artificial Intelligence, 2018. 698–704

  29. Lin T Y, Dollar P, Girshick R, et al. Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017. 2117–2125

  30. Zhou Z, Siddiquee M M R, Tajbakhsh N, et al. UNet++: a nested U-Net architecture for medical image segmentation. In: Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Cham: Springer, 2018. 3–11

    Chapter  Google Scholar 

  31. Liu J J, Hou Q, Cheng M M, et al. A simple pooling-based design for real-time salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 3917–3926

  32. Wu Z, Su L, Huang Q. Cascaded partial decoder for fast and accurate salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 3907–3916

  33. Zhao J X, Liu J J, Fan D P, et al. EGNet: edge guidance network for salient object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. 8779–8788

  34. Dong B, Zhuge M, Wang Y, et al. Towards accurate camouflaged object detection with mixture convolution and interactive fusion. 2021. ArXiv:2101.05687

  35. Zhao H, Shi J, Qi X, et al. Pyramid scene parsing network. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2017. 6230–6239

  36. Liu N, Han J, Yang M-H. PiCANet: learning pixel-wise contextual attention for saliency detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. 3089–3098

  37. Huang Z, Huang L, Gong Y, et al. Mask scoring R-CNN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 6409–6418

  38. He K, Gkioxari G, Dollar P, et al. Mask R-CNN, In: Proceedings of the IEEE International Conference on Computer Vision, 2017. 2961–2969

  39. Mao Y, Zhang J, Wan Z, et al. Generative transformer for accurate and reliable salient object detection. 2022. ArXiv:2104.10127

  40. Hou F Y, Sun J, Yang Q L, et al. Deep reinforcement learning for optimal denial-of-service attacks scheduling. Sci China Inf Sci, 2022, 65: 162201

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work was partially supported by National Key R&D Program of China (Grant No. 2018YFB18011001), National Natural Science Foundation of China (Grant No. 62025502), Guangdong Introducing Innovative and Entrepreneurial Teams of ‘The Pearl River Talent Recruitment Program’ (Grant No. 2019ZT08X340), and Guangdong Guangxi Joint Science Key Foundation (Grant No. 2021GXNSFDA076001).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qing Pan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xiang, J., Pan, Q., Zhang, Z. et al. Double-branch fusion network with a parallel attention selection mechanism for camouflaged object detection. Sci. China Inf. Sci. 66, 162403 (2023). https://doi.org/10.1007/s11432-022-3592-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11432-022-3592-8

Keywords

Navigation