Abstract
To meet the challenge of camouflaged object detection (COD), which has a high degree of intrinsic similarity between the object and background, this paper proposes a double-branch fusion network (DBFN) with a parallel attention selection mechanism (PASM). In detail, a schismatic receptive field block (SRF) combined with an attention mechanism for low-level information is performed to learn texture features in one branch, and an integration of the SRF, a hybrid attention mechanism (HAM), and a depth feature polymerization module (DFPM) is employed for high-level information to extract detection features in the other branch. Then, both texture features and detection features are input into the PASM to acquire selective expression matrices. Eventually, the final result is obtained after further selective matrix optimization with atrous spatial pyramid pooling (ASPP) and a residual channel attention block (RCAB) being applied serially. Experimental results on three public datasets verify that our method outperforms the state-of-the-art methods in terms of four evaluation metrics, i.e., mean absolute error (MAE), weighted Fβ measure (F ωβ ), structural measure (Sα), and E-measure (Eφ)
References
Fan D P, Ji G P, Sun G, et al. Camouflaged object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. 2777–2787
Fan D P, Ji G P, Cheng M M, et al. Concealed object detection. IEEE Trans Pattern Anal Mach Intell, 2021. doi: https://doi.org/10.1109/TPAMI.2021.3085766
Zheng Y, Zhang X, Wang F, et al. Detection of people with camouflage pattern via dense deconvolution network. IEEE Signal Process Lett, 2018, 26: 29–33
Chu H K, Hsu W H, Mitra N J, et al. Camouflage images. ACM Trans Graph, 2010, 29: 1–8
Ge S, Jin X, Ye Q, et al. Image editing by object-aware optimal boundary searching and mixed-domain composition. Comp Visual Media, 2018, 4: 71–82
Fan D P, Ji G P, Zhou T, et al. PraNet: parallel reverse attention network for polyp segmentation. In: Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention, 2020. 263–273
Ren J, Hu X, Zhu L, et al. Deep texture-aware features for camouflaged object detection. IEEE Trans Circuits Syst Video Technol, 2021. doi: https://doi.org/10.1109/TCSVT.2021.3126591
Zhai Q, Li X, Yang F, et al. Mutual graph learning for camouflaged object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021. 12997–13007
Mei H, Ji G P, Wei Z, et al. Camouflaged object segmentation with distraction mining. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021. 8772–8781
Li A, Zhang J, Lv Y, et al. Uncertainty-aware joint salient object and camouflaged object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021. 10071–10081
Lv Y, Zhang J, Dai Y, et al. Simultaneously localize, segment and rank the camouflaged objects. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021. 11591–11601
Feng M, Lu H, Ding E. Attentive feedback network for boundary-aware salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 1623–1632
Qin X, Zhang Z, Huang C, et al. BASNet: boundary-aware salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 7479–7489
Chen Z, Xu Q, Cong R, et al. Global context-aware progressive aggregation network for salient object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, 2020. 34: 10599–10606
Liu S, Huang D, Wang Y. Learning spatial fusion for single-shot object detection. 2019. ArXiv:1911.09516
Woo S, Park J, Lee J Y, et al. CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), 2018. 3–19
Chen L, Zhang H, Xiao J, et al. SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017. 5659–5667
Wang Q, Wu B, Zhu P, et al. ECA-Net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. 11531–11539
Zhao T, Wu X. Pyramid feature attention network for saliency detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 3085–3094
Gao S H, Cheng M M, Zhao K, et al. Res2Net: a new multi-scale backbone architecture. IEEE Trans Pattern Anal Mach Intell, 2019, 43: 652–662
Yang M, Yu K, Zhang C, et al. DenseASPP for semantic segmentation in street scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018. 3684–3692
Basak H, Kundu R, Agarwal A, et al. Single image super-resolution using residual channel attention network. In: Proceedings of 2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS), 2020. 219–224
Liu S, Huang D. Receptive field block net for accurate and fast object detection. In: Proceedings of the European Conference on Computer Vision (ECCV), 2018. 385–400
Skurowski P, Abdulameer H, Błaszczyk J, et al. Animal camouflage analysis: CHAMELEON database. http://kgwisc.aei.polsl.pl/datasets/CamouflageBase/animals.7z
Le T N, Nguyen T V, Nie Z, et al. Anabranch network for camouflaged object segmentation. Comput Vision Image Understanding, 2019, 184: 45–56
Margolin R, Zelnik-Manor L, Tal A. How to evaluate foreground maps? In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014. 248–255
Fan D P, Cheng M M, Liu Y, et al. Structure-measure: a new way to evaluate foreground maps. In: Proceedings of the IEEE International Conference on Computer Vision, 2017. 4548–4557
Fan D P, Gong C, Cao Y, et al. Enhanced-alignment measure for binary foreground map evaluation. In: Proceedings of International Joint Conference on Artificial Intelligence, 2018. 698–704
Lin T Y, Dollar P, Girshick R, et al. Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017. 2117–2125
Zhou Z, Siddiquee M M R, Tajbakhsh N, et al. UNet++: a nested U-Net architecture for medical image segmentation. In: Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Cham: Springer, 2018. 3–11
Liu J J, Hou Q, Cheng M M, et al. A simple pooling-based design for real-time salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 3917–3926
Wu Z, Su L, Huang Q. Cascaded partial decoder for fast and accurate salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 3907–3916
Zhao J X, Liu J J, Fan D P, et al. EGNet: edge guidance network for salient object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. 8779–8788
Dong B, Zhuge M, Wang Y, et al. Towards accurate camouflaged object detection with mixture convolution and interactive fusion. 2021. ArXiv:2101.05687
Zhao H, Shi J, Qi X, et al. Pyramid scene parsing network. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2017. 6230–6239
Liu N, Han J, Yang M-H. PiCANet: learning pixel-wise contextual attention for saliency detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. 3089–3098
Huang Z, Huang L, Gong Y, et al. Mask scoring R-CNN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. 6409–6418
He K, Gkioxari G, Dollar P, et al. Mask R-CNN, In: Proceedings of the IEEE International Conference on Computer Vision, 2017. 2961–2969
Mao Y, Zhang J, Wan Z, et al. Generative transformer for accurate and reliable salient object detection. 2022. ArXiv:2104.10127
Hou F Y, Sun J, Yang Q L, et al. Deep reinforcement learning for optimal denial-of-service attacks scheduling. Sci China Inf Sci, 2022, 65: 162201
Acknowledgements
This work was partially supported by National Key R&D Program of China (Grant No. 2018YFB18011001), National Natural Science Foundation of China (Grant No. 62025502), Guangdong Introducing Innovative and Entrepreneurial Teams of ‘The Pearl River Talent Recruitment Program’ (Grant No. 2019ZT08X340), and Guangdong Guangxi Joint Science Key Foundation (Grant No. 2021GXNSFDA076001).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xiang, J., Pan, Q., Zhang, Z. et al. Double-branch fusion network with a parallel attention selection mechanism for camouflaged object detection. Sci. China Inf. Sci. 66, 162403 (2023). https://doi.org/10.1007/s11432-022-3592-8
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11432-022-3592-8