AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion

Shi, Lin; Zhang, Ruijun; Wu, Yafeng; Cui, Dongyan; Yuan, Na; Liu, Jinyun; Ji, Zhanlin

doi:10.1007/s11760-024-03234-w

AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion

Original Paper
Published: 14 May 2024

(2024)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Lin Shi¹,
Ruijun Zhang¹,
Yafeng Wu¹,
Dongyan Cui¹,
Na Yuan²,
Jinyun Liu¹ &
…
Zhanlin Ji^1,3

94 Accesses
Explore all metrics

Abstract

To solve the problem of incomplete and inaccurate pavement crack detection, an improved U-Net model based on dual attention mechanism and multi-feature fusion is proposed. Firstly, a new encoding module ACI is designed, which has the feature of multi-scale feature extraction, significantly improves the sensing ability of the damaged area, reduces the background interference, and realizes more accurate segmentation. Secondly, a new decoding module HAD is designed, which avoids the network degradation problem caused by gradient vanishing and the growth of network layers and can retain the most subtle feature information during the decoding process. Finally, convolutional block attention module (CBAM) is introduced in the encoding part to effectively extract global and local detail information, and the criss-cross attention mechanism is also introduced in the decoding part to prevent the loss of marginalized information. The model proposed in this article was tested on the public datasets DeepCrack, CrackSeg478, and AsphaltCrack300, and compared with other advanced methods. The experimental results indicate that this method can detect road cracks more accurately and possesses considerable robustness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

SCA-YOLO: a new small object detection model for UAV images

Article 25 May 2023

Learning a Deep Convolutional Network for Image Super-Resolution

Data availability

Not applicable.

References

LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature. 521(7553), 436–444 (2015)
Article Google Scholar
Huang, H., Li, Q., Zhang, D.: Deep learning based image recognition for crack and leakage defects of metro shield tunnel. Tunn. Undergr. Space. Technol. 77, 166–176 (2018)
Article Google Scholar
Zou, Q., Zhang, Z., Li, Q., et al.: Deepcrack: learning hierarchical convolutional features for crack detection. IEEE. Trans. Image Process. 28(3), 1498–1512 (2018)
Article MathSciNet Google Scholar
Fan, Z., Wu, Y., Lu, J., et al.: Automatic pavement crack detection based on structured prediction with the convolutional neural network. arXiv preprint arXiv:1802.02208 (2018)
Zhang, L., Shen, J., Zhu, B.: A research on an improved Unet-based concrete crack detection algorithm. Struct. Health Monit. 20(4), 1864–1879 (2021)
Article Google Scholar
Sun, X., Xie, Y., Jiang, L., et al.: DMA-net: DEEPLab with multi-scale attention for pavement crack segmentation. IEEE. Trans. Intell. Transp. Syst. 23(10), 18392–18403 (2022)
Article Google Scholar
Yu, G., Dong, J., Wang, Y., et al.: RUC-Net: a residual-Unet-based convolutional neural network for pixel-level pavement crack segmentation. Sensors. 23(1), 53 (2022)
Article Google Scholar
Roy, A. G., Navab, N., Wachinger, C.: Concurrent spatial and channel ‘squeeze and excitation’in fully convolutional networks. In: Medical Image Computing and Computer Assisted Intervention (MICCAI), pp. 421–429 (2018)
Sheng, S., Yin, H., Yang, Y., et al.: DUNet: dense U-blocks network for fine-grained crack detection. SIVP. 18(2), 1929–1938 (2024)
Google Scholar
Zhou, Q., Qu, Z., Wang, S.Y., et al.: A method of potentially promising network for crack detection with enhanced convolution and dynamic feature fusion. IEEE. Trans. Intell. Transp. Syst. 23(10), 18736–18745 (2022)
Article Google Scholar
Xu, C., Zhang, Q., Mei, L., et al.: Cross-attention-guided feature alignment network for road crack detection. ISPRS. Int. J. Geo Inf. 12(9), 382 (2023)
Article Google Scholar
Di, Benedetto. A., Fiani, M., Gujski, L. M.: U-Net-Based CNN Architecture for road crack segmentatio. Infrastructures. 8(5): 90 (2023)
Gao, X., Tong, B.: MRA-UNet: balancing speed and accuracy in road crack segmentation network. SIViP. 17(5), 2093–2100 (2023)
Article Google Scholar
Zhou, Q., Qu, Z., Ju, F.: A lightweight network for crack detection with split exchange convolution and multi-scale features fusion. IEEE Transactions on Intelligent Vehicles. (2022)
Woo, S., Park, J., Lee, J. Y., et al.: Cbam: Convolutional block attention module. Proceedings of the European conference on computer vision (ECCV). 3–19 (2018)
Huang, Z., Wang, X., Huang, L., et al.: Ccnet: Criss-cross attention for semantic segmentation. Proceedings of the IEEE/CVF international conference on computer vision. 603–612 (2019)
Liu, Y., Yao, J., Lu, X., et al.: DeepCrack: a deep hierarchical feature learning architecture for crack segmentation. Neurocomputing. 338, 139–153 (2019)
Article Google Scholar
Phan, H. H.: STUCNET–Swin transformer-V2 Unet for crack segmentation network. Journal of Science and Technique-Section on Information and Communication Technology. 12(01) (2023)
Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: medical image computing and computer-assisted intervention–MICCAI 2015: 18th International Conference, pp. 234–241. Springer, Munich (2015)
Liu, J., Li, C., Liang, F., et al.: Inception convolution withcient dilation search. In Proceedings IEEE/CVF Conf. Comput. Vis. Pattern Recognit. pp. 6–11495 (2020)
Li, R., Duan, C., Zheng, S., et al.: MACU-net for semantic segmentation of fine-resolution remotely sensed images. IEEE. Geosci. Remote Sens. Lett. 19, 1–5 (2022)
Google Scholar
Ibtehaz, N., Kihara, D.: Acc-unet: A completely convolutional unet model for the 2020s. International conference on medical image computing and computer-assisted intervention. Cham: Springer Nature Switzerland. 692–702 (2023)
Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. Proceedings of the IEEE conference on computer vision and pattern recognition. 1–9 (2015)
Lin, M., Chen, Q., Yan, S.: Network in network. arXiv preprint arXiv:1312.4400. (2013)
Punn, N.S., Agarwal, S.: Inception u-net architecture for semantic segmentation to identify nuclei in microscopy cell images. ACM. Trans. Multimed. Comput. Commun. Appl. (TOMM). 16(1), 1–5 (2020)
Article Google Scholar
Tolstikhin, I.O., Houlsby, N., Kolesnikov, A., et al.: Mlp-mixer: an all-mlp architecture for vision. Adv. Neural. Inf. Process. Syst. 34, 24261–24272 (2021)
Google Scholar
Chen, J., Lu, Y., Yu, Q., et al.: Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306. (2021)
Yang, J., Li, C., Dai, X., et al.: Focal modulation networks. Adv. Neural. Inf. Process. Syst. 35, 4203–4217 (2022)
Google Scholar
Ruby, U., Yendapalli, V.: Binary cross entropy with deep learning technique for image classification. Int. J. Adv. Trends. Comput. Sci. Eng. 9(10), 5393–5397 (2020)
Google Scholar
Milletari, F., Navab, N., Ahmadi, S. A.: V-net: Fully convolutional neural networks for volumetric medical image segmentation. 2016 fourth international conference on 3D vision (3DV). IEEE. 565–571 (2016)
Montazerolghaem, M., Sun, Y., Sasso, G., et al.: U-Net architecture for prostate segmentation: the Impact of loss function on system performance. Bioengineering. 10(4), 412 (2023)
Article Google Scholar
Trebing, K., Staǹczyk, T., Mehrkanoon, S.: SmaAt-UNet: precipitation nowcasting using a small attention-UNet architecture. Pattern. Recogn. Lett. 145, 178–186 (2021)
Article Google Scholar
Jha, D., Ali, S., Tomar, N.K., et al.: Real-time polyp detection, localization and segmentation in colonoscopy using deep learning. IEEE. Access. 9, 40496–40510 (2021)
Article Google Scholar
Tomar, N. K., Shergill, A., Rieders, B., et al.: TransResU-Net: Transformer based ResU-Net for real-time colonoscopy polyp segmentation. arXiv preprint arXiv:2206.08985 (2022)
Valanarasu, J. M. J., Patel, V. M.: Unext: Mlp-based rapid medical image segmentation network. International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland. 23–33 (2022)
Sun, Y., Bi, F., Gao, Y., et al.: A multi-attention UNet for semantic segmentation in remote sensing images. Symmetry. 14(5), 906 (2022)
Article Google Scholar
Tang, H., He, S., Yang, M., et al.: CSC-Unet: a novel convolutional sparse coding strategy based neural network for semantic segmentation. IEEE Access. (2024)

Download references

Acknowledgements

I would like to thank to reviewers for their detailed comments on the article.

Funding

This publication has emanated from research conducted with the financial support of the National Key Research and Development Program of China under the Grant No. 2017YFE0135700.

Author information

Authors and Affiliations

Hebei Key Laboratory of Industrial Intelligent Perception, North China University of Science and Technology, Tangshan, 063210, China
Lin Shi, Ruijun Zhang, Yafeng Wu, Dongyan Cui, Jinyun Liu & Zhanlin Ji
Intelligence and Information Engineering College, Tangshan University, Tangshan, 063000, China
Na Yuan
College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou, 311300, China
Zhanlin Ji

Authors

Lin Shi
View author publications
You can also search for this author in PubMed Google Scholar
Ruijun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yafeng Wu
View author publications
You can also search for this author in PubMed Google Scholar
Dongyan Cui
View author publications
You can also search for this author in PubMed Google Scholar
Na Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Jinyun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zhanlin Ji
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LS and ZLJ mainly proposed the structure of the article, RJZ and NY mainly completed the improvement in the model and the ablation experiment and comparison experiment, YFW, DYC and JYL mainly completed the comparison of experimental data and the conclusion of the paper.

Corresponding authors

Correspondence to Jinyun Liu or Zhanlin Ji.

Ethics declarations

Conflict of interest

Authors declare no conflict of interest regarding the publication of this paper.

Ethics approval

Approval.

Consent to participate

Approval.

Consent for publication

Approval.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Shi, L., Zhang, R., Wu, Y. et al. AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion. SIViP (2024). https://doi.org/10.1007/s11760-024-03234-w

Download citation

Received: 22 March 2024
Revised: 12 April 2024
Accepted: 18 April 2024
Published: 14 May 2024
DOI: https://doi.org/10.1007/s11760-024-03234-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion

Abstract

Access this article

Similar content being viewed by others

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

SCA-YOLO: a new small object detection model for UAV images

Learning a Deep Convolutional Network for Image Super-Resolution

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Ethics approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion

Abstract

Access this article

Similar content being viewed by others

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

SCA-YOLO: a new small object detection model for UAV images

Learning a Deep Convolutional Network for Image Super-Resolution

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Ethics approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation