Abstract
To solve the problem of incomplete and inaccurate pavement crack detection, an improved U-Net model based on dual attention mechanism and multi-feature fusion is proposed. Firstly, a new encoding module ACI is designed, which has the feature of multi-scale feature extraction, significantly improves the sensing ability of the damaged area, reduces the background interference, and realizes more accurate segmentation. Secondly, a new decoding module HAD is designed, which avoids the network degradation problem caused by gradient vanishing and the growth of network layers and can retain the most subtle feature information during the decoding process. Finally, convolutional block attention module (CBAM) is introduced in the encoding part to effectively extract global and local detail information, and the criss-cross attention mechanism is also introduced in the decoding part to prevent the loss of marginalized information. The model proposed in this article was tested on the public datasets DeepCrack, CrackSeg478, and AsphaltCrack300, and compared with other advanced methods. The experimental results indicate that this method can detect road cracks more accurately and possesses considerable robustness.
Similar content being viewed by others
Data availability
Not applicable.
References
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature. 521(7553), 436–444 (2015)
Huang, H., Li, Q., Zhang, D.: Deep learning based image recognition for crack and leakage defects of metro shield tunnel. Tunn. Undergr. Space. Technol. 77, 166–176 (2018)
Zou, Q., Zhang, Z., Li, Q., et al.: Deepcrack: learning hierarchical convolutional features for crack detection. IEEE. Trans. Image Process. 28(3), 1498–1512 (2018)
Fan, Z., Wu, Y., Lu, J., et al.: Automatic pavement crack detection based on structured prediction with the convolutional neural network. arXiv preprint arXiv:1802.02208 (2018)
Zhang, L., Shen, J., Zhu, B.: A research on an improved Unet-based concrete crack detection algorithm. Struct. Health Monit. 20(4), 1864–1879 (2021)
Sun, X., Xie, Y., Jiang, L., et al.: DMA-net: DEEPLab with multi-scale attention for pavement crack segmentation. IEEE. Trans. Intell. Transp. Syst. 23(10), 18392–18403 (2022)
Yu, G., Dong, J., Wang, Y., et al.: RUC-Net: a residual-Unet-based convolutional neural network for pixel-level pavement crack segmentation. Sensors. 23(1), 53 (2022)
Roy, A. G., Navab, N., Wachinger, C.: Concurrent spatial and channel ‘squeeze and excitation’in fully convolutional networks. In: Medical Image Computing and Computer Assisted Intervention (MICCAI), pp. 421–429 (2018)
Sheng, S., Yin, H., Yang, Y., et al.: DUNet: dense U-blocks network for fine-grained crack detection. SIVP. 18(2), 1929–1938 (2024)
Zhou, Q., Qu, Z., Wang, S.Y., et al.: A method of potentially promising network for crack detection with enhanced convolution and dynamic feature fusion. IEEE. Trans. Intell. Transp. Syst. 23(10), 18736–18745 (2022)
Xu, C., Zhang, Q., Mei, L., et al.: Cross-attention-guided feature alignment network for road crack detection. ISPRS. Int. J. Geo Inf. 12(9), 382 (2023)
Di, Benedetto. A., Fiani, M., Gujski, L. M.: U-Net-Based CNN Architecture for road crack segmentatio. Infrastructures. 8(5): 90 (2023)
Gao, X., Tong, B.: MRA-UNet: balancing speed and accuracy in road crack segmentation network. SIViP. 17(5), 2093–2100 (2023)
Zhou, Q., Qu, Z., Ju, F.: A lightweight network for crack detection with split exchange convolution and multi-scale features fusion. IEEE Transactions on Intelligent Vehicles. (2022)
Woo, S., Park, J., Lee, J. Y., et al.: Cbam: Convolutional block attention module. Proceedings of the European conference on computer vision (ECCV). 3–19 (2018)
Huang, Z., Wang, X., Huang, L., et al.: Ccnet: Criss-cross attention for semantic segmentation. Proceedings of the IEEE/CVF international conference on computer vision. 603–612 (2019)
Liu, Y., Yao, J., Lu, X., et al.: DeepCrack: a deep hierarchical feature learning architecture for crack segmentation. Neurocomputing. 338, 139–153 (2019)
Phan, H. H.: STUCNET–Swin transformer-V2 Unet for crack segmentation network. Journal of Science and Technique-Section on Information and Communication Technology. 12(01) (2023)
Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: medical image computing and computer-assisted intervention–MICCAI 2015: 18th International Conference, pp. 234–241. Springer, Munich (2015)
Liu, J., Li, C., Liang, F., et al.: Inception convolution withcient dilation search. In Proceedings IEEE/CVF Conf. Comput. Vis. Pattern Recognit. pp. 6–11495 (2020)
Li, R., Duan, C., Zheng, S., et al.: MACU-net for semantic segmentation of fine-resolution remotely sensed images. IEEE. Geosci. Remote Sens. Lett. 19, 1–5 (2022)
Ibtehaz, N., Kihara, D.: Acc-unet: A completely convolutional unet model for the 2020s. International conference on medical image computing and computer-assisted intervention. Cham: Springer Nature Switzerland. 692–702 (2023)
Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. Proceedings of the IEEE conference on computer vision and pattern recognition. 1–9 (2015)
Lin, M., Chen, Q., Yan, S.: Network in network. arXiv preprint arXiv:1312.4400. (2013)
Punn, N.S., Agarwal, S.: Inception u-net architecture for semantic segmentation to identify nuclei in microscopy cell images. ACM. Trans. Multimed. Comput. Commun. Appl. (TOMM). 16(1), 1–5 (2020)
Tolstikhin, I.O., Houlsby, N., Kolesnikov, A., et al.: Mlp-mixer: an all-mlp architecture for vision. Adv. Neural. Inf. Process. Syst. 34, 24261–24272 (2021)
Chen, J., Lu, Y., Yu, Q., et al.: Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306. (2021)
Yang, J., Li, C., Dai, X., et al.: Focal modulation networks. Adv. Neural. Inf. Process. Syst. 35, 4203–4217 (2022)
Ruby, U., Yendapalli, V.: Binary cross entropy with deep learning technique for image classification. Int. J. Adv. Trends. Comput. Sci. Eng. 9(10), 5393–5397 (2020)
Milletari, F., Navab, N., Ahmadi, S. A.: V-net: Fully convolutional neural networks for volumetric medical image segmentation. 2016 fourth international conference on 3D vision (3DV). IEEE. 565–571 (2016)
Montazerolghaem, M., Sun, Y., Sasso, G., et al.: U-Net architecture for prostate segmentation: the Impact of loss function on system performance. Bioengineering. 10(4), 412 (2023)
Trebing, K., Staǹczyk, T., Mehrkanoon, S.: SmaAt-UNet: precipitation nowcasting using a small attention-UNet architecture. Pattern. Recogn. Lett. 145, 178–186 (2021)
Jha, D., Ali, S., Tomar, N.K., et al.: Real-time polyp detection, localization and segmentation in colonoscopy using deep learning. IEEE. Access. 9, 40496–40510 (2021)
Tomar, N. K., Shergill, A., Rieders, B., et al.: TransResU-Net: Transformer based ResU-Net for real-time colonoscopy polyp segmentation. arXiv preprint arXiv:2206.08985 (2022)
Valanarasu, J. M. J., Patel, V. M.: Unext: Mlp-based rapid medical image segmentation network. International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland. 23–33 (2022)
Sun, Y., Bi, F., Gao, Y., et al.: A multi-attention UNet for semantic segmentation in remote sensing images. Symmetry. 14(5), 906 (2022)
Tang, H., He, S., Yang, M., et al.: CSC-Unet: a novel convolutional sparse coding strategy based neural network for semantic segmentation. IEEE Access. (2024)
Acknowledgements
I would like to thank to reviewers for their detailed comments on the article.
Funding
This publication has emanated from research conducted with the financial support of the National Key Research and Development Program of China under the Grant No. 2017YFE0135700.
Author information
Authors and Affiliations
Contributions
LS and ZLJ mainly proposed the structure of the article, RJZ and NY mainly completed the improvement in the model and the ablation experiment and comparison experiment, YFW, DYC and JYL mainly completed the comparison of experimental data and the conclusion of the paper.
Corresponding authors
Ethics declarations
Conflict of interest
Authors declare no conflict of interest regarding the publication of this paper.
Ethics approval
Approval.
Consent to participate
Approval.
Consent for publication
Approval.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shi, L., Zhang, R., Wu, Y. et al. AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion. SIViP (2024). https://doi.org/10.1007/s11760-024-03234-w
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-024-03234-w