Skip to main content
Log in

AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

To solve the problem of incomplete and inaccurate pavement crack detection, an improved U-Net model based on dual attention mechanism and multi-feature fusion is proposed. Firstly, a new encoding module ACI is designed, which has the feature of multi-scale feature extraction, significantly improves the sensing ability of the damaged area, reduces the background interference, and realizes more accurate segmentation. Secondly, a new decoding module HAD is designed, which avoids the network degradation problem caused by gradient vanishing and the growth of network layers and can retain the most subtle feature information during the decoding process. Finally, convolutional block attention module (CBAM) is introduced in the encoding part to effectively extract global and local detail information, and the criss-cross attention mechanism is also introduced in the decoding part to prevent the loss of marginalized information. The model proposed in this article was tested on the public datasets DeepCrack, CrackSeg478, and AsphaltCrack300, and compared with other advanced methods. The experimental results indicate that this method can detect road cracks more accurately and possesses considerable robustness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig.1
Fig.2
Fig.3
Fig.4
Fig.5
Fig.6
Fig.7
Fig.8
Fig.9
Fig.10
Fig.11

Similar content being viewed by others

Data availability

Not applicable.

References

  1. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature. 521(7553), 436–444 (2015)

    Article  Google Scholar 

  2. Huang, H., Li, Q., Zhang, D.: Deep learning based image recognition for crack and leakage defects of metro shield tunnel. Tunn. Undergr. Space. Technol. 77, 166–176 (2018)

    Article  Google Scholar 

  3. Zou, Q., Zhang, Z., Li, Q., et al.: Deepcrack: learning hierarchical convolutional features for crack detection. IEEE. Trans. Image Process. 28(3), 1498–1512 (2018)

    Article  MathSciNet  Google Scholar 

  4. Fan, Z., Wu, Y., Lu, J., et al.: Automatic pavement crack detection based on structured prediction with the convolutional neural network. arXiv preprint arXiv:1802.02208 (2018)

  5. Zhang, L., Shen, J., Zhu, B.: A research on an improved Unet-based concrete crack detection algorithm. Struct. Health Monit. 20(4), 1864–1879 (2021)

    Article  Google Scholar 

  6. Sun, X., Xie, Y., Jiang, L., et al.: DMA-net: DEEPLab with multi-scale attention for pavement crack segmentation. IEEE. Trans. Intell. Transp. Syst. 23(10), 18392–18403 (2022)

    Article  Google Scholar 

  7. Yu, G., Dong, J., Wang, Y., et al.: RUC-Net: a residual-Unet-based convolutional neural network for pixel-level pavement crack segmentation. Sensors. 23(1), 53 (2022)

    Article  Google Scholar 

  8. Roy, A. G., Navab, N., Wachinger, C.: Concurrent spatial and channel ‘squeeze and excitation’in fully convolutional networks. In: Medical Image Computing and Computer Assisted Intervention (MICCAI), pp. 421–429 (2018)

  9. Sheng, S., Yin, H., Yang, Y., et al.: DUNet: dense U-blocks network for fine-grained crack detection. SIVP. 18(2), 1929–1938 (2024)

    Google Scholar 

  10. Zhou, Q., Qu, Z., Wang, S.Y., et al.: A method of potentially promising network for crack detection with enhanced convolution and dynamic feature fusion. IEEE. Trans. Intell. Transp. Syst. 23(10), 18736–18745 (2022)

    Article  Google Scholar 

  11. Xu, C., Zhang, Q., Mei, L., et al.: Cross-attention-guided feature alignment network for road crack detection. ISPRS. Int. J. Geo Inf. 12(9), 382 (2023)

    Article  Google Scholar 

  12. Di, Benedetto. A., Fiani, M., Gujski, L. M.: U-Net-Based CNN Architecture for road crack segmentatio. Infrastructures. 8(5): 90 (2023)

  13. Gao, X., Tong, B.: MRA-UNet: balancing speed and accuracy in road crack segmentation network. SIViP. 17(5), 2093–2100 (2023)

    Article  Google Scholar 

  14. Zhou, Q., Qu, Z., Ju, F.: A lightweight network for crack detection with split exchange convolution and multi-scale features fusion. IEEE Transactions on Intelligent Vehicles. (2022)

  15. Woo, S., Park, J., Lee, J. Y., et al.: Cbam: Convolutional block attention module. Proceedings of the European conference on computer vision (ECCV). 3–19 (2018)

  16. Huang, Z., Wang, X., Huang, L., et al.: Ccnet: Criss-cross attention for semantic segmentation. Proceedings of the IEEE/CVF international conference on computer vision. 603–612 (2019)

  17. Liu, Y., Yao, J., Lu, X., et al.: DeepCrack: a deep hierarchical feature learning architecture for crack segmentation. Neurocomputing. 338, 139–153 (2019)

    Article  Google Scholar 

  18. Phan, H. H.: STUCNET–Swin transformer-V2 Unet for crack segmentation network. Journal of Science and Technique-Section on Information and Communication Technology. 12(01) (2023)

  19. Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: medical image computing and computer-assisted intervention–MICCAI 2015: 18th International Conference, pp. 234–241. Springer, Munich (2015)

  20. Liu, J., Li, C., Liang, F., et al.: Inception convolution withcient dilation search. In Proceedings IEEE/CVF Conf. Comput. Vis. Pattern Recognit. pp. 6–11495 (2020)

  21. Li, R., Duan, C., Zheng, S., et al.: MACU-net for semantic segmentation of fine-resolution remotely sensed images. IEEE. Geosci. Remote Sens. Lett. 19, 1–5 (2022)

    Google Scholar 

  22. Ibtehaz, N., Kihara, D.: Acc-unet: A completely convolutional unet model for the 2020s. International conference on medical image computing and computer-assisted intervention. Cham: Springer Nature Switzerland. 692–702 (2023)

  23. Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. Proceedings of the IEEE conference on computer vision and pattern recognition. 1–9 (2015)

  24. Lin, M., Chen, Q., Yan, S.: Network in network. arXiv preprint arXiv:1312.4400. (2013)

  25. Punn, N.S., Agarwal, S.: Inception u-net architecture for semantic segmentation to identify nuclei in microscopy cell images. ACM. Trans. Multimed. Comput. Commun. Appl. (TOMM). 16(1), 1–5 (2020)

    Article  Google Scholar 

  26. Tolstikhin, I.O., Houlsby, N., Kolesnikov, A., et al.: Mlp-mixer: an all-mlp architecture for vision. Adv. Neural. Inf. Process. Syst. 34, 24261–24272 (2021)

    Google Scholar 

  27. Chen, J., Lu, Y., Yu, Q., et al.: Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306. (2021)

  28. Yang, J., Li, C., Dai, X., et al.: Focal modulation networks. Adv. Neural. Inf. Process. Syst. 35, 4203–4217 (2022)

    Google Scholar 

  29. Ruby, U., Yendapalli, V.: Binary cross entropy with deep learning technique for image classification. Int. J. Adv. Trends. Comput. Sci. Eng. 9(10), 5393–5397 (2020)

    Google Scholar 

  30. Milletari, F., Navab, N., Ahmadi, S. A.: V-net: Fully convolutional neural networks for volumetric medical image segmentation. 2016 fourth international conference on 3D vision (3DV). IEEE. 565–571 (2016)

  31. Montazerolghaem, M., Sun, Y., Sasso, G., et al.: U-Net architecture for prostate segmentation: the Impact of loss function on system performance. Bioengineering. 10(4), 412 (2023)

    Article  Google Scholar 

  32. Trebing, K., Staǹczyk, T., Mehrkanoon, S.: SmaAt-UNet: precipitation nowcasting using a small attention-UNet architecture. Pattern. Recogn. Lett. 145, 178–186 (2021)

    Article  Google Scholar 

  33. Jha, D., Ali, S., Tomar, N.K., et al.: Real-time polyp detection, localization and segmentation in colonoscopy using deep learning. IEEE. Access. 9, 40496–40510 (2021)

    Article  Google Scholar 

  34. Tomar, N. K., Shergill, A., Rieders, B., et al.: TransResU-Net: Transformer based ResU-Net for real-time colonoscopy polyp segmentation. arXiv preprint arXiv:2206.08985 (2022)

  35. Valanarasu, J. M. J., Patel, V. M.: Unext: Mlp-based rapid medical image segmentation network. International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer Nature Switzerland. 23–33 (2022)

  36. Sun, Y., Bi, F., Gao, Y., et al.: A multi-attention UNet for semantic segmentation in remote sensing images. Symmetry. 14(5), 906 (2022)

    Article  Google Scholar 

  37. Tang, H., He, S., Yang, M., et al.: CSC-Unet: a novel convolutional sparse coding strategy based neural network for semantic segmentation. IEEE Access. (2024)

Download references

Acknowledgements

I would like to thank to reviewers for their detailed comments on the article.

Funding

This publication has emanated from research conducted with the financial support of the National Key Research and Development Program of China under the Grant No. 2017YFE0135700.

Author information

Authors and Affiliations

Authors

Contributions

LS and ZLJ mainly proposed the structure of the article, RJZ and NY mainly completed the improvement in the model and the ablation experiment and comparison experiment, YFW, DYC and JYL mainly completed the comparison of experimental data and the conclusion of the paper.

Corresponding authors

Correspondence to Jinyun Liu or Zhanlin Ji.

Ethics declarations

Conflict of interest

Authors declare no conflict of interest regarding the publication of this paper.

Ethics approval

Approval.

Consent to participate

Approval.

Consent for publication

Approval.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shi, L., Zhang, R., Wu, Y. et al. AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion. SIViP (2024). https://doi.org/10.1007/s11760-024-03234-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11760-024-03234-w

Keywords

Navigation