Crack-Att Net: crack detection based on improved U-Net with parallel attention

Xu, Na; He, Lizhi; Li, Qing

doi:10.1007/s11042-023-15201-7

Crack-Att Net: crack detection based on improved U-Net with parallel attention

Published: 13 April 2023

Volume 82, pages 42465–42484, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

498 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

In recent years, the demand for automatic crack detection has increased rapidly. Due to the particularity of crack images, that is, the proportion of cracks in the entire images is very small, and some cracks in the image are particularly slender and light, it brings challenge for automatic crack detection. In this paper, we propose an end-to-end pixel-level crack segmentation network, named as “Crack-Att Net”. In our approach, firstly, an encoder network is used to extract the crack features; then, crack features generated by the encoder and decoder networks at the same scale are pairwisely fused through a parallel attention mechanism added for accurately locating the cracks; finally, the fused crack feature maps at all scales are further fused into a multi-scale feature-fusion map for crack detection. Experiments results on three existing datasets and an augmented dataset show that our proposed Crack-Att Net outperforms the current state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

CrackViT: a unified CNN-transformer model for pixel-level crack extraction

Article 31 January 2023

AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion

Article 14 May 2024

Gating attention convolutional networks with dense connection for pixel-level crack detection

Article 15 October 2022

Data availability

Based on existing available datasets from the CFD datasets [35], CrackTree 200 [46], DeepCrack [20] and crack images we have collected, we have composed the Crack Segmentation Dataset 1121(or called CSD1121).

References

Abdel-Qader I, Abudayyeh O, Kelly ME (2003) Analysis of edge-detection techniques for crack identification in bridges. J Comput Civ Eng 17:255–263. https://doi.org/10.1061/(ASCE)0887-3801(2003)17:4(255)
Article Google Scholar
Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39:2481–2495. https://doi.org/10.1109/TPAMI.2016.2644615
Article Google Scholar
Benz C, Debus P, Ha HK, Rodehorst V (2019) Crack segmentation on UAS-based imagery using transfer learning. 2019 International Conference on Image and Vision Computing New Zealand (IVCNZ), Paris, France, pp 1–6. https://doi.org/10.1109/IVCNZ48456.2019.8960998
Cho H, Yoon H-J, Jung J-Y (2018) Image-based crack detection using crack width transform (CWT) algorithm. IEEE Access 6:60100–60114. https://doi.org/10.1109/ACCESS.2018.2875889
Article Google Scholar
Fan R, Bocus MJ, Zhu Y et al (2019) Road crack detection using deep convolutional neural network and adaptive thresholding. 2019 IEEE Intelligent Vehicles Symposium (IV) 474–479. https://doi.org/10.1109/IVS.2019.8814000
Garcia-Garcia A, Orts-Escolano S, Oprea S et al (2017) A survey on deep learning techniques applied to semantic segmentation. Appl Soft Comput 70:41–65. https://doi.org/10.1016/j.asoc.2018.05.018
Article Google Scholar
Girshick R (2015) Fast r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inf Proces Syst 1440–1448. https://doi.org/10.1109/TPAMI.2016.2577031
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, pp 770–778. https://doi.org/10.1109/CVPR.2016.90
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pp 7132–7141. https://doi.org/10.1109/CVPR.2018.00745
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning 37:448–456. https://dl.acm.org/doi/10.5555/3045118.3045167
Kaseko MS, Ritchie SG (1993) A neural network-based methodology for pavement crack detection and classification. Transportation Research Part C: Emerging Technologies 1:275–291
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:14126980. https://arxiv.org/abs/1412.6980
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60:84–90. https://doi.org/10.1145/3065386
Article Google Scholar
Li Q, Liu X (2008) Novel approach to pavement image segmentation based on neighboring difference histogram method. 2008 Congress on Image and Signal Processing, Sanya, pp 792–796. https://doi.org/10.1109/CISP.2008.13
Lim RS, La HM, Sheng W (2014) A robotic crack inspection and mapping system for bridge deck maintenance. IEEE Trans Autom Sci Eng 11:367–378. https://doi.org/10.1109/TASE.2013.2294687
Article Google Scholar
Lin T-Y, Goyal P, Girshick R et al (2020) Focal loss for dense object detection. Focal loss for dense object detection. IEEE Trans Pattern Anal Mach Intell 42:318–327. https://doi.org/10.1109/TPAMI.2018.2858826
Article Google Scholar
Liu W, Anguelov D, Erhan D et al (2016) Ssd: single shot multibox detector. European Conference on Computer Vision 21–37. https://doi.org/10.1007/978-3-319-46448-0_2
Liu Y, Cheng M-M, Hu X et al (2019) Richer convolutional features for edge detection. IEEE Trans Pattern Anal Mach Intell 41:939–1946. https://doi.org/10.1109/TPAMI.2018.2878849
Article Google Scholar
Liu K, Han X, Chen BM (2019) Deep learning based automatic crack detection and segmentation for unmanned aerial vehicle inspections. 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO), Dali, pp 381–387. https://doi.org/10.1109/ROBIO49542.2019.8961534
Liu Y, Yao J, Lu X et al (2019) DeepCrack: a deep hierarchical feature learning architecture for crack segmentation. Neurocomputing 338:139–153. https://doi.org/10.1016/j.neucom.2019.01.036
Article Google Scholar
Liu Y, Li Q, Yuan Y et al (2021) ABNet: adaptive balanced network for multiscale object detection in remote sensing imagery. IEEE Trans Geosci Remote Sens 60:1–14. https://doi.org/10.1109/TGRS.2021.3133956
Article Google Scholar
Liu H, Miao X, Mertz C, et al (2021) CrackFormer: transformer network for fine-grained crack detection. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, pp 3763–3772. https://doi.org/10.1109/ICCV48922.2021.00376
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Medina R, Llamas J, Gómez-García-Bermejo J et al (2017) Crack detection in concrete tunnels using a gabor filter invariant to rotation. Sensors 17:1670. https://doi.org/10.3390/s17071670
Article Google Scholar
Mohan A, Poobal S (2018) Crack detection using image processing: a critical review and analysis. Alex Eng J 57:787–798. https://doi.org/10.1016/j.aej.2017.01.020
Article Google Scholar
Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. Proceedings of the 27th International Conference on International Conference on Machine Learning, pp 807–814. https://dl.acm.org/doi/10.5555/3104322.3104425
Oliveira H, Correia PL (2009) Supervised crack detection and classification in images of road pavement flexible surfaces. Recent Advances in Signal Processing, pp 159–184. https://doi.org/10.5772/7448
Park S, Bang S, Kim H, Kim H (2019) Patch-based crack detection in black box images using convolutional neural networks. J Comput Civ Eng 33:04019017. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000831
Article Google Scholar
Qian N (1999) On the momentum term in gradient descent learning algorithms. Neural Netw 12:145–151. https://doi.org/10.1016/S0893-6080(98)00116-6
Article Google Scholar
Qu Z, Bai L, An S-Q et al (2016) Lining seam elimination algorithm and surface crack detection in concrete tunnel lining. Journal of Electronic Imaging 25:063004. https://doi.org/10.1117/1.JEI.25.6.063004
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, NV, pp 779–788. https://doi.org/10.1109/CVPR.2016.91
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39:1137–1149. https://doi.org/10.1109/TPAMI.2016.2577031
Article Google Scholar
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science, vol 9351. Springer, Cham. https://doi.org/10.1007/978-3-319-24574-4_28
Ruder S (2016) An overview of gradient descent optimization algorithms. arXiv preprint arXiv:160904747. http://arxiv.org/abs/1609.04747
Shi Y, Cui L, Qi Z et al (2016) Automatic road crack detection using random structured forests. IEEE Trans Intell Transp Syst 17:3434–3445. https://doi.org/10.1109/TITS.2016.2552248
Article Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. 3rd International Conference on Learning Representations (ICLR 2015), Computational and Biological Learning Society, pp 1–14
Subirats P, Dumoulin J, Legeay V, Barba D (2006) Automation of pavement surface crack detection using the continuous wavelet transform. 2006 International Conference on Image Processing, Atlanta, GA, pp 3037–3040. https://doi.org/10.1109/ICIP.2006.313007
Sudre CH, Li W, Vercauteren T, et al (2017) Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. DLMIA ML-CDS 2017. Lecture Notes in Computer Science, vol 10553. Springer, Cham. https://doi.org/10.1007/978-3-319-67558-9_28
Szegedy C, Liu W, Jia Y et al (2015) Going deeper with convolutions. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9. https://doi.org/10.1109/CVPR.2015.7298594
Wang G, Zhuang Y, Chen H et al (2022) FSoD-Net: full-scale object detection from optical remote sensing imagery. IEEE Trans Geosci Remote Sens 60:1–18. https://doi.org/10.1109/TGRS.2021.3064599
Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. Proceedings of the European conference on computer vision (ECCV), pp 3–19. https://doi.org/10.1007/978-3-030-01234-2_1
Xie S, Tu Z (2015) Holistically-nested edge detection. Proceedings of the IEEE international conference on computer vision, pp 1395–1403. https://doi.org/10.1109/ICCV.2015.164
Zeiler MD (2012) Adadelta: an adaptive learning rate method. arXiv preprint arXiv:12125701. https://arxiv.org/abs/1212.5701
Zhang L, Yang F, Zhang YD, Zhu YJ (2016) Road crack detection using deep convolutional neural network. 2016 IEEE International Conference on Image Processing (ICIP), pp 3708–3712. https://doi.org/10.1109/ICIP.2016.7533052
Zou Q, Zhang Z, Li Q et al (2018) Deepcrack: learning hierarchical convolutional features for crack detection. IEEE Trans Image Process 28:1498–1512. https://doi.org/10.1109/TIP.2018.2878966
Zou Q, Cao Y, Li Q et al (2012) CrackTree: automatic crack detection from pavement images. Pattern Recogn Lett 33:227–238. https://doi.org/10.1016/j.patrec.2011.11.004

Download references

Acknowledgements

This research was funded by the National Key Research and Development Program of China (No. 2021YFC2902000). Thanks are given to the anonymous reviewers for their careful reviews and detailed comments.

Author information

Authors and Affiliations

College of Geoscience and Survey Engineering, China University of Mining and Technology (Beijing), Beijing, 100083, China
Na Xu & Lizhi He
Department of Computing, Hong Kong Polytechnic University, Hung Hom, Kowloon, HKSAR, Hong Kong, China
Qing Li

Authors

Na Xu
View author publications
You can also search for this author in PubMed Google Scholar
Lizhi He
View author publications
You can also search for this author in PubMed Google Scholar
Qing Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Na Xu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflicts of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Xu, N., He, L. & Li, Q. Crack-Att Net: crack detection based on improved U-Net with parallel attention. Multimed Tools Appl 82, 42465–42484 (2023). https://doi.org/10.1007/s11042-023-15201-7

Download citation

Received: 10 December 2021
Revised: 13 February 2023
Accepted: 30 March 2023
Published: 13 April 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s11042-023-15201-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Crack-Att Net: crack detection based on improved U-Net with parallel attention

Abstract

Access this article

Similar content being viewed by others

CrackViT: a unified CNN-transformer model for pixel-level crack extraction

AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion

Gating attention convolutional networks with dense connection for pixel-level crack detection

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Crack-Att Net: crack detection based on improved U-Net with parallel attention

Abstract

Access this article

Similar content being viewed by others

CrackViT: a unified CNN-transformer model for pixel-level crack extraction

AHC-Net: a road crack segmentation network based on dual attention mechanism and multi-feature fusion

Gating attention convolutional networks with dense connection for pixel-level crack detection

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation