Abstract
Road extraction from aerial imagery is not a trivial task. It plays a pivotal role in urban planning, navigation, disaster assessment and various other fields. It poses challenges due to complex scenarios and factors, including occlusion. Hence conventional methods prove to be inefficient for the purpose. Image segmentation and deep learning models are extensively employed in recent times to extract objects from images. In this paper, the performance of Unet architecture-based model has been improved by Resnet50, VGG16, DenseNet169, Xception and Efficientnet-b4. Further, to investigate the performance of Unet model, three other models FPN, PSPNet and PAN were implemented and evaluated on Massachusetts road dataset. The work presents the comparative analyses of the performance of models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sujatha C, Selvathi D (2015) Connected component-based technique for automatic extraction of road centreline in high resolution satellite images. EURASIP J Image Video Process 2015(1):8
Alshehhi R, Marpu PR (2017) Hierarchical graph-based segmentation for extracting road networks from high-resolution satellite images. ISPRS J Photogramm Remote Sens 126:245–260
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556
Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions. arXiv 2015, arXiv:1511.07122
Zhou L, Zhang C, Wu M (1997) D-linknet: LinkNet with pretrained encoder and dilated convolution for high resolution satellite imagery road extraction. In: Proceedings of the IEEE conference on computer vision and pattern recognition work-shops, San Juan, PR, USA, 17–19 June 1997, pp 182–186
Chaurasia A, Culurciello E (2017) LinkNet: exploiting encoder representations for efficient semantic segmentation. arXiv 2017, arXiv:1707.03718v1
Zhou M, Sui H, Chen S, Wang J, Chen X (2020) BT-RoadNet: a boundary and topologically-aware neural network for road extraction from high-resolution remote sensing imagery. ISPRS J Photogramm Remote Sens 168:288–306
Chen Z, Wang C, Li J, Xie N, Han Y, Du J (2021) Reconstruction bias U-Net for road extraction from optical remote sensing images. IEEE J Sel Top Appl Earth Obs Remote Sens 14:2284–2294
Dey MS, Chaudhuri U, Banerjee B, Bhattacharya A (2021) Dual-path morph-UNet for road and building segmentation from satellite images. IEEE Geosci Remote Sens Lett 19:1–5
Zheng W, Tian X, Yang B, Liu S, Ding Y, Tian J, Yin L (2022) A few shot classification methods based on multiscale relational networks. Appl Sci 12:4059
Geng Q, Zhang H, Qi X, Huang G, Yang R, Zhou Z (2021) Gated path selection network for semantic segmentation. IEEE Trans Image Process 30:2436–2449
Yuan Q, Shen H, Li T et al (2020) Deep learning in environmental remote sensing: achievements and challenges. Remote Sens Environ 241, Article ID 111716
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444; Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the international conference on learning representations, San Diego, CA, USA, 7–9 May 2015
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), Los Alamitos, CA, USA, 27–30 June 2016, pp 770–778
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the 30th IEEE conference on computer vision and pattern recognition, Honolulu, HI, USA, 21–26 July 2017, pp 2261–2269
Mnih V, Hinton GE (2010) Learning to detect roads in high-resolution aerial images. In: Proceedings of the European conference on computer vision, Heraklion, Crete, Greece, 5–11 Sept 2010, pp 210–223
Mnih V (2013) Machine learning for aerial image labeling. Ph.D. thesis, University of Toronto, Toronto, ON, Canada
Wang J, Song J, Chen M, Yang Z (2015) Road network extraction: a neural-dynamic framework based on deep learning and a finite state machine. Int J Remote Sens 36:3144–3169
Alshehhi R, Marpu PR, Woon WL, Mura MD (2017) Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks. ISPRS J Photogramm Remote Sens 130:139–149
Rezaee M, Zhang Y (2017) Road detection using deep neural network in high spatial resolution images. In: Proceedings of the joint urban remote sensing event (JURSE 2017), Dubai, United Arab Emirates, 6–8 Mar 2017, pp 1–4
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Boston, MA, USA, 7–12 June 2015, pp 3431–3440
Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: Proceedings of the medical image computing and computer-assisted intervention, Munich, Germany, 5–9 Oct 2015, pp 234–241
Badrinarayanan V, Kendall A, Cipolla R (2017) SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39:2481–2495
Chen LC, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation. arXiv 2017, arXiv:1706.05587
Chen LC, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. arXiv 2018, arXiv:1802.02611
Gao L, Song W, Dai J, Chen Y (2019) Road extraction from high resolution remote sensing imagery using refined deep residual convolutional neural network. Remote Sens. (ii):1–16
Mnih V (2013) Machine learning for aerial image labeling. Toronto
https://towardsdatascience.com/unet-line-by-line-explanation-9b191c76baf5
Li T, Comer M, Zerubia J (2019) Feature extraction and tracking of CNN segmentations for improved road detection from satellite imagery. In: ICIP 2019—IEEE international conference on image processing, Sept 2019, Taipei, Taiwan. ffhal-01813781v2f
Ye L, Wang L, Zhang W, Li Y, Wang Z (2019) Deep metric learning method for high resolution remote sensing image scene classification 48(6):698
Liu Y, Minh Nguyen D, Deligiannis N, Ding W, Munteanu AJRS (2017) Hourglass-ShapeNetwork based semantic segmentation for high resolution aerial imagery. Remote Sens 9(6):522
Hamaguchi R, Fujita A, Nemoto K, Imaizumi T, Hikosaka S (2018) Effective use of dilated convolutions for segmenting small object instances in remote sensing imagery. In: Proceedings of the 2018 IEEE winter conference on applications of computer vision (WACV). IEEE, Lake Tahoe, Nevada, USA, Mar 2018, pp 1442–1450
Wang H, Wang Y, Zhang Q, Xiang S, Pan CJRS (2017) Gated convolutional neural network for semantic segmentation in high-resolution images. Remote Sens 9(5):446
Shang R, Zhang J, Jiao L, Li Y, Marturi N, Stolkin RJRS (2020) Multi-scale adaptive feature fusion network for semantic segmentation in remote sensing images. Remote Sens 12(5):872
Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, HI, USA, July 2017, pp 1442–1450
Chaurasia A, Culurciello E (2017) LinkNet: exploiting encoder representations for efficient semantic segmentation. In: Proceedings of the 2017 IEEE visual communications and image processing (VCIP). IEEE, Petersburg, FL, USA, Dec 2017, pp 1–4
Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, HI, USA, July 2017, pp 2117–2125
https://www.kaggle.com/datasets/balraj98/massachusetts-roads-dataset
https://medium.com/@dhanush.patel/imagesegmentation-6950eb534d05
Russakovsky O, Deng J, Su H et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vision 115(3):211–252
https://segmentation-modelspytorch.readthedocs.io/en/latest/
Abdollahi A, Pradhan B, Shukla N, Chakraborty S, Alamri AM (2020) Deep learning approaches applied to remote sensing datasets for road extraction: a state-of-the-art review. Remote Sens 12:1444
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, A., Izharul Hasan Ansari, M., Garg, A. (2024). Deep Convolutional Encoder–Decoder Models for Road Extraction from Aerial Imagery. In: Joshi, A., Mahmud, M., Ragel, R.G., Karthik, S. (eds) ICT: Innovation and Computing. ICTCS 2023. Lecture Notes in Networks and Systems, vol 879. Springer, Singapore. https://doi.org/10.1007/978-981-99-9486-1_1
Download citation
DOI: https://doi.org/10.1007/978-981-99-9486-1_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9485-4
Online ISBN: 978-981-99-9486-1
eBook Packages: EngineeringEngineering (R0)