Skip to main content

Advertisement

Log in

Optimized building extraction from high-resolution satellite imagery using deep learning

  • 1220: Visual and Sensory Data Processing for Real Time Intelligent Surveillance System
  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Building extraction is very essential in various urban dynamics like disaster management and change detection, finding the estimated population, and so on. Building extraction from satellite data is a challenging task as the images may be subjected to different illumination or structure due to very large variations of the appearance of buildings which may correspond to the different area/terrain. Although satellite imagery is readily available from various sources, translating the imagery includes intensive effort. Many computer-vision tasks have been carried out successfully but understanding the impact of them on building extraction with remote sensing imagery is a growing need.To overcome this kind of problem, an algorithm is proposed which extends the convolutional neural network for pixel-wise classification of images. Furthermore, to resolve the problem of extraction and masking of images, Mask-RCNN (i.e., Mask Region-based Convolutional Neural Network) algorithm is used which makes this process easier and more efficient.The model is trained on a complex dataset that is significantly larger. Also, to make this algorithm more scalable, an advanced image augmentation technique is used in the pre-processing step.The results show that the algorithm achieves better performance in terms of accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

The data samples have been taken using crowdAI.

Code availability

The relevant code with the manuscript is also avail- able and would be available, if will be asked to do so later.

References

  1. Azimi SM et al (2019) Towards multi-class object detection in unconstrained remote sensing imagery. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11363 LNCS, pp 150–165. https://doi.org/10.1007/978-3-030-20893-6_10

  2. Ardila JP et al (2012)Context-sensitive extraction of tree crown objects in urban areas using VHR satellite images. Int J Appl Earth Obs Geoinf. Elsevier B.V. 15(1):57–69. https://doi.org/10.1016/j.jag.2011.06.005

  3. Akter S, Wamba SF (2016) ‘Big data analytics in E-commerce: a systematic review and agenda for future research. Electron Markets Electron Markets 26(2):173–194. https://doi.org/10.1007/s12525-016-0219-0

    Article  Google Scholar 

  4. Chen K, Fu K, Gao X, Yan M, Sun X, Zhang H (2017) Building extraction from remote sensing images with deep learning in a supervised manner. In 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). IEEE, pp 1672–1675

  5. Cook K, Wright J (1975) Transmission Systems. 20(January 15, 1975), pp 219–223. https://doi.org/10.1049/pbte071e_ch4

  6. Duan Y, Sun L (2019) Buildings extraction from remote sensing data using deep learning method based on improved U-Net network. In: IGARSS 2019–2019 IEEE International Geoscience and Remote Sensing Symposium. IEEE, pp 3959–3961

  7. Gupta A, Anand R, Pandey D, Sindhwani N, Wairya S, Pandey BK, Sharma M (2021) Prediction of breast cancer using Extremely Randomized Clustering Forests (ERCF) Technique: Prediction of breast cancer. Int J Distrib Syst Technol (IJDST) 12(4):1–15

    Article  Google Scholar 

  8. Ghiasi G, Fowlkes CC (2016) Laplacian pyramid reconstruction and refinement for semantic segmentation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9907 LNCS, pp 519–534. https://doi.org/10.1007/978-3-319-46487-9_32

  9. Güler RA, Neverova N, Kokkinos I (2018) DensePose: Dense human pose estimation in the wild. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 7297–7306. https://doi.org/10.1109/CVPR.2018.00762

  10. Girshick R (2015) Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision. 2015 Inter, pp 1440–1448. https://doi.org/10.1109/ICCV.2015.169

  11. Huang Z, Cheng G, Wang H, Li H, Shi L, Pan C (2016) Building extraction from multi-source remote sensing images via deep deconvolution neural networks. In: 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). IEEE, pp 1835–1838

  12. Huang X, Zhang L (2012) Morphological building/shadow index for building extraction from high-resolution imagery over urban areas. IEEE J Sel Top Appl Earth Obs Remote Sens 5(1):161–172. https://doi.org/10.1109/JSTARS.2011.2168195

    Article  Google Scholar 

  13. He K et al (2017) Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision, 2017-Octob, pp 2980–2988. https://doi.org/10.1109/ICCV.2017.322

  14. Hui J, Du M, Ye X, Qin Q, Sui J (2018) Effective building extraction from high-resolution remote sensing images with multitask driven deep neural network. IEEE Geosci Remote Sens Lett 16(5):786–790

    Article  Google Scholar 

  15. Jin X, Davis CH (2005) Automated building extraction from high-resolution satellite imagery in Urban areas using structural, contextual, and spectral information. EURASIP J Appl Sig Process 2005(14):2196–2206. https://doi.org/10.1155/ASP.2005.2196

    Article  MATH  Google Scholar 

  16. Liu Z et al (2015) ‘Deep learning face attributes in the wild. Proceedings of the IEEE International Conference on Computer Vision, 2015 Inter, pp 3730–3738. https://doi.org/10.1109/ICCV.2015.425

  17. Liu Y, Zhou J, Qi W, Li X, Gross L, Shao Q, ... Li Z (2020) ARC-Net: an efficient network for building extraction from high-resolution aerial images. IEEE Access 8:154997–155010

  18. Li W et al (2006) A novel framework for urban change detection using VHR satellite images. Proceedings - International Conference on Pattern Recognition 2, pp 312–315. https://doi.org/10.1109/ICPR.2006.138

  19. Li X, Yao X, Fang Y (2018) Building-a-nets: Robust building extraction from high-resolution remote sensing images with adversarial networks. IEEE J Sel Top Appl Earth Obs Remote Sens 11(10):3680–3687

    Article  Google Scholar 

  20. Liu Y, Gross L, Li Z, Li X, Fan X, Qi W (2019) Automatic building extraction on high-resolution remote sensing imagery using deep convolutional encoder-decoder with spatial pyramid pooling. IEEE Access 7:128774–128786

    Article  Google Scholar 

  21. Marmanis D, Adam F, Datcu M, Esch T, Stilla U (2015) Deep neural networks for above-ground detection in very high spatial resolution digital elevation models. ISPRS Ann Photogramm Remote Sens Spat Inf Sci 2(3):103

    Article  Google Scholar 

  22. Majd RD, Momeni M, Moallem P (2019) Transferable object-based framework based on deep convolutional neural networks for building extraction. IEEE J Sel Top Appl Earth Observations Remote Sens 12(8):2627–2635

    Article  Google Scholar 

  23. Meivel S, Sindhwani N, Anand R, Pandey D, Alnuaim AA, Altheneyan AS, ... Lelisho ME (2022) Mask detection and social distance identification using internet of things and faster R-CNN algorithm. Comput Intell Neurosci 2022:2103975. https://doi.org/10.1155/2022/2103975

  24. Madhumathy P, Pandey D (2022) Deep learning based photo acoustic imaging for non-invasive imaging. Multimed Tools Appl. https://doi.org/10.1007/s11042-022-11903-6

  25. Pandey D, Pandey BK (2022) An efficient deep neural network with adaptive galactic swarm optimization for complex image text extraction. Process Mining Techniques for Pattern Recognition. CRC Press, pp 121–137

  26. Pandey BK, Pandey D, Wariya S, Aggarwal G, Rastogi R (2021) Deep learning and particle swarm optimisation-based techniques for visually impaired humans’ text recognition and identification. Augmented Hum Res 6(1):1–14

    Article  Google Scholar 

  27. Pal SK, Mitra S (1992) Multilayer perceptron, fuzzy sets, and classification. IEEE Trans Neural Netw 3(5):683–697. https://doi.org/10.1109/72.159058

    Article  Google Scholar 

  28. Pandey BK, Mane D, Nassa VKK, Pandey D, Dutta S, Ventayen RJM, ... Rastogi R (2021) Secure text extraction from complex degraded images by applying steganography and deep learning. In: Multidisciplinary Approach to Modern Digital Steganography. IGI Global, pp 146-163

  29. Pandey BK, Pandey D, Wariya S, Agarwal G (2021) A deep neural network-based approach for extracting textual images from deteriorate images. EAI Endorsed Trans Ind Netw Intell Syst 8(28):e3

  30. Papadomanolaki M et al (2016) Benchmarking deep learning frameworks for the classification of very high resolution satellite multispectral data. ISPRS Ann Photogramm Remote Sens Spat Inf Sci III–7(July):83–88. https://doi.org/10.5194/isprsannals-iii-7-83-2016

  31. Pandey D, Pandey B, Wairya S (2021) Hybrid deep neural network with adaptive galactic swarm optimization for text extraction from scene images. Soft Comput 25:1563–1580. https://doi.org/10.1007/s00500-020-05245-4

    Article  Google Scholar 

  32. Sindhwani N, Verma S, Bajaj T, Anand R (2021) Comparative analysis of intelligent driving and safety assistance systems using YOLO and SSD model of deep learning. Int J Inform Syst Model Des (IJISMD) 12(1):131–146

    Article  Google Scholar 

  33. Vakalopoulou M et al (2015) Building detection in very high resolution multispectral data with deep learning features. International Geoscience and Remote Sensing Symposium (IGARSS), 2015-Novem, pp 1873–1876. https://doi.org/10.1109/IGARSS.2015.7326158

  34. Wang Y, Gu L, Li X, Ren R (2020) Building extraction in multitemporal high-resolution remote sensing imagery using a multifeature LSTM network.IEEE Geosci Remote Sensing Lett. https://doi.org/10.1109/LGRS.2020.3005018

  35. Wang M, Yuan S, Pan J (2013) Building detection in high resolution satellite urban image using segmentation, corner detection combined with adaptive windowed Hough Transform. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote. Geoscience and Remote Sensing Symposium (IGARSS), 2013 IEEE International, pp 508–511. https://doi.org/10.1109/IGARSS.2013.6721204

  36. Wang X, Shrivastava A, Gupta A (2017) A-Fast-RCNN: Hard positive generation via adversary for object detection. Proceedings – 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, 2017-Janua, pp 3039–3048. https://doi.org/10.1109/CVPR.2017.324

  37. Xu Y et al (2018) Building extraction in very high resolution remote sensing imagery using deep learning and guided filters. Remote Sens 10(1). https://doi.org/10.3390/rs10010144

  38. Yuan J (2016) Automatic building extraction in aerial scenes using convolutional networks. arXiv preprint arXiv:1602.06564

  39. Singh SK, Thakur RK, Kumar S, Anand R (2022, March) Deep Learning and Machine Learning based Facial Emotion Detection using CNN. In 2022 9th International Conference on Computing for Sustainable Global Development (INDIACom) (pp 530–535). IEEE

  40. Saini P, Anand MR (2014) Identification of Defects in Plastic Gears Using Image Processing and Computer Vision: A Review. Int J Eng Res 3(2):94–99

Download references

Acknowledgements

The authors would like to express gratitude to Department of Technical Education and Panipat Institute of Engineering & Technology, Panipat, India. The authors would also like to thank to Vice Chancellor, Dr. A.P.J. Abdul Kalam Technical University, and Uttar Pradesh, India.

Author information

Authors and Affiliations

Authors

Contributions

All authors approve the final manuscript.

Corresponding author

Correspondence to Digvijay Pandey.

Ethics declarations

Ethics approval

Not Applicable (as the results of studies does not involve any human or animal).

Consent to participate

Not Applicable (as the results of studies does not involve any human or animal).

Consent for publication

Not Applicable (as the results of studies does not involve any human or animal).

Conflict of interest / Competing interests

The authors declare that they have ‘no known conflict of interests or personal relationships’ that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Raghavan, R., Verma, D.C., Pandey, D. et al. Optimized building extraction from high-resolution satellite imagery using deep learning. Multimed Tools Appl 81, 42309–42323 (2022). https://doi.org/10.1007/s11042-022-13493-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-13493-9

Keywords

Navigation