Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery

Karim, Shahid; Zhang, Ye; Yin, Shoulin; Bibi, Irfana

doi:10.1007/s11220-020-00319-x

Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery

Original Paper
Published: 11 January 2021

Volume 22, article number 5, (2021)
Cite this article

Sensing and Imaging Aims and scope Submit manuscript

Shahid Karim^1,2,
Ye Zhang²,
Shoulin Yin² &
…
Irfana Bibi³

535 Accesses
4 Citations
Explore all metrics

Abstract

Object detection in optical remote sensing imagery is being explored to deal with arbitrary orientations and complex appearance which is still a major issue in recent years. To perceive a better solution to the addressed problem, the post-processing of bounding boxes (BBs) has been evaluated and discussed for the applications of object detection. In this paper, the proposed method has divided into two stages; the first stage is based on thresholding of BBs with respect to the confidence values and the second stage is based on the area-based BB regression (BBR). In BBR, the area of each BB was estimated then the oversized and undersized BBs were removed with respect to the size of objects which are being detected. The widely known region-based approaches RCNN, Fast-RCNN and Faster-RCNN are used for evaluation and comparative analysis validates the proposed framework. The results show that the proposed post-processing is very effective for each kind of region-based detector.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

N-IoU: better IoU-based bounding box regression loss for object detection

Article Open access 28 November 2023

High accuracy object detection via bounding box regression network

Article 15 September 2019

An Updated IoU Loss Function for Bounding Box Regression

References

Tayara, H., Soo, K. G., & Chong, K. T. (2018). Vehicle detection and counting in high-resolution aerial images using convolutional regression neural network. IEEE Access, 6, 2220–2230.
Article Google Scholar
Li, K., Cheng, G., Bu, S., & You, X. (2018). Rotation-insensitive and context-augmented object detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 56(4), 2337–2348.
Article Google Scholar
Koga, Y., Miyazaki, H., & Shibasaki, R. (2018). A CNN-based method of vehicle detection from aerial images using hard example mining. Remote Sensing, 10(1), 124.
Article Google Scholar
Bazi, Y., & Melgani, F. (2018). Convolutional SVM networks for object detection in UAV imagery. IEEE Transactions on Geoscience and Remote Sensing, 56(6), 3107–3118.
Article Google Scholar
ElMikaty, M., & Stathaki, T. (2018). Car detection in aerial images of dense urban areas. IEEE Transactions on Aerospace and Electronic Systems, 54(1), 51–63.
Article Google Scholar
Qiu, S., Wen, G., Deng, Z., Liu, J., & Fan, Y. (2018). Accurate non-maximum suppression for object detection in high-resolution remote sensing images. Remote Sensing Letters, 9(3), 237–246.
Article Google Scholar
Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems (pp. 91–99).
Girshick, R., Donahue, J., Darrell, T., Berkeley, U. C., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of IEEE conference on computer vision pattern recognition (Columbus, Ohio) (pp. 2–9). https://doi.org/10.1109/CVPR.2014.81.
Hoiem, D., Chodpathumwan, Y., & Dai, Q. (2012). Diagnosing error in object detectors. In European conference on computer vision (pp. 340–353).
Karim, S., Zhang, Y., Asif, M. R., & Ali, S. (2017). Comparative analysis of feature extraction methods in satellite imagery. Journal of Applied Remote Sensing, 11(4), 42618.
Article Google Scholar
Karim, S., Zhang, Y., Ali, S., & Asif, M. R. (2018). An improvement of vehicle detection under shadow regions in satellite imagery. In Proceedings of SPIE—the international society for optical engineering (Vol. 10615). https://doi.org/10.1117/12.2303518.
Liu, W., et al. (2016). SSD: Single shot multibox detector. In European conference on computer vision (pp. 21–37).
He, K., Zhang, X., Ren, S., & Sun, J. (2014). Spatial pyramid pooling in deep convolutional networks for visual recognition. In European conference on computer vision (pp. 346–361).
Dai, J., Li, Y., He, K., & Sun, J. (2016). R-fcn: Object detection via region-based fully convolutional networks. In Advances in neural information processing systems (pp. 379–387).
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (Las Vegas, NV, USA) (pp. 779–788).
Najibi, M., Rastegari, M., & Davis, L. S. (2016). G-cnn: An iterative grid based object detector. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2369–2377).
Uijlings, J. R. R., Van De Sande, K. E. A., Gevers, T., & Smeulders, A. W. M. (2013). Selective search for object recognition. International Journal on Computer Vision, 104(2), 154–171.
Article Google Scholar
Gidaris, S., & Komodakis, N. (2016). Locnet: Improving localization accuracy for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 789–798).
Dickerson, N. L. (2017). Refining bounding-box regression for object localization. Dissertations and Theses, Paper 3940. https://doi.org/10.15760/etd.5824.
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., & Savarese, S. (2019). Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 658–666).
He, Y., Zhu, C., Wang, J., Savvides, M., & Zhang, X. (2019). Bounding box regression with uncertainty for accurate object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2888–2897).
Qian, X., Lin, S., Cheng, G., Yao, X., Ren, H., & Wang, W. (2020). Object detection in remote sensing images based on improved bounding box regression and multi-level features fusion. Remote Sensing, 12(1), 143.
Article Google Scholar
Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., & Ren, D. (2020). Distance-IoU loss: Faster and better learning for bounding box regression. In AAAI (pp. 12993–13000).
Yuan, D., Chang, X., & He, Z. (2020). Accurate bounding-box regression with distance-IoU loss for visual tracking. arXiv:2007.01864
Girshick, R. (2015). Fast r-cnn. In Proceedings of the IEEE international conference on computer vision (Boston, Massachusetts) (pp. 1440–1448).
Xia, G.-S., et al. (2018). DOTA: A large-scale dataset for object detection in aerial images. In Proceedings of CVPR.
Hariharan, B., Arbeláez, P., Girshick, R., & Malik, J. (2015). Hypercolumns for object segmentation and fine-grained localization. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015 (pp. 447–456).

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grants 61471148.

Author information

Authors and Affiliations

Department of Computer Science, ILMA University, Karachi, Pakistan
Shahid Karim
School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
Shahid Karim, Ye Zhang & Shoulin Yin
School of Computer Science and Technology, Xidian University, Xi’an, 710071, China
Irfana Bibi

Authors

Shahid Karim
View author publications
You can also search for this author in PubMed Google Scholar
Ye Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shoulin Yin
View author publications
You can also search for this author in PubMed Google Scholar
Irfana Bibi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shahid Karim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Karim, S., Zhang, Y., Yin, S. et al. Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery. Sens Imaging 22, 5 (2021). https://doi.org/10.1007/s11220-020-00319-x

Download citation

Received: 28 March 2020
Revised: 16 August 2020
Accepted: 10 October 2020
Published: 11 January 2021
DOI: https://doi.org/10.1007/s11220-020-00319-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery

Abstract

Access this article

Similar content being viewed by others

N-IoU: better IoU-based bounding box regression loss for object detection

High accuracy object detection via bounding box regression network

An Updated IoU Loss Function for Bounding Box Regression

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery

Abstract

Access this article

Similar content being viewed by others

N-IoU: better IoU-based bounding box regression loss for object detection

High accuracy object detection via bounding box regression network

An Updated IoU Loss Function for Bounding Box Regression

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation