Skip to main content
Log in

Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery

  • Original Paper
  • Published:
Sensing and Imaging Aims and scope Submit manuscript

Abstract

Object detection in optical remote sensing imagery is being explored to deal with arbitrary orientations and complex appearance which is still a major issue in recent years. To perceive a better solution to the addressed problem, the post-processing of bounding boxes (BBs) has been evaluated and discussed for the applications of object detection. In this paper, the proposed method has divided into two stages; the first stage is based on thresholding of BBs with respect to the confidence values and the second stage is based on the area-based BB regression (BBR). In BBR, the area of each BB was estimated then the oversized and undersized BBs were removed with respect to the size of objects which are being detected. The widely known region-based approaches RCNN, Fast-RCNN and Faster-RCNN are used for evaluation and comparative analysis validates the proposed framework. The results show that the proposed post-processing is very effective for each kind of region-based detector.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Tayara, H., Soo, K. G., & Chong, K. T. (2018). Vehicle detection and counting in high-resolution aerial images using convolutional regression neural network. IEEE Access, 6, 2220–2230.

    Article  Google Scholar 

  2. Li, K., Cheng, G., Bu, S., & You, X. (2018). Rotation-insensitive and context-augmented object detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 56(4), 2337–2348.

    Article  Google Scholar 

  3. Koga, Y., Miyazaki, H., & Shibasaki, R. (2018). A CNN-based method of vehicle detection from aerial images using hard example mining. Remote Sensing, 10(1), 124.

    Article  Google Scholar 

  4. Bazi, Y., & Melgani, F. (2018). Convolutional SVM networks for object detection in UAV imagery. IEEE Transactions on Geoscience and Remote Sensing, 56(6), 3107–3118.

    Article  Google Scholar 

  5. ElMikaty, M., & Stathaki, T. (2018). Car detection in aerial images of dense urban areas. IEEE Transactions on Aerospace and Electronic Systems, 54(1), 51–63.

    Article  Google Scholar 

  6. Qiu, S., Wen, G., Deng, Z., Liu, J., & Fan, Y. (2018). Accurate non-maximum suppression for object detection in high-resolution remote sensing images. Remote Sensing Letters, 9(3), 237–246.

    Article  Google Scholar 

  7. Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems (pp. 91–99).

  8. Girshick, R., Donahue, J., Darrell, T., Berkeley, U. C., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of IEEE conference on computer vision pattern recognition (Columbus, Ohio) (pp. 2–9). https://doi.org/10.1109/CVPR.2014.81.

  9. Hoiem, D., Chodpathumwan, Y., & Dai, Q. (2012). Diagnosing error in object detectors. In European conference on computer vision (pp. 340–353).

  10. Karim, S., Zhang, Y., Asif, M. R., & Ali, S. (2017). Comparative analysis of feature extraction methods in satellite imagery. Journal of Applied Remote Sensing, 11(4), 42618.

    Article  Google Scholar 

  11. Karim, S., Zhang, Y., Ali, S., & Asif, M. R. (2018). An improvement of vehicle detection under shadow regions in satellite imagery. In Proceedings of SPIE—the international society for optical engineering (Vol. 10615). https://doi.org/10.1117/12.2303518.

  12. Liu, W., et al. (2016). SSD: Single shot multibox detector. In European conference on computer vision (pp. 21–37).

  13. He, K., Zhang, X., Ren, S., & Sun, J. (2014). Spatial pyramid pooling in deep convolutional networks for visual recognition. In European conference on computer vision (pp. 346–361).

  14. Dai, J., Li, Y., He, K., & Sun, J. (2016). R-fcn: Object detection via region-based fully convolutional networks. In Advances in neural information processing systems (pp. 379–387).

  15. Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (Las Vegas, NV, USA) (pp. 779–788).

  16. Najibi, M., Rastegari, M., & Davis, L. S. (2016). G-cnn: An iterative grid based object detector. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2369–2377).

  17. Uijlings, J. R. R., Van De Sande, K. E. A., Gevers, T., & Smeulders, A. W. M. (2013). Selective search for object recognition. International Journal on Computer Vision, 104(2), 154–171.

    Article  Google Scholar 

  18. Gidaris, S., & Komodakis, N. (2016). Locnet: Improving localization accuracy for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 789–798).

  19. Dickerson, N. L. (2017). Refining bounding-box regression for object localization. Dissertations and Theses, Paper 3940. https://doi.org/10.15760/etd.5824.

  20. Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., & Savarese, S. (2019). Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 658–666).

  21. He, Y., Zhu, C., Wang, J., Savvides, M., & Zhang, X. (2019). Bounding box regression with uncertainty for accurate object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2888–2897).

  22. Qian, X., Lin, S., Cheng, G., Yao, X., Ren, H., & Wang, W. (2020). Object detection in remote sensing images based on improved bounding box regression and multi-level features fusion. Remote Sensing, 12(1), 143.

    Article  Google Scholar 

  23. Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., & Ren, D. (2020). Distance-IoU loss: Faster and better learning for bounding box regression. In AAAI (pp. 12993–13000).

  24. Yuan, D., Chang, X., & He, Z. (2020). Accurate bounding-box regression with distance-IoU loss for visual tracking. arXiv:2007.01864

  25. Girshick, R. (2015). Fast r-cnn. In Proceedings of the IEEE international conference on computer vision (Boston, Massachusetts) (pp. 1440–1448).

  26. Xia, G.-S., et al. (2018). DOTA: A large-scale dataset for object detection in aerial images. In Proceedings of CVPR.

  27. Hariharan, B., Arbeláez, P., Girshick, R., & Malik, J. (2015). Hypercolumns for object segmentation and fine-grained localization. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015 (pp. 447–456).

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grants 61471148.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shahid Karim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Karim, S., Zhang, Y., Yin, S. et al. Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery. Sens Imaging 22, 5 (2021). https://doi.org/10.1007/s11220-020-00319-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11220-020-00319-x

Keywords

Navigation