Abstract
Object detection in optical remote sensing imagery is being explored to deal with arbitrary orientations and complex appearance which is still a major issue in recent years. To perceive a better solution to the addressed problem, the post-processing of bounding boxes (BBs) has been evaluated and discussed for the applications of object detection. In this paper, the proposed method has divided into two stages; the first stage is based on thresholding of BBs with respect to the confidence values and the second stage is based on the area-based BB regression (BBR). In BBR, the area of each BB was estimated then the oversized and undersized BBs were removed with respect to the size of objects which are being detected. The widely known region-based approaches RCNN, Fast-RCNN and Faster-RCNN are used for evaluation and comparative analysis validates the proposed framework. The results show that the proposed post-processing is very effective for each kind of region-based detector.
Similar content being viewed by others
References
Tayara, H., Soo, K. G., & Chong, K. T. (2018). Vehicle detection and counting in high-resolution aerial images using convolutional regression neural network. IEEE Access, 6, 2220–2230.
Li, K., Cheng, G., Bu, S., & You, X. (2018). Rotation-insensitive and context-augmented object detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 56(4), 2337–2348.
Koga, Y., Miyazaki, H., & Shibasaki, R. (2018). A CNN-based method of vehicle detection from aerial images using hard example mining. Remote Sensing, 10(1), 124.
Bazi, Y., & Melgani, F. (2018). Convolutional SVM networks for object detection in UAV imagery. IEEE Transactions on Geoscience and Remote Sensing, 56(6), 3107–3118.
ElMikaty, M., & Stathaki, T. (2018). Car detection in aerial images of dense urban areas. IEEE Transactions on Aerospace and Electronic Systems, 54(1), 51–63.
Qiu, S., Wen, G., Deng, Z., Liu, J., & Fan, Y. (2018). Accurate non-maximum suppression for object detection in high-resolution remote sensing images. Remote Sensing Letters, 9(3), 237–246.
Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems (pp. 91–99).
Girshick, R., Donahue, J., Darrell, T., Berkeley, U. C., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of IEEE conference on computer vision pattern recognition (Columbus, Ohio) (pp. 2–9). https://doi.org/10.1109/CVPR.2014.81.
Hoiem, D., Chodpathumwan, Y., & Dai, Q. (2012). Diagnosing error in object detectors. In European conference on computer vision (pp. 340–353).
Karim, S., Zhang, Y., Asif, M. R., & Ali, S. (2017). Comparative analysis of feature extraction methods in satellite imagery. Journal of Applied Remote Sensing, 11(4), 42618.
Karim, S., Zhang, Y., Ali, S., & Asif, M. R. (2018). An improvement of vehicle detection under shadow regions in satellite imagery. In Proceedings of SPIE—the international society for optical engineering (Vol. 10615). https://doi.org/10.1117/12.2303518.
Liu, W., et al. (2016). SSD: Single shot multibox detector. In European conference on computer vision (pp. 21–37).
He, K., Zhang, X., Ren, S., & Sun, J. (2014). Spatial pyramid pooling in deep convolutional networks for visual recognition. In European conference on computer vision (pp. 346–361).
Dai, J., Li, Y., He, K., & Sun, J. (2016). R-fcn: Object detection via region-based fully convolutional networks. In Advances in neural information processing systems (pp. 379–387).
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (Las Vegas, NV, USA) (pp. 779–788).
Najibi, M., Rastegari, M., & Davis, L. S. (2016). G-cnn: An iterative grid based object detector. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2369–2377).
Uijlings, J. R. R., Van De Sande, K. E. A., Gevers, T., & Smeulders, A. W. M. (2013). Selective search for object recognition. International Journal on Computer Vision, 104(2), 154–171.
Gidaris, S., & Komodakis, N. (2016). Locnet: Improving localization accuracy for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 789–798).
Dickerson, N. L. (2017). Refining bounding-box regression for object localization. Dissertations and Theses, Paper 3940. https://doi.org/10.15760/etd.5824.
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., & Savarese, S. (2019). Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 658–666).
He, Y., Zhu, C., Wang, J., Savvides, M., & Zhang, X. (2019). Bounding box regression with uncertainty for accurate object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2888–2897).
Qian, X., Lin, S., Cheng, G., Yao, X., Ren, H., & Wang, W. (2020). Object detection in remote sensing images based on improved bounding box regression and multi-level features fusion. Remote Sensing, 12(1), 143.
Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., & Ren, D. (2020). Distance-IoU loss: Faster and better learning for bounding box regression. In AAAI (pp. 12993–13000).
Yuan, D., Chang, X., & He, Z. (2020). Accurate bounding-box regression with distance-IoU loss for visual tracking. arXiv:2007.01864
Girshick, R. (2015). Fast r-cnn. In Proceedings of the IEEE international conference on computer vision (Boston, Massachusetts) (pp. 1440–1448).
Xia, G.-S., et al. (2018). DOTA: A large-scale dataset for object detection in aerial images. In Proceedings of CVPR.
Hariharan, B., Arbeláez, P., Girshick, R., & Malik, J. (2015). Hypercolumns for object segmentation and fine-grained localization. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015 (pp. 447–456).
Acknowledgments
This work was supported by the National Natural Science Foundation of China under Grants 61471148.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Karim, S., Zhang, Y., Yin, S. et al. Auxiliary Bounding Box Regression for Object Detection in Optical Remote Sensing Imagery. Sens Imaging 22, 5 (2021). https://doi.org/10.1007/s11220-020-00319-x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11220-020-00319-x