Noise Resistant Focal Loss for Object Detection

Hu, Zibo; Gao, Kun; Zhang, Xiaodian; Dou, Zeyang

doi:10.1007/978-3-030-60639-8_10

Zibo Hu¹⁶,
Kun Gao¹⁶,
Xiaodian Zhang¹⁶ &
…
Zeyang Dou¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12306))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

1654 Accesses
5 Citations

Abstract

Noise robustness and hard example mining are two important aspects in object detection. A common view is that the two techniques are contradictory and they cannot be combined. In this paper, we show that there is a possibility to combine the best of two techniques. We find that, even using the hard example mining technique, recent deep neural network-based object detectors themselves have abilities to distinguish correct annotations and wrong annotations during the early stage of training. Based on this observation, we design a simple strategy to separate the wrong annotations from training data, reducing their loss weights and correcting their labels during training. The proposed method is simple, it doesn’t add any computational overhead during model inference. Moreover, the proposed method combines the hard example mining and noise resistance property in one model. Experiments on PASCAL VOC and DOTA datasets show that the proposed method not only archieves competitive performances on clean dataset, but also outperforms the baseline by a large margin when data contain severe noise.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Gradient optimization for object detection in learning with noisy labels

Article 23 March 2024

Enabling Deep Residual Networks for Weakly Supervised Object Detection

Cascade Attentive Dropout for Weakly Supervised Object Detection

Article 22 March 2023

References

Arazo, E., Ortego, D., Albert, P., O’Connor, N., McGuinness, K.: Unsupervised label noise modeling and loss correction. In: ICML 2019: Thirty-Sixth International Conference on Machine Learning, pp. 312–321 (2019)
Google Scholar
Chadwick, S., Newman, P.: Training object detectors with noisy data. In: 2019 IEEE Intelligent Vehicles Symposium (IV) (2019)
Google Scholar
Cheng, G., Han, J., Zhou, P., Guo, L.: Multi-class geospatial object detection and geographic image classification based on collection of part detectors. Isprs J. Photogr. Remote Sens. 98(98), 119–132 (2014)
Article Google Scholar
Dietterich, T.G., Lathrop, R.H., Lozano-Prez, T.: Solving the multiple instance problem with axis-parallel rectangles. Artif. Intell. 89(1), 31–71 (1997)
Article Google Scholar
Dollár, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. PP(99), 2999–3007 (2017)
Google Scholar
Everingham, M., Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vision 88(2), 303–338 (2010)
Article Google Scholar
Gao, J., Wang, J., Dai, S., Li, L.J., Nevatia, R.: Note-RCNN: noise tolerant ensemble RCNN for semi-supervised object detection. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 9507–9516 (2019)
Google Scholar
Han, B., et al.: Co-teaching: robust training of deep neural networks with extremely noisy labels. arXiv preprint arXiv:1804.06872 (2018)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
Google Scholar
Hoffman, J., et al.: LSDA: large scale detection through adaptation. In: Advances in Neural Information Processing Systems, vol. 27, pp. 3536–3544 (2014)
Google Scholar
Jiang, L., Zhou, Z., Leung, T., Li, L.J., Fei-Fei, L.: Learning data-driven curriculum for very deep neural networks on corrupted labels. In: ICML 2018: Thirty-fifth International Conference on Machine Learning (2018)
Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Shrivastava, A., Gupta, A., Girshick, R.: [IEEE 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) - Las Vegas, NV, USA (2016.6.27-2016.6.30)] 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) - training region-based object detectors with online hard example. In: IEEE Conference on Computer Vision & Pattern Recognition (2016)
Google Scholar
Tang, Y., Wang, J., Gao, B., Dellandrea, E., Gaizauskas, R., Chen, L.: Large scale semi-supervised object detection using visual and semantic knowledge transfer. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2119–2128 (2016)
Google Scholar
Uijlings, J.R.R., Popov, S., Ferrari, V.: Revisiting knowledge transfer for training object class detectors. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1101–1110 (2018)
Google Scholar
Wang, X., Shrivastava, A., Gupta, A.: A-fast-RCNN: hard positive generation via adversary for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2606–2615 (2017)
Google Scholar
Xia, G.S., et al.: DOTA: a large-scale dataset for object detection in aerial images. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3974–3983 (2018)
Google Scholar
Zhang, X., Yang, Y., Feng, J.: Learning to localize objects with noisy labeled instances. AAAI 2019 : Thirty-Third AAAI Conference on Artificial Intelligence, vol. 33, no. 1, pp. 9219–9226 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Beijing Institute of Technology, Beijing, 100081, China
Zibo Hu, Kun Gao, Xiaodian Zhang & Zeyang Dou

Authors

Zibo Hu
View author publications
You can also search for this author in PubMed Google Scholar
Kun Gao
View author publications
You can also search for this author in PubMed Google Scholar
Xiaodian Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zeyang Dou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kun Gao .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Yuxin Peng
Nanjing University of Information Science and Technology, Nanjing, China
Qingshan Liu
Dalian University of Technology, Dalian, China
Huchuan Lu
Chinese Academy of Sciences, Beijing, China
Zhenan Sun
Chinese Academy of Sciences, Beijing, China
Chenglin Liu
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Xilin Chen
Peking University, Beijing, China
Hongbin Zha
Nanjing University of Science and Technology, Nanjing, China
Jian Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hu, Z., Gao, K., Zhang, X., Dou, Z. (2020). Noise Resistant Focal Loss for Object Detection. In: Peng, Y., et al. Pattern Recognition and Computer Vision. PRCV 2020. Lecture Notes in Computer Science(), vol 12306. Springer, Cham. https://doi.org/10.1007/978-3-030-60639-8_10

Download citation

DOI: https://doi.org/10.1007/978-3-030-60639-8_10
Published: 15 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60638-1
Online ISBN: 978-3-030-60639-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Noise Resistant Focal Loss for Object Detection

Abstract

Access this chapter

Similar content being viewed by others

Gradient optimization for object detection in learning with noisy labels

Enabling Deep Residual Networks for Weakly Supervised Object Detection

Cascade Attentive Dropout for Weakly Supervised Object Detection

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Noise Resistant Focal Loss for Object Detection

Abstract

Access this chapter

Similar content being viewed by others

Gradient optimization for object detection in learning with noisy labels

Enabling Deep Residual Networks for Weakly Supervised Object Detection

Cascade Attentive Dropout for Weakly Supervised Object Detection

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation