Skip to main content

COG: COnsistent Data AuGmentation for Object Perception

  • Conference paper
  • First Online:
  • 718 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12624))

Abstract

Recently, data augmentation techniques for training conv-nets emerge one after another, especially focusing on image classification. They’re always applied to object detection without further careful design. In this paper we propose COG, a general domain migration scheme for augmentation. Specifically, based on a particular augmentation, we first analyze its inherent inconsistency, and then adopt an adaptive strategy to rectify ground-truths of the augmented input images. Next, deep detection networks are trained on the rectified data to achieve better performance. Our extensive experiments show that our method COG’s performance is superior to its competitor on detection and instance segmentation tasks. In addition, the results manifest the robustness of COG when faced with hyper-parameter variations, etc.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of CVPR, pp. 770–778 (2016)

    Google Scholar 

  2. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Proceedings of NIPS, pp. 91–99 (2015)

    Google Scholar 

  3. He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: Proceedings of IEEE ICCV, pp. 2980–2988 (2017)

    Google Scholar 

  4. Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48

    Chapter  Google Scholar 

  5. Shao, S., et al.: CrowdHuman: a benchmark for detecting human in a crowd. arXiv preprint arXiv:1805.00123 (2018)

  6. Chen, P., Liu, s., Hengshuang, Z., Jiaya, J.: Gridmask data augmentation. arXiv preprint arXiv:2001.04086 (2020)

  7. Cubuk, E.D., Zoph, B., Mané, D., Vasudevan, V., Le, Q.V.: AutoAugment: learning augmentation strategies from data. In: Proceedings of IEEE CVPR, pp. 113–123 (2019)

    Google Scholar 

  8. Lim, S., Kim, I., Kim, T., Kim, C., Kim, S.: Fast AutoAugment. arXiv preprint arXiv:1905.00397 (2019)

  9. Hataya, R., Zdenek, J., Yoshizoe, K., Nakayama, H.: Faster AutoAugment: learning augmentation strategies using backpropagation. arXiv preprint arXiv:1911.06987 (2019)

  10. Zhong, Z., Zheng, L., Kang, G., Li, S., Yang, Y.: Random erasing data augmentation. In: AAAI (2020)

    Google Scholar 

  11. DeVries, T., Taylor, G.W.: Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552 (2017)

  12. Singh, K.K., Hao, Y., Sarmasi, A., Pradeep, G., Yongjae, L.: Hide-and-Seek: a data augmentation technique for weakly-supervised localization and beyond (2018)

    Google Scholar 

  13. Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of IEEE CVPR, pp. 580–587 (2014)

    Google Scholar 

  14. Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2

    Chapter  Google Scholar 

  15. Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)

  16. Li, Y., Chen, Y., Wang, N., Zhang, Z.: Scale-aware trident networks for object detection. arXiv preprint arXiv:1901.01892 (2019)

  17. Cai, Z., Vasconcelos, N.: Cascade R-CNN: delving into high quality object detection. In: Proceedings of CVPR, pp. 6154–6162 (2018)

    Google Scholar 

  18. Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of IEEE ICCV, pp. 2999–3007 (2017)

    Google Scholar 

  19. Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)

  20. Zoph, B., Cubuk, E.D., Ghiasi, G., Lin, T.Y., Shlens, J., Le, Q.V.: Learning data augmentation strategies for object detection. arXiv preprint arXiv:1906.11172 (2019)

  21. Fang, H.s., Sun, J., Wang, R., Gou, M., Li, Y., Lu, C.: InstaBoost: boosting instance segmentation via probability map guided copy-pasting. In: Proceedings of IEEE ICCV, pp. 682–691 (2019)

    Google Scholar 

  22. Chen, K., et al.: MMDetection: open MMLab detection toolbox and benchmark. arXiv preprint arXiv:1906.07155 (2019)

  23. Bodla, N., Singh, B., Chellappa, R., Davis, L.S.: Soft-NMS – improving object detection with one line of code. In: Proceedings of IEEE ICCV, pp. 5562–5570 (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zewen He .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

He, Z., Wu, R., Zhang, D. (2021). COG: COnsistent Data AuGmentation for Object Perception. In: Ishikawa, H., Liu, CL., Pajdla, T., Shi, J. (eds) Computer Vision – ACCV 2020. ACCV 2020. Lecture Notes in Computer Science(), vol 12624. Springer, Cham. https://doi.org/10.1007/978-3-030-69535-4_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-69535-4_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-69534-7

  • Online ISBN: 978-3-030-69535-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics