Skip to main content

From TrashCan to UNO: Deriving an Underwater Image Dataset to Get a More Consistent and Balanced Version

  • Conference paper
  • First Online:
Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges (ICPR 2022)

Abstract

The multiplication of publicly available datasets makes it possible to develop Deep Learning models for many real-world applications. However, some domains are still poorly explored, and their related datasets are often small or inconsistent. In addition, some biases linked to the dataset construction or labeling may give the impression that a model is particularly efficient. Therefore, evaluating a model requires a clear understanding of the database. Moreover, a model often reflects a given dataset’s performance and may deteriorate if a shift exists between the training dataset and real-world data.

In this paper, we derive a more consistent and balanced version of the TrashCan [6] image dataset, called UNO, to evaluate models for detecting non-natural objects in the underwater environment. We propose a method to balance the number of annotations and images for cross-evaluation. We then compare the performance of a SOTA object detection model when using TrashCAN and UNO datasets. Additionally, we assess covariate shift by testing the model on an image dataset for real-world application. Experimental results show significantly better and more consistent performance using the UNO dataset.

The UNO database and the code are publicly available at:

https://www.lirmm.fr/uno and

https://github.com/CBarrelet/balanced_kfold.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.kaggle.com/henryhaefliger/deepseawaste .

  2. 2.

    https://conservancy.umn.edu/handle/11299/214865.

  3. 3.

    https://github.com/CBarrelet/balanced_kfold.

  4. 4.

    https://www.nvidia.com/.

References

  1. JAMSTEC: Japan Agency for Marine Earth Science and Technology. https://www.jamstec.go.jp/e/

  2. Canals, M., et al.: The quest for seafloor Macrolitter: a critical review of background knowledge, current methods and future prospects. Environ. Res. Lett., November 2020. https://doi.org/10.1088/1748-9326/abc6d4. publisher: IOP Publishing

  3. Ferrera, M., Creuze, V., Moras, J., Trouvé-Peloux, P.: AQUALOC: an underwater dataset for visual-inertial-pressure localization. Int. J. Rob. Res. 38(14), 1549–1559 (2019). https://doi.org/10.1177/0278364919883346

  4. Fulton, M., Hong, J., Islam, M.J., Sattar, J.: Robotic detection of marine litter using deep visual detection models. In: 2019 International Conference on Robotics and Automation (ICRA) (2019). https://doi.org/10.1109/ICRA.2019.8793975

  5. Haefliger, H.: Deepseawaste (2019). https://www.kaggle.com/henryhaefliger/deepseawaste

  6. Hong, J., Fulton, M., Sattar, J.: A generative approach towards improved robotic detection of marine litter. In: 2020 IEEE International Conference on Robotics and Automation (ICRA) (2020)

    Google Scholar 

  7. Hong, J., Michael, F., Sattar, J.: TrashCan: a semantically-segmented dataset towards visual detection of marine debris. arXiv (2020). https://conservancy.umn.edu/handle/11299/214865

  8. Hu, Y., Pateux, S., Gripon, V.: Squeezing backbone feature distributions to the max for efficient few-shot learning. Algorithms (2022). https://doi.org/10.3390/a15050147, https://hal.archives-ouvertes.fr/hal-03675145

  9. Jocher, G., et al.: ultralytics/yolov5: v6.0 - yolov5 (2021). https://doi.org/10.5281/zenodo.5563715

  10. Madricardo, F., et al.: How to deal with seafloor marine litter: an overview of the state-of-the-art and future perspectives. Front. Mar. Sci. (2020). https://doi.org/10.3389/fmars.2020.505134, https://www.frontiersin.org/article/10.3389/fmars.2020.505134

  11. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. (2015). https://doi.org/10.1109/TPAMI.2016.2577031

  12. Schwerin, P., Wäscher, G.: The bin-packing problem: A problem generator and some numerical experiments with FFD packing and MTP. Int. Trans. Oper. Res. (2006). https://doi.org/10.1111/j.1475-3995.1997.tb00093.x

  13. Tan, M., Le, Q.: EfficientNet: Rethinking model scaling for convolutional neural networks. In: Chaudhuri, K., Salakhutdinov, R. (eds.) Proceedings of the 36th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 97. PMLR (2019). https://proceedings.mlr.press/v97/tan19a.html

  14. Tzeng, E., Hoffman, J., Saenko, K., Darrell, T.: Adversarial discriminative domain adaptation. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society, Los Alamitos, CA, USA (2017). https://doi.org/10.1109/CVPR.2017.316, https://doi.ieeecomputersociety.org/10.1109/CVPR.2017.316

  15. Veit, A., Matera, T., Neumann, L., Matas, J., Belongie, S.: Coco-text: dataset and benchmark for text detection and recognition in natural images (2016), https://arxiv.org/abs/1601.07140

  16. Wu, H., Xin, M., Fang, W., Hu, H.M., Hu, Z.: Multi-level feature network with multi-loss for person re-identification. IEEE Access (2019). https://doi.org/10.1109/ACCESS.2019.2927052

  17. Zhang, H., Cao, Z., Yan, Z., Zhang, C.: Sill-net: feature augmentation with separated illumination representation (2021)

    Google Scholar 

Download references

Acknowledgements

This research has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No 101000832 (Maelstrom project).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cyril Barrelet .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Barrelet, C., Chaumont, M., Subsol, G., Creuze, V., Gouttefarde, M. (2023). From TrashCan to UNO: Deriving an Underwater Image Dataset to Get a More Consistent and Balanced Version. In: Rousseau, JJ., Kapralos, B. (eds) Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges. ICPR 2022. Lecture Notes in Computer Science, vol 13645. Springer, Cham. https://doi.org/10.1007/978-3-031-37731-0_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-37731-0_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-37730-3

  • Online ISBN: 978-3-031-37731-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics