Skip to main content

Deep Learning Case Study on Imbalanced Training Data for Automatic Bird Identification

  • Chapter
  • First Online:

Part of the book series: Studies in Computational Intelligence ((SCI,volume 865))

Abstract

Collisions between birds and wind turbines can be significant problem in wind farms. Practical deterrent methods are required to prevent these collisions. However, it is improbable that a single deterrent method would work for all bird species in a given area. An automatic bird identification system is needed in order to develop bird species level deterrent methods. This system is the first and necessary part of the entirety that is eventually able to, monitor bird movements, identify bird species, and launch deterrent measures. The system consists of a radar system for detection of the birds, a digital single-lens reflex camera with telephoto lens for capturing images, a motorized video head for steering the camera, and convolutional neural networks trained on the images with a deep learning algorithm for image classification . We utilized imbalanced data because the distribution of the captured images is naturally imbalanced. We applied distribution of the training data set to estimate the actual distribution of the bird species in the test area. Species identification is based on the image classifier that is a hybrid of hierarchical and cascade models. The main idea is to train classifiers on bird species groups, in which the species resembles more each other than any other species outside the group in terms of morphology (coloration and shape). The results of this study show that the developed image classifier model has sufficient performance to identify bird species in a test area. The proposed system produced very good results, when the hybrid hierarchical model was applied to the imbalanced data sets.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Li, F., Li, S., Zhu, C., Lan, X., Chang, H.: Class-imbalance aware CNN extension for high resolution aerial image based vehicle localization and categorization. In: 2017 2nd International Conference on Image, Vision and Computing (ICIVC), Chengdu (2017)

    Google Scholar 

  2. Krizhevsky, A., Sutskever, I., Hinton, G. E.: Imagenet classification with deep convolutional neural networks. Commun. ACM, 84–90 (2017)

    Google Scholar 

  3. Mao, R., Lin, Q., Allebach, J.: Robust convolutional neural network cascade for facial landmark localization exploiting training data augmentation. In: Imaging and Multimedia Analytics in a Web and Mobile World 2018, pp. 374-1-374-5(5) (2018)

    Google Scholar 

  4. Jia, S., Wang, P., Jia, P., Hu, S.: Research on data augmentation for image classification based on convolution neural networks. In: 2017 Chinese Automation Congress (CAC), Jinan (2017)

    Google Scholar 

  5. Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston (2015)

    Google Scholar 

  6. Rachmadi, R., Uchimura, K., Koutagi, G., Komokata, Y.: Japan road sign classification using cascade convolutional neural network. In: ITS (Intelligent Transport System) World Gongress, Tokyo, pp. 1–12 (2016)

    Google Scholar 

  7. Robin radar models. In: Robin Radar Systems B.V. (Accessed 2019). https://www.robinradar.com/

  8. pelco-D protocol. In: Bruxy REGNET. (Accessed 2019). http://bruxy.regnet.cz/programming/rs485/pelco-d.pdf

  9. Niemi, J., Tanttu, J.: Automatic bird identification for offshore wind farms. In: Bispo, R., Bernardino, J.C., Costa, J.L.,( Eds.), Wind Energy and Wildlife Impacts, Cham, pp. 135–151 (2019)

    Google Scholar 

  10. Mirzaei, G., Jamali, M., Ross, J., Gorsevski, P., Bingman, V.: Data fusion of acoustics, infrared, and marine radar for avian study. IEEE Sensors J. 15(11) (2016)

    Google Scholar 

  11. Batista, G., Prati, R., Monard, M.: A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explor. Newsl., 20–29 (2004)

    Google Scholar 

  12. Freund, Y., Schapire, R.: A decision-theoretic generalization of on-line learning and an application to boosting. Comput. Learn. Theory 904, 23–37 (1995)

    Article  Google Scholar 

  13. Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, Kauai, pp. 511–518 (2001)

    Google Scholar 

  14. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, pp. 886–893 (2005)

    Google Scholar 

  15. Yoshihashi, R., Kawakami, R., Iida, M., Naemura, T.: Evaluation of bird detection using time-lapse images around a wind farm. Wind Energy 20(12), 1983–1995 (2017)

    Article  Google Scholar 

  16. Malling Olsen, K., Larsson, H.: Terns of Europe and North America. Helm, London (1995)

    Google Scholar 

  17. Madge, S., Burn, H.: Wildfowl, an Identification Guide to the Ducks, Geese and Swans of the World. Helm, London (1988)

    Google Scholar 

  18. Jarrett, K., Kavukcuoglu, K., Ranzato, M., LeCun, Y.: What is the best multi-stage architecture for object recognition. In: International Conference on Computer Vision, Kyoto, pp. 2146–2153 (2009)

    Google Scholar 

  19. Silla, C., Freitas, A.: A survey of hierarchical classification across different application domains. Data Min Knowl Disc 22(31) (2011)

    Google Scholar 

  20. Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3476–3483 (2013)

    Google Scholar 

  21. Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)

    MATH  Google Scholar 

  22. Y.L., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE 86, 11, New York, pp. 2278–2324 (1998)

    Google Scholar 

  23. Li, M., Zhang, T., Chen, Y., Smola, A. J.: Efficient mini-batch training for stochastic optimization. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge, New York, pp. 661–670 (2014)

    Google Scholar 

  24. Murphy, K.P.: Machine Learning: A Probabilistic Perspective. The MIT Press, Cambridge, USA (2012)

    MATH  Google Scholar 

  25. Haykin, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice Hall/Pearson, New York (1994)

    MATH  Google Scholar 

  26. Malling Olsen, K., Larsson, H.: Gulls of Europe. Asia and North America. Helm, London (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Juha Niemi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Niemi, J., Tanttu, J.T. (2020). Deep Learning Case Study on Imbalanced Training Data for Automatic Bird Identification. In: Pedrycz, W., Chen, SM. (eds) Deep Learning: Algorithms and Applications. Studies in Computational Intelligence, vol 865. Springer, Cham. https://doi.org/10.1007/978-3-030-31760-7_8

Download citation

Publish with us

Policies and ethics