Deep Learning Case Study on Imbalanced Training Data for Automatic Bird Identification

  • Juha NiemiEmail author
  • Juha T. Tanttu
Part of the Studies in Computational Intelligence book series (SCI, volume 865)


Collisions between birds and wind turbines can be significant problem in wind farms. Practical deterrent methods are required to prevent these collisions. However, it is improbable that a single deterrent method would work for all bird species in a given area. An automatic bird identification system is needed in order to develop bird species level deterrent methods. This system is the first and necessary part of the entirety that is eventually able to, monitor bird movements, identify bird species, and launch deterrent measures. The system consists of a radar system for detection of the birds, a digital single-lens reflex camera with telephoto lens for capturing images, a motorized video head for steering the camera, and convolutional neural networks trained on the images with a deep learning algorithm for image classification. We utilized imbalanced data because the distribution of the captured images is naturally imbalanced. We applied distribution of the training data set to estimate the actual distribution of the bird species in the test area. Species identification is based on the image classifier that is a hybrid of hierarchical and cascade models. The main idea is to train classifiers on bird species groups, in which the species resembles more each other than any other species outside the group in terms of morphology (coloration and shape). The results of this study show that the developed image classifier model has sufficient performance to identify bird species in a test area. The proposed system produced very good results, when the hybrid hierarchical model was applied to the imbalanced data sets.


Machine learning Deep learning Convolutional neural networks Classification Data augmentation Intelligent surveillance systems 


  1. 1.
    Li, F., Li, S., Zhu, C., Lan, X., Chang, H.: Class-imbalance aware CNN extension for high resolution aerial image based vehicle localization and categorization. In: 2017 2nd International Conference on Image, Vision and Computing (ICIVC), Chengdu (2017)Google Scholar
  2. 2.
    Krizhevsky, A., Sutskever, I., Hinton, G. E.: Imagenet classification with deep convolutional neural networks. Commun. ACM, 84–90 (2017)Google Scholar
  3. 3.
    Mao, R., Lin, Q., Allebach, J.: Robust convolutional neural network cascade for facial landmark localization exploiting training data augmentation. In: Imaging and Multimedia Analytics in a Web and Mobile World 2018, pp. 374-1-374-5(5) (2018)Google Scholar
  4. 4.
    Jia, S., Wang, P., Jia, P., Hu, S.: Research on data augmentation for image classification based on convolution neural networks. In: 2017 Chinese Automation Congress (CAC), Jinan (2017)Google Scholar
  5. 5.
    Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston (2015)Google Scholar
  6. 6.
    Rachmadi, R., Uchimura, K., Koutagi, G., Komokata, Y.: Japan road sign classification using cascade convolutional neural network. In: ITS (Intelligent Transport System) World Gongress, Tokyo, pp. 1–12 (2016)Google Scholar
  7. 7.
    Robin radar models. In: Robin Radar Systems B.V. (Accessed 2019).
  8. 8.
    pelco-D protocol. In: Bruxy REGNET. (Accessed 2019).
  9. 9.
    Niemi, J., Tanttu, J.: Automatic bird identification for offshore wind farms. In: Bispo, R., Bernardino, J.C., Costa, J.L.,( Eds.), Wind Energy and Wildlife Impacts, Cham, pp. 135–151 (2019)Google Scholar
  10. 10.
    Mirzaei, G., Jamali, M., Ross, J., Gorsevski, P., Bingman, V.: Data fusion of acoustics, infrared, and marine radar for avian study. IEEE Sensors J. 15(11) (2016)Google Scholar
  11. 11.
    Batista, G., Prati, R., Monard, M.: A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explor. Newsl., 20–29 (2004)Google Scholar
  12. 12.
    Freund, Y., Schapire, R.: A decision-theoretic generalization of on-line learning and an application to boosting. Comput. Learn. Theory 904, 23–37 (1995)CrossRefGoogle Scholar
  13. 13.
    Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, Kauai, pp. 511–518 (2001)Google Scholar
  14. 14.
    Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, pp. 886–893 (2005)Google Scholar
  15. 15.
    Yoshihashi, R., Kawakami, R., Iida, M., Naemura, T.: Evaluation of bird detection using time-lapse images around a wind farm. Wind Energy 20(12), 1983–1995 (2017)CrossRefGoogle Scholar
  16. 16.
    Malling Olsen, K., Larsson, H.: Terns of Europe and North America. Helm, London (1995)Google Scholar
  17. 17.
    Madge, S., Burn, H.: Wildfowl, an Identification Guide to the Ducks, Geese and Swans of the World. Helm, London (1988)Google Scholar
  18. 18.
    Jarrett, K., Kavukcuoglu, K., Ranzato, M., LeCun, Y.: What is the best multi-stage architecture for object recognition. In: International Conference on Computer Vision, Kyoto, pp. 2146–2153 (2009)Google Scholar
  19. 19.
    Silla, C., Freitas, A.: A survey of hierarchical classification across different application domains. Data Min Knowl Disc 22(31) (2011)Google Scholar
  20. 20.
    Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3476–3483 (2013)Google Scholar
  21. 21.
    Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)zbMATHGoogle Scholar
  22. 22.
    Y.L., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE 86, 11, New York, pp. 2278–2324 (1998)Google Scholar
  23. 23.
    Li, M., Zhang, T., Chen, Y., Smola, A. J.: Efficient mini-batch training for stochastic optimization. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge, New York, pp. 661–670 (2014)Google Scholar
  24. 24.
    Murphy, K.P.: Machine Learning: A Probabilistic Perspective. The MIT Press, Cambridge, USA (2012)zbMATHGoogle Scholar
  25. 25.
    Haykin, S.: Neural Networks: A Comprehensive Foundation, 2nd edn. Prentice Hall/Pearson, New York (1994)zbMATHGoogle Scholar
  26. 26.
    Malling Olsen, K., Larsson, H.: Gulls of Europe. Asia and North America. Helm, London (2003)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Faculty of Information Technology and Communication SciencesTampere University Pori UnitPoriFinland

Personalised recommendations