Skip to main content

Application of Convolutional Neural Networks for Image Detection and Recognition Based on a Self-written Generator

  • Conference paper
  • First Online:
Distributed Computer and Communication Networks (DCCN 2022)

Abstract

Object recognition is a branch of artificial vision and one of the pillars of machine vision. It consists in identifying the forms described in advance in a digital image and, in general, in a digital video stream. Although, as a rule, it is possible to perform recognition from video clips, the learning process is usually performed on images. In this paper, an algorithm for classifying and recognizing objects using convolutional neural networks is considered. The purpose of the work is to implement an algorithm for detecting and classifying various graphic objects fed from a webcam. The task is to first classify and recognize an object with high accuracy according to a given data set, and then demonstrate a way to generate images to increase the volume of the training data set by using a self-written generator. The classification and recognition algorithm used is invariant to transfer, shift and rotation. A significant novelty of this work is the creation of a self-written generator that allows using various types of augmentation (artificial increase in the volume of the training sample by modifying the training data) to form new groups of modified images each time.

This paper has been supported by the RUDN University Strategic Academic Leadership Program.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mathematical methods of pattern recognition. In: Abstracts of the 19th All-Russian Conference with International Participation (Moscow 2019), 420 p. Russian Academy of Sciences, Moscow (2019)

    Google Scholar 

  2. Redmon, J.: YOLO9000: better, faster, stronger. In: Redmon, J., Farhadi, A. (eds.) IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517–6525 (2017)

    Google Scholar 

  3. Panetto, H., Cecil, J.: Information systems for enterprise integration, interoperability and networking: theory and applications. Enterp. Inf. Syst. 7(1), 1–6 (2013)

    Article  Google Scholar 

  4. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

  5. Mouale, M.N.B., Kozyrev, D.V., Houankpo, H.G.K., Nibasumba, E.: Development of a neural network method in the problem of classification and image recognition. Sovremennye informacionnye tehnologii i IT-obrazovanie (Modern Inf. Technol. IT-Educ.) 17(3), 507–518 (2021). https://doi.org/10.25559/SITITO.17.202103.507-518

  6. Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8, 679–714 (1986)

    Article  Google Scholar 

  7. Duy Thanh, N.: Face image recognition methods based on invariants to affine and brightness transformations. Diss. to the competition uch. degree cand. physical mat. Sciences, Moscow (2018)

    Google Scholar 

  8. Sikorsky, O.S.: Overview of convolutional neural networks for the problem of image classification. In: Sikorsky, O.S. (ed.) New Information Technologies in Automated Systems, vol. 20, pp. 37–42. Moscow (2017)

    Google Scholar 

  9. Erhan, D., Szegedy, C., Toshev, A., Anguelov, D.: Scalable object detection using deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2147–2154 (2014)

    Google Scholar 

  10. He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: ECCV (2014)

    Google Scholar 

  11. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012)

    Google Scholar 

  12. Hizem, W.: Capteur Intelligent pour la Reconnaissance de Visage, Télécommunications et l’Université Pierre et Marie Curie - Paris 6 (2009)

    Google Scholar 

  13. Guerfi Ababsa, S.: Authentification d’individus par reconnaissance de caractéristiques biométriques liées aux visages 2D/3D, Université Evry Val d’Essonne (2008)

    Google Scholar 

  14. Cécile Fiche, M.: Repousser les limites de l’identification faciale en contexte de vidéo surveillance, Université de Grenoble (2012)

    Google Scholar 

  15. Van Wambeke, M.: Reconnaissance et suivi de visages et implémentation en robotique temps-réel, Université Catholique de Louvain (2009–2010)

    Google Scholar 

  16. Gafarov, F.M.: G12 artificial neural networks and applications: textbook. allowance, 121 p. Gafarov, F.M., Galimyanov, A.F. (eds.) Kazan Publishing House, Kazan (2018)

    Google Scholar 

  17. Poniszewska-Maranda, A.: Management of access control in information system based on role concept. Scalable Comput. Pract. Experience 12(1), 35–49 (2011)

    Google Scholar 

  18. Arulogun, O.T., Omidiora, E.O., Olaniyi, O.M., Ipadeola, A.A.: Development of security system using facial recognition. Pac. J. Sci. Technol. 9(3), 377–384 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dmitry Kozyrev .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bienvenue, M.M.N., Kozyrev, D. (2023). Application of Convolutional Neural Networks for Image Detection and Recognition Based on a Self-written Generator. In: Vishnevskiy, V.M., Samouylov, K.E., Kozyrev, D.V. (eds) Distributed Computer and Communication Networks. DCCN 2022. Communications in Computer and Information Science, vol 1748. Springer, Cham. https://doi.org/10.1007/978-3-031-30648-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-30648-8_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-30647-1

  • Online ISBN: 978-3-031-30648-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics