Abstract
Object recognition is a branch of artificial vision and one of the pillars of machine vision. It consists in identifying the forms described in advance in a digital image and, in general, in a digital video stream. Although, as a rule, it is possible to perform recognition from video clips, the learning process is usually performed on images. In this paper, an algorithm for classifying and recognizing objects using convolutional neural networks is considered. The purpose of the work is to implement an algorithm for detecting and classifying various graphic objects fed from a webcam. The task is to first classify and recognize an object with high accuracy according to a given data set, and then demonstrate a way to generate images to increase the volume of the training data set by using a self-written generator. The classification and recognition algorithm used is invariant to transfer, shift and rotation. A significant novelty of this work is the creation of a self-written generator that allows using various types of augmentation (artificial increase in the volume of the training sample by modifying the training data) to form new groups of modified images each time.
This paper has been supported by the RUDN University Strategic Academic Leadership Program.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Mathematical methods of pattern recognition. In: Abstracts of the 19th All-Russian Conference with International Participation (Moscow 2019), 420 p. Russian Academy of Sciences, Moscow (2019)
Redmon, J.: YOLO9000: better, faster, stronger. In: Redmon, J., Farhadi, A. (eds.) IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517–6525 (2017)
Panetto, H., Cecil, J.: Information systems for enterprise integration, interoperability and networking: theory and applications. Enterp. Inf. Syst. 7(1), 1–6 (2013)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Mouale, M.N.B., Kozyrev, D.V., Houankpo, H.G.K., Nibasumba, E.: Development of a neural network method in the problem of classification and image recognition. Sovremennye informacionnye tehnologii i IT-obrazovanie (Modern Inf. Technol. IT-Educ.) 17(3), 507–518 (2021). https://doi.org/10.25559/SITITO.17.202103.507-518
Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8, 679–714 (1986)
Duy Thanh, N.: Face image recognition methods based on invariants to affine and brightness transformations. Diss. to the competition uch. degree cand. physical mat. Sciences, Moscow (2018)
Sikorsky, O.S.: Overview of convolutional neural networks for the problem of image classification. In: Sikorsky, O.S. (ed.) New Information Technologies in Automated Systems, vol. 20, pp. 37–42. Moscow (2017)
Erhan, D., Szegedy, C., Toshev, A., Anguelov, D.: Scalable object detection using deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2147–2154 (2014)
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: ECCV (2014)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012)
Hizem, W.: Capteur Intelligent pour la Reconnaissance de Visage, Télécommunications et l’Université Pierre et Marie Curie - Paris 6 (2009)
Guerfi Ababsa, S.: Authentification d’individus par reconnaissance de caractéristiques biométriques liées aux visages 2D/3D, Université Evry Val d’Essonne (2008)
Cécile Fiche, M.: Repousser les limites de l’identification faciale en contexte de vidéo surveillance, Université de Grenoble (2012)
Van Wambeke, M.: Reconnaissance et suivi de visages et implémentation en robotique temps-réel, Université Catholique de Louvain (2009–2010)
Gafarov, F.M.: G12 artificial neural networks and applications: textbook. allowance, 121 p. Gafarov, F.M., Galimyanov, A.F. (eds.) Kazan Publishing House, Kazan (2018)
Poniszewska-Maranda, A.: Management of access control in information system based on role concept. Scalable Comput. Pract. Experience 12(1), 35–49 (2011)
Arulogun, O.T., Omidiora, E.O., Olaniyi, O.M., Ipadeola, A.A.: Development of security system using facial recognition. Pac. J. Sci. Technol. 9(3), 377–384 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Bienvenue, M.M.N., Kozyrev, D. (2023). Application of Convolutional Neural Networks for Image Detection and Recognition Based on a Self-written Generator. In: Vishnevskiy, V.M., Samouylov, K.E., Kozyrev, D.V. (eds) Distributed Computer and Communication Networks. DCCN 2022. Communications in Computer and Information Science, vol 1748. Springer, Cham. https://doi.org/10.1007/978-3-031-30648-8_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-30648-8_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-30647-1
Online ISBN: 978-3-031-30648-8
eBook Packages: Computer ScienceComputer Science (R0)