Multi-class object detection system using hybrid convolutional neural network architecture

Borade, Jay Laxman; Lakshmi, Muddana A

doi:10.1007/s11042-022-13007-7

Multi-class object detection system using hybrid convolutional neural network architecture

Published: 11 April 2022

Volume 81, pages 31727–31751, (2022)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Jay Laxman Borade¹ &
Muddana A Lakshmi²

343 Accesses
1 Altmetric
Explore all metrics

Abstract

Object detection in computer vision has been a significant research area for the past decade. Identifying objects with multiple classes from an image has attracted great attention because it can effectively classify and detect the image. A multi-class object detection system from a video or image is quite challenging because of the errors obtained by the location classification process. Our proposed system generalized a hybrid convolutional neural network (H-CNN) model is used to realize the user object from an image. The proposed work integrates pre-processing, object localization, feature extraction and classification. First, the input image is pre-processed with Gaussian filtering to remove noise and improve the image quality. After completing the pre-processing procedure, it is subjected to object localization. Here the object in the image is localized using Grid Guided Localization (GGL). In the feature extraction phase, the model would be pre-trained with AlexNet. Here the AlexNet are generalized as fully connected (FC) layers. Finally, the Softmax layer in the AlexNet architecture is replaced by SVR (Support Vector Regression), which acts as a classifier for identifying the object class. The classification loss is minimized using the Improved Grey Wolf (IGW) optimization algorithm. Thus, the H-CNN model can quickly classify and label the objects from images. It also offers improved classification performance in managing effective training time. The proposed work will be implemented in PYTHON. Therefore, the model would be built using various datasets such as MIT-67, PASCAL VOC2010, MS (Microsoft)-COCO, and MSRC to effectively train and classify the object. The proposed H-CNN achieved improved results with MIT-67 (96.02%), PASCAL VOC2010 (95.04%), MSRC (97.37%), and MS COCO (94.53%). The results obtained by H-CNN proved that the excluded result of Mean Average Precision (mAP), Precision, Accuracy, Recall values and F1-Score achieved better results than with recently developed works such as YOLO-fine, EfficientDet, YOLOv4, RetinaNet, GCNet and HRNet architectures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Optimal Approach for Multi-class Object Detection

A Review of Object Detection Models Based on Convolutional Neural Network

Survey on Convolutional Neural Networks-Based Object Detection Methods

Data availability

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

References

Ahmad T, Chen X, Saqlain AS, Ma Y (2021) FPN-GAN: multi-class small object detection in remote sensing images. In2021 IEEE 6th international conference on cloud computing and big data analytics (ICCCBDA), IEEE, 478-482
Alexey B, Wang CY, Mark Liao HY (2020) Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934
Ali B, Cheng MM, Jiang H, Li J (2015) Salient object detection: a benchmark. IEEE Trans Image Process 24(12):5706–5722
Article MathSciNet Google Scholar
Ariadna Q, Torralba A (2009) Recognizing indoor scenes. In 2009 IEEE conference on computer vision and pattern recognition, IEEE. 413-420
Ashwani K, Srivastava S (2020) Object detection system based on convolution neural networks using single shot multi-box detector. Procedia Computer Science 171:2610–2617
Article Google Scholar
Aziz L, MS FC, Ayub S (2021) Multi-level refinement enriched feature pyramid network for object detection. Image and Vision Computing 115:104287
Article Google Scholar
Cao D, Chen Z, Gao L (2020) An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks. Human-centric Computing and Information Sciences 10(1):1–22
Article Google Scholar
Dawei D, Qi Y, Yu H, Yang Y, Duan K, Li ZW, Huang Q, Tian Q (2018) The unmanned aerial vehicle benchmark: Object detection and tracking. In Proceedings of the European Conference on Computer Vision (ECCV) 370–386
Deng-Ping F, Lin Z, Zhang Z, Zhu M, Cheng MM (2020) Rethinking RGB-D salient object detection: models, data sets, and large-scale benchmarks. IEEE Transactions on Neural Networks and Learning Systems
Duygu S, Corso JJ, Guru KA (2017) Detection and localization of robotic tools in robot-assisted surgery videos using deep neural networks for region proposal and detection. IEEE Trans Med Imaging 36(7):1542–1549
Article Google Scholar
Jalled F, Voronkov I (2016) Object detection using image processing.arXiv preprint arXiv:1611.07791
Jawadul BH, Roy-Chowdhury AK (2016) CNN based region proposals for efficient object detection. In 2016 IEEE international conference on image processing (ICIP), IEEE 3658–3662
Jifeng D, Li Y, He K, Sun J (2016) Object detection via region-based fully convolutional networks. In advances in neural information processing systems. 379–387
Wang J, Sun K, Cheng T, Jiang B, Deng C, Zhao Y, Liu D, Mu Y, Tan M, Wang X, Liu W (2020) Deep high-resolution representation learning for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 43(10):3349–3364
Article Google Scholar
Joseph R, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In proceedings of the IEEE conference on computer vision and pattern recognition, 779–788
Junwei H, Zhang D, Cheng G, Liu N, Xu D (2018) Advanced deep-learning techniques for salient and category-specific object detection: a survey. IEEE Signal Process Mag 35(1):84–100
Article Google Scholar
Manisha V, Kumar B (2020) A survey paper on object detection methods in image processing. In 2020 international conference on computer science, engineering and applications (ICCSEA), IEEE 1–4
Mark E, Gool LV, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vis 88(2):303–338
Article Google Scholar
Mathivanan G (2021) Survey on object detection framework: evolution of algorithms. In2021 5th international conference on electronics, communication and aerospace technology (ICECA) 1–5
Mingxing T, Pang R and Le QV (2020) Efficientdet: scalable and efficient object detection. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10781–10790
Minh-Tan P, Courtrai L, Friguet C, Lefèvre S, Baussard A (2020) One-stage detector of small objects under various backgrounds in remote sensing images. Remote Sens 12(15):2501
Article Google Scholar
Ning W, Gao Y, Chen H, Wang P, Tian Z, Shen C, Zhang Y (2020) Fast neural architecture search for object detection. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 11943–11951
Peng Z, Ni B, Geng C, Hu J, Xu Y (2018) Scale-transferrable object detection. In proceedings of the IEEE conference on computer vision and pattern recognition 528–537
Piotr D, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545
Article Google Scholar
Pramanik A, Pal SK, Maiti J, Mitra P (2021) Granulated RCNN and multi-class deep sort for multi-object detection and tracking. IEEE Transact Emerg Top Comput Intell,1–11
Prerna S, Gupta A, Aggarwal A, Gupta D, Khanna A, Hassanien AE, de Albuquerque VHC (2020) The health of things for classification of protein structure using improved grey wolf optimization. J Supercomput 76(2):1226–1241
Article Google Scholar
Ren Y, Zhu C, Xiao S (2018) Deformable faster r-cnn with aggregating multi-layer features for partially occluded object detection in optical remote sensing images. Remote Sens 10(9):1470
Article Google Scholar
Sachchidanand S, Singh N (2017) Object classification to analyze medical imaging data using deep learning. In 2017 international conference on innovations in information, embedded and communication systems (ICIIECS). IEEE 1–4
Shipra O, Sakhare S (2015) Image processing techniques for object tracking in video surveillance-a survey. In 2015 international conference on pervasive computing (ICPC). IEEE 1–6
Sindhia L, Kumar D (2020) An efficient moving object detection and tracking system based on fractional derivative. Multimed Tools Appl 79(13):8519–8537
Google Scholar
Tian S, Kang L, Xing X, Tian J, Fan C, Zhang Y (2021) A relation-augmented embedded graph attention network for remote sensing object detection. IEEE Trans Geosci Remote Sens 60:1–17
Google Scholar
Tomasz M, Alexei A (2007) Improving spatial support for objects via multiple segmentations
Tsung-Yi L, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Common objects in context. In: European conference on computer vision. Springer, Cham, pp 740–755
Google Scholar
Videira G (2019) Image processing and object detection for advanced driver assistance systems PhD diss
Wang Y, Fathi A, Kundu A, Ross DA, Pantofaru C, Funkhouser T, Solomon J. Pillar-based object detection for autonomous driving. InEuropean Conference on Computer Vision, pp 18–34, Springer, Cham
Wu Z, He S (2021) Improvement of the AlexNet networks for large-scale recognition applications. Iranian Journal of Science and Technology, Transactions of Electrical Engineering 45(2):493–503
Article Google Scholar
Xinyi Z, Gong W, Fu W, Du F (2017) Application of deep learning in object detection. In 2017 IEEE/ACIS 16th international conference on computer and information science (ICIS), IEEE, 631–634
Lei Y, Yao X, Chen W, Zhang J, Mehnen J, Yang E (2020) Multiple object detection of workpieces based on fusion of deep learning and image processing. In 2020 International Joint Conference on Neural Networks (IJCNN). IEEE 1–7

Download references

Funding

No funding.

Author information

Authors and Affiliations

Research Scholar at GITAM (Deemed to be University), Hyderabad, India
Jay Laxman Borade
CSE department, GITAM (Deemed to be University), Hyderabad, India
Muddana A Lakshmi

Authors

Jay Laxman Borade
View author publications
You can also search for this author in PubMed Google Scholar
Muddana A Lakshmi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jay Laxman Borade.

Ethics declarations

Conflict of interest

No conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Borade, J.L., Lakshmi, M.A. Multi-class object detection system using hybrid convolutional neural network architecture. Multimed Tools Appl 81, 31727–31751 (2022). https://doi.org/10.1007/s11042-022-13007-7

Download citation

Received: 04 August 2021
Revised: 29 January 2022
Accepted: 28 March 2022
Published: 11 April 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s11042-022-13007-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-class object detection system using hybrid convolutional neural network architecture

Abstract

Access this article

Similar content being viewed by others

An Optimal Approach for Multi-class Object Detection

A Review of Object Detection Models Based on Convolutional Neural Network

Survey on Convolutional Neural Networks-Based Object Detection Methods

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multi-class object detection system using hybrid convolutional neural network architecture

Abstract

Access this article

Similar content being viewed by others

An Optimal Approach for Multi-class Object Detection

A Review of Object Detection Models Based on Convolutional Neural Network

Survey on Convolutional Neural Networks-Based Object Detection Methods

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation