Abstract
Object detection in computer vision has been a significant research area for the past decade. Identifying objects with multiple classes from an image has attracted great attention because it can effectively classify and detect the image. A multi-class object detection system from a video or image is quite challenging because of the errors obtained by the location classification process. Our proposed system generalized a hybrid convolutional neural network (H-CNN) model is used to realize the user object from an image. The proposed work integrates pre-processing, object localization, feature extraction and classification. First, the input image is pre-processed with Gaussian filtering to remove noise and improve the image quality. After completing the pre-processing procedure, it is subjected to object localization. Here the object in the image is localized using Grid Guided Localization (GGL). In the feature extraction phase, the model would be pre-trained with AlexNet. Here the AlexNet are generalized as fully connected (FC) layers. Finally, the Softmax layer in the AlexNet architecture is replaced by SVR (Support Vector Regression), which acts as a classifier for identifying the object class. The classification loss is minimized using the Improved Grey Wolf (IGW) optimization algorithm. Thus, the H-CNN model can quickly classify and label the objects from images. It also offers improved classification performance in managing effective training time. The proposed work will be implemented in PYTHON. Therefore, the model would be built using various datasets such as MIT-67, PASCAL VOC2010, MS (Microsoft)-COCO, and MSRC to effectively train and classify the object. The proposed H-CNN achieved improved results with MIT-67 (96.02%), PASCAL VOC2010 (95.04%), MSRC (97.37%), and MS COCO (94.53%). The results obtained by H-CNN proved that the excluded result of Mean Average Precision (mAP), Precision, Accuracy, Recall values and F1-Score achieved better results than with recently developed works such as YOLO-fine, EfficientDet, YOLOv4, RetinaNet, GCNet and HRNet architectures.
Similar content being viewed by others
Data availability
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
References
Ahmad T, Chen X, Saqlain AS, Ma Y (2021) FPN-GAN: multi-class small object detection in remote sensing images. In2021 IEEE 6th international conference on cloud computing and big data analytics (ICCCBDA), IEEE, 478-482
Alexey B, Wang CY, Mark Liao HY (2020) Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934
Ali B, Cheng MM, Jiang H, Li J (2015) Salient object detection: a benchmark. IEEE Trans Image Process 24(12):5706–5722
Ariadna Q, Torralba A (2009) Recognizing indoor scenes. In 2009 IEEE conference on computer vision and pattern recognition, IEEE. 413-420
Ashwani K, Srivastava S (2020) Object detection system based on convolution neural networks using single shot multi-box detector. Procedia Computer Science 171:2610–2617
Aziz L, MS FC, Ayub S (2021) Multi-level refinement enriched feature pyramid network for object detection. Image and Vision Computing 115:104287
Cao D, Chen Z, Gao L (2020) An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks. Human-centric Computing and Information Sciences 10(1):1–22
Dawei D, Qi Y, Yu H, Yang Y, Duan K, Li ZW, Huang Q, Tian Q (2018) The unmanned aerial vehicle benchmark: Object detection and tracking. In Proceedings of the European Conference on Computer Vision (ECCV) 370–386
Deng-Ping F, Lin Z, Zhang Z, Zhu M, Cheng MM (2020) Rethinking RGB-D salient object detection: models, data sets, and large-scale benchmarks. IEEE Transactions on Neural Networks and Learning Systems
Duygu S, Corso JJ, Guru KA (2017) Detection and localization of robotic tools in robot-assisted surgery videos using deep neural networks for region proposal and detection. IEEE Trans Med Imaging 36(7):1542–1549
Jalled F, Voronkov I (2016) Object detection using image processing.arXiv preprint arXiv:1611.07791
Jawadul BH, Roy-Chowdhury AK (2016) CNN based region proposals for efficient object detection. In 2016 IEEE international conference on image processing (ICIP), IEEE 3658–3662
Jifeng D, Li Y, He K, Sun J (2016) Object detection via region-based fully convolutional networks. In advances in neural information processing systems. 379–387
Wang J, Sun K, Cheng T, Jiang B, Deng C, Zhao Y, Liu D, Mu Y, Tan M, Wang X, Liu W (2020) Deep high-resolution representation learning for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 43(10):3349–3364
Joseph R, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In proceedings of the IEEE conference on computer vision and pattern recognition, 779–788
Junwei H, Zhang D, Cheng G, Liu N, Xu D (2018) Advanced deep-learning techniques for salient and category-specific object detection: a survey. IEEE Signal Process Mag 35(1):84–100
Manisha V, Kumar B (2020) A survey paper on object detection methods in image processing. In 2020 international conference on computer science, engineering and applications (ICCSEA), IEEE 1–4
Mark E, Gool LV, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vis 88(2):303–338
Mathivanan G (2021) Survey on object detection framework: evolution of algorithms. In2021 5th international conference on electronics, communication and aerospace technology (ICECA) 1–5
Mingxing T, Pang R and Le QV (2020) Efficientdet: scalable and efficient object detection. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10781–10790
Minh-Tan P, Courtrai L, Friguet C, Lefèvre S, Baussard A (2020) One-stage detector of small objects under various backgrounds in remote sensing images. Remote Sens 12(15):2501
Ning W, Gao Y, Chen H, Wang P, Tian Z, Shen C, Zhang Y (2020) Fast neural architecture search for object detection. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 11943–11951
Peng Z, Ni B, Geng C, Hu J, Xu Y (2018) Scale-transferrable object detection. In proceedings of the IEEE conference on computer vision and pattern recognition 528–537
Piotr D, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545
Pramanik A, Pal SK, Maiti J, Mitra P (2021) Granulated RCNN and multi-class deep sort for multi-object detection and tracking. IEEE Transact Emerg Top Comput Intell,1–11
Prerna S, Gupta A, Aggarwal A, Gupta D, Khanna A, Hassanien AE, de Albuquerque VHC (2020) The health of things for classification of protein structure using improved grey wolf optimization. J Supercomput 76(2):1226–1241
Ren Y, Zhu C, Xiao S (2018) Deformable faster r-cnn with aggregating multi-layer features for partially occluded object detection in optical remote sensing images. Remote Sens 10(9):1470
Sachchidanand S, Singh N (2017) Object classification to analyze medical imaging data using deep learning. In 2017 international conference on innovations in information, embedded and communication systems (ICIIECS). IEEE 1–4
Shipra O, Sakhare S (2015) Image processing techniques for object tracking in video surveillance-a survey. In 2015 international conference on pervasive computing (ICPC). IEEE 1–6
Sindhia L, Kumar D (2020) An efficient moving object detection and tracking system based on fractional derivative. Multimed Tools Appl 79(13):8519–8537
Tian S, Kang L, Xing X, Tian J, Fan C, Zhang Y (2021) A relation-augmented embedded graph attention network for remote sensing object detection. IEEE Trans Geosci Remote Sens 60:1–17
Tomasz M, Alexei A (2007) Improving spatial support for objects via multiple segmentations
Tsung-Yi L, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Common objects in context. In: European conference on computer vision. Springer, Cham, pp 740–755
Videira G (2019) Image processing and object detection for advanced driver assistance systems PhD diss
Wang Y, Fathi A, Kundu A, Ross DA, Pantofaru C, Funkhouser T, Solomon J. Pillar-based object detection for autonomous driving. InEuropean Conference on Computer Vision, pp 18–34, Springer, Cham
Wu Z, He S (2021) Improvement of the AlexNet networks for large-scale recognition applications. Iranian Journal of Science and Technology, Transactions of Electrical Engineering 45(2):493–503
Xinyi Z, Gong W, Fu W, Du F (2017) Application of deep learning in object detection. In 2017 IEEE/ACIS 16th international conference on computer and information science (ICIS), IEEE, 631–634
Lei Y, Yao X, Chen W, Zhang J, Mehnen J, Yang E (2020) Multiple object detection of workpieces based on fusion of deep learning and image processing. In 2020 International Joint Conference on Neural Networks (IJCNN). IEEE 1–7
Funding
No funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
No conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Borade, J.L., Lakshmi, M.A. Multi-class object detection system using hybrid convolutional neural network architecture. Multimed Tools Appl 81, 31727–31751 (2022). https://doi.org/10.1007/s11042-022-13007-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13007-7