Skip to main content
Log in

Multi-class object detection system using hybrid convolutional neural network architecture

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Object detection in computer vision has been a significant research area for the past decade. Identifying objects with multiple classes from an image has attracted great attention because it can effectively classify and detect the image. A multi-class object detection system from a video or image is quite challenging because of the errors obtained by the location classification process. Our proposed system generalized a hybrid convolutional neural network (H-CNN) model is used to realize the user object from an image. The proposed work integrates pre-processing, object localization, feature extraction and classification. First, the input image is pre-processed with Gaussian filtering to remove noise and improve the image quality. After completing the pre-processing procedure, it is subjected to object localization. Here the object in the image is localized using Grid Guided Localization (GGL). In the feature extraction phase, the model would be pre-trained with AlexNet. Here the AlexNet are generalized as fully connected (FC) layers. Finally, the Softmax layer in the AlexNet architecture is replaced by SVR (Support Vector Regression), which acts as a classifier for identifying the object class. The classification loss is minimized using the Improved Grey Wolf (IGW) optimization algorithm. Thus, the H-CNN model can quickly classify and label the objects from images. It also offers improved classification performance in managing effective training time. The proposed work will be implemented in PYTHON. Therefore, the model would be built using various datasets such as MIT-67, PASCAL VOC2010, MS (Microsoft)-COCO, and MSRC to effectively train and classify the object. The proposed H-CNN achieved improved results with MIT-67 (96.02%), PASCAL VOC2010 (95.04%), MSRC (97.37%), and MS COCO (94.53%). The results obtained by H-CNN proved that the excluded result of Mean Average Precision (mAP), Precision, Accuracy, Recall values ​​and F1-Score achieved better results than with recently developed works such as YOLO-fine, EfficientDet, YOLOv4, RetinaNet, GCNet and HRNet architectures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

References

  1. Ahmad T, Chen X, Saqlain AS, Ma Y (2021) FPN-GAN: multi-class small object detection in remote sensing images. In2021 IEEE 6th international conference on cloud computing and big data analytics (ICCCBDA), IEEE, 478-482

  2. Alexey B, Wang CY, Mark Liao HY (2020) Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934

  3. Ali B, Cheng MM, Jiang H, Li J (2015) Salient object detection: a benchmark. IEEE Trans Image Process 24(12):5706–5722

    Article  MathSciNet  Google Scholar 

  4. Ariadna Q, Torralba A (2009) Recognizing indoor scenes. In 2009 IEEE conference on computer vision and pattern recognition, IEEE. 413-420

  5. Ashwani K, Srivastava S (2020) Object detection system based on convolution neural networks using single shot multi-box detector. Procedia Computer Science 171:2610–2617

    Article  Google Scholar 

  6. Aziz L, MS FC, Ayub S (2021) Multi-level refinement enriched feature pyramid network for object detection. Image and Vision Computing 115:104287

    Article  Google Scholar 

  7. Cao D, Chen Z, Gao L (2020) An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks. Human-centric Computing and Information Sciences 10(1):1–22

    Article  Google Scholar 

  8. Dawei D, Qi Y, Yu H, Yang Y, Duan K, Li ZW, Huang Q, Tian Q (2018) The unmanned aerial vehicle benchmark: Object detection and tracking. In Proceedings of the European Conference on Computer Vision (ECCV) 370–386

  9. Deng-Ping F, Lin Z, Zhang Z, Zhu M, Cheng MM (2020) Rethinking RGB-D salient object detection: models, data sets, and large-scale benchmarks. IEEE Transactions on Neural Networks and Learning Systems

  10. Duygu S, Corso JJ, Guru KA (2017) Detection and localization of robotic tools in robot-assisted surgery videos using deep neural networks for region proposal and detection. IEEE Trans Med Imaging 36(7):1542–1549

    Article  Google Scholar 

  11. Jalled F, Voronkov I (2016) Object detection using image processing.arXiv preprint arXiv:1611.07791

  12. Jawadul BH, Roy-Chowdhury AK (2016) CNN based region proposals for efficient object detection. In 2016 IEEE international conference on image processing (ICIP), IEEE 3658–3662

  13. Jifeng D, Li Y, He K, Sun J (2016) Object detection via region-based fully convolutional networks. In advances in neural information processing systems. 379–387

  14. Wang J, Sun K, Cheng T, Jiang B, Deng C, Zhao Y, Liu D, Mu Y, Tan M, Wang X, Liu W (2020) Deep high-resolution representation learning for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 43(10):3349–3364

    Article  Google Scholar 

  15. Joseph R, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In proceedings of the IEEE conference on computer vision and pattern recognition, 779–788

  16. Junwei H, Zhang D, Cheng G, Liu N, Xu D (2018) Advanced deep-learning techniques for salient and category-specific object detection: a survey. IEEE Signal Process Mag 35(1):84–100

    Article  Google Scholar 

  17. Manisha V, Kumar B (2020) A survey paper on object detection methods in image processing. In 2020 international conference on computer science, engineering and applications (ICCSEA), IEEE 1–4

  18. Mark E, Gool LV, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vis 88(2):303–338

    Article  Google Scholar 

  19. Mathivanan G (2021) Survey on object detection framework: evolution of algorithms. In2021 5th international conference on electronics, communication and aerospace technology (ICECA) 1–5

  20. Mingxing T, Pang R and Le QV (2020) Efficientdet: scalable and efficient object detection. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10781–10790

  21. Minh-Tan P, Courtrai L, Friguet C, Lefèvre S, Baussard A (2020) One-stage detector of small objects under various backgrounds in remote sensing images. Remote Sens 12(15):2501

    Article  Google Scholar 

  22. Ning W, Gao Y, Chen H, Wang P, Tian Z, Shen C, Zhang Y (2020) Fast neural architecture search for object detection. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 11943–11951

  23. Peng Z, Ni B, Geng C, Hu J, Xu Y (2018) Scale-transferrable object detection. In proceedings of the IEEE conference on computer vision and pattern recognition 528–537

  24. Piotr D, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545

    Article  Google Scholar 

  25. Pramanik A, Pal SK, Maiti J, Mitra P (2021) Granulated RCNN and multi-class deep sort for multi-object detection and tracking. IEEE Transact Emerg Top Comput Intell,1–11

  26. Prerna S, Gupta A, Aggarwal A, Gupta D, Khanna A, Hassanien AE, de Albuquerque VHC (2020) The health of things for classification of protein structure using improved grey wolf optimization. J Supercomput 76(2):1226–1241

    Article  Google Scholar 

  27. Ren Y, Zhu C, Xiao S (2018) Deformable faster r-cnn with aggregating multi-layer features for partially occluded object detection in optical remote sensing images. Remote Sens 10(9):1470

    Article  Google Scholar 

  28. Sachchidanand S, Singh N (2017) Object classification to analyze medical imaging data using deep learning. In 2017 international conference on innovations in information, embedded and communication systems (ICIIECS). IEEE 1–4

  29. Shipra O, Sakhare S (2015) Image processing techniques for object tracking in video surveillance-a survey. In 2015 international conference on pervasive computing (ICPC). IEEE 1–6

  30. Sindhia L, Kumar D (2020) An efficient moving object detection and tracking system based on fractional derivative. Multimed Tools Appl 79(13):8519–8537

    Google Scholar 

  31. Tian S, Kang L, Xing X, Tian J, Fan C, Zhang Y (2021) A relation-augmented embedded graph attention network for remote sensing object detection. IEEE Trans Geosci Remote Sens 60:1–17

    Google Scholar 

  32. Tomasz M, Alexei A (2007) Improving spatial support for objects via multiple segmentations

  33. Tsung-Yi L, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Common objects in context. In: European conference on computer vision. Springer, Cham, pp 740–755

    Google Scholar 

  34. Videira G (2019) Image processing and object detection for advanced driver assistance systems PhD diss

  35. Wang Y, Fathi A, Kundu A, Ross DA, Pantofaru C, Funkhouser T, Solomon J. Pillar-based object detection for autonomous driving. InEuropean Conference on Computer Vision, pp 18–34, Springer, Cham

  36. Wu Z, He S (2021) Improvement of the AlexNet networks for large-scale recognition applications. Iranian Journal of Science and Technology, Transactions of Electrical Engineering 45(2):493–503

    Article  Google Scholar 

  37. Xinyi Z, Gong W, Fu W, Du F (2017) Application of deep learning in object detection. In 2017 IEEE/ACIS 16th international conference on computer and information science (ICIS), IEEE, 631–634

  38. Lei Y, Yao X, Chen W, Zhang J, Mehnen J, Yang E (2020) Multiple object detection of workpieces based on fusion of deep learning and image processing. In 2020 International Joint Conference on Neural Networks (IJCNN). IEEE  1–7

Download references

Funding

No funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jay Laxman Borade.

Ethics declarations

Conflict of interest

No conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Borade, J.L., Lakshmi, M.A. Multi-class object detection system using hybrid convolutional neural network architecture. Multimed Tools Appl 81, 31727–31751 (2022). https://doi.org/10.1007/s11042-022-13007-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-13007-7

Keywords

Navigation