Abstract
Convolutional neural networks have become one of the important research directions in the field of computer vision based on deep learning, and they are widely used in object detection with excellent performance. Based on the research results of many scholars, the paper reviews the application research of convolutional neural networks in object detection, introduces its application research from three aspects: object detection based on candidate region, object detection based on regression and video object detection, and finally summarizes the development of convolutional neural networks in the field of object detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014). https://doi.org/10.1109/CVPR.2014.81
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015). https://doi.org/10.1109/TPAMI.2015.2389824
Zhou, H., Wang, Y., Ye, M.: A method of CNN traffic classification based on Sppnet. In: 2018 14th International Conference on Computational Intelligence and Security (CIS), pp. 390–394 (2018). https://doi.org/10.1109/CIS2018.2018.00093
Girshick, R.: Fast R-CNN. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448 (2015). https://doi.org/10.1109/ICCV.2015.169
Jiang, L., et al.: Application of a fast RCNN based on upper and lower layers in face recognition. Comput. Intell. Neurosci. 2021 (2021)
Ren, S.Q., He, K.M., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Proceedings of the Annual Conference on Neural Information Processing Systems, pp. 91–99. NIPS Foundation Press, Montreal (2015)
Zhu, B., Wu, X., Yang, L., Shen, Y., Wu, L.: Automatic detection of books based on faster R-CNN. In: 2016 Third International Conference on Digital Information Processing, Data Mining, and Wireless Communications (DIPDMWC), pp. 8–12 (2016). https://doi.org/10.1109/DIPDMWC.2016.7529355
Shengbin, G., Shaojie, W., Liang, H., Ronghui, Z., Lin Xiaohan, W., Binyun.: Lightweight improvement based on faster-RCNN algorithm and its application in beach waste detection. J. Xiamen Univ. (Nat. Sci. Ed.) 61(02), 253–261 (2022)
Xu, D., Wang, L., Li, F., Guo, Y., Xing, K.: Application research of improved faster RCNN in grain insect object detection. Chin. J. Cereals Oils 37(04), 178–186 (2022)
Dai, J., Li, Y., He, K., et al.: R-FCN: object detection via region-based fully convolutional networks. In: Conference on Neural Information Processing Systems, pp. 379–387 (2016)
Zhigang, Z., Huan, L., Pengcheng, D., Guangbing, Z., Nan, W., Wei-Kun, Z.: Vehicle target detection based on R-FCN. In: 2018 Chinese Control and Decision Conference (CCDC), pp. 5739–5743 (2018). https://doi.org/10.1109/CCDC.2018.8408133
Liu, W., Dong, L., Qu, H.: Small-scale pedestrian detection with improved R-FCN model. Chin. J. Image Graph. 26(10), 2400–2410 (2021)
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2980–2988 (2017). https://doi.org/10.1109/ICCV.2017.322
Zhibo, W., Shuangming, Z.: Road traffic sign recognition based on mask R-CNN. Surv. Mapp. Geogr. Inf. 47(03), 119–122 (2022). https://doi.org/10.14188/j.2095-6045,2019444
Safonova, A., et al.: Olive tree biovolume from UAV multi-resolution image segmentation with mask R-CNN. Sensors 21 (2021)
Shi, J., Zhou, Y., Zhang, W.X.Q.: Object detection based on improved mask RCNN in service robot. In: 2019 Chinese Control Conference (CCC), pp. 8519–8524 (2019). https://doi.org/10.23919/ChiCC.2019.8866278
Yin, S., Chen, X., Bei, X.: Improved mask RCNN algorithm and its application in pedestrian instance segmentation. Comput. Eng. 47(06): 271–276+283 (2021). https://doi.org/10.19678/j.issn.1000-3428.0058058
Cai, Z., Vasconcelos, N.: Cascade R-CNN: delving into high quality object detection. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6154–6162 (2018). https://doi.org/10.1109/CVPR.2018.00644
Wu, J., et al.: Defect detection method for transmission line small object based on Cascade R-CNN algorithm. Power Grid Clean Energy 38(04), 19–27+36 (2022)
Wenxia, B., Yangxun, R., Dong, L., Xianjun, Y., Qiuju, X.: Defect detection algorithm of anti-vibration hammer based on improved cascade R-CNN. In: 2020 International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), pp. 294–297 (2020). https://doi.org/10.1109/ICHCI51889.2020.00070
Li, Y., Chen, Y., Wang, N., Zhang, Z.-X.: Scale-aware trident networks for object detection. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 6053–6062 (2019). https://doi.org/10.1109/ICCV.2019.00615
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779–788 (2016). https://doi.org/10.1109/CVPR.2016.91
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517–6525 (2017). https://doi.org/10.1109/CVPR.2017.690
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.Y., Liao, H.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Liu, D., Li, T., Du, Y., Cong, M.: Lightweight infrared real-time object detection algorithm based on MCA-YOLO. J. Huazhong Univ. Sci. Technol. (Nat. Sci. Ed.) 1–7 (2022). https://doi.org/10.13245/j.hust.239405
Yu, J., Wei, Z.: Face mask wearing detection algorithm based on improved YOLO-v4. Sensors 21 (2021)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Jeong, J., Park, H., Kwak, N.: Enhancement of SSD by concatenating feature maps for object detection. In: Proceedings of the British Machine Vision Conference. BMVA Press, London (2017)
Fu, C.Y., Liu, W., Ranga, A., et al.: DSSD: deconvolutional single shot detector (2017). https://arxiv.org/pdf/1701.06659.pdf
Li, Z., Zhou, F.: FSSD: feature fusion single shot multibox detector. arXiv:1712.00960 (2017)
Zlu, X., Xiong, Y., Dai, J., et al.: Deep feature flow for video recognition. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 511–521 (2017)
Zlu, X.Z., Wang, Y.J., Dai, J.F., et al.: Flow-guided feature aggregation for video object detection. In: International Conference on Computer Visior (ICCV), Venice, Italy, pp. 408–417 (2017)
Bertasius, G., Torresani, L., Shi, J.: Object detection in video with spatiotemporal sampling networks. In: ECCV, Munich, Germany, pp. 342–357 (2018)
Jiang, Z., Liu, Y., Yang, C., et al.: Learning where to focus for efficient video object detection. In: ECCV, Glasgow, United Kingdom (2020)
Gao, H.: Research on video object detection algorithm based on deep learning. University of Electronic Science and Technology of China (2021). https://doi.org/10.27005/d.cnki.gdzku.2021.003628
Yuhu, S., Qigui, Z.: A fast video object detection method based on local attention. Comput. Eng. 48(05), 314–320 (2022). https://doi.org/10.19678/j.issn.1000-3428.0061362
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kong, S., Zhou, C., Sun, J. (2023). A Review of the Application of Convolutional Neural Networks in Object Detection. In: Liang, Q., Wang, W., Mu, J., Liu, X., Na, Z. (eds) Artificial Intelligence in China. AIC 2022. Lecture Notes in Electrical Engineering, vol 871. Springer, Singapore. https://doi.org/10.1007/978-981-99-1256-8_21
Download citation
DOI: https://doi.org/10.1007/978-981-99-1256-8_21
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-1255-1
Online ISBN: 978-981-99-1256-8
eBook Packages: Computer ScienceComputer Science (R0)