Abstract
Over the past few years, as environmental problems have been progressively deteriorating, waste classification has also become a research hotspot. As an algorithm with better detection accuracy and speed, SSD (Single Shot MultiBox Detector) has made great progress in many aspects. However, it can’t achieve a good detection effect for small objects because it does not make full use of high-level semantic information. In this paper, we propose a MobileNet-SSD model with FPN to solve the problem of waste detection, which can reduce parameters, narrow internal space and improve performance for small objects compared with SSD model. Besides, Focal Loss is adopted to reduce the imbalance between foreground and background samples to enhance detector effect. To verify the effectiveness of the approach proposed here, a dataset for waste detection is collected and annotated. As revealed from the experimental results, the mAP of our method is 93.63% and the speed is 102 FPS, which can beat other methods.
Similar content being viewed by others
References
Girshick R, Donahue J, Darrell T et al (2015) Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans Pattern Anal Mach Intell 38(1):142–158
He K, Zhang X, Ren S et al (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37(9):1904–1916
Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Ren S, He KM, Girshick R et al (2015) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
He KM, Gkioxari G, Dollár P, et al (2017) Mask R-CNN. In: Proceedings of 2017 IEEE international conference on computer vision, pp 2980–2988
Cai ZW, Vasconcelos N (2018) Cascade R-CNN: Delving into high quality object detection. In: Proceedings of IEEE conference on computer vision and pattern recognition, pp 6154–6162
Dai JF, Li Y, He KM, and Sun J (2016) R-FCN: object detection via region-based fully convolutional networks. In: Proceedings of the 30th international conference on neural information processing systems, pp 379–387
Redmon J, Divvala S, Girshick R, et al (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY and Berg AC (2016) SSD: single shot multibox detector. In: In European conference on computer vision. Springer, pp 21–37
Fu CY, Liu W, Ranga A, et al. DSSD: Deconvolutional single shotdetector[EB/OL].(2017–01–23)[2019–08–28]. https://arxiv.org/abs/1701.06659
Lin TY, Goyal P, Girshick R et al (2017) Focal loss for dense object detection. IEEE Trans Pattern Anal Mach Intell 99:2999–3007
Li ZX, Zhou FQ. FSSD: feature fusion single shot multibox detector[EB/QL] (2018-05-17) [2019-08-28]. https://arxiv.org/abs/1712.00960
Yi JR, Wu PX, Metaxas, Dimitris N (2019) ASSD: attentive single shot multibox detector. Comput Vis Image Underst 189:102827.1–102827.7
Chua SND, Lim SF, Lai SN et al (2019) Development of a child detection system with artificial intelligence using object detection method. J Electr Eng Technol 14:2523–2529
Huang WZ, Zhang SW (2017) A novel face recognition algorithm based on the deep convolution neural network and key points detection jointed local binary pattern methodology. J Electr Eng Technol 12(1):363–372
Ren J, Han J (2021) A new multi-scale pedestrian detection algorithm in traffic environment. J Electr Eng Technol 16:1151–1161
Bayrakdar ME (2019) A smart insect pest detection technique with qualified underground wireless sensor nodes for precision agriculture. IEEE Sens J 19(22):10892–10897
Bayrakdar ME (2020) Cooperative communication based access technique for sensor networks. Int J Electron 107(2):212–225
Bayrakdar ME (2020) Exploiting cognitive wireless nodes for priority-based data communication in terrestrial sensor networks. ETRI J 42(1):36–45
Simonyan K, Zisserman (2015) A Very deep convolutional networks for large-scale image recognition. In: Proceedings of the 3rd international conference on learning representations
Howard AG, Zhu M, Chen B, et al (2017) MobileNets: Efficient convolutional neural networks for mobile vision applications. https://arxiv.org/abs/1704.04861
Lin T Y, Dollár P, Girshick R, et al (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125
Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Meng, J., Jiang, P., Wang, J. et al. A MobileNet-SSD Model with FPN for Waste Detection. J. Electr. Eng. Technol. 17, 1425–1431 (2022). https://doi.org/10.1007/s42835-021-00960-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42835-021-00960-w