Abstract
In the information age, massive Internet data brings convenience to us. But there is some inappropriate visual content (pornography, violence, politics, terrorism, etc.), among which the dissemination of pornographic content has an adverse influence, especially for children and minors. Therefore, we present an inappropriate visual content detection method based on the joint training strategy in an end-to-end manner, which realizes the identification and location of inappropriate visual content while retaining the base class (80 categories in the COCO dataset) detection. To solve the difficulty of sample labeling, in this paper we propose a combined training strategy of detection and classification. And the Focal loss is used to improve the sample imbalance in the network sharing training. The algorithm can achieve multi-label output and has good recognition accuracy. Finally, a more challenging dataset INVC of inappropriate visual content is proposed, which includes three types of sample data in complex backgrounds at different scales, such as indoor, beach, street, etc.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Short, M., Black, L., Smith, A., Wetterneck, C., Wells, D.: A review of internet pornography use research: methodology and content from the past 10 years. Cyberpsychol. Behav. Soc. Netw. 15(1), 13–23 (2012)
Nuraisha, S., Pratama, F.I., Budianita, A., Soeleman, M.A.: Implementation of K-NN based on histogram at image recognition for pornography detection. In: Proceedings of the 2017 International Seminar on Application for Technology of Information and Communication (iSemantic), Semarang, Indonesia, pp. 5–10 (2017)
Garcia, M.B., Revano, T.F., Habal, B.G.M., Contreras, J.O., Enriquez, J.B.R.: A pornographic image and video filtering application using optimized nudity recognition and detection algorithm. In: Proceedings of the 2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM), Baguio City, Philippines, pp. 1–5 (2018)
Santos, C., Dos Santos, E.M., Souto, E.: Nudity detection based on image zoning. In: Proceedings of the 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA), Montreal, QC, Canada, pp. 1098–1103 (2012)
Moreira, D.C., Fechine, J.M.: A machine learning-based forensic discriminator of pornographic and bikini images. In: Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil, pp. 1–8 (2018)
Wehrmann, J., Simões, G.S., Barros, R.C., Cavalcante, V.F.: Adult content detection in videos with convolutional and recurrent neural networks. Neurocomputing 272, 432–438 (2018)
da Silva, M.V., Marana, A.N.: Spatiotemporal CNNs for pornography detection in videos. In: Vera-Rodriguez, R., Fierrez, J., Morales, A. (eds.) CIARP 2018. LNCS, vol. 11401, pp. 547–555. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-13469-3_64
More, M.D., Souza, D.M., Wehrmann, J., Barros, R.C.: Seamless nudity censorship: an image-to-image translation approach based on adversarial training. In: International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2018)
Lambert, J., Liu, Z., Sener, O., Hays, J., Koltun, V.: MSeg: a composite dataset for multi-domain semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)
He, T., Zhang, Z., Zhang, H., Zhang, Z., Xie, J., Li, M.: Bag of tricks for image classification with convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 558–567 (2019)
AlDahoul, N., et al.: Transfer detection of YOLO to focus CNN’s attention on nude regions for adult content detection. Symmetry 13(1), 26 (2020). https://doi.org/10.3390/sym13010026
Lin, T., Goyal, P., Girshick, R., He, K., Dollar, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 318–327 (2020)
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Lin, T., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2117–2125 (2017)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517–6525 (2017)
Acknowledgement
This work was supported by the Key Research and Development Program of China under Grant 2018YFC0831000.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, X., Liu, J., Liu, X., Li, Y., Yu, L. (2023). Inappropriate Visual Content Detection Based on the Joint Training Strategy. In: Sun, J., Wang, Y., Huo, M., Xu, L. (eds) Signal and Information Processing, Networking and Computers. Lecture Notes in Electrical Engineering, vol 917. Springer, Singapore. https://doi.org/10.1007/978-981-19-3387-5_131
Download citation
DOI: https://doi.org/10.1007/978-981-19-3387-5_131
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-3386-8
Online ISBN: 978-981-19-3387-5
eBook Packages: EngineeringEngineering (R0)