Skip to main content
Log in

DVRGNet: an efficient network for extracting obscenity from multimedia content

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The availability of adult content on the internet through images or videos is easily accessible to minors and adults as well. In addition, this type of content may lead to poor mental health, sexism and objectification, and sexual violence. Therefore, It is extremely important to detect and classify pornographic content. In this paper, DVRGNet, a hierarchical CNN framework for the detection and classification of obscene content from videos is proposed. The proposed framework incorporates motion data and the capture of motion movement to deal with the problem of mapping skin exposure to pornographic content. DVRGNet is a network that leverages DenseNet, VGGNet, ResNet, and GoogLeNet for feature extraction. This network includes different fusions of various sub-networks, which can be seen as diverse tiers of neurons in human brains. The framework also incorporates a 5-layer Bi-LSTM-based classification of obscenity from videos. The proposed framework makes better use of automated pornography detection through computational intelligence architectures. Furthermore, the fusion of these four networks strengthens feature propagation by reducing the vanishing gradient problem. Extensive experiments are conducted on Pornography-2K and Pornography-800 datasets to validate the effectiveness of the proposed framework. The proposed framework achieves an accuracy of 99.42% on the Pronography-2K and 99.04% on the Pornography-800 datasets. An ablation study is also conducted to demonstrate the performance of proposed framework.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Algorithm 1.
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

Data sharing not applicable to this article as no datasets were generated during the current study.

References

  1. Shojae Chaeikar S, Zamani M, Abdul Manaf AB, Zeki AM (2018) PSW statistical LSB image steganalysis. Multimed Tools Appl 77(1):805–835

    Article  Google Scholar 

  2. Karamizadeh S, Shojae Chaeikar S, Jolfaei A (2023) Adult content image recognition by Boltzmann machine limited and deep learning. Evol Intel 16:1185–1194. https://doi.org/10.1007/s12065-022-00729-8

    Article  Google Scholar 

  3. Dines G (2017) Growing up with porn: the developmental and societal impact of pornography on children. Dignity 2(3):3

    Article  Google Scholar 

  4. Perez M, Avila S, Moreira D, Moraes D, Testoni V, Valle E, ... Rocha A (2017) Video pornography detection through deep learning techniques and motion information. Neurocomputing 230:279–293

  5. Jones MJ, Rehg JM (2002) Statistical color models with application to skin detection. Int J Comput Vision 46(1):81–96

    Article  Google Scholar 

  6. Rowley HA, Jing Y, Baluja S (2006) Large-scale image-based adult-content filtering. 1st International Conference on Computer Vision Theory

  7. Lee S, Shim W, Kim S (2009) Hierarchical system for objectionable video detection. IEEE Trans Consum Electron 55(2):677–684

    Article  Google Scholar 

  8. Bouirouga H, Elfkihi S, Jilbab A, Aboutajdine D, El Fkihi S (2012) Skin detection in pornographic videos using threshold technique. J Theor Appl Inf Technol 35(1):7–19

    Google Scholar 

  9. Wehrmann J, Simões GS, Barros RC, Cavalcante VF (2018) Adult content detection in videos with convolutional and recurrent neural networks. Neurocomputing 272:432–438

    Article  Google Scholar 

  10. Jiang YG, Wu Z, Wang J, Xue X, Chang SF (2017) Exploiting feature and class relationships in video categorization with regularized deep neural networks. IEEE Trans Pattern Anal Mach Intell 40(2):352–364

    Article  PubMed  Google Scholar 

  11. Avila S, Thome N, Cord M, Valle E, de A. Araújo A (2011) BOSSA: Extended bow formalism for image classification. In: 2011 18th IEEE International Conference on Image Processing, Brussels, pp 2909–2912. https://doi.org/10.1109/ICIP.2011.6116268

  12. Avila S, Thome N, Cord M, Valle E, AraúJo ADA (2013) Pooling in image representation: the visual codeword point of view. Comput Vis Image Underst 117(5):453–465

    Article  Google Scholar 

  13. Wang L, Zhang J, Wang M, Tian J, Zhuo L (2020) Multilevel fusion of multimodal deep features for porn streamer recognition in live video. Pattern Recogn Lett 140:150–157

    Article  ADS  Google Scholar 

  14. Cheng F, Wang SL, Wang XZ, Liew AWC, Liu GS (2019) A global and local context integration DCNN for adult image classification. Pattern Recogn 96

    Article  Google Scholar 

  15. da Silva MV, Marana AN (2018) Spatiotemporal CNNs for pornography detection in videos. In: Iberoamerican Congress on Pattern Recognition, pp 547–555. Springer International Publishing, Cham

  16. Gautam N, Vishwakarma DK (2022) Obscenity detection in videos through a sequential convnet pipeline classifier. In: IEEE Transactions on Cognitive and Developmental Systems 15(1):310–318

  17. Yousaf K, Nawaz T (2022) A deep learning-based approach for inappropriate content detection and classification of youtube videos. IEEE Access 10:16283–16298

    Article  Google Scholar 

  18. Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708

  19. Khan SI, Shahrior A, Karim R, Hasan M, Rahman A (2022) MultiNet: a deep neural network approach for detecting breast cancer through multi-scale feature fusion. Journal of King Saud University-Computer and Information Sciences 34(8):6217–6228

    Article  Google Scholar 

  20. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

  21. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition pp 770–778

  22. Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9

  23. Guo MH, Xu TX, Liu JJ, Liu ZN, Jiang PT, Mu TJ, Zhang S-H, Martin RR, Cheng M-M, Hu S-M (2022) Attention mechanisms in computer vision: a survey. Comput Vis Media 8:1–38

    Google Scholar 

  24. Gangwar A, González-Castro V, Alegre E, Fidalgo E (2021) AttM-CNN: attention and metric learning based CNN for pornography, age and child sexual abuse (CSA) detection in images. Neurocomputing 445:81–104

    Article  Google Scholar 

  25. Chen J, Liang G, He W, Xu C, Yang J, Liu R (2020) A pornographic images recognition model based on deep one-class classification with visual attention mechanism. IEEE Access 8:122709–122721

    Article  Google Scholar 

  26. Samal S, Nayak R, Jena S et al (2023) Obscene image detection using transfer learning and feature fusion. Multimed Tools Appl. https://doi.org/10.1007/s11042-023-14437-7

    Article  PubMed  PubMed Central  Google Scholar 

  27. Samal S, Zhang Y‐D, Gadekallu TR, Nayak R, Balabantaray BK (2023) SBMYv3: improved MobYOLOv3 a BAM attention‐based approach for obscene image and video detection. Expert Systems e13230

  28. Huang C, Yuan C, Zhang J (2020) Violation detection of live video based on deep. Learning. https://doi.org/10.1155/2020/1895341

    Article  Google Scholar 

  29. Aggarwal K, Mijwil MM, Al-Mistarehi AH, Alomari S, Gök M, Alaabdin AMZ, Abdulrhman SH (2022) Has the future started? The current growth of artificial intelligence, machine learning, and deep learning. Iraqi J Comput Sci Math 3(1):115–123

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vijay Kumar.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Competing interest

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rautela, K., Sharma, D., Kumar, V. et al. DVRGNet: an efficient network for extracting obscenity from multimedia content. Multimed Tools Appl 83, 28807–28825 (2024). https://doi.org/10.1007/s11042-023-16619-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-16619-9

Keywords

Navigation