Skip to main content

Content-Based Classification Approach for Video-Spam Identification

  • Conference paper
  • First Online:
  • 1804 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 736))

Abstract

In this paper the authors have worked on YouTube comment spamming. The work has been carried out on a large and labeled dataset of text-comments. Filtration and pre-processing was done to speed up the detection, elimination of redundancies as well as to increase the accuracy. Spam flags on each set of text-comments were used to check the accuracy in implementation of classification techniques. An improved algorithm has also been proposed based on term frequencies. The results were compared based on accuracy-score and F-score considering the spam flag corresponding to each comment. Further, the accuracy of SVM model was compared with respect to size of dataset, pre-processing of data as well as with XGBoost.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Balakrishnan, A.: Google claims YouTube is 10x as Popular as Netflix or Facebook Video, and Approaching TV (2017). https://www.cnbc.com/2017/02/27/youtube-viewers-reportedly-watch-1-billion-hours-of-videos-a-day–us-tv-viewers-watch-125-billion-and-dropping.html

  2. Google’s Bad Week: YouTube Loses Millions as Advertising Row Reaches US. https://www.theguardian.com/technology/2017/mar/25/google-youtube-advertising-extremist-content-att-verizon

  3. Wattenhofer, M., Wattenhofer, R., Zhu, Z.: The YouTube social network. In: Proceedings of the Sixth International AAAI Conference on Weblogs and Social Media (2012)

    Google Scholar 

  4. Chaudhary, V., Sureka, A.: Contextual feature based one-class classifier approach for detecting video response spam on Youtube. In: Eleventh Annual International Conference on Privacy, Security and Trust (PST), pp. 195–204, July 2013

    Google Scholar 

  5. Jin, X., Lin, C.X., Luo, J., Han, J.: Social spam guard: a data mining-based spam detection system for social media networks. In: Proceedings of the Very Large Data Bases, pp. 1458–1461 (2011)

    Google Scholar 

  6. O’Callaghan, D., Harrigan, M., Carthy, J., Cunningham, P.: Network analysis of recurring youtube spam campaigns. In: Proceedings of the 6th International Conference on Weblogs and Social Media (ICWSM 2012), Dublin, Ireland, pp. 531–534

    Google Scholar 

  7. Abdulhamid, S.M., Latiff, M.S.A., Chiroma, H., Osho, O., Abdul-Salaam, G., Abubakar, A.I., Herawan, T.: A review on mobile SMS spam filtering techniques. IEEE Access 5, 15650–15666 (2017)

    Article  Google Scholar 

  8. Spirin, N., Han, J.: Survey on web spam detection: principles and algorithms. ACM SIGKDD Explor. Newsl. 13(2), 50–64 (2012)

    Article  Google Scholar 

  9. Ghiam, S., Pour, A.N.: A survey on web spam detection methods: taxonomy. Int. J. Netw. Secur. Appl. (IJNSA) 4(5), 119–134 (2012)

    Google Scholar 

  10. Rădulescu, C., Dinsoreanu, M., Potolea, R.: Identification of spam comments using natural language processing techniques. In: IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) (2014)

    Google Scholar 

  11. Lesmeister, C.: Mastering Machine Learning with R. Packt Publishing Ltd., Birmingham (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Palak Agarwal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Agarwal, P., Sharma, M., Kaur, G. (2018). Content-Based Classification Approach for Video-Spam Identification. In: Abraham, A., Muhuri, P., Muda, A., Gandhi, N. (eds) Intelligent Systems Design and Applications. ISDA 2017. Advances in Intelligent Systems and Computing, vol 736. Springer, Cham. https://doi.org/10.1007/978-3-319-76348-4_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-76348-4_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-76347-7

  • Online ISBN: 978-3-319-76348-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics