Content-Based Classification Approach for Video-Spam Identification

Agarwal, Palak; Sharma, Mahak; Kaur, Gagandeep

doi:10.1007/978-3-319-76348-4_23

Content-Based Classification Approach for Video-Spam Identification

Palak Agarwal¹⁸,
Mahak Sharma¹⁸ &
Gagandeep Kaur¹⁸

Conference paper
First Online: 22 March 2018

1804 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 736))

Abstract

In this paper the authors have worked on YouTube comment spamming. The work has been carried out on a large and labeled dataset of text-comments. Filtration and pre-processing was done to speed up the detection, elimination of redundancies as well as to increase the accuracy. Spam flags on each set of text-comments were used to check the accuracy in implementation of classification techniques. An improved algorithm has also been proposed based on term frequencies. The results were compared based on accuracy-score and F-score considering the spam flag corresponding to each comment. Further, the accuracy of SVM model was compared with respect to size of dataset, pre-processing of data as well as with XGBoost.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Balakrishnan, A.: Google claims YouTube is 10x as Popular as Netflix or Facebook Video, and Approaching TV (2017). https://www.cnbc.com/2017/02/27/youtube-viewers-reportedly-watch-1-billion-hours-of-videos-a-day–us-tv-viewers-watch-125-billion-and-dropping.html
Google’s Bad Week: YouTube Loses Millions as Advertising Row Reaches US. https://www.theguardian.com/technology/2017/mar/25/google-youtube-advertising-extremist-content-att-verizon
Wattenhofer, M., Wattenhofer, R., Zhu, Z.: The YouTube social network. In: Proceedings of the Sixth International AAAI Conference on Weblogs and Social Media (2012)
Google Scholar
Chaudhary, V., Sureka, A.: Contextual feature based one-class classifier approach for detecting video response spam on Youtube. In: Eleventh Annual International Conference on Privacy, Security and Trust (PST), pp. 195–204, July 2013
Google Scholar
Jin, X., Lin, C.X., Luo, J., Han, J.: Social spam guard: a data mining-based spam detection system for social media networks. In: Proceedings of the Very Large Data Bases, pp. 1458–1461 (2011)
Google Scholar
O’Callaghan, D., Harrigan, M., Carthy, J., Cunningham, P.: Network analysis of recurring youtube spam campaigns. In: Proceedings of the 6th International Conference on Weblogs and Social Media (ICWSM 2012), Dublin, Ireland, pp. 531–534
Google Scholar
Abdulhamid, S.M., Latiff, M.S.A., Chiroma, H., Osho, O., Abdul-Salaam, G., Abubakar, A.I., Herawan, T.: A review on mobile SMS spam filtering techniques. IEEE Access 5, 15650–15666 (2017)
Article Google Scholar
Spirin, N., Han, J.: Survey on web spam detection: principles and algorithms. ACM SIGKDD Explor. Newsl. 13(2), 50–64 (2012)
Article Google Scholar
Ghiam, S., Pour, A.N.: A survey on web spam detection methods: taxonomy. Int. J. Netw. Secur. Appl. (IJNSA) 4(5), 119–134 (2012)
Google Scholar
Rădulescu, C., Dinsoreanu, M., Potolea, R.: Identification of spam comments using natural language processing techniques. In: IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) (2014)
Google Scholar
Lesmeister, C.: Mastering Machine Learning with R. Packt Publishing Ltd., Birmingham (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of CSE & IT, JIIT, Noida, Uttar Pradesh, India
Palak Agarwal, Mahak Sharma & Gagandeep Kaur

Authors

Palak Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Mahak Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Gagandeep Kaur
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Palak Agarwal .

Editor information

Editors and Affiliations

Machine Intelligence Research Labs , Auburn, Washington, USA
Ajith Abraham
Department of Computer Science, South Asian University, Chanakyapuri, Delhi, India
Pranab Kr. Muhuri
Faculty of Information and Communication Technology, Universiti Teknikal Malaysia Melaka , Durian Tunggal, Melaka, Malaysia
Azah Kamilah Muda
Machine Intelligence Research Labs , Auburn, Washington, USA
Niketa Gandhi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Agarwal, P., Sharma, M., Kaur, G. (2018). Content-Based Classification Approach for Video-Spam Identification. In: Abraham, A., Muhuri, P., Muda, A., Gandhi, N. (eds) Intelligent Systems Design and Applications. ISDA 2017. Advances in Intelligent Systems and Computing, vol 736. Springer, Cham. https://doi.org/10.1007/978-3-319-76348-4_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-76348-4_23
Published: 22 March 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-76347-7
Online ISBN: 978-3-319-76348-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics