Abstract
In this paper the authors have worked on YouTube comment spamming. The work has been carried out on a large and labeled dataset of text-comments. Filtration and pre-processing was done to speed up the detection, elimination of redundancies as well as to increase the accuracy. Spam flags on each set of text-comments were used to check the accuracy in implementation of classification techniques. An improved algorithm has also been proposed based on term frequencies. The results were compared based on accuracy-score and F-score considering the spam flag corresponding to each comment. Further, the accuracy of SVM model was compared with respect to size of dataset, pre-processing of data as well as with XGBoost.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Balakrishnan, A.: Google claims YouTube is 10x as Popular as Netflix or Facebook Video, and Approaching TV (2017). https://www.cnbc.com/2017/02/27/youtube-viewers-reportedly-watch-1-billion-hours-of-videos-a-day–us-tv-viewers-watch-125-billion-and-dropping.html
Google’s Bad Week: YouTube Loses Millions as Advertising Row Reaches US. https://www.theguardian.com/technology/2017/mar/25/google-youtube-advertising-extremist-content-att-verizon
Wattenhofer, M., Wattenhofer, R., Zhu, Z.: The YouTube social network. In: Proceedings of the Sixth International AAAI Conference on Weblogs and Social Media (2012)
Chaudhary, V., Sureka, A.: Contextual feature based one-class classifier approach for detecting video response spam on Youtube. In: Eleventh Annual International Conference on Privacy, Security and Trust (PST), pp. 195–204, July 2013
Jin, X., Lin, C.X., Luo, J., Han, J.: Social spam guard: a data mining-based spam detection system for social media networks. In: Proceedings of the Very Large Data Bases, pp. 1458–1461 (2011)
O’Callaghan, D., Harrigan, M., Carthy, J., Cunningham, P.: Network analysis of recurring youtube spam campaigns. In: Proceedings of the 6th International Conference on Weblogs and Social Media (ICWSM 2012), Dublin, Ireland, pp. 531–534
Abdulhamid, S.M., Latiff, M.S.A., Chiroma, H., Osho, O., Abdul-Salaam, G., Abubakar, A.I., Herawan, T.: A review on mobile SMS spam filtering techniques. IEEE Access 5, 15650–15666 (2017)
Spirin, N., Han, J.: Survey on web spam detection: principles and algorithms. ACM SIGKDD Explor. Newsl. 13(2), 50–64 (2012)
Ghiam, S., Pour, A.N.: A survey on web spam detection methods: taxonomy. Int. J. Netw. Secur. Appl. (IJNSA) 4(5), 119–134 (2012)
Rădulescu, C., Dinsoreanu, M., Potolea, R.: Identification of spam comments using natural language processing techniques. In: IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) (2014)
Lesmeister, C.: Mastering Machine Learning with R. Packt Publishing Ltd., Birmingham (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Agarwal, P., Sharma, M., Kaur, G. (2018). Content-Based Classification Approach for Video-Spam Identification. In: Abraham, A., Muhuri, P., Muda, A., Gandhi, N. (eds) Intelligent Systems Design and Applications. ISDA 2017. Advances in Intelligent Systems and Computing, vol 736. Springer, Cham. https://doi.org/10.1007/978-3-319-76348-4_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-76348-4_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-76347-7
Online ISBN: 978-3-319-76348-4
eBook Packages: EngineeringEngineering (R0)