Malicious Webpage Classification Using Deep Learning Technique

Conference paper
First Online: 20 July 2021

pp 465–475
Cite this conference paper

Artificial Intelligence and Sustainable Computing

Kushagra Krishna⁸,
Jaytrilok Choudhary⁸ &
Dhirendra Pratap Singh ORCID: orcid.org/0000-0001-5519-3928⁸

Part of the book series: Algorithms for Intelligent Systems ((AIS))

405 Accesses

Abstract

There has been a sudden rapid increase in the number of malicious webpages in the recent years, so the process of determining the authenticity of a web link has become very difficult. Malicious webpage developers have been able to evade the signature-based detection techniques very easily and comfortably. The malicious webpage can be detected through basically static and dynamic analysis. In this paper, dynamic deep learning techniques have been explored that convert the links of the webpages to an image file and classifies the web pages as malicious or benign. The deep learning techniques (VGG 16 and VGG 19) are utilized for the classification of web pages. These models provide an accuracy of 99.8% in classifying malware webpage image file samples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Malicious Website Detection Through Deep Learning Algorithms

Chapter © 2022

Malicious Webpage Classification

Chapter © 2021

Malicious Web Page Detection: A Machine Learning Approach

Chapter © 2014

References

Sanou B (2016) ICT: facts and Figures. http://www.itu.int/en/ITU-D/Statistics/Documents/facts/ICTFactsFigures2016.pdf
Ollmann G (2008) The Evolution of commercial malware development kits and colour-by-numbers custom malware. Comput Fraud Secur 2008(9):4–7
Article Google Scholar
Kaspersky (2014) Cybercrime, Inc.: how profitable is the business? URL: https://blog.kaspersky.com/cybercrime-inc-how-profitable-is-the-business/15034/
Choi H, Zhu BB, Lee F (2012) Detecting malicious web links and identifying their attack types. In: 2nd proceedings of USENIX conference on Web application development. Berkeley: USENIX Association, CA, USA, pp 11–11
Google Scholar
Canali D, Cova M, Prophiler GV (2011) A fast filter for the large-scale detection of malicious web pages. In: 20th proceedings of the international conference on World Wide Web. ACM, New York, USA, pp 197–206
Google Scholar
Kumar R, Zhang X, Ahmad Tariq H, Khan RU (2017) Malicious URL detection using multi-layer filtering model. In: International computer conference on wavelet active media technology and information processing
Google Scholar
Chipman HA, George EI, McCulloch RE (1998) Bayesian CART Model Search. J Am Stat Assoc 93(443):935–948
Article Google Scholar
Steinberg D, Colla P (2009) CART: classification and regression trees. The top ten algorithms in data mining, pp 179–201
Google Scholar
Yue T, Sun J, Chen H (2013) Fine-grained mining and classification of malicious web pages. In: 4th International conference on digital manufacturing & automation
Google Scholar
Kazemian HB, Ahmed S (2011) Comparisons of machine learning techniques for detecting malicious webpages. Expert Syst Appl 42(3):1166–1177
Article Google Scholar
Kabanga EK, Kim CH (2018) Malware images classification using convolution neural network. J Comput Commun 6:153–158
Article Google Scholar
Nguyen A, Yosinki J, Clune J (2015) Deep neural networks are easily fooled: high confidence predictions for unrecognizable images. IEEE Comput Vision Pattern Recog (CVPR)
Google Scholar
Wang X, Liu J, Chen X (2015) first place team: say no to over fitting. Winner of microsoft malware classification challenge
Google Scholar
Abdi FD, Wenjuan L (2017) Malicious URL detection using convolution neural network. Int J Comput Sci Eng Inf Technol (IJCSEIT) 7(6)
Google Scholar
Wang Y, Cai WD, Wei PC (2016) A deep learning approach for detecting malicious javascript code. Secur Commun Netw 9(11)
Google Scholar
Xu L, Zhan Z, Xu S, Ye K (2013) Cross-layer detection of malicious websites. In: 3rd proceedings of ACM conference on Data and application security and privacy, pp 141–152
Google Scholar
Shibahara T, Yamanishi K, Takata Y, Chiba D, Akiyama M, Yagi T, Ohsita Y, Murata M (2017) Malicious URL sequence detection using event de-noising convolutional neural network. In: ICC 2017 IEEE international conference on communications, pp 1–7
Google Scholar
Antonakakis M, Perdisci R, Dagon D, Lee W, Feamster N (2010) Building a dynamic reputation system for DNS. In: 19th proceedings of USENIX security symposium, pp 273–290
Google Scholar
Tanner MA, Wong WH (1987) The calculation of posterior distributions by data augmentation. J Am Stat Assoc 82(398):528–540
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Maulana Azad National Institute of Technology, Bhopal, India
Kushagra Krishna, Jaytrilok Choudhary & Dhirendra Pratap Singh

Authors

Kushagra Krishna
View author publications
You can also search for this author in PubMed Google Scholar
Jaytrilok Choudhary
View author publications
You can also search for this author in PubMed Google Scholar
Dhirendra Pratap Singh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical Engineering, Madhav Institute of Technology and Science, Gwalior, India
Hari Mohan Dubey
Department of Electrical Engineering, Madhav Institute of Technology and Science, Gwalior, India
Manjaree Pandit
Department of Electrical Engineering, Madhav Institute of Technology and Science, Gwalior, India
Laxmi Srivastava
Department of Electrical Engineering, Indian Institute of Technology Delhi, Delhi, India
Bijaya Ketan Panigrahi

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Krishna, K., Choudhary, J., Singh, D.P. (2022). Malicious Webpage Classification Using Deep Learning Technique. In: Dubey, H.M., Pandit, M., Srivastava, L., Panigrahi, B.K. (eds) Artificial Intelligence and Sustainable Computing. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-16-1220-6_39

Download citation

DOI: https://doi.org/10.1007/978-981-16-1220-6_39
Published: 20 July 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-1219-0
Online ISBN: 978-981-16-1220-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions