Skip to main content

Phishing Website Prediction: A Machine Learning Approach

  • Conference paper
  • First Online:
Progress in Advanced Computing and Intelligent Engineering

Abstract

Phishing is an act of stealing our precious, personal, sensitive data such as credentials that which we use for accessing the resources and services, available across the cyberspace. Seeing these rapidly growing phishing attacks and their adverse effect on the businesses including individual users, it has now become a need for the organizations and individuals worldwide to be able to effectively predict the phishing website and differentiate them from legitimate ones. The aim of this research paper is to efficiently predict the phishing websites so that users may be benefitted from this study and prevent them from getting trapped. In this paper, machine learning techniques are used for prediction. Data mining is used worldwide by almost every face of the society viz. business organizations, govt. organizations, and other kind of data collectors to extract knowledge from the collected data. On the other hand, machine learning is a data mining technique that is used to analyze, classify the data, and efficiently predict the results for the estimation and planning by all of the organizations all around the globe. Classification algorithms, namely logistic regression, decision tree, and random forest classification, are used to predict the fake websites and presented their comparison of their predictions achieved. The results have been presented in numeric format as well as graphically with the help of chart. The data used is taken from UCI machine learning online repository. The seed value is changed and analyzed, and results achieved are least accuracy of 95.93, 97.96, and 98.78% of accuracy as the highest as well. Some future study and applying some good practices may help in designing a better and more accurate solution for the prediction of the phishing website, just by examining the URL and its features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. APWG | Phishing Activity Trends Reports (2020). https://apwg.org/trendsreports/. Last Accessed 05 May 2020

  2. Kiruthiga, R., Akila, D.: Phishing websites detection using machine learning. 2, 111–114 (2019). https://doi.org/10.35940/ijrte.B1018.0982S1119

  3. Koray, O., et al.: Machine learning based phishing detection from URLs. Expert Syst. Appl. 117, 345–357 (2019). https://doi.org/10.1016/j.eswa.2018.09.029

    Article  Google Scholar 

  4. Mao, J.: Phishing page detection via learning classifiers from page layout feature (2019)

    Google Scholar 

  5. Leng, K., et al.: A new hybrid ensemble feature selection framework for machine learning-based phishing detection system. Inf. Sci. (Ny) 484, 153–166 (2019). https://doi.org/10.1016/j.ins.2019.01.064

    Article  Google Scholar 

  6. Wang, W. et al.: PDRCNN : precise phishing detection with recurrent convolutional neural networks, (2019)

    Google Scholar 

  7. Surbhi Gupta, A.S.: Dynamic classification mining techniques for predicting phishing. URL. Presented at the (2018)

    Google Scholar 

  8. Kumar, A., Gupta, J.B.B.: A machine learning based approach for phishing detection using hyperlinks information number of unique phishing sites detected. J. Ambient Intell. Humaniz. Comput. (2018). https://doi.org/10.1007/s12652-018-0798-z

  9. Comparison of Classification Algorithms to Detect Phishing Web Pages Using Feature Selection and. 4, (2016). https://doi.org/10.5281/zenodo.61181

  10. Chiew, K.L., et al.: Utilisation of website logo for phishing detection. Comput. Secur. (2015). https://doi.org/10.1016/j.cose.2015.07.006

    Article  Google Scholar 

  11. Abdelhamid, N., Abdel-jaber, H.: Learning comparison based on models content and features. 72–77 (2017)

    Google Scholar 

  12. Ali, W.: Phishing website detection based on supervised machine learning with wrapper features selection. 8(9), 72–78 (2017)

    Google Scholar 

  13. Ho, T.K.: Random decision forests. Proc. Int. Conf. Doc. Anal. Recognition, ICDAR. 1, 278–282 (1995). https://doi.org/10.1109/ICDAR.1995.598994

  14. UCI Machine Learning Repository: Phishing Websites Data Set (2019). https://archive.ics.uci.edu/ml/datasets/Phishing+Websites, Last Accessed 19 Apr 2020

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anjaneya Awasthi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Awasthi, A., Goel, N. (2021). Phishing Website Prediction: A Machine Learning Approach. In: Panigrahi, C.R., Pati, B., Pattanayak, B.K., Amic, S., Li, KC. (eds) Progress in Advanced Computing and Intelligent Engineering. Advances in Intelligent Systems and Computing, vol 1299. Springer, Singapore. https://doi.org/10.1007/978-981-33-4299-6_12

Download citation

  • DOI: https://doi.org/10.1007/978-981-33-4299-6_12

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-33-4298-9

  • Online ISBN: 978-981-33-4299-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics