Skip to main content

Detecting Phishing Websites Using Machine Learning

  • Conference paper
  • First Online:
Intelligent and Cloud Computing

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 286))

  • 381 Accesses

Abstract

The entire world is digitizing at a rapid pace. However, the ever-evolving transformation comes with its fair share of vulnerabilities, opening the doors wider for cybercriminals. One of the common types of attacks criminals indulge in these days is phishing. It involves creation of websites to dupe unsuspecting users into thinking that they are on a legitimate site, making them disclose confidential information like bank account details, usernames, and passwords. This paper aims to find out accurately if a URL is reliable or not, in other words, if it is a phished one or otherwise. Machine Learning (ML) based models provide an efficient way to detect these phishing attacks. This research paper focuses on using three different ML algorithms—Logistic Regression, Support Vector Machine (SVM), and Random Forest Classifier in order to find the most accurate model to predict whether a given URL is safe or not. To achieve this, the respective models are trained using a pre-existing data set and then tested as to whether they can accurately classify the websites or not. The algorithms are also compared based on performance measures like Precision, Accuracy, F1 Score, and Recall to deduce which one of the three is most efficient and reliable for classification and prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Basnet, R.B., Sung, A.H., Quingzhong L.: Rule-based phishing attack detection. In: Proceedings of the International Conference on Security and Management Steering Committee of World Congress in Computer Science (World Comp) (2011)

    Google Scholar 

  2. Jain, A.K., Gupta, B.B.: A machine learning based approach for phishing detection using hyperlinks. J AIHC 10, 2015–2028 (2019)

    Google Scholar 

  3. Alam, M.N., Sarma, D., Lima, F.F., Saha, I., Ulfath, R.-E., Hossain, S.: Phishing attacks detection using machine learning approach. In: 2020 (ICSSIT), pp. 1173–1179 (2020). doi: https://doi.org/10.1109/ICSSIT48917.2020.9214225

  4. Mahdieh, D.D., Zabihimayvan.: Fuzzy rough set feature selection enhance detection of phishing attack. In: 2019 Systems (FUZZ-IEEE). IEEE (2019)

    Google Scholar 

  5. Zamir, A., Khan, H.U., Iqbal, T., Yousaf, N., Aslam, F., Anjum, A., Hamdani, M.: Phishing web site detection using diverse machine learning algorithms. Electron. Libr. 38(1), 65–80 (2019). https://doi.org/10.1108/EL-05-2019-0118

    Article  Google Scholar 

  6. PurviPujara, et al.: Int J S Res CSE & IT.: Phishing website detection machine learning: a review. In: 2018 Sept-Oct-2018 3(7), 395–399

    Google Scholar 

  7. Wardman, B., Stallings, T., Warner, G., Skjellum, A.: High-performance content-based phishing attack detection.2011 eCrime Researchers Summit (2011). doi:https://doi.org/10.1109/ecrime.2011.6151977

  8. Akinyelu, A.A., Adewumi, A.O.: Phishing email classification using random forest ML technique. J. Appl. Math. 2014, 1–6 (2014). https://doi.org/10.1155/2014/425731

    Article  Google Scholar 

  9. Joshi, A.N., Pattanshetti, T.R.: Phishing attack detection using feature selection techniques. In: International Conference on Communication and Information Processing (ICCIP-2019), College of Engineering, Pune, Wellesley Road, Shivaji Nagar, Pune, India

    Google Scholar 

  10. Buber1, E., Diri1, B., Sahingoz2, O.: NLP based phishing attack detection from URLs, 1 Computer Engineering Department, YildizTechical University, Istanbul, Turkey, 2 Computer Engineering Department, Istanbul Kultur University, 34158 Istanbul, Turkey, Chapter, March 2018

    Google Scholar 

  11. Ahmad1, S.W., Ismail2, M., Sutoyo3, E., Kasim4, S., Mohamad5, M.: Comparative performance of machine learning methods for classification on phishing attack detection. Int. J. Adv. Trends Comput. Sci. Eng. 9(1.5) 2020

    Google Scholar 

  12. Kulkarni1, A., Brown, L.L.: Phishing websites detection using machine learning III2 Department of Computer Science, The University of Texas at Tyler Tyler, (IJACSA). Int. J. Adv. Comput. Sci. Appl. 10(7) (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to N. Subhashini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sreenidhi, A., Shruti, B., Divya, A., Subhashini, N. (2022). Detecting Phishing Websites Using Machine Learning. In: Mishra, D., Buyya, R., Mohapatra, P., Patnaik, S. (eds) Intelligent and Cloud Computing. Smart Innovation, Systems and Technologies, vol 286. Springer, Singapore. https://doi.org/10.1007/978-981-16-9873-6_32

Download citation

Publish with us

Policies and ethics