Skip to main content

Detecting Phishing Websites Using Rule-Based Classification Algorithm: A Comparison

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 9))

Abstract

In today’s time, phishy website detection is one of the important challenges in the field of information security due to the large numbers of online transactions going through over the websites. Website phishing means stealing one’s personal information over the Internet such as system backup data, user login credentials, bank account details or other security information. Phishing means creation of phishy or fake websites which look like legitimate ones. In this research paper, we use the associative classification data mining approach that is also named as rule-based classification technique by which we can detect a phishy website and thereby identifying the better detection algorithm which has a higher accuracy detection rate. The algorithms used are Naïve Bayes and PART algorithms of associative classification data mining approach. Moreover, we classify the websites into a legitimate website or a phishy website from the collected datasets of websites. The implementation will be done on the datasets of 1,353 websites which contain phishy sites as well as legitimate sites. At the end, results will show us the higher accuracy detection rate algorithm, which will more correctly identify phishing or legitimate websites.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Abdelhamid N, Ayesh A, Thabtah F (2014) Phishing detection based associative classification data mining. Expert Syst Appl 41. Elsevier

    Google Scholar 

  2. Mahajan A, Ganpati A (2014) Performance evaluation of rule based classification algorithms. Int J Ad Res Comput Eng Technol (IJARCET)

    Google Scholar 

  3. Taalohi M, Langari N, Tabatabaee H (2015) Identifying phishing websites by techniques hyper heuristic and machine learning. ISSN Sci Int. Lahore

    Google Scholar 

  4. Datasets of phishing and legitimate websites from the sites as Phishtank, Millersmiles, and UCI machine learning repository site. https://archive.ics.uci.edu/ml/datasets.html

  5. Naïve Bayes algorithm. https://en.wikipedia.org/wiki/Naive_Bayes_classifier

  6. Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Yu SY, Zhou ZH, Steinbach M, Hand DJ, Steinberg D (2007) Top 10 Algorithm in data mining. Springer Verlag London Limited published

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sudhanshu Gautam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gautam, S., Rani, K., Joshi, B. (2018). Detecting Phishing Websites Using Rule-Based Classification Algorithm: A Comparison. In: Mishra, D., Nayak, M., Joshi, A. (eds) Information and Communication Technology for Sustainable Development. Lecture Notes in Networks and Systems, vol 9. Springer, Singapore. https://doi.org/10.1007/978-981-10-3932-4_3

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3932-4_3

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3931-7

  • Online ISBN: 978-981-10-3932-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics