Skip to main content
Log in

Comparison and analysis of logistic regression, Naïve Bayes and KNN machine learning algorithms for credit card fraud detection

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

Financial fraud is a threat which is increasing on a greater pace and has a very bad impact over the economy, collaborative institutions and administration. Credit card transactions are increasing faster because of the advancement in internet technology which leads to high dependence over internet. With the up-gradation of technology and increase in usage of credit cards, fraud rates become challenge for economy. With inclusion of new security features in credit card transactions the fraudsters are also developing new patterns or loopholes to chase the transactions. As a result of which behavior of frauds and normal transactions change constantly. Also the problem with the credit card data is that it is highly skewed which leads to inefficient prediction of fraudulent transactions. In order to achieve the better result, imbalanced or skewed data is pre-processed with the re-sampling (over-sampling or under sampling) technique for better results. The three different proportions of datasets were used in this study and random under-sampling technique was used for skewed dataset. This work uses the three machine learning algorithms namely: logistic regression, Naïve Bayes and K-nearest neighbour. The performance of these algorithms is recorded with their comparative analysis. The work is implemented in python and the performance of the algorithms is measured based on accuracy, sensitivity, specificity, precision, F-measure and area under curve. On the basis these measurements logistic regression based model for prediction of fraudulent was found to be a better in comparison to other prediction models developed from Naïve Bayes and K-nearest neighbour. Better results are also seen by applying under sampling techniques over the data before developing the prediction model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Kundu A, Panigrahi S, Sural S, Majumdar AK (2009) BLAST-SSAHA hybridization for credit card fraud detection. IEEE Trans Dependable Secure Comput 6(4):309–315

    Article  Google Scholar 

  2. Guo T, Li G-Y (2008) Neural data mining for credit card fraud detection. In: International conference on machine learning and cybernetics

  3. Ghobadi F, Rohani M (2016) Cost sensitive modeling of credit card fraud using neural network strategy. In: International conference of signal processing and intelligent systems (ICSPIS)

  4. Mishra A, Ghorpade C (2018) Credit card fraud detection on the skewed data using various classification and ensemble techniques. In: 2018 IEEE International students’ conference on electrical, electronics and computer science, SCEECS 2018

  5. Raj SE, Portia AA (2011) Analysis on credit card fraud detection methods. In: International conference on computer, communication and electrical technology

  6. Wang H, Zhu P, Zou X, Qin S (2018) An ensemble learning framework for credit card fraud detection based on training set partitioning and clustering. In: 2018 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), Guangzhou, pp 94–98. https://doi.org/10.1109/SmartWorld.2018.00051

  7. Malini N, Pushpa M (2017) Analysis on credit card fraud identification techniques based on KNN and outlier detection. In: Proceedings of the 3rd IEEE international conference on advances in electrical and electronics, information, communication and bio-informatics, AEEICB 2017, pp 255–258

  8. Padvekar SA, Kangane PM, Jadhav KV (2016) Credit card fraud detection system. Int J Eng Comput Sci 5(4):16183–16186

    Google Scholar 

  9. Khare N, Sait SY (2018) Credit card fraud detection using machine learning models and collating machine learning models. Int J Pure Appl Math 118(20):825–838

    Google Scholar 

  10. Banerjee R, Bourla G, Chen S, Kashyap M, Purohit S, Battipaglia J (2018) Comparative analysis of machine learning algorithms through credit card fraud detection. New Jersey’s Governor’s School of Engineering and Technology, Piscataway, pp 1–10

    Google Scholar 

  11. Xuan S, Liu G, Li Z, Zheng L, Wang S, Jiang C (2018) Random forest for credit card fraud detection. In: ICNSC 2018—15th IEEE International conference on networking, sensing and control, pp 1–6

  12. Hordri NF, Yuhaniz SS, Firdaus N, Azmi M, Shamsuddin SM (2018) Handling class imbalance in credit card fraud using resampling methods. Int J Adv Comput Sci Appl 9(11):390–396

    Google Scholar 

  13. Awoyemi JO, Adetunmbi AO, Oluwadare SA (2017) Credit card fraud detection using machine learning techniques: a comparative analysis. In: Proceedings of the IEEE international conference on computing, networking and informatics, ICCNI 2017, vol 2017–Jan, pp 1–9

  14. Hordri NF, Yuhaniz SS, Azmi NFM, Shamsuddin SM (2018) Handling class imbalance in credit card fraud using resampling methods. Int J Adv Comput Sci Appl 9(11):390–396

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Satwinder Singh.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Itoo, F., Meenakshi & Singh, S. Comparison and analysis of logistic regression, Naïve Bayes and KNN machine learning algorithms for credit card fraud detection. Int. j. inf. tecnol. 13, 1503–1511 (2021). https://doi.org/10.1007/s41870-020-00430-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-020-00430-y

Keywords

Navigation