Skip to main content

Advertisement

Log in

LGBM: a machine learning approach for Ethereum fraud detection

International Journal of Information Technology Aims and scope Submit manuscript

Abstract

Ethereum is a software platform that uses the concept of blockchain and decentralizes data by distributing copies of smart contracts to thousands of individuals worldwide. Ethereum, as a currency, is utilized to exchange value worldwide in the absence of a third party to monitor or intervene. However, as online commerce grows, a slew of fraudulent activities, such as money laundering, bribery, and phishing, emerge as the primary threat to trade security. This paper proposes Light Gradient Boosting Machine (LGBM) approach for accurately detecting fraudulent transactions. It also examines different models such as Random Forest (RF), Multi-Layer Perceptron (MLP), etc., based on machine learning and soft computing algorithm for classifying Ethereum fraud detection dataset with limited attributes and compares their metrics with the LGBM approach. A comparative study of scores of bagging models is presented to know the applicability of the proposed approach. The light gradient boosting machine (LGBM) algorithms and Extreme Gradient Boosting (XGBoost) demonstrate the highest accuracies, while LGBM shows slightly better performance with 98.60% for the stated dataset scenarios. Further optimizing the LGBM with hyper-parameter tuning, an accuracy of 99.03% is achieved.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price includes VAT (France)

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

References

  1. Arnold L, Brennecke M, Camus P, Fridgen G, Guggenberger T, Radszuwill S, Rieger A, Schweizer A, Urbach N (2019) Blockchain and initial coin offerings: blockchain’s implications for crowdfunding. In: Business transformation through Blockchain. Palgrave Macmillan, Cham, pp 233–272

  2. Li X, Whinston AB (2020) Analyzing cryptocurrencies. Inf Syst Front 22(1):17–22

    Article  Google Scholar 

  3. Liu J, Serletis A (2019) Volatility in the cryptocurrency market. Open Econ Rev 30(4):779–811

    Article  MATH  Google Scholar 

  4. Mock W, Corps N (2019) CSCI49379: Intro to Blockchain (Syllabus)

  5. Malik H, Manzoor A, Ylianttila M, Liyanage M (2019) Performance Analysis of Blockchain based Smart Grids with Ethereum and Hyperledger Implementations. In: 2019 IEEE International Conference on Advanced Networks and Telecommunications Systems (ANTS). IEEE, pp 1–5

  6. Bartoletti M, Carta S, Cimoli T, Saia RJFGCS (2020) Dissecting Ponzi schemes on Ethereum: identification, analysis, and impact. Futur Gener Comput Syst 102:259–277

    Article  Google Scholar 

  7. Chen W, Zheng Z, Ngai EC-H, Zheng P, Zhou YJIA (2019) Exploiting blockchain data to detect smart ponzi schemes on ethereum. IEEE Access 7:37575–37586

    Article  Google Scholar 

  8. Aziz R, Verma C, Srivastava N (2015) A weighted-SNR feature selection from independent component subspace for nb classification of microarray data. Int J Adv Biotec Res 6:245–255

    Google Scholar 

  9. Chen W, Zheng Z, Cui J, Ngai E, Zheng P, Zhou Y (2018) Detecting ponzi schemes on ethereum: Towards healthier blockchain technology. In: Proceedings of the 2018 World Wide Web Conference, pp 1409–1418

  10. Chow SS, Choo KKR, Han J (2021) Editorial for accountability and privacy issues in blockchain and cryptocurrency. Futur Gener Comput Syst 114:647–648

    Article  Google Scholar 

  11. Zhang Y, Wenqiang Y, Ziyu L, Salman R, Huaihu C (2021) Detecting Ethereum Ponzi Schemes Based on Improved LightGBM Algorithm. IEEE Transactions on Computational Social Systems

  12. Liu L, Tsai W-T, Bhuiyan MZA, Peng H, Liu MJFGCS (2021) Blockchain-enabled fraud discovery through abnormal smart contract detection on Ethereum. Futur Gener Comput Syst 6:133–137

    Google Scholar 

  13. Hu T, Liu X, Chen T, Zhang X, Huang X, Niu W, Lu J, Zhou K, Liu Y (2021) Transaction-based classification and detection approach for Ethereum smart contract. Inf Process Manag 58(2):102462

    Article  Google Scholar 

  14. Farrugia S, Ellul J, Azzopardi G (2020) Detection of illicit accounts over the Ethereum blockchain. Exp Syst Appl 150:113318

    Article  Google Scholar 

  15. Yuan Q, Huang B, Zhang J, Wu J, Zhang H, Zhang X (2020) Detecting Phishing Scams on Ethereum Based on Transaction Records. In: 2020 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, pp 1–5

  16. Singh A (2019) Anomaly detection in the Ethereum network. A thesis for the degree of Master of Technology/Indian Institute of Technology Kanpur

  17. Hilbe JM (2016) Practical guide to logistic regression. crc Press; 2016 Apr 5.

  18. Koçak B, Durmaz EŞ, Ateş E, Kılıçkesmez Ö (2019) Radiomics with artificial intelligence: a practical guide for beginners. Diagn Interv Radiol 25(6):485

    Article  Google Scholar 

  19. Sreejith S, Rahul S, Jisha R (2016) A real time patient monitoring system for heart disease prediction using random forest algorithm. In: Advances in signal processing and intelligent recognition systems: Springer, pp 485–500

  20. Aziz R, Verma C, Srivastava N (2016) A fuzzy based feature selection from independent component subspace for machine learning classification of microarray data. Genomics Data 8:4–15

    Article  Google Scholar 

  21. Musheer RA, Verma C, Srivastava NJSC (2019) Novel machine learning approach for classification of high-dimensional microarray data. Soft Comput 23(24):13409–13421

    Article  Google Scholar 

  22. Ahamed BS, Arya S (2021) LGBM classifier based technique for predicting Type-2 Diabetes. Eur J Mol Clin Med 8(3):454–467

    Google Scholar 

  23. Aziz R, Verma CK, Srivastava N (2018) Artificial neural network classification of high dimensional data with novel optimization approach of dimension reduction. Ann Data Sci 5(4):615–635

    Article  Google Scholar 

  24. Ahamed BS (2021) Prediction of Type-2 Diabetes using the LGBM classifier methods and techniques. Turk J Comput Math Educ (TURCOMAT) 12(12):223–231

    Google Scholar 

  25. Chen B, Fushan W, Chunxiang G (2021) Bitcoin Theft Detection Based on Supervised Machine Learning Algorithms. Security and Communication Networks 2021

  26. Poongodi M, Ashutosh S, Varadarajan V, Vaibhav B, Abhinav PS, Razi I, Rajiv K (2020) Prediction of the price of Ethereum blockchain cryptocurrency in an industrial finance system. Comput Electr Eng 81:106527

    Article  Google Scholar 

  27. Elbaghdadi A, Mezroui S, El Oualkadi A (2021) K-Nearest Neighbors Algorithm (KNN): An Approach to Detect Illicit Transaction in the Bitcoin Network. In: Integration Challenges for Analytics, Business Intelligence, and Data Mining: IGI Global, 2021, pp 161–178

  28. Fan S, Shuhui SF, Haoran X, Xiaochun C (2021) Al-SPSD: anti-leakage smart Ponzi schemes detection in blockchain. Inf Process Manag 58(4):102587

    Article  Google Scholar 

  29. Chen W, Cui J, Guo X, Chen Z, Lu Y (2021) Misbehavior Detection on Blockchain Data. In: Blockchain Intelligence: Springer, New York, 95–133

  30. Wang L, Cheng H, Zheng Z, Yang A, Zhu XJK-BS (2021) Ponzi scheme detection via oversampling-based Long Short-Term Memory for smart contracts. Knowl-Based Syst 228:107312

    Article  Google Scholar 

  31. Wang Y et al (2021) Survey of security supervision on blockchain from the perspective of technology. J Inf Secur Appl 60:102859

    Google Scholar 

  32. https://www.kaggle.com/bigquery/crypto-ethereum-classic

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rabia Musheer Aziz.

Ethics declarations

Conflict of interest

Authors declare no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Aziz, R.M., Baluch, M.F., Patel, S. et al. LGBM: a machine learning approach for Ethereum fraud detection. Int. j. inf. tecnol. 14, 3321–3331 (2022). https://doi.org/10.1007/s41870-022-00864-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-022-00864-6

Keywords

Navigation