Abstract
Ethereum is a software platform that uses the concept of blockchain and decentralizes data by distributing copies of smart contracts to thousands of individuals worldwide. Ethereum, as a currency, is utilized to exchange value worldwide in the absence of a third party to monitor or intervene. However, as online commerce grows, a slew of fraudulent activities, such as money laundering, bribery, and phishing, emerge as the primary threat to trade security. This paper proposes Light Gradient Boosting Machine (LGBM) approach for accurately detecting fraudulent transactions. It also examines different models such as Random Forest (RF), Multi-Layer Perceptron (MLP), etc., based on machine learning and soft computing algorithm for classifying Ethereum fraud detection dataset with limited attributes and compares their metrics with the LGBM approach. A comparative study of scores of bagging models is presented to know the applicability of the proposed approach. The light gradient boosting machine (LGBM) algorithms and Extreme Gradient Boosting (XGBoost) demonstrate the highest accuracies, while LGBM shows slightly better performance with 98.60% for the stated dataset scenarios. Further optimizing the LGBM with hyper-parameter tuning, an accuracy of 99.03% is achieved.















References
Arnold L, Brennecke M, Camus P, Fridgen G, Guggenberger T, Radszuwill S, Rieger A, Schweizer A, Urbach N (2019) Blockchain and initial coin offerings: blockchain’s implications for crowdfunding. In: Business transformation through Blockchain. Palgrave Macmillan, Cham, pp 233–272
Li X, Whinston AB (2020) Analyzing cryptocurrencies. Inf Syst Front 22(1):17–22
Liu J, Serletis A (2019) Volatility in the cryptocurrency market. Open Econ Rev 30(4):779–811
Mock W, Corps N (2019) CSCI49379: Intro to Blockchain (Syllabus)
Malik H, Manzoor A, Ylianttila M, Liyanage M (2019) Performance Analysis of Blockchain based Smart Grids with Ethereum and Hyperledger Implementations. In: 2019 IEEE International Conference on Advanced Networks and Telecommunications Systems (ANTS). IEEE, pp 1–5
Bartoletti M, Carta S, Cimoli T, Saia RJFGCS (2020) Dissecting Ponzi schemes on Ethereum: identification, analysis, and impact. Futur Gener Comput Syst 102:259–277
Chen W, Zheng Z, Ngai EC-H, Zheng P, Zhou YJIA (2019) Exploiting blockchain data to detect smart ponzi schemes on ethereum. IEEE Access 7:37575–37586
Aziz R, Verma C, Srivastava N (2015) A weighted-SNR feature selection from independent component subspace for nb classification of microarray data. Int J Adv Biotec Res 6:245–255
Chen W, Zheng Z, Cui J, Ngai E, Zheng P, Zhou Y (2018) Detecting ponzi schemes on ethereum: Towards healthier blockchain technology. In: Proceedings of the 2018 World Wide Web Conference, pp 1409–1418
Chow SS, Choo KKR, Han J (2021) Editorial for accountability and privacy issues in blockchain and cryptocurrency. Futur Gener Comput Syst 114:647–648
Zhang Y, Wenqiang Y, Ziyu L, Salman R, Huaihu C (2021) Detecting Ethereum Ponzi Schemes Based on Improved LightGBM Algorithm. IEEE Transactions on Computational Social Systems
Liu L, Tsai W-T, Bhuiyan MZA, Peng H, Liu MJFGCS (2021) Blockchain-enabled fraud discovery through abnormal smart contract detection on Ethereum. Futur Gener Comput Syst 6:133–137
Hu T, Liu X, Chen T, Zhang X, Huang X, Niu W, Lu J, Zhou K, Liu Y (2021) Transaction-based classification and detection approach for Ethereum smart contract. Inf Process Manag 58(2):102462
Farrugia S, Ellul J, Azzopardi G (2020) Detection of illicit accounts over the Ethereum blockchain. Exp Syst Appl 150:113318
Yuan Q, Huang B, Zhang J, Wu J, Zhang H, Zhang X (2020) Detecting Phishing Scams on Ethereum Based on Transaction Records. In: 2020 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, pp 1–5
Singh A (2019) Anomaly detection in the Ethereum network. A thesis for the degree of Master of Technology/Indian Institute of Technology Kanpur
Hilbe JM (2016) Practical guide to logistic regression. crc Press; 2016 Apr 5.
Koçak B, Durmaz EŞ, Ateş E, Kılıçkesmez Ö (2019) Radiomics with artificial intelligence: a practical guide for beginners. Diagn Interv Radiol 25(6):485
Sreejith S, Rahul S, Jisha R (2016) A real time patient monitoring system for heart disease prediction using random forest algorithm. In: Advances in signal processing and intelligent recognition systems: Springer, pp 485–500
Aziz R, Verma C, Srivastava N (2016) A fuzzy based feature selection from independent component subspace for machine learning classification of microarray data. Genomics Data 8:4–15
Musheer RA, Verma C, Srivastava NJSC (2019) Novel machine learning approach for classification of high-dimensional microarray data. Soft Comput 23(24):13409–13421
Ahamed BS, Arya S (2021) LGBM classifier based technique for predicting Type-2 Diabetes. Eur J Mol Clin Med 8(3):454–467
Aziz R, Verma CK, Srivastava N (2018) Artificial neural network classification of high dimensional data with novel optimization approach of dimension reduction. Ann Data Sci 5(4):615–635
Ahamed BS (2021) Prediction of Type-2 Diabetes using the LGBM classifier methods and techniques. Turk J Comput Math Educ (TURCOMAT) 12(12):223–231
Chen B, Fushan W, Chunxiang G (2021) Bitcoin Theft Detection Based on Supervised Machine Learning Algorithms. Security and Communication Networks 2021
Poongodi M, Ashutosh S, Varadarajan V, Vaibhav B, Abhinav PS, Razi I, Rajiv K (2020) Prediction of the price of Ethereum blockchain cryptocurrency in an industrial finance system. Comput Electr Eng 81:106527
Elbaghdadi A, Mezroui S, El Oualkadi A (2021) K-Nearest Neighbors Algorithm (KNN): An Approach to Detect Illicit Transaction in the Bitcoin Network. In: Integration Challenges for Analytics, Business Intelligence, and Data Mining: IGI Global, 2021, pp 161–178
Fan S, Shuhui SF, Haoran X, Xiaochun C (2021) Al-SPSD: anti-leakage smart Ponzi schemes detection in blockchain. Inf Process Manag 58(4):102587
Chen W, Cui J, Guo X, Chen Z, Lu Y (2021) Misbehavior Detection on Blockchain Data. In: Blockchain Intelligence: Springer, New York, 95–133
Wang L, Cheng H, Zheng Z, Yang A, Zhu XJK-BS (2021) Ponzi scheme detection via oversampling-based Long Short-Term Memory for smart contracts. Knowl-Based Syst 228:107312
Wang Y et al (2021) Survey of security supervision on blockchain from the perspective of technology. J Inf Secur Appl 60:102859
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Authors declare no conflict of interest.
Rights and permissions
About this article
Cite this article
Aziz, R.M., Baluch, M.F., Patel, S. et al. LGBM: a machine learning approach for Ethereum fraud detection. Int. j. inf. tecnol. 14, 3321–3331 (2022). https://doi.org/10.1007/s41870-022-00864-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-022-00864-6