Skip to main content
Log in

Illegal activity detection on bitcoin transaction using deep learning

  • Data analytics and machine learning
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Forensic investigations increasingly leverage artificial intelligence (AI/ML) to identify illegal activities on bitcoin. bitcoin transactions have an original graph (network) structure, which is sophisticated and yet informative. However, machine learning applications on bitcoin have given limited attention to developing end-to-end deep learning frameworks that are modeled to exploit the bitcoin graph structure. To identify illegal transactions on bitcoin, the current paper extracts nineteen features from the bitcoin network and proposes a deep learning-based graph neural network model using spectral graph convolutions and transaction features. The proposed model is compared with two state-of-the-art techniques, viz., a graph attention network (GAT2) and an extreme gradient boosted decision tree (XGBOOST) trained on convoluted features for classification of illegal transactions on bitcoin. To understand the efficacy of the proposed model, a dataset is collected consisting of 13310125 transactions of 2059 entities having 3152202 bitcoin account addresses and belonging to 28 categories of users. Two sets of experiments are performed on the datasets: labeling transactions as legal or illegal (binary classification) and identifying the originator of the transaction to one of the twenty-eight types of entities (multi-class classification). For fast and accurate decisions, binary classification is appropriate, and for pinpointing the category of bitcoin users, a multi-class classifier is suitable. On both the tasks, the proposed models achieved a maximum of 92% accuracy, validating the methodology and suitability of the model for real-world deployment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data Availability

Enquiries about data availability should be directed to the authors.

Notes

  1. Bitcoin refers to the system, and bitcoin or BTC refers to the digital currency.

  2. https://github.com/blockchain-etl.

  3. https://github.com/blockchain-etl/bitcoin-etl.

  4. https://github.com/pranavn91/blockchain/blob/master/walletexplorer-api.

  5. https://github.com/pranavn91/blockchain.

References

  • Aiolli F, Conti M, Gangwal A, Polato M (2019) Mind your wallet’s privacy: Identifying bitcoin wallet apps and user’s actions through network traffic analysis. 04

  • Ajay K, Kumar A, Muhammad RG, Pranav N, Sunil B, Waleed A, Ananda KS, Pushpita C, Uttam G (2021) Securing logistics system and supply chain using blockchain. Appl Stochastic Models in Bus Ind 37(3):413–428

    Article  Google Scholar 

  • Akcora CG, Li Y, Yulia RG, Kantarcioglu M (2019) Bitcoinheist: topological data analysis for ransomware detection on the bitcoin blockchain

  • Alqassem I, Rahwan I, Svetinovic D (2018) The anti-social system properties: Bitcoin network data analysis. IEEE Transactions on Systems, Man, and Cybernetics: Systems

  • Anceaume E, Lajoie-Mazenc T, Ludinard R, Sericola B (2016) Safety analysis of bitcoin improvement proposals. In: 2016 IEEE 15th International Symposium on Network Computing and Applications (NCA), pages 318–325. IEEE

  • Bartoletti M, Pes B, Serusi S (2018) Data mining for detecting bitcoin ponzi schemes. In: 2018 Crypto Valley Conference on Blockchain Technology (CVCBT), pages 75–84

  • Bistarelli S, Mercanti I, Santini F (2018) A suite of tools for the forensic analysis of bitcoin transactions: Preliminary report. In: European Conference on Parallel Processing, pages 329–341. Springer

  • Bogner A (2017) Seeing is understanding: anomaly detection in blockchains with visualized features. In: Proceedings of the 2017 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2017 ACM International Symposium on Wearable Computers, pages 5–8

  • Conti M, Kumar G, Nerurkar P, Saha R, Vigneri L (2022) A survey on security challenges and solutions in the iota. J Netw Comput Appl, page 103383

  • Di Francesco Maesa D, Marino A, Ricci L (2018) The graph structure of bitcoin. In: International Conference on Complex Networks and their Applications, pages 547–558. Springer

  • Di Francesco D, Maesa AM, Ricci L (2018) Data-driven analysis of bitcoin properties: exploiting the users graph. Int J Data Sci Anal 6(1):63–80

    Article  Google Scholar 

  • Di Francesco D, Maesa AM, Ricci L (2019) The bow tie structure of the bitcoin users graph. Appl Netw Sci 4(1):56

    Article  Google Scholar 

  • Ermilov D, Panov M, Yanovich Y (2017) Automatic bitcoin address clustering. In: 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), pages 461–466. IEEE

  • Fang F, Ventre C, Basios M, Kanthan L, Martinez-Rego D, Fan W, Li L (2022) Cryptocurrency trading: a comprehensive survey. Financial Innovation 8(1):1–59

    Article  Google Scholar 

  • Gaihre A, Luo Y, Liu H (2018) Do bitcoin users really care about anonymity? an analysis of the bitcoin transaction graph. In: 2018 IEEE International Conference on Big Data (Big Data), pages 1198–1207. IEEE

  • Gang K, ChaoXiangrui PY, Alsaadi Fawaz E, Enrique H-V (2019) Machine learning methods for systemic risk analysis in financial sectors. Technol Econ Dev Econ 25(5):716–742

    Article  Google Scholar 

  • Greaves A, Au B (2015) Using the bitcoin transaction graph to predict the price of bitcoin. No Data

  • Hamilton WL, Ying R, Leskovec J (2017) Representation learning on graphs: methods and applications. arXiv preprint arXiv:1709.05584

  • Harlev MA, Sun YH, Langenheldt KC, Mukkamala R, Vatrapu R (2018) Breaking bad: De-anonymising entity types on the bitcoin blockchain using supervised machine learning. In: Proceedings of the 51st Hawaii International Conference on System Sciences

  • Herrera-Joancomartí J (2014) Research and challenges on bitcoin anonymity. In:Data Privacy Management, Autonomous Spontaneous Security, and Security Assurance, pages 3–16. Springer

  • Hua SYH, Klaus L, Mikkel H, Rao MR, Ravi V (2019) Regulating cryptocurrencies: a supervised machine learning approach to de-anonymizing the bitcoin blockchain. J Manag Inf Syst 36(1):37–73

    Article  Google Scholar 

  • Hu Y, Seneviratne S, Thilakarathna K, Fukuda K, Seneviratne A (2019) Characterizing and detecting money laundering activities on the bitcoin network. arXiv preprint arXiv:1912.12060

  • Irwin AS, Turner AB (2018) Illicit bitcoin transactions: challenges in getting to the who, what, when and where. J Money Laundering control

  • Janda A (2016) Walletexplorer. com: Smart bicoin block explorer

  • Jourdan M, Blandin S, Wynter L, Deshpande P (2018) Characterizing entities in the bitcoin blockchain. In: 2018 IEEE International Conference on Data Mining Workshops (ICDMW), pages 55–62. IEEE

  • Kanemura K, Toyoda K, Ohtsuki T (2019) Identification of darknet markets’ bitcoin addresses by voting per-address classification results. In: 2019 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), pages 154–158. IEEE

  • Kou G, Peng Y, Wang G (2014) Evaluation of clustering algorithms for financial risk analysis using mcdm methods. Inf Sci 275:1–12

    Article  Google Scholar 

  • Kumar Ajay, Kumar A, Nerurkar P, Ghalib MR, Shankar A, Wen Z, Qi X (2020) Empirical analysis of bitcoin network (2016-2020). In: 2020 IEEE/CIC International Conference on Communications in China (ICCC Workshops), pages 96–101. IEEE

  • Kumar A, Abhishek K, Nerurkar P, Khosravi MR, Ghalib MR, Shankar A (2021) Big data analytics to identify illegal activities on bitcoin blockchain for iomt. Personal and Ubiquitous Comput, pages 1–12

  • Lee C, Maharjan S, Ko K, Hong JW (2020) Toward detecting illegal transactions on bitcoin using machine-learning methods. In: Zheng Zibin, Dai Hong-Ning, Tang Mingdong, Chen Xiangping (eds) Blockchain and Trustworthy Systems. Singapore. Springer Singapore, pp 520–533

    Chapter  Google Scholar 

  • Liang J, Li L, Luan S, Gan L, Zeng D (2019) Bitcoin exchange addresses identification and its application in online drug trading regulation

  • Li T, Kou G, Peng Y, Philip SY (2021) An integrated cluster detection, optimization, and interpretation approach for financial data. IEEE Transactions on Cybernetics

  • Liu F, Li Y, Li B, Li J, Xie H (2021) Bitcoin transaction strategy construction based on deep reinforcement learning. Appl Soft Comput 113:107–952

    Article  Google Scholar 

  • LLiu T, Ge J, Wu Y, Dai B, Li L, Yao Z, Wen J, Shi H, (2020) A new bitcoin address association method using a two-level learner model. In: Wen Sheng, Zomaya Albert, Yang Laurence T (eds) Algorithms and Architectures for Parallel Processing. Cham. Springer International Publishing, pp 349–364

  • Maesa DD, Marino A, Ricci L (2016) Uncovering the bitcoin blockchain: an analysis of the full users graph. In: 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), pages 537–546. IEEE

  • Mauro C, Sandeep KE, Chhagan L, Sushmita R (2018) A survey on security and privacy issues of bitcoin. IEEE Commun Surv Tutorials 20(4):3416–3452

    Article  Google Scholar 

  • Min X, Chen X, Kou G (2019) A systematic review of blockchain. Financial Innovation 5(1):1–14

    Google Scholar 

  • Monamo PM, Marivate V, Twala B (2016) A multifaceted approach to bitcoin fraud detection: Global and local outliers. In: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), pages 188–194. IEEE

  • Nakamoto S (2019) Bitcoin: a peer-to-peer electronic cash system. Technical report, Manubot

    Google Scholar 

  • Nan L, Tao D (2018) Bitcoin mixing detection using deep autoencoder. In: 2018 IEEE Third International Conference on Data Science in Cyberspace (DSC), pages 280–287

  • Nerurkar P, Bhirud S, Patel D, Ludinard R, Busnel Y, Kumari S (2021) Supervised learning model for identifying illegal activities in bitcoin. Appl Intell 51:3824–3843

    Article  Google Scholar 

  • Nerurkar P, Busnel Y, Ludinard R, Shah K, Bhirud S, Patel D (2020) Detecting illicit entities in bitcoin using supervised learning of ensemble decision trees. In: Proceedings of the 2020 10th international conference on information communication and management, pages 25–30

  • Nerurkar P, Patel D, Busnel Y, Ludinard R, Kumari S, Khan MK (2021) Dissecting bitcoin blockchain: empirical analysis of bitcoin network (2009–2020). volume 177, page 102940. Elsevier

  • Paquet-Clouston M, Romiti M, Haslhofer B, Charvat T (2019) Spams meet cryptocurrencies: sextortion in the bitcoin ecosystem. In: Proceedings of the 1st ACM Conference on Advances in Financial Technologies, pages 76–88

  • Pham T, Lee S (2016) Anomaly detection in bitcoin network using unsupervised learning methods. arXiv preprint arXiv:1611.03941

  • Phetsouvanh S, Oggier F, Datta A (2018) Egret: extortion graph exploration techniques in the bitcoin network. In: 2018 IEEE International Conference on Data Mining Workshops (ICDMW), pages 244–251

  • Pinna A, Tonelli R, Orrú M, Marchesi M (2018) A petri nets model for blockchain analysis. Comput J 61(9):1374–1388

    Article  MathSciNet  Google Scholar 

  • Portnoff RS, Huang DY, Doerfler P, Afroz S, McCoy D (2017) Backpage and bitcoin: uncovering human traffickers. In: KDD ’17

  • Sean F, Karlsen Jonathan R, Putniņš Tālis J (2019) Sex, drugs, and bitcoin: how much illegal activity is financed through cryptocurrencies? Rev Financial Stud 32(5):1798–1853

    Article  Google Scholar 

  • Sebastião H, Godinho P (2021) Forecasting and trading cryptocurrencies with machine learning under changing market conditions. Financial Innovation 7(1):1–30

    Article  Google Scholar 

  • Shao W, Li H, Chen M, Jia C, Liu C, Wang Z (2018) Identifying bitcoin users using deep neural network. In: Jaideep Vaidya, Jin Li (eds) Algorithms and Architectures for Parallel Processing. Springer International Publishing, Cham, pp 178–192

    Chapter  Google Scholar 

  • Turner A, Irwin AS(2018) Bitcoin transactions: a digital discovery of illicit activity on the blockchain. J Financial Crime

  • Vasek M, Moore T (2015) There’s no free lunch, even using bitcoin: Tracking the popularity and profits of virtual currency scams. In Rainer Böhme and Tatsuaki Okamoto, editors, Financial Cryptography and Data Security, pages 44–61, Berlin, Heidelberg. Springer Berlin Heidelberg

  • Weber M, Domeniconi G, Chen J, Weidele DK, Bellei C, Robinson T, Leiserson CE (2019) Anti-money laundering in bitcoin: experimenting with graph convolutional networks for financial forensics. arXiv preprint arXiv:1908.02591

  • Wu Y, Luo A, Xu D (2019) Identifying suspicious addresses in bitcoin thefts. Digital Investigation 31(200895):12

    Google Scholar 

  • Wu Z, Pan S, Chen F, Long G, Zhang C, Yu PS (2019) A comprehensive survey on graph neural networks. arXiv preprint arXiv:1901.00596

  • Wu Y, Tao F, Liu L, Gu J, Panneerselvam J, Zhu R, Shahzad MN (2020) A bitcoin transaction network analytic method for future blockchain forensic investigation. IEEE Transactions on Network Science and Engineering, pages 1–1

  • Yang L, Dong X, Xing S, Zheng J, Gu X, Song X (2019) An abnormal transaction detection mechanim on bitcoin. In: 2019 International Conference on Networking and Network Applications (NaNA), pages 452–457. IEEE

  • Yin HS, Vatrapu R(2017) A first estimation of the proportion of cybercriminal entities in the bitcoin ecosystem using supervised machine learning. In: 2017 IEEE International Conference on Big Data (Big Data), pages 3690–3699. IEEE

  • Zarpelão BB, Miani RS, Rajarajan M (2019) Detection of bitcoin-based botnets using a one-class classifier. In: Olivier Blazy and Chan Yeob Yeun, editors, Information Security Theory and Practice, pages 174–189, Cham. Springer International Publishing

  • Zayuelas Muñoz J (2019) Detection of bitcoin miners from network measurements. B.S. thesis, Universitat Politècnica de Catalunya

  • Zhang Z, Zhou T, Xie Z (2018) Bitscope: Scaling bitcoin address de-anonymization using multi-resolution clustering

Download references

Funding

This work was supported in part by the Raman Charpak Fellowship of the Indo-French Centre for the Promotion of Advanced Research Grant no: IFC/4132/RCF 2019/716. The authors thank Prof. Dhiren Patel and Prof. Sunil Bhirud VJTI Mumbai, NMIMS University Mumbai and Prof. Yann Busnel and Prof. Romaric Ludinard IMT Atlantique, France for providing the lab resources.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pranav Nerurkar.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have influenced the work reported in this paper.

Ethical Standards

The authors declare that they have complied with ethical standards of the journal during their research.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nerurkar, P. Illegal activity detection on bitcoin transaction using deep learning. Soft Comput 27, 5503–5520 (2023). https://doi.org/10.1007/s00500-022-07779-1

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-022-07779-1

Keywords

Navigation