Abstract
The purpose of the paper is to predict Bitcoin prices using various machine learning techniques. Due to its high volatility attribute, accurate price prediction is the need of the hour for sound investment decision-making. At the offset, this study categorizes Bitcoin price by daily and high-frequency price (5-min interval price). For its daily and 5-min interval price prediction, a set of high-dimensional features and fundamental trading features are employed, respectively. Thereafter, we find that statistical methods like Logistic Regression predict daily price with 64.84% accuracy while complex machine learning algorithms like XGBoost predict 5-min interval price with an accuracy level of 59.4%. This work on Bitcoin price prediction recognizes the significance of sample dimensions in machine learning algorithms.
Similar content being viewed by others
References
Barro, R. J. (1979). Money and the price level under the gold standard. The Economic Journal, 89(353), 13–33.
Bukovina, J., & Martiček, M. (2016). Sentiment and bitcoin volatility. MENDELU Working Papers in Business and Economics, 58, 1–5.
Chen, Z., Li, C., & Sun, W. (2020). Bitcoin price prediction using machine learning: An approach to sample dimension engineering. Journal of Computational and Applied Mathematics, 365, 112395.
Ciaian, P., Rajcaniova, M., & Kancs, D. A. (2016). The economics of BitCoin price formation. Applied Economics, 48(19), 1799–1815.
Dyhrberg, A. H. (2016). Bitcoin, gold and the dollar—a GARCH volatility analysis. Finance Research Letters, 16, 85–92.
Gamberger, D., & Lavrač, N. (1997). Conditions for Occam’s razor applicability and noise elimination. In European conference on machine learning (pp. 108–123). Springer.
Georgoula, I., Pournarakis, D., Bilanakos, C., Sotiropoulos, D., & Giaglis, G. M. (2015). Using time-series and sentiment analysis to detect the determinants of bitcoin prices. Available at SSRN 2607167.
Greaves, A., & Au, B. (2015). Using the bitcoin transaction graph to predict the price of bitcoin. Retrieved April 21, 2022 from http://snap.stanford.edu/.
Hayes, A. (2015). What factors give cryptocurrencies their value: An empirical analysis. SSRN 2579445.
Kohavi, R., & John, G. H. (1997). Wrappers for feature subset selection. Artificial Intelligence, 97(1–2), 273–324.
Kristoufek, L. (2013). BitCoin meets Google Trends and Wikipedia: Quantifying the relationship between phenomena of the Internet era. Scientific Reports, 3, 3415.
Madan, I., Saluja, S., & Zhao, A. (2015). Automated bitcoin trading via machine learning algorithms. http://cs229.stanford.edu/proj2014/Isaac%20Madan20
Mai, F., Shan, Z., Bai, Q., Wang, X., & Chiang, R. H. (2018). How does social media impact Bitcoin value? A test of the silent majority hypothesis. Journal of Management Information Systems, 35(1), 19–52.
McNally, S., Roche, J., & Caton, S. (2018, March). Predicting the price of bitcoin using machine learning. In 2018 26th Euromicro international conference on parallel, distributed and network-based processing (PDP) (pp. 339–343). IEEE.
Rechenthin, M. D. (2014). Machine-learning classification techniques for the analysis and prediction of high-frequency stock direction. https://ir.uiowa.edu/cgi/viewcontent.cgi?article=5248&context=etd
Shah, D., & Zhang, K. (2014). Bayesian regression and Bitcoin. In 2014 52nd annual Allerton conference on communication, control, and computing (Allerton) (pp. 409–414). IEEE.
Wind, D. K. (2014). Concepts in predictive machine learning (pp. 1–129). Technical University of Denmark. http://www.davidwind.dk/wp-content/uploads/2014/07/main.pdf
Yermack, D. (2015). Is Bitcoin a real currency? An economic appraisal. In Handbook of digital currency (pp. 31–43). Academic Press.
Zahálka, J., & Železný, F. (2011). An experimental test of Occam’s razor in classification. Machine Learning, 82(3), 475–481.
Funding
The authors have not disclosed any funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have not disclosed any competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1: Machine Learning Models on Bitcoin Daily Data
See Tables 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, and 21.
Model: Logistic Regression
Accuracy Score: 0.6484018264840182
Model: Linear Discriminant Analysis
Accuracy Score: 0.5981735159817352
Model: Random Forest
Accuracy Score: 0.5114155251141552
Model: XGBoost
Accuracy Score: 0.4748858447488584
Model: Quadratic Discriminant Analysis
Accuracy Score: 0.4474885844748858
Model: K-Nearest Neighbors
Accuracy Score: 0.4703196347031963
Model: Decision Tree
Accuracy Score: 0.5616438356164384
Model: Support Vector Machine
Accuracy Score: 0.4611872146118721
Appendix 2: Machine Learning Models On Bitcoin 5-Minutes Interval Data
See Tables 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, and 37.
Model: Logistic Regression
Accuracy Score: 0.5939269171384457
Model: Linear Discriminant Analysis
Accuracy Score: 0.593798250128667
Model: Random Forest
Accuracy Score: 0.5432321152856407
Model: XGBoost
Accuracy Score: 0.5941842511580031
Model: Quadratic Discriminant Analysis
Accuracy Score: 0.5110653628409676
Model: K-Nearest Neighbors
Accuracy Score: 0.5400154400411734
Model: Decision Tree
Accuracy Score: 0.5317807514153371
Model: Support Vector Machine
Accuracy Score: 0.5654915079773546
Rights and permissions
About this article
Cite this article
Ranjan, S., Kayal, P. & Saraf, M. Bitcoin Price Prediction: A Machine Learning Sample Dimension Approach. Comput Econ 61, 1617–1636 (2023). https://doi.org/10.1007/s10614-022-10262-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10614-022-10262-6