Skip to main content
Log in

Bitcoin Price Prediction: A Machine Learning Sample Dimension Approach

  • Published:
Computational Economics Aims and scope Submit manuscript

Abstract

The purpose of the paper is to predict Bitcoin prices using various machine learning techniques. Due to its high volatility attribute, accurate price prediction is the need of the hour for sound investment decision-making. At the offset, this study categorizes Bitcoin price by daily and high-frequency price (5-min interval price). For its daily and 5-min interval price prediction, a set of high-dimensional features and fundamental trading features are employed, respectively. Thereafter, we find that statistical methods like Logistic Regression predict daily price with 64.84% accuracy while complex machine learning algorithms like XGBoost predict 5-min interval price with an accuracy level of 59.4%. This work on Bitcoin price prediction recognizes the significance of sample dimensions in machine learning algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Barro, R. J. (1979). Money and the price level under the gold standard. The Economic Journal, 89(353), 13–33.

    Article  Google Scholar 

  • Bukovina, J., & Martiček, M. (2016). Sentiment and bitcoin volatility. MENDELU Working Papers in Business and Economics, 58, 1–5.

  • Chen, Z., Li, C., & Sun, W. (2020). Bitcoin price prediction using machine learning: An approach to sample dimension engineering. Journal of Computational and Applied Mathematics, 365, 112395.

    Article  Google Scholar 

  • Ciaian, P., Rajcaniova, M., & Kancs, D. A. (2016). The economics of BitCoin price formation. Applied Economics, 48(19), 1799–1815.

    Article  Google Scholar 

  • Dyhrberg, A. H. (2016). Bitcoin, gold and the dollar—a GARCH volatility analysis. Finance Research Letters, 16, 85–92.

    Article  Google Scholar 

  • Gamberger, D., & Lavrač, N. (1997). Conditions for Occam’s razor applicability and noise elimination. In European conference on machine learning (pp. 108–123). Springer.

  • Georgoula, I., Pournarakis, D., Bilanakos, C., Sotiropoulos, D., & Giaglis, G. M. (2015). Using time-series and sentiment analysis to detect the determinants of bitcoin prices. Available at SSRN 2607167.

  • Greaves, A., & Au, B. (2015). Using the bitcoin transaction graph to predict the price of bitcoin. Retrieved April 21, 2022 from http://snap.stanford.edu/.

  • Hayes, A. (2015). What factors give cryptocurrencies their value: An empirical analysis. SSRN 2579445.

  • Kohavi, R., & John, G. H. (1997). Wrappers for feature subset selection. Artificial Intelligence, 97(1–2), 273–324.

    Article  Google Scholar 

  • Kristoufek, L. (2013). BitCoin meets Google Trends and Wikipedia: Quantifying the relationship between phenomena of the Internet era. Scientific Reports, 3, 3415.

    Article  Google Scholar 

  • Madan, I., Saluja, S., & Zhao, A. (2015). Automated bitcoin trading via machine learning algorithms. http://cs229.stanford.edu/proj2014/Isaac%20Madan20

  • Mai, F., Shan, Z., Bai, Q., Wang, X., & Chiang, R. H. (2018). How does social media impact Bitcoin value? A test of the silent majority hypothesis. Journal of Management Information Systems, 35(1), 19–52.

    Article  Google Scholar 

  • McNally, S., Roche, J., & Caton, S. (2018, March). Predicting the price of bitcoin using machine learning. In 2018 26th Euromicro international conference on parallel, distributed and network-based processing (PDP) (pp. 339–343). IEEE.

  • Rechenthin, M. D. (2014). Machine-learning classification techniques for the analysis and prediction of high-frequency stock direction. https://ir.uiowa.edu/cgi/viewcontent.cgi?article=5248&context=etd

  • Shah, D., & Zhang, K. (2014). Bayesian regression and Bitcoin. In 2014 52nd annual Allerton conference on communication, control, and computing (Allerton) (pp. 409–414). IEEE.

  • Wind, D. K. (2014). Concepts in predictive machine learning (pp. 1–129). Technical University of Denmark. http://www.davidwind.dk/wp-content/uploads/2014/07/main.pdf

  • Yermack, D. (2015). Is Bitcoin a real currency? An economic appraisal. In Handbook of digital currency (pp. 31–43). Academic Press.

  • Zahálka, J., & Železný, F. (2011). An experimental test of Occam’s razor in classification. Machine Learning, 82(3), 475–481.

    Article  Google Scholar 

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Parthajit Kayal.

Ethics declarations

Conflict of interest

The authors have not disclosed any competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1: Machine Learning Models on Bitcoin Daily Data

See Tables 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, and 21.

Model: Logistic Regression

Accuracy Score: 0.6484018264840182

Table 6 Confusion matrix on Bitcoin daily price using Logistic Regression
Table 7 Classification report on Bitcoin daily price using Logistic Regression

Model: Linear Discriminant Analysis

Accuracy Score: 0.5981735159817352

Table 8 Confusion matrix on Bitcoin daily price using linear Discriminant Analysis
Table 9 Classification report on Bitcoin daily price using Linear Discriminant Analysis

Model: Random Forest

Accuracy Score: 0.5114155251141552

Table 10 Confusion matrix on Bitcoin daily price using Random Forest
Table 11 Classification report on Bitcoin daily price Using Random Forest

Model: XGBoost

Accuracy Score: 0.4748858447488584

Table 12 Confusion matrix on Bitcoin daily price using XGBoost

Model: Quadratic Discriminant Analysis

Accuracy Score: 0.4474885844748858

Table 13 Classification report on Bitcoin daily price using XGBoost
Table 14 Confusion matrix on Bitcoin daily price using Quadratic Discriminant Analysis
Table 15 Classification report on Bitcoin daily price using Quadratic Discriminant Analysis

Model: K-Nearest Neighbors

Accuracy Score: 0.4703196347031963

Table 16 Confusion matrix on Bitcoin daily price using K-Nearest Neighbors
Table 17 Classification report on Bitcoin daily price using K-Nearest Neighbors

Model: Decision Tree

Accuracy Score: 0.5616438356164384

Table 18 Confusion matrix on Bitcoin daily price using Decision Tree

Model: Support Vector Machine

Accuracy Score: 0.4611872146118721

Table 19 Classification report on Bitcoin daily price using Decision Tree
Table 20 Confusion matrix on Bitcoin daily price using Support Vector Machine
Table 21 Classification report on Bitcoin daily price using Support Vector Machine

Appendix 2: Machine Learning Models On Bitcoin 5-Minutes Interval Data

See Tables 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, and 37.

Model: Logistic Regression

Accuracy Score: 0.5939269171384457

Table 22 Confusion matrix on Bitcoin 5-min interval price using Logistic Regression
Table 23 Classification report on Bitcoin 5-min interval price using Logistic Regression

Model: Linear Discriminant Analysis

Accuracy Score: 0.593798250128667

Table 24 Confusion matrix on Bitcoin 5-min interval price using Linear Discriminant Analysis
Table 25 Classification report on Bitcoin 5-min interval price using Linear Discriminant Analysis

Model: Random Forest

Accuracy Score: 0.5432321152856407

Table 26 Confusion matrix on Bitcoin 5-min interval price using Random Forest
Table 27 Classification report on Bitcoin 5-min interval price using Random Forest

Model: XGBoost

Accuracy Score: 0.5941842511580031

Table 28 Confusion matrix on Bitcoin 5-min interval price using XGBoost
Table 29 Classification report on Bitcoin 5-min interval price using XGBoost

Model: Quadratic Discriminant Analysis

Accuracy Score: 0.5110653628409676

Table 30 Confusion matrix on Bitcoin 5-min interval price using Quadratic Discriminant Analysis
Table 31 Classification report on Bitcoin 5-min interval price using Quadratic Discriminant Analysis

Model: K-Nearest Neighbors

Accuracy Score: 0.5400154400411734

Table 32 Confusion matrix on Bitcoin 5-min interval price using K-Nearest Neighbors
Table 33 Classification report on Bitcoin 5-min interval price using K-Nearest Neighbors

Model: Decision Tree

Accuracy Score: 0.5317807514153371

Table 34 Confusion matrix on Bitcoin 5-min interval price using Decision Tree
Table 35 Classification report on Bitcoin 5-min interval price using Decision Tree

Model: Support Vector Machine

Accuracy Score: 0.5654915079773546

Table 36 Confusion matrix on Bitcoin 5-min interval price using Support Vector Machine
Table 37 Classification report on Bitcoin 5-min interval price using Support Vector Machine

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ranjan, S., Kayal, P. & Saraf, M. Bitcoin Price Prediction: A Machine Learning Sample Dimension Approach. Comput Econ 61, 1617–1636 (2023). https://doi.org/10.1007/s10614-022-10262-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10614-022-10262-6

Keywords

Navigation