Skip to main content

TraderNet-CR: Cryptocurrency Trading with Deep Reinforcement Learning

  • Conference paper
  • First Online:
Artificial Intelligence Applications and Innovations (AIAI 2022)

Abstract

The predominant method of developing trading strategies is technical analysis on historical market data. Other financial analysts monitor the public activity towards cryptocurrencies, in order to forecast upcoming trends in the market. Until now, the best cryptocurrency trading models rely solely on one of the two methodologies and attempt to maximize their profits, while disregarding the trading risk. In this paper, we present a new machine learning approach, named TraderNet-CR, which is based on deep reinforcement learning. TraderNet-CR combines both methodologies in order to detect profitable round trips in the cryptocurrency market and maximize a trader’s profits. Additionally, we have added an extension method, named N-Consecutive Actions, which examines the model’s previous actions, before suggesting a new action. This method is complementary to the model’s training and can be fruitfully combined, in order to further decrease the trading risk. Our experiments show that our model can properly forecast profitable round trips, despite high market commission fees.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 119.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 159.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A DRL agent utilizes a deep learning model in order to learn to behave optimally in its environment.

  2. 2.

    A round trip is a pair of two opposite orders placed one after the other (buy-sell or sell-buy), that aims to take advantage of price differences in order to produce profit.

  3. 3.

    The sparse reward problem happens when an environment rarely produces a reward. This usually slows down the training process of a DRL agent [15].

  4. 4.

    A signal is called bullish when the close price begins to rise. On the other hand, a signal is called bearish when the close price starts to drop.

  5. 5.

    https://anonymous.4open.science/r/Finance-AI-08C2.

  6. 6.

    OHLCV datasets consist of five columns: Open, High, Low, Close, Volume of a market at a specific time.

  7. 7.

    https://www.cryptodatadownload.com/data/.

References

  1. Low, R.K.Y., Tan, E.: The Role of Analyst Forecasts in the Momentum Effect. Wiley Trading (2006)

    Google Scholar 

  2. Baiynd, A.M.: The Trading Book: A Complete Solution to Mastering Technical Systems and Trading Psychology. McGraw-Hill (2011)

    Google Scholar 

  3. Brown, R.G.: Smoothing, Forecasting and Prediciton of Time Series. Dover Publications (1963)

    Google Scholar 

  4. Brown, R.G.: New Concepts in Technical Trading Systems. Trend Research (1978)

    Google Scholar 

  5. Brown, R.G.: Technical Analysis Power Tools for Active Investors. Financial Times Prentice Hall (2005)

    Google Scholar 

  6. Gerstein, M.: Evaluation of the chaikin power gauge stock rating system. Chaikin Analytics (2013)

    Google Scholar 

  7. Granville, J.E.: Granville’s New Key to Stock Market Profits. Papamoa Press (2018)

    Google Scholar 

  8. Jiang, Z., Xu, D., Liang, J.: A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem, pp. 1–31 (2017). http://arxiv.org/abs/1706.10059

  9. Livieris, I.E., Stavroyiannis, S., Iliadis, L., Pintelas, P.: Smoothing and stationarity enforcement framework for deep learning time-series forecasting. Neural Comput. Appl. 33(20), 14021–14035 (2021). https://doi.org/10.1007/s00521-021-06043-1

    Article  Google Scholar 

  10. Low, R.K.Y., Tan, E.: The role of analyst forecasts in the momentum effect. Int. Rev. Finan. Anal. 9 (2016)

    Google Scholar 

  11. Lucarelli, G., Borrotti, M.: A deep q-learning portfolio management framework for the cryptocurrency market. Neural Comput. Appl. 32(23), 17229–17244 (2020)

    Article  Google Scholar 

  12. Mudassir, M., Bennbaia, S. Unal, D.H.M.: Time-series forecasting of bitcoin prices using high-dimensional features: a machine learning approach. Neural Computing and Applications (2020)

    Google Scholar 

  13. Mulloy, P.: Technical Analysis of Stocks and Commodities 40(1) (1982)

    Google Scholar 

  14. Murphy, J.J.: Technical analysis of the financial markets: a comprehensive guide to trading methods and applications. Penguin (1999)

    Google Scholar 

  15. Noel, A.D., van Hoof, C., Millidge, B.: Online reinforcement learning with sparse rewards through an active inference capsule (2021)

    Google Scholar 

  16. Sattarov, O., Muminov, A., Lee, C.W., Kang, H.K., Oh, R., Ahn, J., Oh, H.J., Jeon, H.S.: Recommending cryptocurrency trading points with deep reinforcement learning approach. Appl. Sci. 10(4), 1506 (2020)

    Article  Google Scholar 

  17. Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vasilis Kochliaridis .

Editor information

Editors and Affiliations

Appendix: A

Appendix: A

Fig. 4.
figure 4

A typical PPO Agent architecture

Fig. 5.
figure 5

The TraderNet-CR actor-critic network architecture

Rights and permissions

Reprints and permissions

Copyright information

© 2022 IFIP International Federation for Information Processing

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kochliaridis, V., Kouloumpris, E., Vlahavas, I. (2022). TraderNet-CR: Cryptocurrency Trading with Deep Reinforcement Learning. In: Maglogiannis, I., Iliadis, L., Macintyre, J., Cortez, P. (eds) Artificial Intelligence Applications and Innovations. AIAI 2022. IFIP Advances in Information and Communication Technology, vol 646. Springer, Cham. https://doi.org/10.1007/978-3-031-08333-4_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-08333-4_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-08332-7

  • Online ISBN: 978-3-031-08333-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics