Deep Reinforcement Learning pp 65-74 | Cite as
Q-Learning in Code
Coding the Off-Policy Q-Learning Agent and Behavior Policy
Chapter
First Online:
Abstract
In this chapter, we would put what we have learnt on Q-Learning in the last chapter in code. We would implement a Q-Table-based Off-Policy Q-Learning agent class, and to complement with a behavior policy, we would implement another class on Behavior Policy with an implementation of the epsilon-greedy algorithm.
Copyright information
© Springer Nature Singapore Pte Ltd. 2019