Skip to main content

Lifelong Reinforcement Learning

  • Chapter
Lifelong Machine Learning

Abstract

This chapter discusses lifelong reinforcement learning. Reinforcement learning (RL) is the problem where an agent learns actions through trial-and-error interactions with a dynamic environment [Kaelbling et al., 1996, Sutton and Barto, 1998]. In each interaction step, the agent receives input on the current state of the environment. It chooses an action from a set of possible actions. The action changes the state of the environment. Then, the agent gets the value of this state transition, which can be a reward or penalty. This process repeats as the agent learns a trajectory of actions to optimize its objective, e.g., to maximize the long-run sum of rewards. The goal of RL is to learn an optimal policy that maps states to actions (possibly stochastically). There is a recent surge in research in RL due to its successful use in the computer program called AlphaGo [Silver et al., 2016], which won 4–1 against one of the legendary professional Go players Lee Sedol in March 2016.1 More recently, AlphaGo Zero [Silver et al., 2017]2 was designed to learn to master the game of Go from scratch without human knowledge, and it has achieved superhuman performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this chapter

Cite this chapter

Chen, Z., Liu, B. (2018). Lifelong Reinforcement Learning. In: Lifelong Machine Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning. Springer, Cham. https://doi.org/10.1007/978-3-031-01581-6_9

Download citation

Publish with us

Policies and ethics