Skip to main content
Log in

A teaching method using a self-organizing map for reinforcement learning

  • Original Article
  • Published:
Artificial Life and Robotics Aims and scope Submit manuscript

Abstract

We described a new preteaching method for re-inforcement learning using a self-organizing map (SOM). The purpose is to increase the learning rate using a small amount of teaching data generated by a human expert. In our proposed method, the SOM is used to generate the initial teaching data for the reinforcement learning agent from a small amount of teaching data. The reinforcement learning function of the agent is initialized by using the teaching data generated by the SOM in order to increase the probability of selecting the optimal actions it estimates. Because the agent can get high rewards from the start of reinforcement learning, it is expected that the learning rate will increase. The results of a mobile robot simulation showed that the learning rate had increased even though the human expert had showed only a small amount of teaching data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Sutton RS, Barto A (1998) Reinforcement learning: an introduction. A Bradford Book, MIT Press, Cambridge

    Google Scholar 

  2. Kohonen T (1996) Self-organizing maps (in Japanese) Springer, Berlin Tokyo

    Google Scholar 

  3. Clouse JA, Utgoff PE (1992) A teaching method for reinforcement learning. Proceedings of the 9th International Conference on Machine Learning, p 92–101

  4. Lin L-J (1991) Programming robots using reinforcement learning and teaching. Proceedings of the Ninth National Conference on Artificial Intelligence, p 781–786

  5. Watkins CJCH, Dayan P (1992) Technical note: Q-learning. Mach Learn 8:55–68

    Google Scholar 

  6. Sutton RS (1996) Generalization in reinforcement learning: successful examples using space coarse coding. Adv Neural Inf Process Syst 8:1038–1044

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Takeshi Tateyama.

Additional information

This work was presented in part at the 7th International Symposium on Artificial Life and Robotics, Oita, Japan, January 16–18, 2002

About this article

Cite this article

Tateyama, T., Kawata, S. & Oguchi, T. A teaching method using a self-organizing map for reinforcement learning. Artificial Life and Robotics 7, 193–197 (2004). https://doi.org/10.1007/BF02471206

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02471206

Key words

Navigation