Skip to main content
Log in

Adaptive and economic data representation in control architectures of autonomous real-world robots

  • Original Article
  • Published:
Artificial Life and Robotics Aims and scope Submit manuscript

Abstract

Learning algorithms for autonomous robots in complex, real-world environments usually have to deal with many degrees of freedom and continuous state spaces. Reinforcement learning is a promising concept for unsupervised learning, but common algorithms suffer from huge storage and calculation requirements if they are used to construct an internal model by estimating a value-function for every action in every possible state. In our attempt to approximate this function at the lowest cost, we introduce a flexible method that focuses on the states of greatest interest, and interpolates between them with a fast and easy-to-implement algorithm. In order to provide the highest accuracy to any predefined limit, we enhanced this algorithm by a fast converging multilayer error approximator.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. MIT Press, Cambridge, MA, London, UK.

    Google Scholar 

  2. Fischer J (1999) Strategiebildung mit neuronalen Netzen (in German). Diploma thesis, Department of Applied Physics, WWU-Muenster

  3. Breithaupt R (1999) Adaptive und kooperative Automaten in statischen und dynamischen Umgebungen (in German). Diploma thesis, department of Applied Physics, WWU-Muenster

  4. Sutton RS, Santamaria CJ, Ram A (1996) Experiments with reinforcement learning problems with continuous state and action spaces. University of Massachusetts Amherst (UM-CS-1996-088)

    Google Scholar 

  5. Kroese BJA, van Dam JW (1992) Adaptive space quantisation for reinforcement learning of collision-free navigation. Faculty of Mathematics and Computer Science University of Amsterdam

  6. Samejima K, Omori T (1999) Adaptive internal state-space construction method for reinforcement learning of a real-world agent. Neural Networks 12:1143–1156

    Article  Google Scholar 

  7. Okabe A, Boots B, Sugihara K, et al. (2000) Concepts and applications of Voronoi diagrams, 2nd edn. Wiley, Chichester

    MATH  Google Scholar 

  8. Kohonen T (1989) Self-organization and associative memory. Springer series in information sciences, 3rd edn. Springer, Berlin

    Google Scholar 

  9. Herz J, Krogh A, Palmer RG (1991) Introduction to the theory of neural computation. Addison-Wesley, Reading

    Google Scholar 

  10. Hertzberg J, Kirchner F (1997) A prototype study of an autonomous robot platform for sewerage system maintainance. Auton Robots J 4:319–331

    Article  Google Scholar 

  11. Tesauro G (1992) Temporal difference learning of backgammon strategy. In: Sleeman D, Edwards P (eds) Machine learning. Morgan Kaufmann, San Mateo, p 451–457

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joern Fischer.

About this article

Cite this article

Fischer, J., Breithaupt, R. & Bode, M. Adaptive and economic data representation in control architectures of autonomous real-world robots. Artif Life Robotics 6, 200–204 (2002). https://doi.org/10.1007/BF02481268

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02481268

Keywords

Navigation