Evolving Systems

, Volume 5, Issue 1, pp 21–32 | Cite as

Fault-tolerant gait learning and morphology optimization of a polymorphic walking robot

  • David Johan Christensen
  • Jørgen Christian Larsen
  • Kasper Stoy
Original Paper

Abstract

This paper presents experiments with a morphology-independent, life-long strategy for online learning of locomotion gaits. The experimental platform is a quadruped robot assembled from the LocoKit modular robotic construction kit. The learning strategy applies a stochastic optimization algorithm to optimize eight open parameters of a central pattern generator based gait implementation. We observe that the strategy converges in roughly ten minutes to gaits of similar or higher velocity than a manually designed gait and that the strategy readapts in the event of failed actuators. We also optimize offline the reachable space of a foot based on a reference design but finds that the reality gap hardens the successfully transference to the physical robot. To address this limitation, in future work we plan to study co-learning of morphological and control parameters directly on physical robots.

Keywords

Online learning  Locomotion  Modular robots  Reconfigurable robots  Fault-tolerance  Central pattern generators  Morphology optimization 

Notes

Acknowledgments

This work was performed as part of the “Locomorph” project funded by the EU’s Seventh Framework Programme (Future Emerging Technologies, Embodied Intelligence) and as part of the “Assemble and Animate” project funded by the Danish Council for Independent Research.

References

  1. Bongard J (2002) Evolving modular genetic regulatory networks. In: Proceedings of the congress on evolutionary computation (CEC), IEEE Computer Society, Washington, DC, CEC ’02, pp 1872–1877Google Scholar
  2. Bongard J, Zykov V, Lipson H (2006) Resilient machines through continuous self-modeling. Science 314(5802):1118–1121CrossRefGoogle Scholar
  3. Braitenberg V (1986) Vehicles: Experiments in synthetic psychology. MIT press, CambridgeGoogle Scholar
  4. Brooks RA (1992) Artificial life and real robots. In: Toward a practice of autonomous systems: proceedings of the first european conference on artificial life. Cambridge, pp 3–10Google Scholar
  5. Christensen DJ, Bordignon M, Schultz UP, Shaikh D, Stoy K (2008a) Morphology independent learning in modular robots. In: Proceedings of international symposium on distributed autonomous robotic systems 8 (DARS 2008), pp 379–391Google Scholar
  6. Christensen DJ, Schultz UP, Brandt D, Stoy K (2008b) A unified simulator for self-reconfigurable robots. In: Proceedings of IEEE/RSJ international conference on intelligent robots and systemsGoogle Scholar
  7. Christensen DJ, Schultz UP, Stoy K (2010a) A distributed strategy for gait adaptation in modular robots. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), pp 2765–2770Google Scholar
  8. Christensen DJ, Sproewitz A, Ijspeert AJ (2010b) Distributed online learning of central pattern generators in modular robots. In: Proceedings of the 11th international conference on simulation of adaptive behavior (SAB2010). Paris, pp 402–412Google Scholar
  9. Christensen DJ, Schultz UP, Moghadam M (2011) The assemble and animate control framework for modular reconfigurable robots. In: Proceedings of the IROS workshop on reconfigurable modular robotics: challenges of mechatronic and Bio-Chemo-Hybrid SystemsGoogle Scholar
  10. Christensen DJ, Larsen JC, Stoy K (2012) Adaptive strategy for online gait learning evaluated on the polymorphic robotic locokit. In: Proceedings of the IEEE conference on evolving and adaptive intelligent systems (EAIS)Google Scholar
  11. Crespi A, Ijspeert AJ (2006) AmphiBot II: An Amphibious Snake Robot that Crawls and Swims using a Central Pattern Generator. In: Proceedings of the 9th international conference on climbing and walking robots (CLAWAR 2006), pp 19–27Google Scholar
  12. Duff D, Yim M, Roufas K (2001) Evolution of polybot: A modular reconfigurable robot. In: Proceedings of harmonic drive international symposium, NaganoGoogle Scholar
  13. Fitch R, Rus D, Vona M (2000) A basis for self-repair using crystalline modules. In: Proceedings, intelligent autonomous systems (IAS-6), VeniceGoogle Scholar
  14. Fukuda T, Nakagawa S (1988) Dynamically reconfigurable robotic system. In: Proceedings of the IEEE international conference on robotics & automation (ICRA’88), pp 1581–1586Google Scholar
  15. Groß R, Bonani M, Mondada F, Dorigo M (2006) Autonomous self-assembly in a swarm-bot. In: Proceedings of the 3rd international symposium on autonomous minirobots for research and edutainment (AMiRE 2005), Springer, Berlin, pp 314–322Google Scholar
  16. Ijspeert AJ (2008) Central pattern generators for locomotion control in animals and robots: a review. Neural Netw 21(4):642–653CrossRefGoogle Scholar
  17. Jakobi N (1998) Running across the reality gap: Octopod locomotion evolved in a minimal simulation. In: Evolutionary Robotics, Springer, Berlin, pp 39–58Google Scholar
  18. Jansen T (2007) Theo Jansen: The Great Pretender. OIO Publishers, New ZealandGoogle Scholar
  19. Kamimura A, Kurokawa H, Yoshida E, Murata S, Tomita K, Kokaji S (2005) Automatic locomotion design and experiments for a modular robotic system. IEEE/ASME Trans Mech 10(3):314–325CrossRefGoogle Scholar
  20. Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings. IEEE international conference on neural networks, 1995. IEEE 4:1942–1948Google Scholar
  21. van den Kieboom J (2009) Biped locomotion and stability a practical approach. Master’s thesis, University of Groningen, The NetherlandsGoogle Scholar
  22. Kimura H, Akiyama S, Sakurama K (1999) Realization of dynamic walking and running of the quadruped using neural oscillator. Auton Robot 7(3):247–258CrossRefGoogle Scholar
  23. Kohl N, Stone P (2004) Machine learning for fast quadrupedal locomotion. In: The Nineteenth National Conference on Artificial Intelligence, pp 611–616Google Scholar
  24. Lappe M, Frenz H (2009) Visual estimation of travel distance during walking. Exp Brain Res 199(3):369–375CrossRefGoogle Scholar
  25. Larsen J, Garcia R, Stoy K (2010) Increased versatility of modular robots through layered heterogeneity. In: Proceedings of the ICRA Workshop on Modular Robots, State of the Art, Anchorage, Alaska, pp 24–29Google Scholar
  26. Larsen JC, Brandt D, Stoy K (2012) Locokit: A robot construction kit for studying and developing functional morphologies. In: Proceedings of 12th international conference on simulation of adaptive behavior (SAB 2012), Odense, Denmark, Lecture Notes in Computer Science, vol 7426, pp 12–22Google Scholar
  27. Lipson H, Pollack JB (2000) Automatic design and manufacture of robotic lifeforms. Nature 406:974–978CrossRefGoogle Scholar
  28. Maes P, Brooks RA (1990) Learning to coordinate behaviors. In: National conference on artificial intelligence, pp 796–802Google Scholar
  29. Mahdavi SH, Bentley PJ (2003) An evolutionary approach to damage recovery of robot motion with muscles. In: Seventh european conference on artificial life (ECAL03). Springer, Berlin, pp 248–255Google Scholar
  30. Marbach D, Ijspeert AJ (2004) Co-evolution of configuration and control for homogenous modular robots. In: Proc., 8th international conference on intelligent autonomous systems, Amsterdam, pp 712–719Google Scholar
  31. Marbach D, Ijspeert AJ (2005) Online Optimization of Modular Robot Locomotion. In: Proceedings of the IEEE international conference on mechatronics and automation (ICMA 2005), pp 248–253Google Scholar
  32. Mataric M, Cliff D (1996) Challenges in evolving controllers for physical robots. Robot Auton Sys 19(1):67 – 83CrossRefGoogle Scholar
  33. Morimoto J, Doya K (2001) Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning. Robot Auton Sys 36(1):37–51CrossRefMATHGoogle Scholar
  34. Murata S, Kurokawa H (2012) Self-organizing robots. Springer, BerlinGoogle Scholar
  35. Pfeifer R, Lungarella M, Iida F (2007) Self-organization, embodiment, and biologically inspired robotics. Science 318(5853):1088–1093CrossRefGoogle Scholar
  36. Pouya S, van den Kieboom J, Spröwitz A, Ijspeert AJ (2010) Automatic gait generation in modular robots: "to oscillate or to rotate; that is the question". In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, Taipei, pp 514–520Google Scholar
  37. Righetti L, Ijspeert A (2006) Programmable central pattern generators: an application to biped locomotion control. In: Proceedings 2006 IEEE international conference on robotics and automation (ICRA), pp 1585–1590. doi: 10.1109/ROBOT.2006.1641933
  38. Silva MF, Machado JT (2012) A literature review on the optimization of legged robots. J Vib Contr 18(12):1753–1767CrossRefGoogle Scholar
  39. Sims K (1994) Evolving 3d morphology and behavior by competition. In: Brooks R, Maes P (eds) Proceedings of the artificial life IV. MIT Press, Cambridge, pp 28–39Google Scholar
  40. Spall JC (1992) Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE Trans Automat Contr 37(3):332–341CrossRefMATHMathSciNetGoogle Scholar
  41. Sproewitz A, Moeckel R, Maye J, Ijspeert AJ (2008) Learning to move in modular robots using central pattern generators and online optimization. Int J Rob Res 27(3-4):423–443CrossRefGoogle Scholar
  42. Stoy K, Nagpal R (2004) Self-repair through scale independent self-reconfiguration. In: Proceedings of IEEE/RSJ international conference on robots and systems (IROS). Sendai, pp 2062–2067Google Scholar
  43. Stoy K, Lyder A, Garcia R, Christensen DJ (2007) Hierarchical robots. In: Proceedings of the of the IROS workshop on self-reconfigurable modular robot, San DiegoGoogle Scholar
  44. Stoy K, Brandt D, Christensen DJ (2010) Self-Reconfigurable Robots: An Introduction. Intelligent Robotics and Autonomous Agents series. MIT Press, CambridgeGoogle Scholar
  45. Taga G, Yamaguchi Y, Shimizu H (1991) Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment. Biol Cybern 65:147–159. doi: 10.1007/BF00198086 Google Scholar
  46. Warren W, Kay B, Zosh W, Duchon A, Sahuc S (2001) Optic flow is used to control human walking. Nat Neurosci 4(2):213Google Scholar
  47. Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evolut Comput 1(1):67–82CrossRefGoogle Scholar
  48. Yim M, Shen WM, Salemi B, Rus D, Moll M, Lipson H, Klavins E (2007a) Modular self-reconfigurable robot systems: Challenges and opportunities for the future. IEEE Robot Automat Mag 14(1):43–52CrossRefGoogle Scholar
  49. Yim M, Shirmohammadi B, Sastra J, Park M, Dugan M, Taylor CJ (2007b) Towards robotic self-reassembly after explosion. In: Video Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS), San DiegoGoogle Scholar
  50. Yoshida E, Murata S, Tomita K, Kurokawa H, Kokaji S (1999) An experimental study on a self-repairing modular machine. Robot Auton Sys 29:79–89CrossRefGoogle Scholar
  51. Zahadat P, Christensen DJ, Schultz UP, Katebi SD, Stoy K (2010) Fractal gene regulatory networks for robust locomotion control of modular robots. In: Proceedings of the 11th international conference on simulation of adaptive behavior (SAB2010). ParisGoogle Scholar
  52. Zahadat P, Schmickl T, Crailsheim K (2012) Evolving reactive controller for a modular robot: Benefits of the property of state-switching in fractal gene regulatory networks. From Animals to Animats, vol 12, pp 209–218Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • David Johan Christensen
    • 1
  • Jørgen Christian Larsen
    • 2
  • Kasper Stoy
    • 2
  1. 1.Department of Electrical EngineeringTechnical University of Denmark Kgs. LyngbyDenmark
  2. 2.The Maersk Mc-Kinney Moller InstituteUniversity of Southern DenmarkOdenseDenmark

Personalised recommendations