Adaptive Optimal Control for Redundantly Actuated Arms

Mitrovic, Djordje; Klanke, Stefan; Vijayakumar, Sethu

doi:10.1007/978-3-540-69134-1_10

Adaptive Optimal Control for Redundantly Actuated Arms

Djordje Mitrovic¹,
Stefan Klanke¹ &
Sethu Vijayakumar¹

Conference paper

1144 Accesses
8 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5040))

Abstract

Optimal feedback control has been proposed as an attractive movement generation strategy in goal reaching tasks for anthropomorphic manipulator systems. Recent developments, such as the iterative Linear Quadratic Gaussian (iLQG) algorithm, have focused on the case of non-linear, but still analytically available, dynamics. For realistic control systems, however, the dynamics may often be unknown, difficult to estimate, or subject to frequent systematic changes. In this paper, we combine the iLQG framework with learning the forward dynamics for a simulated arm with two limbs and six antagonistic muscles, and we demonstrate how our approach can compensate for complex dynamic perturbations in an online fashion.

Download to read the full chapter text

Chapter PDF

References

Stengel, R.F.: Optimal control and estimation. Dover Publications, New York (1994)
MATH Google Scholar
Flash, T., Hogan, N.: The coordination of arm movements: an experimentally confirmed mathematical model. Journal of Neuroscience 5, 1688–1703 (1985)
Google Scholar
Todorov, E., Jordan, M.: A minimal intervention principle for coordinated movement. In: Advances in Neural Information Processing Systems, vol. 15, pp. 27–34. MIT Press, Cambridge (2003)
Google Scholar
Shadmehr, R., Wise, S.P.: The Computational Neurobiology of Reaching and Ponting. MIT Press, Cambridge (2005)
Google Scholar
Li, W.: Optimal Control for Biological Movement Systems. PhD dissertation, University of California, San Diego (2006)
Google Scholar
Scott, S.H.: Optimal feedback control and the neural basis of volitional motor control. Nature Reviews Neuroscience 5, 532–546 (2004)
Article Google Scholar
Dyer, P., McReynolds, S.: The Computational Theory of Optimal Control. Academic Press, New York (1970)
Google Scholar
Jacobson, D.H., Mayne, D.Q.: Differential Dynamic Programming. Elsevier, New York (1970)
MATH Google Scholar
Li, W., Todorov, E.: Iterative linear-quadratic regulator design for nonlinear biological movement systems. In: Proc. 1st Int. Conf. Informatics in Control, Automation and Robotics (2004)
Google Scholar
Todorov, E., Li, W.: A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. In: Proc. of the American Control Conference (2005)
Google Scholar
Atkeson, C.G., Schaal, S.: Learning tasks from a single demonstration. In: Proc. Int. Conf. on Robotics and Automation (ICRA), Albuquerque, New Mexico, vol. 2, pp. 1706–1712 (1997)
Google Scholar
Abbeel, P., Quigley, M., Ng, A.Y.: Using inaccurate models in reinforcement learning. In: Proc. Int. Conf. on Machine Learning, pp. 1–8 (2006)
Google Scholar
Katayama, M., Kawato, M.: Virtual trajectory and stiffness ellipse during multijoint arm movement predicted by neural inverse model. Biol. Cybern. 69, 353–362 (1993)
MATH Google Scholar
Corke, P.I.: A robotics toolbox for MATLAB. IEEE Robotics and Automation Magazine 3(1), 24–32 (1996)
Article Google Scholar
Özkaya, N., Nordin, M.: Fundamentals of biomechanics: equilibrium, motion, and deformation. Van Nostrand Reinhold, New York (1991)
Google Scholar
Bertsekas, D.P.: Dynamic programming and optimal control. Athena Scientific, Belmont, Mass (1995)
MATH Google Scholar
Thrun, S.: Monte carlo POMDPs. In: Advances in Neural Information Processing Systems 12, pp. 1064–1070. MIT Press, Cambridge (2000)
Google Scholar
Atkeson, C.G.: Randomly sampling actions in dynamic programming. In: Proc. Int. Symp. on Approximate Dynamic Programming and Reinforcement Learning, pp. 185–192 (2007)
Google Scholar
Vijayakumar, S., D’Souza, A., Schaal, S.: Incremental online learning in high dimensions. Neural Computation 17, 2602–2634 (2005)
Article MathSciNet Google Scholar
Shadmehr, R., Mussa-Ivaldi, F.A.: Adaptive representation of dynamics during learning of a motor task. The Journal of Neurosciene 14(5), 3208–3224 (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Perception, Action & Behavior, University of Edinburgh, The King’s Buildings, Edinburgh, EH9 3JZ, United Kingdom
Djordje Mitrovic, Stefan Klanke & Sethu Vijayakumar

Authors

Djordje Mitrovic
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Klanke
View author publications
You can also search for this author in PubMed Google Scholar
Sethu Vijayakumar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Minoru Asada John C. T. Hallam Jean-Arcady Meyer Jun Tani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mitrovic, D., Klanke, S., Vijayakumar, S. (2008). Adaptive Optimal Control for Redundantly Actuated Arms. In: Asada, M., Hallam, J.C.T., Meyer, JA., Tani, J. (eds) From Animals to Animats 10. SAB 2008. Lecture Notes in Computer Science(), vol 5040. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69134-1_10

Download citation

DOI: https://doi.org/10.1007/978-3-540-69134-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69133-4
Online ISBN: 978-3-540-69134-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics