Robust trajectory optimization under frictional contact with iterative learning
- 343 Downloads
Optimization is often difficult to apply to robots due to the presence of errors in model parameters, which can cause constraints to be violated during execution on the robot. This paper presents a method to optimize trajectories with large modeling errors using a combination of robust optimization and parameter learning. In particular it considers the context of contact modeling, which is highly susceptible to errors due to uncertain friction estimates, contact point estimates, and sensitivity to noise in actuator effort. A robust time-scaling method is presented that computes a dynamically-feasible, minimum-cost trajectory along a fixed path under frictional contact. The robust optimization model accepts confidence intervals on uncertain parameters, and uses a convex parameterization that computes dynamically-feasible motions in seconds. Optimization is combined with an iterative learning method that uses feedback from execution to learn confidence bounds on modeling parameters. It is applicable to general problems with multiple uncertain parameters that satisfy a monotonicity condition that requires parameters to have conservative and optimistic settings. The method is applied to manipulator performing a “waiter” task, on which an object is moved on a carried tray as quickly as possible, and to a simulated humanoid locomotion task. Experiments demonstrate this method can compensate for large modeling errors within a handful of iterations.
KeywordsRobotics Trajectory optimization Robust optimization Model uncertainty Contact modeling Manipulation Humanoid robots
This work is partially supported under NSF Grants IIS # 1218534 and CAREER # 3332066.
- Bertsimas, D., & Thiele, A. (2006). Robust and data-driven optimization: Modern decision-making under uncertainty. In INFORMS tutorials in operations research: Models, methods, and applications for innovative decision making (pp. 1–39).Google Scholar
- Cobb, G. W., Witmer, J. A., & Cryer, J. D. (1997). An electronic companion to statistics. New York: Cogito Learning Media.Google Scholar
- Dahl, O., & Nielsen, L. (1989). Torque limited path following by on-line trajectory time scaling. In IEEE international conference on robotics and automation (ICRA) (Vol. 2, pp. 1122–1128). doi:10.1109/ROBOT.1989.100131.
- Escande, A., Kheddar, A., Miossec, S., & Garsault, S. (2009) Planning support contact-points for acyclic motions and experiments on HRP-2. In: O. Khatib, V. Kumar, G. J. Pappas (Eds.), Experimental Robotics. Springer Tracts in Advanced Robotics, Vol. 54. Springer, Berlin, Heidelberg.Google Scholar
- Gill, P. E., Murray, W., & Saunders, M. A. (1997). An SQP algorithm for large-scale constrained optimization: Snopt.Google Scholar
- GNU. (2015). Gnu linear programming kit (glpk). http://www.gnu.org/software/glpk/glpk.html. Accessed 16 April 2015.
- Harada, K., Hauser, K., Bretl, T., & Latombe, J.-C. (2006). Natural motion generation for humanoid robots. In IEEE/RSJ international conference on intelligent robots and systems (IROS).Google Scholar
- Hauser, K. (2013a). Fast interpolation and time-optimization on implicit contact submanifolds. In Robotics: Science and systems.Google Scholar
- Hauser, K. (2013b). Robust contact generation for robot simulation with unstructured meshes. In International symposium on robotics research, Singapore.Google Scholar
- Kunz, T., & Stilman, M. (2012). Time-optimal trajectory generation for path following with bounded acceleration and velocity. In Robotics: Science and systems.Google Scholar
- Lertkultanon, P., & Pham, Q.-C. (2014). Dynamic non-prehensile object transportation. In International conference on control automation robotics vision (ICARCV) (pp. 1392–1397).Google Scholar
- Liu, C. K. (2009). Dextrous manipulation from a grasping pose. ACM Transactions on Graphics (TOG), 28(3), 59.Google Scholar
- Luo, J., & Hauser, K. (2012). Interactive generation of dynamically feasible robot trajectories from sketches using temporal mimicking. In IEEE international conference on robotics and automation (ICRA) (pp. 3665–3670).Google Scholar
- Luo, J., & Hauser, K. (2015). Robust trajectory optimization under frictional contact with iterative learning. In Robotics: Science and systems.Google Scholar
- Lynch, K. M., & Mason, M. T. (1996). Dynamic underactuated nonprehensile manipulation. In IEEE/RSJ international conference on intelligent robots and systems (IROS) (Vol. 2, pp. 889–896). IEEE.Google Scholar
- Mordatch, I., Popović, Z., & Todorov, E. (2012). Contact-invariant optimization for hand manipulation. In Proceedings of the ACM SIGGRAPH/eurographics symposium on computer animation (pp. 137–144). Eurographics Association.Google Scholar
- Pham, Q.-C., Caron, S., Lertkultanon, P., & Nakamura, Y. (2014). Planning truly dynamic motions: Path-velocity decomposition revisited. arXiv preprint arXiv:1411.4045.
- Posa, M., & Tedrake, R. (2012). Direct trajectory optimization of rigid body dynamical systems through contact. In Workshop on the algorithmic foundations of robotics.Google Scholar