Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks

Maeda, Guilherme J.; Neumann, Gerhard; Ewerton, Marco; Lioutikov, Rudolf; Kroemer, Oliver; Peters, Jan

doi:10.1007/s10514-016-9556-2

Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks

Published: 10 March 2016

Volume 41, pages 593–612, (2017)
Cite this article

Autonomous Robots Aims and scope Submit manuscript

Guilherme J. Maeda¹,
Gerhard Neumann¹,
Marco Ewerton¹,
Rudolf Lioutikov¹,
Oliver Kroemer² &
…
Jan Peters^1,3

4602 Accesses
131 Citations
8 Altmetric
Explore all metrics

Abstract

This paper proposes an interaction learning method for collaborative and assistive robots based on movement primitives. The method allows for both action recognition and human–robot movement coordination. It uses imitation learning to construct a mixture model of human–robot interaction primitives. This probabilistic model allows the assistive trajectory of the robot to be inferred from human observations. The method is scalable in relation to the number of tasks and can learn nonlinear correlations between the trajectories that describe the human–robot interaction. We evaluated the method experimentally with a lightweight robot arm in a variety of assistive scenarios, including the coordinated handover of a bottle to a human, and the collaborative assembly of a toolbox. Potential applications of the method are personal caregiver robots, control of intelligent prosthetic devices, and robot coworkers in factories.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Probabilistic Framework for Semi-autonomous Robots Based on Interaction Primitives with Phase Estimation

Priors Inspired by Speed-Accuracy Trade-Offs for Incremental Learning of Probabilistic Movement Primitives

Framework for Learning and Adaptation of Humanoid Robot Skills to Task Constraints

Notes

For example, a rate of 1.25 acts as a surrogate for a human that moves 25 % slower than the time-aligned interaction model.

References

Ben Amor, H., Neumann, G., Kamthe, S., Kroemer, O., & Peters, J. (2014). Interaction primitives for human–robot cooperation tasks. In: Proceedings of the IEEE international conference on robotics and automation (ICRA).
Ben Amor, H., Vogt, D., Ewerton, M., Berger, E., Jung, B., & Peters, J. (2013). Learning responsive robot behavior by imitation. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 3257–3264).
Bishop, C. (2006). Pattern recognition and machine learning (Vol. 4(4)). New York: Springer.
MATH Google Scholar
Bonilla, B. L., & Asada, H. H. (2014). A robot on the shoulder: Coordinated human-wearable robot control using coloured petri nets and partial least squares predictions. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 119–125).
Cakmak, M., Srinivasa, S. S., Lee, M. K., Forlizzi, J., & Kiesler, S. (2011). Human preferences for robothuman hand-over configurations. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 1986–1993).
Calinon, S., & Billard, A. (2009). Statistical learning by imitation of competing constraints in joint space and task space. Advanced Robotics, 23(15), 2059–2076.
Article Google Scholar
Calinon, S., D’halluin, F., Sauser, E. L., Caldwell, D. G., & Billard, A. G. (2010). Learning and reproduction of gestures by imitation. IEEE Robotics & Automation Magazine, 17(2), 44–54.
Article Google Scholar
Calinon, S., Li, Z., Alizadeh, T., Tsagarakis, N. G., & Caldwell, D. G. (2012). Statistical dynamical systems for skills acquisition in humanoids. In: Proceedings of the international conference on humanoid robots (HUMANOIDS) (pp. 323–329).
Englert, P., & Toussaint, M. (2014). Reactive phase and task space adaptation for robust motion execution. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 109–116).
Ewerton, M., Maeda, G., Peters, J., & Neumann, G. (2015). Learning motor skills from partially observed movements executed at different speeds. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 456–463).
Ewerton, M., Neumann, G., Lioutikov, R., Ben Amor, H., Peters, J., & Maeda, G. (2015). Learning multiple collaborative tasks with a mixture of interaction primitives. In: Accepted: Proceedings of the IEEE international conference on robotics and automation (ICRA).
Ijspeert, A. J., Nakanishi, J., Hoffmann, H., Pastor, P., & Schaal, S. (2013). Dynamical movement primitives: Learning attractor models for motor behaviors. Neural Computation, 25(2), 328–373.
Article MathSciNet MATH Google Scholar
Kalakrishnan, M., Chitta, S., Theodorou, E., Pastor, P., & Schaal, S. (2011). STOMP: Stochastic trajectory optimization for motion planning. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 4569–4574).
Kim, S., Gribovskaya, E., & Billard, A. (2010). Learning motion dynamics to catch a moving object. In: Proceedings of the IEEE-RAS international conference on humanoid robots (HUMANOIDS) (pp. 106–111).
Koppula, H.S., & Saxena, A., (2013). Anticipating human activities using object affordances for reactive robotic response. In: Robotics: Science and systems.
Kulvicius, T., Biehl, M., Aein, M. J., Tamosiunaite, M., & Wörgötter, F. (2013). Interaction learning for dynamic movement primitives used in cooperative robotic tasks. Robotics and Autonomous Systems, 61(12), 1450–1459.
Article Google Scholar
Kupcsik, A., Hsu, D., & Lee, S. (2015). Learning dynamic robot-to-human object handover from human feedback. In: International symposium on robotics research (ISRR).
Lawitzky, M., Medina, J., Lee, D., & Hirche, S. (2012, Oct). Feedback motion planning and learning from demonstration in physical robotic assistance: differences and synergies. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 3646–3652).
Lee, D., Ott, C., & Nakamura, Y. (2010). Mimetic communication model with compliant physical contact in human–humanoid interaction. The International Journal of Robotics Research, 29(13), 1684–1704.
Article Google Scholar
Maeda, G., Ewerton, M., Lioutikov, R., Ben Amor, H., Peters, J., & Neumann, G. (2014). Learning interaction for collaborative tasks with probabilistic movement primitives. In: Proceedings of the international conference on humanoid robots (HU- MANOIDS) (pp. 527–534).
Maeda, G., Neumann, G., Ewerton, L. R., M., & Peters, J. (2015). A probabilistic framework for semiautonomous robots based on interaction primitives with phase estimation. In: Proceedings of the international symposium of robotics research (ISRR).
Mainprice, J., & Berenson, D. (2013). Human–robot collaborative manipulation planning using early prediction of human motion. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 299–306).
Oliver, N., Rosario, B., & Pentland, A. (2000). A Bayesian computer vision system for modeling human interactions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8), 831–843. doi:10.1109/34.868684.
Article Google Scholar
Paraschos, A., Daniel, C., Peters, J., & Neumann, G. (2013). Probabilistic movement primitives. In: Advances in neural information processing systems (NIPS) (pp. 2616–2624).
Ratliff, N., Zucker, M., Bagnell, J. A., & Srinivasa, S. (2009). Chomp: Gradient optimization techniques for efficient motion planning. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 489–494).
Rohmer, E., Freese, M., Singh, S. P. (2013). V-rep: A versatile and scalable robot simulation framework. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS).
Rozo, L., Calinon, S., Caldwell, D. G., Jimenez, P., & Torras, C. (2013). Learning collaborative impedance-based robot behaviors. In: AAAI conference on artificial intelligence. Bellevue, Washington, USA
Sakoe, H., & Chiba, S. (1978). Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech and Signal Processing, 26(1), 43–49.
Article MATH Google Scholar
Schaal, S. (1999). Is imitation learning the route to humanoid robots? Trends in Cognitive Sciences, 3(6), 233–242.
Article Google Scholar
Sisbot, E. A., & Alami, R. (2012). A human-aware manipulation planner. IEEE Transactions on Robotics, 28(5), 1045–1057.
Article Google Scholar
Strabala, K. W., Lee, M. K., Dragan, A. D., Forlizzi, J. L., Srinivasa, S., Cakmak, M., et al. (2013). Towards seamless human–robot handovers. Journal of Human–Robot Interaction, 2(1), 112–132.
Google Scholar
Tanaka, Y., Kinugawa, J., Sugahara, Y., & Kosuge, K. (2012). Motion planning with worker’s trajectory prediction for assembly task partner robot. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 1525–1532).
Theodorou, E., Buchli, J., & Schaal, S. (2010). Reinforcement learning of motor skills in high dimensions: A path integral approach. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 2397–2403).
Wang, Z., Muelling, K., Deisenroth, M. P., Ben Amor, H., Vogt, D., Schoelkopf, B., et al. (2013). Probabilistic movement modeling for intention inference in human–robot interaction. The International Journal of Robotics Research, 7, 841–858.
Article Google Scholar
Yamane, K., Revfi, M., & Asfour, T. (2013). Synthesizing object receiving motions of humanoid robots with human motion database. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 1629–1636).
Yamane, K., Yamaguchi, Y., & Nakamura, Y. (2011). Human motion database with a binary tree and node transition graphs. Autonomous Robots, 30(1), 87–98.
Article Google Scholar

Download references

Acknowledgments

The research leading to these results has received funding from the European Community’s Seventh Framework Programmes (FP7-ICT-2013-10) under Grant agreement #610878 (3rdHand) and from the European Union’s Horizon 2020 research and innovation programme under grant agreement #645582 (RoMaNS) and from the Project BIMROB of the Forum fr interdisziplinre Forschung (FiF) of the TU Darmstadt. The authors would like to acknowledge Heni Ben Amor for the invaluable ideas and discussions that contributed to this paper.

Author information

Authors and Affiliations

Technische Universitaet Darmstadt, Darmstadt, Germany
Guilherme J. Maeda, Gerhard Neumann, Marco Ewerton, Rudolf Lioutikov & Jan Peters
University of Southern California, California, USA
Oliver Kroemer
Max Planck Institute for Intelligent Systems, Tuebingen, Germany
Jan Peters

Authors

Guilherme J. Maeda
View author publications
You can also search for this author in PubMed Google Scholar
Gerhard Neumann
View author publications
You can also search for this author in PubMed Google Scholar
Marco Ewerton
View author publications
You can also search for this author in PubMed Google Scholar
Rudolf Lioutikov
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Kroemer
View author publications
You can also search for this author in PubMed Google Scholar
Jan Peters
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guilherme J. Maeda.

Additional information

This is one of several papers published in Autonomous Robots comprising the “Special Issue on Assistive and Rehabilitation Robotics”.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (mp4 124848 KB)

Appendix: Time-alignment of multiple demonstrations

One issue of imitation learning is that multiple demonstration trajectories provided by humans are usually, sometimes severely, warped in time. To compute the distribution of ProMP weights the demonstrated trajectories must be first aligned in relation to a common “clock”. In the context of movement primitives, this clock is often referred to as the phase variable. In this paper, all human and robot trajectories collected during the experiments presented in Sect. 4 were aligned by using the method briefly presented in (Maeda et al. 2014) and will be described in detail here.

The method consists of minimizing the cost J defined as the cumulative absolute difference between the demonstrated trajectory to be time-aligned $\varvec{y}_w$ and a trajectory taken as a phase reference $\varvec{y}_{r}$,

$$\begin{aligned} J= \sum _{k=0}^K | \varvec{y}_{r}( k ) - \varvec{y}_{w}( \varvec{t}_w^{j+1}(k) ) |, \end{aligned}$$

(25)

where both trajectories are resampled to have the same number of K steps. The vector $\varvec{t}_w^{j+1}$ is the unwarped time, which is the solution of the iterative update

$$\begin{aligned} \varvec{t}_w^{j+1} = v^j_0 + \varvec{G}^j\varvec{t}_w^{j}, \end{aligned}$$

(26)

where $\varvec{G} = \text {diag}(g(1), ..., g(K))$ and j is the iteration number of the optimization step.

We propose g(k) as a smooth and continuous warping function parameterized by N weights

$$\begin{aligned} {g}(k) = {\psi }_k^T \varvec{v}_{1:N}, \end{aligned}$$

(27)

where ${\psi }_k$ is the Gaussian basis function at time step k. The vector of parameters $\varvec{v}^j = [v_0, \ v_1, ..., \ v_N ]$ is optimized by gradient descent to decrease the cost J defined in (25). The extra parameter $v_0$ is used to shift the time which is useful when the reference and warped trajectories are, in fact, identical but start at different instants. The optimization is usually initialized with $v^j_0=0$ and $\varvec{t}_w^{j} = \varvec{t}_r$.

Dynamic Time Warping (DTW) (Sakoe and Chiba 1978) is a method widely used for solving time-alignment problems. An extension of DTW for the case where the time-alignment must be made on-line given only partial observations of $\varvec{y}_w$ was presented in (Ben Amor et al. 2014). An issue intrinsic to DTW-based algorithms, however, is that several adjacent time steps of the trajectory to be aligned may be attributed to a single time step of the reference trajectory, and vice-versa. For trajectories provided by a dynamical system, this issue leads to discontinuities in the solution and unnatural movements. An extreme example of this problem is shown in Fig. 21(a) where it is observed that parts of the warped trajectory were lost after the DTW alignment.

A heuristic referred to as the slope constraint was proposed in (Sakoe and Chiba 1978) to alleviate this problem by forcing the same index to not be repeated more than a certain threshold. The slope constraint, however, does not completely solve the discontinuity problem and the tuning of the slope constraint is task dependent. By construction, our proposed method enforces that the warping function $\varvec{g}$ is both continuous and smooth. The use of a smooth function not only avoids the tunning of slope constraint but also preserves the overall shape of the trajectory. Figure 21(b) shows the solution of our method for the same input data used in Fig. 21(a).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Maeda, G.J., Neumann, G., Ewerton, M. et al. Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks. Auton Robot 41, 593–612 (2017). https://doi.org/10.1007/s10514-016-9556-2

Download citation

Received: 29 March 2015
Accepted: 10 February 2016
Published: 10 March 2016
Issue Date: March 2017
DOI: https://doi.org/10.1007/s10514-016-9556-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks

Abstract

Access this article

Similar content being viewed by others

A Probabilistic Framework for Semi-autonomous Robots Based on Interaction Primitives with Phase Estimation

Priors Inspired by Speed-Accuracy Trade-Offs for Incremental Learning of Probabilistic Movement Primitives

Framework for Learning and Adaptation of Humanoid Robot Skills to Task Constraints

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Appendix: Time-alignment of multiple demonstrations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks

Abstract

Access this article

Similar content being viewed by others

A Probabilistic Framework for Semi-autonomous Robots Based on Interaction Primitives with Phase Estimation

Priors Inspired by Speed-Accuracy Trade-Offs for Incremental Learning of Probabilistic Movement Primitives

Framework for Learning and Adaptation of Humanoid Robot Skills to Task Constraints

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Appendix: Time-alignment of multiple demonstrations

Appendix: Time-alignment of multiple demonstrations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation