Skip to main content
Log in

Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks

  • Published:
Autonomous Robots Aims and scope Submit manuscript

Abstract

This paper proposes an interaction learning method for collaborative and assistive robots based on movement primitives. The method allows for both action recognition and human–robot movement coordination. It uses imitation learning to construct a mixture model of human–robot interaction primitives. This probabilistic model allows the assistive trajectory of the robot to be inferred from human observations. The method is scalable in relation to the number of tasks and can learn nonlinear correlations between the trajectories that describe the human–robot interaction. We evaluated the method experimentally with a lightweight robot arm in a variety of assistive scenarios, including the coordinated handover of a bottle to a human, and the collaborative assembly of a toolbox. Potential applications of the method are personal caregiver robots, control of intelligent prosthetic devices, and robot coworkers in factories.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20

Similar content being viewed by others

Notes

  1. For example, a rate of 1.25 acts as a surrogate for a human that moves 25 % slower than the time-aligned interaction model.

References

  • Ben Amor, H., Neumann, G., Kamthe, S., Kroemer, O., & Peters, J. (2014). Interaction primitives for human–robot cooperation tasks. In: Proceedings of the IEEE international conference on robotics and automation (ICRA).

  • Ben Amor, H., Vogt, D., Ewerton, M., Berger, E., Jung, B., & Peters, J. (2013). Learning responsive robot behavior by imitation. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 3257–3264).

  • Bishop, C. (2006). Pattern recognition and machine learning (Vol. 4(4)). New York: Springer.

    MATH  Google Scholar 

  • Bonilla, B. L., & Asada, H. H. (2014). A robot on the shoulder: Coordinated human-wearable robot control using coloured petri nets and partial least squares predictions. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 119–125).

  • Cakmak, M., Srinivasa, S. S., Lee, M. K., Forlizzi, J., & Kiesler, S. (2011). Human preferences for robothuman hand-over configurations. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 1986–1993).

  • Calinon, S., & Billard, A. (2009). Statistical learning by imitation of competing constraints in joint space and task space. Advanced Robotics, 23(15), 2059–2076.

    Article  Google Scholar 

  • Calinon, S., D’halluin, F., Sauser, E. L., Caldwell, D. G., & Billard, A. G. (2010). Learning and reproduction of gestures by imitation. IEEE Robotics & Automation Magazine, 17(2), 44–54.

    Article  Google Scholar 

  • Calinon, S., Li, Z., Alizadeh, T., Tsagarakis, N. G., & Caldwell, D. G. (2012). Statistical dynamical systems for skills acquisition in humanoids. In: Proceedings of the international conference on humanoid robots (HUMANOIDS) (pp. 323–329).

  • Englert, P., & Toussaint, M. (2014). Reactive phase and task space adaptation for robust motion execution. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 109–116).

  • Ewerton, M., Maeda, G., Peters, J., & Neumann, G. (2015). Learning motor skills from partially observed movements executed at different speeds. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 456–463).

  • Ewerton, M., Neumann, G., Lioutikov, R., Ben Amor, H., Peters, J., & Maeda, G. (2015). Learning multiple collaborative tasks with a mixture of interaction primitives. In: Accepted: Proceedings of the IEEE international conference on robotics and automation (ICRA).

  • Ijspeert, A. J., Nakanishi, J., Hoffmann, H., Pastor, P., & Schaal, S. (2013). Dynamical movement primitives: Learning attractor models for motor behaviors. Neural Computation, 25(2), 328–373.

    Article  MathSciNet  MATH  Google Scholar 

  • Kalakrishnan, M., Chitta, S., Theodorou, E., Pastor, P., & Schaal, S. (2011). STOMP: Stochastic trajectory optimization for motion planning. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 4569–4574).

  • Kim, S., Gribovskaya, E., & Billard, A. (2010). Learning motion dynamics to catch a moving object. In: Proceedings of the IEEE-RAS international conference on humanoid robots (HUMANOIDS) (pp. 106–111).

  • Koppula, H.S., & Saxena, A., (2013). Anticipating human activities using object affordances for reactive robotic response. In: Robotics: Science and systems.

  • Kulvicius, T., Biehl, M., Aein, M. J., Tamosiunaite, M., & Wörgötter, F. (2013). Interaction learning for dynamic movement primitives used in cooperative robotic tasks. Robotics and Autonomous Systems, 61(12), 1450–1459.

    Article  Google Scholar 

  • Kupcsik, A., Hsu, D., & Lee, S. (2015). Learning dynamic robot-to-human object handover from human feedback. In: International symposium on robotics research (ISRR).

  • Lawitzky, M., Medina, J., Lee, D., & Hirche, S. (2012, Oct). Feedback motion planning and learning from demonstration in physical robotic assistance: differences and synergies. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 3646–3652).

  • Lee, D., Ott, C., & Nakamura, Y. (2010). Mimetic communication model with compliant physical contact in human–humanoid interaction. The International Journal of Robotics Research, 29(13), 1684–1704.

    Article  Google Scholar 

  • Maeda, G., Ewerton, M., Lioutikov, R., Ben Amor, H., Peters, J., & Neumann, G. (2014). Learning interaction for collaborative tasks with probabilistic movement primitives. In: Proceedings of the international conference on humanoid robots (HU- MANOIDS) (pp. 527–534).

  • Maeda, G., Neumann, G., Ewerton, L. R., M., & Peters, J. (2015). A probabilistic framework for semiautonomous robots based on interaction primitives with phase estimation. In: Proceedings of the international symposium of robotics research (ISRR).

  • Mainprice, J., & Berenson, D. (2013). Human–robot collaborative manipulation planning using early prediction of human motion. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 299–306).

  • Oliver, N., Rosario, B., & Pentland, A. (2000). A Bayesian computer vision system for modeling human interactions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8), 831–843. doi:10.1109/34.868684.

    Article  Google Scholar 

  • Paraschos, A., Daniel, C., Peters, J., & Neumann, G. (2013). Probabilistic movement primitives. In: Advances in neural information processing systems (NIPS) (pp. 2616–2624).

  • Ratliff, N., Zucker, M., Bagnell, J. A., & Srinivasa, S. (2009). Chomp: Gradient optimization techniques for efficient motion planning. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 489–494).

  • Rohmer, E., Freese, M., Singh, S. P. (2013). V-rep: A versatile and scalable robot simulation framework. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS).

  • Rozo, L., Calinon, S., Caldwell, D. G., Jimenez, P., & Torras, C. (2013). Learning collaborative impedance-based robot behaviors. In: AAAI conference on artificial intelligence. Bellevue, Washington, USA

  • Sakoe, H., & Chiba, S. (1978). Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech and Signal Processing, 26(1), 43–49.

    Article  MATH  Google Scholar 

  • Schaal, S. (1999). Is imitation learning the route to humanoid robots? Trends in Cognitive Sciences, 3(6), 233–242.

    Article  Google Scholar 

  • Sisbot, E. A., & Alami, R. (2012). A human-aware manipulation planner. IEEE Transactions on Robotics, 28(5), 1045–1057.

    Article  Google Scholar 

  • Strabala, K. W., Lee, M. K., Dragan, A. D., Forlizzi, J. L., Srinivasa, S., Cakmak, M., et al. (2013). Towards seamless human–robot handovers. Journal of Human–Robot Interaction, 2(1), 112–132.

    Google Scholar 

  • Tanaka, Y., Kinugawa, J., Sugahara, Y., & Kosuge, K. (2012). Motion planning with worker’s trajectory prediction for assembly task partner robot. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 1525–1532).

  • Theodorou, E., Buchli, J., & Schaal, S. (2010). Reinforcement learning of motor skills in high dimensions: A path integral approach. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 2397–2403).

  • Wang, Z., Muelling, K., Deisenroth, M. P., Ben Amor, H., Vogt, D., Schoelkopf, B., et al. (2013). Probabilistic movement modeling for intention inference in human–robot interaction. The International Journal of Robotics Research, 7, 841–858.

    Article  Google Scholar 

  • Yamane, K., Revfi, M., & Asfour, T. (2013). Synthesizing object receiving motions of humanoid robots with human motion database. In: Proceedings of the IEEE international conference on robotics and automation (ICRA) (pp. 1629–1636).

  • Yamane, K., Yamaguchi, Y., & Nakamura, Y. (2011). Human motion database with a binary tree and node transition graphs. Autonomous Robots, 30(1), 87–98.

    Article  Google Scholar 

Download references

Acknowledgments

The research leading to these results has received funding from the European Community’s Seventh Framework Programmes (FP7-ICT-2013-10) under Grant agreement #610878 (3rdHand) and from the European Union’s Horizon 2020 research and innovation programme under grant agreement #645582 (RoMaNS) and from the Project BIMROB of the Forum fr interdisziplinre Forschung (FiF) of the TU Darmstadt. The authors would like to acknowledge Heni Ben Amor for the invaluable ideas and discussions that contributed to this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guilherme J. Maeda.

Additional information

This is one of several papers published in Autonomous Robots comprising the “Special Issue on Assistive and Rehabilitation Robotics”.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (mp4 124848 KB)

Appendix: Time-alignment of multiple demonstrations

Appendix: Time-alignment of multiple demonstrations

One issue of imitation learning is that multiple demonstration trajectories provided by humans are usually, sometimes severely, warped in time. To compute the distribution of ProMP weights the demonstrated trajectories must be first aligned in relation to a common “clock”. In the context of movement primitives, this clock is often referred to as the phase variable. In this paper, all human and robot trajectories collected during the experiments presented in Sect. 4 were aligned by using the method briefly presented in (Maeda et al. 2014) and will be described in detail here.

The method consists of minimizing the cost J defined as the cumulative absolute difference between the demonstrated trajectory to be time-aligned \(\varvec{y}_w\) and a trajectory taken as a phase reference \(\varvec{y}_{r}\),

$$\begin{aligned} J= \sum _{k=0}^K | \varvec{y}_{r}( k ) - \varvec{y}_{w}( \varvec{t}_w^{j+1}(k) ) |, \end{aligned}$$
(25)

where both trajectories are resampled to have the same number of K steps. The vector \(\varvec{t}_w^{j+1}\) is the unwarped time, which is the solution of the iterative update

$$\begin{aligned} \varvec{t}_w^{j+1} = v^j_0 + \varvec{G}^j\varvec{t}_w^{j}, \end{aligned}$$
(26)

where \(\varvec{G} = \text {diag}(g(1), ..., g(K))\) and j is the iteration number of the optimization step.

We propose g(k) as a smooth and continuous warping function parameterized by N weights

$$\begin{aligned} {g}(k) = {\psi }_k^T \varvec{v}_{1:N}, \end{aligned}$$
(27)

where \({\psi }_k\) is the Gaussian basis function at time step k. The vector of parameters \(\varvec{v}^j = [v_0, \ v_1, ..., \ v_N ]\) is optimized by gradient descent to decrease the cost J defined in (25). The extra parameter \(v_0\) is used to shift the time which is useful when the reference and warped trajectories are, in fact, identical but start at different instants. The optimization is usually initialized with \(v^j_0=0\) and \(\varvec{t}_w^{j} = \varvec{t}_r\).

Dynamic Time Warping (DTW) (Sakoe and Chiba 1978) is a method widely used for solving time-alignment problems. An extension of DTW for the case where the time-alignment must be made on-line given only partial observations of \(\varvec{y}_w\) was presented in (Ben Amor et al. 2014). An issue intrinsic to DTW-based algorithms, however, is that several adjacent time steps of the trajectory to be aligned may be attributed to a single time step of the reference trajectory, and vice-versa. For trajectories provided by a dynamical system, this issue leads to discontinuities in the solution and unnatural movements. An extreme example of this problem is shown in Fig. 21(a) where it is observed that parts of the warped trajectory were lost after the DTW alignment.

A heuristic referred to as the slope constraint was proposed in (Sakoe and Chiba 1978) to alleviate this problem by forcing the same index to not be repeated more than a certain threshold. The slope constraint, however, does not completely solve the discontinuity problem and the tuning of the slope constraint is task dependent. By construction, our proposed method enforces that the warping function \(\varvec{g}\) is both continuous and smooth. The use of a smooth function not only avoids the tunning of slope constraint but also preserves the overall shape of the trajectory. Figure 21(b) shows the solution of our method for the same input data used in Fig. 21(a).

Fig. 21
figure 21

Comparison of time-alignment given by classical Dynamic Time Warping without slope constraint (a) and the proposed method using gradient descent (b). In the latter, the thin gray curves represent intermediate solutions of the local optimization while converging

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Maeda, G.J., Neumann, G., Ewerton, M. et al. Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks. Auton Robot 41, 593–612 (2017). https://doi.org/10.1007/s10514-016-9556-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10514-016-9556-2

Keywords

Navigation