Abstract
The notion of synergy enables one to provide simplified descriptions of hand actions. It has been used in a number of different meanings ranging from kinematic and dynamic synergies to postural and temporal postural synergies. However, relatively little is known about how representing an action by synergies might take into account the possibility to have a hierarchical and multiple action representation. This is a key aspect for action representation as it has been characterized by action theorists and cognitive neuroscientists. Thus, the aim of the present paper is to investigate whether and to what extent a hierarchical and multiple action representation can be obtained by a synergy approach. To this purpose, we took advantage of representing hand action as a linear combination of temporal postural synergies (TPSs), but on the assumption that TPSs have a tree-structured organization. In a tree-structured organization, a hand action representation can involve a TPS only if the ancestors of the synergy in the tree are themselves involved in the action representation. The results showed that this organization is enough to force a multiple representation of hand actions in terms of synergies which are hierarchically organized.
Similar content being viewed by others
Notes
We consider high sparsity a mean sparsity value roughly equal to 70 % of the total number of TPSs of each tree. More specifically, the mean sparsity of the tree-based action representations belonging to each training set was computed as \(1-\frac{1}{rn}\sum\nolimits_{j=1}^{n}\|{\bf U}_{j}\|_{0}\), where U j are the coefficients of the tree-based action representation for jth action of the training set. For the corresponding λ, we found that the reconstruction error \(\frac{1}{2np}\|{\bf X}-{\bf U}{\bf V}^{T}\|_{F}^{2}\) was always lower than 4 × 10−3.
Note that all w j are fixed to 1 in the experiments.
References
Aharon M, Elad M, Bruckstein A (2006) K-svd: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans Signal Process 54(11):4311–4322
Arbib MA, Iberall T, Lyons D (1985) Coordinated control programs for control of the hands. In: Goodwin AW, Darian-Smith I (eds) Hand function and the neocortex. Experimental Brain Research Supplemental 10, Springer, Berlin, pp 111–129
Arimoto S, Tahara K, Yamaguchi M, Nguyen PTA, Han MY (2001) Principles of superposition for controlling pinch motions by means of robot fingers with soft tips. Robotica 19:21–28
Basso C, Santoro M, Verri A, Villa S (2011) Paddle: proximal algorithm for dual dictionaries learning. In: ICANN 2011, Lecture notes in computer science, vol 6791. Springer, Berlin, pp 379–386
Bernstein NA (1967) The co-ordination and regulation of movements, first english edition edn. Pergamon Press Ltd, Oxford
Combettes P, Wajs V (2006) Signal recovery by proximal forward-backward splitting. Multiscale Model Simul 4(4):1168–1200
d’Avella A, Portone A, Fernandez L, Lacquaniti F (2006) Control of fast-reaching movements by muscle synergy combinations. J Neurosci 26(30):7791–7810
Engan K, Aase S, Hakon Husoy J (1999) Method of optimal directions for frame design. In: Acoustics, speech, and signal processing, 1999. Proceedings of the 1999 IEEE international conference on, IEEE computer society, vol 5, pp 2443–2446
Feix T, Pawlik R, Schmiedmayer H, Romero J, Kragic D (2009) A comprehensive grasp taxonomy. In: Robotics, Science and Systems: workshop on understanding the human hand for advancing robotic manipulation, poster presentation, Seattle, USA
Fogassi L, Ferrari PF, Gesierich B, Rozzi S, Chersi F, Rizzolatti G (2005) Parietal lobe: from action organization to intention understanding. Science 308(5722):662–667
Gallese V (2000) The inner sense of action. Agency and motor representations. J Conscious Stud 7(10):23–40
Goldman A (1970) A theory of human action, vol 2. Prentice-Hall Englewood Cliffs, NJ
Goodman SR, Latash ML (2006) Feed-forward control of a redundant motor system. Biol Cybern 95:271–280
Gorniak S, Zatsiorsky VM, Latash ML (2009) Hierarchical control of static prehension: II. Multi-digit synergies. Exp Brain Res 194(1):1–15
Grafton ST, Hamilton AF (2007) Evidence for a distributed hierarchy of action representation in the brain. Hum Mov Sci 26(4):590–616
Grinyagin IV, Biryukova EV, Maier MA (2005) Kinematic and dynamic synergies of human precision-grip movements. J Neurophysiol 94(4):2284–2294
Hastie T, Tibshirani R, Friedman J (2003) The elements of statistical learning: data mining, inference, and prediction, corrected edn. Springer, Berlin
Iberall T, Fagg AH (1996) Neural network models for selecting hand shapes. In: Wing AM, Haggard P, Flanagan JR (eds) Hand and brain: the neurophysiology and psychology of hand movements. Academic Press, London, pp 243–264
Iberall T, Bingham G, Arbib MA (1986) Opposition space as a structuring concept for the analysis of skilled hand movements. Exp Brain Res Ser 15:158–173
Jeannerod M (1988) The neural and behavioural organization of goal-directed movements. Clarendon Press, Oxford
Jeannerod M, Arbib MA, Rizzolatti G, Sakata H (1995) Grasping objects: the cortical mechanisms of visuomotor transformation. Trends Neurosci 18(7):314–320
Jenatton R, Mairal J, Obozinski G, Bach F (2010) Proximal methods for sparse hierarchical dictionary learning. In: International conference on machine learning 2010, pp 487–494
Jenatton R, Audibert JY, Bach F (2011) Structured variable selection with sparsity-inducing norms. J Mach Learn Res 12:2777–2824
Jerde T, Soechting J, Flanders M (2003) Biological constraints simplify the recognition of hand shapes. IEEE Trans Biomed Eng 50(2):265–269
Latash ML (2008) Synergy. Oxford University Press, New York
Latash ML, Scholz JP, Schöner G (2007) Toward a new theory of motor synergies. Motor control 11:276–308
Lee H, Battle A, Raina R, Ng AY (2007) Efficient sparse coding algorithms. In: Advances in neural information processing systems, vol 19, pp 801–808
Mason CR, Gomez JE, Ebner TJ (2001) Hand synergies during reach-to-grasp. J Neurophysiol 86(6):2896–2910
Mele A (1997) The philosophy of action. Oxford university press, New York
Murata A, Gallese V, Luppino G, Kaseda M, Sakata H (2000) Selectivity for the shape, size, and orientation of objects for grasping in neurons of monkey parietal area aip. J Neurophysiol 83(5):2580–2601
Nelissen K, Luppino G, Vanduffel W, Rizzolatti G, Orban G (2005) Observing others: multiple action representation in the frontal lobe. Science 310(5746):332
Raina R, Battle A, Lee H, Packer B, Ng AY (2007) Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th international conference on Machine learning. ACM, New York, ICML ’07, pp 759–766
Rizzolatti G, Sinigaglia C (2008) Mirrors in the brain: how our minds share actions, emotions. Oxford University Press, USA
Rizzolatti G, Sinigaglia C (2010) The functional role of the parieto-frontal mirror circuit: interpretations and misinterpretations. Nat Rev Neurosci 11(4):264–274
Rizzolatti G, Camarda R, Fogassi L, Gentilucci M, Luppino G, Matelli M (1988) Functional organization of inferior area 6 in the macaque monkey. Exp Brain Res 71(3):491–507
Rizzolatti G, Fogassi L, Gallese V (2001) Neurophysiological mechanisms underlying the understanding and imitation of action. Nat Rev Neurosci 2(9):661–670
Rizzolatti G, Fogassi L, Gallese V (2004) Cortical mechanisms subserving object grasping, action understanding, and imitation. In: Gazzaniga MS (ed) The new cognitive neurosciences, 3rd edn, a bradford book. MIT Press, Cambridge, pp 427–440
Santello M, Soechting J (2000) Force synergies for multifingered grasping. Exp Brain Res 133(4):457–467
Santello M, Flanders M, Soechting JF (1998) Postural hand synergies for tool use. J Neurosci 18(23):10105–10115
Santello M, Flanders M, Soechting JF (2002) Patterns of hand motion during grasping and the influence of sensory guidance. J Neurosci 22(4):1426–1235
Shen H, Huang JZ (2008) Sparse principal component analysis via regularized low rank matrix approximation. J Multivar Anal 99:1015–1034
Shim JK, Latash ML, Zatsiorsky VM (2003) Prehension synergies: trial-to-trial variability and hierarchical organization of stable performance. Exp Brain Res 152(2):173–184
Shim JK, Latash ML, Zatsiorsky VM (2005) Prehension synergies in three dimensions. J Neurophysiol 93(2):766–776
Shim JK, Latash ML, Zatsiorsky VM (2005b) Prehension synergies: trial-to-trial variability and principle of superposition during static prehension in three dimensions. J Neurophysiol 93:3649–3658
Subasi A, Ismail Gursoy M (2010) Eeg signal classification using pca, ica, lda and support vector machines. Expert Syst Appl 37:8659–8666
Tessitore G, Prevete R, Catanzariti E, Tamburrini G (2010) From motor to sensory processing in mirror neuron computational modelling. Biol Cybern 103(6):471–485
Thakur PH, Bastian AJ, Hsiao SS (2008) Multidigit movement synergies of the human hand in an unconstrained haptic exploration task. J Neurosci 28(6):1271–1281
Todorov E, Ghahramani Z (2004) Analysis of the synergies underlying complex hand manipulation. In: 26th Annual international conference of the IEEE engineering in medicine and biology society, IEEE, vol 4, pp 4637–4640
Todorov E, Jordan MI (2002) Optimal feedback control as a theory of motor coordination. Nat Neurosci 5(11):1226–1235
Tresch MC, Saltiel P, Bizzi E (1999) The construction of movement by the spinal cord. Nat Neurosci 2(2):162–167
Vinjamuri R, Lee H, Mao Z (2010a) Dimensionality reduction in control and coordination of the human hand. IEEE Trans Biomed Eng 57(2):284–295
Vinjamuri R, Sun M, Chang CC, Lee HN, Sclabassi RJ, Mao ZH (2010b) Temporal postural synergies of the hand in rapid grasping tasks. IEEE Trans Inf Technol Biomed 14(4):986–994
Wright J, Ma Y, Mairal J, Sapiro G, Huang TS, Yan S (2010) Sparse representation for computer vision and pattern recognition. Proc IEEE 98(6):1031–1044
Zatsiorsky VM, Gao F, Latash ML (2003) Finger force vectors in multi-finger prehension. J Biomech 36(11):1745–1749
Zatsiorsky VM, Latash ML, Gao F, Shim JK (2004) The principle of superposition in human prehension. Robotica 22:231–234
Zou H, Hastie T, Tibshirani R (2006) Sparse principal component analysis. J Comput Graph Stat 15(2):265–286
Acknowledgments
This work was supported by a grant by Compagnia San Paolo. The authors wish to thank Giacomo Rizzolatti, Guglielmo Tamburrini and Giuseppe Trautteur for their comments and suggestions on a previous version of this paper. Support from the European Commission, VII FP (grant no. ICT-216293 DEXMART) is gratefully acknowledged by Prevete and Tessitore. We are also grateful to Chiara Brozzo for her help in revising the final version of the paper.
Author information
Authors and Affiliations
Corresponding authors
Appendix: Tree-structured synergy method (TSSM)
Appendix: Tree-structured synergy method (TSSM)
We will use the following notations. Bold uppercase letters refer to matrices, for example, X, V, and bold lowercase letters designate vectors, for example, x, v. We denote by X i and X j the ith row and the jth column of a matrix X, respectively. We use the notation x i and v ij to refer to the ith element of the vector x and the element in the ith row and the jth column of the matrix V, respectively. Given \({{\bf x}\in\mathbb{R}^{p}}\) we use the notation \(\|x\|\) to refer to \(l_{\infty}\) norm. Given two vectors x and y in \({\mathbb{R}^{p}}\), we denote by \({\bf x}\circ{\bf y}=(x_{1}y_{1},x_{2}y_{2},\ldots,x_{p}y_{p})\in R^{p}\) the element-wise product of x and y.
Tree-structured stage
The update of the U values is performed in this stage, and, more importantly, following the approach suggested by Jenatton et al. (2010), a tree-structured representation of the rows in X is found. The main difficulty is that the optimization of the \({\bf U}_{i}, i\in{1,2, \ldots,n}\), for a fixed V involves the nonsmooth regularization term \(\Upomega({\bf U}_{i})=\sum\nolimits_{j=1}^{r}w_{j}\|{\bf D}_{j}\circ{\bf U}_{i}\|\), where w j are positive weights.Footnote 2 In this case the update of the vectors U i can be performed using a proximal method (Combettes and Wajs 2006). In general, proximal approaches are used when one has to minimize a convex nonsmooth objective function which assumes the following general form:
where f(u) is the usual data-fitting term \(\frac{1}{2}\Vert{\bf x}-{\bf u}{\bf V}^{T}\Vert_{2}^{2}\) and \(\Upomega({\bf u})\) is a nondifferentiable regularization term. In a nutshell, the proximal approach consists of two consecutive updating steps: first, the vector u is updated using the standard gradient update rule w.r.t the first term of the objective function as follows:
then, starting from the value \(\bar{{\bf u}}\) the new value for u is computed by applying a proximal operator \(\Uppi_{{\bf U}}\) defined by the following minimization problem:
Thus, we obtain \({\bf u}^{new}\leftarrow\Uppi_{{\bf U}}(\bar{{\bf u}})\). For a number of regularization terms, the minimization problem expressed in (6) can lead to closed-form solutions. For example, when \(\Upomega({\bf u})\) is the ℓ1 norm of u the corresponding proximal operator \(\Uppi_{{\bf U}}\) is the well-known soft-thresholding operator. In the case of the regularization term used here, this minimization problem can be solved by a primal-dual approach which enables us to implement the proximal operator defined in (6) by the procedure presented in Algorithm (1).
Summarizing, the optimization of the U i values is performed using the gradient descent rule expressed in (5) and, then, applying the proximal operator as defined previously.
Synergy dictionary stage
This stage consists in updating the V’s values while keeping fixed the values of U. Note that the objective function in (1) is composed of two terms to be minimized, and the second term does not depend on V. Therefore, the optimization problem posed in (1) can be, in this stage, reformulated as follows:
Due the fact that the columns of V are constrained to lie inside the unit ball, the update of V is performed in two consecutive steps. First, we apply a standard gradient updating rule as follows
where η is a parameter. Then, we use the projection operator \(\Uppi({\bf v})=\frac{{\bf v}}{max\{1,\Vert{\bf v}\Vert_{2}\}}\) in order to project the columns of \({{\bar{\bf{V}}}}\) on the unit ball in \({\mathbb{R}^{p}}\). Consequently, the update of V is computed as follows:
The overall algorithm of TSSM is reported in algorithm 2. Note that a fixed step gradient descent procedure was adopted with the two learning rate σ U and σ V chosen equal to the Lipschitz constant of ∇ f(u) and ∇ f(v), respectively.
Rights and permissions
About this article
Cite this article
Tessitore, G., Sinigaglia, C. & Prevete, R. Hierarchical and multiple hand action representation using temporal postural synergies. Exp Brain Res 225, 11–36 (2013). https://doi.org/10.1007/s00221-012-3344-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00221-012-3344-9