Motion synthesis and editing for the generation of new sign language content

Building new signs with phonological recombination


Existing work on the animation of signing avatars often relies on pure procedural techniques or on the playback of Motion Capture (MoCap) data. While the first solution results in robotic and unnatural motions, the second one is very limited in the number of signs that it can produce. In this paper, we propose to implement data-driven motion synthesis techniques to increase the variety of Sign Language (SL) motions that can be made from a limited database. In order to generate new signs and inflection mechanisms based on an annotated French Sign Language MoCap corpus, we rely on phonological recombination, i.e. on the motion retrieval and modular reconstruction of SL content at a phonological level with a particular focus on three phonological components of SL: hand placement, hand configuration and hand movement. We propose to modify the values taken by those components in different signs to create their inflected version or completely new signs by (i) applying motion retrieval at a phonological level to exchange the value of one component without any modification, (ii) editing the retrieved data with different operators, or, (iii) using conventional motion generation techniques such as interpolation or inverse kinematics, which are parameterized to comply to the kinematic properties of real motion observed in the data set. The quality of the synthesized motions is perceptually assessed through two distinct evaluations that involved 75 and 53 participants respectively.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17


  1. 1.

    We call channel the set of joints corresponding to a phonological component.

  2. 2.

    It can also be interesting to add noise to achieve a similar effect.

  3. 3.

    Not to mention the overall attitude and facial expression that are not part of this work.

  4. 4.

    One of the 4 remaining ground truth videos was removed from the questionnaire beforehand and was not showed to the participants as it contained an artefact.

  5. 5.

    We considered a significant difference for a p-value \({< 0.01}\).

  6. 6.

    A video showing our synthesis results is available at this address:


Naert, L., Larboulette, C. & Gibet, S. Motion synthesis and editing for the generation of new sign language content. Machine Translation (2021).

  • Sign language
  • Motion synthesis
  • Motion capture
  • Avatar
  • Phonological recombination