MAGEFACE: Performative Conversion of Facial Characteristics into Speech Synthesis Parameters

d’Alessandro, Nicolas; Astrinaki, Maria; Dutoit, Thierry

doi:10.1007/978-3-319-03892-6_21

Nicolas d’Alessandro¹⁶,
Maria Astrinaki¹⁶ &
Thierry Dutoit¹⁶

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 124))

Included in the following conference series:

International Conference on Intelligent Technologies for Interactive Entertainment

578 Accesses

Abstract

In this paper, we illustrate the use of the MAGE performative speech synthesizer through its application to the conversion of realtime-measured facial features with FaceOSC into speech synthesis features such as vocal tract shape or intonation. MAGE is a new software library for using HMM-based speech synthesis in reactive programming environments. MAGE uses a rewritten version of the HTS engine enabling the computation of speech audio samples on a two-label window instead of the whole sentence. Only this feature enables the realtime mapping of facial attributes to synthesis parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

d’Alessandro, N., Dutoit, T.: HandSketch Bi-Manual Controller: Investigation on Expressive Control Issues of an Augmented Tablet. In: Proc. International Conference on New Interfaces for Musical Expression, pp. 78–81 (2007)
Google Scholar
MAGE and Face Tracking, https://vimeo.com/39567236
MAGE and HandSketch, https://vimeo.com/39558917
MAGE website, http://mage.numediart.org
Nordstorm, K., et al.: Developing Vowels Mappings for an Interactive Voice Synthesis System Controlled by Hand Motions. Journal of the Acoustical Society of America 127, 2021 (2010)
Article Google Scholar
Zen, H., Tokuda, K., Black, A.: Statistical Parametric Speech Synthesis. Speech Communications 51(11), 1039–1064 (2009)
Article Google Scholar
FaceOSC, https://vimeo.com/26098366

Download references

Author information

Authors and Affiliations

Institute for New Media Art Technology, Signal Processing Laboratory, University of Mons, 31 Boulevard Dolez, B-7000, Mons, Belgium
Nicolas d’Alessandro, Maria Astrinaki & Thierry Dutoit

Authors

Nicolas d’Alessandro
View author publications
You can also search for this author in PubMed Google Scholar
Maria Astrinaki
View author publications
You can also search for this author in PubMed Google Scholar
Thierry Dutoit
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Engineering Faculty, University of Mons (UMONS), IT Research Unit 20, Place du Parc, 7000, Mons, Belgium
Matei Mancas , Nicolas d’ Alessandro , Xavier Siebert , Bernard Gosselin , Carlos Valderrama & Thierry Dutoit , , , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

d’Alessandro, N., Astrinaki, M., Dutoit, T. (2013). MAGEFACE: Performative Conversion of Facial Characteristics into Speech Synthesis Parameters. In: Mancas, M., d’ Alessandro, N., Siebert, X., Gosselin, B., Valderrama, C., Dutoit, T. (eds) Intelligent Technologies for Interactive Entertainment. INTETAIN 2013. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 124. Springer, Cham. https://doi.org/10.1007/978-3-319-03892-6_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-03892-6_21
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03891-9
Online ISBN: 978-3-319-03892-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics