Voice Sample Synthesis

Schroeter, Juergen; Conkie, Alistair

doi:10.1007/978-3-642-27733-7_6-3

Voice Sample Synthesis

Juergen Schroeter³ &
Alistair Conkie³

Living reference work entry
First Online: 01 January 2014

83 Accesses

Synonyms

Speech synthesis; Synthetic voice creation; Text to speech (TTS)

Definition

Over the last decade, speech synthesis, the technology that enables machines to talk to humans, has become so natural sounding that a naïve listener might assume that he/she is listening to a recording of a live human speaker. Speech synthesis is not new; indeed, it took several decades to arrive where it is today. Originally starting from the idea of using physics-based models of the vocal tract, it took many years of research to perfect the encapsulation of the acoustic properties of the vocal tract as a “black box,” using so-called formant synthesizers. Then, with the help of ever more powerful computing technology, it became viable to use snippets of recorded speech directly and glue them together to create new sentences in the form of concatenative synthesizers. Combining this idea with now available methods for fast search, potentially millions of choices are evaluated to find the optimal...

This is a preview of subscription content, log in via an institution.

References

J. Schroeter, Basic principles of speech synthesis, in Springer Handbook of Speech Processing and Communication, chap. 19, ed. by J. Benesty (Springer, Berlin, 2008)
Google Scholar
J.L. Bader, Presidents as pitchmen, and posthumous play-by-play, commentary. New York Times, 9 Aug 2001
Google Scholar
J. van Santen, R. Sproat, J. Olive, J. Hirschberg (eds.) Progress in Speech Synthesis, section III (Springer, New York, 1997)
Google Scholar
J.N. Holmes, Research report formant synthesizers: cascade or parallel? Speech Commun. 2 (4), 251–273 (1983)
Article Google Scholar
R. Sproat, (ed.), Multilingual Text-to-Speech Synthesis. The Bell Labs Approach (Kluwer Academic, Dordrecht, 1998)
Google Scholar
A. Hunt, A.W. Black, Unit selection in a concatenative speech synthesis system using a large speech database, in Proceedings of the ICASSP-96, Atlanta, 1996, pp. 373–376
Google Scholar
G.D. Forney, The viterbi algorithm. Proc. IEEE 61 (3), 268–278 (1973)
Article MathSciNet Google Scholar
T. Dutoit, Corpus-based speech synthesis, in Springer Handbook of Speech Processing and Communication, chap. 21, ed. by J. Benesty (Springer, Berlin, 2008)
Google Scholar
J. van Santen, Prosodic processing, in Springer Handbook of Speech Processing and Communication, chap. 23, ed. by J. Benesty (Springer, Berlin, 2008)
Google Scholar
E. Cosatto, H.P. Graf, J. Ostermann, J. Schroeter, From audio-only to audio and video text-to-speech. Acta Acust. 90, 1084–1095 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

AT&T Labs Research, Room D163, 180 Park Ave., 07932, Florham Park, NJ, USA
Juergen Schroeter & Alistair Conkie

Authors

Juergen Schroeter
View author publications
You can also search for this author in PubMed Google Scholar
Alistair Conkie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juergen Schroeter .

Editor information

Editors and Affiliations

Center Biometrics Research & Security Nat'l. Lab of Pattern Recognition, Chinese Academy of Sciences, Beijing, China
Stan Z. Li
Department of Computer Science & Engineering, Michigan State University, East Lansing, Michigan, USA
Anil K. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Schroeter, J., Conkie, A. (2014). Voice Sample Synthesis. In: Li, S., Jain, A. (eds) Encyclopedia of Biometrics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27733-7_6-3

Download citation

DOI: https://doi.org/10.1007/978-3-642-27733-7_6-3
Received: 17 April 2014
Accepted: 17 April 2014
Published: 15 May 2014
Publisher Name: Springer, Berlin, Heidelberg
Online ISBN: 978-3-642-27733-7
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics