The synthetization of human voices

Bendel, Oliver

doi:10.1007/s00146-017-0748-x

The synthetization of human voices

Original Article
Published: 26 July 2017

Volume 34, pages 83–89, (2019)
Cite this article

AI & SOCIETY Aims and scope Submit manuscript

Oliver Bendel¹

1574 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

The synthetization of voices, or speech synthesis, has been an object of interest for centuries. It is mostly realized with a text-to-speech system, an automaton that interprets and reads aloud. This system refers to text available for instance on a website or in a book, or entered via popup menu on the website. Today, just a few minutes of samples are enough to be able to imitate a speaker convincingly in all kinds of statements. This article abstracts from actual products and actual technological realization. Rather, after a short historical outline of the synthetization of voices, exemplary applications of this kind of technology are gathered for promoting the development, and potential applications are discussed critically to be able to limit them if necessary. The ethical and legal challenges should not be underestimated, in particular with regard to informational and personal autonomy and the trustworthiness of media.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Anderson M, Anderson SL (eds) (2011) Machine ethics. Cambridge University Press, Cambridge
Google Scholar
Bendel O (2012) Maschinenethik. Gabler Wirtschaftslexikon. Springer Gabler, Wiesbaden. http://wirtschaftslexikon.gabler.de/Definition/maschinenethik.html
Bendel O (2015) Surgical, Therapeutic, nursing and sex robots in machine and information ethics. In: Rysewyk SPV, Pontier M (eds) Machine medical ethics. Series: intelligent systems, control and automation: science and engineering. Springer, Berlin, pp 17–32
Bendel O (2016a) Cloud Computing aus Sicht von Verbraucherschutz und Informationsethik. In: Reinheimer S (ed) HMD—Praxis der Wirtschaftsinformatik, 25 July 2016 (“online first” article on SpringerLink)
Bendel O (2016b) 300 Keywords Informationsethik: Grundwissen aus Computer-, Netz- und Neue-Medien-Ethik sowie Maschinenethik. Springer Gabler, Wiesbaden
Book Google Scholar
Bendel O (2017a) Towards Kant machines. In: The 2017 AAAI spring symposium series. AAAI Press, Palo Alto
Bendel O (2017b) Sex robots from the perspective of machine ethics. In: Cheok AD, Devlin K, Levy D (eds) Love and sex with robots. second international conference, LSR 2016, London, UK, December 19–20, 2016, Revised Selected Papers. Springer International Publishing, Cham, pp 1–10
Bendel O, Gerhard M (2004) Handy-Avatare—Möglichkeiten der mobilen Kommunikationsunterstützung. InfoWeek.ch, 12, pp 51–55
Bendel O, Schwegler K, Richards B (2016) The LIEBOT project. In: Machine ethics and machine law, Jagiellonian University. November 18–19, 2016, Cracow, Poland. E-Proceedings. Jagiellonian University, Cracow. http://machinelaw.philosophyinscience.com/technical-program/
Beuth P (2016) Tonaufnahmenfälschen leicht gemacht. ZEIT ONLINE, 4 November 2016. http://www.zeit.de/digital/internet/2016-11/adobe-project-voco-photoshop-audio-manipulation
Grimm J (1808) Entstehung der Verlagspoesie. Arnim LAv (ed) Zeitung für Einsiedler
Ingruber D, Prutsch U (2007) Imágenes—Bilder und Filme aus Lateinamerika. LIT, Münster
Google Scholar
Kempelen WV (1791) Mechanismus der menschlichen Sprache nebst der Beschreibung seiner sprechenden Maschine. J. B. Degen, Wien
Klatt D (1987) Review of text-to-speech conversion for English. J. Acous. Soc. Amer. 82:737–793
Article Google Scholar
Lenke M (2015) Nutzerprofile nach dem Tod: So regeln Sie Ihren digitalen Nachlass. Focus Online, 12 February 2015. http://www.focus.de/digital/internet/sterben-2-0-virtuelle-grabpflege-so-regeln-sie-ihren-digitalen-nachlass_id_4224951.html
Lüpke MV (2014) Als die Fotos lügen lernten. Spiegel Online (Eines Tages), 13 October 2014. http://www.spiegel.de/einestages/bildmanipulation-falsche-fotos-vor-der-digital-aera-a-996453.html
Nagels P (2016) Wie eine Russin ihren toten Freund zum Leben erweckt. welt.de, 7 October 2016. https://www.welt.de/kmpkt/article158616017/Wie-eine-Russin-ihren-toten-Freund-zum-Leben-erweckt.html
Plass-Fleßenkämper B (2016) Dank Adobe können wir unseren Ohren nicht mehr trauen. WIRED Germany, 8 November 2016. https://www.wired.de/collection/tech/adobes-neues-tool-kann-sprache-imitieren
Schulz TM, Whitehead H, Gero S (2011) Individual vocal production in a sperm whale (Physeter macrocephalus) social unit. MARINE MAMMAL SCIENCE, 27(1), January 2011, pp 149–166
Stark J (2016) Adobe stellt Sprach-Software Voco vor. com! professional, 9 November 2016. http://www.com-magazin.de/news/adobe-systems/adobe-stellt-sprach-software-voco-1146967.html
Steinacker L (2017) Wirtschaftwoche, 20 January 2017. Mit falscher Stimme, p 52
Thies J, Zollhöfer M, Stamminger M, Theobalt C, Nießner M (2016) Face2Face: real-time face capture and reenactment of RGB videos. In: Proceedings computer vision and pattern recognition (CVPR), IEEE. http://www.graphics.stanford.edu/~niessner/papers/2016/1facetoface/thies2016face.pdf
Vincent J (2017) Lyrebird claims it can recreate any voice using just one minute of sample audio. The Verge, 24 April 2017. http://www.theverge.com/2017/4/24/15406882/ai-voice-synthesis-copy-human-speech-lyrebird
Wallach W, Allen C (2009) Moral machines: teaching robots right from wrong. Oxford University Press, Oxford
Book Google Scholar

Download references

Author information

Authors and Affiliations

School of Business, University of Applied Sciences and Arts Northwestern Switzerland, Bahnhofstrasse 6, 5210, Windisch, Switzerland
Oliver Bendel

Authors

Oliver Bendel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Oliver Bendel.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bendel, O. The synthetization of human voices. AI & Soc 34, 83–89 (2019). https://doi.org/10.1007/s00146-017-0748-x

Download citation

Received: 16 April 2017
Accepted: 18 July 2017
Published: 26 July 2017
Issue Date: 14 March 2019
DOI: https://doi.org/10.1007/s00146-017-0748-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The synthetization of human voices

Abstract

Access this article

Similar content being viewed by others

Speech Synthesis and Uncanny Valley

Speech Synthesis: Text-To-Speech Conversion and Artificial Voices

Speech Synthesis: Text-To-Speech Conversion and Artificial Voices

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The synthetization of human voices

Abstract

Access this article

Similar content being viewed by others

Speech Synthesis and Uncanny Valley

Speech Synthesis: Text-To-Speech Conversion and Artificial Voices

Speech Synthesis: Text-To-Speech Conversion and Artificial Voices

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation