Abstract
The synthetization of voices, or speech synthesis, has been an object of interest for centuries. It is mostly realized with a text-to-speech system, an automaton that interprets and reads aloud. This system refers to text available for instance on a website or in a book, or entered via popup menu on the website. Today, just a few minutes of samples are enough to be able to imitate a speaker convincingly in all kinds of statements. This article abstracts from actual products and actual technological realization. Rather, after a short historical outline of the synthetization of voices, exemplary applications of this kind of technology are gathered for promoting the development, and potential applications are discussed critically to be able to limit them if necessary. The ethical and legal challenges should not be underestimated, in particular with regard to informational and personal autonomy and the trustworthiness of media.
Similar content being viewed by others
References
Anderson M, Anderson SL (eds) (2011) Machine ethics. Cambridge University Press, Cambridge
Bendel O (2012) Maschinenethik. Gabler Wirtschaftslexikon. Springer Gabler, Wiesbaden. http://wirtschaftslexikon.gabler.de/Definition/maschinenethik.html
Bendel O (2015) Surgical, Therapeutic, nursing and sex robots in machine and information ethics. In: Rysewyk SPV, Pontier M (eds) Machine medical ethics. Series: intelligent systems, control and automation: science and engineering. Springer, Berlin, pp 17–32
Bendel O (2016a) Cloud Computing aus Sicht von Verbraucherschutz und Informationsethik. In: Reinheimer S (ed) HMD—Praxis der Wirtschaftsinformatik, 25 July 2016 (“online first” article on SpringerLink)
Bendel O (2016b) 300 Keywords Informationsethik: Grundwissen aus Computer-, Netz- und Neue-Medien-Ethik sowie Maschinenethik. Springer Gabler, Wiesbaden
Bendel O (2017a) Towards Kant machines. In: The 2017 AAAI spring symposium series. AAAI Press, Palo Alto
Bendel O (2017b) Sex robots from the perspective of machine ethics. In: Cheok AD, Devlin K, Levy D (eds) Love and sex with robots. second international conference, LSR 2016, London, UK, December 19–20, 2016, Revised Selected Papers. Springer International Publishing, Cham, pp 1–10
Bendel O, Gerhard M (2004) Handy-Avatare—Möglichkeiten der mobilen Kommunikationsunterstützung. InfoWeek.ch, 12, pp 51–55
Bendel O, Schwegler K, Richards B (2016) The LIEBOT project. In: Machine ethics and machine law, Jagiellonian University. November 18–19, 2016, Cracow, Poland. E-Proceedings. Jagiellonian University, Cracow. http://machinelaw.philosophyinscience.com/technical-program/
Beuth P (2016) Tonaufnahmenfälschen leicht gemacht. ZEIT ONLINE, 4 November 2016. http://www.zeit.de/digital/internet/2016-11/adobe-project-voco-photoshop-audio-manipulation
Grimm J (1808) Entstehung der Verlagspoesie. Arnim LAv (ed) Zeitung für Einsiedler
Ingruber D, Prutsch U (2007) Imágenes—Bilder und Filme aus Lateinamerika. LIT, Münster
Kempelen WV (1791) Mechanismus der menschlichen Sprache nebst der Beschreibung seiner sprechenden Maschine. J. B. Degen, Wien
Klatt D (1987) Review of text-to-speech conversion for English. J. Acous. Soc. Amer. 82:737–793
Lenke M (2015) Nutzerprofile nach dem Tod: So regeln Sie Ihren digitalen Nachlass. Focus Online, 12 February 2015. http://www.focus.de/digital/internet/sterben-2-0-virtuelle-grabpflege-so-regeln-sie-ihren-digitalen-nachlass_id_4224951.html
Lüpke MV (2014) Als die Fotos lügen lernten. Spiegel Online (Eines Tages), 13 October 2014. http://www.spiegel.de/einestages/bildmanipulation-falsche-fotos-vor-der-digital-aera-a-996453.html
Nagels P (2016) Wie eine Russin ihren toten Freund zum Leben erweckt. welt.de, 7 October 2016. https://www.welt.de/kmpkt/article158616017/Wie-eine-Russin-ihren-toten-Freund-zum-Leben-erweckt.html
Plass-Fleßenkämper B (2016) Dank Adobe können wir unseren Ohren nicht mehr trauen. WIRED Germany, 8 November 2016. https://www.wired.de/collection/tech/adobes-neues-tool-kann-sprache-imitieren
Schulz TM, Whitehead H, Gero S (2011) Individual vocal production in a sperm whale (Physeter macrocephalus) social unit. MARINE MAMMAL SCIENCE, 27(1), January 2011, pp 149–166
Stark J (2016) Adobe stellt Sprach-Software Voco vor. com! professional, 9 November 2016. http://www.com-magazin.de/news/adobe-systems/adobe-stellt-sprach-software-voco-1146967.html
Steinacker L (2017) Wirtschaftwoche, 20 January 2017. Mit falscher Stimme, p 52
Thies J, Zollhöfer M, Stamminger M, Theobalt C, Nießner M (2016) Face2Face: real-time face capture and reenactment of RGB videos. In: Proceedings computer vision and pattern recognition (CVPR), IEEE. http://www.graphics.stanford.edu/~niessner/papers/2016/1facetoface/thies2016face.pdf
Vincent J (2017) Lyrebird claims it can recreate any voice using just one minute of sample audio. The Verge, 24 April 2017. http://www.theverge.com/2017/4/24/15406882/ai-voice-synthesis-copy-human-speech-lyrebird
Wallach W, Allen C (2009) Moral machines: teaching robots right from wrong. Oxford University Press, Oxford
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bendel, O. The synthetization of human voices. AI & Soc 34, 83–89 (2019). https://doi.org/10.1007/s00146-017-0748-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00146-017-0748-x