, Volume 34, Issue 1, pp 83–89 | Cite as

The synthetization of human voices

  • Oliver BendelEmail author
Original Article


The synthetization of voices, or speech synthesis, has been an object of interest for centuries. It is mostly realized with a text-to-speech system, an automaton that interprets and reads aloud. This system refers to text available for instance on a website or in a book, or entered via popup menu on the website. Today, just a few minutes of samples are enough to be able to imitate a speaker convincingly in all kinds of statements. This article abstracts from actual products and actual technological realization. Rather, after a short historical outline of the synthetization of voices, exemplary applications of this kind of technology are gathered for promoting the development, and potential applications are discussed critically to be able to limit them if necessary. The ethical and legal challenges should not be underestimated, in particular with regard to informational and personal autonomy and the trustworthiness of media.


Speech synthesis Text-to-speech system Artificial intelligence Robotics Information ethics Machine ethics 


  1. Anderson M, Anderson SL (eds) (2011) Machine ethics. Cambridge University Press, CambridgeGoogle Scholar
  2. Bendel O (2012) Maschinenethik. Gabler Wirtschaftslexikon. Springer Gabler, Wiesbaden.
  3. Bendel O (2015) Surgical, Therapeutic, nursing and sex robots in machine and information ethics. In: Rysewyk SPV, Pontier M (eds) Machine medical ethics. Series: intelligent systems, control and automation: science and engineering. Springer, Berlin, pp 17–32Google Scholar
  4. Bendel O (2016a) Cloud Computing aus Sicht von Verbraucherschutz und Informationsethik. In: Reinheimer S (ed) HMD—Praxis der Wirtschaftsinformatik, 25 July 2016 (“online first” article on SpringerLink)Google Scholar
  5. Bendel O (2016b) 300 Keywords Informationsethik: Grundwissen aus Computer-, Netz- und Neue-Medien-Ethik sowie Maschinenethik. Springer Gabler, WiesbadenCrossRefGoogle Scholar
  6. Bendel O (2017a) Towards Kant machines. In: The 2017 AAAI spring symposium series. AAAI Press, Palo AltoGoogle Scholar
  7. Bendel O (2017b) Sex robots from the perspective of machine ethics. In: Cheok AD, Devlin K, Levy D (eds) Love and sex with robots. second international conference, LSR 2016, London, UK, December 19–20, 2016, Revised Selected Papers. Springer International Publishing, Cham, pp 1–10Google Scholar
  8. Bendel O, Gerhard M (2004) Handy-Avatare—Möglichkeiten der mobilen Kommunikationsunterstützung., 12, pp 51–55Google Scholar
  9. Bendel O, Schwegler K, Richards B (2016) The LIEBOT project. In: Machine ethics and machine law, Jagiellonian University. November 18–19, 2016, Cracow, Poland. E-Proceedings. Jagiellonian University, Cracow.
  10. Beuth P (2016) Tonaufnahmenfälschen leicht gemacht. ZEIT ONLINE, 4 November 2016.
  11. Grimm J (1808) Entstehung der Verlagspoesie. Arnim LAv (ed) Zeitung für EinsiedlerGoogle Scholar
  12. Ingruber D, Prutsch U (2007) Imágenes—Bilder und Filme aus Lateinamerika. LIT, MünsterGoogle Scholar
  13. Kempelen WV (1791) Mechanismus der menschlichen Sprache nebst der Beschreibung seiner sprechenden Maschine. J. B. Degen, WienGoogle Scholar
  14. Klatt D (1987) Review of text-to-speech conversion for English. J. Acous. Soc. Amer. 82:737–793CrossRefGoogle Scholar
  15. Lenke M (2015) Nutzerprofile nach dem Tod: So regeln Sie Ihren digitalen Nachlass. Focus Online, 12 February 2015.
  16. Lüpke MV (2014) Als die Fotos lügen lernten. Spiegel Online (Eines Tages), 13 October 2014.
  17. Nagels P (2016) Wie eine Russin ihren toten Freund zum Leben erweckt., 7 October 2016.
  18. Plass-Fleßenkämper B (2016) Dank Adobe können wir unseren Ohren nicht mehr trauen. WIRED Germany, 8 November 2016.
  19. Schulz TM, Whitehead H, Gero S (2011) Individual vocal production in a sperm whale (Physeter macrocephalus) social unit. MARINE MAMMAL SCIENCE, 27(1), January 2011, pp 149–166Google Scholar
  20. Stark J (2016) Adobe stellt Sprach-Software Voco vor. com! professional, 9 November 2016.
  21. Steinacker L (2017) Wirtschaftwoche, 20 January 2017. Mit falscher Stimme, p 52Google Scholar
  22. Thies J, Zollhöfer M, Stamminger M, Theobalt C, Nießner M (2016) Face2Face: real-time face capture and reenactment of RGB videos. In: Proceedings computer vision and pattern recognition (CVPR), IEEE.
  23. Vincent J (2017) Lyrebird claims it can recreate any voice using just one minute of sample audio. The Verge, 24 April 2017.
  24. Wallach W, Allen C (2009) Moral machines: teaching robots right from wrong. Oxford University Press, OxfordCrossRefGoogle Scholar

Copyright information

© Springer-Verlag London Ltd. 2017

Authors and Affiliations

  1. 1.School of BusinessUniversity of Applied Sciences and Arts Northwestern SwitzerlandWindischSwitzerland

Personalised recommendations