Friendly Artificial Intelligence

  • Eliezer YudkowskyEmail author
Part of the The Frontiers Collection book series (FRONTCOLL)


By far the greatest danger of Artificial Intelligence is that people conclude too early that they understand it. Of course this problem is not limited to the field of AI. Jacques Monod wrote: “A curious aspect of the theory of evolution is that everybody thinks he understands it”. Nonetheless the problem seems to be unusually acute in Artificial Intelligence.


Technical Failure Human Female Chess Player Computer Chip Empirical Belief 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Barrett, J. L., & Keil, F. (1996). Conceptualizing a non-natural entity: anthropomorphism in god concepts. Cognitive Psychology, 31, 219–247.CrossRefGoogle Scholar
  2. Bostrom, N. (2001). Existential risks: analyzing human extinction scenarios. Journal of Evolution and Technology, 9.Google Scholar
  3. Brown, D. E. (1991). Human universals. New York: McGraw-Hill.Google Scholar
  4. Crochat, P., & Franklin, D. (2000). Back-propagation neural network tutorial.
  5. Ekman, P., & Keltner, D. (1997). Universal facial expressions of emotion: An old controversy and new findings. In U. Segerstrale & P. Molnar (Eds.), Nonverbal communication: Where nature meets culture. Mahwah: Lawrence Erlbaum Associates.Google Scholar
  6. Hibbard, B. (2001). Super-intelligent machines. ACM SIGGRAPH Computer Graphics, 35(1)  .Google Scholar
  7. Hibbard, B. (2004). Reinforcement learning as a context for integrating AI research. Presented at the 2004 AAAI Fall Symposium on Achieving Human-Level Intelligence through Integrated Systems and Research.Google Scholar
  8. Jaynes, E. T., & Bretthorst, G. L. (2003). Probability theory: The logic of science. Cambridge: Cambridge University Press.zbMATHCrossRefGoogle Scholar
  9. Monod, J. L. (1974). On the molecular theory of evolution. Oxford: New York.Google Scholar
  10. Raymond, E. S. ed. (2003). DWIM. The on-line hacker Jargon File, version 4.4.7, 29 Dec 2003.Google Scholar
  11. Rice, H. G. (1953). Classes of recursively enumerable sets and their decision problems. Transactions of the American Mathematical Society, 74, 358–366.MathSciNetzbMATHCrossRefGoogle Scholar
  12. Schmidhuber, J. (2003). Goedel machines: self-referential universal problem solvers making provably optimal self-improvements. In B. Goertzel & C. Pennachin (Eds), Artificial general intelligence. Forthcoming. New York: Springer.Google Scholar
  13. Sober, E. (1984). The nature of selection. Cambridge: MIT Press.Google Scholar
  14. Tooby, J., & Cosmides, L. (1992). The psychological foundations of culture. In J. H. Barkow, L. Cosmides, & J. Tooby (Eds.), The adapted mind: Evolutionary psychology and the generation of culture. New York: Oxford University Press.Google Scholar
  15. Wachowski, A., & Wachowski, L. (1999). The Matrix, USA, Warner Bros, 135 min.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  1. 1.Machine Intelligence Research InstituteSan FranciscoUSA

Personalised recommendations