Abstract
The European Parliament recently proposed to grant the personhood of autonomous AI, which raises fundamental questions concerning the ethical nature of AI. Can they be moral agents? Can they be morally responsible for actions and their consequences? Here we address these questions, focusing upon, inter alia, the possibilities of moral agency and moral responsibility in artificial general intelligence; moral agency is a precondition for moral responsibility (which is, in turn, a precondition for legal punishment). In the first part of the paper we address the moral agency status of AI in light of traditional moral philosophy, especially Kant’s, Hume’s, and Strawson’s, and clarify the possibility of Moral AI (i.e., AI with moral agency) by discussing the Ethical Turing Test, the Moral Chinese Room Argument, and Weak and Strong Moral AI. In the second part we address the moral responsibility status of AI, and thereby clarify the possibility of Responsible AI (i.e., AI with moral responsibility). These issues would be crucial for AI-pervasive technosociety in the (possibly near) future, especially for post-human society after the development of artificial general intelligence.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Caliskan, A., et al.: Semantics derived automatically from language corpora contain human-like biases. Science 356, 183–186 (2017)
Cole, D.: The Chinese room argument. In: The Stanford Encyclopedia of Philosophy (2009)
Denis, L.: Kant and Hume on morality. In: Stanford Encyclopedia of Philosophy (2008)
European Parliament report with recommendations to the Commission on Civil Law Rules on Robotics (2015/2103(INL))
Floridi, L., Sanders, J.: On the morality of artificial agents. Minds Mach. 14, 349–379 (2004)
Götz, M.J., et al.: Criminality and antisocial behaviour in unselected men with sex chromosome abnormalities. Psychol. Med. 29, 953–962 (1999)
Sheppard, B.H., et al.: Organizational Justice. Macmillan, Basingstoke (1992)
Hume, D.: A Treatise of Human Nature, pp. 1739–1740 (2003)
Hume, D.: An Enquiry Concerning the Principles of Morals. A. Millar (1751)
Kant, I.: Groundwork for the Metaphysics of Morals, James W. Ellington (trans.). Hackett Publishing Company, Indianapolis (1785)
Kant, I.: Critique of Pure Reason, trans. Paul Guyer and Allen Wood. Cambridge University Press, Cambridge (1998)
Kauppinen, A.: Moral sentimentalism. In: Stanford Encyclopedia of Philosophy (2014)
Maruyama, Y.: Reasoning about fuzzy belief and common belief: with emphasis on incomparable beliefs. In: Proceedings of IJCAI 2011, pp. 1008–1013 (2011)
Maruyama, Y.: Dualities for algebras of Fitting’s many-valued modal logics. Fundamenta Informaticae 106, 273–294 (2011)
Maruyama, Y.: From operational chu duality to coalgebraic quantum symmetry. In: Heckel, R., Milius, S. (eds.) CALCO 2013. LNCS, vol. 8089, pp. 220–235. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40206-7_17
Maruyama, Y.: Full lambek hyperdoctrine: categorical semantics for first-order substructural logics. In: Libkin, L., Kohlenbach, U., de Queiroz, R. (eds.) WoLLIC 2013. LNCS, vol. 8071, pp. 211–225. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39992-3_19
Maruyama, Y.: Duality theory and categorical universal logic: with emphasis on quantum structures. In: Proceedings of the Tenth Quantum Physics and Logic Conference, EPTCS, vol. 171, pp. 100–112 (2014)
Maruyama, Y.: Prior’s tonk, notions of logic, and levels of inconsistency: vindicating the pluralistic unity of science in the light of categorical logical positivism. Synthese 193, 3483–3495 (2016)
Maruyama, Y.: AI, quantum information, and external semantic realism: Searle’s observer-relativity and Chinese room, revisited. Fund. Issues Artif. Intell. Synth. Libr. 376, 115–127 (2016)
Maruyama, Y.: Meaning and duality: from categorical logic to quantum physics. DPhil Thesis, University of Oxford (2017)
Maruyama, Y.: Quantum pancomputationalism and statistical data science: from symbolic to statistical AI, and to quantum AI. In: Müller, V.C. (ed.) PT-AI 2017. SAPERE, vol. 44, pp. 207–211. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-96448-5_20
Maruyama, Y.: Compositionality and contextuality: the symbolic and statistical theories of meaning. In: Bella, G., Bouquet, P. (eds.) CONTEXT 2019. LNCS (LNAI), vol. 11939, pp. 161–174. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-34974-5_14
Maruyama, Y.: The conditions of artificial general intelligence: logic, autonomy, resilience, integrity, morality, emotion, embodiment, and embeddedness. In: Goertzel, B., Panov, A.I., Potapov, A., Yampolskiy, R. (eds.) AGI 2020. LNCS (LNAI), vol. 12177, pp. 242–251. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-52152-3_25
Maruyama, Y.: The categorical integration of symbolic and statistical AI: quantum NLP and applications to cognitive and machine bias problems. In: Abraham, A., Siarry, P., Ma, K., Kaklauskas, A. (eds.) ISDA 2019. AISC, vol. 1181, pp. 466–476. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-49342-4_45
Maruyama, Y.: Post-truth AI and big data epistemology: from the genealogy of artificial intelligence to the nature of data science as a new kind of science. In: Abraham, A., Siarry, P., Ma, K., Kaklauskas, A. (eds.) ISDA 2019. AISC, vol. 1181, pp. 540–549. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-49342-4_52
Maruyama, Y.: Quantum physics and cognitive science from a Wittgensteinian perspective: Bohr’s Classicism, Chomsky’s Universalism, and Bell’s Contextualism. In: Wuppuluri, S., da Costa, N. (eds.) WITTGENSTEINIAN (adj.). TFC, pp. 375–407. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-27569-3_20
Maruyama, Y.: Symbolic and statistical theories of cognition: towards integrated artificial intelligence. In: Cleophas, L., Massink, M. (eds.) SEFM 2020. LNCS, vol. 12524, pp. 129–146. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67220-1_11
Maruyama, Y.: Learning, development, and emergence of compositionality in natural language processing. In: Proceedings of IEEE ICDL, pp. 1–7 (2021)
Maruyama, Y.: Fibred algebraic semantics for a variety of non-classical first-order logics and topological logical translation. J. Symb. Logic 86, 1–27 (2021)
McLear, C.K: Philosophy of mind. In: Internet Encyclopedia of Philosophy
Noorman, M.: Computing and moral responsibility. In: Stanford Encyclopedia of Philosophy (2018)
Searle, J.R.: Minds, brains, and programs. Behav. Brain Sci. 3, 417–457 (1980)
Searle, J.R.: Intentionality: an essay in the philosophy of mind. In: CUP (1983)
Searle, J.R.: The Rediscovery of the Mind. MIT Press, Cambridge (1992)
Siewert, C.: Consciousness and intentionality. In: Stanford Encyclopedia of Philosophy (2016)
Smith, A.: Theory of Moral Sentiments, A. Millar (1761)
Strawson, P.F.: Freedom and resentment. Proc. Brit. Acad. 48, 1–25 (1962)
Khoury, A.C.: Moral responsibility. J. Value Inquiry 48(4), 573–575 (2014). https://doi.org/10.1007/s10790-014-9457-6
Turney, P., Pantel, P.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. 37, 141–188 (2010)
Williams, G.: Responsibility. In: Internet Encyclopedia of Philosophy (1995)
Acknowledgements
This work was supported by the Moonshot R&D Programme (JST; JPMJMS2033). Special thanks to Professor Seth Lazar for his mentoring and guidance and for leading the Humanising Machine Intelligence project.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Maruyama, Y. (2022). Moral Philosophy of Artificial General Intelligence: Agency and Responsibility. In: Goertzel, B., Iklé, M., Potapov, A. (eds) Artificial General Intelligence. AGI 2021. Lecture Notes in Computer Science(), vol 13154. Springer, Cham. https://doi.org/10.1007/978-3-030-93758-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-93758-4_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-93757-7
Online ISBN: 978-3-030-93758-4
eBook Packages: Computer ScienceComputer Science (R0)