Abstract
How to evaluate Artificial General Intelligence (AGI) is a critical problem that is discussed and unsolved for a long period. In the research of narrow AI, this seems not a severe problem, since researchers in that field focus on some specific problems as well as one or some aspects of cognition, and the criteria for evaluation are explicitly defined. By contrast, an AGI agent should solve problems that are never-encountered by both agents and developers. However, once a developer tests and debugs the agent with a problem, the never-encountered problem becomes the encountered problem, as a result, the problem is solved by the developers to some extent, exploiting their experience, rather than the agents. This conflict, as we call the trap of developers’ experience, leads to that this kind of problems is probably hard to become an acknowledged criterion. In this paper, we propose an evaluation method named Artificial Open World, aiming to jump out of the trap. The intuition is that most of the experience in the actual world should not be necessary to be applied to the artificial world, and the world should be open in some sense, such that developers are unable to perceive the world and solve problems by themselves before testing, though after that they are allowed to check all the data. The world is generated in a similar way as the actual world, and a general form of problems is proposed. A metric is proposed aiming to quantify the progress of research. This paper describes the conceptual design of the Artificial Open World, though the formalization and the implementation are left to the future.
Keywords
- Evaluation
- Artificial Open World
- Artificial General Intelligence
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Adams, S.S., Banavar, G., Campbell, M.: I-athlon: towards a multidimensional turing test. AI Mag. 37(1), 78–84 (2016)
Campbell, M., Hoane Jr, A.J., Hsu, F.H.: Deep blue. Artif. Intell. 134(1–2), 57–83 (2002)
Chollet, F.: On the measure of intelligence. arXiv preprint arXiv:1911.01547 (2019)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Genesereth, M., Björnsson, Y.: The international general game playing competition. AI Mag. 34(2), 107–107 (2013)
Goertzel, B.: Artificial general intelligence: concept, state of the art, and future prospects. J. Artif. Gen. Intell. 5(1), 1 (2014)
Goertzel, B., Bugaj, S.V.: AGI preschool: a framework for evaluating early-stage human-like AGIs. In: Proceedings of AGI, vol. 9, pp. 31–36 (2009)
Goertzel, B., Pennachin, C.: Artificial General Intelligence, vol. 2. Springer, Cham (2007). https://doi.org/10.1007/978-3-540-68677-4
Hart, D., Goertzel, B.: Opencog: a software framework for integrative artificial general intelligence. In: AGI, pp. 468–472 (2008)
Hofstadter, D.R.: Gdel, Escher, Bach: An Eternal Golden Braid. Basic Books, 20th anniversary (edn.) (1999)
Legg, S., Hutter, M., et al.: A collection of definitions of intelligence. Front. Artif. Intell. Appl. 157, 17 (2007)
Leibo, J.Z., et al.: Scalable evaluation of multi-agent reinforcement learning with melting pot. In: International Conference on Machine Learning, pp. 6187–6199. PMLR (2021)
Schrenk, M.: Metaphysics of Science: A Systematic and Historical Introduction. Routledge (2016)
Silver, D., et al.: Mastering the game of go without human knowledge. Nature 550(7676), 354–359 (2017)
Team, O.E.L., et al.: Open-ended learning leads to generally capable agents. arXiv preprint arXiv:2107.12808 (2021)
Wang, P.: Non-axiomatic reasoning system: exploring the essence of intelligence. Ph.D. thesis, Indiana University (1995)
Wang, P.: The evaluation of AGI systems. In: Proceedings of the Third Conference on Artificial General Intelligence, vol. 11, pp. 164–169. Citeseer (2010)
Wang, P.: Non-axiomatic Logic: A Model of Intelligent Reasoning. World Scientific (2013)
Wang, P.: On defining artificial intelligence. J. Artif. Gen. Intell. 11(2), 73–86 (2020)
Wang, P., Goertzel, B.: Theoretical Foundations of Artificial General Intelligence, vol. 4. Springer, Cham (2012). https://doi.org/10.2991/978-94-91216-62-6
Wang, P., Hammer, P.: Issues in temporal and causal inference. In: Bieger, J., Goertzel, B., Potapov, A. (eds.) AGI 2015. LNCS (LNAI), vol. 9205, pp. 208–217. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21365-1_22
Wray, R., Lebiere, C.: Metrics for cognitive architecture evaluation. In: Proceedings of the AAAI-07 Workshop on Evaluating Architectures for Intelligence, pp. 60–66 (2007)
Xu, B., Zhan, X., Ren, Q.: The gap between intelligence and mind. In: Goertzel, B., Iklé, M., Potapov, A. (eds.) AGI 2021. LNCS (LNAI), vol. 13154, pp. 292–305. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-93758-4_31
Contributions and Acknowledgements
Bowen Xu proposes the main idea and writes this paper; Quansheng Ren, who reviews and modifies the paper, points out the key idea that the complexity of the world stems from agents’ behaviors. We thank Pei Wang for sharing some pieces of literature on evaluating AGI. We thank those who review this paper. The work was sponsored by Zhejiang Lab (No. 2021RD0AB01).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Xu, B., Ren, Q. (2023). Artificial Open World for Evaluating AGI: A Conceptual Design. In: Goertzel, B., Iklé, M., Potapov, A., Ponomaryov, D. (eds) Artificial General Intelligence. AGI 2022. Lecture Notes in Computer Science(), vol 13539. Springer, Cham. https://doi.org/10.1007/978-3-031-19907-3_43
Download citation
DOI: https://doi.org/10.1007/978-3-031-19907-3_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-19906-6
Online ISBN: 978-3-031-19907-3
eBook Packages: Computer ScienceComputer Science (R0)