Cognitive hierarchies in adaptive play

Khan, Abhimanyu; Peeters, Ronald

doi:10.1007/s00182-014-0410-5

Cognitive hierarchies in adaptive play

Published: 31 January 2014

Volume 43, pages 903–924, (2014)
Cite this article

International Journal of Game Theory Aims and scope Submit manuscript

Abhimanyu Khan¹ &
Ronald Peeters¹

314 Accesses
6 Citations
Explore all metrics

Abstract

Inspired by the behavior in repeated guessing game experiments, we study adaptive play by populations containing individuals that reason with different levels of cognition. Individuals play a higher order best response to samples from the empirical data on the history of play, where the order of best response is determined by their exogenously given level of cognition. As in Young’s model of adaptive play, (unperturbed) play still converges to a minimal curb set. Random perturbations of the best response dynamic identifies the stochastically stable states. In Young’s model of adaptive play with simple best-responses, the set of stochastically stable states are sensitive to the sample size that individuals from a population can draw. In generic games with higher order best-responders in both populations, the sample size is rendered irrelevant in determination of the stochastically stable set. Perhaps counter-intuitively, higher cognition may actually be bad for both the individual with higher cognition and his parent population.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey of experimental research on contests, all-pay auctions and tournaments

Article 06 November 2014

Reward Function Design in Reinforcement Learning

The Fundamentals of Complex Adaptive Systems

Notes

Relatedly, Stahl (1993) develops a hierarchical model of “smartness” based on rationalizability and argues that while “smartness” may evolve over time, all levels of “smartness” would continue to be represented in the population. Our framework also allows for evolution of cognitive hierarchies.
The minimal curb set is, loosely speaking, a set which contains all its best responses and there is no proper subset contained in it with the same property. For a more precise definition, see Sect. 2.
Mohlin (2012) shows that it is possible for evolutionary learning processes to converge to a state where different cognitive types co-exist.
For the purpose of nomenclature, we retain the associated terminology of the level-\(k\) model, even though we step outside the boundaries of it. Our focus is the long-run behavior, whereas the level-\(k\) model is meant for the purpose of explaining initial play.
Although in the Nash bargaining game a higher sample size confers a benefit to the population, depending on the payoff structure it can be a bane as well—see Sect. 6 for more details.
This effect has also been shown by Sáez-Martí and Weibull (1999) when allowing for “clever” individuals (that is, of level-2) in only one population.
We refer here to the same generic class of games as referred to in Young (1998, p. 111). So, a property holds generically for a class of games if it holds for an open dense subset of that class (according to the Lebesgue measure on the Euclidean space specifying the payoffs while fixing the number of players and the number of actions they can choose from). Next, if we have a property that is generic for a class of games, we call the games in the subset for which this property holds generic.
Matros (2003) shows this for the situation where only one population has level-2 individuals and both populations have the same sample size, in which case, the presence of the level-2 individuals does not make a difference to the stochastically stable outcome. Our more general result allows for even higher levels of cognition, populations to have unequal sample sizes, and both populations to host individuals of level-2 or higher.
We do not require the probability to be selected to be equal for each individual in a population.
In cases of multiple best responses, we always assume each best response to be chosen with positive probability, not necessarily with equal chance.
In order to do so, it is necessary that the \(L2\) individual possesses knowledge of the utility function of the rival population. In Sect. 5 we are going to relax this assumption.
In Sect. 5 we replace this assumption with an alternative one.
The situation where one population contains a share of \(L2\) individuals and both populations have an equal sample size has been captured in Matros (2003).
Note that we do not explicitly require all these types actually to be contained in the rival population.
The \(L^{*}k\) individual, therefore, does not choose a best response to a distribution of types, but rather, after considering the best response to each lower cognitive type, places point mass belief on one such lower cognitive type. We remain agnostic about the process by which the particular lower cognitive type is chosen, but only require that the probability of each lower cognitive type being assigned point mass belief be strictly positive.
Examples outside this class for which the propositions below do not hold are easily constructed.
Possible explanations of such a systematic behavioral trait include the ‘false consensus effect’ (Ross et al. 1977) and self-projection (Buckner and Carroll 2007).
Even though we assume that an \(L^{\prime }k\) individual projects his own utility function onto the rival population, the general message of this section—that is, play converges to a minimal curb set but the set of stochastically stable states may differ—is valid even if an \(L^{\prime }k\) individual evaluates the rival’s preferences with some other cardinal utility function (under the proviso that the ordinal preferences are identical). The reason for more explicitly dealing with self-projection of utility function is that under a situation of identical ordinal preferences, it might be more reasonable to attribute one’s own preference onto another rather than to use some other arbitrary utility function to do so.

References

Basu K, Weibull J (1991) Strategy subsets closed under rational behavior. Econ Lett 36(2):141–146
Article Google Scholar
Binmore K (1987) Modeling rational players I. Econ Philos 3:179–214
Article Google Scholar
Binmore K (1988) Modeling rational players II. Econ Philos 4:9–55
Article Google Scholar
Buckner RL, Carroll DC (2007) Self-projection and the brain. Trends Cogn Sci 11(2):49–57
Article Google Scholar
Camerer CF, Ho T-H, Chong J-K (2004) A cognitive hierarchy model of games. Q J Econ 119(3):861–898
Article Google Scholar
Coricelli G, Nagel R (2009) Neural correlates of depth of strategic reasoning in medial prefrontal cortex. Proc Natl Acad Sci USA 106(23):9163–9168
Article Google Scholar
Crawford VP, Iriberri N (2007) Fatal attraction: salience, naivete, and sophistication in experimental “hide-and-seek” games. Am Econ Rev 97(5):1731–1750
Article Google Scholar
Hurkens S (1995) Learning by forgetful players. Games Econ Behav 11(2):304–329
Article Google Scholar
Matros A (2003) Clever agents in adaptive learning. J Econ Theory 111(1):110–124
Article Google Scholar
Mohlin E (2012) Evolution of theories of mind. Games Econ Behav 75(1):299–312
Article Google Scholar
Nagel R (1995) Unraveling in guessing games: an experimental study. Am Econ Rev 85(5):1313–1326
Google Scholar
Ross L, Greene D, House P (1977) The ‘false consensus effect’: an egocentric bias in social perception and attribution processes. J Exp Soc Psychol 13(3):279–301
Article Google Scholar
Sáez-Martí M, Weibull J (1999) Clever agents in Young’s evolutionary bargaining model. J Econ Theory 86(2):268–279
Article Google Scholar
Stahl DO (1993) Evolution of \(smart_{n}\) players. Games Econ Behav 5(4):604–617
Article Google Scholar
Stahl DO, Wilson PW (1994) Experimental evidence on players’ models of other players. J Econ Behav Organ 25(3):309–327
Article Google Scholar
Stahl DO, Wilson PW (1995) On players’ models of other players: theory and experimental evidence. Games Econ Behav 10(1):218–254
Article Google Scholar
Wang JT, Spezio M, Camerer CF (2010) Pinocchio’s pupil: using eyetracking and pupil dilation to understand truth telling and deception in sender–receiver games. Am Econ Rev 100(3):984–1007
Article Google Scholar
Young HP (1993a) The evolution of conventions. Econometrica 61(1):57–84
Article Google Scholar
Young HP (1993b) An evolutionary model of bargaining. J Econ Theory 59(1):145–168
Article Google Scholar
Young HP (1998) Individual strategy and social structure: an evolutionary theory of institutions. Princeton University Press, Princeton
Google Scholar

Download references

Acknowledgments

We thank Jean-Jacques Herings and David Levine for very helpful comments and suggestions. Financial support by the Netherlands Organisation for Scientific Research (NWO) is gratefully acknowledged.

Author information

Authors and Affiliations

Department of Economics, Maastricht University, P.O. Box 616, 6200 MD , Maastricht, The Netherlands
Abhimanyu Khan & Ronald Peeters

Authors

Abhimanyu Khan
View author publications
You can also search for this author in PubMed Google Scholar
Ronald Peeters
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ronald Peeters.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Khan, A., Peeters, R. Cognitive hierarchies in adaptive play. Int J Game Theory 43, 903–924 (2014). https://doi.org/10.1007/s00182-014-0410-5

Download citation

Received: 11 July 2013
Accepted: 02 January 2014
Published: 31 January 2014
Issue Date: November 2014
DOI: https://doi.org/10.1007/s00182-014-0410-5

Keywords

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cognitive hierarchies in adaptive play

Abstract

Access this article

Similar content being viewed by others

A survey of experimental research on contests, all-pay auctions and tournaments

Reward Function Design in Reinforcement Learning

The Fundamentals of Complex Adaptive Systems

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

Cognitive hierarchies in adaptive play

Abstract

Access this article

Similar content being viewed by others

A survey of experimental research on contests, all-pay auctions and tournaments

Reward Function Design in Reinforcement Learning

The Fundamentals of Complex Adaptive Systems

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation