The value of (bounded) memory in a changing world

Monte, Daniel; Said, Maher

doi:10.1007/s00199-013-0771-1

The value of (bounded) memory in a changing world

Research Article
Published: 01 September 2013

Volume 56, pages 59–82, (2014)
Cite this article

Economic Theory Aims and scope Submit manuscript

Daniel Monte¹ &
Maher Said²

413 Accesses
4 Citations
Explore all metrics

Abstract

This paper explores the value of memory in decision making in dynamic environments. We examine the decision problem faced by an agent with bounded memory who receives a sequence of signals from a partially observable Markov decision process. We characterize environments in which the optimal memory consists of only two states. In addition, we show that the marginal value of additional memory states need not be positive and may even be negative in the absence of free disposal.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Memory Management in Resource-Bounded Agents

Effects of Limited and Heterogeneous Memory in Hidden-Action Situations

The Effort of Reasoning: Modelling the Inference Steps of Boundedly Rational Agents

Notes

Other recent work in decision problems with limited memory includes Güth and Ludwig (2000), Mullainathan (2002), Wilson (2004), Kocer (2010), Miller and Rozen (2012) and Kaneko and Kline (2013).
Unlike a decision maker with bounded recall (see, among others, Lehrer 1988; Aumann and Sorin 1989 or Alós-Ferrer and Shi 2012) who knows only a finite truncation of history, a decision maker with bounded memory has a finite number of states that summarize all her information. Such models have been studied extensively in repeated-game settings: Neyman (1985), Rubinstein (1986) and Kalai and Stanford (1988) are some of the early contributions to this literature, while Romero (2011), Compte and Postlewaite (2012a, b) and Monte (2012) are more recent. Closely related is the literature on “dynastic” games, as in Anderlini and Lagunoff (2005) and Anderlini et al. (2008).
For broad overviews of related work on bounded rationality and behavioral biases, the curious reader may wish to consult Lipman (1995) or Rubinstein (1998), as well as the references therein.
Kalai and Solan (2003) also consider a model of dynamic decision making with bounded memory, but focus on the role and value of simplicity and randomization.
This is the main contrast with the stationary models of, for instance, Hellman and Cover (1970) and Wilson (2004).
With discounting, the optimal bounded memory system will be somewhat present biased, with distortions that are dependent on the decision maker’s initial prior. (Kocer (2010), Lemma 1) suggests, however, that discounting and the limit of means criterion are “close”—the payoff to the discounted-optimal memory system converges, as the discount rate goes to zero, to the payoff to the limit-of-means-optimal memory system.
Therefore, this decision problem is very different from a multi-armed bandit problem and departs from the optimal experimentation literature. See Kocer (2010) for a model of experimentation with bounded memory.
Recall that Bayesian updating in this environment is symmetric, with $\rho _{t+1}^{H}(\rho )+\rho _{t+1}^{L}(1-\rho )=1$ for all $\rho \in [0,1]$.
Quantifying the loss from bounded memory (relative to an unbounded Bayesian decision maker) is certainly a natural avenue for further inquiry. Such an attempt is complicated, however, by the difficulty of analytically characterizing the general solution to a partially observable Markov decision problem such as our own and is thus beyond the scope of the present work.

References

Alós-Ferrer, C., Shi, F.: Imitation with asymmetric memory. Econ. Theory 49(1), 193–215 (2012)
Article Google Scholar
Anderlini, L., Gerardi, D., Lagunoff, R.: A “super” folk theorem for dynastic repeated games. Econ. Theory 37(3), 357–394 (2008)
Article Google Scholar
Anderlini, L., Lagunoff, R.: Communication in dynastic repeated games: ‘whitewashes’ and ‘coverups’. Econ. Theory 26(2), 265–299 (2005)
Article Google Scholar
Aumann, R.J., Sorin, S.: Cooperation and bounded recall. Games Econ. Behav. 1(1), 5–39 (1989)
Article Google Scholar
Compte, O., Postlewaite, A.: Belief formation. Unpublished manuscript, University of Pennsylvania (2012a)
Compte, O., Postlewaite, A.: Plausible cooperation. Unpublished manuscript, University of Pennsylvania (2012b)
Cover, T., Hellman, M.: On memory saved by randomization. Ann. Math. Stat. 42(3), 1075–1078 (1971)
Article Google Scholar
De Grauwe, P.: Animal spirits and monetary policy. Econ. Theory 47(2–3), 423–457 (2011)
Article Google Scholar
Güth, S., Ludwig, S.: How helpful is a long memory on financial markets? Econ. Theory 16(1), 107–134 (2000)
Article Google Scholar
Hellman, M., Cover, T.: Learning with finite memory. Ann. Math. Stat. 41(3), 765–782 (1970)
Article Google Scholar
Kalai, E., Solan, E.: Randomization and simplification in dynamic decision-making. J. Econ. Theory 111(2), 251–264 (2003)
Article Google Scholar
Kalai, E., Stanford, W.: Finite rationality and interpersonal complexity in repeated games. Econometrica 56(2), 397–410 (1988)
Article Google Scholar
Kaneko, M., Kline, J.J.: Partial memories, inductively derived views, and their interactions with behavior. Econ. Theory 53(1), 27–59 (2013)
Article Google Scholar
Kocer, Y.: Endogenous learning with bounded memory. Unpublished manuscript, Princeton University (2010)
Lehrer, E.: Repeated games with stationary bounded recall strategies. J. Econ. Theory 1, 130–144 (1988)
Article Google Scholar
Lipman, B.L.: Information processing and bounded rationality: a survey. Can. J. Econ. 28(1), 42–67 (1995)
Article Google Scholar
Miller, D.A., Rozen, K.: Optimally empty promises and endogenous supervision. Unpublished manuscript, Yale University (2012)
Monte, D.: Learning with bounded memory in games. Unpublished manuscript, Sao Paulo School of Economics (2012)
Mullainathan, S.: A memory-based model of bounded rationality. Q. J. Econ. 117(3), 735–774 (2002)
Article Google Scholar
Neyman, A.: Bounded complexity justifies cooperation in the finitely repeated prisoners’ dilemma. Econ. Lett. 19(3), 227–229 (1985)
Article Google Scholar
Romero, J.: Finite automata in undiscounted repeated games with private monitoring. Unpublished manuscript, Purdue University (2011)
Rubinstein, A.: Finite automata play the repeated prisoner’s dilemma. J. Econ. Theory 39(1), 83–96 (1986)
Article Google Scholar
Rubinstein, A.: Modeling Bounded Rationality. MIT Press, Cambridge (1998)
Google Scholar
Stokey, N.L., Lucas R.E. Jr.: Recursive Methods in Economic Dynamics. Harvard University Press, Cambridge (1989)
Wilson, A.: Bounded memory and biases in information processing. Unpublished manuscript, Harvard University (2004)

Download references

Author information

Authors and Affiliations

Sao Paulo School of Economics-FGV, São Paulo, Brazil
Daniel Monte
Olin Business School, Washington University in St. Louis, St. Louis, MO, USA
Maher Said

Authors

Daniel Monte
View author publications
You can also search for this author in PubMed Google Scholar
Maher Said
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Maher Said.

Additional information

This paper supersedes an earlier working paper circulated as “Learning in Hidden Markov Models with Bounded Memory.” We thank the editor, an anonymous referee, Dirk Bergemann, Dino Gerardi, Bernardo Guimaraes, Johannes Hörner, Abraham Neyman, and Ben Polak, as well as seminar participants at Yale University and Simon Fraser University for their helpful advice and comments.

Appendix

Proof of Theorem 2

Note that symmetry implies $\mu _{(1,L)}=\mu _{(2,H)}$ and $\mu _{(2,L)}=\mu _{(1,H)}$; therefore, the decision maker solves

$$\begin{aligned} \max _{\varphi _{1,2}^{H}\in [0,1]}\left\{ 2 \left( \gamma \mu _{(2,H)}+(1-\gamma )\mu _{(1,H)}\right) \right\} . \end{aligned}$$

We may write the steady-state condition in Eq. (3) for state $(2,H)$ as

$$\begin{aligned} \mu _{(2,H)}&= (\alpha \mu _{(1,L)}+(1-\alpha )\mu _{(1,H)})\left( \gamma \varphi _{1,2}^{H}+(1-\gamma )\varphi _{1,2}^{L}\right) \\&+\,(\alpha \mu _{(2,L)}+(1-\alpha )\mu _{(2,H)})\left( \gamma \varphi _{2,2}^{H}+(1-\gamma )\varphi _{2,2}^{L}\right) \\&= (\alpha \mu _{(2,H)}+(1-\alpha )\mu _{(1,H)})\left( \gamma \varphi _{1,2}^{H}+(1-\gamma )\varphi _{2,1}^{H}\right) \\&+\,(\alpha \mu _{(1,H)}+(1-\alpha )\mu _{(2,H)})\left( \gamma \varphi _{2,2}^{H}+(1-\gamma )\varphi _{1,1}^{H}\right) , \end{aligned}$$

where the second equality follows from symmetry. Recalling that $\varphi _{1,1}^{H}=1-\varphi _{1,2}^{H}$, and that monotonicity implies $\varphi _{2,1}^{H}=0$, this may be written as

$$\begin{aligned} \mu _{(2,H)}&= (\alpha \mu _{(2,H)}+(1-\alpha )\mu _{(1,H)})\left( \gamma \varphi _{1,2}^{H}\right) \\&+\,(\alpha \mu _{(1,H)}+(1-\alpha )\mu _{(2,H)})\left( \gamma \varphi _{2,2}^{H}+(1-\gamma )\left( 1-\varphi _{1,2}^{H}\right) \right) \\&= \mu _{(1,H)}\left( \alpha \gamma +\alpha (1-\gamma )\left( 1-\varphi _{1,2}^{H}\right) +(1-\alpha )\gamma \varphi _{1,2}^{H}\right) \\&+\,\mu _{(2,H)}\left( \alpha \gamma \varphi _{1,2}^{H}+(1-\alpha )\gamma +(1-\alpha )(1-\gamma )\left( 1-\varphi _{1,2}^{H}\right) \right) . \end{aligned}$$

Combining this expression with the observation from Eq. (2) that $\mu _{(1,H)}=\frac{1}{2}-\mu _{(2,H)}$, we can then solve for $\mu _{(2,H)}$. In particular, we must have

$$\begin{aligned} \mu _{(2,H)}=\frac{1}{2}\left( \frac{\alpha +(\gamma -\alpha ) \varphi _{1,2}^{H}}{2\alpha +(1-2\alpha )\varphi _{1,2}^{H}}\right) . \end{aligned}$$

With this in hand, we may write the decision maker’s payoff as

$$\begin{aligned} U_{2}\left( \varphi _{1,2}^{H}\right) =(1-\gamma )+(2\gamma -1) \frac{\alpha +(\gamma -\alpha )\varphi _{1,2}^{H}}{2\alpha +(1-2\alpha )\varphi _{1,2}^{H}}. \end{aligned}$$

(4)

Differentiating with respect to $\varphi _{1,2} ^{H}$ yields

$$\begin{aligned} U_{2}'\left( \varphi _{1,2}^{H}\right)&=(2\gamma -1)\frac{\left( 2\alpha +(1-2\alpha )\varphi _{1,2}^{H}\right) (\gamma -\alpha )-\left( \alpha +(\gamma -\alpha )\varphi _{1,2}^{H}\right) (1-2\alpha )}{\left( 2\alpha +(1-2\alpha )\varphi _{1,2}^{H}\right) ^{2}}\\&=\alpha \left( \frac{2\gamma -1}{2\alpha +(1-2\alpha )\varphi _{1,2}^{H}}\right) ^{2}. \end{aligned}$$

Since $\alpha \in (0,\frac{1}{2})$ and $\gamma \in (\frac{1}{2},1)$, this expression is strictly positive for all $\varphi _{1,2}^{H}\in [0,1]$; therefore, the maximum is achieved when $\varphi _{1,2}^{H}=1$, yielding a payoff of $U_{2}(1)=1-2\gamma (1-\gamma )$. $\square $

Proof of Theorem 3

Notice first that symmetry implies that $\mu _{(1,L)}=\mu _{(3,H)}, \mu _{(2,L)}=\mu _{(2,H)}$, and $\mu _{(3,L)}=\mu _{(1,H)}$. Thus, $\frac{\mu _{(2,H)}}{\mu _{(2,L)}+\mu _{(2,H)}}=\frac{1}{2}$, implying that the expected payoff, conditional on being in state 2, is $\frac{1}{2}\gamma +\frac{1}{2}(1-\gamma )=\frac{1}{2}$. Therefore, the agent solves

$$\begin{aligned}&\max _{\varphi _{1,2}^{H},\varphi _{1,3}^{H},\varphi _{2,3}^{H}} \left\{ 2\left( \gamma \mu _{(3,H)}+\frac{1}{2}\mu _{(2,H)} +(1-\gamma )\mu _{(1,H)}\right) \right\} \\&\quad \text {s.t. } 0\le \varphi _{1,2}^{H},\varphi _{1,3}^{H},\varphi _{2,3}^{H}\le 1\\&\quad \text {and } \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\le 1. \end{aligned}$$

We begin by writing the steady-state condition for states $(1,H)$ and $(3,H)$ from Eq. (3) as

$$\begin{aligned} \mu _{(1,H)}&=(\alpha \mu _{(1,L)}+(1-\alpha )\mu _{(1,H)})\left( \gamma \varphi _{1,1}^{H}+(1-\gamma )\varphi _{1,1}^{L}\right) \\&\quad +(\alpha \mu _{(2,L)}+(1-\alpha )\mu _{(2,H)})\left( \gamma \varphi _{2,1}^{H}+(1-\gamma )\varphi _{2,1}^{L}\right) \\&\quad +(\alpha \mu _{(3,L)}+(1-\alpha )\mu _{(3,H)})\left( \gamma \varphi _{3,1}^{H}+(1-\gamma )\varphi _{3,1}^{L}\right) \text { and}\\ \mu _{(3,H)}&=(\alpha \mu _{(1,L)}+(1-\alpha )\mu _{(1,H)})\left( \gamma \varphi _{1,3}^{H}+(1-\gamma )\varphi _{1,3}^{L}\right) \\&\quad +(\alpha \mu _{(2,L)}+(1-\alpha )\mu _{(2,H)})\left( \gamma \varphi _{2,3}^{H}+(1-\gamma )\varphi _{2,3}^{L}\right) \\&\quad +(\alpha \mu _{(3,L)}+(1-\alpha )\mu _{(3,H)})\left( \gamma \varphi _{3,3}^{H}+(1-\gamma )\varphi _{3,3}^{L}\right) . \end{aligned}$$

Imposing symmetry and monotonicity, these may be written as

$$\begin{aligned} \mu _{(1,H)}&=(\alpha \mu _{(3,H)}+(1-\alpha )\mu _{(1,H)})\left( \gamma \left( 1-\varphi _{1,2}^{H}-\varphi _{1,3}^{H}\right) +(1-\gamma )\right) \\&\quad +\mu _{(2,H)}(1-\gamma )\varphi _{2,3}^{H}+(\alpha \mu _{(1,H)}+(1-\alpha )\mu _{(3,H)})\left( (1-\gamma )\varphi _{1,3}^{H}\right) \\&=\mu _{(1,H)}\left( 1-\alpha -(1-\alpha )\gamma \varphi _{1,2}^{H}-(\gamma -\alpha )\varphi _{1,3}^{H}\right) +\mu _{(2,H)}(1-\gamma )\varphi _{2,3}^{H}\\&\quad +\mu _{(3,H)}\left( \alpha -\alpha \gamma \varphi _{1,2}^{H}+(1-\alpha -\gamma )\varphi _{1,3}^{H}\right) \text { and}\\ \mu _{(3,H)}&=(\alpha \mu _{(3,H)}+(1-\alpha )\mu _{(1,H)})\left( \gamma \varphi _{1,3}^{H}\right) +\mu _{(2,H)}\gamma \varphi _{2,3}^{H}\\&\quad +(\alpha \mu _{(1,H)}+(1-\alpha )\mu _{(3,H)})\left( \gamma +(1-\gamma )\left( 1-\varphi _{1,2}^{H}-\varphi _{1,3}^{H}\right) \right) \\&=\mu _{(1,H)}\left( \alpha -\alpha (1-\gamma )\varphi _{1,2}^{H}+(\gamma -\alpha )\varphi _{1,3}^{H}\right) +\mu _{(2,H)}\gamma \varphi _{2,3}^{H}\\&\quad +\mu _{(3,H)}\left( 1-\alpha -(1-\alpha )(1-\gamma )\varphi _{1,2}^{H}-(1-\alpha -\gamma )\varphi _{1,3}^{H}\right) . \end{aligned}$$

Combining the two equations above with the observation in Eq. (2) that

$$\begin{aligned} \mu _{(2,H)}=\frac{1}{2}-\mu _{(1,H)}-\mu _{(3,H)}, \end{aligned}$$

we can solve for $\mu _{(1,H)}$ and $\mu _{(3,H)}$. In particular, we have

$$\begin{aligned}&\mu _{(1,H)}=\frac{1}{2}\\&\quad \times \frac{\left( \alpha +\left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\right) (1-\gamma -\alpha )-\varphi _{1,2}^{H}(1-2\alpha )\gamma (1-\gamma )\right) \varphi _{2,3}^{H}}{\alpha \left( \varphi _{1,2}^{H}+2\varphi _{2,3}^{H}\right) +(1-2\alpha ) \left( \left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\right) \varphi _{2,3}^{H}+\varphi _{1,2}^{H} \left( \varphi _{1,2}^{H}+2\left( \varphi _{1,3}^{H}-\varphi _{2,3}^{H}\right) \right) \gamma (1-\gamma )\right) },\\&\mu _{(3,H)}=\frac{1}{2}\\&\quad \times \frac{\left( \alpha +\left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\right) (\gamma -\alpha )-\varphi _{1,2}^{H}(1-2\alpha )\gamma (1-\gamma )\right) \varphi _{2,3}^{H}}{\alpha \left( \varphi _{1,2}^{H}+2\varphi _{2,3}^{H}\right) +(1-2\alpha ) \left( \left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\right) \varphi _{2,3}^{H}+\varphi _{1,2}^{H} \left( \varphi _{1,2}^{H}+2\left( \varphi _{1,3}^{H}-\varphi _{2,3}^{H}\right) \right) \gamma (1-\gamma )\right) }. \end{aligned}$$

Furthermore, note that the decision maker’s expected payoff is

$$\begin{aligned} 2\left( \gamma \mu _{(3,H)}+\frac{1}{2}\mu _{(2,H)}+(1-\gamma )\mu _{(1,H)}\right) =\frac{1}{2}+(2\gamma -1)(\mu _{(3,H)}-\mu _{(1,H)}), \end{aligned}$$

where we have again substituted for $\mu _{(2,H)}$ using Eq. (2). This implies that the decision maker maximizes

$$\begin{aligned}&U_{3}\left( \varphi _{1,2}^{H},\varphi _{1,3}^{H},\varphi _{2,3}^{H}\right) \nonumber \\&:=\frac{1}{2}+\frac{1}{2}\frac{(2\gamma -1)^{2}\left( \varphi _{1,2}^{H} +\varphi _{1,3}^{H}\right) \varphi _{2,3}^{H}}{\alpha \left( \varphi _{1,2}^{H}+2\varphi _{2,3}^{H}\right) +(1-2\alpha ) \left( \left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\right) \varphi _{2,3}^{H}+\varphi _{1,2}^{H} \left( \varphi _{1,2}^{H}+2\left( \varphi _{1,3}^{H}-\varphi _{2,3}^{H}\right) \right) \gamma (1-\gamma )\right) }.\nonumber \\ \end{aligned}$$

(5)

Note first that, if $\varphi _{1,2}^{H}=0$ (and, by symmetry, $\varphi _{3,2}^{L}=0$), then the “middle” memory state (state 2) is effectively redundant—the memory system only makes use of the two extremal states. Applying Theorem 2, the optimal memory, conditional on $\varphi _{1,2}^{H}=0$, must have $\varphi _{1,3}^{H}=1$. As in the optimal two-state memory from Theorem 2, this memory yields an expected payoff of

$$\begin{aligned} U_{3}\left( 0,1,\varphi _{2,3}^{H}\right) =1-2\gamma (1-\gamma ). \end{aligned}$$

Clearly, the value of $\varphi _{2,3}^{H}$ is irrelevant in this case. However, in order to ensure that there is only a single recurrent communicating class, we simply set $\varphi _{2,3}^{H}=1$ when $\varphi _{1,2}^{H}=0$.

Suppose instead that $\varphi _{1,2}^{H}>0$. Then differentiating the payoff in Eq. (5) with respect to $\varphi _{2,3}^{H}$ yields

$$\begin{aligned}&\frac{\partial U_{3}\left( \varphi _{1,2}^{H},\varphi _{1,3}^{H},\varphi _{2,3}^{H}\right) }{\partial \varphi _{2,3}^{H}}\\&\quad =\frac{\varphi _{1,2}^{H}\left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\right) (\alpha +(1-2\alpha ) \left( \varphi _{1,3}^{H}+2\varphi _{2,3}^{H}\right) \gamma (1-\gamma ))(2\gamma -1)^{2}}{2\left( \alpha \left( \varphi _{1,2}^{H}+2\varphi _{2,3}^{H}\right) +(1-2\alpha ) \left( \left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\right) \varphi _{2,3}^{H}+\varphi _{1,2}^{H}\left( \varphi _{1,2}^{H}+2 \left( \varphi _{1,3}^{H}-\varphi _{2,3}^{H}\right) \right) \gamma (1-\gamma )\right) \right) ^{2}}. \end{aligned}$$

Clearly, the denominator is positive. Moreover, $\varphi _{1,2}^{H}>0$ implies that the numerator is positive. Thus, it is without loss of generality to set $\varphi _{2,3}^{H}=1$ whenever $\varphi _{1,2}^{H}>0$.

With this in mind, we consider two cases. We first assume that $\varphi _{1,2}^{H}>0$ and $\varphi _{1,2}^{H}+\varphi _{1,3}^{H}=1$. In this case, the decision maker’s payoff is $U_{3}(\varphi _{1,2}^{H},1-\varphi _{1,2}^{H},1)$. Note, however, that

$$\begin{aligned} \frac{\partial ^{2} U_{3}(\varphi _{1,2}^{H},1-\varphi _{1,3}^{H},1)}{\partial \left( \varphi _{1,3}^{H}\right) ^{2}} =\frac{(2\gamma -1)^{2}\left( \kappa +\alpha ^{2}-3\alpha \kappa \varphi _{1,2}^{H}+\left( \kappa \varphi _{1,2}^{H}\right) ^{2}\right) }{\left( 1+\alpha \varphi _{1,2}^{H}-\kappa \left( \varphi _{1,2}^{H}\right) ^{2}\right) ^{3}}, \end{aligned}$$

where we define $\kappa :=(1-2\alpha )\gamma (1-\gamma )$. Since $\alpha \in (0,\frac{1}{2})$ and $\gamma \in (\frac{1}{2},1)$, we must have $\kappa \in (0,1)$. Thus, $\varphi _{1,2}^{H}\in [0,1]$ implies that

$$\begin{aligned} 1+\alpha \varphi _{1,2}^{H}-\kappa \left( \varphi _{1,2}^{H}\right) ^{2} > 1-\kappa >0. \end{aligned}$$

In addition, we can write

$$\begin{aligned} \kappa +\alpha ^{2}-3\alpha \kappa \varphi _{1,2}^{H}+\left( \kappa \varphi _{1,2}^{H}\right) ^{2} =\left( 2\kappa \varphi _{1,2}^{H}-\alpha \right) \left( \kappa \varphi _{1,2}^{H}-\alpha \right) +\kappa . \end{aligned}$$

Note that $(2\kappa \varphi _{1,2}^{H}-\alpha )(\kappa \varphi _{1,2}^{H}-\alpha )$ is negative if, and only if, $2\kappa \varphi _{1,2}^{H}-\alpha >0>\kappa \varphi _{1,2}^{H}-\alpha $. But $2\kappa \varphi _{1,2}^{H}-\alpha <2\kappa $ and $\kappa \varphi _{1,2}^{H}-\alpha >-\alpha $, implying that

$$\begin{aligned} \left( 2\kappa \varphi _{1,2}^{H}-\alpha \right) \left( \kappa \varphi _{1,2}^{H}-\alpha \right) +\kappa >(2\kappa )(-\alpha )+\kappa =(1-2\alpha )\kappa >0. \end{aligned}$$

Thus, $U_{3}(\varphi _{1,2}^{H},1-\varphi _{1,2}^{H},1)$ is a convex function of $\varphi _{1,2}^{H}$, and is therefore maximized either when $\varphi _{1,2}^{H}=0$ or $\varphi _{1,2}^{H}=1$. The decision maker’s expected payoff in each of these cases is

$$\begin{aligned} U_{3}(0,1,1)=1-2\gamma (1-\gamma ) \text { and } U_{3}(1,0,1)=\frac{2+\alpha -4\gamma (1-\gamma )-\kappa }{2(1+\alpha -\kappa )}. \end{aligned}$$

Then we have

$$\begin{aligned} U_{3}(1,0,1)-U_{3}(0,1,1)&=\frac{(2+\alpha -4\gamma (1-\gamma )-\kappa )-2(1+\alpha -\kappa )(1-2\gamma (1-\gamma ))}{2(1+\alpha -\kappa )}\\&=-\frac{(\alpha -\gamma +2\alpha \gamma +\gamma ^{2}-2\alpha \gamma ^{2})(2\gamma -1)^{2}}{2(1+\alpha -\kappa )}\\&=\frac{(\kappa -\alpha )(2\gamma -1)^{2}}{2(1+\alpha -\kappa )}. \end{aligned}$$

Recalling the definition of $\kappa $, we may then conclude that $U_{3}(1,0,1)>U_{3}(0,1,1)$ if, and only if,

$$\begin{aligned} \frac{\alpha }{1-2\alpha }<\gamma (1-\gamma ). \end{aligned}$$

Turning to our second case, suppose that $\varphi _{1,2}^{H}>0$ and $\varphi _{1,2}^{H}+\varphi _{1,3}^{H}<1$. In addition, assume that $\varphi _{1,3}^{H}>0$. Therefore, the first-order conditions for both $\varphi _{1,2}^{H}$ and $\varphi _{1,3}^{H}$ must hold; that is, we have

$$\begin{aligned} \frac{\partial U_{3}(\varphi _{1,2}^{H},\varphi _{1,3}^{H},1)}{\partial \varphi _{1,2}^{H}}&=\frac{(2\gamma -1)^{2}\left( \left( 2-\varphi _{1,3}^{H}\right) \left( \alpha +\kappa \varphi _{1,3}^{H}\right) -\kappa \left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}\right) ^{2}\right) }{2\left( \varphi _{1,2}^{H} +\varphi _{1,3}^{H}+\left( 2-\varphi _{1,2}^{H}-2\varphi _{1,3}^{H}\right) \left( \alpha -\kappa \varphi _{1,2}^{H}\right) \right) ^{2}}=0 \text { and}\\ \frac{\partial U_{3}(\varphi _{1,2}^{H},\varphi _{1,3}^{H},1)}{\partial \varphi _{1,3}^{H}}&=\frac{(2\gamma -1)^{2}\left( 2+\varphi _{1,2}^{H}\right) \left( \alpha -\kappa \varphi _{1,2}^{H}\right) }{2\left( \varphi _{1,2}^{H}+\varphi _{1,3}^{H}+\left( 2-\varphi _{1,2}^{H}-2\varphi _{1,3}^{H}\right) \left( \alpha -\kappa \varphi _{1,2}^{H}\right) \right) ^{2}}=0. \end{aligned}$$

Solving these two equations yields

$$\begin{aligned} \varphi _{1,2}^{H}=\frac{\alpha }{\kappa } \text { and } \varphi _{2,3}^{H}=1-\frac{\alpha }{2\kappa }. \end{aligned}$$

Note, however, that these two expressions sum to more than 1, a contradiction. Thus, we must have $\varphi _{1,3}^{H}=0$, and only the first of the FOCs above can hold. This implies that

$$\begin{aligned} \varphi _{1,2}^{H}=\sqrt{\frac{2\alpha }{\kappa }}. \end{aligned}$$

(Of course, this is less than 1 if, and only if, $\frac{2\alpha }{(1-2\alpha )}<\gamma (1-\gamma )$; otherwise, we are at the corner solution where $\varphi _{1,2}^{H}=1$.) Note, however, that

$$\begin{aligned} U_{3}\left( \varphi _{1,2}^{H},0,1\right) -U_{3}(0,1,1)&=\frac{2\varphi _{1,2}^{H}(1-2\gamma (1-\gamma ))+\left( 2-\varphi _{1,2}^{H}\right) \left( \alpha -\kappa \varphi _{1,2}^{H}\right) }{2\left( \varphi _{1,2}^{H}+\left( 2-\varphi _{1,2}^{H}\right) \left( \alpha -\kappa \varphi _{1,2}^{H}\right) \right) }\\&\,\quad \,-(1-2\gamma (1-\gamma ))\\&=\frac{\left( \varphi _{1,2}^{H}-2\right) \left( \alpha -\kappa \varphi _{1,2}^{H}\right) (2\gamma -1) ^{2}}{2\left( \varphi _{1,2}^{H}+\left( 2-\varphi _{1,2}^{H}\right) \left( \alpha -\kappa \varphi _{1,2}^{H}\right) \right) }>0 \end{aligned}$$

if, and only if, $\alpha <\kappa \varphi _{1,2}^{H}$. Since $\varphi _{1,2}^{H}=\sqrt{2\alpha /\kappa }$, this implies that $U_{3}(\sqrt{2\alpha /\kappa },0,1)>U_{3}(0,1,1)$ if, and only if, $\frac{\alpha }{2(1-2\alpha )}<\gamma (1-\gamma )$. $\square $

Lemma 4

The expected payoff of the four-state memory system depicted in Fig. 4 is

$$\begin{aligned} U_{4}:= \frac{1-\alpha ^{2}-3\gamma +4\alpha \gamma +3\gamma ^{2}-4 \alpha \gamma ^{2}}{1+\alpha (1-2\alpha )-2(1-2\alpha )\gamma (1-\gamma )}. \end{aligned}$$

Proof

Note that we can write the Eq. (3) steady-state condition for states $(1,H), (2,H)$, and $(3,H)$, for the case of the four-state memory in Fig. 4, as

$$\begin{aligned} \mu _{(1,H)}&=\alpha (1-\gamma )\mu _{(1,L)}+(1-\alpha )(1-\gamma ) \mu _{(1,H)}+\alpha (1-\gamma )\mu _{(2,L)}\\&\quad +(1-\alpha )(1-\gamma )\mu _{(2,H)},\\ \mu _{(2,H)}&=\alpha \gamma \mu _{(1,L)}\!+\!(1\!-\!\alpha )\gamma \mu _{(1,H)} \!+\!\alpha (1-\gamma )\mu _{(3,L)}\!+\!(1-\alpha )(1-\gamma )\mu _{(3,H)}, \text { and}\\ \mu _{(3,H)}&=\alpha \gamma \mu _{(2,L)}+(1-\alpha )\gamma \mu _{(2,H)} +\alpha (1-\gamma )\mu _{(4,L)}+(1-\alpha )(1-\gamma )\mu _{(4,H)}, \end{aligned}$$

where we have made use of the fact that the memory transition rule is given by

$$\begin{aligned} \varphi _{m,m'}^{s}:= {\left\{ \begin{array}{ll} 1 &{}\text {if } (m,m',s)=(1,2,H),(2,3,H),(3,4,H),(4,4,H),\\ 1 &{}\text {if } (m,m',s)=(1,1,L),(2,1,L),(3,2,L),(4,3,L),\\ 0 &{}\text {otherwise.} \end{array}\right. } \end{aligned}$$

Symmetry also implies that $\mu _{(1,L)}=\mu _{(4,H)}, \mu _{(2,L)}=\mu _{(3,H)}, \mu _{(3,L)}=\mu _{(2,H)}$, and $\mu _{(4,L)}=\mu _{(1,H)}$; therefore, we may write

$$\begin{aligned} \mu _{(1,H)}&=(1-\alpha )(1-\gamma )\mu _{(1,H)}+(1-\alpha ) (1-\gamma )\mu _{(2,H)}+\alpha (1-\gamma )\mu _{(3,H)}\\&\quad \;+\alpha (1-\gamma )\mu _{(4,H)},\\ \mu _{(2,H)}&=(1-\alpha )\gamma \mu _{(1,H)}\!+\!\alpha (1\!-\!\gamma )\mu _{(2,H)}\!+\!(1\!-\!\alpha ) (1-\gamma )\mu _{(3,H)}\!+\!\alpha \gamma \mu _{(4,H)}, \text { and}\\ \mu _{(3,H)}&=\alpha (1-\gamma )\mu _{(1,H)}+(1-\alpha )\gamma \mu _{(2,H)}+ \alpha \gamma \mu _{(3,H)}+(1-\alpha )(1-\gamma )\mu _{(4,H)}. \end{aligned}$$

In addition, recall from Eq. (2) that $\mu _{(1,H)}+\mu _{(2,H)}+\mu _{(3,H)}+\mu _{(4,H)}=\frac{1}{2}$. Solving this system of four equations in four unknowns yields the stationary distribution of this memory system, which is given by

$$\begin{aligned} \mu _{(1,H)}&=\frac{(1-\gamma )\left( 1-\alpha -(2-\alpha -\gamma )(1-2\alpha ) \gamma \right) }{2\left( 1+\alpha (1-2\alpha )-2(1-2\alpha )\gamma (1-\gamma )\right) },\\ \mu _{(2,H)}&=\frac{\alpha (1-\alpha )+(1-\alpha -\gamma ) (1-2\alpha )\gamma (1-\gamma )}{2\left( 1+\alpha (1-2\alpha )-2(1-2\alpha ) \gamma (1-\gamma )\right) },\\ \mu _{(3,H)}&=\frac{\alpha (1-\alpha )+(\gamma -\alpha )(1-2\alpha )\gamma (1-\gamma )}{2\left( 1+\alpha (1-2\alpha )-2(1-2\alpha )\gamma (1-\gamma )\right) }, \text { and}\\ \mu _{(4,H)}&=\frac{\gamma \left( 2\alpha (1-\alpha )+(\gamma -\alpha ) (1-2\alpha )\gamma \right) }{2\left( 1+\alpha (1-2\alpha )-2(1-2\alpha ) \gamma (1-\gamma )\right) }. \end{aligned}$$

Therefore, the expected payoff of this memory system is

$$\begin{aligned} U_{4}&:= \gamma (\mu _{(1,L)}+\mu _{(2,L)}+\mu _{(3,H)}+\mu _{(4,H)})\\&+(1-\gamma )(\mu _{(1,H)}+\mu _{(2,H)}+\,\mu _{(3,L)}+\mu _{(4,L)})\\&= 2\gamma (\mu _{(3,H)}+\mu _{(4,H)})+2(1-\gamma )(\mu _{(1,H)}+\mu _{(2,H)})\\&= \frac{\alpha \gamma +\alpha \gamma ^{2}-2\alpha \gamma ^{3} -\alpha ^{2}\gamma +\gamma ^{3}}{1+\alpha (1-2\alpha )-2(1-2\alpha )\gamma (1-\gamma )}\\&+\,\frac{1-\alpha ^{2}-3\gamma +3\alpha \gamma +\alpha ^{2}\gamma +3\gamma ^{2} -5\alpha \gamma ^{2}-\gamma ^{3}+2\alpha \gamma ^{3}}{1+\alpha (1-2\alpha ) -2(1-2\alpha )\gamma (1-\gamma )}\\&= \frac{1-\alpha ^{2}-(3-4\alpha )\gamma (1-\gamma )}{1+\alpha (1-2\alpha )-2(1-2\alpha )\gamma (1-\gamma )}. \end{aligned}$$

$\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Monte, D., Said, M. The value of (bounded) memory in a changing world. Econ Theory 56, 59–82 (2014). https://doi.org/10.1007/s00199-013-0771-1

Download citation

Received: 05 August 2013
Accepted: 07 August 2013
Published: 01 September 2013
Issue Date: May 2014
DOI: https://doi.org/10.1007/s00199-013-0771-1

Keywords

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The value of (bounded) memory in a changing world

Abstract

Access this article

Similar content being viewed by others

Memory Management in Resource-Bounded Agents

Effects of Limited and Heterogeneous Memory in Hidden-Action Situations

The Effort of Reasoning: Modelling the Inference Steps of Boundedly Rational Agents

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Proof of Theorem 2

Proof of Theorem 3

Lemma 4

Proof

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

The value of (bounded) memory in a changing world

Abstract

Access this article

Similar content being viewed by others

Memory Management in Resource-Bounded Agents

Effects of Limited and Heterogeneous Memory in Hidden-Action Situations

The Effort of Reasoning: Modelling the Inference Steps of Boundedly Rational Agents

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Appendix

Proof of Theorem 2

Proof of Theorem 3

Lemma 4

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation