1 Introduction: proper equilibrium and its thought experiment

In this paper, we reconsider the concept of proper equilibrium (Myerson 1978) in mixed extensions of finite strategic games, from now on just abbreviated to games. To adequately state our purposes and ideas, we first recall the underlying framework and basic notation and definitions. A game is given by \(G=(N,\{\Delta _{M^i}\}_{i \in N},\{\pi ^i\}_{i \in N})\), with \(N=\{1,\ldots ,n\}\) the player set, \(\Delta _{M^i}\) the mixed strategy space of player \(i \in N\), with \(M^i=\{1,\ldots ,m^i\}\) the set of pure strategies, and \(\pi ^i:\prod _{j \in N}\Delta _{M^j}\rightarrow \mathbb {R}\) the von Neumann Morgenstern expected payoff function of player i. A pure strategy \(k \in M^i\) of player i is alternatively denoted by \(e^i_k\), a typical element of \(\Delta _{M^i}\) by \(x^i\). We denote the probability which \(x^i\) assigns to pure strategy k by \(x^i_k\). The set of all strategy profiles is given by \(\Delta =\prod _{i \in N}\Delta _{M^i}\), a typical element of \(\Delta \) by x.

The most fundamental concept in games is that of Nash equilibrium (Nash 1951). A strategy profile \(\hat{x}\) is a Nash equilibrium of G, denoted by \(\hat{x} \in \mathrm{NE}(G)\), if \(\pi ^i(\hat{x})\ge \pi ^i(x^i,\hat{x}^{-i})\) for all \(x^i \in \Delta _{M^i}\) and all \(i \in N\). Here, \((x^i,\hat{x}^{-i})\) is the frequently used shorthand notation for the strategy profile \((\hat{x}^1,\ldots ,\hat{x}^{i-1},x^i,\hat{x}^{i+1},\ldots ,\hat{x}^n)\).

The set of Nash equilibria may be very large and can contain counterintuitive outcomes. Selten (1965) introduced the concept of perfect equilibrium as a refinement of the set of Nash equilibria. The essential idea in the thought experiment underlying perfect equilibrium is that no pure strategy should ever be given zero probability, since there is always a small chance that any pure strategy might be chosen, if only by mistake. To further refine the set of (perfect) equilibria, Myerson (1978) introduced the concept of proper equilibrium.

Definition 1.1

(Myerson 1978) Let \(G=(N,\{\Delta _{M^i}\}_{i \in N},\{\pi ^i\}_{i \in N})\) be an n-player game. A strategy profile \(x \in \Delta \) is a proper equilibrium of G if there exists a sequence \(\{\varepsilon _t\}_{t \in \mathbb {N}}\) of positive real numbers converging to zero, and a sequence \(\{x_t\}_{t \in \mathbb {N}}\) of completely mixed strategy profiles converging to x such that \(x_t\) is \(\varepsilon _t\)-proper for all \(t \in \mathbb {N}\), i.e.,

$$\begin{aligned} \pi ^i\left( e^i_{\ell },x^{-i}_t\right) <\pi ^i\left( e^i_{k},x^{-i}_t\right)\Rightarrow & {} x^i_{t,\ell }\le \varepsilon _t x^i_{t,k} \end{aligned}$$

for all \(k,\ell \in M^i\) and all \(i \in N\).

The properness concept plays an important role in the game theoretic literature and is widely studied in various directions, see, e.g., van Damme (1984), García-Jurado and Prada-Sánchez (1990), Blume et al. (1991), Yamamoto (1993) and Schuhmacher (1999). In the equilibrium refinement literature, it is featured most prominently in the work on stable sets (Kohlberg and Mertens 1986; Mertens 1989; Hillas 1990 and Mertens (1991)), as each stable set contains a proper equilibrium. The attractiveness of the properness concept is mainly based on the fact that this concept selects the intuitively appealing strategy combinations in many (well-known) games (see, e.g., Myerson 1978; van Damme 1991). Moreover, properness captures the extensive form notion of sequential rationality (van Damme 1984). In that sense, we recognize the selective power of proper equilibrium. In our opinion, however, the definition and underlying thought experiment of proper equilibrium are somewhat unsatisfying.

In the thought experiment underlying properness the idea is that, just as in the thought experiment underlying perfectness, players make mistakes. Contrary to the concept of perfectness, however, these mistakes are not made randomly; the trembles are somehow sensible, meaning that innocuous mistakes are made with a significantly higher probability than mistakes that have a substantial negative impact on the payoff of the players. However, in the thought experiment players have a passive role in the sense that they do not consciously decide on (an ordering of the) alternatives to their preferred strategies. More precisely, in the thought experiment underlying properness the alternatives are exogenously ordered based on the corresponding payoffs (given the opponent’s strategies). Hence, what is missing is an appropriate justification for obtaining this specific ordering. This problem is also addressed in van Damme (1991) who shows that the use of control costs does not provide such a justification. We provide a justification for the fundamental idea underlying properness by starting out from a different, active, thought experiment.

In this alternative approach, each player in the thought experiment is conscious of the fact that both his intended strategy and the intended strategies of his opponents might not be executed. In this approach, we then explicitly model how each player actively anticipates on the occurrence of such events. More specifically, in this thought experiment all the actions of each player are blocked exogenously with a small but positive probability. Since each player wants to play a best reply, each player has to strategically decide beforehand on a back-up action in case his first choice is blocked. However, since this back-up action might be blocked as well, he also has to decide on a second back-up action in case the first back-up action turns out to be unavailable, and so forth and so on. Hence, each player must decide on a complete ordering of his actions beforehand. The probability with which a player is unable to play a certain action is assumed to be independent of the particular choice he makes. This probability may, however, vary between players. We stress that players are not given the opportunity to block each other’s actions, but rather that actions can be blocked exogenously. The common ground with proper is that some exogenous event occurs (mistake/blocking), but the difference lies in what happens after that event. In proper, players somehow accidentally coordinate their mistakes, whereas in our setting the players have to consciously make a contingency plan in case their first choice turns out to be impossible for whatever reason (which we call “blocked”).

The described thought experiment results in the concept of fall back proper equilibrium, which alternatively can be seen as a hierarchical extension to the concept of fall back equilibrium, introduced by Kleppe et al. (2012) and further discussed in Kleppe et al. (2013). In the original fall back equilibrium concept, a player’s pure strategy in the fall back game is an intended pure strategy in the original game, backup up by a single fall back choice rather than a complete hierarchy. Although fall back proper equilibrium is an extension of the idea behind fall back equilibrium, the two solutions are logically unrelated (cf. Kleppe 2010).

Another refinement concept that is also based on a thought experiment with a more active anticipation concept is the notion of informationally robust equilibrium as introduced by Robson (1994) and further elaborated upon by Reijnierse et al. (2007).

To formalize the concept of fall back proper equilibrium, we introduce some additional notation. The action set in the fall back proper game for player \(i\in N\) within the thought experiment described above equals the set of all orderings of the action set \(M^i\), and is denoted by \(\Omega ^i\). Hence, the total number of actions in the fall back proper game for player i equals \(\tilde{m}^i=m^i!\). A typical element of \(\Omega ^i\) is denoted by \(\sigma \), where the action on position s of \(\sigma \) is given by \(\sigma (s) \in M^i\). A pure strategy \(\sigma \in \Omega ^i\) will alternatively be denoted by \(e^i_{\sigma }\). By \(\Omega ^i_k \subseteq \Omega ^i\), \(k \in M^i\), we denote the set of orderings of \(M^i\) for which \(\sigma (1)=k\), hence \(\Omega ^i_k=\{\sigma \in \Omega ^i|\sigma (1)=k\}\). The mixed strategy space of player i is given by \(\Delta _{\Omega ^i}\).

We assume that each action of player i is blocked with the same probability, denoted by \(\varepsilon ^i\), but we allow for different probabilities among the players. Hence, let \(\varepsilon =(\varepsilon ^1,\ldots ,\varepsilon ^n)\) be an n-tuple of (small) non-negative probabilities. If player i plays action \(\sigma \in \Omega ^i\) in the fall back proper game, he plays with probability \((1-\varepsilon ^i)(\varepsilon ^i)^{s-1}\) action \(\sigma (s)\) of the game G for \(s \in \{1,\ldots , |m^i|\}\). With probability \((\varepsilon ^i)^{m^i}\), all actions of player i are blocked, the game is not played and the payoff to all players is defined to be zero.

The fall back proper game \(\tilde{G}(\varepsilon )=(N,\{\Delta _{\Omega ^i}\}_{i \in N},\{\pi _{\varepsilon }^i\}_{i \in N})\) is the mixed extension of the corresponding finite game with \(\tilde{m}^i\) pure strategies for each player \(i \in N\). The payoff functions \(\{\pi _{\varepsilon }^i\}_{i\in N}\) on mixed strategy combinations in \(\Pi _{i \in N}\Delta _{\Omega ^i}\) are derived in the standard way using expected payoffs from the payoff functions on pure strategy combinations in \(\Pi _{i \in N}\Omega ^i\), as described by

$$\begin{aligned} \pi ^i_{\varepsilon }((e_{\sigma }^j)_{j \in N})=\sum _{(k^1,\ldots ,k^n)\in \prod _{r \in N} M^r}\left( \prod _{j \in N}(1-\varepsilon ^j)(\varepsilon ^j)^{\sigma ^{-1}(k^j)-1})\pi ^i((e^j_{k^j})_{j \in N}\right) \end{aligned}$$

for all \(i \in N\). The residual probability in which at least one player is unable to play any of his actions is implicitly incorporated in this payoff function, as in that case the payoff to every player is zero. Note that the zero payoff is arbitrary and will not influence the equilibria of the game, because it does not depend on the players’ strategy choices.

A typical element of \(\Delta _{\Omega ^i}\) is denoted by \(\rho ^i\), and the probability which \(\rho ^i\) assigns to pure strategy \(\sigma \) is given by \(\rho ^i_{\sigma }\). The set of all strategy profiles is given by \(\tilde{\Delta }=\prod _{i \in N}\Delta _{\Omega ^i}\), an element of \(\tilde{\Delta }\) by \(\rho \).

Definition 1.2

Let \(G=(N,\{\Delta _{M^i}\}_{i \in N},\{\pi ^i\}_{i \in N})\) be an n-player game. A strategy profile \(x \in \Delta \) is a fall back proper equilibrium of G if there exists a sequence \(\{\varepsilon _t\}_{t \in \mathbb {N}}\) of n-tuples of positive real numbers converging to zero, and a sequence \(\{\rho _t\}_{t \in \mathbb {N}}\) such that \(\rho _t \in \mathrm{NE}(\tilde{G}(\varepsilon _t))\) for all \(t \in \mathbb {N}\), converging to \(\rho \in \tilde{\Delta }\), with \(x^i_{k}=\sum _{\sigma \in \Omega ^i_k} \rho ^i_{\sigma }\) for all \(k \in M^i\) and all \(i \in N\). The set of fall back proper equilibria of a game G is denoted by \(\mathrm{FBPR}(G)\).

In the thought experiment underlying fall back proper equilibrium, all the actions of each player are blocked with a small but positive probability. Therefore, players decide beforehand on a complete ordering of their actions. This is modeled by letting players play the fall back proper game in which each action consists of a full ordering of the actions of the original game such that the first action is played with a probability close to one and each following action with a smaller probability of a fixed factor. A fall back proper equilibrium of the original game is then deduced from the limit point of a sequence of Nash equilibria of the corresponding fall back proper games when the blocking probabilities converge to zero.

Fall back proper equilibrium can be seen as a hierarchical extension of fall back equilibriumFootnote 1 (Kleppe et al. 2012), in which players are only allowed to use a single backup action. As a result, one might think that the set of fall back proper equilibria refines the set of fall back equilibria. We refer to Kleppe (2010) for an example which shows that this is not the case.

This paper shows that fall back proper equilibrium is a refinement of proper equilibrium. So, interestingly, the more active thought experiment underlying fall back proper equilibrium sheds some new light on the original, more passive thought experiment underlying proper equilibrium. We show that the two concepts coincide for two-player games. In general, fall back proper constitutes a refinement of proper, where designing an example where the concepts differ requires an intricate construction.

The outline of the remainder of the paper is as follows. In Sect. 2, we provide an alternative characterization of fall back proper equilibrium based only on limitations of the strategy spaces. Using that characterization, we show in Sect. 3 that the set of fall back proper equilibria is a (possibly strict) non-empty and closed subset of the set of proper equilibria, and in Sect. 4 that for two-player games the sets of proper and fall back proper equilibria coincide. Section 5 concludes the paper.

2 A characterization of fall back proper equilibrium

In this section, we provide an alternative characterization of fall back proper equilibrium in which the perturbations of the thought experiment are fully captured by limitations of the strategy spaces. This allows for a perturbed game of the same dimensions as the original one. For a (sufficiently small) blocking vector \(\delta \in \mathbb {R}^N_{+}\), the blocking game \(G(\delta )=(N,\{\Delta _{M^i}(\delta ^i)\}_{i \in N},\{\pi ^i\}_{i \in N})\) is defined to be the game which only differs from \(G=(N,\{\Delta _{M^i}\}_{i \in N},\{\pi ^i\}_{i \in N})\) in the sense that the strategy spaces are restricted to

$$\begin{aligned} \Delta _{M^i}(\delta ^i)&= \left\{ x^i \in \Delta _{M^i} | \sum _{k \in T^i} x^i_k \le \dfrac{1-(\delta ^i)^{|T^i|}}{1-(\delta ^i)^{m^i}}\text { for all } T^i \subseteq M^i\right\} \end{aligned}$$

for all \(i \in N\), with the domains of the payoff functions restricted accordingly. We define the set of all strategy profiles of the blocking game by \(\Delta (\delta )=\Pi _{j \in N}\Delta _{M^j}(\delta ^j)\).

Note that this blocking game gives the maximum probability by which each number of actions can be played, e.g., if player i puts the maximum allowed probability on the actions in a set \(T^i\), then any other strategy \(k \notin T^i\) can be played with a probability of at most \((1-\delta ^i)(\delta ^i)^{|T^i|}\).

Lemma 2.1

Let \(G=(N,\{\Delta _{M^i}\}_{i \in N},\{\pi ^i\}_{i \in N})\) be an n-player game. Let \(\delta \in \mathbb {R}^N_+\) be a blocking vector, and let \(\tilde{G}(\delta )=(N,\{\Delta _{\Omega ^i}\}_{i \in N},\{\pi ^i_{\delta }\}_{i \in N})\) and \(G(\delta )=(N,\{\Delta _{M^i}(\delta ^i)\}_{i \in N},\{\pi ^i\}_{i \in N})\) be the corresponding fall back proper and blocking game, respectively. Then, there exists an onto map \(f_\delta :\tilde{\Delta }\rightarrow \Delta (\delta )\) such that \(\pi ^i_{\delta }(\rho )=\pi ^i(f_{\delta }(\rho ))\cdot \Pi _{j \in N}(1-(\delta ^j)^{m^j})\) for all \(\rho \in \tilde{\Delta }\) and all \(i \in N\).

Proof

We explicitly construct a map \(f_{\delta }\) satisfying the conditions of the lemma. Let \(\rho \in \tilde{\Delta }\). We define \(f_{\delta }(\rho )=x\), with

$$\begin{aligned} x^i_{k}=\dfrac{\sum _{\sigma \in \Omega ^i}(1-\delta ^i)(\delta ^i)^{\sigma ^{-1}(k)-1}\rho ^i_{\sigma }}{1-(\delta ^i)^{m^i}} \end{aligned}$$

for all \(k \in M^i\) and all \(i \in N\). By considering the most extreme case in which \(\rho ^i_{\sigma }\) is a pure strategy in the fall back proper game, it is readily checked that \(\sum _{k \in T^i} x^i_k \le \dfrac{1-(\delta ^i)^{|T^i|}}{1-(\delta ^i)^{m^i}}\) for all \(T^i \subseteq M^i\) such that \(x \in \Delta (\delta )\). Furthermore, the probabilities put by strategy profile x on all the action profiles in the game G are equal to the probabilities put by \(\rho \) on these action profiles multiplied by \(\dfrac{1}{\Pi _{j \in N}(1-(\delta ^j)^{m^j})}\). Hence, \(\pi ^i_{\delta }(\rho ) = \pi ^i(x)\cdot \Pi _{j \in N}(1-(\delta ^j)^{m^j})=\pi ^i(f_{\delta }(\rho ))\cdot \Pi _{j \in N}(1-(\delta ^j)^{m^j})\) for all \(i \in N\). Finally, it is readily checked that \(f_{\delta }\) is onto. \(\square \)

As a consequence of Lemma 2.1, a fall back proper equilibrium can also be defined in terms of a sequence of Nash equilibria of blocking games.

Theorem 2.1

Let \(G=(N,\{\Delta _{M^i}\}_{i \in N},\{\pi ^i\}_{i \in N})\) be an n-player game. Then, a strategy profile \(x \in \Delta \) is a fall back proper equilibrium of G if and only if there exists a sequence \(\{\delta _t\}_{t \in \mathbb {N}}\) of blocking vectors of positive real numbers converging to zero and a sequence \(\{x_t\}_{t \in \mathbb {N}}\) converging to x such that \(x_t \in {NE}(G(\delta _t))\) for all \(t \in \mathbb {N}\).

Proof

We just prove the “only if” part, the reverse statement can be shown analogously. Assume \(\hat{x}\in \mathrm{FBPR}(G)\). Then by definition, there exists a sequence \(\{\delta _t\}_{t \in \mathbb {N}}\) of n-tuples of positive real numbers converging to zero, and a sequence \(\{\hat{\rho }_t\}_{t \in \mathbb {N}}\) converging to \(\hat{\rho } \in \tilde{\Delta }\), with \(\hat{x}^i_k=\sum _{\sigma \in \Omega ^i_k}\hat{\rho }^i_{\sigma }\) for all \(k \in M^i\) and all \(i \in N\), such that \(\hat{\rho }_t\in \mathrm{NE}(\tilde{G}(\delta _t))\) for all \(t \in \mathbb {N}\). By Lemma 2.1, there exists a sequence \(\{\hat{x}_t\}_{t \in \mathbb {N}}\) converging to \(\hat{x} \in \Delta \), with \(\hat{x}_t \in \Delta (\delta _t)\) for all \(t \in \mathbb {N}\), such that \(\pi ^i(\hat{x}_t) = \dfrac{\pi _{\delta _t}^i(\hat{\rho }_t)}{\Pi _{j \in N}(1-(\delta ^j)^{m^j})}\) for all \(i \in N\) and all \(t \in \mathbb {N}\).

Let \(i \in N\). We show that \(\pi ^i(\hat{x}_t)\ge \pi ^i(x^i_t,\hat{x}^{-i}_t)\) for all \(x^i_t \in \Delta _{M^i}(\delta ^i_t)\) and all \(t \in \mathbb {N}\), which proves that \(\hat{x}_t \in \mathrm{NE}(G(\delta _t))\) for all \(t \in \mathbb {N}\) and, therefore, completes the proof. Let \(t \in \mathbb {N}\) and let \((x^i_t,\hat{x}^{-i}_t)\in \Delta (\delta _t)\). Then by Lemma 2.1, we can take a strategy \((\rho ^i_t,\hat{\rho }^{-i}_t) \in \tilde{\Delta }\) such that \(\pi ^i_{\delta _t}(\rho ^i_t,\hat{\rho }^{-i}_t) = \pi ^i(x^i_t,\hat{x}^{-i}_t) \cdot \Pi _{j \in N}(1-(\delta ^j)^{m^j})\).

Since \(\hat{\rho }_t \in \mathrm{NE}(\tilde{G}(\delta _t))\), we obtain

$$\begin{aligned} \pi ^i(x^i_t,\hat{x}^{-i}_t)= & {} \dfrac{\pi _{\delta _t}^i(\rho ^i_t,\hat{\rho }^{-i}_t)}{\Pi _{j \in N}(1-(\delta ^j)^{m^j})}\\\le & {} \dfrac{\pi ^i_{\delta _t}(\hat{\rho }_t)}{\Pi _{j \in N}(1-(\delta ^j)^{m^j})}\\= & {} \pi ^i(\hat{x}_t). \end{aligned}$$

Consequently, \(\pi ^i(\hat{x}_t)\ge \pi ^i(x^i_t,\hat{x}^{-i}_t)\) for all \(x^i_t \in \Delta _{M^i}(\delta ^i_t)\) and all \(t \in \mathbb {N}\). \(\square \)

Note that it immediately follows from Theorem 2.1 that each completely mixed Nash equilibrium is a fall back proper equilibrium.

3 General results

In this section, we show that the set of fall back proper equilibria is a (possibly strict) non-empty and closed subset of the set of proper equilibria.

Theorem 3.1

Let G be an n-player game. Then, each fall back proper equilibrium of G is a proper equilibrium of G.

Proof

Let \(G=(N,\{\Delta _{M^i}\}_{i \in N},\{\pi ^i\}_{i \in N})\) be an n-player game and let \(x \in \mathrm{FBPR}(G)\). Then by Theorem 2.1, there exists a sequence \(\{\delta _t\}_{t \in \mathbb {N}}\) of blocking vectors converging to zero, and a sequence \(\{x_t\}_{t \in \mathbb {N}}\) such that \(x_t \in \mathrm{NE}(G(\delta _t))\) for all \(t \in \mathbb {N}\), converging to \(x \in \Delta \).

Let the sequence \(\{\varepsilon _t\}_{t \in \mathbb {N}}\) be given by \(\varepsilon _t = \max _{i \in N} \delta ^i_t\) for all \(t \in \mathbb {N}\). Let \(i \in N\) and let \(\pi ^i(e^i_{\ell },x^{-i}_{\hat{t}}) < \pi ^i(e^i_k,x^{-i}_{\hat{t}})\) for some \(k,\ell \in M^i\) and some \(\hat{t} \in \mathbb {N}\). Since \(x_t \in \mathrm{NE}(G(\delta _t))\) for all \(t \in \mathbb {N}\), it holds that \(x^i_{\hat{t},\ell } \le \delta ^i_{\hat{t}}x^i_{\hat{t},k}\). Hence, \(x^i_{\hat{t},\ell } \le \varepsilon _{\hat{t}} x^i_{\hat{t},k}\).

Consequently, \(\{\varepsilon _t\}_{t \in \mathbb {N}}\) is a sequence of positive real numbers converging to zero and \(\{x_t\}_{t \in \mathbb {N}}\) is a sequence of completely mixed strategy profiles converging to x such that for all \(t \in \mathbb {N}\)

$$\begin{aligned} \pi ^i\left( e^i_{\ell },x^{-i}_t\right) <\pi ^i\left( e^i_{k},x^{-i}_t\right)\Rightarrow & {} x^i_{t,\ell }\le \varepsilon _t x^i_{t,k} \end{aligned}$$

for all \(k,\ell \in M^i\) and all \(i \in N\). Hence, x is a proper equilibrium. \(\square \)

Hence, the set of fall back proper equilibria is a subset of the set of proper equilibria. The following theorem states that this subset is non-empty and closed.

Theorem 3.2

Let G be an n-player game. Then, the set of fall back proper equilibria of G is non-empty and closed.

Proof

We first show non-emptiness. Let \(\{\delta _t\}_{t \in \mathbb {N}}\) be a sequence of blocking vectors converging to zero. Take a sequence \(\{x_t\}_{t \in \mathbb {N}}\) such that \(x_t \in \mathrm{NE}(G(\delta _t))\) for all \(t \in \mathbb {N}\). Since the strategy spaces are compact, there exists a subsequence of \(\{x_t\}_{t \in \mathbb {N}}\) converging to, say, \(x \in \Delta \). By Theorem 2.1, \(x \in \mathrm{FBPR}(G)\).

Secondly, we show that \(\mathrm{FBPR}(G)\) is closed. Take a converging sequence \(\{x_t\}_{t \in \mathbb {N}}\) with \(x_t \in \mathrm{FBPR}(G)\) for all \(t \in \mathbb {N}\), with limit x. For all \(t \in \mathbb {N}\) there exists a sequence \(\{\delta _{tr}\}_{r \in \mathbb {N}}\) of blocking vectors converging to zero and a sequence \(\{x_{tr}\}_{r \in \mathbb {N}}\) converging to \(x_t\) such that

$$\begin{aligned} x_{tr} \in \mathrm{NE}(G(\delta _{tr})) \end{aligned}$$

for all \(r \in \mathbb {N}\). Considering the sequences \(\{\delta _{tt}\}_{t \in \mathbb {N}}\) and \(\{x_{tt}\}_{t \in \mathbb {N}}\), one readily establishes that \(x \in \mathrm{FBPR}(G)\).

Using a quite specific construction, the following example is designed to show that the set of fall back proper equilibria can be a strict subset of the set of proper equilibria.

Example 3.1

Consider the following three-player game G in which the third player chooses one of the four matrices, \(e^3_1\), \(e^3_2\), \(e^3_3\) or \(e^3_4\), respectively.

In this example, it is possible to coordinate the probabilities on the lower-level actions in such a way that \(x=(e^1_1,e^2_1,e^3_1)\) is a proper equilibrium. This type of coordination is, however, not possible in the thought experiment underlying fall back proper equilibrium, as players are not free to make these lower-level mistakes that just happen to make things work, as their assumed active role requires them to play a (hierarchical) best reply.

Consider the sequence \(\{\varepsilon _t\}_{t \in \mathbb {N}}\), with \(\varepsilon _t=\frac{2}{t}\) for all \(t \in \mathbb {N}\), converging to zero and the sequence \(\{\bar{x}_t\}_{t \in \mathbb {N}}\) converging to \(x \in \Delta \), with \(\bar{x}_t\) for all \(t \in \mathbb {N}\) given by \(\bar{x}^1_t=(1-\frac{1}{t}-\frac{1}{t^2}-\frac{1}{t^3}-\frac{1}{t^5})e^1_1+\frac{1}{t}e^1_2+\frac{1}{t^2}e^1_3+\frac{1}{t^3}e^1_4+\frac{1}{t^5}e^1_5\), \(\bar{x}^2_t=(1-\frac{1}{t}-\frac{1}{t^2}-\frac{1}{t^3})e^2_1+\frac{1}{t}e^2_2+\frac{1}{t^2}e^2_3+\frac{1}{t^3}e^2_4\) and \(\bar{x}^3_t=(1-\frac{1}{t}-\frac{1}{t^2}-\frac{1}{t^3})e^3_1+\frac{1}{t}e^3_2+\frac{1}{t^2}e^3_3+\frac{1}{t^3}e^3_4\). Then, \(\bar{x}_t\) is \(\varepsilon _t\)-proper for all \(t \in \mathbb {N}\) and, hence, x is a proper equilibrium.

If x were a fall back proper equilibrium, there should exist a sequence \(\{\delta _t\}_{t \in \mathbb {N}}\) of blocking vectors converging to zero and a sequence \(\{\hat{x}_t\}_{t \in \mathbb {N}}\) converging to x such that \(\hat{x}_t \in \mathrm{NE}(G(\delta _t))\) for all \(t \in \mathbb {N}\). Hence, for a \(t \in \mathbb {N}\) sufficiently large it should hold that \(\pi ^1(e^1_1,\hat{x}^{-1}_t)\ge \pi ^1(e^1_2,\hat{x}^{-1}_t)\), \(\pi ^2(e^2_1,\hat{x}^{-2}_t)\ge \pi ^2(e^2_2,\hat{x}^{-2}_t)\) and \(\pi ^3(e^3_1,\hat{x}^{-3}_t)\ge \pi ^3(e^3_2,\hat{x}^{-3}_t)\). However, note that \(\pi ^1(e^1_1,\hat{x}^{-1}_t)\ge \pi ^1(e^1_2,\hat{x}^{-1}_t)\) implies that \(\delta _t^3\ge \delta _t^2\), \(\pi ^2(e^2_1,\hat{x}^{-2}_t)\ge \pi ^2(e^2_2,\hat{x}^{-2}_t)\) implies that \(\delta ^1_t\ge \delta ^3_t\) and \(\pi ^3(e^3_1,\hat{x}^{-3}_t)\ge \pi ^3(e^3_2,\hat{x}^{-3}_t)\) implies that \(\delta ^2_t\ge 4\delta ^1_t\). Combining all this results in \(\delta ^1_t\ge 4\delta ^1_t\), which is not possible for \(\delta ^1_t>0\). Consequently, x is not a fall back proper equilibrium.

To understand why x is proper but not fall back proper, focus on player 3. To check whether player 3 puts maximum probability on \(e^3_1\) rather than \(e^3_2\), we should check his payoffs for all pure strategies for players 1 and 2, since in both thought experiments each player plays a completely mixed strategy. Scanning the payoff trimatrices, it follows that the strategy profiles \(y=(e^1_4,e^2_4,e^3_1)\) and \(z=(e^1_5,e^2_3,e^3_2)\) are important. Informally, to check whether x is fall back proper, the fall back strategy profiles y and z are equally important for player 3 since twice a fourth choice \((e^1_4,e^2_4)\) has the same number of factors \(\delta \) as one fifth and one third choice \((e^1_5,e^2_3)\).Footnote 2 As a result, the 4 at z trumps the 1 at y and consequently, x is not fall back proper. Nevertheless, we can still construct a sequence of \(x_t\) supporting x as a proper equilibrium. We only have to take care that player 1’s fifth choice (his worst pure strategy) is played with a sufficiently small probability to neutralise the 4 in z. In fall back proper equilibrium, it is not allowed to coordinate small probabilities on specific deviations in the game.

Note that to make the construction in Example 3.1 work, we need the three payoffs of 10 (one per player) outside the proper equilibrium. For the remaining payoffs in the trimatrix, we actually have some slack, as can be seen from the final inequality, \(\delta _t^1\ge 4\delta _t^1\). If you replace the 4 by any constant larger than 1, the same contradiction follows. As a result, the class of games for which the concepts of proper equilibrium and fall back proper equilibrium do not coincide does not have measure zero.

4 Results for two-player games

In the previous section, we showed that in general the set of fall back proper equilibria is a (possibly strict) subset of the set of proper equilibria. Interestingly, for two-player games the sets of proper and fall back proper equilibria coincide.

Theorem 4.1

Let G be a two-player game. Then, the sets of proper and fall back proper equilibria of G coincide.

Proof

Let \(G=(\{1,2\},\{\Delta _{M^i}\}_{i \in \{1,2\}},\{\pi ^i\}_{i \in \{1,2\}})\) be a two-player game. Since \(\mathrm{FBPR}(G) \subseteq \mathrm{PR}(G)\) for all n-player games (Theorem 3.1), we only have to show that \(\mathrm{PR}(G) \subseteq \mathrm{FBPR}(G)\). Let \(x \in \mathrm{PR}(G)\). Then, there exists a sequence \(\{\varepsilon _t\}_{t \in \mathbb {N}}\) of positive real numbers converging to zero, and a sequence \(\{x_t\}_{t \in \mathbb {N}}\) of completely mixed strategy profiles converging to x such that \(x_t\) is \(\varepsilon _t\)-proper for all \(t \in \mathbb {N}\), i.e.,

$$\begin{aligned} \pi ^i\left( e^i_{\ell },x^{-i}_t\right) <\pi ^i\left( e^i_k,x^{-i}_t\right)\Rightarrow & {} x^i_{t,\ell }\le \varepsilon _t x^i_{t,k} \end{aligned}$$

for all \(k,\ell \in M^i\) and all \(i \in N\).

Let \(i \in \{1,2\}\) and \(t \in \mathbb {N}\). We divide the actions of player i recursively in a finite number \(S^i_t\) of best reply sets such that \(Q^i_t(s)=\{k \in M^i \backslash \cup _{r \in \{1,\ldots ,s-1\}}Q^i_t(r) | \pi ^i(e^i_k,x^j_t)\ge \pi ^i(e^i_{\ell },x^j_t)\text { for all } \ell \in M^i \backslash \cup _{r \in \{1,\ldots ,s-1\}}Q^i_t(r)\}\) for all \(s \in \{1,\ldots ,S^i_t\}\). Note that since \(x_t\) is \(\varepsilon _t\)-proper, \(x^i_{t,\ell }\le \varepsilon _t x^i_{t,k}\) for all \(k \in Q^i_t(s)\) and \(\ell \in Q^i_t(s')\) with \(s<s'\).

For each set \(Q^i_t(s)\), with \(s=\{1,\dots ,S^i_t\}\), we construct a strategy \(\bar{x}^i_t(s)\) such that

$$\begin{aligned} \bar{x}^i_{t,k}(s)=\left\{ \begin{array}{ll} \dfrac{x^i_{t,k}}{\sum _{k \in Q^i_t(s)}x^i_{t,k}}&{}\quad \text {if }k \in Q^i_t(s),\\ 0&{}\quad \text {otherwise}.\\ \end{array}\right. \end{aligned}$$

Hence, \(\bar{x}^i_t(s)\) is a strategy in which actions outside \(Q^i_t(s)\) are not played and the probabilities on the actions in \(Q^i_t(s)\) are relatively the same as in \(x^i_{t}\).

Let \(\delta ^i_t=\varepsilon _t\) for all \(i \in N\) and all \(t \in \mathbb {N}\). Then, we construct for each \(t \in \mathbb {N}\) the strategy \(\hat{x}_t\) such that

$$\begin{aligned} \hat{x}^i_t=\dfrac{\sum _{s=1}^{S^i_t}((1-\delta ^i_t)\sum _{a=b}^{b+|Q^i_t(s)|}(\delta ^i_t)^{a})\bar{x}^i_t(s)}{1-(\delta ^i_t)^{m^i}} \end{aligned}$$

for all \(i \in \{1,2\}\), with \(b=|\cup _{r<s}Q^i_t(r)|\).

It follows that the sequence \(\{\hat{x}_t\}_{t \in \mathbb {N}}\) converges to x and that \(\hat{x}_t \in \Delta (\delta _t)\) for all \(t \in \mathbb {N}\). It remains to be shown that for all \(i \in \{1,2\}\) and all \(t \in \mathbb {N}\), \(\pi ^i(\hat{x}_t)\ge \pi ^i(\dot{x}^i_t,\hat{x}^{-i}_t)\) for all \(\dot{x}^i_t \in \Delta _{M^i}(\delta ^i_t)\). Since each player has only one opponent, for all \(i \in \{1,2\}\) and all \(\ell \in M^i\), \(\{k \in M^i | \pi ^i(e^i_k,x^{-i}_t)\ge \pi ^i(e^i_{\ell },x^{-i}_t)\}=\{k \in M^i | \pi ^i(e^i_k,\hat{x}^{-i}_t)\ge \pi ^i(e^i_{\ell },\hat{x}^{-i}_t)\}\). Hence, let \(i \in \{1,2\}\) and \(t \in \mathbb {N}\), and let \(k \in Q^i_t(s)\) and \(\ell \in Q^i_t(s')\), with \(s < s'\). Then, there is number \(U \in \{1,\ldots ,S^{-i}_t\}\) such that

$$\begin{aligned} \pi ^i(e^i_k,\bar{x}^{-i}_t(u))= & {} \pi ^i(e^i_{\ell },\bar{x}^{-i}_t(u)) \end{aligned}$$

for all \(1\le u < U\), and

$$\begin{aligned} \pi ^i(e^i_k,\bar{x}^{-i}_t(U))> & {} \pi ^i(e^i_{\ell },\bar{x}^{-i}_t(U)). \end{aligned}$$

This implies that in \(\hat{x}_t\) player i recursively puts the maximum allowed probability on each following best reply level. Consequently, \(\pi ^i(\hat{x}_t)\ge \pi ^i(\dot{x}^i_t,\hat{x}^{-i}_t)\) for all \(\dot{x}^i_t \in \Delta _{M^i}(\delta ^i_t)\). Therefore, \(x \in \mathrm{FBPR}(G)\). \(\square \)

5 Concluding remarks

This paper provides a new thought experiment with an active fall back role for the players to put into new light the concept of proper equilibrium, which has been criticised in the literature (e.g., van Damme 1991) for the passive role of the players in its original thought experiment. Although technically, the concepts of fall back proper and proper do not always coincide, an example showing that the two concepts differ needs a rather specific, intricate design.