1 Introduction

We study perfect information games with infinitely many players where each player is active only once. Our emphasis is on the question of existence of subgame perfect \(\epsilon \)-equilibria.Footnote 1 The references below testify to the increasing attention that the concept has been receiving in the recent game-theoretic literature. In addition to the challenging existence issue, the paper reaches out to a large area of research on topics such as minority games, time-inconsistent preferences, and intergenerational games.

The contribution of the paper is twofold. The first contribution is to introduce a class of so-called frequency-based minority games. These are games where a player benefits from choosing an action that is less popular among the players. Minority games have been used to model a variety of different phenomena ranging from congestion, downloading in P2P networks, market entry, to speculative behavior in financial markets. For an extensive list of related works, we refer to Renault et al. (2007) and Renault et al. (2008).

Unlike much of the work on minority games that focuses on the setup with simultaneous moves, here we consider sequential games of perfect information. In more detail, we consider perfect information games where each player becomes active exactly once in the course of the game and has a choice between two actions. One is a risky action: It gives a high payoff if a minority of the players use it and a low payoff otherwise. The other is a safe action: Taking this action leads to an intermediate payoff for sure, irrespective of what the other players do. In minority games, payoffs depend on the set of players who take the action that the minority of the players choose. The subtlety of the construction lies in the exact meaning of the expression “a minority of the players.” We allow for several specifications, the most characteristic of which invokes the frequency of the appearance of the risky action in the population. If too many players choose the safe action or the risky action, then choosing the risky action or the safe action, respectively, provides a higher payoff.

Our main result on frequency-based minority games in Sect. 3 states that these games admit no subgame perfect \(\epsilon \)-equilibria for small positive \(\epsilon \). The crucial tension arises from the fact that each player needs to predict the behavior of the future players. If, in the course of the game, it becomes likely that the majority of the future players will take the risky action, then the active player has an incentive to take the safe action with a large probability. But then, the probability for the majority of players to play risky is small.

Closely related to our work are two known counterexamples to existence of subgame perfect \(\epsilon \)-equilibrium. One is a stopping game played by a sequence of players considered in Flesch, Kuipers, Mashiah-Yaakovi, Schoenmakers, Solan, and Vrieze (2010, page 753). The example shows that games of the type studied in this paper need not have subgame perfect \(\epsilon \)-equilibria in pure strategies.

The other example is given in Flesch et al. (2014). This being a two-player game, it falls outside of the class of games studied here. All deviations in the current paper are automatically one-shot, since each player acts only once in the course of the game. In contrast, in Flesch et al. (2014) deviations can be more complex and may involve changes by a player at infinitely many periods of time. Moreover, while the counterexample in the present paper admits an interpretation as an intergenerational game, the same cannot be said of the game in Flesch et al. (2014). Indeed, the agent form of the game in Flesch et al. (2014), that is, the game obtained by splitting each player into infinitely many “selves,” each self acting at most once, does have a subgame perfect 0-equilibrium in pure strategies.Footnote 2

There are also important differences on a purely technical level. This paper relies on coupling. Coupling is used to compare any given strategy profile of the players to one particular stationary strategy profile whereby each player takes the risky action with a fixed positive probability. The main technique in Flesch et al. (2014) is Lévy’s zero–one law. The intuition behind the nonexistence of subgame perfect \(\epsilon \)-equilibrium is that in order for player 2 to punish player 1, player 2 must, with a small but positive probability, repeatedly take a “bad” action. However, Lévy’s zero–one law then forces player 2 to take the bad action in deep subgames with probability close to one. Such behavior is incompatible with payoff maximization by player 2.

The second contribution of this paper is to provide several sufficient conditions for the existence of subgame perfect \(\epsilon \)-equilibrium. Many of these conditions have appeared previously in the literature in various contexts. We now discuss each condition in turn.

[I] Games with qualitative objectives (also called Boolean objectives): These are games where each player’s payoff function takes only two values, 0 or 1. This class is motivated by a vast literature in computer science on qualitative games. The inventory of qualitative objectives considered in the computer science literature includes reachability, safety, Büchi, parity, and Müller objectives. For the precise definitions of these objectives, we refer to the reviews by Chatterjee and Henzinger (2012) and Bruyère (2017). We show that games with qualitative objectives admit subgame perfect 0-equilibria in pure strategies. The proof is based on the iterative procedure in Flesch et al. (2010).

[II] Games with upper semicontinuous payoffs. We show that games of this class admit a subgame perfect 0-equilibrium in pure strategies. This result complements the findings of Purves and Sudderth (2011), who consider games with finitely many players. A more general result, albeit with a more complicated proof, has been recently obtained by Flesch and Predtetchinski (2017).

[III] Games continuous outside a countable set. These are games in which the payoffs are continuous when restricted to a sufficiently large part of the domain. This somewhat technical condition is related to the work of Flesch and Predtetchinski (2016b). We show that this class of games admits a subgame perfect \(\epsilon \)-equilibrium for every positive value of \(\epsilon \). The result generalizes the corresponding result of Cingiz et al. (2016).

[IV] Games played by a finite number of teams. A team is a group of players with identical payoff functions. In this class of games, the set of players can be partitioned into finitely many teams. The idea of teams consisting of individual players has attracted considerable attention (see, e.g., von Stengel and Koller 1997; Solan 2000; Gossner and Tomala 2007; von Stengel and Zamir 2010; Gossner and Hörner 2010). We show that games of this class admit a subgame perfect 0-equilibrium in pure strategies.

Games where each player acts only once arise naturally in economics. One example is decision making with time-inconsistent preferences, as modeled by a sequence of multiple selves (Strotz 1956; Pollak 1968; Peleg and Yaari 1973). A special case of this is hyperbolic discounting (Phelps and Pollak 1968; Laibson 1994, 1997; Jaśkiewicz and Nowak 2014). Another example relates to intergenerational games, where a player in the infinite sequence of players represents a generation (Phelps and Pollak 1968; Balbus et al. 2015). In a similar vein, Asheim (2010) considers a model with an infinite stream of generations to analyze intergenerational social welfare. A class of continuous games played by a sequence of players is studied, for example, in Hellwig et al. (1990). Callander and Hörner (2009) study social learning using games played by a sequence of players.

The paper is organized as follows. In Sect. 2, we provide the general model and formal definitions. In Sect. 3, we introduce frequency-based minority games and state our main result: Nonexistence of subgame perfect \(\epsilon \)-equilibrium. In Sect. 4, we give several sufficient conditions for the existence of subgame perfect \(\epsilon \)-equilibrium. Proofs are collected in “Appendix.”

2 The model

Preliminaries: Let \(\mathbb {N}\) denote the set of natural numbers \(\{1,2,\ldots \}\). Let A be a non-empty finite set. For every \(t \in \mathbb {N}, \) let \(H_{t}\) denote the set of sequences of elements of A of length \(t-1\). Thus, in particular, \(H_{1}\) is a singleton consisting of the empty sequence \(\varnothing .\) Let \(H = \cup _{t \in \mathbb {N}}H_{t}\). Elements of H are called histories. Let \(A^{\mathbb {N}}\) denote the set of infinite sequences of elements of A. Elements of \(A^{\mathbb {N}}\) are called plays. A typical play is written as \(p = (a_{1},a_{2},\ldots )\). Finite sequences \(\varnothing , (a_{1}), (a_{1},a_{2}),\ldots \) are said to be prefixes of the play \((a_{1},a_{2},\ldots )\). A play \( p \in A^{\mathbb {N}} \) is said to extend a history \(h \in H\) if h is a prefix of p. In a similar way, we define the prefix of a history and a history extending another history.

Given a history \(h \in H,\) we define the cylinder set C(h) as the set of plays \(p \in A^{\mathbb {N}}\) such that h is a prefix of p. The collection of all cylinder sets is the basis of a topology on \(A^{\mathbb {N}}\). In what follows, \(A^{\mathbb {N}}\) is always assumed to be endowed with the topology generated by the cylinder sets. This topology is equal to the product topology, where each A is given the discrete topology, and is well known to be a Polish topology.

We let \(\mathcal {B}\) denote the Borel sigma-algebra on \(A^{\mathbb {N}}, \) the smallest sigma-algebra containing all the open sets. Since A is assumed to be finite, \(\mathcal {B}\) coincides with the sigma-algebra generated by the cylinder sets.

The game: We consider games where the set of players is equal to \(\mathbb {N}\). Player \(t \in \mathbb {N}\) is active in exactly one period, period t. In period t,  player t chooses an action \(a_t\) from the set A. The chosen action is observed by all players. Depending on the infinite sequence of actions chosen \(p=(a_1,a_2,\ldots )\), player t receives a payoff equal to \(u_{t}(p)\). Throughout the paper, we assume that for every player \( t \in \mathbb {N}, \)\(u_t\) takes only finitely many values and is Borel measurable on \(A^{\mathbb {N}}\).

Let some player \( t \in \mathbb {N} \) be given. A strategy \(\sigma _t\) for player t is a mapping \(\sigma _t: H_t \rightarrow \Delta (A). \) Recall that \(H_{t}\) is defined as the set of histories of length \( t-1, \) so \(H_t\) corresponds to the set of histories where player t is active. The set \(\Delta (A)\) contains all probability measures on A. A strategy profile is denoted by \(\sigma = (\sigma _t)_{t \in \mathbb {N}}. \) The set of strategies for player t is denoted by \(\Sigma _t\) and the set of strategy profiles by \(\Sigma = \displaystyle \prod {} _{t \in \mathbb {N}} \Sigma _{t} \).

A strategy \( \sigma _t\) for player \(t \in \mathbb {N}\) is called pure if for every \(h \in H_t\), \(\sigma _t(h)\) places all the probability on a single action. The set of pure strategies for player t is denoted by \(S_t\). A pure strategy \( s_{t} \in S_{t} \) is treated as a function \( s_{t} : H_{t} \rightarrow A. \) The set of pure strategy profiles is denoted by \(S = \displaystyle \prod _{t \in \mathbb {N}} S_{t} \).

We define \(\rho : S \times H \rightarrow A^{\mathbb {N}}\) by letting \(\rho (s|h)\) denote the play induced by the pure strategy profile s conditional on reaching history h. Formally, for every \( t \in \mathbb {N}, \) for every \(h \in H_{t}\), we let \(\rho (s|h)\) be the play \((h, a_{t}, a_{t+1}, \ldots )\) where \(a_{t} = s_{t}(h)\), \(a_{t+1} = s_{t + 1}(h,a_{t})\), and so on. The payoff for a player \( t \in \mathbb {N}\) induced by a pure strategy profile \( s \in S \) at \( h \in H_{t} \) is denoted by \(v_{t}(s|h) = u_{t}(\rho (s|h))\).

Given a strategy profile \(\sigma \in \Sigma \) and a history \( h \in H, \) we let \(\mu _{\sigma ,h}\) denote the Borel probability measure on \(A^{\mathbb {N}}\) induced by \(\sigma \) in the subgame starting with history h. Thus, \(\mu _{\sigma ,h}\) places probability 1 on the cylinder set C(h). The expected payoff for a player \(t \in \mathbb {N}\), induced by \(\sigma \) at h, is equal to

$$\begin{aligned} v_{t}(\sigma |h)=\mathbb {E}_{\mu _{\sigma ,h}}(u_t)= \displaystyle \int _{A^{\mathbb {N}}}u_{t} \mathrm {d}\mu _{\sigma ,h}. \end{aligned}$$
(2.1)

We denote \(v_{t}(\sigma |\varnothing )\) by \(v_{t}(\sigma )\). This brings us to the following definitions.

\(\epsilon \)-Equilibrium: For \(\epsilon \ge 0\), a strategy profile \(\sigma \in \Sigma \) is an \(\epsilon \)-equilibrium if for every player \(t\in \mathbb {N}\), for every strategy \(\sigma _t' \in \Sigma _t\),

$$\begin{aligned} v_{t}(\sigma ) \ge v_{t}(\sigma _{-t},\sigma _{t}')-\epsilon . \end{aligned}$$

Mertens and Neyman (see Mertens 1987) have proven that a perfect information game with finitely many players and bounded Borel-measurable payoff functions admits a pure \(\epsilon \)-equilibrium. Le Roux (2013) generalized their result in several directions: to infinitely many players (thus covering the games studied in this paper) and quasi-Borel preference relations rather than Borel payoff functions. Since we additionally assume that the payoff functions have finite range, the games studied in this paper admit a pure 0-equilibrium.

A strategy profile that constitutes an \(\epsilon \)-equilibrium for a game does not necessarily induce an \(\epsilon \)-equilibrium in every subgame, even though that would be highly desirable for a solution concept. This is the main motivation for considering the following refinement of \(\epsilon \)-equilibrium.

Subgame perfect \(\epsilon \)-equilibrium: For \(\epsilon \ge 0\), a strategy profile \(\sigma \in \Sigma \) is a subgame perfect\(\epsilon \)-equilibrium if for every history \( h \in H\), for every player \(t \in \mathbb {N},\) for every strategy \(\sigma _t' \in \Sigma _t,\)

$$\begin{aligned} v_{t}(\sigma |h) \ge v_{t}(\sigma _{-t},\sigma _{t}'|h)-\epsilon . \end{aligned}$$

Thus, a strategy profile \(\sigma \in \Sigma \) is a subgame perfect \(\epsilon \)-equilibrium if and only if, for every \( h \in H, \)\( \sigma \) induces an \( \epsilon \)-equilibrium in the subgame starting with history h if and only if for every \(t \in \mathbb {N}\), for every \( h \in H_t, \) for every \(a_t \in A, \)

$$\begin{aligned} v_{t}(\sigma |h) \ge v_{t}(\sigma _{-t},a_t|h )-\epsilon . \end{aligned}$$
(2.2)

The goal of this paper is to study the existence of subgame perfect \(\epsilon \)-equilibria.

3 Frequency-based minority games

This section introduces the class of frequency-based minority games. In Sect. 3.1, we describe our leading example of a frequency-based minority game, and in Sect. 3.2, we give a definition of the entire class. Section 3.3 provides some further discussion of the definition, and Sect. 3.4 contains the main result and an overview of its proof. The formal proof is relegated to “Appendix A.”

3.1 The leading example

Example 3.1

Consider a game where the action set is given by \(A = \{0,1\}\). Define the function \(f : A^{\mathbb {N}} \rightarrow [0,1]\) by

$$\begin{aligned} f(a_{1},a_{2},\ldots ) = \displaystyle \limsup _{t \rightarrow \infty } \tfrac{1}{t}(a_{1}+\cdots +a_{t}). \end{aligned}$$
(3.1)

The function f gives the frequency (interpreted as the limit superior of the empirical average) of the use of action 1. We define the set

$$\begin{aligned} F = \left\{ p \in A^{\mathbb {N}} \mid f(p) > \tfrac{1}{2}\right\} . \end{aligned}$$
(3.2)

We interpret F as expressing the condition that “more than half” of the players use action 1. To illustrate, consider the play \(p = (1,1,0,1,1,0,\dots )\). Under this action sequence, only every third player takes the action 0, so \(f(p) = 2/3\) and p is an element of F. Notice that p is an element of F even though 0 is played by infinitely many players.

For \(t \in \mathbb {N}\), player t’s payoff function is given by

$$\begin{aligned} u_{t}(a_{1},a_{2},\ldots )= {\left\{ \begin{array}{ll} 1,&{}\text {if }a_t=1,\\ 2,&{}\text {if }a_t=0\text { and }(a_{1},a_{2},\ldots ) \in F,\\ 0,&{}\text {if }a_t=0\text { and }(a_{1},a_{2},\ldots ) \in A^{\mathbb {N}}\setminus F. \end{array}\right. } \end{aligned}$$
(3.3)

Action 1 is a safe action that leads to a payoff of 1 irrespective of the actions chosen by the other players. Action 0 is a risky action. It leads to a payoff of 2 if the play of the game belongs to the set F and to a payoff of 0 otherwise, so the risky action pays off if it is chosen by a minority of the players.

As noted earlier, the games studied in this paper have a 0-equilibrium. One particular 0-equilibrium of the game in Example 3.1 is as follows: All players take action 1 as long as no player has chosen action 0. As soon as some player takes action 0, all subsequent players take action 0 as well. This 0-equilibrium is not subgame perfect. At a history such that some player has chosen action 0,  the 0-equilibrium prescribes to take action 0 leading to a payoff of 0. Switching to action 1 gives a payoff of 1 and is therefore a profitable deviation.

Surprisingly, as follows from the main result of the paper, the game admits no subgame perfect \(\epsilon \)-equilibrium for any sufficiently small \(\epsilon > 0\). It is clear that the payoff functions of the game are not continuous; otherwise, existence would follow, for example, from the positive result of Sect. 4.2.

3.2 The definition of frequency-based minority games

Let \(A=\{0,1\}\) and \(F \subset A^{\mathbb {N}}=\{0,1\}^{\mathbb {N}}\). A game is called frequency-based minority game if for every player \(t \in \mathbb {N} \) the payoff function is given by equation (3.3) and the set F satisfies the following properties:

  1. [F1]

    F is a Borel subset of \(A^{\mathbb {N}}\).

  2. [F2]

    F is a tail set in the following sense: If \((a_{1},a_{2},\ldots ) \in F\) and \((a_{1}',a_{2}',\ldots ) \in A^{\mathbb {N}}\) is such that \(a_{n}' \ne a_{n}\) for at most finitely many \(n \in \mathbb {N}\), then \((a_{1}',a_{2}',\ldots ) \in F\).

  3. [F3]

    F is closed under \(\le \): If \((a_{1},a_{2},\ldots ) \in F\) and \((a_{1}',a_{2}',\ldots ) \in A^{\mathbb {N}}\) is such that, for every \( t \in \mathbb {N}, \)\(a_{t} \le a_{t}^{\prime }, \) then \((a_{1}',a_{2}',\ldots ) \in F\).

  4. [F4]

    There exists a number \(\nu ^{*} \in (0,1)\) with the following property: If \(\beta _{1}, \beta _{2},\ldots \) is a sequence of i.i.d. binary random variables, each taking the value 1 with probability \(\nu ^{*}\) and the value 0 with probability \(1 -\nu ^{*}\), then

    $$\begin{aligned}\mathbb {P}((\beta _{1},\beta _{2},\ldots ) \in F) = 1.\end{aligned}$$
  5. [F5]

    There exists a number \(\nu _{*} \in (0,1)\) with the following property: If \(\beta _{1}', \beta _{2}',\ldots \) is a sequence of i.i.d. binary random variables, each taking the value 1 with probability \(\nu _{*}\) and the value 0 with probability \(1 - \nu _{*}\), then

    $$\begin{aligned}\mathbb {P}((\beta _{1}',\beta _{2}',\ldots ) \in F) = 0.\end{aligned}$$

The leading example of a set satisfying [F1–F5] is the set F as defined in (3.2). It satisfies [F1] since f is a Borel-measurable function. It satisfies condition [F4] with for instance \(\nu ^{*} = 2/3\) and condition [F5] with \(\nu _{*} = 1/3\) as a consequence of the strong law of large numbers.

The set F describes the plays such that action 0 is played by a minority of the players. As we remarked previously, a minority can consist of infinitely many players.

Condition [F1] is a technical measurability condition. Condition [F2] essentially means that each individual player is negligible: whether a play belongs to F or not depends only on the tail of the play. Condition [F3] says that starting from an action profile in F, if some of the players revise their action upward, then the resulting action profile is again in F. By Condition [F4], it holds that if all players choose action 1 independently with sufficiently high probability, then the play belongs to F with probability 1. Similarly, by Condition [F5], if all players choose action 1 independently with sufficiently low probability, then the play belongs to F with probability 0.

3.3 Further discussion of properties [F1–F5]

We have already seen that the set F as defined in (3.2) satisfies [F1–F5]. In fact, the choice of 1/2 as a lower bound on the frequency of action 1 is inessential. Any number in the open interval (0, 1) yields a set F satisfying [F1–F5]. Likewise, the set

$$\begin{aligned} \left\{ (a_{1},a_{2},\ldots ) \in A^{\mathbb {N}} \mid \liminf _{t \rightarrow \infty } \tfrac{1}{t}(a_{1}+ \dots + a_{t}) > \tfrac{1}{2}\right\} \end{aligned}$$

satisfies [F1–F5]. A somewhat less obvious example of a set satisfying [F1–F5] is

$$\begin{aligned} \left\{ (a_{1},a_{2},\ldots ) \in A^{\mathbb {N}} \mid \ell (a_{1},a_{2},\ldots ) > \tfrac{1}{2}\right\} , \end{aligned}$$

where \(\ell \) is a medial limit, a special type of Banach limit, as described in Mertens, Sorin and Zamir (2015, page 29). This medial limit \(\ell \) is a positive linear functional that assigns a real number to every bounded sequence of real numbers and has various appealing properties. For example, for every \((a_{1},a_{2},\ldots )\in A^{\mathbb {N}}\), it holds that

$$\begin{aligned}& \ell (a_{1},a_{2},\ldots )=\ell (a_{1},\tfrac{1}{2}(a_{1}+a_{2}),\tfrac{1}{3}(a_{1}+a_{2}+a_{3}),\ldots ),\\& \ell (a_{1},a_{2},a_3,\ldots )=\ell (a_{2},a_{3},\ldots ), \\& \liminf _{t \rightarrow \infty } \tfrac{1}{t}(a_{1}+ \dots + a_{t}) \le \ell (a_{1},a_{2},\ldots ) \le \limsup _{t \rightarrow \infty } \tfrac{1}{t}(a_{1}+ \dots + a_{t}). \end{aligned}$$

It is easy to make use of examples of sets with [F1–F5] to create new examples, since the collection of those sets is closed under finite unions and finite intersections.

It follows from the topological zero–one law (Kechris (1995), Theorem 8.47) that a set satisfying [F1–F5] is either meager or comeager. The Hewitt–Savage zero–one law (Shiryaev 1996, page 382) implies that if F satisfies [F1–F5] and \(\beta _{1},\beta _{2},\dots \) is an i.i.d. sequence of Bernoulli random variables, then the probability \(\mathbb {P}((\beta _{1},\beta _{2},\ldots ) \in F)\) is either 0 or 1.

Next, we remark that any set satisfying [F1–F5] is sandwiched between the sets E and \(E'\) defined as follows:

$$\begin{aligned} E= & {} \{(a_{1},a_{2},\ldots ) \in A^{\mathbb {N}} \mid \{t \in \mathbb {N} : a_{t} = 0\} \text{ is } \text{ finite }\}, \nonumber \\ E^{\prime }= & {} \{(a_{1},a_{2},\ldots ) \in A^{\mathbb {N}} \mid \{t \in \mathbb {N} : a_{t} = 1\} \text{ is } \text{ infinite }\}. \end{aligned}$$
(3.4)

Claim 3.2

If F satisfies [F1–F5], then \(E \subset F \subset E^{\prime }\).

Proof

The set F is non-empty by [F4]. By [F3], it must contain the play \( (1,1,\ldots ). \) It now follows by [F2] that it contains all plays in E.

Suppose that F is not a subset of \( E^{\prime }. \) Then, F contains a play \((a_{1},a_{2},\ldots )\) such that only finitely many players choose action 1. Replacing the finitely many 1’s by 0’s and making use of [F2], we conclude that F contains the play \((0,0,\ldots )\). But then, by [F3], we have \(F = A^{\mathbb {N}}, \) contradicting [F5]. \(\square \)

The set E satisfies [F1], [F2], [F3], and [F5], but not [F4]. Indeed, if \(\beta _{1}, \beta _{2},\ldots \) is a sequence of i.i.d. distributed random variables, each taking value 1 with probability \(\nu > 0\), then with probability 1 the realization of infinitely many random variables \(\beta _{t}\) is equal to 0. The set \(E'\) satisfies [F1], [F2], [F3] [F4], but not [F5].

3.4 The main result and overview of the proof

The main result of this section is the following theorem.

Theorem 3.3

Let G be a frequency-based minority game with a set F satisfying [F1–F5]. Let \(\nu ^{*}\) be as in [F4] and \(\nu _{*}\) as in [F5]. Then, G has no subgame perfect \(\epsilon \)-equilibrium whenever \( \epsilon < \min \{\nu _{*},(1-\nu ^{*})/3\}. \)

Proof

See “Appendix A.” \(\square \)

It is easy to see that G has no subgame perfect 0-equilibrium in pure strategies. Indeed, suppose that \(\sigma \) is a pure subgame perfect 0-equilibrium of G. We argue that, after each history, the play induced by \(\sigma \) lies in F. This is because each player can guarantee a payoff of 1 by taking the safe action 1. If the induced play would not be in F, then the players taking action 0 would be receiving a payoff of 0, contradicting the fact that \(\sigma \) is subgame perfect 0-equilibrium. But then, since the continuation play after any history is an element of F, every player prefers to choose action 0, leading to a contradiction.

The difficulty of Theorem 3.3 involves the incorporation of mixed strategies. The intuitive reason why mixing does not mend the nonexistence result can be explained as follows: Even if the strategy profile \(\sigma \) is mixed, in very deep subgames the probability of reaching the set F under \(\sigma \) is either very close to zero or very close to one as a consequence of Lévy’s zero–one law. Thus, in those deep subgames, the players can make very accurate predictions. Such accuracy, however, forces them to make nearly pure choices, leading us back to the situation of a pure strategy profile.

The formal proof relies on a technique known as coupling. Coupling is a technique from probability theory, which is useful whenever the distributions of two distinct random variables need to be compared to each other. The technique consists of creating a new, larger probability space and two new random variables, having the same distributions as the two original ones, with some additional property that affords a simpler comparison of the two. Coupling is used to compare any given strategy profile of the players to one particular stationary strategy profile whereby each player takes the risky action with a fixed positive probability.

4 Sufficient conditions for existence

For games with finitely many players, a number of sufficient conditions for the existence of a subgame perfect \(\epsilon \)-equilibrium have been identified, see for instance Kuipers et al. (2016) and Flesch and Predtetchinski (2016a, b). Much less is known about games with infinitely many players.

Below, we discuss four classes of games for which subgame perfect \(\epsilon \)-equilibria exist: games with qualitative objectives, games with upper semicontinuous payoffs, games continuous outside a countable set, and games played by a finite number of teams. Many of the conditions that we discuss have previously appeared in the literature in various contexts.

As in Sect. 2, we continue to assume that for every player for every player \( t \in \mathbb {N}, \)\(u_t\) takes only finitely many values and is Borel measurable on \(A^{\mathbb {N}}\).

4.1 Games with qualitative objectives

The game G is said to have qualitative objectives if for each player \(t \in \mathbb {N}\) the payoff function \(u_{t}\) takes at most two values, 0 and 1. The literature on computer science provides many examples of qualitative objectives. Examples are the objectives of reachability and safety, and the criteria of Büchi, parity, and Müller, see the reviews by Chatterjee and Henzinger (2012) and Bruyère (2017).

Example 4.1, which is inspired by the work of Voorneveld (2010) on minority games, is a game with qualitative objectives. The payoff functions in this game are exactly those of Example 4.3 in Voorneveld (2010), an example which is used to show the nonexistence of a 0-equilibrium. What distinguishes our analysis is that we are looking at a perfect information version of the game, while Voorneveld (2010) analyzes the simultaneous-move version. Notice that the game in Example 4.1 is not a frequency-based minority game as defined in Sect. 3.

Example 4.1

The action set is equal to \( A = \{0,1\}. \) For every \( t \in \mathbb {N}, \) player t’s payoff function is given by

$$\begin{aligned}u_{t}(a_{1},a_{2},\ldots )= {\left\{ \begin{array}{ll} 1,&{}\text {if }a_t=1\text { and }f(a_{1},a_{2},\ldots )\le 1/2,\\ 1,&{}\text {if }a_t=0\text { and }f(a_{1},a_{2},\ldots ) > 1/2,\\ 0,&{}\text {otherwise}, \end{array}\right. }\end{aligned}$$

where f is as in (3.1).

Player t receives a payoff of 1 if he chooses the minority action. Otherwise, player t receives a payoff of 0. More precisely, player t receives a payoff of 1 if he chooses action 1 and the frequency of players choosing action 1 is less than or equal to 1/2. Player t also receives a payoff of 1 if he chooses action 0, and the frequency of players choosing action 1 is strictly above 1/2.

The game has a pure strategy subgame perfect 0-equilibrium: Player 1 playing action 1, and each player \(t > 1\) taking the same action as player \(t-1\), is an example of such an equilibrium. \(\Box \)

Theorem 4.2

If the game G has qualitative objectives, then it has a subgame perfect 0-equilibrium in pure strategies.

We apply the iterative algorithm developed in Flesch et al. (2010) to prove Theorem 4.2. For each ordinal number \( \xi , \) we recursively define the collection \( \{P_{\xi }(h) \mid h \in H\} \) of sets of plays and the collection \( \{\alpha _{\xi }(h) \mid h \in H \}\) of real numbers augmented with \(+\infty \). For ordinal number 0,  for every \( t \in \mathbb {N}, \) for every history \(h \in H_{t}, \) we define

$$\begin{aligned} P_{0}(h)= & {} \{p\in A^{\mathbb {N}} \mid h\text { is a prefix of }p\}, \end{aligned}$$
(4.1)
$$\begin{aligned} \alpha _{0}(h)= & {} \min _{p \in P_{0}(h)}u_{t}(p). \end{aligned}$$
(4.2)

For every successor ordinal \(\xi +1, \) for every \( t \in \mathbb {N}, \) for every \(h \in H_{t},\) we let

$$\begin{aligned} \alpha _{\xi +1}(h)&=\max _{a\in A} \min _{p \in P_{\xi }(h,a)}u_{t}(p), \end{aligned}$$
(4.3)
$$\begin{aligned} P_{\xi +1}(h)&=\{p \in \bigcup _{a\in A}P_{\xi }(h,a) \mid u_{t}(p)\ge \alpha _{\xi +1}(h)\}, \end{aligned}$$
(4.4)

with the convention that the minimum over the empty set is equal to \(+\infty \). For every limit ordinal \(\xi , \) for every \( t \in \mathbb {N}, \) for every history \(h \in H_{t},\) we let

$$\begin{aligned} P_{\xi }(h)&= \bigcap _{\lambda <\xi } P_{\lambda }(h),\\ \alpha _{\xi }(h)&= \min _{p \in P_{\xi }(h)}u_{t}(p). \end{aligned}$$

For \( t \in \mathbb {N}, \) we let \(W_{t}\) denote the set of plays \(p \in A^{\mathbb {N}}\) for which \(u_{t}(p) = 1\). We think of \(W_{t}\) as the winning set of player t.

Proof of Theorem 4.2:

One can show, exactly as in Flesch et al. (2010), that for each \(h \in H\) the sequence \(P_{\xi }(h)\) is non-increasing by inclusion and that the sequence \(\alpha _{\xi }(h)\) is non-decreasing. Moreover, the game admits a subgame perfect 0-equilibrium in pure strategies if and only if the sets \(P_{\xi }(h)\) are non-empty for every ordinal \(\xi \) and every history h. Furthermore, the set \(\cap _{\xi }P_{\xi }(\varnothing )\) is exactly the set of plays that could be induced by subgame perfect 0-equilibria of the game.

We show that \(P_{\xi }(h)\) is a non-empty set for each history h by induction on \(\xi \). The statement is clearly true for \(\xi = 0\). Suppose the statement is true for the ordinal \(\xi \). Consider a history \(h \in H_{t}\), and let a be an action that reaches the maximum in (4.3). Then, \(P_{\xi + 1}(h) \supset P_{\xi }(h,a)\); hence, \(P_{\xi + 1}(h)\) is non-empty, as desired.

We now turn to the more challenging case of a limit ordinal \(\xi \). Suppose that \(P_{\eta }(h)\) is non-empty for each \(h \in H \) and each ordinal \(\eta < \xi \). Let \(h \in H_{t}\) be given. We recursively define a play \(p = (h, a_{t}, a_{t+1},\dots )\). To define \(a_{t}, \) we distinguish two cases.

Case 1: If \(\alpha _{\eta }(h) = 0\) for each \(\eta < \xi \), let \(a_{t}\) be any element of A.

Case 2: Otherwise, let \(\eta < \xi \) be the least successor ordinal such that \(\alpha _{\eta }(h) = 1\). By (4.3), there is an action \(a_{t} \in A\) such that

$$\begin{aligned} \min _{p \in P_{\eta -1}(h,a_t)}u_{t}(p)=1. \end{aligned}$$

Notice that \(P_{\eta }(h,a_{t}) \subset W_{t}\).

To proceed with the definition of p, let \(h_{t+1} = (h, a_{t})\) and repeat the argument ad infinitum. The construction guarantees that \(p \in P_{\xi }(h)\). \(\square \)

Grädel and Ummels (2008) consider games with qualitative objectives in a setup that is complementary to ours: There are finitely many players and each player can move infinitely many times. The authors show that the games of this class admit a subgame perfect 0-equilibrium. Unlike Grädel and Ummels (2008), who rely on Borel determinacy (Martin, 1975) to obtain their result, our technique does not require that the winning sets \(W_{t}\) be Borel measurable.

4.2 Games with upper semicontinuous payoffs

The payoff function \(u_t\) of player \( t \in \mathbb {N} \) is upper semicontinuous if for every sequence \((p^{n})_{n \in \mathbb {N}}\) of plays in \(A^{\mathbb {N}}\) that converges to a limit \(p \in A^{\mathbb {N}} \) it holds that \(\limsup _{n \rightarrow \infty }u_{t}(p^n) \le u_{t}(p)\). Equivalently, \(u_{t}\) is upper semicontinuous if for each real number \(r \in \mathbb {R} \) the set \(\{p \in A^{\mathbb {N}} \mid u_{t}(p) \ge r\}\) is closed. The game G is said to have upper semicontinuous payoffs if every player \( t \in \mathbb {N} \) has an upper semicontinuous payoff function.

Theorem 4.3

If the game G has upper semicontinuous payoffs, then it has a subgame perfect 0-equilibrium in pure strategies.

Proof

As in the proof of Theorem 4.2, the result follows once we show that the set \(P_{\xi }(h)\) as defined in Sect. 4.1 is a non-empty compact set for each history h and each ordinal \(\xi \). The proof is by induction on \(\xi \). For \(\xi = 0\), the statement is clearly true. Assume that for some ordinal number \(\xi \) the statement is true. Take \( h \in H_{t} \), and let \(a \in A\) be an action that attains the maximum in (4.3). Since \(P_{\xi + 1}(h) \supset P_{\xi }(h,a), \) the set \(P_{\xi + 1}(h)\) is non-empty. Since, for every \(a \in A, \) the set \(P_{\xi }(h,a)\) is compact by assumption and since A is finite, the set \(\cup _{a \in A}P_{\xi }(h,a)\) is also compact. The set \(\{p \in A^{\mathbb {N}} \mid u_{t}(p) \ge \alpha _{\xi +1}(h)\}\) is compact because \(u_{t}\) is upper semicontinuous. We conclude that \(P_{\xi + 1}(h)\) is compact.

Let \(\xi \) be a limit ordinal, and assume the statement is true for each ordinal \( \eta < \xi . \) Then, the statement is true for \( \xi , \) because the intersection of a nested family of non-empty and compact sets is non-empty and compact. \(\square \)

This result complements the result in Purves and Sudderth (2011), which shows that perfect information games with a finite number of players and upper semicontinuous payoff functions with finite range admit a subgame perfect 0-equilibrium in pure strategies. Their method of proof relies on induction on the number of payoffs in the game and therefore seems difficult to extend to games with infinitely many players.

Theorem 4.3 is subsumed by a recent result of Flesch and Predtetchinski (2017), which, however, has a more complicated proof. Another related work is Le Roux and Pauly (2014), Theorem 22, which implies that games of our class with upper semicontinuous payoffs admit a 0-equilibrium in pure strategies.

An interesting open problem is existence of subgame perfect \(\epsilon \)-equilibria in games with lower semicontinuous payoffs. Due to an example in Flesch et al. (2010), we do know that such games do not always admit a subgame perfect 0-equilibrium.

4.3 Games continuous outside a countable set

A subset C of \(A^{\mathbb {N}}\) is said to be co-countable if the set \(A^{\mathbb {N}} \setminus C\) is countable. The game G is said to be continuous outside a countable set if there exists a co-countable subset C of \(A^{\mathbb {N}}\) such that for each player \( t \in \mathbb {N}\) the restriction of the payoff function \(u_{t}\) to the set C, denoted by \(u_{t}|C\), is a continuous function.Footnote 3 The game G is said to have uniformly bounded payoffs if there exists \(B < \infty \) such that, for every \( t \in \mathbb {N}, \) for every \( p \in A^{\mathbb {N}}, \)\(|u_{t}(p)| < B. \)

An illustration of this condition is provided by Example 4.4. The payoff functions of this example are borrowed from Peleg (1969). In Peleg (1969), the players make simultaneous moves and it is shown that the game does not admit a 0-equilibrium. Moreover, Radner (1980) has shown that Peleg’s game does not admit an \(\epsilon \)-equilibrium for arbitrarily small positive values of \( \epsilon . \) The game of Example 4.4 is, like all games studied in this paper, a game of perfect information.

Example 4.4

The action set is equal to \( A = \{0,1\}. \) For every \( t \in \mathbb {N}, \) player t’s payoff function is given byFootnote 4

$$\begin{aligned} u_{t}(a_{1},a_{2},\ldots )= {\left\{ \begin{array}{ll} 1, &{} \text {if } a_{t} = 1, \\ 2, &{} \text {if } a_{t} = 0 \text { and } (a_{1}, a_{2}, \ldots ) \in E, \\ 0, &{} \text {if }a_{t} = 0\text { and } (a_{1}, a_{2}, \ldots ) \in A^{\mathbb {N}} \setminus E. \end{array}\right. } \end{aligned}$$

Recall that \( E = \{(a_{1},a_{2},\ldots ) \in A^{\mathbb {N}} \mid \{t \in \mathbb {N} : a_{t} = 0\} \text{ is } \text{ finite }\} \) has been defined in (3.4) as the set of plays such that only finitely many players take action 0.

The game is not a frequency-based minority game, since the set E violates condition [F4]; see also the discussion in Sect. 3.3. The game is continuous outside a countable set, with \(C = A^{\mathbb {N}}\setminus E\). Indeed, E is countable. Moreover, the function \(u_{t}|C\) is a function of \(a_{t}\) only and is therefore continuous.

The game admits a subgame perfect \(\epsilon \)-equilibrium for every positive \(\epsilon \), for instance the strategy profile where each player takes action 1 with probability \(1 - \epsilon \) and action 0 with probability \(\epsilon \). This result is tight, as the game has no subgame perfect 0-equilibrium as is shown in “Appendix B.” \(\square \)

Theorem 4.5

If the game G has uniformly bounded payoffs and is continuous outside a countable set, then, for every \( \epsilon > 0, \) it has a subgame perfect \(\epsilon \)-equilibrium.

Proof

Let \(\delta > 0\) be such that \(2BM \delta < \epsilon , \) where M is the cardinality of the action set A. For each player \( t \in \mathbb {N}, \) consider a restricted strategy space \(\Sigma _{t}^{\delta }\) consisting of strategies \(\sigma _{t}\) such that for each \(h \in H_{t}\) the probability distribution \(\sigma _{t}(h)\) places a probability of at least \(\delta \) on each action in A. Let \(\Sigma ^{\delta }\) be the corresponding set of strategy profiles. It is clear that for each \(\sigma \) in \(\Sigma ^{\delta }\) the induced probability measure \(\mu _{\sigma ,h}\) places probability zero on each singleton set \(\{p\}\) and consequently also on each countable subset of \(A^{\mathbb {N}}\). In particular, \(\mu _{\sigma ,h}\) assigns probability 0 to the complement of the set C. It follows that the payoff function \(v_{i}(\sigma |h)\) is continuous on \(\Sigma ^{\delta }\). By Peleg and Yaari (1973, Theorem 6.2), there is a 0-equilibrium in the restricted strategy space. Since each finite sequence of actions is reached with positive probability, this is also a subgame perfect 0-equilibrium of the restricted game. It follows by the choice of \(\delta \) that this strategy profile is a subgame perfect \(\epsilon \)-equilibrium of the original game.\(\square \)

Theorem 4.5 generalizes a result in Cingiz et al. (2016), who consider so-called centipede games played by an infinite sequence of players.

4.4 Games played by a finite number of teams

A team is a group of players with identical payoff functions. The game G is said to be played by a finite number of teams if there exists a finite partition \(\{T_{1},\dots ,T_{n}\}\) of the player set \(\mathbb {N}\) such that, for every \( k \in \{1,\ldots ,n\}, \) for every \( t,t' \in T_{k}, \)\( u_{t} = u_{t^{\prime }}. \)

Theorem 4.6

If the game G is played by a finite number of teams, then it has a subgame perfect 0-equilibrium in pure strategies.

Proof

Consider a game where the set of players is \(\{1,\dots ,n\}\), player k’s payoff function is \(u_{t}\) for \(t \in T_{k}\), and player k makes a move at each period \(t \in T_{k}\). This results in a game with finitely many players. Flesch et al. (2010) now apply to find a strategy profile that is immune to one-shot deviations, i.e., a strategy profile with the property that no player can improve his payoff at any history by deviating only once. Such a strategy profile induces a subgame perfect 0-equilibrium of the game G. \(\square \)