Deterministic n-person shortest path and terminal games on symmetric digraphs have Nash equilibria in pure stationary strategies

Boros, Endre; Franciosa, Paolo Giulio; Gurvich, Vladimir; Vyalyi, Michael

doi:10.1007/s00182-023-00875-y

Deterministic n-person shortest path and terminal games on symmetric digraphs have Nash equilibria in pure stationary strategies

Original Paper
Open access
Published: 10 October 2023

Volume 53, pages 449–473, (2024)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Game Theory Aims and scope Submit manuscript

Deterministic n-person shortest path and terminal games on symmetric digraphs have Nash equilibria in pure stationary strategies

Download PDF

Endre Boros¹,
Paolo Giulio Franciosa ORCID: orcid.org/0000-0002-5464-4069²,
Vladimir Gurvich^1,3 &
…
Michael Vyalyi^3,4,5

688 Accesses
1 Citation
Explore all metrics

Abstract

We prove that a deterministic n-person shortest path game has a Nash equlibrium in pure and stationary strategies if it is edge-symmetric (that is (u, v) is a move whenever (v, u) is, apart from moves entering terminal vertices) and the length of every move is positive for each player. Both conditions are essential, though it remains an open problem whether there exists a NE-free 2-person non-edge-symmetric game with positive lengths. We provide examples for NE-free 2-person edge-symmetric games that are not positive. We also consider the special case of terminal games (shortest path games in which only terminal moves have nonzero length, possibly negative) and prove that edge-symmetric n-person terminal games always have Nash equilibria in pure and stationary strategies. Furthermore, we prove that an edge-symmetric 2-person terminal game has a uniform (subgame perfect) Nash equilibrium, provided any infinite play is worse than any of the terminals for both players.

Topological Network-Control Games

A Dijkstra-Type Algorithm for Dynamic Games

Article 12 May 2015

The Complexity of Infinitely Repeated Alternating Move Games

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Given a finite directed graph $G = (V,E)$, we interpret a vertex $v \in V$ as a position of a game and a directed edge $e = (u,v)\in E$ as a move from u to v.

The set of players is denoted by $I = \{1, \ldots , n\}$. Each player $i\in I$ controls a subset $V_i\subseteq V$ of the positions, and we also have a nonempty subset $V_T$, the set of so-called terminal positions that are not controlled by any of the players. We assume that the sets $V_1$,..., $V_n$, and $V_T$ form a partition of V, and that terminals are the only positions with no directed edges leaving. Furthermore, we fix a position $v_0\in V{\setminus } V_T$ and call it the initial position.

A pure and stationary strategy $\sigma _i$ of player $i\in I$ is a mapping $\sigma _i:V_i\mapsto V$ such that $(v,\sigma _i(v))\in E$ for all $v \in V_i$. To simplify our notation, sometimes we also use $\sigma _i$ as the set of edges $\{(v,\sigma _i(v))\mid v\in V_i\}$. In this paper we consider only pure and stationary strategies, and call them simply strategies.

Let us denote by $\Sigma _i$ the set of strategies of player $i\in I$, and set $\Sigma =\Sigma _1\times \cdots \times \Sigma _n$. The tuple $\sigma = \left( \sigma _i \mid i \in I\right) \in \Sigma$ is called a strategy profile or situation. Every situation determines uniquely a directed walk, called the play $P(\sigma )$ and defined as follows. In a position $v \in V_i$ on this walk, player i makes the move $(v, \sigma _i(v))$, after which the walk enters $v'=\sigma _i(v)$. Starting with $v_0$, this process creates a unique play $P(\sigma )$. This play either comes to a terminal $v\in V_T$, where it stops (no player handles the terminal and no moves are from it), or it repeats a position reached earlier, in which case it follows the enclosed cycle infinitely many times. In the first case we call $P(\sigma )$ a terminal play, while in the second case it is called an infinite play.

Each player $i\in I$ has his own length function $\ell ^i:E\rightarrow \mathbb {R}$ that assigns to every move $e\in E$ a real number $\ell ^i(e)$ that we interpret as the cost of this move for this player; we call it the $\ell ^i$-cost of the move and set $\ell =(\ell ^i\mid i\in I)$. The cost of a sequence of moves is the sum of the costs of these moves. In particular, for a situation $\sigma \in \Sigma$ and player $i\in I$ the effective cost of $\sigma$ for player i is

$$\begin{aligned} \ell ^i(\sigma ) ~=~ \sum _{(u,v)\in P(\sigma )} \ell ^i(u,v). \end{aligned}$$

This kind of additive effective cost function, called total, is considered in Thuijsman and Vrieze (1987); see more analysis and variations in Boros et al. (2018, 2017); Gurvich and Oudalov (2014). Note that the cost of an infinite play may not be well defined, in particular when the cost of moves can be both positive and negative. The above cited papers provide several possible solutions for this problem. In our paper we focus on the special case, when all cost functions are positive; in this case the cost of an infinite play is $+\infty$, while every terminal play has a finite cost.

All players aim to minimize their effective costs.

We will call the obtained class of games the shortest path games, or simply games in the sequel. Such a game $\Gamma$ is defined by the triple $(G, v_0, \ell )$. Note that we always assume a given partition $V = V_1 \cup \cdots \cup V_n\cup V_T$ of the vertices of G; just we do not want to convolute our paper with extra notation.

A situation $\sigma \in \Sigma$ is called a Nash equilibrium (or NE, in short) if $\ell ^i(\sigma ) \le \ell ^i(\sigma ')$ for any player $i \in I$ and any situation $\sigma ' \in \Sigma$ that may differ from $\sigma$ only by the strategy of player i. In other words, a situation is a NE if no player $i \in I$ can reduce her effective cost by choosing another strategy, provided that all other players keep their old strategies. We call $\sigma$ a terminal NE if it is a NE and $P(\sigma )$ is terminal.

1.1 Edge-symmetric and positive shortest path games

A local cost function $\ell$ is called positive if $\ell ^{i}(e) > 0$ for all players $i \in I$ and moves $e \in E$. This condition is equivalent to the seemingly weaker condition of $\sum _{e \in C} \ell ^i(e) > 0$ for all directed cycles C of G and players $i \in I$. The equivalence is based on the concept of potential transformation introduced for directed graphs in Gallai (1958). It is applied to the length function of each player, separately. Such an equivalent transformation can in fact be computed efficiently, by linear programming. Since this transformation changes all path lengths between two vertices by the same constant, a shortest path game with positive cycle lengths can always be transformed into an equivalent game with positive edge lengths.

A digraph $G=(V,E)$ is called edge-symmetric if $(u,v)\in E$ whenever $(v,u)\in E$, except if $u\in V_T$. Finally, we call a game $\Gamma =(G, v_0, \ell )$ edge-symmetric and positive if G is edge-symmetric and $\ell$ is positive. Let us emphasize that the cost functions of the players are not assumed to be symmetric, that is $\ell ^i(u,v)\ne \ell ^i(v,u)$ may hold for some edges $(u,v)\in E$ and players $i\in I$.

Theorem 1

An edge-symmetric and positive n-person shortest path game $\Gamma =(G, v_0, \ell )$ has a NE. Furthermore, it has a terminal NE whenever $V_T$ is reachable from $v_0$.

Let us add that the above result cannot be extended to the so-called limit mean effective payoff games (see Gillette 1957 for definitions). For mean effective payoff a 2-person NE-free example was constructed in Gurvich (1988) on an edge-symmetric bipartite digraph with non-positive length function, and it is easy to make this example positive by adding a constant to the lengths of all edges.

Both conditions of edge-symmetry and positivity, are essential in Theorem 1. In Sect. 6 we provide examples of NE-free edge-symmetric (but not positive) 2-person games. For $n \ge 3$ such examples were known earlier (Boros and Gurvich 2003; Gurvich and Oudalov 2014). Furthermore, a NE-free positive but not edge-symmetric 3-person game was obtained in Gurvich and Oudalov (2014).

Another case, when the assumption of edge-symmetry helps to give criteria for the existence of a NE is the family of so-called cyclic games, in which each directed cycle is a separate outcome. For 2-person edge-symmetric cyclic games (Boros et al. 2011) provides a criterion for the existence of NE for all cost functions. Without the assumption of edge-symmetry no such criterion is known.

In Fraenkel et al. (1993), a polynomial algorithm solving game “geography” was found for symmetric digraphs, while the problem is known to be PSPACE-complete in general (Lichtenstein and Sipser 1980; Schaefer 1978).

Note also that positivity is important even if we have only one player. In the presence of negative arc lengths (with some negative cycles) a shortest path may not exist, and computing a shortest simple path is NP-hard in this case.

A shortest path game is called play-once if $\vert V_i\vert =1$ for all $i\in I$. It was shown in Boros and Gurvich (2003) that every play-once positive game has a NE.

It remains an open problem whether every 2-person non-edge-symmetric positive shortest path game has a NE or not.

1.2 Edge-symmetric terminal games

A related family of games that in fact forms a special case of shortest path games is the family of terminal games. In many popular positional games (Go, Chess, Checkers, etc.) players do not pay for moves; the effective cost for each player depends only on the terminal position. We assume, as before, that the set of positions and moves of the game form a directed graph, $G=(V,E)$, players $i\in I$ control distinct subsets $V_i\subseteq V$ of the positions, and $V_T$ is a nonempty subset of the terminal positions, such that $V=V_1\cup \cdots \cup V_n\cup V_T$ is a partition of the set of positions. In these games a situation may end up in a terminal position within $V_T$ or in an infinite play, going around the same cycle infinitely many times. We consider all infinite plays equivalent, and denote the corresponding outcome by c. Thus, to evaluate the outcome of these games we have mappings $\mathcal {L}^i: V_T \rightarrow \mathbb {R}$ for $i\in I$ that evaluate the value of terminals for players.

Then, for a situation $\sigma$ that defines a play $P(\sigma )$ the effective cost of player $i\in I$ is defined as

$$\begin{aligned} \mathcal {L}^i(\sigma ) ~=~ {\left\{ \begin{array}{ll} \mathcal {L}^i(w) &{} \text { if } P(\sigma ) \text { is a finite play terminating at } w\in V_T,\\ 0 &{} \text { if } P(\sigma ) \text { is the infinite play { c}.} \end{array}\right. } \end{aligned}$$

The infinite play c may be worse than some terminals and better than some others for a player. As in shortest path games, all players minimize their own effective costs. We call such a game $\Gamma =(G,v_0,\mathcal {L})$ a terminal game.

In the 2-person zero-sum case, such games are also called deterministic graphical; see (Washburn 1990), where the existence of a NE (saddle point) in pure and stationary strategies was shown.

In Boros and Gurvich (2003), this result was extended to the non-zero-sum 2-person case using a general criterion of Gurvich (1975, 1988). Recently these results were extended further for the so-called multi-stage deterministic graphical games; see (Gurvich 2018) and also (Gurvich and Koshevoy 2018). However, this line of arguments cannot be extended to the n-person case for $n>2$, since the criterion of Gurvich (1975, 1988) holds only for $n=2$. Examples for 3- and 4-person NE-free terminal games were constructed in Boros et al. (2018); Gurvich (2015).

Let us also mention that Everett (1997) already in 1957 considered a closely related class of terminal games and, among other results, provided some NE-free examples for concurrent zero-sum terminal games.

The existence of a NE for the above defined class of terminal games remains an open problem if we assume additionally that $\mathcal {L}^i(v)< 0$ for all terminals $v\in V_T$ and players $i\in I$. This condition describes a natural subclass of these games in which an infinite play is worse than any terminal play for each of the players. We can show that this subfamily of terminal games can be viewed as a subfamily of positive shortest path games, and thus the existence of a (terminal) NE for such a game is implied by Theorem 1. We can further generalize this by waiving the above assumption, and prove the following claim:

Theorem 2

An edge-symmetric n-person terminal game has a NE.

1.3 Uniform Nash equilibrium for edge-symmetric terminal games

We can further strengthen Theorem 2 for the case of 2 players. We call a situation $(\sigma _i\mid i\in I)$ a uniform Nash equilibrium (or a UNE, in short), if it is a NE no matter what the initial position is.

Theorem 3

Assume that $\Gamma =(G,\mathcal {L})$ is a terminal game that satisfies the following conditions:

(TWO)
$n=\vert I\vert =2$;
(SYM)
G is edge-symmetric;
(CIW)
infinite plays are worse than any of the terminal plays for all players.

Then $\Gamma$ has a UNE.

Let us add that all three conditions are necessary to guarantee the existence of a UNE, as we demonstrate it in Sect. 5.

We can also note that condition (CIW) is automatically satisfied by positive shortest path games. However, the theorem still fails even if we keep all three conditions but replace terminal games by shortest path games, see the last example in Sect. 6.

2 Edge-symmetric positive shortest path games

Assume that $\Gamma =(G,v_0,\ell )$ is an edge-symmetric and positive shortest path game. We denote by $N^+(v)=\{u\mid (v,u)\in E\}$ the out-neighborhood of a position $v\in V$ (and thus we have $N^+(v)=\emptyset$ for all terminals $v\in V_T$). Note also that since all plays reaching a terminal end there, we can merge all terminals into a single terminal position without any loss of generality. Thus, in this section we assume that $V_T=\{v_t\}$.

Let us consider the subgraphs induced by vertex sets $V_i$, $i\in I$. Each such subgraph can uniquely be decomposed into strongly connected subgraphs. We denote by $\mathcal {Q}=\{Q_1,Q_2,\ldots \}$ the family of all such components for all $i\in I$. For sake of simplicity, we consider $\{v_t\}$ also as one of those components. Note that we adopt the name “component” for these subgraphs, even though they may not be strongly connected components of our graph. They are strongly connected components of the subgraphs, induced by $V_i$, $i\in I$ and $V_T=\{v_t\}$.

We use i(v) to denote the player who controls vertex v. We extend this notation even for the terminal node for notational convenience and assume $i(v_t)\not \in I$, even though $v_t$ has no outgoing arcs, and no player controls it. We denote by $i(Q)\in I$ the player who controls the vertices in such a component $Q\in \mathcal {Q}$. Note that, due to the symmetric nature of the graph and the above definition of the components, we have the following property:

(A)
If $u\in Q$, $v\in Q'$, $Q\ne Q'$ and $(u,v)\in E$ then $i(Q)\ne i(Q')$.

Since a component $Q\in \mathcal {Q}$ is strongly connected, for any two vertices $u,v\in Q$ player i(Q) can create directed path(s) between these vertices. We denote by $d^{i(Q)}(u,v)$ the length of a shortest $u\rightarrow v$ path within Q, where the i(Q)-length is used to measure the length of the edges. Note that we do not assume the symmetry of the lengths, thus $d^{i(Q)}(u,v)$ and $d^{i(Q)}(v,u)$ may be different values.

Let us define $Q(v)\in \mathcal {Q}$ to be the component containing vertex $v\in V$.

For a $v_0\rightarrow v_t$ path P let us call a sequence $\{v_i,v_{i+1},\ldots v_{i+j}\}$ a subpath of P. The sets $P\cap Q$, $Q\in \mathcal {Q}$ partition P into a number of such subpaths. We denote by q(P) the number of these subpaths. In other words, if we denote by $Q(P)_j$, $j=1,...,q$ the components that P intersects as we follow P from $v_0$ to $v_t$, then $q=q(P)$, and $Q(P)_j\ne Q(P)_{j+1}$ for all $j=1,...,q-1$. Note that we may have $Q(P)_j=Q(P)_k$ for some j and $k\ge j+2$.

2.1 Special paths

Let us now consider a $v_0\rightarrow v_t$ path P in G, a player $i\in I$, and the subgraph G(i, P) of G consisting of the moves in P and the moves $(u,v)\in E$ with $u\in V_i$. We say that P is i-special if P is a shortest $v_0\rightarrow v_t$ path in G(i, P) for player i.

Lemma 1

There exists a $v_0\rightarrow v_t$ path P satisfying the following conditions:

(B)
q(P) is the smallest among all $v_0\rightarrow v_t$ paths, and
(C)
P is i-special for all $i\in I$.

Proof

Let us first associate a new edge length $\lambda : E\mapsto \{0,1\}$ by defining

$$\begin{aligned} \lambda (u,v) ~=~ {\left\{ \begin{array}{ll} 1 &{} \text { if } Q(u)\ne Q(v),\\ 0 &{} \text { if } Q(u)= Q(v), \end{array}\right. } \end{aligned}$$

and choose a $\lambda$-shortest $v_0\rightarrow v_t$ path P. By abusing our notations, we shall use P to denote both the set of vertices of P and also the set of edges in P. We hope that from the context it will always be unambiguous what we mean.

Note that q(P) is minimum among all $v_0\rightarrow v_t$ paths, and that P does not cross any component $Q\in \mathcal {Q}$ twice. This is because Q is strongly connected and the $\lambda$-cost of a path inside Q is zero, and outside is positive. Let us observe that the minimality of q(P) implies the following property:

(D)
For all $v\in Q(P)_j$, $1\le j\le q(P)-2$ and $j+2\le k \le q(P)$ we have $N^+(v)\cap Q(P)_k=\emptyset$.

Let us introduce $P[j]=P\cap Q(P)_j$ for $j=1,...,q(P)$ and note that the edges in P leaving vertices in P[j] form a subpath ending in $Q(P)_{j+1}$. Let us next define

$$\begin{aligned} r(P)_j~=~ \sum _{\begin{array}{c} u\in P[j]\\ (u,v)\in P \end{array}} \ell ^{i(Q(P)_j)}(u,v) \end{aligned}$$

for $j=1,...,q(P)-1$, and set $r(P)=(r(P)_1,...,r(P)_{q(P)-1})$. Note that in a way r(P) measures the length of P such that the length of every move is measured by the length function of the player who controls that move.

Assume now that P is an arbitrary path that satisfies property (B). Our arguments above show that there are such paths. Assume also that it does not satisfy property (C), that is there exists a player $i\in I$ that can improve on it. It means that there is an index $1\le j < q(P)$ with $i=i(Q(P)_j)$ and a position $v\in P[j]$ such that player i can deviate from P at v, return to P in $Q(P)_j\cup Q(P)_{j+1}$ (due to property (D)) and make the i-length of the improved path $P'$ shorter than the i-length of P. Note that if $P'$ returns to P at or before the first position of $P[j+1]$, then we must have $r(P')_j<r(P)_j$ and $r(P')_k=r(P')_k$ for all $k>j$. If $P'$ returns to P at or after the second position of $P[j+1]$ (in case $P[j+1]$ has more than one vertex), then we must have $r(P')_{j+1}<r(P)_{j+1}$ and $r(P')_k=r(P)_k$ for all $k>j+1$. Note that in the second case we may have $r(P')_j>r(P)_j$. In both cases however, the vector $r(P')$ is smaller than r(P) in the reverse lexicographic order. Note furthermore that $P'$ must also satisfy property (B).

Since we have only finitely many different paths, and thus r(P) vectors, after finitely many improvement steps we must arrive to a path that satisfies both (B) and (C), as claimed. $\square$

Let us call a path P satisfying both (B) and (C) a special path. By the above lemma such a special path exists. Let us now fix one $v_0\rightarrow v_t$ special path P.

2.2 Extending a special path to a NE

Let us introduce for $j=1,...,q(P)$ the sets

$$\begin{aligned} U(P)_j ~=~ \bigcup _{k=1}^j Q(P)_k, \end{aligned}$$

and let $u_j\in Q(P)_j$ be the first vertex on $P\cap Q(P)_j$. Recall that for a vertex $u\in Q(P)_j$ and player $i=i(Q(P)_j)$ we denote by $d^i(u_j,u)$ the i-distance from $u_j$ to u inside component $Q(P)_j$.

We are ready now to define strategies for the players.

(i)
If $v\in P$ then we choose edge $(v,u)\in P$.
(ii)
If $v\in V\setminus P$ and $N^+(v)\cap U(P)_{q(P)}\ne \emptyset$, then
1. (ii-1)
  choose the smallest index k such that $N^+(v)\cap Q(P)_k\ne \emptyset$ and set $i=i(Q(P)_k)$;
2. (ii-2)
  choose a vertex $u\in N^+(v)\cap Q(P)_k$ that minimizes $d^i(u_k,u)$;
3. (ii-3)
  choose edge (v, u).
(iii)
If $v\in V\setminus P$ and $N^+(v)\cap U(P)_{q(P)}=\emptyset$, then choose an arbitrary edge $(v,u)\in E$.

Let us denote by $\sigma (P)=\{\sigma _i(P)\mid i\in I\}$ one of the situations defined in this way.

Lemma 2

If P is a special $v_0\rightarrow v_t$ path, then $\sigma (P)$ is a NE.

Proof

Assume for contradiction that a player $i\in I$ can deviate and improve on P. This means that there is a component $Q(P)_j$ controlled by $i=i(Q(P)_j)$ and there is a vertex in $P\cap Q(P)_j$ such that player i can deviate from P starting at this vertex and create a new path $P'$ such that the i-length of $P'$ is smaller than the i-length of P. We can assume that $P'$ is the shortest path (according to $\ell ^i$) player i can create, and that $P\cap Q(P)_k=P'\cap Q(P)_k$ for all $k<j$.

In particular, $P'$ cannot return to $P\cap U(P)_{j-1}$. Note first that if $u\in U(P)_{j-1}{\setminus } P$, $i(u)\ne i$, and $(u,v)\in \sigma (P)$, then we must have $v\in U(P)_{j-1}$ by our definition of $\sigma$, since our graph is edge-symmetric. Note next that if $u\in U(P)_{j-1}$, and $i(u)=i$, then $u\in U(P)_{j-2}$. These together imply that if $P'$ enters $U(P)_{j-1}$ then it cannot reach $v_t$.

Let us now focus on the segment of $P'$ that is outside of $U(P)_{j-1}$ (note that this may be the entire path $P'$ if $j=1$.) Let us denote the vertices along this segment of $P'$ by $u_j=w_0$, $w_1$,..., $w_m=v_t$. Let k be the smallest index such that $w_k\not \in Q(P)_j$. By property (C) and the fact that $P'$ is the shortest path that player i can create, we can conclude that $w_k\not \in P$. Then we must have $i(w_k)\ne i$ by the definition of the components, thus we have $(w_k,w_{k+1})\in \sigma (P)$. By the symmetric nature of our graph, by part (ii) of our definition of $\sigma$, and by the fact that $P'$ does not enter $U(P)_{j-1}$ we must have $w_{k+1}\in Q(P)_j$. Since $N^+(w_k)\cap Q(P)_j\supseteq \{w_{k-1},w_{k+1}\}$, we get that (ii-2) of the definition of $\sigma$ implies $d^i(u_j,w_{k+1})\le d^i(u_j,w_{k-1})$. Since player i can create such a shortest $u_j\rightarrow w_{k+1}$ path inside $Q(P)_j$ he could replace $w_0,...,w_{k+1}$ with this path, and create another path $P''$ that is strictly shorter (by at least $\ell ^i(w_{k-1},w_k)+\ell ^i(w_k,w_{k+1})$) than $P'$, contradicting the fact that $P'$ is the shortest path this player can create deviating from P inside $Q(P)_j$. This contradiction proves our claim. $\square$

Proof of Theorem 1

The claim follows by Lemmas 1 and 2. $\square$

3 Terminal games and shortest path games

It is easy to see that terminal games satisfying condition (CIW) of Theorem 3 can also be viewed as positive shortest path games.

Theorem 4

Assume that $\Gamma =(G,v_0,\mathcal {L})$ is a terminal game that satisfies condition (CIW). Then there exist positive local costs functions $\ell ^i:E\mapsto \mathbb {R}$ for $i\in I$ such that $\Gamma '=(G, v_0, \ell )$ is a positive shortest path game such that any situation $\sigma$ that is a terminal NE of $\Gamma '$ is also a NE of $\Gamma$.

Proof

Let us note first that condition (CIW) implies that $\mathcal {L}^i(v)<0$ for all terminals $v\in V_T$ and players $i\in I$. Since we have only finitely many terminals and players, we can assume w.l.o.g. that $\mathcal {L}^i(v)$ are all negative integers for all terminals $v\in V_T$ and players $i\in I$, and there exists a positive integer M such that

$$\begin{aligned} -M ~<~ \mathcal {L}^i(v) <0 ~~~\text { for all } i\in I, \text { and } v\in V_T. \end{aligned}$$

Let us next define the local costs for the players $i\in I$ and moves $(u,v)\in E$ by

$$\begin{aligned} \ell ^i(u,v) ~=~ {\left\{ \begin{array}{ll} \frac{1}{2\vert E\vert } &{} \text { for all moves } (u,v)\in E, ~v \not \in V_T,\\ M+\mathcal {L}^i(v) &{} \text { for all moves } (u,v)\in E, ~v \in V_T. \end{array}\right. } \end{aligned}$$

Thus, for an arbitrary situation $\sigma$ for which $P(\sigma )$ is a finite play terminating at $v\in V_T$ and for a player $i\in I$ we have

$$\begin{aligned} \ell ^i(\sigma )~=~ \sum _{(u,v)\in P(\sigma )} \ell ^i(u,v) ~=~ \frac{\vert P(\sigma )\vert -1}{2\vert E\vert } + (M+\mathcal {L}^i(v)) \end{aligned}$$

implying that for all situations $\sigma$ with a finite play and for all players $i\in I$ we have

$$\begin{aligned} M+\mathcal {L}^i(\sigma ) ~\le ~ \ell ^i(\sigma ) ~<~ M+\mathcal {L}^i(\sigma ) ~+~ \frac{1}{2}. \end{aligned}$$

Now assume that $\sigma$ is a terminal NE in $\Gamma '$ and let $\sigma '$ be obtained from $\sigma$ by one of the players, say $i\in I$ changing his strategy. If $\sigma '$ is infinite, then $\mathcal {L}^i(\sigma ')>\mathcal {L}^i(\sigma )$. Otherwise, by the above inequalities we can write

$$\begin{aligned} M+\mathcal {L}^i(\sigma ) \le \ell ^i(\sigma ) \le \ell ^i(\sigma ') < M+\mathcal {L}^i(\sigma ')+\frac{1}{2}, \end{aligned}$$

where the second inequality follows from the fact that $\sigma$ is NE in $\Gamma '$. Since the $\mathcal {L}^i$ values are integers, $\mathcal {L}^i(\sigma )\le \mathcal {L}^i(\sigma ')$ follows, proving that $\sigma$ is also a NE in $\Gamma$. $\square$

Note, that in the above construction, $\Gamma$ may have some NE that are not NE in $\Gamma '$.

Note also that $\Gamma$ and $\Gamma '$ in the above statement use the same underlying directed graph. Thus they are simultaneously edge-symmetric or non-edge-symmetric and we can derive the following claim:

Corollary 5

Edge-symmetric terminal games satisfying condition (CIW) have NE. Furthermore, there is a terminal NE whenever $V_T$ is reachable from $v_0$.

Proof

By Theorem 4, an edge-symmetric terminal game satisfying condition (CIW) can be viewed as a positive shortest path game, and thus the claim follows by Theorem 1. $\square$

4 Edge-symmetric terminal games

Let us consider an edge-symmetric terminal game $\Gamma =(G,v_0,\mathcal {L})$, and note that unlike in shortest path games, a loop (that is a move of the form (u, u) for some position $u\in V$) may play a role in a NE, since the infinite play may not be the worse outcome for some players. Consequently, we have $u\in N^+(u)$ whenever $(u,u)\in E$.

Analogously to shortest path games, let us consider the family $\mathcal {Q}$ of strongly connected components of the subgraphs induced by the subsets $V_i$, $i\in I$.

We associate to $\Gamma$ its small version $\Gamma '$ obtained by “merging” the strongly connected components $Q\in \mathcal {Q}$ into single positions. More precisely, we introduce a new position $v_Q$ and set $i(v_Q)=i(Q)$ for all $Q\in \mathcal {Q}$, and define a directed graph $G'$ on the set of positions $V'=V_T\cup \{v_Q\mid Q\in \mathcal {Q}\}$. We define the edge set $E'$ of $G'$ by including edges $(v_Q,v_R)$ for $Q,R\in \mathcal {Q}$, $Q\ne R$ if there are positions $u\in Q$ and $v\in R$ such that $(u,v)\in E$. We also include edge $(v_Q,w)$ for $Q\in \mathcal {Q}$ and $w\in V_T$ if there is a position $u\in Q$ such that $(u,w)\in E$. Furthermore, we include a looping edge $(v_Q,v_Q)$ for all $Q\in \mathcal {Q}$ with $\vert Q\vert \ge 2$. Furthermore, if $Q=\{u\}$ for some $Q\in \mathcal {Q}$ and $(u,u)\in E$, then we also include in $E'$ the loop $(v_Q,v_Q)$. Note that with the above definitions we avoided creating parallel edges, since they would be redundant in a terminal game. Finally, we define $v_0'=v_Q$ if $v_0\in Q$. Note that this small game $\Gamma '=(G',v_0',\mathcal {L})$ is again an edge-symmetric terminal game with the same set of players and cost function as $\Gamma$.

Lemma 3

If the small game $\Gamma '$ has a NE then so does $\Gamma$.

Proof

Assume that $\sigma '\subseteq E'$ is a NE of $\Gamma '$. We associate to $\sigma '$ a situation $\sigma$ in $\Gamma$ as follows.

For every position $v_Q\in V'$, $Q\in \mathcal {Q}$ there exists a unique edge $(v_Q,x)\in \sigma '$. We associate to $(v_Q,x)$ edges of E to be included in $\sigma$ in the following way:

If $x=v_R\ne v_Q$, then by the definition of the small game we have (at least one) corresponding edge $(u,v)\in E$ such that $u\in Q$, $v\in R$. Let us consider one of these (u, v) edges, and declare position $u=u^Q$ the root of the strong component $Q\in \mathcal {Q}$. Since $Q\in \mathcal {Q}$ is a strong component, we have a directed tree $T^Q$ rooted at $u^Q\in Q$ such that from every vertex $u'\in Q$ we have a unique directed path from $u'$ to $u^Q$ via the edges of $T^Q$. Let us then include in $\sigma$ the edges of $T^Q$ and edge $(u,v)=(u^Q,v)$.

If $x=v_Q$ and $\vert Q\vert \ge 2$, then (by the definition of a component) Q is a strongly connected subgraph with at least two vertices, $u,v\in Q$, $u\ne v$. Let us declare $u=u^Q$ the root of Q and consider a directed tree $T^Q$ on the vertices of Q such that from all other vertices of Q there is a unique path to $u_Q$. Let us then include in $\sigma$ the edges of $T^Q$ and the edge $(u^Q,v)$.

Finally, if $x=v_Q$ and $Q=\{u\}$, then by the definition of the small game we must have $(u,u)\in E$, and we include this loop in $\sigma$.

Note that we have $\mathcal {L}^i(\sigma )=\mathcal {L}^i(\sigma ')$ for all $i\in I$. Furthermore, for all positions $v\in Q\in \mathcal {Q}$ player $i(v)=i(Q)$ can reach the same set of terminals (and/or the infinite play) in $\Gamma$ and $\Gamma '$, assuming all other players keep their strategies. Thus, since $\sigma '$ is a NE in $\Gamma '$, situation $\sigma$ must also be a NE in $\Gamma$. $\square$

Proof of Theorem 2

Assume that $\Gamma =(G, v_0, \mathcal {L})$ is an edge-symmetric terminal game. By Lemma 3 we can assume that $\vert Q\vert =1$ for all $Q\in \mathcal {Q}$, or in other words that for all moves $(u,v)\in E$ we have $i(u)\ne i(v)$, unless $u = v$.

Clearly, if $V_T$ is not reachable from $v_0$ then any situation is a NE. Assume for the rest of our proof that $V_T$ is reachable from $v_0$.

For all positions $v\in V$ with $N^+(v)\cap V_T\ne \emptyset$ let us denote by $t(v)\in V_T$ a terminal that is a most preferred by player i(v) in $N^+(v)\cap V_T$.

We consider the following cases:

Case 1: $N^+(v_0)\cap V_T=\emptyset$ and $\exists ~ v_1\in N^+(v_0)$ such that $N^+(v_1)\cap V_T=\emptyset$.

In this case we can construct a NE $\sigma$ resulting in an infinite play. We include in $\sigma$ the moves $(x,v_0)$ for all $x\in N^+(v_0)$, $(v_0,v_1)$, and $(y,v_1)$ for all $y\in N^+(v_1){\setminus } (N^+(v_0)\cup \{v_0\})$. For all other positions we include an arbitrary move. Here $P(\sigma )$ is the infinite play $v_0\rightarrow v_1\rightarrow v_0$, and neither $i(v_0)$ nor $i(v_1)$ can achieve a better outcome. Note that $v_0=v_1$ is possible in this case.
Case 2: $N^+(v_0)\cap V_T=\emptyset$ and $N^+(v)\cap V_T\ne \emptyset$ for all $v\in N^+(v_0)$.

Let us define $v_1\in N^+(v_0)$ as a position such that $t(v_1)$ is one of the most preferred terminals for $i(v_0)$ in the subset $\{t(v)\mid v\in N^+(v_0)\}\subseteq V_T$.
Subcase 2.1: Player $i(v_1)$ prefers $t(v_1)$ to an infinite play.

We can construct a NE $\sigma$ in this case as follows. We include the move $(v_0,v_1)$, the moves $(x,v_1)$ for all $x\in N^+(v_1) {\setminus } V_T$, the moves (v, t(v)) for all $v\in N^+(v_0){\setminus } N^+(v_1)$, and arbitrary moves for all other positions. Now the play $P(\sigma )$ is the path $v_0\rightarrow v_1\rightarrow t(v_1)$ yielding $\mathcal {L}^i(\sigma )=\mathcal {L}^i(t(v_1))$ for all players $i\in I$. Only two players are involved in the play, $i(v_0)$ and $i(v_1)$. If $i(v_1)$ deviates from $\sigma$ then he can get either another terminal in $N^+(v_1)\cap V_T$ or the infinite play, and neither one is better for him than $t(v_1)$. If $i(v_0)$ deviates then he can get one of the terminals $\{t(v)\mid v\in N^+(v_0)\}$, and by our choice, none of them is better for him than $t(v_1)$.
Subcase 2.2: For player i, the cost of $t(v_1)$ is at least that of the infinite play.

We can construct a NE $\sigma$ in this case as follows. We include the moves $(x,v_0)$ for all $x\in N^+(v_0)$, the moves $(y,v_1)$ for all $y\in N^+(v_1)\setminus V_T$, and an arbitrary move for all other positions. Now $P(\sigma )$ is the infinite play $v_0\rightarrow v_1\rightarrow v_0$. Player $i(v_0)$ can only deviate to another infinite play. Player $i(v_1)$ can deviate to either another infinite play, or to a terminal in $N^+(v_1)\cap V_T$, but those are not better for him then $t(v_1)$, which is not better in this case than an infinite play.
Case 3: $N^+(v_0)\cap V_T\ne \emptyset$ and $N^+(v_0)\not \subseteq V_T$.

Let us now create a smaller subgame $\Gamma '=(G',v_0,\mathcal {L})$ by deleting from $\Gamma$ the moves $(v_0,w)$ for $w\in N^+(v_0)\cap V_T$. Since Case 1 or 2 holds for $\Gamma '$, we have a NE $\sigma '$ in $\Gamma '$ by the previous arguments.
Subcase 3.1: $\mathcal {L}^{i(v_0)}(\sigma ')\le \mathcal {L}^{i(v_0)}(t(v_0))$.

In this case $\sigma '$ is also a NE in $\Gamma$, since only player $i(v_0)$ gained extra options, and all of those options would take him to a terminal in $N^+(v_0)\cap V_T$, which are assumed to be not better in this case for $i(v_0)$ than the outcome of $\sigma '$.
Subcase 3.2: $\mathcal {L}^{i(v_0)}(\sigma ')> \mathcal {L}^{i(v_0)}(t(v_0))$.

Let us modify $\sigma '$ by replacing the move from $v_0$ by $(v_0,t(v_0))$, and denote by $\sigma$ the obtained situation of $\Gamma$. We claim that $\sigma$ is a NE in $\Gamma$ in this case. This is because $P(\sigma )$ is the single move $v_0\rightarrow t(v_0)$, and any deviation from $\sigma$ by player $i(v_0)$ yields either another terminal in $N^+(v_0)\cap V_T$, which is not better than $t(v_0)$, or it yields $\sigma '$ or a situation obtainable by a deviation from $\sigma '$ in $\Gamma '$. Since $\sigma '$ is a NE in $\Gamma '$ none of these can give a better outcome to player $i(v_0)$ than $\sigma '$ which is not better than $t(v_0)$ by our assumptions in this case.
Case 4: $N^+(v_0)\subseteq V_T$.

In this case choosing $\sigma _{i(v_{0})}(v_0) = t(v_0)$ trivially gives a NE.

$\square$

5 Existence of UNE in edge-symmetric terminal games

In this section we prove Theorem 3 claiming that under conditions (TWO), (SYM), and (CIW) a terminal game has a UNE.

We consider an edge-symmetric terminal game $\Gamma =(G,\mathcal {L})$ in which no initial position is fixed, and assume that $I=\{1,2\}$. For a situation $\sigma$ and arbitrary position $v\in V$ we denote by $P(\sigma ,v)$ the unique walk starting from position v and following the moves in situation $\sigma$. Such a walk either terminates in a terminal position $w=w(\sigma ,v)\in V_T$, or is infinite, cycling around a directed cycle infinitely many times. Accordingly, we define the effective cost for player $i\in I$ for a position $v\in V$ by

$$\begin{aligned} \mathcal {L}^i(\sigma ,v) ~=~ {\left\{ \begin{array}{ll} \mathcal {L}^i(w(\sigma ,v)) &{}\text { if } P(\sigma ,v) \text { is terminating, and }\\ 0 &{} \text { if } P(\sigma ,v) \text { is infinite.} \end{array}\right. } \end{aligned}$$

Note that the above definition and condition (CIW) imply that we have $\mathcal {L}^i(w)<0$ for all players $i\in I$ and terminals $w\in V_T$.

Given a player $i\in I$ we use $\sigma ^{-i}$ to denote the strategy of the opponent. Thus, we can write $\sigma =(\sigma ^{-i},\sigma _i)$ for an arbitrary situation and player $i\in I$, where $\sigma _i\in \Sigma _i$ is a strategy for player $i\in I$. Note that in this section we have $\vert I\vert =2$, and thus for a situation $\sigma =(\sigma _1,\sigma _2)$ we have $\sigma ^{-1}=\sigma _2$ and $\sigma ^{-2}=\sigma _1$.

Given a player $i\in I$ and $\sigma ^{-i}$ a strategy $\sigma _i\in \Sigma _i$ is called a uniform best response of player i to $\sigma ^{-i}$ if the equality

$$\begin{aligned} \mathcal {L}^i((\sigma ^{-i},\sigma _i),v) ~=~ \min _{\sigma _i'\in \Sigma _i}\mathcal {L}^i((\sigma ^{-i},\sigma _i'),v) \end{aligned}$$

holds for all positions $v\in V$. Note that a uniform best response always exists, can be determined efficiently, and may not be unique.

We call a situation $\sigma =(\sigma _i\mid i\in I)$ a uniform Nash equilibrium (or UNE, in short), if $\sigma _i$ is a uniform best response of player i to $\sigma ^{-i}$ for all players $i\in I$.

Given a situation $\sigma =(\sigma ^{-i},\sigma _i)$ we say that strategy $\sigma _i'$ is a uniform best improvement for player i if $\sigma _i'$ is a uniform best response to $\sigma ^{-i}$, different from $\sigma _i$, and for all positions $v\in V_i$ we have either $\sigma _i(v)=\sigma _i'(v)$ or $\mathcal {L}^i((\sigma ^{-i},\sigma _i'),v) < \mathcal {L}^i((\sigma ^{-i},\sigma _i),v)$. Note that every uniform best improvement is a uniform best response, but not necessarily the other way around. Furthermore, if a situation is not a UNE, then at least one of the players have a uniform best improvement (and it can be determined efficiently). Let us also note that a uniform best improvement may not be unique in case players have ties over the set of terminals.

Our first step is to show that it is enough to prove Theorem 3 for simplified, special edge-symmetric terminal games.

Lemma 4

If $\Gamma =(G,\mathcal {L})$ is an edge-symmetric terminal game, then we can assume w.l.o.g. that edges are between different vertices controlled by different players, i.e.,

$$\begin{aligned} (u,v)\in E, ~v\not \in V_T ~~~\implies ~~~ i(u)\ne i(v). \end{aligned}$$

(1)

We can also assume that from each position we have at most one terminal move, i.e.,

$$\begin{aligned} \vert N^+(v)\cap V_T\vert ~\le ~ 1 ~~~\text { for all } v\in V. \end{aligned}$$

(2)

We can assume further that the terminals are reachable from each position $v\in V$, i.e.,

$$\begin{aligned} \text { for all }~~ v\in V ~~\text { there exist a }~~ v\rightarrow V_T ~~\text { path.} \end{aligned}$$

(3)

Proof

For condition (1) let us also note that Lemma 3 with the same proof works also for UNE (instead of NE).

For the inequalities (2) note first that if $x,y\in N^+(v)\cap V_T$, $x\ne y$, and $\mathcal {L}^{i(v)}(x)<\mathcal {L}^{i(v)}(y)$ then the move (v, y) can not belong to a UNE. Furthermore, if $\mathcal {L}^{i(v)}(x)=\mathcal {L}^{i(v)}(y)$ and $\sigma '$ is a UNE in $\Gamma '$ obtained from $\Gamma$ by deleting move (v, y), then $\sigma '$ is also a UNE in $\Gamma$. This is because only player i(v) has an option in $\Gamma$ that is not available in $\Gamma '$ (namely the move (v, y)) but that move cannot be part of a uniform best improvement of player i(v) to $\sigma '^{-i(v)}$ since the move (v, x) is available for player i(v) in $\Gamma '$, $\sigma '$ is a UNE in $\Gamma '$, and $\mathcal {L}^{i(v)}(x)=\mathcal {L}^{i(v)}(y)$.

For property (3) let us consider the set of positions $U\subseteq V$ such that no directed path connects a vertex $u\in U$ to the set of terminals $V_T$. Then, assigning arbitrary moves to positions in U does not change whether a situation is a UNE or not. Since identifying the set of positions from which the set of terminals is not reachable is a computationally easy task, and since those positions have no influence on the existence of a UNE, we can assume w.l.o.g. that we preprocess the input game, and delete all such positions, before any further analysis. $\square$

Lemma 5

Assume that $\Gamma =(G,\mathcal {L})$ is an edge-symmetric terminal game, satisfying conditions (1) and $\sigma$ is a situation such that for one of the players, $i\in I$, strategy $\sigma _i$ is a uniform best response to $\sigma ^{-i}$. Assume further that $u\in V_j$, $v\in V_i$ are distinct positions with $j\ne i$ that are connected by edges in G. Then we have

$$\begin{aligned} \mathcal {L}^i(\sigma ,v) ~\le ~ \mathcal {L}^i(\sigma ,u). \end{aligned}$$

Proof

Let us note first that if (v, u) belongs to $\sigma _i$ or $P(\sigma ,u)$ contains position v, then we must have $\mathcal {L}^i(\sigma ,v) = \mathcal {L}^i(\sigma ,u)$ by the definition of a terminal game.

Assume next that the move (v, u) does not belong to $\sigma _i$ and position v does not belong to $P(\sigma ,u)$. Let us define a new strategy $\sigma _i'$ for player i by

$$\begin{aligned} \sigma _i'(w) ~=~ {\left\{ \begin{array}{ll} \sigma _i(w) &{} \text { for all } w\in V_i\setminus \{v\},\\ u &{} \text { for } w=v. \end{array}\right. } \end{aligned}$$

With this definition we have

$$\begin{aligned} \mathcal {L}^i(\sigma ,u) ~=~ \mathcal {L}^i((\sigma ^{-i},\sigma _i'),u) ~=~ \mathcal {L}^i((\sigma ^{-i},\sigma _i'),v) ~\ge ~ \mathcal {L}^i((\sigma ^{-i},\sigma _i),v). \end{aligned}$$

Here the first two equalities follow by the definition of $\sigma _i'$ and the fact that position v does not belong to $P(\sigma ,u)$. The last inequality follows by the assumption that $\sigma _i$ is a uniform best response to $\sigma ^{-i}$. $\square$

Lemma 6

Assume that $\Gamma =(G,\mathcal {L})$ is a terminal game satisfying condition (CIW), and $\sigma$ is a situation such that the plays $P(\sigma ,v)$ are finite for all $v\in V$. Then, if $\sigma '$ is obtained from $\sigma$ by a uniform best improvement of one of the players, the plays $P(\sigma ',v)$, $v\in V$ are also all finite.

Proof

This is an immediate consequence of condition (CIW) and the definition of uniform best improvement. $\square$

Our proof of Theorem 3 is based on an iterative process, the finiteness of which depends on a strictly monotone decreasing measure of progress. The proof of that monotonicity depends critically on the following definition.

Let us assume, for the rest of this section, that by (2) of Lemma 4 we have $\vert N^+(v)\cap V_T\vert \le 1$ for all $v\in V$. Let us then denote by $t(v)\in N^+(v)\cap V_T$ the unique terminal for vertices $v\in V$ with $N^+(v)\cap V_T\ne \emptyset$.

Let us call a situation $\sigma$ i-basic for a player $i\in I$ if for all positions $v\in V$ with $N^+(v)\cap V_T\ne \emptyset$ we either have $\sigma (v)=t(v)$ or $\mathcal {L}^{i(v)}(\sigma ',v)<\mathcal {L}^{i(v)}(t(v))$ hold for all situations $\sigma '$ obtained from $\sigma$ by a uniform best improvement of player i. In other words, a situation is i-basic if a uniform best improvement by player i provides every position that it controls and has an unused terminal move with a strictly better outcome than that terminal move would provide.

Lemma 7

Assume that $\Gamma =(G,\mathcal {L})$ is an edge-symmetric terminal game that satisfies the conditions in Lemma 4. Then we can efficiently find a situation $\sigma$ that is i-basic for all players $i\in I$, and for which the plays $P(\sigma ,v)$ are finite for all $v\in V$.

Proof

We can construct a situation $\sigma$ satisfying the claimed properties by the following approach. Let us define $W=\{v\in V\mid N^+(v)\cap V_T\ne \emptyset \}$ and note that by our assumptions we have a terminal $t(v)\in V_T$ for all $v\in W$ such that $N^+(v)\cap V_T=\{t(v)\}$ holds. Let us now choose the moves (v, t(v)) for all positions $v\in W$, and color all positions in $W\cup V_T$ blue. In the sequel, while there exists an uncolored position $v\in V$ from which we can reach a blue position u in one move, we choose the move (v, u) and color v blue. In a finite number of steps this procedure will stop, and all positions will be colored blue, since we assume that there is a finite directed path from all positions to the set of terminals. In this way we define a situation $\sigma$ such that all plays $P(\sigma ,v)$ are finite.

To see that $\sigma$ is i-basic, for all players $i\in I$, let us consider one of the players i, and derive $\sigma '$ by applying a uniform best improvement of player i to $\sigma$. Now, for a position $v\in W\setminus V_i$ we have $\sigma '(v)=\sigma (v)=t(v)$ by our definitions of $\sigma$ and $\sigma '$, while for a position $v\in W\cap V_i$ we have either $\sigma '(v)=\sigma (v)=t(v)$ or $\mathcal {L}^i(\sigma ',v)<\mathcal {L}^i(\sigma ,v)=\mathcal {L}^i(t(v))$, by the definition of a uniform best improvement. $\square$

Lemma 8

Assume that $\Gamma =(G,\mathcal {L})$ is a 2-person edge-symmetric terminal game satisfying conditions (CIW), conditions in Lemma 4, and that $\sigma$ is a 1-basic situation such that all plays $P(\sigma ,v)$ are finite. Assume further that $\sigma ^1$ is obtained from $\sigma$ by a uniform best improvement of player 1, and $\sigma ^2$ is obtained from $\sigma ^1$ by a uniform best improvement of player 2. Then we have the inequalities

$$\begin{aligned} \mathcal {L}^{i(v)}(\sigma ^2,v) ~\le ~ \mathcal {L}^{i(v)}(\sigma ^1,v) \end{aligned}$$

for all positions $v\in V$.

Proof

Since $\sigma ^2$ is obtained from $\sigma ^1$ by a uniform best improvement of player 2, the claim follows for all $v\in V_2$ by the definition of a uniform best improvement.

Let us next consider positions $v\in V_1$. If $P(\sigma ^1,v)=P(\sigma ^2,v)$, then we have the equality $\mathcal {L}^{1}(\sigma ^{2},v) = \mathcal {L}^{1}(\sigma ^1,v)$.

Assume for the rest of our proof that $P(\sigma ^2,v)\ne P(\sigma ^1,v)$. Since $P(\sigma ^2,v)$ is a finite play by Lemma 6, positions controlled by player 1 and 2 alternate along this finite path, due to our assumption (1). Since $P(\sigma ^2,v)\ne P(\sigma ^1,v)$, we must have some positions $w\in V_2$ on the path $P(\sigma ^2,v)$ such that $\sigma ^2_2(w)\ne \sigma ^1_2(w)$. Let us number all such positions as $w_1,..., w_k\in V_2$ for some $k\ge 1$, in the order we pass through them along the path $P(\sigma ^2,v)$, when we start from v. Since $\sigma$ is 1-basic, we cannot have $\sigma ^2(w_k)\in V_T$, and thus we have $v_j:=\sigma ^2(w_j)\in V_1$ for all indices $j=1,...,k$. Now, let us observe that the moves $(v_j,w_j)$ exist, since this is an edge-symmetric terminal game, and thus they were considered in the uniform best improvement $\sigma ^1$, for all $j=1,...,k$. Since they were not chosen, we must have the inequalities

$$\begin{aligned} \mathcal {L}^1(\sigma ^1,v_j) ~\le ~ \mathcal {L}^1(\sigma ^1,w_j) \end{aligned}$$

for all $j=1,...,k$ by Lemma 5 applied for the moves $(v_j,w_j)$ with player $i=1$, $j=1,...,k$. We also have the equalities

$$\begin{aligned} \mathcal {L}^1(\sigma ^1,v_j)=\mathcal {L}^1(\sigma ^1,w_{j+1}) \end{aligned}$$

for $j=1,...,k-1$, due to our selection of vertices $w_j$, $j=1,...,k$. These two groups of equations and inequalities imply that

$$\begin{aligned} \mathcal {L}^1(\sigma ^1,w_1) ~\ge ~ \mathcal {L}^1(\sigma ^1,v_k). \end{aligned}$$

Since we also have $\mathcal {L}^1(\sigma ^2,v)=\mathcal {L}^1(\sigma ^2,v_k)=\mathcal {L}^1(\sigma ^1,v_k)$ and $\mathcal {L}^1(\sigma ^1,v)=\mathcal {L}^1(\sigma ^1,w_1)$, the stated inequalities follow. $\square$

Lemma 9

Assume that $\Gamma =(G,\mathcal {L})$ is a 2-person edge-symmetric terminal game satisfying conditions (CIW), conditions in Lemma 4, and that $\sigma$ is a 1-basic situation in which all plays $P(\sigma ,v)$ are finite. Assume further that $\sigma ^1$ is obtained from $\sigma$ by a uniform best improvement of player 1. Then situation $\sigma ^1$ is 2-basic.

Proof

Let us apply a uniform best improvement of player 2 to $\sigma ^1$, and denote the obtained situation by $\sigma ^2$.

Let us define $W=\{v\in V\mid N^+(v)\cap V_T\ne \emptyset \}$ and note that by Lemma 4 we have a terminal $t(v)\in V_T$ for all $v\in W$ such that $N^+(v)\cap V_T=\{t(v)\}$ holds.

Now, for a position $v\in W\cap V_2$ we have either $\sigma ^2(v)=\sigma (v)$ and thus $\mathcal {L}^2(\sigma ^2,v)=\mathcal {L}^2(\sigma ,v)$, or we have $\sigma ^2(v)\ne \sigma ^1(v)=\sigma (v)$ and thus $\mathcal {L}^2(\sigma ^2,v) <\mathcal {L}^2(\sigma ^1,v)=\mathcal {L}^2(\sigma ,v)$ by the definition of a uniform best improvement. In both cases the claimed property follows by our assumption that $\sigma$ is 1-basic.

Finally, for positions $v\in W\cap V_1$ we have $\sigma ^2(v)=\sigma ^1(v)$. Thus, we either have $\sigma ^2(v)=\sigma ^1(v)=t(v)$, or $\sigma ^1(v)\ne t(v)$, in which case we have $\mathcal {L}_1(\sigma ^1,v)<\mathcal {L}^1(t(v))$ by the definition of a uniform best improvement. In this latter case we apply Lemma 8 and conclude $\mathcal {L}^1(\sigma ^2,v)\le \mathcal {L}^1(\sigma ^1,v)<\mathcal {L}^1(t(v))$. $\square$

Proof of theorem 3

Let us first recall that the effective cost of an infinite play is defined as 0, and thus condition (CIW) is equivalent with saying that $\mathcal {L}^i(w)<0$ for all players $i\in I$ and terminals $w\in V_T$. We also assume that the simplifying conditions in Lemma 4 hold.

Our proof idea is to show that if players alternate in making their uniform best improvements, then after a finite number of such iterations we arrive to a situation on which neither of the players can improve, showing that it is a UNE. While this idea would work starting with an arbitrary situation, our proof becomes simpler if we choose a special initial situation $\sigma ^0=(\sigma ^0_1,\sigma ^0_2)$, in which all plays are finite and which is 1-basic. According to Lemma 7 such a situation exists and is easy to construct.

After this, players, starting with player 1, alternate in making their uniform best improvements, as long as there is some change. This creates a series of situations, $\sigma ^j$, $j=1,2,...$. To be very precise, situation $\sigma ^j$ for an odd $j\ge 1$ is obtained from $\sigma ^{j-1}$ by a uniform best improvement of player 1, while for an even $j\ge 2$ it is obtained from $\sigma ^{j-1}$ by a uniform best improvement of player 2.

Note that by our definitions, if a player has a uniform best improvement, it actually improves strictly his local costs in at least some of the positions it controls, and degrades in neither. If the above process terminates, the last situation is a UNE by definition.

On the other hand, a uniform best improvement by one of the players may actually increase some of the local costs for the other player, and thus the above procedure may start cycling through a number of situations, never terminating. To prove our theorem, we are going to show that such cycling cannot happen. To this end, we associate a quantity that can take only finitely many different values to every situation, and show that this decreases strictly in every step in the above process.

To a situation $\sigma$ let us associate for $i\in I$

$$\begin{aligned} \nu ^i(\sigma ) ~=~ \sum _{v\in V_i} \mathcal {L}^i(\sigma ,v), \end{aligned}$$

and define the quantity associated to situations $\sigma$ as

$$\begin{aligned} \nu (\sigma ) ~=~ \nu ^1(\sigma )+\nu ^2(\sigma ). \end{aligned}$$

We claim that for all $j\ge 1$ we have

$$\begin{aligned} \nu (\sigma ^{j+1}) ~<~ \nu (\sigma ^j). \end{aligned}$$

(4)

Note that by the definition of a uniform best improvement we have

$$\begin{aligned} \begin{array}{rl} \nu ^1(\sigma ^{j+1}) &{}<~ \nu ^1(\sigma ^j) ~~~\text { if { j} is even, and } \\ \nu ^2(\sigma ^{j+1}) &{}<~ \nu ^2(\sigma ^j) ~~~\text { if { j} is odd.} \\ \end{array} \end{aligned}$$

Thus, to prove our main claim (4), and consequently the theorem, it is enough to show that for all $j\ge 1$ and positions $v\in V$ we have

$$\begin{aligned} \mathcal {L}^{i(v)}(\sigma ^{j+1},v) ~\le ~ \mathcal {L}^{i(v)}(\sigma ^j,v). \end{aligned}$$

(5)

Note that since $\sigma ^0$ is 1-basic, this claim follows by Lemma 8 for $j=1$.

By applying Lemma 9, recursively, we can conclude that $\sigma ^j$ for an even $j\ge 2$ are all 1-basic, and for an odd $j\ge 1$ are all 2-basic.

Note also that inequality (5) holds for positions $v\in V_1$ whenever j is even, for $v\in V_2$ whenever j is odd, simply by the definition of a uniform best improvement. For the remaining cases, we can prove (5) by induction on j, starting with $j=1$, and applying Lemma 8, with interchanging the players role’s in it, alternately.

The above arguments prove (5), and thus (4) follows. Since $\nu (\sigma )$ can only take finitely many different values, this completes the proof of our theorem. $\square$

Remark 1

Though our paper is not of algorithmic nature, we would like to remark that such a UNE for an edge-symmetric terminal game $\Gamma$ can be computed efficiently.

First of all, computing a uniform best improvement is equivalent with solving a so-called reachability problem in G, and thus can be done in $O(\vert E\vert )$ time.

A starting strategy $\sigma ^0$ can be computed in $O(\vert V\vert \vert E\vert )$ time, according to the procedure in Lemma 7.

According to our assumptions, as in Lemma 4 and to condition (CIW), we can assume w.l.o.g. that

$$\begin{aligned} -\vert V_T\vert ~\le ~ \mathcal {L}^i(w) ~<~ 0 \end{aligned}$$

for all terminals $w\in V_T$ and players $i\in I$. Thus we have

$$\begin{aligned} -\vert V\vert \vert V_T\vert ~\le ~ \nu (\sigma ) <0 \end{aligned}$$

for all situations $\sigma$. Thus we need only to consider at most $\vert V\vert \vert V_T\vert$ many uniform best improvements before termination, showing that our total complexity is $O(\vert V\vert \vert V_T\vert \vert E\vert )$.

Remark 2

We also mention that this theorem is sharp in the sense that all three listed conditions, (TWO), (SYM), and (CIW) are necessary, as we can demonstrate this by the next three examples:

Consider $\Gamma ^2$ from Boros et al. (2012) (see also Fig. 1). This example satisfies (TWO) and (SYM), but not (CIW) and has no UNE.
Consider $\Gamma ^3$ from Boros et al. (2012) and symmetrize it (see Fig. 2). This example satisfies (SYM) and (CIW), but has three players and has no UNE.
Consider finally $\Gamma ^6$ from Boros et al. (2012) (see also Fig. 3). This example satisfies (TWO) and (CIW), but not (SYM) and has no UNE.

Most of these claims are very easy to check, based on the graphical description of these terminal games given in Figs. 1, 2, and 3, except the last one, where the lack of a UNE is more complicated to check. We refer the reader to Boros et al. (2012) where this claim is demonstrated by describing the $8\times 8$ normal form of $\Gamma ^6$.

6 NE-free examples

In this section we provide examples for edge-symmetric shortest path games that have no NE (and of course violate in some way the positivity condition). We also provide an example for a positive edge-symmetric shortest path game that has no UNE.

Let us recall (e.g., from Boros et al. 2017, 2018) that the effective cost of infinite plays can be defined in different ways, where differences are due to cycles in which the sum of edge lengths is zero (for some of the players). Accordingly, we provide two examples.

The first example shown in Fig. 4 is a 2-person nonzero sum shortest path game on an edge-symmetric digraph that has both positive and negative edge lengths and no cycle has length zero. The corresponding normal form is shown in Fig. 7 proving that this example has no NE.

The second example shown in Fig. 5 is a 2-person nonzero sum shortest path game on an edge-symmetric digraph that has nonnegative edge lengths but some of the cycles have zero length. In fact only cycles have zero length in which all edges have zero lengths. The corresponding normal form is shown in Fig. 8 proving that this example has no NE, either.

Let us add that all meaningful definitions of the cost of an infinite play (that may lead to a NE) agree on the following facts (see Boros et al. 2017, 2018): if for a situation $\sigma$ the corresponding play $P=P(\sigma )$ ends in a cycle C and $\sum _{e\in C} \ell ^i(e)>0$, then $\ell ^i(\sigma )=+\infty$, and if $\sum _{e\in C} \ell ^i(e)<0$, then $\ell ^i(\sigma )=-\infty$. Furthermore, if all edges $e\in C$ have $\ell ^i(e)=0$, then $\ell ^i(\sigma )=\sum _{e\in P{\setminus } C} \ell ^i(e)$, or in other words, the effective cost of the play ending in cycle C is the length of the path of P leading to cycle C. In this sense, the above two examples show that positivity of the edge lengths in edge-symmetric shortest path games is “essential” to guarantee a NE.

Finally we show an example for a 2-person positive edge-symmetric shortest path game that has no UNE. This game, $\Gamma ^{6s}$ is shown in Fig. 6 is derived from the terminal game $\Gamma ^6$ shown earlier. Note first that the counter clockwise moves have a large length for both players, larger than the length of any terminating move. This implies that in any UNE these moves will not be used. Let us also observe that from any position v and for any two terminals $a_i$ and $a_j$, $i\ne j$ the i(v)-lengths of the paths from v to these terminals (via the clockwise moves) compare exactly the same way as in the terminal game $\Gamma ^6$: the i(v)-length of the $v\rightarrow a_i$ path is shorter than the i(v)-length of the $v\rightarrow a_j$ path if and only if player i(v) prefers terminal $a_i$ to $a_j$ in $\Gamma ^6$. Thus, any UNE in this shortest path game would correspond to a UNE in $\Gamma ^6$, and Boros et al. (2012) proved that no such UNE exist.

7 Open problems

Two important questions concerning the existence of NE remain open. The first one is about terminal games.

(Q1) Does an n-person terminal game satisfying condition (CIW) have a NE?

In this paper question (Q1) is answered in the positive for games on edge-symmetric digraphs. Moreover, in this case condition (CIW) is not essential.

It was shown in Boros et al. (2018) that in general (for non-symmetric digraphs) condition (CIW) is essential when $n > 2$, while for $n=2$ (CIW) is not needed (Boros and Gurvich 2003). Interestingly, if the answer to (Q1) were negative then paradoxically there should be a terminal game satisfying (CIW) in which all NE are infinite, that is, realized by infinite plays; see (Boros et al. 2018).

The second open problem is about shortest path games.

(Q2) Does a positive 2-person shortest path game have a NE?

The answer is negative for more than two players, see (Gurvich and Oudalov 2014). In the present paper it is shown that the answer is negative for general edge-symmetric 2-person games, while it is positive for edge-symmetric positive games with any number of players.

References

Boros E, Gurvich V (2003) On Nash-solvability in pure stationary strategies of positional games with perfect information which may have cycles. Math Soc Sci 46:207–241
Article Google Scholar
Boros E, Elbassioni K, Gurvich V, Makino K (2012) On Nash equilibria and improvement cycles in pure positional strategies for chess-like and Backgammon-like $n$-person games. Discrete Math 312(4):772–788
Article Google Scholar
Boros E, Elbassioni K, Gurvich V, Makino K (2017) A nested family of $k$-total effective rewards for positional games. Int J Game Theory 46(1):263–293
Article Google Scholar
Boros E, Elbassioni K, Gurvich V, Makino K (2018) Markov decision processes and stochastic games with total effective payoff. Ann Oper Res. https://doi.org/10.1007/s10479-018-2898-8
Article Google Scholar
Boros E, Gurvich V (2009) Why Chess and Backgammon can be solved in pure positional uniformly optimal strategies, RUTCOR Research Report, RRR-21-2009, Rutgers University
Boros E, Gurvich V, Makino K, Wei S (2011) Nash-solvabile two-person symmetric cycle game forms. Discrete Appl Math 159(15):1461–1487
Article Google Scholar
Boros E, Gurvich V, Milanič M, Oudalov V, Vičič J (2018) A three-person deterministic graphical game without Nash equilibria. Discrete Appl Math 243:21–38
Article Google Scholar
Everett H (1997) Recursive games, in M. Dresher, A. W. Tucker, and P. Wolfe (eds.) Contributions to the Theory of Games, Annals of Mathematics Studies 3, Princeton University Press. (1957) 67–78. Reprinted in H.W. Kuhn, ed., Classics in Game Theory, Princeton University Press
Fraenkel AS, Scheinerman ER, Ullman D (1993) Undirected edge geography. Theor Comput Sci 112:371–381
Article Google Scholar
Gallai T (1958) Maximum-minimum Satze uber Graphen. Acta Math Acad Sci Hungar 9:395–434
Article Google Scholar
Gillette D (1957) Stochastic Games with Zero Stop Probabilities, In Contributions to the Theory of Games, Vol. III, Annals of Mathematics Studies 39:179–187
Google Scholar
Gurvich V (1975) Solution of positional games in pure strategies. USSR Comput Math Math Phys 15(2):74–87
Article Google Scholar
Gurvich V (1988) A stochastic game with complete information and without equilibrium situations in pure stationary strategies. Russian Math Surv 43(2):171–172
Article Google Scholar
Gurvich V (1989) Equilibrium in pure strategies. Soviet Math Dokl 38(3):597–602
Google Scholar
Gurvich V (2015) A four-person chess-like game without Nash equilibria in pure stationary strategies. Bus Inf 1(31):68–76
Google Scholar
Gurvich V (2018) Backward induction in presence of cycles. Oxf J Log Comput 28(7):1635–1646
Article Google Scholar
Gurvich V, Koshevoy G (2018) Monotone bargaining is Nash-solvable. Discrete Appl Math 250:1–15
Article Google Scholar
Gurvich V, Oudalov V (2014) On Nash-solvability in pure stationary strategies of the deterministic n-person games with perfect information and mean or total effective cost. Discrete Appl Math 167:131–143
Article Google Scholar
Lichtenstein D, Sipser M (1980) Go is polynomial-space hard. J ACM 27(2):393–401
Article Google Scholar
Schaefer TJ (1978) On the complexity of some two-person perfect-information games. J Comput Syst Sci 16(2):185–225
Article Google Scholar
Thuijsman F, Vrieze OJ (1987) The bad match, a total reward stochastic game. Oper Res Spektrum 9:93–99
Article Google Scholar
Thuijsman F, Vrieze OJ (1998) Total reward stochastic games and sensitive average reward strategies. J Optim Theory Appl 98:175–196
Article Google Scholar
Washburn AR (1990) Deterministic graphical games. J Math Anal Appl 153:84–96
Article Google Scholar

Download references

Acknowledgements

The third and forth authors were working within the framework of the HSE University Basic Research Program. The fourth author was supported in part by the state assignment topic no. 0063-2016-0003.

Funding

Open access funding provided by Università degli Studi di Roma La Sapienza within the CRUI-CARE Agreement.

Author information

Authors and Affiliations

MSIS & RUTCOR, Business School, Rutgers University, 100 Rockafellar Road, Piscataway, NJ, 08854, USA
Endre Boros & Vladimir Gurvich
Department of Statistics, Sapienza University, P.le Aldo Moro 5, Rome, 00185, Italy
Paolo Giulio Franciosa
Higher School of Economics (HSE), National Research University, Moscow, Russia
Vladimir Gurvich & Michael Vyalyi
Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, Moscow, Russia
Michael Vyalyi
Department of Control and Applied Mathematics, Moscow Institute of Physics and Technology, Moscow, Russia
Michael Vyalyi

Authors

Endre Boros
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Giulio Franciosa
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir Gurvich
View author publications
You can also search for this author in PubMed Google Scholar
Michael Vyalyi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Paolo Giulio Franciosa.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Boros, E., Franciosa, P.G., Gurvich, V. et al. Deterministic n-person shortest path and terminal games on symmetric digraphs have Nash equilibria in pure stationary strategies. Int J Game Theory 53, 449–473 (2024). https://doi.org/10.1007/s00182-023-00875-y

Download citation

Accepted: 17 September 2023
Published: 10 October 2023
Issue Date: June 2024
DOI: https://doi.org/10.1007/s00182-023-00875-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Deterministic n-person shortest path and terminal games on symmetric digraphs have Nash equilibria in pure stationary strategies

Abstract

Similar content being viewed by others

Topological Network-Control Games

A Dijkstra-Type Algorithm for Dynamic Games

The Complexity of Infinitely Repeated Alternating Move Games

1 Introduction

1.1 Edge-symmetric and positive shortest path games

Theorem 1

1.2 Edge-symmetric terminal games

Theorem 2

1.3 Uniform Nash equilibrium for edge-symmetric terminal games

Theorem 3

2 Edge-symmetric positive shortest path games

2.1 Special paths

Lemma 1

Proof

2.2 Extending a special path to a NE

Lemma 2

Proof

Proof of Theorem 1

3 Terminal games and shortest path games

Theorem 4

Proof

Corollary 5

Proof

4 Edge-symmetric terminal games

Lemma 3

Proof

Proof of Theorem 2

5 Existence of UNE in edge-symmetric terminal games

Lemma 4

Proof

Lemma 5

Proof

Lemma 6

Proof

Lemma 7

Proof

Lemma 8

Proof

Lemma 9

Proof

Proof of theorem 3

Remark 1

Remark 2

6 NE-free examples

7 Open problems

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation