Now, we want to make our comparisons and merges more concrete by looking at a concrete modal logic that already existed independently, and that turns out to shed some additional light on the semantic and axiomatic aspects of STIT meeting PDL.
Choices and pair events Let us return to the STIT choice situation for two agents. There is an actual world with the choices that were actually made. It makes sense to think of the worlds here as pairs of actions chosen. Note that each world \(w\) can be mapped to a unique pair of equivalence classes containing it, one for each agent, and by the product axiom, this map to pairs of equivalence classes is surjective. What we do not know is whether the map is injective, and indeed it may not be, unless we modify the product axiom to require that different choices for all the agents have singleton intersections. The latter constraint says that all slack in choices has been explained by introducing enough agents—perhaps including the ‘environment’ to take up all remaining slack. There is some simple arithmetic involved here. Assume that our model is finite. The product axiom with the singleton clause forces all equivalence classes for agent \(1\) to have the same size \(n\), as they need room for representatives of all choices of \(2\). The total size will be \(n\times k\), with \(k\) the fixed size for \(2\) that exists similarly. But this suggests a viewpoint in terms of “matrix models” for joint actions that is well-known from logics of games in strategic form (cf. Osborne and Rubinstein 1994). We will develop this analogy here, using a logic proposed in (van Benthem 2007) that provides a particularly apt comparison for STIT, while also doing full justice to the PDL perspective.Footnote 12
4.1 Modal logic of matrix games
Games induce natural models for epistemic, doxastic and preference logics, as well as conditional logics and temporal logics of action. See van der Hoek and Pauly (2006) for an overview of many such systems. Our discussion just takes a small slice.
Recall the definition of a strategic game for a set of players \(N\): (1) a set \(A_i\) of actions for each \(i\in N\), and (2) a utility function or preference ordering on the set of outcomes. For simplicity, one often identifies the outcomes with the set \(S\)
\(=\)
\(\Pi _{i\in N} A_i\) of strategy profiles. Given a strategy profile \(\sigma \in S\) with \(\sigma =(a_1,\ldots ,a_n)\), \(\sigma _i\) is the \(i\)th projection (i.e., \(\sigma _i=a_i\)) and \(\sigma _{-i}\) lists the choices of all agents except agent \(i\): \(\sigma _{-i}=(a_1,\ldots ,a_{i-1},a_{i+1},\ldots ,a_n)\).
Now, from a logical perspective, it is natural to treat the set \(S\) of strategy profiles as a universe of “possible worlds”.Footnote 13 Following (van Benthem et al. 2011) for the rest of this subsection, two natural relations can be defined on these worlds. For each \(\sigma ,\sigma '\in S\), set for each player \(i\in N\):
-
\(\sigma \sim _i \sigma '\) iff \(\sigma _i=\sigma _i'\): this epistemic relation represents player \(i\)’s “view of the game” at the ex interim stage where \(i\)’s choice is fixed but the choices of the other players’ are unknown,
-
\(\sigma \approx _i\sigma '\) iff \(\sigma _{-i}=\sigma '_{-i}\): this relation of “action freedom” (a term taken from Seligman (2010)) gives the alternative choices for player \(i\) when the other players’ choices are fixed.
Control can be freedom Our earlier discussion of STIT was in terms of control, including the lack of it inside players’ equivalence classes. But in a multi-agent perspective, one person’s lack of control is another person’s freedom, and labels can switch easily.
This can all be packaged in a standard relational structure
$$\begin{aligned} \mathcal {M}=\langle S, \{\sim _i\}_{i\in N}, \{\approx _i\}_{i\in N} \rangle \end{aligned}$$
with \(S\) the set of strategy profiles and the relations just defined. Adding a valuation function interpreting a set \(\mathsf {At}\) of atomic propositions that represent basic facts about strategy profiles (physical, or game-internal), we get standard multi-modal models.Footnote 14
Such game models support many logical languages, from simple modal formalisms to ‘hybrid modal logics’, first-order logic, or even non-first-order fixed-point logics. Cf. van Benthem (2010) and Blackburn et al. (2002) on the balance of expressive power and computational complexity that arises in such design choices, a topic that will return below. However, the simplest system will do for us here. In particular, here are the key modalities for a modal logic of strategic games:
-
\(\sigma \models [\sim _i]\varphi \) iff for all \(\sigma '\), if \(\sigma \sim _i\sigma '\) then \(\sigma '\models \varphi \).
-
\(\sigma \models [\approx _i]\varphi \) iff for all \(\sigma '\), if \(\sigma \approx _i\sigma '\) then \(\sigma '\models \varphi \).
The first modality expresses the knowledge a player has once her choice is made, and given her uncertainty about what others will do, the second modality refers to her freedom of choice. As is well-known, combining the two modalities makes \(\varphi \) true in each world of a matrix game model: \([\sim _i][\approx _i]\varphi \) acts as a universal modality \(U\).Footnote 15 This reflects an earlier observation about STIT—and that is no coincidence, witness the observations in Sect. 4.2 below.
What is the deductive power of the basic modal logic of strategic games? As before, we restrict attention to two-player games. First, given the nature of our relations, the separate logics are standard modal \(\mathbf{{S5}}\) for epistemic outlook and action freedom. In addition, the interaction of these modalities validates further laws. In particular, the above fact about the universal modality is reflected in the following law:
the equivalence \([\sim _i][\approx _i]\varphi \leftrightarrow [\approx _i][\sim _i]\varphi \) is valid in all matrix game models.
This validity depends on, and in fact it expresses, the geometrical “grid property” of game matrices that, if one can go on a path \(x \sim _i y \approx _i z\), then there also exists a point \(u\) with \(x \approx _i u \sim _i z\). We will discuss what this feature means in some more detail in Sect. 4.3.
This concludes our brief introduction to the modal logic of matrix games. For details and further issues, the reader is referred to (van Benthem 2014).
4.2 STIT in Modal Matrix Logic
Given our discussion in Sect. 3, it will be evident how to translate the basic STIT operators into our modal language of matrix games:
$$\begin{aligned}{}[i\ \mathsf {stit}] \varphi := [\sim _i]\varphi , \quad \Box \varphi := [\sim _i][\approx _i]\varphi \end{aligned}$$
This connection gives just the right combination of what we have called freedom plus knowledge.
Fact 4.1.
Our translation embeds STIT logic faithfully into the modal logic of full matrix games.
Proof.
First consider the direction from STIT theoremhood to modal game logic. Our translation validates the earlier STIT axioms, where the action modality refers to all consequences of the choice actually made, while the freedom modality looks at all alternative histories passing through the current profile. In particular, the quantifier combination employed in the Freedom axiom now becomes derivable through the theorems that are derivable for the STIT modality plus the existential modality \(E\) defined as \(\langle \approx _1\rangle \langle \approx _2\rangle \):
Fact 4.2.
The formula \((E [\sim _1] \varphi \ \wedge \ E [\sim _2]\psi )\rightarrow E(\varphi \wedge \psi )\) is derivable in multi-\(\mathbf{{S5}}\) plus the commutation law for the two modalities.
Conversely, to prove that the embedding is faithful, we need to refute each non-valid STIT law in our matrix models. To do so, take any STIT temporal counter-model in the sense of Sect. 2, and note that it suffices to look at the current moment and the next moments only (recall, that our STIT language does not contain temporal modalities). Furthermore, without loss of generality, we assume that this model is finite. More precisely, as in Sect. 3.1, we can abstract a finite two-agent basic STIT \(\mathbf{{S5}}\)-model out of the temporal structure by letting histories be worlds, and defining agent’s equivalence relations respecting their choice partitions. Now, the historic necessity operator is the universal modality while the two STIT modalities are the modalities for the equivalence relations. The last step is to show that we can transform this model into a matrix model.
If the intersections of the equivalence classes, one from each agent, are singletons, then we are done. Otherwise, we proceed as follows. A cell is an intersection of the agents’ equivalence classes (i.e., \(C=[w]_1\,\cap \,[v]_2\) for some states \(w\) and \(v\)). Since the model is finite, there are finitely many cells and each cell has only finitely many states in them. Furthermore, by the independence assumption, each cell is non-empty. Let \(m\) be the number of elements in the largest cell. Without loss of generality, we can assume that all cells contain exactly \(m\) states (this may require adding copies of states to the model).
Organize the cells so that they form a matrix where each row contains all the cells making up a 1-equivalence class and each column contains all cells making up a 2-equivalence class. Label each cell by its position in the matrix (so, the pair \((x,y)\) corresponds to the cell in row \(x\) and column \(y\)). There may be more than one way to organize the cells so that the rows correspond to a 1-equivalence class and the columns correspond to a 2-equivalence class. Our construction does not depend on the choice of labeling. For the remainder of the proof, fix such a \(r\times c\) matrix.
Now, construct an \(m\times m\) matrix for each cell. Fix a cell \(C\) labelled with \((x,y)\) containing states \(w_1,\ldots , w_m\). Worlds in the new model with be 4-tuples \((i,j,x,y)\) where \((i,j)\) denotes the position in the matrix and \((x,y)\) denotes the cell containing the world. Formally, let \((i,j,x,y)\) be a copy of \(w_{i+{j-1} \mod m}\). So, for example, if \(m=3\), then the world \((2,3,x,y)\) is a copy of \(w_1\). Note that each row and each column contains a copy of all the worlds in \(C\).
The model is \(\mathcal {M}=\langle W', \sim _1', \sim _2', V'\rangle \) where \(W'=\{ (i,j,x,y) \, | \,i,j\le m, x\le r, y\le c \}\) (where \(r\) and \(c\) are the number of rows and columns respectively in the outer matrix). We define the uncertainty relations for the agents as follows:
-
\((i,j,x,y)\sim _1^0 (i,j',x,y)\) for all \(j,j'\le m\)
-
\((i,j,x,y)\sim _2^0 (i',j,x,y)\) for all \(i,i'\le m\)
So \(\sim _1'\) runs along the rows of each inner matrix, and \(\sim _2'\) runs along the columns. We extend this relation as follows:
-
\((i,j,x,y)\sim _1^0 (i,0,x,y+1)\), where the addition is taken modulo m
-
\((i,j,x,y)\sim _2^0 (0,j,x+1,y)\), where the addition is taken modulo m
Let \(\sim _1'\) and \(\sim _2'\) be the reflexive and transitive closure of \(\sim _1^0\) and \(\sim _2^0\), respectively. Finally, the valuation \(V'\) is copied from the original valuation in the obvious way. We note the following two facts about the construction:
-
1.
If \((i,j,x,y)\sim _1' (i',j',x',y'),\) then \(i'=i\) and \(x'=x\). If \(y'=y,\) then \(w_{i+(j-1) \mod m}\) and \(w_{i+(j'-1) \mod m}\) are both in the cell labeled by \((x,y)\), and so \(w_{i+(j-1) \mod m} \sim _1 w_{i+(j'-1) \mod m}\). If \(y'\ne y\), then \(w_{i+(j-1) \mod m}\) and \(v_{i+(j'-1) \mod m}\) are in different cells. However, we still have \(w_{i+(j-1) \mod m}\sim _1 v_{i+(j'-1) \mod m}\) since we assume that cells in the same row are in the same 1-equivalence class.
-
2.
If \((i,j,x,y)\sim _2' (i',j',x',y')\) then \(j'=j\) and \(y'=y\). If \(x'=x\), then \(w_{i+(j-1) \mod m}\) and \(w_{i+(j'-1) \mod m}\) are both in the cell labeled by \((x,y),\) and so \(w_{i+(j-1) \mod m} \sim _2 w_{i+(j'-1) \mod m}\). If \(x'\ne x\), then \(w_{i+(j-1) \mod m}\) and \(v_{i+(j'-1) \mod m}\) are in different cells. However, we still have \(w_{i+(j-1) \mod m}\sim _2 v_{i+(j'-1) \mod m}\) since we assume that cells in the same column are in the same 2-equivalence class.
These observations show immediately that the newly constructed model is bisimilar to the original STIT model. Hence, they satisfy the same formulas in our language.
The last thing we need to check is that the intersection of agents’ equivalence classes are singletons. Suppose that \((i_0,j_0,x_0,y_0)\sim _1' (i',j',x',y')\), \((i_0,j_0,x_0,y_0) \sim _1' (i'',j'',x'',y'')\), \((i_1,j_1,x_1,y_1)\sim _2' (i',j',x',y')\) and \((i_1,j_1,x_1,y_1)\sim _2' (i'',j'', x'',y'')\). Then, by construction, \(i'=i''=i_0\), \(x'=x''=x_0\) and \(j'=j''=j_1\) and \(y'=y''=y_1\). Hence, \((i',j',x',y')=(i'',j'',x'',y'')\), as desired.
We have shown that our translation is both correct and faithful.Footnote 16
QED
This proof exploits the fact that matrix game models are close to the multi-\(\mathbf{{S5}}\) models for basic STIT defined in Sect. 3.1. Still, the geometrical matrix perspective is useful, since it links up with a body of existing results. We will see a number of examples as we proceed.
4.3 Complexity and Correlation
While the preceding embedding makes sense, it does embed STIT in a system whose behavior is potentially complex. Richer modal logics of matrix games may well be unaxiomatizable and worse. The reason is the above commutation law for the two equivalence relations. While this may look like a pleasant structural feature of matrices, its logical effects are delicate. It is well-known that the general logic of bi-modal languages plus a universal modality on ‘grid models’ with two immediate successor relations is not decidable, and not even axiomatizable: indeed, it is “\(\Pi _1^1\)-complete” (cf. Halpern and Vardi 1989; Marx 2007; Gabbay et al. 2003; van Benthem and Pacuit 2006). The reason is that grid structure can be exploited to encode computations of Turing machines on successive rows, or geometrical “tiling problems” of known high complexity.
Now, it is not clear whether our most basic modal game logic falls into this trap, since our models only have two equivalence relations, one horizontal and one vertical. Indeed, its closeness to STIT may suggest that it remains decidable–even though this does not follow from our earlier embedding result, that went in the opposite direction. Still, Halpern and Vardi (1989) and Spaan (1990) show high complexity of modal logics on grid models with reflexive transitive relations, using an encoding trick with alternating proposition letters.Footnote 17
This potential high complexity, while not directly threatening to STIT, does raise an interesting issue in modeling action. A standard way of defusing high complexity results is by allowing more models. In the present setting, the resulting structures are general game models where certain strategy profiles may be absent. Then general modal game logic becomes much simpler, being just multi-agent modal \(\mathbf{{S5}}\) without any connecting axioms (van Benthem 1997).Footnote 18
Now this is not just a technical move: “profile gaps” encode something interesting, namely correlations between behavior of agents. In a general game model, if player \(i\) changes her move, then the only available profiles for this may now be ones where some other player \(j\) has changed his move as well. Game theorists have studied correlations extensively: cf. (Aumann 1987; Brandenburger and Friedenberg 2008). But the same notion has come up in logic, since correlations provide “information channels” where the behavior of one agent can carry information about that of another (Barwise and Seligman 1997). And more recently, generalized forms of such dependencies have become the focus of attention in “dependence logics” (Väänänen 2007). In other words, independence may be costly, and the Product Axiom that seemed the pride of STIT may eventually stand in the way, being just an extreme case of a more sophisticated theory of agent behavior.Footnote 19
In the rest of this chapter, we look at extensions of the current framework with features that seem essential to rational agency, and that have been the subject of study in dynamic logics.