The prediction value

We introduce the prediction value (PV) of player i as the difference between the conditional expectations of v(S) when i cooperates or not in a probabilistic TU game. The latter combines a standard TU game and a probability distribution over the set of coalitions. The PV reflects the importance of information about a given player’s behavior for predicting, e.g., committee decisions that are subject to opinion interdependencies. The PV is characterized by anonymity, linearity, a consistency requirement and two normalization conditions. Every multinomial probabilistic value, hence every binomial semivalue, coincides with the PV for a particular family of probability distributions. So the PV can be regarded as a power index in specific cases. Conversely, some semivalues—including the Banzhaf but not the Shapley value—can be interpreted in terms of informational importance.


Introduction
Concepts of power and importance in models of cooperation are central to numerous studies in sociology, political science, mathematics, and economics. Much of the literature applies values or power indices which attribute fixed roles-often perfectly symmetric-to all players in the underlying coalition formation process and then focus on their marginal contributions. Most prominent examples are the Shapley value and Banzhaf value (Shapley 1953;Banzhaf 1965); others can be found in Roth (1988), Owen (1995), Felsenthal and Machover (1998) or Laruelle and Valenciano (2008b). The values differ in how marginal contributions to distinct coalitions are weighted.
With an appropriate rescaling, weights on specific marginal contributions can always be interpreted as a probability distribution. So Shapley value, Banzhaf value, and more generally probabilistic values (Weber 1988) correspond to the expectation of a difference. This difference is between the worth of a random coalition S which is drawn from 2 N \i according to a value-specific probability distribution P i and the worth of the same coalition when i joins. That is, a probabilistic value equals for a fixed family of distributions {P i } i∈N . 1 Unless the probabilistic value in question is also a (multibinary or) multinomial probabilistic value (cf. Puente 2000;Freixas and Puente 2002;Carreras and Puente 2015a), the presence of player i in the realized coalition S can statistically depend on whether player j = i belongs to it or not. This may plausibly be the case, for example, when voting is preceded by a process of information transmission or opinion formation. 2 Unfortunately the expectation E P i [v(S ∪ i) − v(S)], interpreted as power or importance of player i, can behave in strange ways when the family of distributions {P i } i∈N implicates correlated behavior. The following example illustrates the conceptual problem. 3 Example 1 Consider the canonical simple majority decision rule with an assembly of 5 voters. Let P be the probability distribution that assigns probability 0 to the 20 coalitions containing exactly two or exactly three voters; and equal probability of 1/12 to each of the remaining 12 coalitions. Irrespective of whether we derive P i by projecting P to 2 N \i or by conditioning P on i / ∈ S, the probabilistic value is zero for all players. However, that no member of this decision body should have any voting power or importance is somewhat counterintuitive.
Here, the expectation of a difference is uninformative since non-zero marginal contributions v(S ∪ i) − v(S) > 0 do not count when the underlying probability distribution P treats S ∪ i and S as null events. More generally, the problem with the expected marginal contribution E P i [v(S ∪ i) − v(S)] is that it treats i's decision, 1 It is equivalent to consider suitable probability distributions P i on {S ∈ 2 N : i ∈ S} and then to evaluate We adopt the usual notational simplifications like writing S\i or S ∪ i j instead of S\{i} or S ∪ {i, j}. 2 See, for example, the seminal opinion formation model of DeGroot (1974): individuals start with initial opinions (beliefs) on a subject represented by an n-dimensional vector of probabilities, and repeatedly update their individual opinion based on the current opinions of their peers. Different structures of consensus formation can be captured by different network topologies. 3 We owe this example to Moshé Machover. say, to change her no vote into a yes (or vice versa) as being fully detached from the respective probabilities of observing the considered two coalitions with and without i.
This paper proposes an alternative approach: namely, to consider the difference of two expectations. These expectations will be derived from a given probabilistic description P of coalition formation. The latter plays the same role as {P i } i∈N does for probabilistic values or corresponding families {P v i } i∈N do for values that evaluate marginal contributions in game v-specific ways. 4 However, we take P as a primitive of the collective decision situation under investigation, rather than of the solution concept.
We thus depart from most of the previous literature in two respects: first, similar to Laruelle and Valenciano (2008a), we explicitly consider probabilistic games (N , v, P) where (N , v) is a standard TU game and P is a probability distribution on N 's power set 2 N . Second, we introduce a new value that reflects the difference between two conditional expectations. Specifically, we define the prediction value (PV) of any given player i ∈ N as the difference in v's expected value when the distribution P|i which conditions P on the event {i ∈ S} and the distribution P|¬i which conditions on {i / ∈ S} are applied. In other words, we suggest to evaluate The two coincide in interesting special cases, but not in general. In particular, our approach is similar in spirit but factually differs from the evaluation of conditional decisiveness as in Valenciano (2005, 2008a).
The margin between the respective conditional expectations can be interpreted as the importance of a player in the probabilistic game (N , v, P) in several ways. Most generally, it captures the informational or predictive value of knowing i's decision in advance of the process which divides N into some final coalition S and its complement. Moreover, in case i's membership of the coalition which supports a specific bill or cooperates in a joint venture is statistically independent of others, the PV provides a measure of i's influence on the outcome of collective decision making, or of i's power in (N , v, P).
The existing literature-the references above are a small selection-obviously does not suffer from a shortage of solution concepts in general, nor of ones targeted at quantifying a player's power in TU games. But as Aumann (1987) argued: "Different solution concepts are like different indicators of an economy; different methods for calculating a price index; different maps (road, topo, political, geologic, etc., not to speak of scale, projection, etc.); …They depict or illuminate the situation from different angles; each one stresses certain aspects at the expense of others." Taking up Aumann's metaphor, we do not suggest another scale or projection of a player's vector of marginal contributions here. We rather propose to look at a slightly different kind of map. For some situations-involving statistically independent cooperation decisions-the picture may look identical to that generated by, say, the Banzhaf value (giving the latter additional force/basis/clout); in others, such as Example 1, it will provide a new perspective.
Example 1 (continued) In the considered assembly, the probability of coalition S was P(S) = 0 for |S| = 2 or |S| = 3 and P(S) = 1/12 otherwise. The respective conditional probabilities follow from Bayes' rule. In particular, knowing that player i cooperates (i.e., is part of S) gives rise to Similarly, we obtain in case i is known not to cooperate. The expected value of v(S), which corresponds to the probability that the assembly passes the proposal in question, is changed by the observation that, say, voter 1 supports rather than opposes it by The information that player 1 votes in a particular way is not enough to predict an assembly decision perfectly. (That would be the case But it changes the odds significantly. Player 1 may not have voting power in the traditional sense, but his or her vote is important from an informational perspective. It may be quite valuable to investors or speculators, for instance. The same applies here to the other members of the assembly to identical degrees. Asymmetric voting weights or asymmetric roles in opinion dynamics and coalition formation would naturally give rise to different PV numbers for the respective assembly members.
A null player who has a voting weight that cannot matter for matching a required majority threshold and whose behavior is uncorrelated with the remaining players has a PV of zero. Endowing the same player with greater voting weight will at some point translate into a positive value-reflecting the difference that her vote can now make for the outcome, just like traditional indices. But leaving initial voting weights unchanged, the PV will also ascribe positive importance to the null player if interdependencies make its cooperation a predictor of whether a proposal is passed.
Plausible causes for dependencies abound and, for instance, include the possibility that the player in question is actually without vote but 'followed' by the official voters as, say, their paramount leader. The proposed change of perspective-from, traditionally, the expected difference that a player would make by an ad-hoc change of coalition membership towards the difference in expectations for the collective outcome which is associated with that player's cooperation-opens the route to studying voting and coalition formation as the result of social interaction. Final votes may be determined by whether i is initially a supporter or opponent even if i is a null player of (N , v), and this is arguably a source of power just like official voting weight. We believe that evaluating changes in conditional expectations can help to quantify this in future research.
Here, we introduce and investigate properties of the prediction value. We formally define it in Sect. 3, after collecting some preliminaries in Sect. 2. We describe a set of characteristic properties in Sect. 4 and relate the PV to traditional probabilistic values in Sect. 5. The considered distributions P could embody the a prioristic presumptions of traditional power measures, i.e., be the uniform distribution on 2 N or the space of permutations on N . (Interestingly, the latter does not make PV and Shapley value coincide). But P could equally well be based on empirical data-say, observations of past voting behavior in a decision making body like the US Congress, EU Council, etc. We briefly conduct such a posteriori analysis with the PV in an application to the Dutch Parliament in Sect. 6, and we conclude in Sect. 7. All mathematical proofs are contained in the Appendix.

Preliminaries
A TU game is an ordered pair (N , v) where N ⊂ N represents a non-empty, finite set of players and v : 2 N → R is the characteristic function which specifies the worth v(S) of any subset or coalition S ⊆ N and satisfies v(∅) = 0. The set of all TU games is denoted by G, and the set of all TU games with player set N by G N .
(N , v) ∈ G is a simple game if v is a monotone Boolean function, i.e., v(S) ∈ {0, 1} and v(S) ≤ v(S ) for all S ⊆ S ⊆ N , such that v(∅) = 0 and v(N ) = 1. A coalition S with v(S) = 1 is then called winning. Given any non-empty coalition S ⊆ N , the so-called unanimity game u S is defined by u S (T ) = 1 if S ⊆ T and u S (T ) = 0 otherwise. We will drop the player set N from our notation when it is clear from the context; so u S is shorthand for (N , u S ). Moreover, we refer to u {i} simply as u i . Player Every player i with v(i) = 0 is said to be dependent. A dummy player who is dependent is also known as a null player. If (N , v) is a simple game and v(S) = 1 if and only if i ∈ S, then player i is called a dictator.
TU games (N , v) have explicitly been combined with probability distributions P over coalitions S ⊆ N since Owen (1972). P is typically mentioned in order to provide probabilistic motivation or foundations for a particular solution concept, not as a characteristic of a collective decision making situation. We want to emphasize its role as a primitive and define a probabilistic game as an ordered triple (N , v, P), where (N , v) is a TU game and P is a probability distribution on the power set of N , 2 N . The set of all probabilistic games is denoted by PG; and PG N is the restriction to the class of probabilistic games with player set N .
A TU value is a function which assigns a real number to all elements of N for any given TU game. An extended value is a mapping ϕ that assigns to each probabilistic game (N , v, P) a vector ϕ(N , v, P) ∈ R |N | . ϕ i (N , v, P) will be interpreted as a measure of the 'difference', in an abstract sense, that player i makes for the probabilistic game (N , v, P). It might, for instance, relate to the average of marginal contributions v(S ∪ i) − v(S) that are made by i to coalitions S ∈ N \i, to the difference that i makes to a potential function (i.e., a mapping from PG to R) when i is added to the player set N such that N ∪ i = N , or to any other indicator of how important the behavior or presence of player i might be to the members of N or an outside observer.
TU values and extended values are defined on two distinct domains, G and PG. Extended values can be regarded as technically the more general concept because any given TU value can be turned into an extended value simply by ignoring the distribution P that is specified as part of (N , v, P). For instance, correspondingly 'extends' the Shapley value and the (extended) Banzhaf value can be defined by 5 Both the original Shapley TU value and the Banzhaf TU value (which was at first restricted to simple games, and later extended to general TU games by Owen 1975) are special instances of probabilistic values, as introduced by Weber (1988). 6 They are defined by such that each element Q i of the collection Q = {Q i } i∈N denotes a probability distribution on {S ⊆ 2 N : i ∈ S}, or such that Q i denotes a probability distribution on 2 N \i . 7 Several subclasses of probabilistic values have received special attention. Semivalues satisfy (6) and (7) for weights Q i (S) and Q i (S), respectively, which are identical for all i ∈ N and depend on S only via its cardinality |S| (Dubey et al. 1981). Then 5 When the considered set of players N is clear from the context, we simplify notation by writing S i instead of S⊆N : i / ∈S , or S i instead of S⊆N : i∈S . 6 Weber (1988) originally defined a "probabilistic value" for each individual player i ∈ {1, . . . , n} and referred to the corresponding n-vector as a group value. We follow the later terminology of Monderer and Samet (2002). 7 The argument (N , v, Q) in (6) and (7) is more general than a probabilistic game. The latter follows as the special case in which each Q i stems from the same probability distribution Q.
Multinomial values have been introduced by Puente (2000) and are obtained from (6) or (7) by requiring that each player j is part of the formed coalition with probabilitỹ p j independently of any other player. 8

The prediction value
For a given probabilistic game (N , v, P) define the conditional probability distributions P|i and P|¬i as follows: for all S ⊆ N and, similarly, Laruelle and Valenciano (2005) have suggested to consider the conditional decisiveness measures and as indicators of player i's importance in (N , v, P). 9 It is easy to see that if and only if P(S) ≡ 2 −|N | for all S ⊆ N , and one can similarly obtain identity with the (extended) Shapley value. Namely, with n = |N | and s = |S| (see Laruelle and Valenciano 2005, Prop. 3). 10 We propose an altogether different approach to assessing the importance of N 's members in a probabilistic game (N , v, P). It is not based on probabilistic values, nor marginal contributions in general. Weighted marginal contributions may misrepresent player i's importance in (N , v, P) in that they implicitly treat i's decision, say, to change her no vote into a yes (or vice versa) as being fully detached from the respective probabilities of observing the considered two coalitions with and without i. Example 1 already highlighted the effect that non-zero marginal contributions v(S ∪i)−v(S) > 0 do not count when the underlying probability distribution P treats both events S ∪i and S as null events. Adding up marginal contributions also leads to strange conclusions if only one of the coalitions S and S ∪ i has positive probability. One thing that outside observers, members j = i of N or i herself might still care about in the examples is the informational gain that comes with the knowledge: "i will (not) be part of the eventually formed coalition". Knowing this might imply that j cannot (or must) be amongst the members of the coalition. This may have ramifications 9 Laruelle and Valenciano (2008a) refer to + i (N , v, P) as player i's "interim expected marginal contribution" and identify its relations with different classes of probabilistic values. 10 Note that the respective distributions P which need to be assumed in order to obtain the Shapley value as a conditional expected marginal contribution in (16) and (17) differ.
for the expected surplus that is created or the passage probability of the bill being debated. In other words, it may be useful to base one's evaluation of collective decision making as described by (N , v, P) on P|i rather than P when i is known to support the decision. This suggests looking at the difference E P|i [v(S)] − E P [v(S)] as a way of quantifying i's effect on the outcome. And it is arguably of similar interest-and may yield a rather different quantification of the difference that i's decision makes-not to look at how much i's support increases the expected worth v(S) but at how much i's opposition lowers it, i.e., Combining these two evaluations of how knowledge of i's decision changes the expectation of the game by summing them, we obtain: Remark 1 In case that coalition membership is statistically independent for every i = j, i.e., if P is a product measure on 2 N , the equality P | i(S) = P | ¬i(S\i) holds whenever i ∈ S. Then equations (14), (15), and (18) all evaluate to the same number-for instance, to the Banzhaf value if P(S) ≡ 2 −|N | . That the "expectation of a difference" in (14) or (15) coincides with the "difference between two expectations" in (18), however, fails to hold in general. In particular, we will show in Corollary 2 that there is no probability distribution P which would allow the Shapley value to be interpreted as measuring informational importance.
Remark 2 The restriction of the prediction value to simple games has been identified by Häggström et al. (2006) as playing a key role in extending the Condorcet jury theorem (on asymptotically correct simple majority decisions by n statistically independent voters) to weighted majority decisions with arbitrary joint vote distributions. 11 Häggström et al. call the difference between expectations based on P|i and P|¬i the "effect" of voter i. Their observation that player i's effect can be interpreted as a normalized form of the correlation between i's vote and the random jury outcome extends straightforwardly to probabilistic TU games. Namely, writing 1 {i∈S} for the indicator function of event {i ∈ S} and p i = P({S : i ∈ S}), the covariance of v(S) and 1 {i∈S} is Neeman (2014) further extended the analysis to weighted plurality decisions.
Dividing by standard deviations σ v(S) and σ 1 {i∈S} = √ p i (1 − p i ), the PV and the correlation coefficient Corr(v(S), 1 {i∈S} ) can be seen to satisfy

Characterizing the prediction value
We will now provide an axiomatic characterization of the prediction value. We begin with two classical conditions that are part of many axiomatic systems in the literature on TU values. The first is anonymity, which requires that the indicated difference to the game that is ascribed to any player by an extended value does not depend on the labeling of the players. The second is linearity, which demands of an extended value that it is linear in the characteristic function component v of probabilistic games.

Definition 2 Consider two probabilistic games
Definition 3 An extended value ϕ is linear if for all (N , v, P), (N , v , P) ∈ PG and real constants α, β Linearity combines two properties, scale invariance and additivity. Especially the latter is far from being innocuous. 12 But linearity is frequently imposed on solution concepts for TU games; and the PV, as the difference of two expectations, embraces it rather naturally.
The third characteristic property of the PV concerns the way how the respective extended values of two games G and G compare when one can be viewed as a reduced form of the other.

Definition 4 Given
So, when one moves from a given probabilistic game G to the reduced game G −i , first, player i is removed from the set of players; second, the probabilities of all coalitions in G which only differ concerning i's presence are aggregated; and, third, the corresponding new worth v −i (S) of coalitions S ⊆ N −i is the convex combination of the associated old worths, v(S) and v(S ∪ i), weighted according to the respective probabilities under P. We will require that the extended value of any player j ∈ N −i stays unaffected by the removal of i. 14 One reason for why this consistency property could be desirable is the following. Suppose that the considered model is misspecified in the sense that a player of interest in the game is not taken into account by the rest of the players (or an outside observer). For instance, consider the situation of a voting game G = (N , v , P ), where the presence of a lobbyist i has been neglected. The more accurate model would include the lobbyist and be G = (N ∪ i, v, P). The effect of the lobbyist endorsing a proposal or opposing it would explicitly be captured by the probability distribution P: for example, voters with strong ties to i may be likely to vote the same way, while others behave oppositely. Coalitions S and S ∪ i which differ only in i's presence will consequently have very different P-probabilities depending on whether S includes i's fellow travelers or opponents. But if the probability P and value v of each coalition T ⊆ N in the 'misspecified' game without i are defined in a probabilistically correct way, i.e., if the misspecified game G equals G −i , then the assessment of any actor j = i should be unaffected by whether one considers G or G −i . Consistency can thus be seen as formalizing robustness to probabilistically correct misspecifications.

Proposition 1 The prediction value is anonymous, linear, and consistent.
Proposition 1 is not enough to fully characterize the PV. For instance, for every a, b ∈ R, the extended value ϕ (a,b) satisfies anonymity, linearity and consistency. 15 Our characterization in Theorem 1 below will use that if an extended value is linear and consistent, it is fully determined by its image for the subclass of all 2player probabilistic games. 16 The question then is how the extended values of 2-player probabilistic games should suitably be restricted.
Before giving an answer it is worth recalling two implications of i being part of the formed coalition: first, i's presence means that i contributes to the formed coalition her voting weight, productivity, etc. This reveals information about the expected worth directly. But, second, i's presence also affects the expected worth indirectly because it reveals information about the presence and contributions of other players if the behavior of N \i and of i are not statistically independent. In case of independence, i.e., if the presence of i ∈ N has no informational value according to P, and if moreover i is a null player in the TU-game (N , v), then a reasonable extended value can be expected to assign zero to i. If, in contrast, knowledge of the behavior of null player i does change the odds of a proposal being passed, then i has positive informational value.
For illustration, consider a voting game in which j is a dictator according to the rules formalized by v. Let the voting behavior of j be perfectly correlated with that of some other player i (formally a null player). Now note that it is not part of the model (N , v, P), which mathematically describes the rules of the collective decision body involving i and j and the random outcomes of coalition formation processes, why the votes of i and j always coincide. 'Null player' i might simply follow 'dictator' j in all his decisions. Alternatively, player i could be irrelevant merely from a formal perspective, i.e., have no say de jure; while it is her who imposes all her wishes on j-that is, she rules de facto. In either case the informational values of i and j are identical. They are also maximal (and could plausibly be normalized to, say, 1) in the sense that the outcome can be predicted perfectly when knowing that i or j votes yes or no.
We combine the requirement that an independent null player i should be assigned an extended value of zero with the requirement that i has a value of one in the considered perfect correlation case as follows: 17 Definition 6 An extended value ϕ satisfies the informational dummy-dictator property (IDDP) if for i ∈ N and |N | = 2 Regarding dictators themselves it makes sense to impose the following for 1-player probabilistic games: 16 Lemma 3 below allows to characterize all anonymous, linear and consistent extended values by setting ϕ i ({i, j}, u j , P) := f i j (P) and ϕ i ({i}, u i , P) := g i (P) for anonymous functions f, g. The latter can depend on P in arbitrary anonymous ways. This is why the properties in Proposition 1 do not yield an analogue to Weber's (1988)  This formalizes that if N consists of just a single player i ∈ N with v(i) = u i (i) = 1 then i's importance or the difference that i makes to this game should plausibly be evaluated as unity. Immediately from the definition of the PV we obtain.

Proposition 2 The prediction value satisfies full control and IDDP.
Remark 3 IDDP implies a positive extended value for a null player i even if i's behavior is imperfectly but still positively correlated with that of a dictator j. This is, e.g., the case when a yes-vote by i is made more likely by most other players voting yes, i.e., for the implicit probabilistic model behind the Shapley value. For a probabilistic game with a dictator where P reflects any Shapley value-like probabilistic assumptions, this means that PV and Shapley value φ will not coincide because the Shapley value of null players is zero.
We have the following characterization result: Theorem 1 There is a unique extended value ϕ which satisfies linearity, consistency, full control and IDDP. It is anonymous and ϕ ≡ ξ .
We note that the four properties in Theorem 1 are independent and non-redundant.

Lemma 1 (i) The extended value
with v = S⊆N α S · u S being the unique decomposition of v into unanimity games, satisfies linearity, consistency, full control but not IDDP. (ii) The extended value satisfies linearity, consistency, IDDP but not full control. (iii) The extended value satisfies linearity, full control, IDDP but not consistency. (iv) Let |N | ≥ 3 and v = S⊆N α S · u S . The extended value satisfies consistency, full control, IDDP but not linearity.

Relation between prediction value and probabilistic values
The example values discussed earlier (like φ, β, + , − ) all are probabilistic values, i.e., they have in common that they weight marginal contributions of a player by some probability measure. We already noted in Remark 1 that the natural extension of the Banzhaf value agrees with the prediction value if P(S) ≡ 2 −|N | . We now study the relationship between (extended) probabilistic values and the prediction value somewhat more generally. Following Weber (1988), the class of probabilistic values is characterized by (i) linearity, (ii) positivity: ϕ(N , v, P) ≥ 0 if v is monotonic and (iii) the dummy player property: The PV is not a probabilistic value. It is linear (see Proposition 1) but violates (ii) and (iii). 18 Theorem 2 Let be a probabilistic value as defined in (6). The identity (·, Q) ≡ ξ(·, P) holds for n > 1 if and only if there exist probabilities 0 <p i < 1 for each player such that holds for all S ⊆ N \i, i ∈ N and holds for all S ⊆ N .
The theorem shows that a probabilistic value is a PV if and only if it is a multinomial value gp, i.e., satisfies (11), for an interior vector of probabilities p = (p 1 , . . . ,p n ). This also clarifies the connection between semivalues and the prediction value. Namely, a semivalue f q can be interpreted as a PV if and only if f q is also a multinomial value gp. Identity f q ≡ gp and the fact that weightings Q i (S) can depend only on |S| for semivalues imply (i)p i =p j =: p for all i, j ∈ N in Theorem 2 and (ii) q satisfies (10) for p. So we obtain: Corollary 1 For a given semivalue f q and n > 1 there exists P such that f q (·) ≡ ξ(·, P) on G N if and only if f q is a binomial semivalue, i.e., q satisfies (10) for some p ∈ (0, 1). 19 Because the Shapley value φ is no binomial semivalue and differs from any such value on G N when |N | ≥ 3, we have: Corollary 2 For n ≥ 3 there exists no P such that φ(·) ≡ ξ(·, P) on G N . 18 See Remark 3 on the possibility of ξ i (N , v, P) > 0 for a null player i. Positivity is violated, e.g., for the dummy-dicator setting (26) when i and j always vote contrary such that P|i(i j) = 0 and P|¬i( j) = 1. 19 In some definitions in the literature the extreme cases p = 0 and p = 1 are allowed, too, with the convention 0 0 = 1. For p = 0 we would get the dictatorial index and for p = 1 the marginal index. See Owen (1978) for details. However, note that neither p = 0 nor p = 1 satisfy the conditions from Theorem 2.

Prediction values in the Dutch Parliament 2008-2010
As illustration of the prediction value's practical applicability and of how its informational importance indications can be very different from power ascriptions by traditional values, we consider the seat distribution and voting behavior in the Dutch Parliament between 2008 and 2010. This was the period of the left-centered Balkenende IV government, which consisted of Christian democrats from the CDA and Christen Unie parties and the social democratic PvdA.
The distribution of the 150 seats in parliament between its eleven parties is displayed in the top part of Table 1. The three government parties held a majority of 80 out of 150 seats. When voting on non-constitutional propositions, the Dutch Parliament applies simple majority rule. It is straightforward to define a voting game with this information, and to calculate the corresponding a priori Banzhaf and Shapley values β and φ.
We used the parliamentary information system Parlis 20 in order to extract information on members, meetings, votes and decisions on propositions in the 2008-2010 period. From the records of regular plenary voting rounds, where parties vote as blocks, we derived the empirical frequencies of the 2 11 conceivable divisions into yes and no-camps from 2720 observations. 21 Defining P by these empirical frequencies, we calculated the corresponding prediction values ξ i of the parties as well as their positive and negative conditional decisiveness values + i and − i defined in (14) and (15). A summary of the results is given in the bottom part of Table 1.
The PV-scores ξ i of Dutch parties tend to be higher than their respective traditional Banzhaf or Shapley power measures β i and φ i , and even the decisiveness measures + i and − i which incorporate the same empirical estimate of P. In particular, the prediction value ascribes rather substantial numbers also to small parties like D66, SGP, or Verdonk.
This reflects specificities of the political situation in the Netherlands and that the PV picks up corresponding correlations between the voting behavior of different parties. Varying majorities at calls are quite common in the Dutch Parliament. The member parties of the government do not necessarily vote the same way; some are frequently supported by smaller opposition parties. The correlation coefficients reported in Table 2 indicate, for instance, that SGP and D66 quite commonly voted the same way as CU and PvdA. Their PV numbers hence differ much less than their seat shares.
Verdonk and SP constitute an extreme case in this respect. The former is commonly considered as far right-wing, the latter as a far left-wing party; still both voted the same way at each call in the data set (presumably having different reasons). Perfect correlation of their votes implies that both have the same prediction value-despite SP having 25 seats and Verdonk but one: knowing either's vote in advance would have been equally valuable for predictive purposes. Measures based on marginal contributions, in 20 The data is available through http://data.appsvoordemocratie.nl. 21 We pooled all regular plenary votes in order to illustrate the simplest way in which data can be used to infer interdependencies in a voting body-one might want to split the data with respect to topics, or weight distinct calls by their importance, in actual political analysis. Note that the Dutch Parliament's chairperson assumes that parties vote as blocks unless some MP demands voting by call. Only then can members of the same party vote differently. We excluded such cases of 'non-coherent voting' from our analysis.  contrast, clearly favor SP over Verdonk (though less so if the a posteriori correlation between SP's and Verdonk's votes is ignored). Interestingly, the GL party has the second-highest prediction value: despite it not being in government and having only the sixth-largest seat share, support by GL was a better predictor of a bill's success than support by any except the biggest party (CDA).

Concluding remarks
Traditional semivalues like the Shapley or Banzhaf values and the prediction value provide two qualitatively distinct perspectives on the importance of the members of a collective decision body. One highlights the difference that an ad-hoc change of a given player i's membership in the coalition which eventually forms would make from an ex ante perspective; the other stresses the difference that the change of a player's presumed membership makes for one's ex ante assessment of realized worth. As the figures in Table 1 illustrate, both can differ widely in case players' behavior exhibits interdependencies. But, as formalized by Theorem 2, they coincide in case of statistical independence. The latter is presumed by the behavioral model underlying, e.g., the Banzhaf value, but incompatible with that underlying the Shapley value. For independent individual voting decisions, the conditioning on different votes of player i adds no behavioral information to the formal fact of i's weight contribution to either the yes or no camp. Then i's informational importance and i's voting power or influence-reflected by sensitivity of the collective decision to a last-minute change of i's behavior-are aligned. 22 We note that the prediction value does not distinguish between correlation and causation in cases of interdependence. For illustration, consider decisions by a weighted voting body in which some player i has zero weight but all other players' decisions are perfectly correlated with that of i. Player i's prediction value is then one irrespective of whether (i) players j = i 'follow' i and cast their weight as their supreme leader i would if he had any, (ii) i = k and all players j = k follow a specific other player k, or (iii) all players debate the merit of a proposal based on different initial inclinations and collective opinion dynamics converge to, for instance, the majority inclination. 23 Since knowing i's decision-rather than i's initial inclination-will always fully reveal the realized outcome, we regard finding ξ i = 1 in all three scenarios a feature, not a flaw.
However, this example points to an interesting extension of the described "difference of conditional expected values"-approach to measuring importance. Namely, start with a given description (N , v, P) of a decision body where P corresponds to, say, the Banzhaf uniform distribution and augment it by the formal description of a social opinion formation process which defines a mapping from players' binary initial voting inclinations to a distribution over final ones after social interaction. One can then capture a player i's combined social and formal influence in the decision body by answering the question: how much does knowing that i's initial inclination is in favor (or against) the given proposal modify the final outcome which is to be expected? We conjecture that this approach actually has advantages over extending marginal contribution-based analysis to social interaction, 24 and plan to pursue this extension in future research.
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Proof of Proposition 1
Anonymity and linearity of ξ are obvious from Definition 1. To prove consistency, consider (N , v, P) ∈ PG and let i ∈ N be dependent in v. Let j ∈ N \i and S ⊆ N \i j. Equations (12) and (24) imply that and By using the definition of v −i and invoking equality (33) one can verify that Similarly, by definition of v −i together with (34), we get One can then infer where the second equality uses (35) and (36), and the third one follows by shifting the corresponding terms from inside the square brackets to the outer summation.

Proof of Theorem 1
The proof proceeds in three steps. First, in Lemma 2 we prove for |N | = 2 that linearity and consistency imply that an extended value is determined by unanimity games. Second, we generalize this to all probabilistic games in Lemma 3. Finally, we show that the full control property and IDDP characterize the PV for 2-player probabilistic games and hence probabilistic games in general.

Lemma 2
Consider an extended value ϕ that is linear on the space of all 2-player probabilistic games and consistent. For any set N with |N | = 2, the mapping (N , v, P) → ϕ(N , v, P) is fully determined by the numbers Proof Let P be a fixed probability distribution on 2 N with N = {i, j}. The set of unanimity games {u i , u j , u i j } forms a basis for the space of all TU games on N . In particular, for any (N , v) And thus, for arbitrary P and i ∈ N , ϕ's linearity implies We need to show that x i,i j and x j,i j are fully determined by x ii and x i j . To see this, notice first that both players are dependent in (N , u i j , P). So we may consider the reduced game obtained by j's removal, which involves N − j = {i} and In case P(i) + P(i j) > 0, we have where the first equality invokes consistency, the second linearity, and the third one exploits that ({i}, u i , P − j ) is the reduction of (N , u i , P) by player j and again consistency. When P(i) = P(i j) = 0 we have ϕ i (N , u i j , P) = 0 because in this case (u i j ) − j ({i}) = 0 by Definition 4, so that (u i j ) − j is the all-zero game 0 in that case.
In summary, And in a similar fashion one obtains For any v ≡ α i u i + α j u j + α i j u i j we have and an analogous expression for ϕ j (N , v, P). This finding can be generalized from just two players to arbitrary N : Lemma 3 Let ϕ be a consistent and linear extended value. Then the mapping (N , v, P) → ϕ(N , v, P) is fully specified by the parameters in (37).
Proof Using the n-player unanimity games as a basis for PG N one can always write Letting i ∈ N be an arbitrary but fixed player, we will use induction on n in order to prove the following Claim: There exist β i j , depending on the α T and P, such that The claim is obvious for a single player and was proven for |N | = 2 in Lemma 2.
In view of linearity, it suffices to prove the statement for unanimity games u T , where nothing needs to be shown when the cardinality of T is one. So we consider |N | ≥ 3, |T | ≥ 2 and assume that the statement is true for all player sets N of cardinality n − 1. Let j ∈ N \i be a player, which must be dependent in u T because |T | ≥ 2. Now we consider the reduced game (N − j , (u T ) − j , P − j ). From consistency we conclude Applying the induction hypothesis implies the existence of β ik , which depend on P − j and hence on P, such that Since (u k ) − j = u k the reduced game of (N , u k , P) is given by We remark that the coefficients β i j referred to in the above proof get quite complicated for increasing n. In the following we will use only the fact that they are well-defined given v and P.
Proof of Theorem 1. To complete the proof we now show how the values x ii = ϕ i (N , u i , P) and x i j = ϕ i (N , u j , P) can be computed from the corresponding values for the player set N = {i, j}. Since (u i ) − j = u i for all i = j we can recursively conclude from consistency where Using equation (49) and similarly defining we conclude ϕ i ({i}, u i , P ) = ϕ i ({i, j}, u i , P ) from consistency. Thus, the full control property, in connection with consistency and linearity, implies x ii = 1 for all player sets N (containing player i). If ϕ satisfies IDDP the values of x i j are determined, and hence ϕ is determined on the class of 2-player probabilistic games. Then ϕ ≡ ξ follows from Lemma 3. Finally note that the full control property and IDDP do not depend on the labeling of the players, which implies anonymity.

Proof of Lemma 1
(i) Linearity is inherited from the expected value E P|i . To see consistency we first verify that E P − j |i ([u S ] − j ) = E P|i (u S ) for any probability measure P. If U ⊆N \ j, i∈U P − j (U ) = 0, then both sides of the equation are equal to zero. So, we may assume U ⊆N \ j, i∈U P − j (U ) > 0 in the following. For the reduced Inserting (25) provides which confirms consistency of ϕ 1 . Full control follows from ϕ 1 ({i}). This is 1 if P({i}) > 0 and otherwise 0 by the definition of P|i.
In remains to be shown that ϕ 1 does not satisfy IDDP. For the unanimity game u j note that u j = S⊆N α S · u S provides α S = 1 for S = { j} , 0 otherwise.
However, the summation in (27) is over all S with i ∈ S such that in contradiction to (26).
(ii) ϕ 2 inherits linearity and consistency from ξ and ϕ 1 . Inserting (51) into (28) provides such that ϕ 2 i inherits IDDP from ξ i . As both ξ and ϕ 1 satisfy full control we get in contradiction to Definition 7.
(iii) Linearity is obvious. For |N | ≤ 2 the extended value ϕ 3 is identical to the PV and the latter satisfies full control and IDDP. For a counterexample to consistency consider a game G − j = (N , v, P) with |N | = 3 and perfect correlation P(N ) = 1/2 = P(∅). Here, However, for the reduced game For ϕ 3 follows which contradicts (52).
(iv) Consider the reduced game G − j = (N − j , v − j , P − j ). PV is consistent and therefore We conclude

Proof of Theorem 2
The proof is based on three insights, stated in Lemmas 4-6. (1 − p j ) for all ∅ = S ⊆ N . By using S⊆N P(S) = 1 we conclude that equation (53) is also valid for the empty set and thus for all subsets of N . Lemma 4 and a short calculation gives also the first formula of the proposed statement.
To verify that the converse holds as well let 0 < p i < 1 be given for all i ∈ N and define P(S) = j∈S p j · j∈N \S 1 − p j , i.e., P is a product measure. Next set for i ∈ N \S, i.e. the Q i (S ∪ i) derive from the same product measure. We can easily verify P|i(S) = P|¬i(S\i) for all S i and all i ∈ N . Inserting this into the definition of the prediction value provides ξ(·, P) = (·, Q).