1 Introduction

Since the EPR correlations and the philosophical issues concerning them have been described by (San Pedro 2012), to which paper this note is a technical follow-up, I will be very brief here. In the setup with which we are concerned two electrons are emitted in opposite directions. Each electron from a given pair proceeds towards a detector which can be set to measure the electron’s spin along an axis chosen from a previously determined set of directions (typically from 2 or 3 options); the results of the measurement are binary: either “spin-up” or “spin-down”. Very roughly, the EPR correlations consist in the fact that while the probability of each result for any measurement direction in any of the two wings of the experiment is one-half, the probabilities of obtaining pairs of results (one from each wing) typically differ from one-fourth. This is puzzling due to the spatiotemporal features of the setup which suggest various independencies: the outcome at one wing cannot influence the outcome in the other wing, and the choice of measurement setting at one wing cannot influence the outcome in the other wing.

The famous 1964 result of Bell shows that one cannot account for the correlations by positing a hidden variable subject to certain formal conditions which are supposed to formulate the natural independencies like the ones described above. There have been many attempts to formulate alternative explanations by means of various constructs subject to various independence relations; one usual relation of this sort is that of no conspiracy, or measure independence: the posited hidden variable should be statistically independent from the measurement settings. It is to this field that San Pedro makes a contribution. Apart from philosophical points about backwards causation, which will not concern me here, he offers a conjecture; namely, that whenever measurement independence is violated, the so called parameter independence condition—which says that if we fix the value of the posited hidden factor the chance for obtaining a given result in one wing should not depend on the measurement setting in the other wing—is also violated. San Pedro offers a mathematical argument supporting this conjecture. In the following sections I will present a method of constructing counterexamples to the conjecture and point to a flaw in San Pedro’s argument.

2 The formal setup

San Pedro’s notational conventions dictate that we use:

  • L i for the event of the detector in the left wing being set to the axis i;

  • R j for the event of the detector in the right wing being set to the axis j;

  • \({L_{i}^{a}}\) for the event of the detector in the left wing, set to the axis i, displaying the result a;

  • \({R_{j}^{b}}\) for the event of the detector in the right wing, set to the axis j, displaying the result b;

where i,j∈{1,2,3} and a,b∈{+,−}.

For the hidden causal factors San Pedro proposes to use the notation \(C_{ij}^{ab}\), with i,j,a,b as before. Now, due to the difference between “common common causes” and “separate common causes” (see e.g. Hofer-Szabó et al. 2013) it is currently standard to require different causal factors screening off correlations at different measurement settings. However, San Pedro aims to go one step further, and require that at the measurement setting L i R j , the event \(C_{ij}^{ab}\) should screen off the correlation between \({L_{i}^{a}}\) and \({R_{j}^{b}}\) and none of the other three (anti-)correlations. That is, for example, \(C_{13}^{++}\) should screen off \(L_{1}^{+}\) from \(R_{3}^{+}\), but not \(L_{1}^{+}\) from \(R_{3}^{-}\).Footnote 1

This seems to be a mistake. Any event screening off one correlation at a given combination of measurement settings screens off all the other correlations at the same pair of measurement settings. Let me show an example of this (the argument is similar in all other cases): if \(C_{13}^{++}\) screens off \(L_{1}^{+}\) from \(R_{3}^{+}\), then it also screens off \(L_{1}^{+}\) from \(R_{3}^{-}\).

Assume that (*) \(P(L_{1}^{+} \wedge R_{3}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) = P(L_{1}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) \cdot P(R_{3}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++})\). Now, \(P(L_{1}^{+} \wedge R_{3}^{-} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) = P(L_{1}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) - P(L_{1}^{+} \wedge R_{3}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++})\). Due to (*), this is equal to \(P(L_{1}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) - P(L_{1}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) \cdot P(R_{3}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++})\), which in turn equals \(P(L_{1}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) \cdot \left (1 - P(R_{3}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++})\right )\). The last expression is just \(P(L_{1}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) \cdot P(R_{3}^{-} | L_{1} \wedge R_{3} \wedge C_{13}^{++})\). Putting all this together, we get the result that \(P(L_{1}^{+} \wedge R_{3}^{-} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) = P(L_{1}^{+} | L_{1} \wedge R_{3} \wedge C_{13}^{++}) \cdot P(R_{3}^{-} | L_{1} \wedge R_{3} \wedge C_{13}^{++})\). And so the screening off we wanted to establish holds.

San Pedro’s notation seems to be redundant, then. There is no great harm in keeping it, though; we will simply consider the existence of various screening-off factors for the correlations at a given pair of measurement settings.

We can now formulate the well-known independence conditions. The screening off condition is also called outcome independence (OI, p. 147):

$$ P({L_{i}^{a}} \wedge {R_{j}^{b}} | L_{i} \wedge R_{j} \wedge C_{ij}^{ab}) = P({L_{i}^{a}} | L_{i} \wedge R_{j} \wedge C_{ij}^{ab}) \cdot P({R_{j}^{b}} | L_{i} \wedge R_{j} \wedge C_{ij}^{ab}) $$
(OI)

The other two requirements are parameter independence (PI, p. 153) and measurement independence (MI, p. 147; also called no conspiracy), each of which consists of a symmetric pair of conditions:

$$ P({L_{i}^{a}} | L_{i} \wedge C_{ij}^{ab}) = P({L_{i}^{a}} | L_{i} \wedge R_{j} \wedge C_{ij}^{ab}) $$
(PI)
$$ P({R_{j}^{b}} | R_{j} \wedge C_{ij}^{ab}) = P({R_{j}^{b}} | L_{i} \wedge R_{j} \wedge C_{ij}^{ab}) $$
(PI)
$$ P(C_{ij}^{ab} \wedge L_{i}) = P(C_{ij}^{ab}) \cdot P(L_{i}) $$
(MI)
$$ P(C_{ij}^{ab} \wedge R_{j}) = P(C_{ij}^{ab}) \cdot P(R_{j}) $$
(MI)

Notice that the notion of measurement independence used by San Pedro is relatively weak: it does not require the independence of \(C_{ij}^{ab}\)’s and pairs of measurement settings, and neither the independence of measurement settings and Boolean combinations of \(C_{ij}^{ab}\)’s (which are non-trivial for different pairs of measurement settings). Since I am commenting on a conjecture by San Pedro, I will keep his nomenclature.

Apart from the disjointness assumption, i.e. that with fixed i and j, for ac, bd, \(C_{ij}^{ab} \cap C_{ij}^{ad} = C_{ij}^{ab} \cap C_{ij}^{cb} = \emptyset \), on p. 146 San Pedro sets down another requirement for \(C_{ij}^{ab}\), from which it follows that

$$ C_{ij}^{ab} \subset L_{i} \cap R_{j} $$
(SC)

which I will call the subset condition.Footnote 2

I will not discuss the philosophical advantages or disadvantages of the subset condition, but will notice that, by mere mathematics, it renders San Pedro’s conjecture less interesting that it (I think) would be if the condition was removed (I consider this option in Sections 3.1 and 3.2).

First, San Pedro himself notices (p. 147) that his \(C_{ij}^{ab}\)’s violate MI. Note that it is due to the subset condition and nothing else. It is elementary that if AB and P(B)≠1, then P(A)⋅P(B)≠P(AB). The subset condition gives us \(C_{ij}^{ab} \subset L_{i}\), and since there is more than one possible measurement setting in the left wing of the experiment, P(L i )≠1. Therefore \(P(C_{ij}^{ab} \wedge L_{i}) \neq P(C_{ij}^{ab}) \cdot P(L_{i})\), that is, MI is violated.

Second, notice that due to the subset condition (and, again, nothing else) PI universally holds. This is because it follows immediately from the subset condition that \(L_{i} \cap C_{ij}^{ab} = L_{i} \cap R_{j} \cap C_{ij}^{ab}\) (as both sides are identical to \(C_{ij}^{ab}\)). Therefore \( P({L_{i}^{a}} | L_{i} \wedge C_{ij}^{ab}) = P({L_{i}^{a}} | L_{i} \wedge R_{j} \wedge C_{ij}^{ab})\); similarly for the symmetric condition. Therefore PI always holds.

3 The conjecture and the argument

What is the conjecture, then? To quote San Pedro (p. 152):

Conjecture 1

If measurement independence is violated then parameter independence is also violated.

By the light of the previous section, we have to say that the conjecture is simply false. If we use San Pedro’s definition, we cannot even contemplate a violation of PI.

At this point one may be tempted to consider an alternative formulation of parameter independence; namely, for jk and hi:

$$ P({L_{i}^{a}} | L_{i} \wedge R_{k} \wedge C_{ij}^{ab}) = P({L_{i}^{a}} | L_{i} \wedge R_{j} \wedge C_{ij}^{ab}) (PI\prime) $$
(PI’)
$$ P({R_{j}^{b}} | L_{h} \wedge R_{j} \wedge C_{ij}^{ab}) = P({R_{j}^{b}} | L_{i} \wedge R_{j} \wedge C_{ij}^{ab}) (PI\prime) $$
(PI’)

Notice, though, that due to the subset condition (and the fact that jk and hi) \(L_{i} \cap R_{k} \cap C_{ij}^{ab} = L_{h} \cap R_{j} \cap C_{ij}^{ab} = \emptyset \), so \(P(L_{i} \wedge R_{k} \wedge C_{ij}^{ab})=P(L_{h} \wedge R_{j} \wedge C_{ij}^{ab})=0\), and thus the left-hand side probabilities in the PI’ conditions are undefined. So PI’ universally fails. On this formulation of parameter independence, the conjecture is true, but not interesting, akin to an implication with a previously known consequent. And again, the subset condition is the culprit. (From now on I will stick to PI as opposed to PI’, since it is the formulation used by San Pedro.)

I suspect San Pedro himself had second thoughts about the subset condition, since he offered an argument in support of the conjecture, the falsity of which we have seen to follow directly from the subset condition. It might be interesting, then, to consider the conjecture without assuming SC.

3.1 Without the subset condition: flaw in the argument

Before we tackle the conjecture itself, let us look at the argument for it given by San Pedro on p. 153. Consider the following excerpt:

A violation of measurement independence entails that (...)

$$ P(C_{ij}^{ab} \wedge L_{i} ) \neq P(C_{ij}^{ab}) \cdot P(L_{i}), $$
(18)
$$ P(C_{ij}^{ab} \wedge R_{j} ) \neq P(C_{ij}^{ab}) \cdot P(R_{j}), $$
(19)

which entails that

$$ P(C_{ij}^{ab} \wedge L_{i} \wedge R_{j}) \neq P(C_{ij}^{ab}) \cdot P(L_{i} \wedge R_{j}), $$
(20)

as long as we assume that L i and R j are probabilistically independent.

Notice again that in the presence of SC (20) would need no argument. If we forget about SC, though, the entailment does not hold: it is possible to make (18) and (19) true while making (20) false, while keeping L i and R j probabilistically independent. Consider the probability space with Ω=[0,1]×[0,1], the event algebra \(\mathcal {F}\) being the smallest σ−algebra containing all “rectangles” of the form “a Borel subset of [0,1] × a Borel subset of [0,1]”, and the measure being the unique extension to the whole \(\mathcal {F}\) of the function P r defined on the rectangles so that P r(A×B)=L(A)⋅L(B), where L is the Lebesgue measure. (In other words, consider a space defined over a square of area 1, with the probability of its measurable subsets being their area.) Now define the following:Footnote 3

$$\begin{array}{@{}rcl@{}} L_{1} & = & [0,1] \times [{1}/{2},1]; \\ L_{2} & = & {\Omega}\, \backslash\, L_{1}; \\ R_{1} & = & [0, {1}/{2}] \times [0,1]; \\ R_{2} & = & {\Omega}\, \backslash\, R_{3}; \\ C_{12}^{++} & = & [0, {1}/{2}] \times [{1}/{4}, {1}/{2}] \cup [0, {1}/{2}] \times [{7}/{8}, 1] \cup \\ && \cup \; ({1}/{2}, 1] \times [{1}/{8}, {1}/{2}] \; \cup \; ({1}/{2}, 1] \times [{3}/{8}, 1]. \end{array} $$

The Reader may check that in this situation, after substituting L 1 for L i , R 2 for R j , and \(C_{12}^{++}\) for \(C_{ij}^{ab}\), inequalities (18) and (19) hold, “ L i and R j are probabilistically independent”, but inequality (20) is false.

Of course, San Pedro only intended the argument to inspire more confidence in the truth of the conjecture, without giving an exact proof (he would not call it a “conjecture” in that case!); nonetheless, I think it is fitting when arguing against a conjecture supported by an argument to also point to a flaw in the argument. I can now turn to presenting the method of constructing counterexamples to San Pedro’s conjecture.

3.2 Without the subset condition: falsity of the conjecture

In this section I will show how, given a probability space modelling some set of EPR correlations, that is, containing the events L i , R j , \({L_{i}^{a}}\) and \({R_{j}^{b}}\), to add to it events \(C_{ij}^{ab}\) so that MI is violated while PI is satisfied. The construction will also satisfy OI. For an easier argument (saving us from an additional argument by cases), assume the (anti-)correlations are not perfect, i.e. that any combination of \({L_{i}^{a}} \wedge {R_{j}^{b}}\) occurs with a non-zero probability.

We can without loss of generality assume the probability space is atomless; that is, given an event A with P(A)>0, we can always find an event B such that BA and P(A)>P(B)>0. We do not lose any generality because any classical probability space can be embedded in an atomless classical probability space (see chapter 4 of Wroński 2010for a short proof).

I will also again assume that we have two possible settings at each of the detectors. This is just for brevity, the reasoning goes through for any finite number of settings. I will label the directions for the left wing “1” and “2”, and the other two “3” and “4”. Therefore the events of selecting the measurement directions are L 1, L 2, R 3 and R 4. We can think of the sample space as divided into four “quadrants” of the form L i R j . So to define each \(C_{ij}^{ab}\) we need to set its intersection with each quadrant, making sure the resulting event satisfies OI and PI. For example, for any a,b∈{+,−}, \(C_{13}^{ab}\) should screen off all the (anti-)correlations between \({L_{1}^{c}}\) and \({R_{3}^{d}}\) (for any c,d∈{+,−}); also, the chance of \({L_{1}^{c}}\) should be the same conditional on \(L_{1} \wedge C_{13}^{ab}\) as conditional on \(L_{1} \wedge R_{3} \wedge C_{13}^{ab}\), and so on.

The key observation is the following: for any \(C_{ij}^{ab}\), making sure it satisfies OI and PI only requires paying attention to its intersection with three quadrants. For example, in the case of \(C_{13}^{++}\), we need to specify its intersection with L 1R 3 (for OI to work) and with L 1R 4 and L 2R 3 (for PI to work). Its intersection with the last remaining quadrant, L 2R 4, is irrelevant for the purposes of establishing OI and PI. Once the intersections of \(C_{13}^{++}\) with three quadrants are fixed, MI might already be violated. But if it is not, then there is at most one number fit for the probability of \(P(C_{13}^{++} \wedge L_{2} \wedge R_{4})\) if MI is to be sustained; label the number r. But since we are (due to atomlessness) at our complete liberty when it comes to choosing the probability of \(P(C_{13}^{++} \wedge L_{2} \wedge R_{4})\), we can choose the event so that its probability does not equal r, therefore violating MI. (In all but some very exceptional casesFootnote 4 setting the intersection \(C_{13}^{++} \cap L_{2} \cap R_{4}\) to be the empty set will be enough to the trick.) And so what is really needed is only the way of defining \(C_{13}^{++}\)’s intersection with the three abovementioned quadrants so that OI and PI are satisfied.

This we will achieve in the following way. For each pair of detector settings i,j we will consider four events: \(C_{ij}^{++}\), \(C_{ij}^{-+}\), \(C_{ij}^{+-}\), and \(C_{ij}^{--}\). The intuitive meaning is that \(C_{ij}^{ab}\) should guaranteeFootnote 5, if we set the left detector to i and the right detector to j, the occurence of \({L_{i}^{a}} \wedge {R_{j}^{b}}\). In the particular case of \(C_{13}^{++}\) we will achieve this (which means that OI is satisfied), as well as the satisfaction of PI, by setting the intersection of \(C_{13}^{++}\) with the three relevant quadrants in the following way:

$$\begin{array}{@{}rcl@{}} C_{13}^{++} \cap L_{1} &:=& \left(L_{1}^{+} \cap R_{3}^{+}\right) \cup \left(L_{1}^{+} \cap R_{4}^{+}\right), \\ C_{13}^{++} \cap L_{2} \cap R_{3} &:=& L_{2}^{+} \cap R_{3}^{+}. \end{array} $$

With \(C_{13}^{+-}\) we set the intersections as follows:

$$\begin{array}{@{}rcl@{}} C_{13}^{+-} \cap L_{1} &:=& \left(L_{1}^{+} \cap R_{3}^{-}\right) \cup \left(L_{1}^{+} \cap R_{4}^{-}\right), \\ C_{13}^{+-} \cap L_{2} \cap R_{3} &:=& L_{2}^{+} \cap R_{3}^{-}, \end{array} $$

with the remaining two cases dealt with analogously. It is elementary that the \(C_{13}^{ab}\)’s defined in this way satisfy OI and PI no matter what their intersection with L 2R 4 is.

We now need to ensure the violation of MI. Notice that MI requires that

$$P(C_{13}^{++}) \cdot P(L_{2}) = P(C_{13}^{++} \wedge L_{2} \wedge R_{3}) + P(C_{13}^{++} \wedge L_{2} \wedge R_{4}),$$

which we can transform into

$$P(C_{13}^{++} \wedge L_{2} \wedge R_{4}) = \frac{ \left(P(C_{13}^{++} \wedge R_{3})+ P(C_{13}^{++} \wedge L_{1} \wedge R_{4})\right)\cdot P(L_{2})- P(C_{13}^{++} \wedge L_{2} \wedge R_{3})}{P(L_{1})}\hfill \ast\ast $$

Notice that the values of all the terms on the right hand side of the equality are known. Label the value of the fraction r. We now know that if we set the intersection \(C_{13}^{++} \cap L_{2} \cap R_{4}\) so that its probability is different than r, MI will be violated. Thanks to atomlessness, there are infinitely many ways of doing so.Footnote 6

At this point we have secured the violation of MI. As I have already stated, all \(C_{13}^{ab}\)’s satisfy OI and PI. Now all that needs to be done is to repeat the above construction for \(C_{14}^{ab}\)’s, \(C_{23}^{ab}\), and \(C_{24}^{ab}\), changing the role of the quadrants accordingly.

Notice also that, for any i and j, nothing prevents us from setting the events in the “irrelevant” quadrant so that the \(C_{ij}^{ab}\)’s form a partition of the sample space.

We now know that if we remove the subset condition, the conjecture is false. I have presented the construction for imperfect (anti-)correlations, but it should be evident how to transform it into one fit for that special case.

4 Conclusions

In the previous section I argued that San Pedro’s conjecture, taken as a mathematical statement (since its proponent himself gives a mathematical argument in its support) is false. I showed that if we keep San Pedro’s definitions, the conjecture fails immediately; while if we remove one of the conditions, it is still possible to provide a method for obtaining (infinitely many) counterexamples. Now, the counterexamples are mathematical constructions. It would be interesting if there were philosophical reasons for rejecting them, especially if some of the reasons could arguably result in additional formal requirements which my construction fails to satisfy. I would welcome such a development; as it stands, though, San Pedro’s conjecture is false.