Skip to main content
Log in

Interventionist counterfactuals

  • Published:
Philosophical Studies Aims and scope Submit manuscript

Abstract

A number of recent authors (Galles and Pearl, Found Sci 3 (1):151–182, 1998; Hiddleston, Noûs 39 (4):232–257, 2005; Halpern, J Artif Intell Res 12:317–337, 2000) advocate a causal modeling semantics for counterfactuals. But the precise logical significance of the causal modeling semantics remains murky. Particularly important, yet particularly under-explored, is its relationship to the similarity-based semantics for counterfactuals developed by Lewis (Counterfactuals. Harvard University Press, 1973b). The causal modeling semantics is both an account of the truth conditions of counterfactuals, and an account of which inferences involving counterfactuals are valid. As an account of truth conditions, it is incomplete. While Lewis’s similarity semantics lets us evaluate counterfactuals with arbitrarily complex antecedents and consequents, the causal modeling semantics makes it hard to ascertain the truth conditions of all but a highly restricted class of counterfactuals. I explain how to extend the causal modeling language to encompass a wider range of sentences, and provide a sound and complete axiomatization for the extended language. Extending the truth conditions for counterfactuals has serious consequences concerning valid inference. The extended language is unlike any logic of Lewis’s: modus ponens is invalid, and classical logical equivalents cannot be freely substituted in the antecedents of conditionals.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. Although this is the standard account of the relationship between selection and truth at a point, it is not the the only possible account. We might say instead that \(\phi \,\square{\kern-4.1pt}\rightarrow\, \psi\) is true at α if ψ is true at every point in f(ϕ, α), false at α if ψ is false at every point in f(ϕ, α), and indeterminate otherwise. We would then need a three-valued logic for assigning truth values to compounds of counterfactuals. Or we might say that \(\phi \,\square{\kern-4.1pt}\rightarrow\, \psi\) is true at α if ϕ is true at most points in f(ϕ α), and false otherwise. For simplicity’s sake, I will ignore these alternatives and assume the standard account.

  2. To ensure that we can use the concept of closeness to define a selection function, we need the Limit Assumption, which forbids infinite sequences of closer and closer ϕ worlds to w with none closest. I will assume that the Limit Assumption holds; see Stalnaker (1981a, pp. 96–98) for a defense of it.

  3. See Woodward (2003, pp. 98–114) for a detailed discussion of the concept of an intervention.

  4. One can make pragmatic use of either approach without reading it as a complete account of the truth conditions of counterfactuals. For instance, one might think the similarity account is heuristically useful, but mistaken whenever it conflicts with the causal modeling account. Or one might accept (Hall 2007, p. 110)’s claim that the causal modeling framework is “a useful means for selectively representing aspects of an antecedently understood counterfactual structure”, but does not provide truth conditions for all counterfactuals of interest. In this article, however, I take both accounts at face value.

  5. Lewis (1986b, p. 111) glosses Human Supervenience this way: “if two worlds match perfectly in all matters of particular fact, they match perfectly in all other ways too—in modal properties, laws, causal connections, chances.”

  6. Some versions of the similarity account also limit the range of permissible antecedents. One way of defining the selection function, proposed by Lewis (1979), involves mapping each antecedent A onto a stretch of time t A , where A “is entirely about” events that happen in t A . Another type of proposal, defended by Schaffer (2004) and Edgington (2004), defines the selection function in terms of causal independence. Not all propositions are about particular stretches of time, and not all propositions describe events that are eligible to stand in relations of causal dependence or independence. Where A 1 and A 2 are each about particular stretches of time, A 1A 2 need not be. Likewise, where A 1 and A 2 each describe events that stand in relations of causal dependence and independence, A 1A 2 need not.

  7. I use the axiomatization in (Lewis 1981), since it is the one that has been taken up in the causal modeling literature.

  8. Astute readers will notice similarities between my counterexample to modus ponens and the counterexamples famously proposed by McGee (1985). McGee formulates counterexamples for both an indicative interpretation of the conditional—in which points are probability functions, truth at a point for nonconditional sentences is equated with high probability, and f(AP) is the probability function that results from conditionalizing P on A—and a counterfactual interpretation of the conditional—where the semantics agrees with Lewis’s in all relevant respects except centering. On McGee’s two interpretations of the conditional, and on mine, whenever A is true at a point α, f(A, α) must contain a point β that assigns the same truth values to all nonconditional sentences as α. But the features of β that ground the selection function are slightly different from the features of α that ground the selection function; thus α and β assign different truth values to conditionals. Although modus ponens is valid for conditionals with nonconditional consequents, it is not valid for conditionals in general.

  9. I’ve made few modifications to Fine’s rules. I appeal only to verification conditions, whereas Fine appeals to both verification conditions and falsification conditions. And I’ve added verification conditions for atomic sentences and the material conditional.

  10. I owe much to the axiomatization given by Halpern (2000). The first eight axioms are similar in spirit to Halpern’s axioms; the last three concern counterfactuals with complex antecedents and consequents, and are entirely my own.

  11. (CM9) ensures that CM validates the controversial rule Simplification of Disjunctive Antecedents (SDA) \((\phi \vee \psi) \,\square{\kern-4.1pt}\rightarrow\, \theta \vdash (\phi \,\square{\kern-4.1pt}\rightarrow\, \theta) \wedge (\psi \,\square{\kern-4.1pt}\rightarrow\, \theta). \) Although Nute (1978) defends SDA, most authors reject the principle on the grounds that it leads to undesirable consequences when taken together with the substitution of classical equivalents (see Fine 1975; see Loewer 1976). But the concept of exact equivalence gives us independent reason to reject the substitution of classical equivalents.

References

  • Edgington, D. (2004). Counterfactuals and the benefit of hindsight. In: P. Dowe & P. Noordhof (Eds.), Cause and chance: Causation in an indeterministic world. Routledge.

  • Elga, A. (2001). Statistical mechanics and the asymmetry of counterfactual dependence. Philosophy of Science, 68S, S313–S324.

    Article  Google Scholar 

  • Fine, K. (1975). Critical notice. Mind, 84(1), 451–458.

    Article  Google Scholar 

  • Fine, K. (forthcoming). Counterfactuals without possible worlds. The Journal of Philosophy.

  • Fodor, J. (1974). Special sciences, or the disunity of science as a working hypothesis. Synthese, 28, 97–115.

    Article  Google Scholar 

  • Galles, D., & Pearl, J. (1998). An axiomatic characterization of causal counterfactuals. Foundations of Science, 3(1), 151–182.

    Article  Google Scholar 

  • Gundersen, L. B. (2004). Outline of a new semantics for counterfactuals. Pacific Philosophical Quarterly, 85(1), 1–20.

    Article  Google Scholar 

  • Hall, N. (2007). Structural equations and causation. Philosophical Studies, 132(1), 109–136.

    Article  Google Scholar 

  • Halpern, J. (2000). Axiomatizing causal reasoning. Journal of Artificial Intelligence Research, 12, 317–337.

    Google Scholar 

  • Handfield, T., Twardy, C. R., Korb, K. B., & Oppy, G. (2008). The metaphysics of causal models: Where’s the biff? Erkenntnis, 68(2), 149–68.

    Article  Google Scholar 

  • Hawthorne, J. (2005) Chance and counterfactuals. Philosophy and Phenomenological Research, 70(2), 396–405.

    Article  Google Scholar 

  • Hiddleston, E. (2005). A causal theory of counterfactuals. Noûs, 39(4), 232–257.

    Google Scholar 

  • Hitchcock, C. (2001). The intransitivity of causation revealed in arrows and graphs. The Journal of Philosophy, 98(6), 273–299.

    Article  Google Scholar 

  • Hüber, F. Structural equations and beyond. manuscript.

  • Kment, B. (2006). Counterfactuals and explanation. Mind, 115(458), 261–310.

    Article  Google Scholar 

  • Krasner, D., & Heller, M. (1994). The miracle of counterfactuals: Counterexamples to lewis’s world ordering. Philosophical Studies, 76(1), 27–43.

    Article  Google Scholar 

  • Lewis, D. (1973a). Causation. The Journal of Philosophy, 70(17), 556–567.

    Article  Google Scholar 

  • Lewis, D. K. (1973b). Counterfactuals. Harvard University Press.

  • Lewis, D. (1979) Counterfactual dependence and time’s arrow. Noûs, 13(4), 455–476.

    Article  Google Scholar 

  • Lewis, D. (1981). Counterfactuals and comparative possibility. In W. L. Harper, R. Stalnaker, & G. Pearce (Eds.), Ifs, volume 15 of The University of Western Ontario series in philosophy of science (pp. 57–85). Dordrecht: D. Reidel.

  • Lewis, D. (1986a). Introduction. In: Philosophical papers: Volume II. New York: Oxford University Press.

  • Lewis, D. (1986b). A subjectivist’s guide to objective chance. In Philosophical papers (Vol. II). New York: Oxford University Press.

  • Lewis, D. (1994). Humean supervenience debugged. Mind, 103, 473–490.

    Article  Google Scholar 

  • Loewer, B. (1976). Counterfactuals with disjunctive antecedents. The Journal of Philosophy, 73(16), 531–537.

    Article  Google Scholar 

  • McGee, V. (1985). A counterexample to modus ponens. The Journal of Philosophy, 82(9), 462–471.

    Article  Google Scholar 

  • Menzies, P. (2004). Causal models, token causation, and processes. Philosophy of Science, 71(5):820–832.

    Article  Google Scholar 

  • Nute, D. (1978). Simplification and substitution of counterfactual antecedents. Philosophia, 7(2), 317–325.

    Article  Google Scholar 

  • Pearl, J. (2000). Causality: Models, reasoning, and inference. Cambridge: University Press.

    Google Scholar 

  • Penczek, A. (1997). Counterfactuals with true components. Erkenntnis, 46(1), 79–85.

    Article  Google Scholar 

  • Sartorio, C. (2006). Disjunctive causes. Journal of Philosophy, 103

  • Schaffer, J. (2004). Counterfactuals, causal independence and conceptual circularity. Logic and Analysis, 64(4), 299–308.

    Google Scholar 

  • Stalnaker, R. (1981a). A defense of conditional excluded middle. In W. L. Harper, R. Stalnaker, & G. Pearce (Eds.), Ifs, volume 15 of The University of Western Ontario series in philosophy of science (pp. 87–104). Dordrecht: D. Reidel.

  • Stalnaker, R. (1981b). A theory of conditionals. In W. L. Harper, R. Stalnaker, & G. Pearce (Eds.), Ifs, volume 15 of The University of Western Ontario series in philosophy of science (pp. 41–55). Dordrecht: D. Reidel.

  • Wasserman, R. (2006). The future similarity objection revisited. Synthese, 150(1), 57–67.

    Article  Google Scholar 

  • Woodward, J. (2003). Making things happen. Oxford: Oxford University Press.

    Google Scholar 

  • Woodward, J., & Hitchcock, C. (2003). Explanatory generalizations, part I: A counterfactual account. Noûs, 37(1), 1–24.

    Article  Google Scholar 

Download references

Acknowledgments

I am thankful for the helpful questions and comments of audience members at the 2011 Carolina Metaphysics Workshop, the 2010 Formal Epistemology Workshop, and various workshops and departmental seminars in Australia. Special thanks are due to Delia Graff Fara and Peter Menzies for extensive commentary and discussion.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rachael Briggs.

Additional information

Earlier versions of this paper were presented at the 2011 Carolina Metaphysics Workshop, the 2010 Formal Epistemology Workshop, and various workshops and departmental seminars around Australia.

Appendix

Appendix

1.1 Disjunctive Normal Form

Every Boolean sentence is exactly equivalent to a sentence in disjunctive normal form; i.e., a disjunction of conjunctions of atomic sentences. This disjunctive normal form can be derived in two steps.

The first step is to generate a sentence whose subsentences are all positive—i.e., all atomic sentences, negated atomic sentences, conjunctions, or disjunctions. Given a sentence ϕ n with non-positive subsentences, one can generate a sentence ϕ n+1 by the following procedure. First, find ϕ n ’s longest non-positive subsentence ψ. (If two or more of these subsentences are tied for longest, let ψ be leftmost.)

  • If ψ is of the form \(\neg \neg \theta\), then ϕ n+1 is the sentence that results from replacing ψ with θ in ϕ n . (By \((\neg)\), this preserves exact equivalence.)

  • If ψ is of the form \(\theta_1 \supset \theta_2\), then ϕ n+1 is the sentence that results from replacing ψ with \(\neg \theta_1 \vee \theta_2\) in ϕ n . (By \((\supset^{+})\), this preserves exact equivalence.)

  • If ψ is of the form \(\neg(\theta_1 \supset \theta_2)\), then ϕ n+1 is the sentence that results from replacing ψ with \( \theta_1 \wedge \neg \theta_2\) in ϕ n . (By \((\supset^{-})\), this preserves exact equivalence.)

  • If ψ is of the form \(\neg(\theta_1 \wedge \theta_2)\), then ϕ n+1 is the sentence that results from replacing ψ with \(\neg \theta_1 \vee \neg \theta_2\) in ϕ n . (By (∧), this preserves exact equivalence.)

  • If ψ is of the form \(\neg(\theta_1 \vee \theta_2)\), then ϕ n+1 is the sentence that results from replacing ψ with \(\neg \theta_1 \wedge \neg \theta_2\) in ϕ n . (By (∨), this preserves exact equivalence.)

Since this procedure replaces each non-positive subsentence of ϕ with a positive sentence, and generates either no new non-positive sentences, or one positive sentence which is shorter than the one replaced, it is guaranteed to terminate in a sentence with only positive subsentences. Since each step in the procedure preserves exact equivalence, the positive sentence derived at the end is exactly equivalent to the original sentence.

The next step is to “push the conjunctions inward”. Begin with a sentence ϕ0, all of whose subsentences are positive. Given a sentence ϕ n , all of whose subsentences are positive, but not necessarily in disjunctive normal form, one can generate a sentence ϕ n+1 by the following procedure. First, find ϕ n ’s longest subsentence ψ that is a conjunction of disjunctions \((\theta^{1}_{1} \vee \ldots \vee \theta^{1}_{f(1)})\vee \ldots \vee (\theta^{n}_{1} \vee \ldots \vee \theta^{n}_{f(n)}). \) Consider all the conjunctions of θ i j s that have at least one conjunct of the form θ k g(k) for each \(k \in \{1\ldots n\}\) (where g(k) is an arbitrary number between 1 and f(k) inclusive). Let ψ′ be the disjunction of all these conjunctions. Let ϕ n+1 be the sentence that results from replacing ψ with the disjunction of all sentences of the form ψ′ in ϕ n . By (∧+) and (∨+), replacing ϕ n with ϕ n+1 preserves exact equivalence.

Each step of the second procedure eliminates the longest disjunction inside the scope of a conjunction, and without introducing any new disjunction inside the scope of a conjunction. Each step keeps the sentence positive. After a finite number of steps, the length of the longest disjunction inside the scope of a conjunction must equal 1, at which point ϕ = ϕ n+1. At this stage, the sentence will be entirely positive, and have no disjunctions inside the scope of conjunctions: it will be in disjunctive normal form.

1.2 Soundness

The validity of (R1) and (CM1) is ensured by the definitions of the Boolean connectives.

(R2)’s valdity is ensured by the selection semantics for \(\,\square{\kern-4.1pt}\rightarrow\,: \) Suppose \((\psi_1 \wedge \ldots \wedge \psi_n) \supset \theta\) is true in every model, and \(v_M((\phi \,\square{\kern-4.1pt}\rightarrow\, \psi_1) \wedge \ldots \wedge (\phi \,\square{\kern-4.1pt}\rightarrow\, \psi_n)) = 1\). It remains to show that \(v_M(\phi \,\square{\kern-4.1pt}\rightarrow\, \theta) = 1\). The semantic rule for ∧ ensures that for each of \(\psi_i \in \{\psi_1 \ldots \psi_n\}, v_M (\psi_i) = 1\). By the semantic rule for \(\,\square{\kern-4.1pt}\rightarrow\,\), for every \(\psi_i \in \{\psi_1 \ldots \psi_n\}\) and \(M' \in f(\phi, M), v_{M'}(\psi_i) = 1\). By hypothesis, \(v_{M'}((\psi_1 \wedge \ldots \wedge \psi_n) \supset \theta) = 1\) for every \(M' \in f(\phi, M)\). By tautological reasoning \({v_{M}^{\prime}}(\theta)\) v M ′(θ) = 1 for every \(M' \in f(\phi, M)\), and so \(v_M(\phi \,\square{\kern-4.1pt}\rightarrow\, \theta) = 1\).

The validity of (CM2) and (CM3) is ensured by the behaviour of variables. Every variable takes on some value or other (hence (CM2)), and no variable takes on two values at once (hence (CM3)).

(CM4) is valid because every sentence of the form A is made true by a single state, and each state corresponds to exactly one intervention. So f(AM) contains exactly one model M′. For every sentence ϕ, either \({v_{M}^{\prime}}(\theta)\) = 1, in which case \(v_M({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \phi) = 1\), or \({v_{M}^{\prime}}(\theta)\) (ϕ) = 0, in which case \(v_M({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \neg \phi) = 1\). In either case \(v_M(({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \phi) \vee ({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \neg \phi)) = 1\).

(CM5) is valid because every sentence of the form A is made true by some state or other, and each state corresponds to an intervention. Therefore, there is at least one submodel in f(AM). Since \(\bot\) is true in no model, \(\bot\) is not true in every submodel in f(AM). Hence, \(v_M(\neg({\bf A}\,\square{\kern-4.1pt}\rightarrow\, \bot)) =1\).

(CM6) is valid because the order of interventions on a causal model does not affect the end result of those interventions, provided one doesn’t intervene twice on the same variable. To generate M A V = v from M A , one intervenes on M A to set V = v. Suppose v M_A (V = v) = 0. Then by the semantic rules for material conditional and counterfactual, \(v_M({\bf A} \,\square{\kern-4.1pt}\rightarrow\, (V = v \supset B)) = 1. \) By tautological reasoning \(v_M((({\bf A} \wedge V = v) \,\square{\kern-4.1pt}\rightarrow\, B) \supset ({\bf A} \,\square{\kern-4.1pt}\rightarrow\, (V = v \supset B))) = 1. \) On the other hand, suppose that in M A V = v. Intervening to change V’s structural equation will not affect the values of V’s nondescendants, for their values do not depend on V’s value. Nor (by hypothesis) will it affect V’s value. Nor will it affect the values of V’s descendants, since in a recursive model, their values are determined by the value of V and V’s nondescendants. Ergo, v M_A (V′ = v′) = v M_A V = v(V′ = v′) for all \(V' \in \mathcal{V}. \) Hence, where B is any Boolean combination of atomic sentences, v M_A (B) = v M_A V = v(B), and so once again, \(v_M((({\bf A} \wedge V = v) \,\square{\kern-4.1pt}\rightarrow\, B) \supset ({\bf A} \,\square{\kern-4.1pt}\rightarrow\, (V = v \supset B))) = 1. \)

(CM7) is valid because any intervention that makes A true also generates a model in which A is true. This holds where A is an atomic sentence—intervening to set V = v generates a model in which V = v—or a negated atomic sentence—intervening to set V = v′ generates a model in which V ≠ v, so long as v′ ≠ v. The conditions on truthmaking ensure that if it holds for ϕ and ψ, it also holds for \(\phi \vee \psi, \neg( \phi \vee \psi), \phi \wedge \psi, \neg ( \phi \wedge \psi), \phi \supset \psi\) and \(\neg(\phi \supset \psi). \)

(CM8) is valid for the class of recursive models. To show this, we must appeal to a notion of counterfactual dependence among variables. Say that Vcounterfactually depends on V iff for some variables \(V_1\ldots V_m\), all distinct from V and V′, some values \(v_1\ldots v_m\) of \(V_1 \ldots V_m\), some values v and v * of V, and some value v′ of V′, the intervention setting \(V_1= v_1, \ldots V_m = v_m\), and V = v generates a submodel in which V′ = v′, while the intervention setting \(V_1= v_1, \ldots V_{m} = v_{m}\), and V = v * generates a submodel in which V′ ≠ v′.

We can show that V′ stands in the ancestral of counterfactual dependence to V iff V is an ancestor of V′. First, the left-to-right direction: If V′ counterfactually depends on V, then the structural equations ensure that V′’s value depends non-trivially on V’s value. Hence, V is an ancestor of V′. Since the relation of being an ancestor is transitive, it follows that if V′ stands in the ancestral of counterfactual dependence to V, then V is an ancestor of V′. Next, the right-to-left direction: if V 1 is an ancestor of V n , there is a sequence of variables \(V_1, \ldots V_n\) such that V 1 is the parent of V 2, and \(\ldots V_{n-1}\) is the parent of V n . Since each variable counterfactually depends on each of its parents, it follows that V′ stands in the ancestral of counterfactual dependence to V.

Furthermore, it follows from the definition of the selection function and the semantic rule for the counterfactual that V′ counterfactually depends on V iff there is a sentence of the form d(VV′) such that v M (d(V, V′)) = 1. As a corollary, if there exist sentences \(d_1(V_1, V_2)\ldots d_{n-2}(V_{n-2}, V_{n-1})\) such that \(v_M( d_1(V_1, V_2) \wedge \ldots \wedge d_{n-1}(V_{n-1}, V_{n})) =1\), then V n stands in the ancestral of counterfactual dependence to V 1, and so V 1 is an ancestor of V n .

In order to show the validity of (CM8), we can assume that \(v_M( d_1(V_1, V_2) \wedge \ldots \wedge d_{n-1}(V_{n-1}, V_{n})) = 1\), and show that there is no sentence of the form d n (V n , V 1) such that v M (d n (V n , V 1)) = 1. By our supposition, in MV n stands in the ancestral of the counterfactual dependence relation to V 1. Therefore, V 1 is an ancestor of V n . On the assumption that M is recursive, V n-1 is not an ancestor or V 1, and so cannot counterfactually depend on V n-1. Therefore, no sentence of the form d n (V n-1, V 1) can be true in M; QED.

(CM9) is sound according to the recursive conditions for truthmaking. \(v_M((A \vee B) \,\square{\kern-4.1pt}\rightarrow\, \phi) = 1\) iff v M ′(ϕ) = 1 for every submodel M′ generated by an intervention on M corresponding to a state that makes AB true. But according to condition \((\vee^{+}), s\, \vDash\, A \vee B\) iff \(s\, \vDash\, A\) or \(s\, \vDash\, B\) or \(s = t \sqcup r\) where \(t \,\vDash\, A\) and \(r\, \vDash\, B\). In other words, rephrasing the third disjunct in terms of condition \((\wedge^{+}), s \,\vDash\, A \vee B\) iff \(s \,\vDash\, A\) or \(s\, \vDash\, B\) or \(s \vdash A \wedge B\). So \(v_M((A \vee B) \,\square{\kern-4.1pt}\rightarrow\, \phi) = 1\) iff v M ′(ϕ) = 1 for every submodel M′ generated by an intervention on M corresponding to a state that makes A true, and v M ′(ϕ) = 1 for every submodel M′ generated by an intervention on M corresponding to a state that makes B true, and v M ′(ϕ) = 1 for every submodel M′ generated by an intervention on M corresponding to a state that makes AB true. In other words, \(v_M((A \vee B) \,\square{\kern-4.1pt}\rightarrow\, \phi) = 1\) iff \(v_M(A \,\square{\kern-4.1pt}\rightarrow\, \phi) = 1\), and \(v_M(B \,\square{\kern-4.1pt}\rightarrow\, \phi) = 1\), and \(v_M((A \wedge B) \,\square{\kern-4.1pt}\rightarrow\, phi) = 1\). The semantic rules for Boolean connectives ensure that \(v_M(((A \vee B) \,\square{\kern-4.1pt}\rightarrow\, \phi) \equiv ((A \,\square{\kern-4.1pt}\rightarrow\, \phi) \wedge (B \,\square{\kern-4.1pt}\rightarrow\, \phi) \wedge ((A \wedge B) \,\square{\kern-4.1pt}\rightarrow\, \phi))) = 1\).

(CM10) is valid because any exact equivalent A′ of A is made true by the same states as A; hence f(AM) = f(A′, M).

(CM11) is valid because where \(\mathcal{V}_{\bf A}\) is the set of variables mentioned in \({\bf A}, \mathcal{V}_{\bf B}\) is the set of variables mentioned in B and B′, and \(\mathcal{V}_{\bf C}\) is the set of variables mentioned in C, the procedure of performing the interventions sanctioned by each of AB, and C has the same results as the procedure by performing the interventions sanctioned by A and B′, then performing the interventions sanctioned by B (thereby reversing the interventions sanctioned by B′) and performing the interventions sanctioned by C.

1.3 Completeness

A proof system is complete over a class of models iff for every consistent set of sentences \(\Upgamma\), there is a model in which every sentence in \(\Upgamma\) is true. Suppose, then, that we have a consistent set of sentences \(\Upgamma_0\). If we can specify a procedure for constructing a model in which every member of \(\Upgamma_0\) is true, then we will have proved completeness. We begin by using Lindenbaum’s Lemma to construct a maximal consistent set \(\Upgamma\) such that \(\Upgamma_0 \subseteq \Upgamma\). We can use \(\Upgamma\) to build a causal model \(M = \langle\mathcal{V},\mathcal{S}, \mathcal{A}\rangle\), by the following procedure.

Let \(\mathcal{V}\) contain one variable V i for each variable V i for which the language contains a name.

To define \(\mathcal{S}\), we must show that for every \(V_i \in \mathcal{V}\) and partial assignment function \(\mathcal{A}^{-}\), where \(\mathcal{A}^{-}(V_j) \in \mathcal{R}(V_j)\) for the \(V_j \in \mathcal{V} - V_i, \Upgamma\) contains exactly one sentence of the form \(\wedge_{V_j \in \mathcal{V} -V_{i}} (V_{j} = \mathcal{A}^{-}(V_{j})) \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i\). To show this, we must prove both an existence claim and a uniqueness claim.

The existence claim: for every \(V_i \in \mathcal{V}\) and partial assignment function \(\mathcal{A}^{-}, \Upgamma\) contains at least one sentence of the form \(\wedge_{j \in \mathcal{V} -V_i} (V_j = \mathcal{A}^{-}(V_{j})) \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i\). This can be shown by assuming otherwise, and proving a contradiction. Since \(\Upgamma\) is maximal, if the existence claim is false, then for every \(v_i \in \mathcal{R}(V_i)\),

$$ \neg (\wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i \in \Upgamma) $$

Since \(\Upgamma\) contains every instance of every axiom schema, and (CM4) is an axiom, for each \(v_i \in \mathcal{R}(V_i)\),

$$ (\wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i) \vee (\wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, V_i \neq v_i) \in \Upgamma $$

Since \(\Upgamma\) is closed under tautological consequence, for each \(v_i \in \mathcal{R}(V_i)\),

$$ \wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, V_i \neq v_i \in \Upgamma $$

By (R2) and the closure of \(\Upgamma\) under logical consequence,

$$ \wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, \wedge_{v \in {\mathcal{R}}(V_i)} V_i \neq v\in \Upgamma $$

By (CM1), (CM2), and (R1),

$$ \wedge_{v \in {\mathcal{R}}(V_i)} V_i \neq v \vdash \bot $$

and so by (R2), and the closure of \(\Upgamma\) under logical consequence,

$$ \wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, \bot \in \Upgamma $$

But since every instance of (CM5) is an axiom,

$$ \neg (\wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, \bot) \in \Upgamma $$

We have derived a contradiction, thus completing the existence proof.

Now, the uniqueness claim: for every \(V_i \in \mathcal{V}\) and partial assignment function \(\mathcal{A}^{-}, \Upgamma\) contains at most one sentence of the form \(\wedge_{j \in \mathcal{V} -V_i} (V_j = \mathcal{A}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i. \) Again, this can be shown by assuming otherwise and proving a contradiction. We will suppose the uniqueness claim is false, i.e., that \(\Upgamma\) contains sentences of the form

$$ \wedge_{j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i $$

and

$$ \wedge_{j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, V_i = v'_i $$

where v i  ≠ v i . And we will prove a contradition.

By (R2) and the closure of \(\Upgamma\) under entailment, we can infer that

$$ \wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, (V_i = v_i \wedge V_i = v'_i) \in \Upgamma $$

But by (CM3)

$$ V_i = v_i \wedge V_i = v'_i \vdash \bot $$

and so by (R2),

$$ \wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, \bot \in \Upgamma $$

But since every instance of (CM5) is an axiom,

$$ \neg (\wedge_{V_j \in {\mathcal{V}} -V_i} (V_j = {\mathcal{A}}^{-}(V_j)) \,\square{\kern-4.1pt}\rightarrow\, \bot) \in \Upgamma $$

We have derived a contradiction, thus completing the uniqueness proof.

Now we can define \(\mathcal{S}\). Since there are many possible partial assignment functions for each set of variables, let us number each possible \(\mathcal{A}^{-}\) with a subscript. First, let us write, for each V i , an equation of the form

$$ f_{i} = v_{k} \begin{array}{l} \hbox{if} (V_{j} = {\mathcal{A}}^-_k(j)) \hbox{for each} V_{j} \in {\mathcal{V}} - \{V_{i}\}\\ \hbox{where} \wedge_{j \in {\mathcal{V}} -V_{i}} (V_{j} = {\mathcal{A}}^{-}_k(j)) \,\square{\kern-4.1pt}\rightarrow\, V_{i} = v_{k} \in \Upgamma \\ \end{array} $$

By our existence and uniqueness claim, there is exactly one such v k for every i and k. Now, let us replace each of these equations with an equation that has the same solutions in all cases, but does not mention any redundant variables. (If all variables are redundant, delete the equation altogether.) The set of equations generated this way is \(\mathcal{S}.\,\mathcal{S}\) will contain at most one structural equation for each variable.

The last step in defining M is to define \(\mathcal{A}\). (CM2) and (CM3) ensure that for each \(V \in \mathcal{V}, \Upgamma\) contains exactly one sentence of the form V = v. Let \(\mathcal{A}(V_i)\) be the v i such that \(V_i = v_i \in \Upgamma\).

Having defined M, we must show that it is recursive. (CM8) ensures that for any two variables V and V′, if V′ stands in the ancestral of counterfactual dependence to V, then V does not counterfactually depend on V′. I have shown in the soundness portion of the appendix that the ancestral of counterfactual dependence is the ancestor relation. It follows that for any two variables V and V′, if V′ is an ancestor of V, then V is not counterfactually dependent on V′. Furthermore, since all variables counterfactually depend on their parents, we can infer that for any two variables V and V′, if V′ is an ancestor of V, then V is not parent of V′. This entails that no variable is its own descendant. For if a variable V were its own descendant, it would have a parent V′ that was also V’s descendant. Thus, no variable in M can be its own descendant, and so M is recursive.

The last step of the completeness proof is to show that if \(\phi \in \Upgamma\), then v M (ϕ) = 1. This can be proved in two steps. First, we prove that every sentence is equivalent to a Boolean compound of atomic sentences and simple conditionals of the form \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, B\). Next, we show that for any such Boolean compound, \(\phi \in \Upgamma \,\, \hbox{iff} \,\, v_{M}(\phi) = 1\).

I will prove that every sentence is equivalent to a Boolean compound of atomic sentences and simple conditionals of the form \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, B\) by providing a translation procedure.

The first part of the translation procedure eliminates all complex antecedents. Begin with a sentence ϕ0. Let ψ n be ϕ n ’s shortest subsentence of the form \(A \,\square{\kern-4.1pt}\rightarrow\, \theta\). By (CM10) and the disjunctive normal form theorem, θ is equivalent to a sentence of the form \({\bf A}_1 \vee \ldots \vee {\bf A}_n \,\square{\kern-4.1pt}\rightarrow\, \psi\). By (CM9), ψ n is equivalent to a conjunction of the form \(({\bf A}_1 \,\square{\kern-4.1pt}\rightarrow\, \psi) \wedge \ldots \wedge ({\bf A}_n \,\square{\kern-4.1pt}\rightarrow\, \psi)\). Let ϕ n+1 be the result of replacing Replace ψ with this equivalent sentence. Repeated applications of (R2) ensure ϕ n+1 is equivalent to ϕ n . Each step eliminates a counterfactual with a complex antecedent without setting up a new one in its place, and so the procedure will terminate when there are no remaining subsentences with complex antecedents.

The second part of the translation procedure is to eliminate counterfactual consequents. Begin with a sentence ϕ0, all of whose conditional subsentences have antecedents of the form A. Let ψ n be ϕ n ’s shortest subsentence of the form \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \theta\) where θ is not of the form B. Let \((\theta_1^1 \wedge \ldots \wedge \theta_1^{f(1)}) \vee \ldots \vee (\theta_n^1 \wedge \ldots \wedge \theta_n^{f(n)})\) be the (classical) disjunctive normal form of θ. By (R2), ψ′ is equivalent to \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, ((\theta_1^1 \wedge \ldots \wedge \theta_1^{f(1)}) \vee \ldots \vee (\theta_n^1 \wedge \ldots \wedge \theta_n^{f(n)})), \) and by (CM4) and (R2) together, ϕ′ is equivalent to \((({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \theta_1^1) \wedge \ldots \wedge ({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \theta_1^{f(1)})) \vee \ldots \vee (({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \theta_n^1) \wedge \ldots \wedge ({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \wedge \theta_n^{f(n)})). \) Each θ i j is either atomic or of the form \({\bf B} \,\square{\kern-4.1pt}\rightarrow\, C. \) If θ j i is of the form \({\bf B}^j_i \,\square{\kern-4.1pt}\rightarrow\, C^j_i, \) then (CM11) ensures that \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, \theta^j_i\) is equivalent to \({\bf A}^j_i \wedge {\bf B}^j_i \,\square{\kern-4.1pt}\rightarrow\, C^j_i, \) where A j i the conjunction that results from removing any conjuncts from A that share variables with B j i . Finally, let A j i  = A and C j i  = θ j i if θ j i is atomic. Thus, ψ n is equivalent to \((({\bf A}_1^1 \,\square{\kern-4.1pt}\rightarrow\, C_{1}^{1}) \wedge \ldots \wedge ({\bf A}^{f(1)}_1 \,\square{\kern-4.1pt}\rightarrow\, C_{1}^{f(1)})) \vee \ldots \vee (({\bf A}^{1}_{n} \,\square{\kern-4.1pt}\rightarrow\, C_n^1) \wedge \ldots \wedge ({\bf A}^{f(n)}_{n} \,\square{\kern-4.1pt}\rightarrow\, \wedge C_n^{f(n)}))\) (call this sentence ψ′ n . Let ϕ n+1 be the sentence that results from replacing ψ n with ψ′ n in ϕ n . Repeated applications of (R2) ensure that ϕ n+1 is equivalent to ϕ n . Each step eliminates a counterfactual consequent without setting up a new one in its place, and so the procedure will terminate when there are no remaining subsentences with counterfactual consequents.

All that remains now is to show that if ϕ is a Boolean compound of atomic sentences and sentences of the form \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, B\), then v M (ϕ) = 1. In the base case, where ϕ is an atomic sentence, this follows straightforwardly from the definition of \(\mathcal{A}\). Suppose it holds for all sentences shorter than ϕ. Then we can show that it holds for ϕ as well.

  • Suppose ϕ is of the form V ≠ v and \(\phi \in \Upgamma. \) We have already shown that if \(\phi \in \Upgamma, \) there is some v′ ≠ v such that \(V = v' \in \Upgamma\). By the definition of \(\mathcal{A}, \mathcal{A}(V) = v'\). By the semantic rule for atomic sentences v M (V = v) = 0, and so by the semantic rule for negation, v M (ϕ) = 1.

  • Where ϕ is of the form \(\psi_1 \vee \psi_2, \neg (\psi_1 \vee \psi_2), \psi_1 \wedge \psi_2, \neg(\psi_1 \wedge \psi_2), \psi_1 \supset \psi_2, \) or \(\neg( \psi_1 \supset \psi_2), \) we can use the usual definitions of truth functional connectives to prove on the basis of the inductive hypothesis that \(\phi \in \Upgamma\).

  • Suppose ϕ is of the form \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V =v\) and \(\phi \in \Upgamma. \) We can prove by induction on the number of variables in A (written ‘|A|’) that v M (ϕ) = 1. (CM7) and (CM5) ensure that this hypothesis holds for the case of \(|{\bf A}| = |\mathcal{V}|; \) the definition of the selection function ensures that it holds where \(|{\bf A}| = |\mathcal{V}| -1. \) Suppose it holds for \(|{\bf A}| > |\mathcal{V}| - n. \) We can show that it holds for \(|{\bf A}| = |\mathcal{V}| - n. \) For any two distinct variables V and V′ not mentioned in A, there will be a v and a v′ such that \(\Upgamma\) contains sentences of the form \(({\bf A} \wedge V' = v') \,\square{\kern-4.1pt}\rightarrow\, V = v\) and \(({\bf A} \wedge V = v) \,\square{\kern-4.1pt}\rightarrow\, V' = v'\).

  • (We know there must be such sentences by (CM4), (CM2), (R2), and (CM8). (CM4), (CM2), and (R2) ensure that \(\Upgamma\) contains sentences of the form \(({\bf A} \wedge V' = v') \,\square{\kern-4.1pt}\rightarrow\, V = v_1\) and \(({\bf A} \wedge V = v) \,\square{\kern-4.1pt}\rightarrow\, V' = v'_1\) for each \(v \in \mathcal{R}(V)\) and \(v' \in \mathcal{R}(V'). \) Since M is recursive, one of the variables is a nondescendant of the other. Let us assume without loss of generality that V is a nondescendant of V′. There must be a v such that \(({\bf A} \wedge V' =v'_i) \,\square{\kern-4.1pt}\rightarrow\, V = v \in \Upgamma\) for every \(v'_i \in\mathcal{R}(V'). \) We can simply choose for our v′ the \(v \in \mathcal{R}(V)\) such that \(({\bf A} \wedge V= v) \,\square{\kern-4.1pt}\rightarrow\, V' = v'. \))

  • By the inductive hypothesis, \(v_M(({\bf A} \wedge V' = v') \,\square{\kern-4.1pt}\rightarrow\, V = v) = v_M(({\bf A} \wedge V = v) \,\square{\kern-4.1pt}\rightarrow\, V' = v') = 1. \) Since (we’ve assumed without loss of generality) V is a nondescendant of \(V', ({\bf A} \wedge V' = v'_i) \,\square{\kern-4.1pt}\rightarrow\, V = v \in \Upgamma\) for every \(v'_i \in \mathcal{R}(V')\). By (CM6), \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, (V' = v'_i \supset V = v) \in \Upgamma\) for every \(v'_i \in \mathcal{R}(V')\). By (R2) and (CM2), \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V = v \in \Upgamma. \)

  • We know that \(v_M({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V = v) = 1, \) since \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V =v\) follows validly from other sentences that are true in M, and all our axioms and rules are sound. (R2), (CM2), and (CM5) ensure that \(\Upgamma\) contains no sentence of the form \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V =v^*\) for v * ≠ v.

  • Thus, for arbitrary V and \(v \in \mathcal{R}(V), \) if \(|{\bf A}| \leq |\mathcal{V} -(n+1), \) if \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V =v \in \Upgamma, \) then \(v_M(A \,\square{\kern-4.1pt}\rightarrow\, V =v) = 1. \)

  • Suppose ϕ is of the form \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, B. \) (CM4) ensures that for every atomic subsentence V i  = v i of B, either \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i \in \Upgamma\) or \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V_i \neq v_i \in \Upgamma. \) (R2) and (CM5) entail that the conjunction of the \(V_i = v_i \in \Upgamma\) and the \(V_i \neq v_i \in \Upgamma\) is compatible with B; therefore this conjunction entails B. By our inductive hypothesis, \(v_M({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i) = 1\) for all V i such that \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V_i = v_i \in \Upgamma, \) and \(v_M({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V_i \neq v_i) = 1\) for all V i such that \({\bf A} \,\square{\kern-4.1pt}\rightarrow\, V_i \neq v_i \in \Upgamma. \) By the selection semantics for the counterfactual, \(v_M({\bf A} \,\square{\kern-4.1pt}\rightarrow\, B) = 1\).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Briggs, R. Interventionist counterfactuals. Philos Stud 160, 139–166 (2012). https://doi.org/10.1007/s11098-012-9908-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11098-012-9908-5

Keywords

Navigation