1 Introduction

The notion of proportionality has attracted the interest of metaphysicians working on theories of causation in general, and the problem of mental causation in particular. However, a technical use of the term is fairly novel in philosophy. Wide-spread mention of the word has been present in the philosophical literature only since Yablo published his seminal paper on mental causation in (1992). Analyzing the relationship of supervenient and subvenient causes, he says “(...) it seems clear that faced with a choice between two candidate causes, normally the more proportional candidate is to be preferred.” (277) Causes are “proportional” if they “(...) incorporate a good deal of causally important material but not too much that is causally unimportant.” (274)

Questions about the possibility of level-bound or macro-causation had been around for some time before Yablo’s work (cf. Schumacker & Lomax, 2004). However, the idea that proportionality is a constraint on causation seems to have been Yablo’s discovery—or perhaps “invention”. Yablo’s paper has since had a large impact in the philosophy of mind (according to GoogleScholar, 981 citations on 22nd August 2021), and the proportionality constraint has made its career in metaphysics and the philosophy of mind accordingly (cf. among others McGrath, 1998; Shoemaker, 2003b; McLaughlin, 2007; Schröder, 2007; Crane, 2008; Woodward, 2008b; List & Menzies, 2009; Weslake, 2013; Harbecke, 2008, 2013, 2014a; Harbecke & Atmanspacher, 2012; Bernstein, 2014).

In this paper, I am concerned with the content, status, and application of the proportionality constraint in the context of some of the most prominent theories of causation. In particular, I investigate two general versions of proportionality, which I call “horizontal” and “vertical” proportionality, and the constraint’s role in three popular classes of theories of causation.

As a very first hint, two events can be described as “vertically proportional” if they display the same degree in “graininess” and “detail”. An example might be Biden’s winning of the election in November 2020 and Biden’s actually becoming president of the United States on 20th January 2021, both of which are macroscopic social events and comparable in their graininess and detail. This is in contrast to Biden’s winning of the election in November 2020 and the vastly complex molecular event underlying Biden’s actually becoming president of the United States on 20th January, which are very different in their degree of complexity and their amount of detail.

Two events can be described as “horizontally proportional” if one is “sufficient, but not redundant” with respect to the other. An example might be Suzie’s throwing of the stone and the shattering of the bottle—in contrast to the state of the entire universe at the moment when Suzie throws the stone which is not horizontally proportional to the shattering of the bottle (the former contains much more than was necessary for the shattering to occur and was therefore redundant with respect to the latter).

The selected set of prominent theories of causation which I discuss in order to determine their relationships to the different versions of proportionality are interventionist theories, Humean theories, and power theories of causation. I have chosen this particular set of theories mainly because they are either among the currently most popular and widely discussed theories themselves, are or they closely related to further popular theories such as structural equation approaches or probabilistic causation.Footnote 1

I also shed some light on the question whether proportionality is considered as an ontic or as an epistemic, i.e. explanatory, constraint on causation in the context of these theories. Both positions can be found in the mental causation literature, and their defense has been a matter of controversy.

As an answer to these questions I will defend the following main claimsFootnote 2

  1. 1.

    The horizontal (HP) and the vertical version (VP) of the proportionality constraint are logically independent; they do not entail each other or the other’s negation.

  2. 2.

    HP is implied by some of the currently most popular theories of causation.

  3. 3.

    None of the currently most popular theories of causation contradicts either HP or VP. They could embrace HP and VP as further constraints, without rendering the underlying theory of causation incoherent.

  4. 4.

    HP and VP are not ontic or epistemic principles as such; rather, whether they are ontic or epistemic depends on the theories chosen plus background assumptions about the existence of higher-level causes and their non-identity to lower-level ones.

The paper is structured as follows. In a first step (Sect. 2), I review some contributions in the philosophical literature that explicitly discuss, and in some cases adopt, Yablo’s proportionality constraint. Some of these will be characterized as interpreting the constraint as an ontic, some as an epistemic concept. Afterwards (Sect. 3), I offer a preliminary definition of the notion and I distinguish HP and VP more specifically. In a next step, I clarify the status of the proportionality constraint from the perspective of three popular but fundamentally different theories of causation, namely interventionist, Humean, and power theories of causation (Sects. 46). Section 7 summarizes the results and points to some puzzles in the context of the proportionality constraint that will have to be left for future research.

2 Proportionality in the Philosophy of Mind

As mentioned in the previous section, Yablo (1992) believed that from the set \({\mathcal {C}}:=\{C_1, C_2, \ldots, C_n\}\) of all sufficient causes of a given effect E, usually only a subset is, or should be, considered the actual or genuine cause(s), namely the cause(s) that is (are) proportional to E. Sufficient causes are proportional to a given effect if they “(...) incorporate a good deal of causally important material but not too much that is causally unimportant.” (274) Yablo himself explicates this idea on the basis of four conditions specifying a specific kind of counterfactual dependency of an effect E on candidate causes: A cause C is proportional with respect to an effect E, if and only if, E is contingent upon CFootnote 3, C is adequate for E, C is required for E, and C is enough for E. Contingency amounts to a kind of counterfactual dependence of E on C. Adequacy implies a counterfactual sufficiency of C for E. Requiredness demands that any impoverishment in detail of C would have led to the non-occurrence of E. Finally, enoughness declares that the genuine cause C should mark the upper bound of all events that satisfy requiredness with respect to E.

The earliest explicit response to this idea is found in a paper by Green and Vervaeke (1996), who agree with Yablo that “(...) given different effects of the same precipitating event, different aspects of the precipitator will move to the fore as more plausible candidates for the title of ‘cause’.” (205) Critical responses have been elaborated, among others, by McGrath (1998) and Gillett and Rives (2005). Whereas McGrath believes that “(...) the notions of proportionality and of properties as determinables cannot be used, jointly, to solve the problems raised by BELOW [also called the “exclusion problem”] and WITHIN” (170)Footnote 4, Gillett and Rives simply deny that there are such determinable properties that could be subject to proportionality.

A more recent version of the proportionality constraint has been offered by Shoemaker (2003b), who develops a theory of mental causation on the basis of his power theory of properties. About the alleged competition of two properties \(P_1\) and \(P_2\), where \(P_2\) contains a subset of the powers of \(P_1\), he says:

Yablo’s notion of proportionality can be applied here. Where the only causal features of property \(P_1\) that play a role in producing an effect are ones that belong to property \(P_2\), of which \(P_1\) is a determinate or realizer property, there seems a good sense in which considerations of proportionality favor the instantiation of \(P_2\) over the instantiation of \(P_1\) as a cause of the effect. (Shoemaker, 2003b, 436)

Whereas Shoemaker believes that proportionality is an ontic feature of causation since, in his view, properties are objective features of the real world, Thomas Bontly points out that it is mostly a pragmatic phenomenon within our use of causal language. Consequently, he denies that non-reductive physicalism can claim support from the constraint: “(...) proportionality is not in fact a constraint on the causal relation but rather a pragmatic feature of our use of causal language, derived from general principles of language use. If so, it turns out that proportionality has little if anything to do with the nature of causation itself and cannot save nonreductive materialism from the exclusion problem.” (Bontly, 2005, 332)

Brian McLaughlin has similar inclinations as Bontly towards an epistemic, pragmatic, or explanatory interpretation of proportionality. As he says, “[f]or the record, I myself think that rather than a constraint on causation, proportionality is a pragmatic constraint on explanation. Too much causal detail or too little causal detail makes for a poor or misleading explanation in a context.” (McLaughlin, 2007, 15/16)

Since the work of Shoemaker, Bontly, and McLaughlin, reference to proportionality has become frequent in the philosophy of mind—both in an appreciative and a critical manner. Generally appreciative examples are Schröder (2007), Crane (2008), Woodward (2008b), and List and Menzies (2009), Weslake (2013), Harbecke (2013, 2014a), and Harbecke and Atmanspacher (2012). Another critical view has recently been voiced by McDonnell (2017), who argues that the proportionality principle cannot be a strong constraint on causation as it violates much of our practice in identifying causal relations.

In short, over the last years the notion of proportionality has received much attention in metaphysics and the philosophy of mind. At the same time, a disagreement re-occurs about the status of proportionality as an ontic or epistemic concept. This disagreement forms the background for the following sections.

3 Two Kinds of Proportionality

Apart from the ontic/epistemic disagreement mentioned at the end of Sect. 2, the debates on mental causation and the nature of causation have dealt with two very different kinds of causal proportionality, which could be characterized as “vertical” and “horizontal” proportionality.Footnote 5 As a first and non-technical approachFootnote 6, these two notions can be characterized as follows:

  • Vertical Proportionality (VP): An event \(\phi\) is vertically proportional to an event \(\psi\) if, and only if, \(\phi\) and \(\psi\) display the same degree in “graininess” and “detail” within a supervenience hierarchy, i.e. a hierarchy that involves ordered sets of events all of which occur in the same place and time, but where every event is sufficient for all events located at a higher position within the hierarchy.Footnote 7

  • Horizontal Proportionality (HP): An event \(\phi\) is horizontally proportional to an event \(\psi\) if, and only if, \(\phi\) is sufficient, but not redundant with respect to \(\psi\), where “redundancy” compares non-identical events with a similar degree of graininess which may or may not overlap in at least one subevent.

Characterizations VP and HP are intended as merely preliminary definitions. Among other things, they make use of a “similarity in graininess” relation in a supervenience hierarchy without offering a further definition of this notion. Moreover, depending on the theory of causation presupposed, VP and HP will have to receive a different formulation. Nevertheless, the two definitions may help to approach the two kinds of proportionality in a heuristic way.

Some features of VP and HP should be mentioned here already. First, note that neither VP nor HP binds proportionality to a causal relation. An event C might satisfy VP with respect to another event E without being causally connected to that event. For instance, the event “the audience’s applause at the Zurich Opera” may display a similar degree of graininess as the simultaneous event “the audience’s applause at the Berlin opera”, because the two events are both meso-scale entities that involve collections of people. But the applause in Zurich may not be causally connected in any way to the applause in Berlin. In a similar way, an event C can satisfy HP with respect to an event E without being causally related to E. Assume \(E_1\) and \(E_2\) are both effects of a single common cause CC. Assume further that CC, \(E_1\) and \(E_2\) are all part of a deterministic causal system such that whenever \(E_1\) occurs, \(E_2\) occurs (because in this case CC also occurs) and that whenever \(E_1\) does not occur, \(E_2\) does not occur too (because in this case CC does not occur). \(E_1\) is sufficient, but not redundant with respect to \(E_2\) in this scenario, but \(E_1\) is neither a cause nor an effect of \(E_2\).Footnote 8

The way vertical proportionality acts in causal situations is illustrated by Fig. 1. The two layered cylinders represent sequences of events, all of which fully overlap and which are ordered by the relation of supervenience. The arrows represent actual, or proportional, causal relationships. Consider, for instance, the following three events that (presumably) fully overlap and are ordered by supervenience: \(C_1'\) = Otto cycling to the grocery store, \(C_2'\) = Otto cycling to the grocery store fast, \(C_3'\) = Otto cycling to the grocery store fast and furiously (\(C_1'\) supervenes on \(C_2'\) supervenes on \(C_3'\)). For such vertical sequences of events all of which are sufficient (and non-redundant) for a certain effect (perhaps \(E'\) = the truck driver shouting at Otto for his way of driving on the road), VP is meant to attain a match between the specificity of the cause to the effect. For instance, it might turn out that \(C_3'\) = Otto cycling to the grocery store fast and furiously was proportional to the truck driver’s shouting and, hence, the actual cause (Otto’s mere cycling would not have sufficed for the driver to shout at him).

Fig. 1
figure 1

Vertical Proportionality

Or consider the following three events which may satisfy the same conditions of supervenience and overlap despite not being conceptually connected: \(C_1''\) = the activity of a complex neural cell network in my central brain, \(C_2''\) = the activation of my amygdala, \(C_3''\) = my mental state of sadness in response to disturbing news about the Syrian civil war (ex hypothesis: \(C_3''\) supervenes on \(C_2''\) supervenes on \(C_1''\)). And take the effect in question to be \(E''\) = my switching to a website with fun information about sunny holiday destinations. Then \(C_3''\), or possibly \(C_2''\), might turn out to be proportional to \(E''\) under the assumption that the neural network activity was by far too specific to be located on the same graininess level as \(E''\). Hence, according to VP, \(C_3''\) and \(C_2''\) are more suitable candidates for causing \(E''\) than is \(C_1''\).

Horizontal proportionality approaches the matter from a different angle. The idea is illustrated by Fig.  2. The comparison made by HP is one between two or more sufficient causes. However, in this case the non-proportional events are redundant—they occupy “too much” spacetime, or contain too many irrelevant subevents—with respect to the effect. The notion of a subevent is not metaphysically innocent here, of course. That many events contain further events as parts or subevents seems hard to deny nonetheless. WWII contains the subevent of the bombing of the Warsaw Ghetto, Cindy’s and John’s wedding contains as a subevent the exchanging of the rings, Usain Bolt’s world record run in Berlin contains as a subevent him crossing the victory line etc. In some cases, not all of the subevents of an event are causally relevant for a given effect. It is such redundant events that HP is supposed to declare as causally relevant for, but as not actually causing, the effect.

An example might help to illustrate the idea. If \(C_1^*\) is the entire macroscopic state of the universe at \(t_1\), \(C_2^*\) is the movement of my hand at \(t_1\) (such that \(C_2^*\) is contained or included in \(C_1^*\)), and \(E^*\) is the tipping over of my wine glass at \(t_2\), then it seems that \(C_2^*\) is, and \(C_1^*\) is not, horizontally proportional to \(E^*\). \(C_1^*\) may contain such things as Melania Trump’s combing of her hair, Bruno Mars giving a concert in Tokyo etc.—two subevents of the total state of the universe at \(t_1\) that, as far as we can tell, are far too remote to have an impact on my glass at \(t_2\). Consequently, \(C_1^*\) is not well-calibrated with respect to E*. It contains massive amounts of irrelevant surplus and, according to HP, is therefore disqualified as a causes of \(E^*\).

Whether HP is a justified or adequate ingredient in our theories of causation is a different question that cannot be answered in depth here. However, to offer a hint: A strong reason for believing that redundancy as targeted by HP is a problem for causation is that it violates a basic and widely accepted constraint on ontology and science. The constraint is often dubbed “Ockham’s razor” alluding to the 14th century scholar William of Ockham who famously demanded: Numquam ponenda est pluralitas sine necessitate [“Plurality must never be posited without necessity”]. The fundamental idea has received various formulations in many different philosophical and scientific contexts (cf. Baker, 2016). The principle has sometimes been justified on a priori grounds. One such defense claims that redundancy violates our aesthetic sense and our standards of rationality. Naturalistic justifications have cited as empirical evidence the patterns of acceptance and rejection of competing theories by working scientists (cf. Baker 2016, sections 3-4.)

Given that these justifications are roughly valid, it would be peculiar if parsimony constrained our ontological and scientific attitudes, but turned out to be inapplicable to our theories of causation.Footnote 9 Or, in other words, if the justifications of the parsimony principle are convincing in other fields, we probably want to let them guide us in our attempt to distinguish causes from non-causes as well. Otherwise, the number of causes of each everyday event would simply explode in an intractable fashion.

Fig. 2
figure 2

Horizontal Proportionality

The two conditions of VP and HP, both under an ontic and an epistemic reading, are logically independent. A cause can in principle satisfy both, only one, or none of the two conditions. To offer a rigorous proof for this claim requires a formal explication of VP and HP. Depending on the theory of causation presupposed, the formalizations will be different in each case. A more accessible and shorter way to support the independence claim is again by an example.

Consider the large mesoscopical event \(C_2^{*+}\), which may be the bodily movements of all party guests at \(t_1\) including \(C_2^{*}\) = the movement of my hand at \(t_1\). Since both \(C_2^{*+}\) and \(E^{*}\) = the tipping over of my wine glass at \(t_2\) are mesoscale events, and since \(C_2^{*+}\) had something to do with the occurrence of \(E^{*}\) (precisely because it contained \(C_2^*\)), \(C_2^{*+}\) can be expected to satisfy VP with respect to \(E^{*}\), at least relative to \(C_2^{*+'}\), which is the total microphysical event underlying \(C_2^{*+}\). The reason is that, on any account, \(C_2^{*+'}\) is on a much lower level than \(E^*\). At the same time, \(C_2^{*+}\) can be expected to fail HP with respect to \(E^*\), at least if \(C_2^*\) is taken into account. That is because \(C_2^{*+}\) contains much irrelevant stuff including Jenny waving Dave to come over to the bar. \(C_2^*\), in contrast, can be expected to fail VP with respect to \(E^{*'}\), which is the total microscopic state underlying \(E^*\). That is because, on any account, \(C_2^*\) is on a much higher level than the microscopic event \(E^{*'}\). However, \(C_2^{*}\) may still satisfy HP with respect to \(E^{*'}\), because a much smaller region than the one covered by \(C_2^*\) might not have been enough to secure the occurrence of \(E^{*'}\).

A better candidate for the satisfaction of VP with respect to \(E^{*'}\) might have been \(C_2^{*'}\) = the microphysical and microscopic state underlying \(C_2^{*}\). However, the event \(C_2^{*+'}\) = the microphysical and microscopic state underlying \(C_2^{*+}\) should be expected to satisfy neither HP nor VP with respect to \(E^{*}\) (= the tipping over of my wine glass at \(t_2\)). Or in short, \(sat(VP | \langle C_2^{*}, E^{*}\rangle ) = 1\) and \(sat(HP | \langle C_2^{*}, E^{*}\rangle ) = 1\); \(sat(VP | \langle C_2^{*+}, E^{*}\rangle ) = 1\) and \(sat(HP | \langle C_2^{*+}, E^{*}\rangle ) = 0\); \(sat(VP | \langle C_2^{*}, E^{*'}\rangle ) = 0\) and \(sat(HP | \langle C_2^{*}, E^{*'}\rangle ) = 1\); \(sat(VP | \langle C_2^{*+'}, E^{*}\rangle ) = 0\) and \(sat(HP | \langle C_2^{*+'}, E^{*}\rangle ) = 0\). This exhausts all logical possibilities.

How are VP and HP related to Yablo’s constraints? As mentioned above, Yablo defines causal proportionality over four conditions: A cause C is proportional with respect to an effect E, if and only if, E is contingent upon C, C is adequate for E, C is required for E, and C is enough for E. Contingency amounts to a version of counterfactual dependence of E on C. Adequacy implies a counterfactual sufficiency of C for E. Requiredness rules that less detail of C would have led to the non-occurrence of E, whilst enoughness declares that the genuine cause C should mark the upper bound of all events that satisfy requiredness with respect to E. Since counterfactual dependency and sufficiency are not sensitive to detail but simply to the occurrence or non-occurrence of events, one interpretation is that contingency and adequacy jointly embody HP. They are designed to a select an event that is “just of the right size” to be the cause. In contrast, requiredness and enoughness are explicitly designed to isolate an event that has “just the right amount of detail” to be the cause. Hence, the two latter conditions can be interpreted as jointly embodying VP.

In this sense, the application of proportionality constraints such as the one developed by Yablo or the one re-constructed by Harbecke (2014a; without Yablo’s essentialist metaphysics) can be thought of as criteria that “zoom in” on a proportional cause relative to a given effect. Partially and wholly irrelevant events are kicked out from the elected class of candidate causes, and within the supervenience hierarchy only such events pass the test that are akin to the effect in detail and specificity. Figure 3 is intended to illustrate this process of “zooming in” on the cause.

The following Sects. 46 aim to clarify what the relationship is between VP, HP, and some of the most prominent theories of causation. The aim is to show that HP is implied at least by some theories of causation, but not by others, and that some theories of causation accept versions of HP and VP as additional constraints. Moreover, it is argued that none of the currently most popular theories of causation contradicts either HP or VP as such. They could embrace HP and VP as further constraints, without rendering the underlying theory of causation incoherent. An inconsistency is attained with the help of additional assumptions only.

Fig. 3
figure 3

“Zooming in” on a cause relative to an effect

4 Interventionist Theories

The basic idea of interventionism is that a variable X is a cause of a variable Y if, and only if, there is an ideal intervention on X that brings about a change in Y without I being connected to Y through paths not involving X. Woodward offers the following definition for a (type-level) direct cause, a contributing cause, and an ideal intervention:

(M):

A necessary and sufficient condition for X to be a (type-level) direct cause of Y with respect to a variable set \({\mathbf {V}}\) is that there be a possible intervention on X that will change Y or the probability distribution of Y when one holds fixed at some value all other variables \(Z_i\) in \({\mathbf {V}}\). A necessary and sufficient condition for X to be a (type-level) contributing cause of Y with respect to variable set \({\mathbf {V}}\) is that (i) there be a directed path from X to Y such that each link in this path is a direct causal relationship (...), and that (ii) there be some intervention on X that will change Y when all other variables in \({\mathbf {V}}\) that are not on this path are fixed at some value. (Woodward, 2003, 59)

(IV):

I is an intervention variable for X with respect to Y iff “[(i)] I causes X; [(ii)] I acts as a switch for all the other variables that cause X (...); [(iii)] any directed path from I to Y goes through X (...); [(iv)] I is (statistically) independent of any variable Z that causes Y and that is on a directed path that does not go through X.” (op. cit., 98)

In Woodward’s original contributions, the notion of causal proportionality plays no explicit role. It was only when Woodward, and along with him several other authors (cf. List and Menzies, 2009; Shapiro, 2010; Shapiro & Sober, 2007; Raatikainen, 2010; Weslake, 2009) began to argue that interventionism provides a stable basis for a non-reductive theory of mental causation and higher-level causation in general that proportionality entered the picture. In the scenario discussed by Woodward (2008b, 2015) and classically presented by Kim (2003), the question asked is whether a mental event or variable \(M_1\) can cause a mental variable \(M_2\) if \(M_1\) has (by hypothesis) a physical supervenience base \(P_1\) and \(M_2\) has a physical supervenience base \(P_2\) such that \(P_1\) is the cause of \(P_2\). It seems plausible that under these conditions at least some interventions on \(M_1\) (which induce immediate changes in \(P_1\)) have an effect on \(M_2\) (because, by hypothesis, \(P_1\) influences \(P_2\), and at least some changes in \(P_2\) are correlated with changes in \(M_2\)).

Moreover, it may also be the case that some interventions on \(P_1\) sometimes lead to changes in \(M_2\), but in a less reliable way. Since, in Woodward’s view, reliability, or stability, are additional criteria that causal relations satisfy, there are reasons to believe that the relation \(M_1 \rightarrow M_2\) is a causal relation in a more substantial sense than the relation \(P_1 \rightarrow M_2\). Woodward himself observed the analogy between the stability criterion and Yablo’s proportionality constraint:

I have argued (...) that even if both \(M_1\) and \(P_1\) cause \(M_2\), there may be important differences between the \(M_1 \rightarrow M_2\) causal relationship and the \(P_1 \rightarrow M_2\) causal relationship: depending on the details of the case, the causal relationship between \(M_1\) and \(M_2\) may be more stable or better satisfy certain other desiderata, such as “proportionality” than the causal relationship between \(P_1\) and \(M_2\). (Woodward, 2015, 304)

Woodward does not make explicit what kind of proportionality he has in mind here: whether it is VP or HP. However, since his concern in the present context are “macroscopic properties that supervene on (or are ‘realized by’) other (typically microphysical) properties but that are not identical with those realizing properties” (Woodward, 2015, 303), it becomes clear that Woodward does not have in mind HP. HP is applied to single out a cause from a set of properties or events that display the same degree of graininess which may or may not overlap (cf. Sect. 3). In other words, if one out of \(M_1\) and \(P_1\) is more proportional to \(M_2\) than the other, it must be in the sense of VP.

Secondly, it is not entirely clear from this passage whether Woodward considers proportionality to be an additional constraint on causation, or whether he believes that VP is already implied by his theory. In an earlier paper, Woodward’s tendency seems have been towards the second option:

From an interventionist perspective, there are circumstances in which higher level causal claims may be more satisfactory (roughly when they figure in answers to a wider range of what-if-things-had-been-different questions or perhaps just provide answers to such questions that a lower level account does not provide) and also circumstances in which lower level claims will be more satisfactory (roughly when they involve relationships that are more stable or variables that are more well-defined for the purposes of intervention). (Woodward, 2008a, 183)

Under this interpretation, the criteria offered by interventionism already imply VP, and VP is not an additional constraint on interventionist causation.Footnote 10

In contrast, in the most recent contribution on the topic, Woodward describes proportionality as one of the criteria for “...choosing among explanations and causal representations that we are able to exhibit or formulate” (2021, 5). He offers the following new formulation of proportionality:

  • Woodward Proportionality (P\(^*\)): Suppose we are considering several different causal claims/explanations formulated in terms of different variables and representing different claims about patterns of dependency relations involving some target effect or explanandum E and where all of these satisfy some minimal interventionist condition [...]. Then, other things being equal, we should prefer those causal claims/explanations that more fully represent or exhibit those patterns of dependence that hold with respect to E. (Woodward, 2021, 11)

This suggests that Woodward considers proportionality as an additional constraint on causation that is not already implied by his original theory. At the same time, in his (2015, 305, fn. 19), Woodward emphasizes: “I do not regard proportionality as a necessary condition for causation.” Hence, whilst Woodward considers VP to play a prominent role in his theory, the status of this constraint relative to the central notions of the theory remains somewhat unclear.

What about HP? Does it play any interesting role in the context of interventionist theories of causation? The term as such clearly does not appear in Woodward’s writings. However, the problem of redundancy does appear in the debate at various places, namely as part of the problem of variable choice. Intuitively, if X represents the state of the whole universe preceding the tipping over of my wine classe Y (cf. Sect. 3), then \(X \rightarrow Y\) will probably satisfy (M). However, X will be grossly redundant (non-proportional) with respect to Y. In (2016, Sect. 11) Woodward actually discusses a version of the problem of variable choice, and he assigns criteria such as specificity and sparseness as relevant for such choices:

Causes that are non-specific figure in non-sparse causal representations in the sense that variables in such representations are connected to many other variables. Sometimes representations involving variables figuring in non-specific causal relationships can be replaced with representations involving new variables standing in more specific causal relationships (Woodward, 2016, 1070)

For instance, one may sometimes seek “to replace a representation according to which many different, disparate factors causally influence disease D with a representation in which a single factor E is the proximate cause of D.” (Woodward, 2016, 1070)

The demanded strategy of replacement of an non-sparse variable by a sparse one realizes the core idea of HP, namely that causes are generally non-redundant. Sparseness or HP does not enter Woodward’s definition of causation. Nevertheless, by recommending it as a choice criterion, Woodward seems to make a distinction between “better” and “worse” variables and, thereby “better” and “worse” causal models. Hence, the best causal models seem to observe this criterion in Woodward’s view.

And so it seems that, whilst VP and HP are not strictly implied by the most recent versions of Woodward’s theory, they (or very similar notions) do play a prominent role in his interventionist theory of causation. It remains to be asked whether the two criteria should be considered as epistemic or ontic concepts in Woodward’s framework. For answering this question, it is important to note that the interventionist theory of causation is primarily a theory about causal inference and causal explanation. It is, first and foremost, epistemic in this sense. It is meant to establish our best and perhaps pragmatically most useful causal hypotheses. This interpretation is supported by the fact that Woodward defends a non-reductive analysis of causation (Woodward, 2003, 20-22). His technical approach merely aims to clarify the interconnection of certain concepts, and not to provide an analysis.

Secondly, the relevant relata of the causal relation as defined by (M) and (IV) are variables. In his definition (M) (Woodward, 2003, 59), Woodward says “...for X to be a (type-level) direct cause of Y...”. He does not use a formulation such as “the causal claim that X causes Y is true if, and only if...”. Nevertheless, X and Y are supposed to be variables, not events (even though, of course, the former is intended to represent the latter). If variables are considered to be linguistic or epistemic tools primarily, it will seem like a category mistake to characterize variables—and the relationships holding between them—as ontic. On the other hand, variables can be understood as direct representations for abstract properties as well.Footnote 11 It may be more convenient to talk about variables than about abstract properties, but the direct reference gives variables an ontic taste. Nonetheless, even such ontically hued variables serve epistemic functions first and foremost, and the relations holding among them are primarily epistemic at least in this sense.

In conclusion, it seems that VP and HP are neither strictly implied by, nor explicitly contained in, the interventionist theory of causation. However, Woodward does recognize the importance of these criteria at least in some interventionist inferences, and they are consistent with his theory. Moreover, in his view, the two criteria work on a different level and are independent from another. In this sense, a cause that is more proportional in Woodward’s view can be either vertically, horizontally, or vertically as well as horizontally more proportional. Woodward grants that both criteria are used in search for relatively better causal models in the framework of interventionism. And since interventionism deals with causation mostly on the epistemic side, VP and HP as well become (primarily) epistemic concepts.

5 Humean Theories

David Hume famously defined a cause as “...an object followed by another, and where all the objects, similar to the first, are followed by objects similar to the second. Or, in other words, where, if the first object had not been, the second never had existed.” (1748, Sect. 7) Hume’s interpreters have seen two related but different theories of causation in this quote. According to the first one, causation consists of perceived regularities; and wherever there is a regularity observed (that fails to be falsified), there is a reason to infer the presence of a causal relationship. According to the second one, causation has essentially a modal dimension: whether an event is a cause at the actual world depends on its (non-)correlation with its potential effects in non-actual scenarios. Both theories have in common that they do not believe in necessary connections. Whether A causes B purely depends on the contingent distribution of properties and facts.

Modern regularity theorists such as Mackie (1974), May, Graßhoff, and Baumgartner (cf. Graßhoff & May, 2001; May,, 1999; Baumgartner & Graßhoff, 2004; Baumgartner, 2008) have used this fundamental idea to develop more technically sophisticated theories of causation. Mackie focused on an INUS condition which answers to the problem of monotony for first order universal conditionals.

Let ‘A’ stand for the INUS condition-in our example, the occurrence of a short-circuit at that place-and let ‘B’ and ’\(\lnot\)C’ (that is, ‘not-C’, or the absence of C) stand for the other conditions, positive and negative, which were needed along with A to form a sufficient condition of the fire-in our example, B might be the presence of inflammable material, \(\lnot\)C the absence of a suitably placed sprinkler. Then the conjunction ‘AB\(\lnot\)C’ represents a sufficient condition of the fire, and one that contains no redundant factors; that is, AB\(\lnot\)C is a minimal sufficient condition for the fire. (Mackie, 1974, 246)

Graßhoff and May eventually managed to solve the additional “Manchester Hooters Problem”Footnote 12 that still bothered Mackie by the introduction of a minimal theory (cf. Graßhoff & May, 2001, 16; cf. also Baumgartner, 2008, 340). A further extension of this theory has recently been developed by Baumgartner and Falk (2019).

The question is whether the notion of causal proportionality plays any role for these modern regularity theories. In order to see the connection, it is important to distinguish between Mackie’s causes as INUS conditions, and a complete cause, which is a conjunction of INUS conditions. It makes little sense to search for non-redundancy in individual INUS condition given the fact that, by definition, an INUS condition is insufficient for the occurrence of an effect. However, the non-redundancy or minimization constraints mentioned by Mackie (1974, 76) and introduced by May, Graßhoff, and Baumgartner, which apply to complete causes, reflect HP: a factor F is causally relevant for a factor G only if it stays within a minimally sufficient condition on any extension of the factor frame. Or put differently, a conjunction of factors is minimally sufficient for a given effect E if, and only if, it contains only INUS conditions of E. Such a conjunction satisfies HP with respect to E, because it does not contain any irrelevant stuff. And so at least in this sense, the modern regularity theories imply HP.

With respect to VP, the answer is slightly more complex. Suppose a causal structure as the one illustrated by Fig. 4. It is based on the factor frame \({\mathcal {F}}_0 := \{ A, B, C, D, E, F, G, H \}\) such that the factor set ABCD is sufficient for FG, whilst no other factors that are located on the same level as A, B, C, and D and that are not themselves causes of A, B, C, and D are required to bring about FG. Suppose further that the higher-level factor E supervenes on A, B, C, and D (conjoined) and the higher-level factor H supervenes on F and G (conjoined).

Fig. 4
figure 4

Proportionality for regularities. Dotted arrows represent causal relationships; full arrows represent supervenience relationships

Under the assumptions that the factors in question are either “switched on” or “switched off”, the structure of this kind will generate the data summarized in Table 1. Each row represents an observed coincidence of factors. Since the regularity theory can determine the direction of causation only if, for any given effect, there are at least two independent type-level causes, the table assumes the hypothetical existence of factors acE, acF , acG and acH, which are presumed to be alternative causes of F, G, and H, respectively.Footnote 13

Table 1 Coincidence data generated by a causal structure involving overlapping events of different grains

Data of this kind poses a challenge to the modern regularity theories of causation in several ways. First, from the coincidence table it can be uncovered that the factor set AB is minimally sufficient for F (row c11 shows the sufficiency, whereas rows c2–c3, c6–c10, and c12–c16 prove the minimality). The existence of a second minimal sufficient condition for F—as indicated by row c18−allows us to interpret the relationships between A and F and B and F as causal (under certain further assumptions; cf. Baumgartner, 2008). Furthermore, factor set CD is minimally sufficient for G, and the information contained in row c19 allows for a causal interpretation. Finally, E is minimally sufficient for H, and by taking into account rows c20 and c21, this relationship is causally interpretable as well.

But now something strange happens. ABCD turns out to be minimally sufficient for factor E, and the alternative cause acE of E suggests that we have uncovered a causal relationship. Moreover, the factor set FG is minimally sufficient for H, which by taking into account rows c17 and c20 is causally interpretable as well. In short, based on concidence table data, the modern regularity theories cannot distinguish between a structure as the one illustrated by Fig. 4 and structures as the one illustrated by Fig. 5, which involves only causal relationships.

Fig. 5
figure 5

Proportionality for regularities

Even if additional information about the graininess of factors is introduced by declaring, perhaps, \({\mathcal {F}}_1 := \{ A, B, C, D, F, G, acF, acG \}\) to be a fine-grained, or “lower-level”, set and \({\mathcal {F}}_2 := \{ E, H, acE, acH \}\) to be a coarse-grained, or “higher-level”, set, for the regularity theory the connection between, for instance, FG and H will still have to be interpreted as a causal relationship, and the regularity theory will accept inter-level causation. This result is perhaps even desirable. For instance, in the debate on supervenience Leuridan (2012) has argued that no meaningful distinction can be made between causation and supervenience, and so both should be considered as different versions of causation. Since a regularity framework interprets mechanistic constitution as an instance of supervenience (cf. Harbecke, 2014b), Leuridan’s claim mighty apply in an analogous way.

In contrast, if, for independent metaphysical reasons, one would like to introduce a distinction between causation and supervenience, VP as a constraint on causation would yield the desired result. Based on \({\mathcal {F}}_1\) and \({\mathcal {F}}_2\), it would characterize the relationships \(AB \rightarrow F\), \(acF \rightarrow F\), \(CD \rightarrow G\), \(acG \rightarrow G\), \(acE \rightarrow E\), \(E \rightarrow H\), and \(acH \rightarrow H\) as causal relationships, and it would declare \(ABCD \rightarrow E\) and \(FG \rightarrow H\) to be non-causal regularities. That is because VP would allow causal relationships only among members of the factor set \({\mathcal {F}}_1 := \{ A, B, C, D, F, G, acF, acG \}\) and among members of the factor set \({\mathcal {F}}_2 := \{ E, H, acE, acH \}\). All other sufficiency relationships it would (correctly) interpret as representing supervenience relationships within a supervenience hierarchy.

What this shows is that proportionality in the sense of VP plays no explicit role for the modern regularity theories of causation so far. However, it is consistent with these and can be introduced in order to make a distinction between causal and supervenience regularities for which the modern regularity theories of causation are blind otherwise.

In conclusion then, the regularity theory of causation already in some sense implies HP through its minimal sufficiency. VP is not a topic explicitly dealt with in the literature on regularity theories. However, the core idea can be integrated into the theory and may even be a natural extension of it that solves a problem faced by them.

Are HP and VP to be interpreted ontically or epistemically in the context of regularity theories of causation? The answer depends on how one conceives of regularities. Hume himself may be read as having defended a non-theory, or anti-realist theory of causation. “In reality, there is no part of matter, that does ever, by its sensible qualities, discover any power or energy, or give us ground to imagine, that it could produce any thing, or be followed by any other object, which we could denominate its effect.” (1748, Sect. 7) In Hume’s view, not even regularities are objective. Rather, they are perceived uniformities. Hence, if Hume would have accepted HP through minimization and VP as an additional constraint, he probably would have declared both to be purely epistemic principles.

Modern regularity theorists such as Mackie, May, Graßhoff, and Baumgartner are less explicitly empiricist and, in their rhetoric, seem to presuppose the objectivity of regularities. They provide both a definition of causation and principles for causal discovery (cf. Graßhoff & May, 1995; Baumgartner and Graßhoff, 2004). The latter is, of course, compatible with Hume. Hence, at least within the modern frameworks, HP and VP could in principle be interpreted ontically within a regularity framework.

Concerning the modal version of Humean causal theory, it has already been mentioned in Sect. 2 that Yablo, (1992) grounds his theory of mental causation in a counterfactual theory of causation (cf. also Harbecke,, 2014a). He introduces HP and VP as additional constraints yielding the result that mental events sometimes are causes. There have been some attempts to show that VP is actually implied by the counterfactual theory. As List and Menzies (2009) claim:

Assuming that the closest worlds in which the monkey does not have neural property N1 are ones in which it has another neural property realizing the intention I1, one can see that the second counterfactual is false: in any such world, the monkey has another neural property that realizes I1, and so performs A1.” (488)

With this assumption in place, they can show that some scenarios involve higher-level causation, namely when the higher-level events is “realization insensitive”.

The problem is that, at least from the viewpoint of the classical counterfactual theory of causation, List and Menzies have overlooked a technical challenge by ignoring the “ban on replacement” (Bennett, 2003, fn. 21; cf. also Lewis, 2000, 190 and Lewis, 1986, 211). According to this principle, the possible world to be used for the evaluation of a counterfactual should not be one in which the event referred to by the antecedent of a counterfactual is replaced by another one. Rather, the event must be deleted and gone in that possible world, as though one applied a “metaphysical hole puncher”. Otherwise, the counterfactual theory of causation will fail to identify any cause for (almost) any actual effect. If so, even if the higher-level event is realization insensitive, one does not get the result that the lower-level counterfactual is false.

If M supervenes on N1 without being “realization sensitive” (i.e. the closest non-N1 world is not a non-M world), then the closest non-N1 world relevant for the counterfactual interpretation is not one in which a replacer N2 is present. This observation shows that VP is not implied by the counterfactual theory. It would have to be added as an additional principle or constraint on counterfactual causation. This is the move that Yablo proposes in his paper.

Are HP and VP ontic or epistemic concepts under Yablo’s definition? This again depends on the interpretation of the modalities involved. In Lewis’ modal realism, causation and causal proportionality should probably be considered ontic relations. In Stalnaker’s modal anti-realism (cf. Stalnaker, 1968), for instance, it is less clear that the relationship is an ontic one. In short, both options remain open, and they depend on further background assumptions about the ontology of possible worlds.

6 Power Theories

The power theory of causation is connected to Shoemaker’s theory of properties as sets of powers. Shoemaker’s central notion is that of the bestowing of powers through properties: “A property P bestows a causal power C(K, E) upon the objects that have it just in case instances of P, in circumstances K, cause (or contribute to causing) instances of E, and the holding of K alone does not cause (or contribute to causing) instances of E.” (cf. Shoemaker, 1980).

The very notion of properties as “sets” suggests a special way in which properties can be related to one another. According to the author, “(...) sometimes the conditional powers bestowed by one property will be a proper subset of those bestowed by another.” (Shoemaker, 2003b, 431) Since higher-level properties are just that—subsets of powers of fundamental properties—the subset relation makes clear, in Shoemaker’s view, that higher-level properties do not compete with lower-level ones. As he says about the alleged competition of two properties P1 and P2, where P2 contains a subset of the powers of P1:

Yablo’s notion of proportionality can be applied here. Where the only causal features of property P1 that play a role in producing an effect are ones that belong to property P2, of which P1 is a determinate or realizer property, there seems a good sense in which considerations of proportionality favor the instantiation of P2 over the instantiation of P1 as a cause of the effect. (Shoemaker, 2003b, 436)

In other words, Shoemaker considers a proportionality constraint to be not implied by, but compatible with, his power theory of causation. However, it is not entirely clear whether the kind of proportionality considered by Shoemaker corresponds to VP, HP, or both.

If the higher-order property is a subset of a lower-level property, it seems that a principle very similar to that of HP is intended. Shoemaker might have had in mind something like the following example. If property \(P_1\) containing powers \(\{p_1, p_2, p_3 \}\) is a subset of property \(P_2\) containing powers \(\{p_1, p_2, p_3, p_4 \}\), and for some effect property \(P_3\) containing powers \(\{p_5, p_6, p_7 \}\), only powers \(p_1\), \(p_2\), and \(p_3\) are required but not \(p_4\), then \(P_1\) is, and \(P_2\) is not, proportional to \(P_3\). In other words, the proportionality constraint as intended by Shoemaker may be isomorphic to the minimality condition of the regularity theory (cf. Sect. 5).

Whether VP has any application in this context is not obvious. That is because it is not entirely clear whether Shoemaker accepts that there are some non-overlapping properties or sets of powers that are connected by supervenience and are realized in the same place and time. If the answer to this ontological question is “yes”, Shoemaker could introduce a version of proportionality in the sense of VP. However, again, his theory does not imply such a constraint.

It is also not clear whether Shoemaker intends proportionality as epistemic or ontic. Since powers in his view are objective features of the world, it seems natural to conclude that Shoemaker has an ontic view of proportionality as well. However, since his theory is in principle interpretable as a non-realism about causation, proportionality may also turn out as an epistemic concept in his overall framework. Both options seem open for the unconstrained power theory of causation.

7 Conclusion

This paper offered an investigation into the content of proportionality as a constraint on causation. It first reviewed some contributions in the philosophical literature that explicitly discuss, and in some cases adopt, Yablo’s causal proportionality constraint. Subsequently, two general versions of the constraint were distinguished, namely “horizontal” and “vertical” proportionality. Moreover, the paper analyzed the relationship between the notion of proportionality and some of the classical theories of causation. With respect to each theory, it was discussed whether proportionality should be considered as an ontic, or as an epistemic, i.e. explanatory, constraint. Both positions can be found in the mental causation literature.

The following main claims were defended explicitly:

  1. 1.

    The horizontal (HP) and the vertical version (VP) of the proportionality constraint are logically independent; they do not entail each other or the other’s negation.

  2. 2.

    HP is implied by some of the currently most popular theories of causation.

  3. 3.

    None of the currently most popular theories of causation contradicts either HP or VP. They could embrace HP and VP as further constraints, without rendering the underlying theory of causation incoherent.

  4. 4.

    HP and VP are not ontic or epistemic principles as such; rather, whether they are ontic or epistemic depends on the theories chosen plus background assumptions about the existence of higher-level causes and their non-identity to lower-level ones.

Due to lack of space, a number of further questions concerning the proportionality constraint could not be discussed in detail in the present paper.

For instance, one further important class of theories of causation was not discussed, namely process theories as defended by Wesley Salmon (1984, 1997, 1998) and Phil Dowe (1992a, b). As it turns out, Dowe (2010) is quite sympathetic to the idea of proportionality as a constraint on causation. However, a deeper analysis of the role of proportionality in process theories of causation will have to await a future contribution on the topic.

Secondly, Sect. 3 only offered a preliminary motivation for the acceptance of proportionality as a constraint on causation. Much more would have to be said if a full defense of this substantial constraint should be considered reasonable.

Moreover, whether proportionality makes sense from a scientific, and not merely an colloquial, point of view could not be investigated in depth.

Finally, a formal proof of the logical independence of the two proportionality constraints was not offered, and the adaptions of HP and VP to the context of each particular theory of causation could be offered only in a preliminary way. Answering these questions and meeting these challenges will have to be left for future research on the topic of the proportionality constraint on causation.