1 The interpretational theory of logical validity

This paper is about logical validity and not about a more generic notion of validity. I will assume throughout the paper that logical validity is a notion that has been discussed at least since Aristotle. I will also assume that this notion has been made precise in the last century or so through model theory, where it is defined as preservation of truth in every model, where models are set-theoretic constructions. However, philosophers disagree on what we are modelling through models because they disagree on what logical validity is.

In his book ‘The concept of logical consequence’, Etchemendy distinguished between a representational and an interpretational theory of logical validity (1990). According to the interpretational theory (IR), logical validity is preservation of truth across all acceptable interpretations, where an interpretation is acceptable only if compatible with the intended meaning of logical expressions.Footnote 1 According to the representational theory (RE), on the other hand, logical validity is preservation of truth in all possible worlds. IR and RE are not the only options: a hybrid view is often preferred, where validity is preservation of truth in a world and an interpretation (Hanson, 1997; Shapiro, 1998; Sher, 2001).

IR and RE are not technical definitions, but rather explications of a pre-theoretic notion, so there is no technical notion of interpretation or representation attached to them. Yet, it is natural to interpret the models of model-theory as depicting interpretations in IR and possible worlds in RE.Footnote 2 The question: ‘What is logical validity?’ can be recast as the question of what models are models of. Consider a model where something satisfies ‘is grass’ exactly when it is snow and ‘is blue’ exactly when it is white. IR suggests that the model is depicting an interpretation where ‘grass’ means snow and ‘blue’ white, and the fact that ‘Grass is blue’ is true in the model is mirroring the fact that ‘Grass is blue’ is actually true, so-and-so interpreted. RE suggests that the model is depicting a world where grass is blue, and the fact that ‘Grass is blue’ is true in the model is mirroring the fact that, under its English reading, the sentence would be true had that world been actual. In IR we keep the world fixed and change the interpretation of the language; in RE an interpretation is prefixed, and we simply ‘apply it’ to different possibilities (1990, p. 21).

2 The cardinality problem

Here is an objection against IR. Take the following schema, where n is any positive integerFootnote 3:

$$\begin{aligned} \text{ There } \text{ are } \text{ at } \text{ least } n \text{ things } \end{aligned}$$
(CN)

Logic students know that CN can be expressed using only connectives, existential quantification and identity. People argue that CN instances come out logically true, under IR. They reason as follows: according to IR, a sentence is logically true when true in all reinterpretations compatible with the intended meaning of logical expressions. By ‘intended meaning’ of an expression we mean the meaning the expression has as we are using it. The meaning of logical expressions ought to be kept fixed.Footnote 4\(\exists \), \(\lnot \) and \(=\) are all logical expressions, and since any instance of CN is composed of these logical expressions alone, it has only one acceptable interpretation: that there are n things. Thus, it is logically true if true in the intended interpretation; that is, it is logically true if true. It is true because there are n things, so it is logically true.Footnote 5

This result is considered a defeating objection for IR because CN instances are not logical truths. Why, though? In the literature there is no unique reasoning behind the claim that CN instances are not logical truths, nor is there a specific methodology that people follow when they make this claim. The best way to show that the CN schema is not valid is by showing that there is an informational asymmetry between its instances. Consider the following:

$$\begin{aligned}&\text{ There } \text{ are } \text{ at } \text{ least } \text{30 } \text{ things } \end{aligned}$$
(C30)
$$\begin{aligned}&\text{ There } \text{ are } \text{ at } \text{ least } 31 \text{ things } \end{aligned}$$
(C31)

Intuitively, C30 follows from C31, but not vice-versa. There seems to be an asymmetry in information between these sentences because what C30 says is ‘contained’ in what C31 says but not conversely. One can correctly conclude C30 solely in virtue of the information provided by C31, but not vice versa. Yet, if both C30 and C31 were logical truths, they would trivially co-entail each other. They don’t, so they are not logical truths.

Model-theory secures the desired result because different models have different domains, some with fewer than n things for any positive n. Also, whenever a model has \(n+1\) things in the domain it has n things, so C(\(n + 1\)) implies C(n) for any n. The challenge for IR, though, is how to make sense of such counterexamples if we take models to depict interpretations, since the argument above suggests that there is no interpretation compatible with the meaning of logical expressions that corresponds to any model with a finite domain.

Some might complain that we need to make clearer what we mean by ‘information-asymmetry’. A way to do it is through the notion of ‘analytic containment’: in virtue of what the existential quantifier, conjunction, negation and identity mean, there being 30 things is contained in there being 31 things, and to deny C30 on the hypothesis that C31 is a contradiction in terms. Someone who understands the meaning of logical expressions understands sentences like C30 and C31, since logical expressions are all they are made out of. And someone who understands what C31 means must conclude that C30 is true, if C31 is. Yet, the reverse does not seem to be unintelligible in the same way: even if one did conclude C31 after understanding and assuming C30, one would not do so because of what the logical expressions (and thus C30) mean.

Philosophers disagree on the definition of ‘analytic’, or on whether the notion makes sense, at all. Under Frege’s prominent definition of ‘analytic’, analytic containment is perfectly useless as an argument against the validity of the inference from C30 to C31. Frege (1884) defines an analytic statement as something that can be converted in a logical truth by swapping synonyms with synonyms, so all logical truths are trivially analytic. Under Frege’s theory, to first establish whether \(\ulcorner \)If C30 then C31\(\urcorner \) is analytic, one would need to establish if it is a logical truth, since there is no interesting swapping to be done, as the statement is purely made out of logical terms. Yet, \(\ulcorner \)If C30 then C31\(\urcorner \) is logically true only if the argument from C30 to C31 is logically valid.

Arguably, if one is sceptical about the notion of analyticity, or if someone is sympathetic to the Fregean definition, they should not reject altogether the idea that there is an asymmetry in information between C30 and C31. Rather, they should try to spell it out in different terms. One other way to spell out an asymmetry in information is to appeal to warrant in hypothetical reasoning. One seems warranted to infer C30 from C31; yet, if we start from C30, C31 does not seem to be guaranteed in the same way. In this sense, if C31 is true then C30 must be true on that basis alone, but not vice versa. C31 cannot be logically true, for a logical truth comes with a ‘self-warrant’ for its own truth, so to speak: regardless of where we start, we are always warranted to infer it. Yet, by starting from C30, we do not seem to be warranted to infer C31. Warrant in hypothetical reasoning can be useful to clarify what we mean by ‘informational asymmetry’, and does not rely on the notion of analyticity, nor is it rendered useless by the Fregean definition.

3 Attempted solutions to the cardinality problem

The cardinality problem prompted different reactions:

  1. (1)

    Some give up on a purely interpretational theory, and add a representational element to address the cardinality problem (Hanson, 1997; Shapiro, 1998; Sher, 2001).

  2. (2)

    Some bite the bullet and argue that CN instances are logically true, and we need to dispel any impression to the contrary (Williamson, 1999).

  3. (3)

    Some consider changing the set of logical expressions to stop the argument in its tracks (Quine, 1986; Etchemendy, 1990).

3.1 First and second strategy

Reaction (1) is not really relevant to our discussion, since it is an admission of defeat rather than a solution within the IR account. What is interesting to note is that mixing RE with IR does not straightforwardly fix the issue. ‘There are n things’ for any n is a mathematical truth, and mathematical truths are metaphysically necessary, so true in all possible worlds. Calling for a special, stricter sense of ‘logical’ possibility is quite unhelpful, and might be convincing only if someone has already assumed that CN instances are not logically true, which begs the question. For arguably we understand something to be logically necessary when it is logically true. Hanson acknowledges this problem, and he looks at ‘sub-worlds’ of possible worlds to find a representational counterexample to CN instances. He asks us to consider ‘just the Washington Monument and the White House’. If we focus on these two things alone, we have an actual case where it is false that there are three things (1997, p. 388). I will discuss later how we can frame Hanson’s reasoning inside a purely interpretational account where we make quantifiers context-sensitive. This shows that there was no need to depart from IR in the first place.

People who bite the bullet and admit that CN are logically true tend to focus on the simplicity and elegance of the interpretational theory. They argue that these features make it worth it to depart from the commonly accepted logical truths. After all, we are only adding validities, not subtracting them, so classical logic is safe. Williamson argues that the target of the logical enquiry are ‘general principles involving the selected logical constants, just as the target of a physical inquiry may be general principles involving only terms from a language for physics’ (2017, p. 331). Generality is all we are after. ‘Adding a second dimension of necessity, a priority, or analyticity needlessly complicates the picture, mixing together questions that our fundamental terminology should hold carefully apart so that it can represent their interrelations perspicuously’ (2017, p. 328). Thus, according to him issues of asymmetry in information are probably misguided, since logical validity has not much to do with analyticity or even a priority.

My issue with the ‘bite the bullet’ strategy is that logical validity has quite a venerable pedigree, and information-containment, formality and a priority are among its commonly accepted features. This makes the departure from a classical and traditional notion of logical validity theoretically quite expensive. By following an abductive methodology, if we actually weight this theoretical departure from the traditional notion against the simplicity and elegance of the interpretational theory, it is not so clear-cut that the departure is worth doing. The best possible outcome would be to keep the simplicity of the interpretational account and do not depart from a traditional notion of logical validity as analytically truth-preserving, a priori and and as a form of information-containment. I argue that IR paired with a context-sensitive account of quantification achieves both results, and should therefore be preferred.

3.2 Third strategy: changing the logical expressions

The third strategy relies on tampering with logical expressions. People who follow this strategy usually either deny that identity is logical or deny that the existential and universal quantifiers are logical. Quine (1986) held the first view. Before we dive into the details, my general complaint about this strategy is that it does not really address the point of the cardinality problem. Even someone who thinks that there is a definite set of ‘real’ logical expressions will agree that sometimes we simply treat an expression as logical, for the purpose at hand. For example, the factivity of knowledge is a truth of the logic for knowledge. This fact does not require us to say that the knowledge operator is a logical expression, but only that it is treated as such, in the context of epistemic logic. If so, then the cardinality problem can be recast as follows: in the logic of quantification of identity where we treat quantifiers and identity as logical expressions, we should not expect CN to come out logically true because there is an asymmetry in information between its instances. The third strategy simply cannot address the cardinality problem, when put in this way.

The most popular version of the third strategy is to deny that universal and existential quantifiers are logical. This strategy itself divides into two sub-strategies, both discussed and rejected by Etchemendy (1990, pp. 65–69). In the first sub-strategy, ‘something’ is treated as a variable (i.e. non-logical) term. Yet, its range of admissible reinterpretations is somehow restricted to generalised existential quantifiers like ‘some-dogs’, ‘some-humans’ etc. As Etchemendy later points out, this strategy looks quite ad hoc. If ‘something’ is not logical, it seems that we should be able to reinterpret it as any other (unary) quantifier, like ‘everything’. If we cannot, then we are restricting the range of legitimate interpretations. In the interpretational view, this must be because of the logical form of sentences and the meaning of logical expressions. Yet, the existential quantifier is not logical, so it cannot be the culprit and we cannot account for the restriction.

The second strategy Etchemendy suggests is a more refined version of the first one. The existential quantifier, like any other quantifier, is composed of a determiner—‘some’—and a noun-phrase—‘thing’. Maybe what is logical is the determiner alone, and not the whole quantifier. If so, then the noun-phrase can change arbitrarily, causing a shift in the range of quantification. This strategy is not directly applicable to the syntax of standard first-order logic where there are no generalised quantifiers, yet it can be implemented by exploiting the equivalence between ‘Some \(\phi \) is \(\psi \)’ and ‘Something is \(\phi \) and \(\psi \)’. Call L a language of FOL and \(L^+\) the result of adding generalised quantification to L. A determiner in \(L^+\) works like a binary function which takes two noun-phrases and gives out a sentence (Barwise & Cooper, 1981). In the syntax, we will have formulae like \({\exists (\phi , \psi )}\), where \(\phi \) is the noun-phrase that restricts the range of quantification. The idea is to uniformly translate formulae of the form \({\exists x \psi }\) of L in \(L^+\). We then reinterpret ‘thing’ in \(L^+\), to then translate the sentences back into the original L, via the equivalence of \({\exists (\phi , \psi )}\) with \({\exists x (\phi \wedge \psi )}\).Footnote 6

Etchemendy claims that the proposal under-generates because it makes existential generalisation invalid. If we interpret ‘some-thing’ as ‘some-dog’, the generalisation from ‘Socrates is a philosopher’ to ‘Something is a philosopher’ fails because nothing is a dog and a philosopher. However, I see more serious issues with the proposal. Often in the literature, when we translate L into the language of generalised quantifiers, we translate ‘Something is Socrates’ as \(\exists (x=x, x= \text{ Socrates})\). Yet, if we follow this natural translation, the solution does not work anymore since—modulo the logicality of identity—‘\(x=x\)’ has no non-trivial reinterpretations. This shows that enriching the language with generalised quantifiers is not really what makes the strategy work. The real engine of the solution is suggested by Etchemendy in a note: we need to translate existential sentences via a restriction to an unspecified, non-logical predicate U, which is reinterpreted arbitrarily (1990, p. 166). However, no grounds have been given for translating ‘Something is F’ in the language of generalised quantifiers as \(\exists (Ux, Fx)\) rather than \(\exists (x=x, Fx)\). Even if we insisted on not translating ‘thing’ as \(x=x\), we could avoid using ‘thing’ altogether and instead use a new word explicitly for \(x=x\), like ‘self-identical’. Under Etchemendy’s proposal ‘There are two self-identicals’ would still be logically true, since it should be translated as \(\exists (x=x, \exists (y=y, x\ne y))\). Even more convincingly, Etchemendy’s proposal does not work for the language of generalised quantifiers \(L^+\), for then \(\exists (x=x, \exists (y=y, x\ne y))\) would be logically true. The fact that we need to stipulate that U is non-logical for the strategy to work is also quite suspicious, since logicality is about the semantic value of a term, not about how we introduce it. Arguably, then, it seems to me that U is logical if self-identity is, since they mean the same thing. Yet the second is treated as logical in Etchemendy’s theory, while the first is not.

4 The context-sensitive account of the quantifiers

4.1 The proposal

The existential and universal quantifier \(\exists \) and \(\forall \) in the object-language should roughly be taken to mean ‘something’ and ‘everything’ respectively. A very common idea is that ‘something’ and ‘everything’ are context-sensitive. An expression is context-sensitive when an aspect of its meaning cannot be captured through linguistic means alone, but must be supplied by some contextual, non-linguistic parameter. Indexical expressions are the paradigmatic examples: I find a piece of paper on the ground, it says: ‘I am Greek’. I cannot tell what the content of the writing is, unless I am told who wrote the sentence. The claim I wish to defend is that \(\forall \) and \(\exists \) in first-order logic ought to be taken as context-sensitive, as well, roughly like ‘everything’ and ‘something’ in natural language. If so, then we have a solution to the cardinality problem: every instance C(n) of CN is made false by a context in which less than n things are relevant, so they are not logically true, after all.

Even though the discussion on the semantics of quantifiers in natural language is admittedly quite nuanced, a great deal of literature backs up the claim that ‘something’ and ‘everything’ are context-sensitive. I am at a dinner party and—sadly—we ran out of wine. When I say: ‘Every bottle is empty’, I do not mean to say that every bottle in the universe is empty, just the ones we bought for the dinner party. If I then go out to the shop to buy more, and I say: ‘Every bottle is sold out’, I am now talking about the bottles in the shop, not the empty ones we just drank. Different contexts, different meanings, hence context-sensitivity.Footnote 7 The proposal itself is compatible with different theories of context sensitivity. The way in which quantifiers can be context-sensitive, however, is limited by the expressive ability of the language of first-order logic which we are interpreting: the contextual work must be done in the meta-language, since there is no contextual parameter in the syntax of first-order logic.

My proposal to treat \(\exists \) and \(\forall \) in first-order logic as context-sensitive is natural, given the intuitive semantic identification of \(\forall \) with its natural language counterpart, and the likely context-sensitivity of the latter. Suppose we are dealing with a formal language with the syntactic expressions ‘H’ and ‘Y’, which we paraphrase as ‘here’ and ‘you’. It would be quite odd to paraphrase them in such a way, while refusing to treat ‘Y’ and ‘H’ as context-sensitive.Footnote 8

The meaning of a logical expression ought to be kept constant across all interpretations. This is correct but misleading for context-sensitive expressions because only the so-called linguistic part of their meaning ought to be kept fixed. In Kaplanian terms, we save the character, not the content: we keep constant the linguistic rule that fixes the content, not the content itself. For example, in a logic for the first person ‘I’, the character of ‘I’ is the way we fix its referent when given a context; the content of ‘I’, given a context, is the subject of the context (Kaplan, 1989). It would be silly to assume that, when treated as a logical expression, ‘I’ ought to keep its content fixed: that it ought to refer to the same person across all interpretations. Rather, we let its referent vary arbitrarily, yet we do not change its character: in every interpretation its content is always the subject of the context. In quite the same manner, what must be kept fixed across interpretations is only the character, not the content of \(\forall \). The character of \(\forall \) is a function from contexts to the content of \(\forall \) in a context. The character is fixed: in all interpretations \(\forall \) ranges over everything relevant at the context. Yet, its content is not fixed: in a context where all and only dogs are salient, the content of \(\forall \) is that of ‘every dog’, while in a context where all and only numbers are salient, it is that of ‘every number’.

The fallacy in the argument for the validity of CN in Sect. 2 is diagnosed as a confusion about contexts: ‘There are n things’ is true under the interpretation it has as we are using it, i.e. it is true in the meta-theory for any n. Yet, this only shows that CN instances are true when interpreted under the context of the meta-theory, not under any context. The fact that we must keep its interpretation fixed means that we need to keep its character fixed, but the contextual ingredient to meaning can change, triggering a shift in content.Footnote 9

The context-sensitive theory is still a version of IR because it is impossible to interpret context-sensitive expressions without specifying a context. The relevant notion of truth in the definition of validity is still ‘truth in an interpretation’: it is not that the formula is true ‘in a possibility’—a context. Rather, only in a context does a sentence mean something which can be said to be true or not. It is not that the sentence is true in an interpretation and context; rather, the context is part of the interpretation. In the example above, ‘I am Greek’ means different things in the mouth of different people. We can assess the sentence only if we are contextually given a subject. If the subject of the context is Pericles the sentence is true, if it is Caesar it is false.

One might argue that validity will itself be context sensitive if quantifiers are context-sensitive. Truth in an interpretation is context sensitive, since an interpretation specifies a context, and the same sentence can be true and false at different contexts. However, validity itself is defined as preservation of truth in all interpretations, and therefore it implies preservation of truth in all contexts, so it is not context-sensitive. However, Georgi (2015) argues that in languages with demonstratives such as ‘this’ and ‘that’, if we agree that the reference of these expressions can vary within the same context, validity and logical truth obtain or fail to obtain only relative to a context. In a context where all occurrences of ‘this’ refer to the same thing ‘This is this’ is logically true, but it is not logically true in a context where the two occurrences refer to different things. If quantifiers are ‘promiscuous’ in a similar way, and they shift range within the same context of utterance, then one could claim that we need a notion of logical consequence relative to a context, as well. I assume that one can extend the framework developed by Georgi to implement this theory.Footnote 10

4.2 Formal system

To make my proposal more precise, I now provide a set-based semantics that implements context-sensitivity. The interpretations I obtain are equivalent to models of free logic in the usual set-based model-theory.

The meta-theory I am using is Zermelo Fraenkel set-theory with choice. The language we are interpreting is a first order language L with countably many predicates \(P_1,\ldots , P_n,\ldots \) of arbitrary arity, the indentity predicate \(=\), variables \(v_1, \ldots , v_n,\ldots \), constants \(a_1, \ldots , a_n, \ldots \), the connectives \(\lnot \) and \(\wedge \), and the quantifier \(\exists \) (\(\rightarrow \), \(\vee \), \(\leftrightarrow \) and \(\forall \) are meta-abbreviations). We use \(\alpha _1, \ldots \alpha _m, \ldots \) as meta-variables for variables or constants. Where \(P_n\) has arity m, a well-formed formula is defined inductively, as: \(P_n(\alpha _1, \ldots , \alpha _m) \mid \alpha _m = \alpha _n \mid \lnot \phi \mid \phi \wedge \psi \mid \exists v \phi \).

To interpret our language we start with a universe. A universe U is any set. To provide a suitable notion of logical validity for which a common calculus for first-order logic is complete, U needs to be at least countably infinite.Footnote 11 Given a suitably large universe U, we define an interpretation I based on U as a triple \(\langle U, c_I, \llbracket \rrbracket _I \rangle \). \(c_I\) is any subset of U and \(\llbracket \rrbracket _I\) is a function defined as follows:

$$\begin{aligned}&\text{ For } \text{ any } P \text{ or } \text{ arity } n, \llbracket P \rrbracket _I \text{ is } \text{ a } \text{ set } \text{ of } n\text{-tuples } \text{ from } U \end{aligned}$$
(a)
$$\begin{aligned}&\hbox { For any}\ a,\llbracket a \rrbracket _I\in U \end{aligned}$$
(b)

We are identifying a context with the set of things relevant at that context because that is all we need contexts to specify. \(c_I\) might not be U: when I say that everything is packed, all and only the objects in my backpack are relevant, but there are other irrelevant things in the universe which are not inside the backpack, like me and the backpack.

Where \(\mathfrak {a}\) is a variable assignment (i.e. a function from the set of variables to U), x is any element of U and v and w are variables, define \(\mathfrak {a}^x_v\) as the result of switching to x the value of v:

$$\begin{aligned} \mathfrak {a}^x_v(w) = \left\{ \begin{array}{ll} x &{} \text{ if } w=v, \\ \mathfrak {a}(w) &{} \text{ otherwise. } \end{array} \right. \end{aligned}$$

We extend \(\llbracket \rrbracket \) as follows: \(\llbracket \alpha \rrbracket _{I, \mathfrak {a}}=\mathfrak {a}(\alpha )\) if \(\alpha \) is a variable, \(\llbracket \alpha \rrbracket _{I,a} =\llbracket \alpha \rrbracket _{I}\) if it is a constant.

We define a valuation function V from all formulae, variable assignments based on U and interpretations I based on U to \(\{1, 0\}\) inductively as the unique function that satisfies the following conditions:

$$\begin{aligned}&V(P(\alpha _1, \ldots , \alpha _n), I, \mathfrak {a})= 1 \quad \text{ iff } \langle \llbracket \alpha _1 \rrbracket _{I, \mathfrak {a}}, \ldots ,\llbracket \alpha _n \rrbracket _{I, \mathfrak {a}} \rangle \in \llbracket P \rrbracket _I \end{aligned}$$
(V1)
$$\begin{aligned}&V(\alpha _n = \alpha _m, I, \mathfrak {a}) = 1 \quad \text{ iff } \llbracket \alpha _n \rrbracket _{I, \mathfrak {a}} = \llbracket \alpha _m \rrbracket _{I, \mathfrak {a}} \end{aligned}$$
(V2)
$$\begin{aligned}&V(\lnot \phi , I, \mathfrak {a}) = 1 \quad \text{ iff } V(\phi , I, \mathfrak {a})=0 \end{aligned}$$
(V3)
$$\begin{aligned}&V(\phi \wedge \psi , I, \mathfrak {a})=1 \quad \text{ iff } V(\phi , I, \mathfrak {a})=V(\psi , I, \mathfrak {a})=1 \end{aligned}$$
(V4)
$$\begin{aligned}&V(\exists v \phi , I, \mathfrak {a})=1 \quad \text{ iff } \text{ for } \text{ some } x \in c_I\, V(\phi , I, \mathfrak {a}^x_v)=1 \end{aligned}$$
(V5)

We obtain the usual models (of free logic) restricted to a given set U. To obtain all the familiar models of free logic, we would need U to be the universe of sets: we can use proper-class theory to achieve this.

I now show how we can interpret V5 as deriving the truth-value of a quantified formula explicitly from the character of the quantifier, as the context-sensitive solution demands. The character of \(\exists \) given a universe U (written \(\llbracket \exists \rrbracket _U\)) is a function that takes each element \(c_I\in \wp U\) to the content of \(\exists \) in I. The content of \(\exists \) in I is the set of \(x \in \wp U\) that have an element in common with \(c_I\):

$$\begin{aligned} \llbracket \exists \rrbracket _U(c_I) = \{x\in \wp U: x\cap c_I \ne \emptyset \} \end{aligned}$$

(V5) is equivalent to the following clause:

$$\begin{aligned} V(\exists v \phi , I, \mathfrak {a})=1\ldots \text{ iff } \{x: V(\phi , I, \mathfrak {a}^x_v)=1\}\in \llbracket \exists \rrbracket _U(c_I) \end{aligned}$$
(V5*)

Note that (V5*) makes sense only because, when we apply it to any \(\exists v \phi \), we have already inductively defined the set of true and false sentences under any I and \(\mathfrak {a}\), for any \(\psi \) of complexity less than \(\exists v \phi \); one such \(\psi \) is \(\phi \).

4.3 Free logic

Logical validity is not defined by quantifying across frames with different universes because there is no ground for using different universes in an interpretational theory, but only different contexts. Rather, we fix one universe U, which models the actual universe. Then, where I ranges over all interpretations based on U and \(\mathfrak {a}\) over assignments based on U, logical validity \(\models \) is defined as follows:

figure a

The resulting logic is an inclusive free logic. Free logic is the most natural logic for IR when we pair it with my context-sensitive proposal. Existential generalisation and universal instantiation both fail. Adapting an example from Gauker (1997), suppose that in a context c all that is relevant are some wooden figurines on a table. Under c, ‘Everything is made of wood’ is true even though ‘Socrates is made of wood’ is false. Assuming compositionality, the meaning of a sentence is determined by the meaning of its constituents. The meaning of ‘Socrates is human’ is determined by the meaning of ‘Socrates’ and ‘is human’, neither of which are context-sensitive, so ‘Socrates is human’ cannot be context-sensitive, either. Thus, when we evaluate it under an interpretation that specifies a context c, c is inert.Footnote 12 I find a piece of paper on the ground, it says: ‘Socrates is human’. Modulo the assumption that the author intended to write in English, under any context the sentence means that Socrates is human, which is something true regardless. Existential generalisation fails, as well: ‘Socrates is human’ is true in an interpretation I where everything relevant is in my closet, even though—I swear—nothing is human in my closet, so ‘Something is human’ is false in I.

One might claim that ‘Socrates is human’ cannot be true because, if uttered, Socrates would become relevant. Yet something can be true even when it cannot be uttered in any given context. ‘This sentence is not uttered’ is such an example. Paraphrasing Lewis (1996), truths can be ‘elusive’, sometimes they can be only expressed sottovoce. Moreover, it is not clear that if we utter sentences about x in a context, we make x relevant. While packing I can truthfully say: ‘I have everything in my backpack’. Clearly, everything relevant in that context is in the backpack, yet I am not inside the backpack, nor is my backpack inside itself. So, even though ‘I have everything in my backpack’ is true in any interpretation that specifies a context where all that is relevant is in my backpack, ‘Everything is inside something’ is false, so interpreted. In my semantics we would say that ‘I’ and ‘backpack’ are part of the universe U, even though they are not part of what is relevant at the context \(c_I\).

The logic we obtain from the previous section is also ‘inclusive’ because there is an empty context where nothing is relevant. Under any interpretation that specifies that context, \(\exists x \phi \) is false and \(\forall x \phi \) is true, for any \(\phi \). The system can easily be changed if we think that in any context something is relevant, by setting \(c_I\) to be non-empty, for any I. As I specify in Sect. 5.3, we need to discard the empty context if we wish to recapture classical logic.

4.4 Advantages of the context-sensitive solution

I discuss in 5.2 how quantifiers are still logical expressions albeit context-sensitive. Because we are not relying on a shift in what counts as a logical expression, my proposal has none of the shortcomings of the third strategy discussed above. My solution also works when we simply treat quantifiers as logical in the scope of a logic of quantification, similarly to how we treat the knowledge operator as logical in the scope of epistemic logic.

Contrary to Etchemendy’s first proposal, the fact that in no interpretation does ‘everything’ have the content of ‘some dog’ is explained: we must keep the character fixed. Contrary to his second proposal, the fact that in an interpretation ‘Something is F’ has the content of ‘Something is U and F’ is also explained: U is what is relevant in the context of that interpretation. However, we are not interpreting ‘something’ as ‘some U’: rather, the content of ‘something’ coincides with that of ‘some U’ under this interpretation, but the two still differ in meaning, in the same way that ‘I’ and the name ‘Caesar’ do in a context where Caesar is the subject.

Hanson’s counterexample to CN can be accommodated by setting ‘being the Washington Monument or the White House’ as the relevance condition for a context. The reasoning, however, is quite different: we are not imagining a possible situation where there are only two things, and ask what would be true ‘in it’. Rather, we claim that, even after a linguistic interpretation is specified, we still do not know the content of ‘There are at least three things’, so we cannot say whether the sentence is true or not. For that we need a context; when we consider what the sentence means in a context where all that is salient are the Washington Monument and the White House, we realise that, so interpreted, the sentence is simply false. Since we can interpret Hanson’s additional representational cases as interpretations equipped with a context, there was no need to abandon the interpretational theory in the first place.

Contrary to the second strategy, there is no need to bite the bullet and argue that CN instances are logical truths after all, because we do have interpretations that falsify them. We can keep a traditional notion of logical validity as formal and analytically truth-preserving without departing from the interpretational account.

The context-sensitive version of IR also confirms both the intuitions that CN instances are in some sense contingent and in some sense as necessary as mathematics. In the context of me talking about what is in my closet, ‘There are n things’ expresses something contingent because there could have been nothing in my closet. In the context of a mathematical theory, however, ‘There are n things’ expresses something whose content is the same as a necessary truth of mathematics. Yet, Regardless of what instances of CN mean in a context, there is always a strict entailment between C31 and C30.Footnote 13

In the context-sensitive version of IR we are able to make sense of the asymmetry in information between C30 and C31. To interpret these sentences, one needs a context; regardless of the context given, if C31 is true so interpreted, so is C30. By shifting context we change the information these sentences convey: in a context where dogs are all that is salient, C30 conveys that there are 30 dogs and C31 that there are 31 dogs; in any case, the piece of information conveyed by C31 contains that conveyed by C30. If we frame asymmetry in terms of analytic containment, we would say that, in every context, C31 guarantees the truth of C30 in virtue of what the terms mean in that context (their content), even though C30 and C31 say different things in different contexts. If we frame it in terms of warrant, we would say that we are warranted to infer C30 from C31 in every context because of what the sentences mean in that context; however, the reverse does not hold, nor are we guaranteed to infer C30 from scratch, regardless of what it means in a context because we do not know whether 30 things are always relevant to any context.

5 Ramifications and objections

5.1 Meta-context and generality

Consider the following revenge-like cardinality problem. Some claim that there is an absolutely general context (Williamson, 2003). The context-sensitive theory of interpretations is consistent with this view: that there is a most general context does not imply that the most general context is the only context. Suppose that a friend of absolute generality adds to the language a quantifier which is explicitly absolutely general. Arguably, the resulting expression will not be context-sensitive because in any context it ‘points back’ at the most general context. Suppose we substitute such a quantifier to ‘there are’ in C30, obtaining the resulting sentence:

$$\begin{aligned} \text{ There } \text{(absolutely) } \text{ are } \text{ at } \text{ least } 30 \text{ things. } \end{aligned}$$
(A30)

A30 is a truth of the logic of the absolutely general quantifier iff it is true in any (linguistic) interpretation and context, iff it is true in all interpretations that specify the most general context, iff it is true in the most general context. It is true in the most general context, so it is a truth of the logic of the absolutely general quantifier. The same reasoning applies to A(n), for any n.

It is not clear, however, that A30 is subject to the same issues C30 had. A30 does not mean that there are 30 things, it means that there absolutely are 30 things. Suppose we equip the language with an arithmetical quantifier ‘there is in arithmetic’, and suppose we treat it as a logical expression. We should expect ‘There are in arithmetic 30 things’ to be a logical truth of the logic of the arithmetical quantifier. Similarly, we should expect A30 to be a logical truth of the logic of an explicitly absolutely general quantifier.

A representationalist might complain, claiming that A30 is still false in a world where my pen and my notebook are all there is. In this case, the interpretationalist should bite the bullet. When said in a context where the socks in my closet are all that is relevant, ‘There are at least 30 things’ says something contingent: there could have been nothing in the closet. On the other hand, A30 is as contingent as mathematics, and thus cannot have been false in any metaphysically possible world. The asymmetry problem also does not apply because we do not need a context to pin down what A30 means: it always means that there absolutely are 30 things, which is something I know simply by knowing what absolute quantification means. Similarly, to understand what ‘There are in arithmetic 30 things’ means, I need to know what ‘in arithmetic’ means, and if I know that, I must know that there are 30 things in arithmetic. Since I can always prove that there are 30 things in the most general context, I am always warranted to infer the truth of A30, under any interpretation. So, I am also warranted to infer any instance of A(n) from any other, as well.Footnote 14

5.2 Logicality in context

One might doubt that quantifiers can be logical albeit context-sensitive. The discussion on logicality for context-sensitive expressions is perhaps not explicitly well-developed in the literature, but it fits nicely within the already existing literature on logicality. In this section, I focus on invariance theories. Invariance theories of logicality claim that an expression is logical when its meaning is invariant under arbitrary transformations of the universe, usually bijections (Bonnay, 2014).

That universal and existential quantifiers come out logical despite context-sensitivity should not surprise us. What might trigger non-logicality, if anything, would be the fact that they change content across contexts. Yet, in standard semantic systems there already are different models with different domains; clearly, the content of \(\forall \) in a model with domain D is not that of a model with domain \(D'\) where \(D\ne D'\). As Sher claims, ‘while non-logical terms are defined within models, logical terms are defined by fixed functions over models’ (1996, p. 675). We do not expect the content of the quantifier in a model to be invariant, but rather its characteristic fixed function over models (which is structurally similar to a character). For example, the operation across models that corresponds to \(\exists \) is the (class-sized) function that take any model \(\mathfrak {M}\) with domain x to the set of non-empty elements of \(\wp x\). This function is invariant under arbitrary bijections (Sher, 1991; McGee, 1996).

Albeit the discussion is scarce, some invariance theorists have defended the logicality of some contexts-sensitive expressions. For example, in his PhD thesis, MacFarlane develops a sophisticated account of logical expressions using a categorical semantics. In Sect. 6.3 he suggests that the type for objects O is context-sensitive, since ‘what counts as a thing in some stretch of discourse is determined largely by context and conversational convention’ (2000, p. 200). MacFarlane goes on to say that ‘we must add the notion of a context and replace the old notion of “semantic value” with two new notions, content and character’, where the character is a function from contexts to contents (2000, p. 201). He then redefines invariance as invariance of character. A similar account is developed by Woods (2017); Woods defines the character and the content of an expression within a possible world semantics, and proposes an invariance test which is similar to MacFarlane’s test. \(\forall \) and \(\exists \) are logical in both theories.

I now show how to adapt the invariance test of logicality to our semantic system. Given a universe U, consider the interpretations based on U and a bijection \(\pi \) between subsets of U. We define the transformation \(\pi ^+\) induced by \(\pi \) on the type-constructions from U as follows, where \(X^n\) is the n-ary Cartesian product of X and \(n^X\) is the n-ary power-set of X:

$$\begin{aligned}&\pi ^+(x)=\pi (x) \text{ if } x \in U \\&\pi ^+(\langle x_1, \ldots , x_n \rangle )=\langle \pi ^+(x_1), \ldots , \pi ^+(x_n) \rangle \\&\forall n\forall m \text{ if } x\in m^{U^n}\quad \text{ then } \pi ^+(x)= \{y: \exists z (z\in x \wedge y=\pi ^+(z))\} \end{aligned}$$

We define invariance similarly to MacFarlane (2000, p. 204):

Definition

(Invariance) An expression x of L is bijection-invariant in U exactly when, for all \(\pi \) between subsets of U, if \(\pi ^+(c_I)=c_H\), then \(\pi ^+(\llbracket x \rrbracket _{I})=\llbracket x \rrbracket _H\)

One can readily check that \(\exists \) and \(\forall \) satisfy the test. In fact, if we apply the test to characters, we have the stronger invariance result \(\pi ^+(\llbracket \exists \rrbracket _U)=\llbracket \exists \rrbracket _U\), for any \(\pi \). If we wanted, we could strengthen the test by looking at bijections across different universes, like Woods does. This would make the result less contingent upon the initial choice of U. Similar results would be obtained in the strengthened theory.

5.3 Classical recapture

I believe that free logic is the most natural choice in view of an interpretational account of logical validity and the context-sensitivity of quantifiers. However, in this section I claim that one can recapture classical logic from within my system, contrary to what the literature argues about IR.

The fact that IR is unable to validate classical logic is one of Etchemendy’s strongest complaints against it, and the fact that RE naturally fits classical logic is for him one of its great advantages over IR. Etchemendy notes that in (classical) model-theory we usually restrict the interpretation of the expressions inside the range of the quantifiers in each model. This holistic fitting procedure makes sense in representational terms: if a model is like a picture of a possibility, then all the expressions of the language ought to be ‘squeezed’ into the picture (1990, p. 66). To achieve the same result in interpretational terms, however, one would need to ban all spurious interpretations where the value of an expression outstrips the range of quantifiers. Yet, these ‘cross-term restrictions’ are hard to make sense of, since the interpretations banned are perfectly legitimate if not for making some rules or axioms invalid, which puts the cart before the horse.

No cross-term restrictions should be adopted in my proposal because, under a context where everything relevant is in on the table, we are clearly not reinterpreting the whole language so that everything fits on the table. ‘Socrates’ clearly still refers to Socrates, regardless. Yet, we can make sense of the classical ‘holistic fitting’ without resorting to cross-term restrictions, by taking a particular stance on how contexts affect the truth-value of sentences without quantifiers. We can insist that contexts act even on sentences that do not contain quantifiers, by proposing a pervasive context-sensitivity of truth, which is not triggered only by the presence of quantification. If Socrates is not relevant at a context, ‘Socrates is human’ is not true in any interpretation that specifies that context. A common option is to make the formula not false, either: a multi-valued logic comes natural.

Gauker (1997) develops a system of validity with these characteristics under the name ‘Kaplanesque three-valued free logic’. In such a logic there are no tautologies, since any object is irrelevant at some context, so any formula about that object is neither true nor false, when interpreted under that context.Footnote 15 As explained by Gauker, the rule of universal instantiation is invalid because the fact that everything relevant at a context is \(\phi \) does not imply that Socrates \(\phi \)s, as Socrates might not be relevant at that context. On the other hand, if Socrates does satisfy some condition under a context, then he must be relevant at that context, so something relevant at it satisfies that condition: the rule of existential generalisation is valid (1997, p. 196). Contraposition fails, modulo the equivalence between \({\exists x \phi }\) and \({\lnot \forall x \lnot \phi }\). Since there are no tautologies, the deduction theorem also fails. Similarly, proofs by contradictions are invalid: \({\phi \wedge \lnot \phi }\) implies any formula and its negation, since it is not true under any interpretation; yet, \({\lnot (\phi \wedge \lnot \phi )}\) is not logically true.

What should we make of such a massive departure from classical logic? Gauker suggests that we should redefine logical validity by replacing truth and falsity in a context with assertibility and deniability in a context, respectively. If Socrates is not something relevant at the context at hand, then nothing can be asserted about him, nor can it be denied, for to deny something is to assert its negation. This is quite a conceptual shift, which Gauker acknowledges as such. However, we do not need to follow Gauker, as the semantics does not force us into Kleene validity. Instead, we can use Gauker’s semantics to recapture classical logic within IR. Even if we insist that any sentence about Socrates is neither true nor false in a context where he is not relevant, we still have different choices on how to define validity. Sure, bivalence fails, but failure of bivalence does not by itself lead to Kleene logic, nor in general to non-classical logic (Rumfitt, 2015). One way to recapture classical logic is to distinguish between a strong and a weak counterexample to the validity of a formula. A strong counterexample is given by an interpretation where the formula is false, a weak counterexample is given by an interpretation where the formula is not true. Absence of any counterexample results in Kleene validity, absence of only strong counterexamples results in classical validity. As I show in the appendix, classical validity is recaptured as follows: \({\phi }\) (classically) logically follows from \(\Gamma \) exactly when, under all interpretations and assignments based on U, if each member of \(\Gamma \) is true, \({\phi }\) is not false. The result works only if we further assume that the set of things relevant at a context is never empty, something which we quite left open so far. One could argue for it by insisting that in any context we must be talking about something, so something must always be relevant.

6 Conclusion

I argued that ‘There are n things’ is not logically true because, while from ‘There are n things’ we can logically infer ‘There are \(n-1\) things’, from ‘There are \(n-1\) things’ we cannot logically infer ‘There are n things’; yet, we should be able to infer ‘There are n things’ from any premise if it was a logical truth. An interpretational theory of validity can account for this ‘informational asymmetry’ by making quantifiers context sensitive. I argued that my proposal is superior to the ones present in the literature: it does not require us to abandon the simplicity of the interpretational theory, nor does it rely on a shift in the choice of logical notions, nor does it force us to depart from a traditional notion of logical validity as formally and analytically truth-preserving. Quantifiers are still logical notions despite being context-sensitive. Finally, I showed how, while an inclusive free logic is a natural choice for the context-sensitive version of IR, we can also recapture classical logic within IR by adopting a Kleene semantics and a non-standard notion of validity.