1 Introduction

According to one influential view of the sorites paradox, the tolerance principle—the constraint whereby if someone is tall, for example, then someone whose height is imperceptibly shorter is tall too—is an unsound rule of reasoning (see Williamson 1994). In combination with further premises that look less controversial, it leads to contradiction. We therefore ought to reject it, and not to rely on it when reasoning with vague predicates.

There are reasons to think of such a view as too drastic and as missing out on the role that such a principle plays in categorization and in ordinary judgmental and inferential practice (see van Rooij 2011b; Égré 2015). Taking a different perspective, the tolerance principle ought not to be discarded that fast, even when viewed normatively. Instead, it corresponds to what some authors have called a soft constraint (see Maudlin 2008), or a default (Cumming 2014), namely a rule that we can use legitimately in reasoning, but that cannot be used without some care.

One way to represent the tolerance principle is as a certain conditional sentence, of the form: \(Pa~\wedge ~a~\sim _{P}~b~\rightarrow ~Pb\) (where \(a\sim _{P}b\) means that a and b are similar in P-relevant respects, e.g. having similar heights relative to “tall”). Another way is as an argument, of the form: \(Pa, a\sim _{P}b \vdash Pb\). Let us consider the conditional version first. Various options have been proposed to deal with the tolerance conditional as a soft constraint. In standard fuzzy logic, for example, the idea is to think of that conditional as not perfectly true, but close-to-perfectly-true. Let us assume the conditional to be Łukasiewicz’s conditional (see Goguen 1969), and call a sentence close-to-perfectly-true provided its degree of truth is 0.9. Given an appropriate sorites sequence (for example a series of individuals from clearly short to clearly tall, whose adjacent members differ imperceptibly in height), it will be possible to have: \(Pa_{1} \rightarrow Pa_{2}\) close-to-perfectly-true, \(Pa_{2} \rightarrow Pa_{3}\) close-to-perfectly-true, without having \(Pa_{1}\rightarrow Pa_{3}\) close-to-perfectly-true. That is, each tolerance conditional is true to degree at least 0.9, but the resulting conditional itself may end up with a degree of truth less than 0.9. If validity is defined as the preservation of perfect truth (degree 1) from premises to conclusions, modus ponens remains a valid rule, but the tolerance principle, though close-to-perfectly-true, is not true enough to produce a sound argument, in the sense of a valid argument based on perfectly true premises. Another possibility therefore is to define validity as the preservation of close-to-perfect-truth, for instance having a degree at least 0.9. In that case the tolerance principle can be sound in the sense of being close-to-perfectly-true, but modus ponens is no longer valid, and the conditional is no longer transitive (Williamson 1994, p. 124).Footnote 1

Further options for the definition of validity exist. Smith (2008) proposes that an argument is valid iff when all premises are strictly greater than 0.5, the conclusion is greater or equal to 0.5. The option is very similar to the one we pursue below. In particular it relies on a standard conditional connective, and yields ordinary classical logic.Footnote 2

There is also the kind of definition of validity suggested in Edgington (1997). On this approach, an argument is valid iff the conclusion can never fall as far short of perfect truth as the sum of the extents to which the premises fall short of perfect truth. Edgington does not use Łukasiewicz’s truth-degree conditions for her connectives, and so arrives at classical logic this way. But using Łukasiewicz’s truth conditions, this would again result in a distinct logic, one that validates modus ponens but is noncontractive. We discuss noncontractive approaches to vagueness in Sect. 2.3.

An alternative to the fuzzy strategy is to view the tolerance conditional as expressing a defeasible rule. Say that \(Pa \wedge a\sim _{P}b \rightarrow Pb\) is true provided Pb is true in all \((Pa \wedge a\sim _{P}b\))-normal worlds [see Hu 2015; based on Asher and Morreau 1991]. Call a world \((\phi \wedge a\sim _{b}P)\)-normal if a is P-similar to b but is not close to a borderline case of P. From \(Pa\wedge a\sim _{P}b, Pa \wedge a\sim _{P}b \rightarrow Pb\), it need not follow that Pb, since a world may satisfy \(Pa\wedge a\sim _{P}b\) without being \((Pa\wedge a\sim _{P}b)\)-normal, precisely when b is a borderline case of P. Again, this conditional fails to satisfy modus ponens, and it is also nontransitive. But moreover it is nonmonotone, since a world that is \(Pa\wedge a\sim _{P}b\)-normal need not be \(Pa\wedge Pc\wedge a\sim _{P}b\)-normal. On that view, the tolerance conditional represents a defeasible rule, usable except in cases in which the main premise itself does not correspond to a normal world.

Both the fuzzy approach and the nonmonotonic approach have some appeal. On some understandings of fuzzy consequence, the sorites paradox is solved by saying that modus ponens is not valid. On the nonmonotonic approach, the sorites paradox is solved by saying that modus ponens is a defeasible rule: premises in sorites arguments are assertible, but the argument is not undefeasibly valid. However, the nonmonotonic approach faces a limitation, which concerns the treatment of the tolerance principle in terms of a special conditional connective. As is well known, a sorites argument can be stated using only conjunction and negation, by saying that it is not the case that there are two cases a and b that are very similar, but are such that Pa and not Pb. But a nonmonotonic treatment of the conditional does not tell us how to address that alternative version of the sorites. The situation in fuzzy logic is better. Relative to Łukasiewicz’s conditional, it is possible to define a strong conjunction & such that the degree of truth of \((A\rightarrow B)\) is always equal to the degree of truth of \( \lnot (A \& \lnot B)\).Footnote 3 However, the Łukasiewicz conditional loses some features that one might wish to retain for a conditional, such as contraction. Relative to the loss of modus ponens, this may not stand as a significant cost, but we would favor an approach that preserves more of the regular features of the standard conditional.

In this paper therefore, we are interested in accounts of vagueness that, instead of relying on a special conditional connective, propose to capture the tolerance principle by imposing specific constraints on the consequence relation, in a way that leaves intuitively desirable properties of a conditional connective in place, and in a way suitable to deal with the sorites argument in its conjunctive form as well as its conditional one. More specifically, we compare and discuss various structural choices that can be made on a consequence relation in order to make the tolerance principle valid, when represented not merely as a conditional sentence, but also as a rule, namely as an argument of form: \(Pa, a\sim _{P}b \vdash Pb\). Our main focus is on two structural approaches which mirror the nontransitive and nonmonotone conditional to some extent, but shift those properties up one level, namely to the consequence relation. The first is the nontransitive treatment of logical consequence favored in our past work, on which the principle of tolerance comes out as valid in rule form, but cannot be iterated without risk.Footnote 4 The second is the nonmonotone treatment of logical consequence, on which the principle of tolerance too can come out as valid, but in a way that is sensitive to context and to the addition of further premises (soft consequence as defeasible consequence) (see Misiuna 2010; Hu 2015).

This paper compares three notions of entailment, starting with strict-to-tolerant entailment (see Cobreros et al. 2012), which links the tolerance principle to nontransitive reasoning. We discuss two elaborations of st-entailment, based on the idea of pragmatic strengthening. Those two variants differ in that they both treat the tolerance principle via nonmonotonic reasoning. The first of those variants combines nontransitivity and nonmonotonicity, whereas the second, though nonmonotonic, remains fully transitive. Our broader agenda is to compare those three notions, and to discuss their relative merits and limitations.

In Sect. 2, we start off with an overview of various structural choices that can be made on a consequence relation in order to block the sorites paradox, that is we survey several nontransitive, nonmonotonic, and noncontractive approaches. In Sect. 3, we focus on the nontransitive approach favored in our past work on strict–tolerant validity, and highlight some motivations for the definition of a generalized notion of pragmatic entailment on that basis, of which two variants are considered, which we call Prt-entailment and PrPr-entailment. In Sect. 4, finally, we compare the three systems under discussion, and draw more general lessons for the modeling of reasoning with vague predicates.

2 Transitivity, monotonicity, contraction

In this section we consider three structural properties of particular interest in relation to soritical reasoning, namely transitivity, monotonicity, and contraction. Although noncontractive approaches will not play a role in our subsequent inquiry, they are worth considering too, in particular because of some analogies with nontransitive treatments.

2.1 Transitivity

Here, we will call a consequence relation \(\vdash \) ‘transitive’ iff whenever \(\Gamma \vdash \phi , \Delta \) and \(\Gamma ', \phi \vdash \Delta '\), then \(\Gamma , \Gamma ' \vdash \Delta , \Delta '\).Footnote 5 (We assume that \(\Gamma \) and \(\Gamma '\) are read conjunctively, while \(\Delta \) and \(\Delta '\) are read disjunctively.) Many consequence relations obey this condition (indeed, sometimes this is required as part of how ‘consequence relation’ is understood!); but in recent years, there has been increasing interest in systems that do not. (For examples, see Frankowski 2004; Hallnäs 1991; Hallnäs and Schroeder-Heister 1991; Tennant 1987; Weir 1998; Zardini 2008. Of these, only Zardini 2008 focus on vagueness.) We have ourselves explored several nontransitive systems in our past work, for example in Cobreros et al. (2012, 2013, 2015a); here, we focus on the kind of nontransitive approach to vagueness we recommend.Footnote 6

On this approach, tolerance is not merely a formula that may occur or not in an argument; its effects are internalized in the consequence relation itself. In the system st of Cobreros et al. (2012), for example, \(Pa, a \sim _P b \vdash Pb\). Note that no statement of tolerance occurs in this argument, and yet st validates it: the argument itself is a statement of tolerance. On this approach, the claim that a is P, together with the claim that a is P-similar to b, directly entails, without need for any further claims, the claim that b is P. We are most interested in systems with this feature, whatever their structural properties, and we will aim to internalise tolerance into the consequence relations we explore throughout the paper.

If we combine this kind of principle with transitivity, we can reason as follows:

figure a

In this way, we can reach \(Pa_0, a_0 \sim _P a_1, \ldots , a_{n-1} \sim _P a_n \vdash Pa_n\) for any n. Call this ‘n-fold tolerance’. n-fold tolerance seems to be a problem: the similarity premises here are exactly the setup for a sorites sequence for P, and this tells us that they entail, together with the claim that the first member of the sequence is P, that the last member is also P. That, of course, looks much like falling victim to the sorites paradox.

In a moment, we’ll see that there are ways one might choose to live with n-fold tolerance, ways that do not fall victim to the paradox—but for now, let’s assume it’s to be avoided. The only ingredient here is transitivity itself; once tolerant reasoning is internalised in this way, there is no place else to push back on the above reasoning. One possible response, then, is to reject transitivity, to say that each individual step of soritical reasoning is indeed valid, but that chaining these valid steps together does not preserve their validity.Footnote 7

This kind of approach allows appeal to tolerance, as internalised in the consequence relation, once but not more than once. This, then, is one way to understand tolerance’s soft status; it can be appealed to freely, but we cannot combine the results of multiple appeals.

So much for internalised tolerance; what about formula-level statements of tolerance, like \(\forall x \forall y ((Px \wedge x \sim _P y) \rightarrow Py)\)? If tolerance is really internalised in the consequence relation itself, you might expect such formulas to be theorems of the consequence relation: valid consequences of the empty set of premises. Indeed, this is exactly what happens in the systems we have explored elsewhere.

But you might also expect these tolerance formulas to be suppressible: to be such that, when an argument with a tolerance formula among its premises is valid, then the corresponding argument without that tolerance formula among its premises is also valid.Footnote 8 When a consequence relation is transitive, in the sense we’ve given here, all of its theorems are suppressible, since the move from \(\vdash A\) and \(\Gamma , A \vdash \Delta \) to \(\Gamma \vdash \Delta \) is just an instance of transitivity; but for nontransitive relations this need not be the case. (For the converse direction, it is reflexivity that matters; so long as each formula entails itself, suppressible formulas are theorems, since we can move from \(A \vdash A\) to \(\vdash A\) simply by suppressing A. We do not consider nonreflexive consequence relations here.)

In fact, tolerance formulas are not suppressible in the systems we have explored elsewhere; although tolerance is internalised in these consequence relations, still including a formula-level statement of tolerance as a premise can have an effect. This gives another way to understand tolerance’s soft status. A consequence relation, at least when it is nontransitive, can give two kinds of endorsement to a formula: the weaker endorsement of theoremhood and the stronger one of suppressibility. Tolerance formulas receive the weaker endorsement but not the stronger; they are given, but not completely.

2.2 Monotonicity

A consequence relation \(\vdash \) is monotonic iff whenever \(\Gamma \vdash \Delta \), then \(\Gamma , \Gamma ' \vdash \Delta , \Delta '\); that is, if adding premises and/or conclusions to a valid argument can never result in an invalid argument. Nonmonotonic logics provide one of the most usual ways to understand defeasible reasoning, since they allow for validity to depend on the absence of certain information, and so to be defeated when such information arises. A standard example for this type of reasoning is the following: if a timetable does not mention any direct flight from A to B, we routinely assume that no such flight exists, which, in turn, allows the planner to represent this information simply by not listing such a flight. This type of pragmatic reasoning is called in AI ‘Reasoning as Failure’ and is made use of in various nonmonotonic logics (e.g. McCarty’s Circumscription and various variants of Logic Programming). It has been used as well in linguistics to account, among others, for the progressive (Asher 1992), generics (Asher and Morreau 1995), discourse structure (Asher and Lascarides 2003), several types of Gricean conversational implicatures (Schulz and van Rooij 2006), and even swear words (McCready 2012) (see Thomason 2011; van Rooij and Schulz 2011, for overview). The Sorites paradox is not standardly approached from a pragmatic point of view. It is perhaps not surprising, then, that there are not so many approaches to the sorites paradox based on nonmonotonic logics. However, there are some.

Approaches to tolerance that turn on rejecting monotonicity can be found in Hu (2015) and Misiuna (2010).Footnote 9 Both Misiuna and Hu offer pictures involving new connectives to play the role of the conditional in tolerance principles, and so run afoul of the argument we’ve offered above involving conditional-free sorites.Footnote 10 However, the way in which nonmonotonicity is involved can be maintained without a special conditional.

The basic idea behind these nonmonotonic approaches is to validate reasoning according to tolerance, but only when no countervailing information is available. That is, given only the premises Pa and \(a \sim _P b\), we can validily conclude Pb. Indeed, given the premises \(Pa_0\), \(a_0 \sim _P a_1\), \(a_1 \sim _P a_2\), ..., \(a_{n - 1} \sim _P a_n\), one may validly conclude \(Pa_n\); these approaches validate n-fold tolerance.Footnote 11 However, in the presence of certain extra information—for example, the additional premises \(a_n \sim _P a_{n+1}\) and \(\lnot Pa_{n+1}\)—this conclusion would not validly follow. As Hu (2015) puts the point, “sorites reasoning is...not...inherently defective or never worth utilizing, but...unproblematic in many instances, so long as it is not carried out too far down a sorites series”.

This, then, gives a way to accept the validity of n-fold tolerance. The problem, on a nonmonotonic view, is not with concluding \(Pa_n\) on the basis of such a chain of tolerant reasoning; the problem occurs when such a conclusion is maintained in the face of countervailing evidence. Such countervailing evidence, of course, is present when we confront a sorites series; the puzzle is created precisely because we know that the last things in the series don’t satisfy the relevant predicate. This is, then, the way in which we can understand tolerance’s soft status within the nonmonotonic approach: tolerance can be appealed to repeatedly, but it gives way in the face of a sorites series.

How does this kind of nonmonotonic approach compare to the kind of nontransitive approach we have discussed above? Both agree that tolerance can be appealed to at the beginning of a sorites series, but not farther down. However, they see this in different ways. On a nontransitive approach, tolerance gives out from exhaustion as we proceed down such a series; it simply cannot go that far. On a nonmonotonic approach, it gives out only when it meets opposition; on its own it can proceed arbitrarily far.

There is another difference worth noting as well. Whereas the nontransitive approach allows for drawing conclusions that cannot be drawn on as premises, nonmonotonic approaches typically do not; for the nonmonotonician, once something cannot be drawn on as a premise it must also be withdrawn as a conclusion.Footnote 12

2.3 Contraction

We will explore the third property—contraction—in less depth in the course of this paper, but we pause to present it here nonetheless, as it certainly provides another family of approaches worth considering. A consequence relation \(\vdash \) is contractive iff: whenever \(\Gamma , \phi , \phi \vdash \Delta \), then \(\Gamma , \phi \vdash \Delta \), and whenever \(\Gamma \vdash \phi , \phi , \Delta \), then \(\Gamma \vdash \phi , \Delta \). (Good introductions to a variety of noncontractive logics can be found in Restall 2000; Paoli 2002; Bimbó 2015.)

In noncontractive logics, the number of times a premise or conclusion is appealed to may matter for validity.Footnote 13 An argument that repeats a premise three times, say, may be valid, while a similar argument that only repeats the premise twice is not. This has been interpreted in a variety of ways; the way most relevant for our purposes is perhaps that of Slaney (2010).Footnote 14

On the approach Slaney presents, the crucial failures of contraction involve precisely the tolerance principle. Note that, in a sorites argument with the tolerance principle as a premise, this principle must be appealed to many times—once for each step in the sorites series. This, then, provides us another way to think of tolerance as a soft or defeasible principle: perhaps it is a principle we may appeal to only a limited number of times in any argument, a principle that holds once (or twice, or three times) but not an unlimited number of times.

Of course, to fully make sense of this we would need to have an understanding of what it is for something to hold only once (or twice, or three times). We will not attempt to make this idea precise here. The rough idea, however, is that in the absence of contraction, we can allow for principles like tolerance, which are meant to be defeasible, to be appealed to a limited number of times. When they are defeated, in a setting like this one, it is in a battle of attrition.

These noncontractive approaches share a certain flavour with nontransitive approaches: according to both, we have something that can be appealed to without any worry or caveat, but only a limited number of times. On nontransitive approaches, it is the consequence relation itself that is like this; on noncontractive approaches, it is the tolerance principle.

Uniquely among the three kinds of structural approach we consider, there does not seem to be a way for a noncontractive approach to internalise tolerance in the consequence relation itself, at least not if the sorites is to be addressed by failures of contraction. The reason is that noncontractive consequence relations can only prevent multiple uses of things that actually appear as premises or conclusions, but tolerance itself is the only thing appealed to repeatedly in a sorites argument. If this repeated appeal is to be blocked, then, tolerance cannot be internalised; it must appear explicitly. For this reason, we set noncontractive approaches to tolerance aside for the remainder of the paper.

3 From nontransitive to nonmonotonic

In previous work we showed how can we endorse tolerance as a soft constraint within the context of non-transitive entailment. This way of endorsing tolerance has the consequence, to which we pointed out above, of making a difference between the endorsement of a formula as a theorem and as a suppressible premise. In our non-transitive framework tolerance, as explicit formula, is endorsed in the former but not in the latter sense. But there are reasons for which one would like to endorse tolerance in a stronger sense; for example, to be able to properly assert tolerance in some contexts. In a relatively independent discussion, in response to objections raised in Alxatib et al. (2013), we developed a notion of pragmatic interpretation. In this section we show how this can be used to go from our original non-transitive account of tolerance to a non-monotonic endorsement of tolerance.

3.1 The logic st

Let \(\mathcal{M} = \langle \mathcal{D},\mathcal{I} \rangle \), with \(\mathcal{I}\) a total function from atomic sentences to \(\{0,\frac{1}{2}, 1\}\). This model extends to formulas according to the strong Kleene valuation scheme:Footnote 15

  • \(\mathcal{V}_{\mathcal{M}}(\phi ) \qquad \ \ = \quad \mathcal{I}_{\mathcal{M}}(\phi )\), if \(\phi \) is atomic.

  • \(\mathcal{V}_{\mathcal{M}}(\lnot \phi ) \qquad = \quad 1 - \mathcal{V}_{\mathcal{M}}(\phi )\).

  • \(\mathcal{V}_{\mathcal{M}}(\phi \wedge \psi ) \quad = \quad min\{\mathcal{V}_{\mathcal{M}}(\phi ), \mathcal{V}_{\mathcal{M}}(\psi )\}\).

  • \(\mathcal{V}_{\mathcal{M}}(\phi \vee \psi ) \quad \ = \quad max\{\mathcal{V}_{\mathcal{M}}(\phi ), \mathcal{V}_{\mathcal{M}}(\psi )\}\).

  • \(\mathcal{V}_{\mathcal{M}}(\forall x \phi ) \qquad = \quad min\{\mathcal{V}_{\mathcal{M}}([^{\underline{d}}/_x] \phi ): d \in D\}\).

We say that \(\phi \) is strictly true in \(\mathcal{M}\) iff \(\mathcal{V}_{\mathcal{M}}(\phi ) = 1\), and that \(\phi \) is tolerantly true in \(\mathcal{M}\) iff \(\mathcal{V}_{\mathcal{M}}(\phi ) \ge \frac{1}{2}\). In terms of this semantics we can define some well-known logics: Kleene’s K3 and Priest’s LP. Both logics understand entailment as preservation of truth in all models, the difference is that while for K3 truth it means strict truth, for LP it means tolerant truth:

$$\begin{aligned} \begin{array}{c} \Gamma \vDash _{K3}\Delta \hbox { just in case for all } \mathcal {M}: \\ if \forall \gamma \in \Gamma : \mathcal{V}_{\mathcal{M}}(\gamma ) = 1, \hbox { then } \exists \delta \in \Delta : \mathcal{V}_{\mathcal{M}}(\delta ) = 1.\\ \Gamma \vDash _{LP}\Delta \hbox { just in case for all } \mathcal {M}: \\ if \forall \gamma \in \Gamma : \mathcal{V}_{\mathcal{M}}(\gamma )> 0, \hbox { then } \exists \delta \in \Delta : \mathcal{V}_{\mathcal{M}}(\delta ) > 0. \end{array} \end{aligned}$$

A fundamental idea in Cobreros et al. (2012) to define entailment from strict to tolerant:

$$\begin{aligned} \begin{array}{c} \Gamma \vDash ^{st}\Delta \hbox { just in case for all } \mathcal {M}: \\ if \forall \gamma \in \Gamma : \mathcal{V}_{\mathcal{M}}(\gamma ) = 1, \hbox { then } \exists \delta \in \Delta : \mathcal{V}_{\mathcal{M}}(\delta ) > 0. \end{array} \end{aligned}$$

Thus, although we don’t give up the idea that entailment is truth-preserving, we allow the standard of assertion of the conclusions to be weaker than the standard of assertion of the premises. A surprising feature of this logic is that although the semantics makes use of three truth-values, the consequence relation is exactly the familiar consequence relation of classical logic. This fact contrasts sharply with K3 and LP, both of which give up a good amount of classically valid arguments.

Now, despite its classicality, this new semantics makes room for tolerance without falling victim to the sorites paradox. In order to account for tolerance, we extend the language with similarity relations, \(\sim _P\), one for each predicate P. There are various ways to interpret this relation that would work for our purposes. One interpretation is the following, taken up in Cobreros et al. (2015a):

  • \( \mathcal{V}_{\mathcal{M}}(a \sim _P b) = 1\) iff \(|\mathcal{V}_{\mathcal{M}}(Pa) - \mathcal{V}_{\mathcal{M}}(Pb)| < 1\), 0 otherwise.

The resulting logic \(st^{\sim }\) is a conservative extension of classical logic, in the sense that any classically valid argument in the old vocabulary remains valid. In addition, the tolerance formula (\(\forall x,y((Px \wedge x \sim _P y) \rightarrow Py)\)) becomes valid, as does the tolerance argument: \(Pa, a \sim _P b \vDash ^{st}_{\sim } Pb\).Footnote 16 The endorsement of tolerance does not lead to paradox, however, since tolerance in \(st^{\sim }\) leads to non-transitivity:

$$\begin{aligned} \begin{array}{c} Pa, a \sim _P b \vDash ^{st}_{\sim } Pb \hbox { and } Pb, b \sim _P c \vDash ^{st}_{\sim } Pc \\ \hbox { BUT }\\ Pa, a \sim _P b, b \sim _P c \not \vDash ^{st}_{\sim } Pc. \end{array} \end{aligned}$$

We felt, and still feel, that this is a very intuitive and appealing treatment of the sorites paradox. The treatment however, comes with the limitation that we should make a distinction about the diagnosis of the sorites paradox depending on its formulation. If we look at the sorites as a step-by-step argument based on the validity of tolerance inferences then, though each tolerance inference is valid, validity breaks when we try to chain these inferences. If we consider the sorites argument with the tolerance formula (\(\forall x,y((Px \wedge x \sim _P y) \rightarrow Py)\)) as an explicit premise, then although that formula is valid, the resulting argument is valid but unsound; it fails to establish its conclusion, since tolerance cannot be freely drawn on as a premise. In short, although the tolerance formula is valid, according to the logic \(st^\sim \) we are not in a position to draw on it as a premise without further ado. However, there are contexts in which we would like to assert the tolerance formula and go on to draw conclusions from it. That is, we would like to assert it strictly. But there are other assertions we might want to make that signal only that something can be validly concluded, not that it may be drawn on as a premise. In these cases, we want to assert tolerantly. That is, when we make an assertion we would like to interpret it sometimes tolerantly and sometimes strictly.

A number of recent experiments (e.g., Alxatib and Pelletier 2011; Ripley 2011; Égré et al. 2013; Egré and Zehr 2016) show that naive speakers find a logical contradiction like ‘John is tall and John is not tall’ acceptable in cases where John is a borderline tall man. This seems to show that we need to take account of tolerant truth, since tolerant truth exhibits this exact behaviour. However, just relying on the notion of tolerant truth would mean that the assertion ‘John is tall’ would be acceptable in the same situation. The same experimental evidence shows, however, that this is not the case: ‘John is tall’ is taken to be acceptable only if John is really tall. In terms of our three-valued models this could be modeled by saying that the assertion ‘John is tall’ is acceptable only if John is strictly tall. Similarly, Serchuk et al. (2011) found that classical tautologies like ‘\(Tj \vee \lnot Tj\)’ are generally not accepted if John is borderline tall. So making use not only of tolerant, but also of strict truth (which exhibits this latter behavior) seems required.

The conclusion we draw from the previous discussion, together with considerations raised in Cobreros et al. (2012, 2015a), is that we should interpret a sentence strictly if possible, and tolerantly otherwise. This interpretation strategy is in line with a usual strategy to account for scalar implicatures: where Grice (1967) assumed that from an assertion of the form ‘\(\phi \vee \psi \)’ one can conclude that it is not the case that \(\phi \wedge \psi \) is true, because otherwise the speaker would have said so, we can assume that from assertion ‘\(\phi \)’ one can conclude that \(\phi \wedge \lnot \phi \) is not true for a similar reason. The strategy to interpret strictly if possible, and tolerantly otherwise, has just this effect.

Unfortunately, this interpretation strategy taken at face value gives rise to trouble for more complex sentences. Alxatib et al. (2013) show that this wrongly predicts that a sentence like ‘Adam is tall and not tall, or John is rich’ entails that John is strictly rich, although it should not entail this and intuitively should mean that either Adam is borderline tall or John is strictly rich. In Cobreros et al. (2015b) we responded by providing a more sophisticated pragmatic interpretation rule to strengthen the meaning of a sentence, which we review below.Footnote 17

3.2 Pragmatic interpretation

3.2.1 Truth-makers

To account for this pragmatic strengthening we will make use of truth-makers. We propose that the pragmatic interpretation of \(\phi \) makes one exact truth-maker of \(\phi \) as true as possible. To determine what the truth-makers of a sentence are, we follow van Fraassen (1969).

We think of a state of affairs as a situation that might hold or might not hold. The state of a affairs that Socrates is wise holds iff the sentence ‘Socrates is wise’ is true. The state of a affairs Socrates is not Cretan holds if the sentence ‘Socrates is not Cretan’ is true.Footnote 18\(^{,}\) Footnote 19 A fact is any non-empty set of states of affairs. We call facts that contains only a single state of affairs, like \(\{\) Socrates is wise \(\}\), an atomic fact. This atomic fact is a truth-maker for the sentence ‘Socrates is wise’ but also the conjunctive fact \(\{\) Socrates is wise, Socrates is not Cretan \(\}\) is a truth-maker for that sentence.

More generally, let SOA be the set of all states of affairs. For each state of affairs \(\mathbf{p}\in SOA\) there is a corresponding complement \(\mathbf{\overline{p}} \in SOA\) for which it holds that \(\mathbf{\overline{\overline{p}}} = \mathbf{p}\). We assume for simplicity a close correspondence between atomic sentences of the language and states of affairs: there is exactly one state of affairs corresponding to each literal. The set of facts, \(\mathcal{F}\), is \(\wp (SOA) - \emptyset \): that is, a fact is a non-empty set of states of affairs. If \(\mathbf{p}, \mathbf{q} \in SOA\), then \(\{\mathbf{p}\}\) and \(\{\mathbf{q}\}\) are atomic facts, and \(\{\mathbf{p,q}\}\) is a conjunctive fact. A truth-maker for a sentence \(\phi \) (or the proposition expressed by that sentence) is a fact that makes \(\phi \) true. We follow van Fraassen further in saying that an atomic sentence might have more than one truth-maker. An atomic sentence p is not only made true by atomic fact \(\{\mathbf{p}\}\), but also by conjunctive fact \(\{\mathbf{p,q}\}\). The former is a more minimal truth-maker than the latter. More interestingly, disjunctive sentences might have several minimal truth-makers. The disjunction \(p \vee q\), for instance, has two minimal truth-makers: \(\{\mathbf{p}\}\) and \(\{\mathbf{q}\}\). What we are after, however, is the notion of what Fine (2014) calls the exact truth-makers for \(\phi \). We say that the disjunction \(p \vee (p \wedge q)\)—although it has only \(\{\mathbf{p}\}\) as its minimal truth-maker—has two exact truth-makers: \(\{\mathbf{p}\}\) and \(\{\mathbf{p,q}\}\). Following van Fraassen (1969), we give the following simultaneous recursive definition of the set of exact truth-makers of \(\phi \), \(T(\phi )\), and the set of exact falsity-makers, \(F(\phi )\):Footnote 20

\(T(p) = \{\{\mathbf{p}\}\}\)

\(F(p) = \{\{\mathbf{\overline{p}}\}\}\) for atomic p

\(T(\lnot \phi ) = F(\phi )\)

\(F(\lnot \phi ) = T(\phi )\)

\(T(\phi \wedge \psi ) = T(\phi ) \otimes T(\psi )\)

\(F(\phi \wedge \psi ) \ = F(\phi ) \cup F(\psi )\)

= \(\{X \cup Y| X \in T(\phi ), Y \in T(\psi )\}\)

 

\(T(\phi \vee \psi ) \ = T(\phi ) \cup T(\psi )\)

\(F(\phi \vee \psi ) \ = F(\phi ) \otimes F(\psi )\)

Notice that according to these rules, \(T(p) = \{\{\mathbf{p}\}\}\), \(T(\lnot p) = \{\{\mathbf{\overline{p}}\}\}\), \(T(p \vee q) = \{\{\mathbf{p}\}, \{\mathbf{q}\}\}\) and \(T(p \wedge q) = \{\{\mathbf{p}, \mathbf{q}\}\}\). We analyse conditionals like \(\phi \rightarrow \psi \) as material implication, that is \(p \rightarrow q \equiv \lnot p \vee q\) and \(\lnot (p \wedge q) \equiv \lnot p \vee \lnot q\), and thus \(T(p \rightarrow q) = \{\{\mathbf{\overline{p}}\}, \{\mathbf{q}\}\}\) and \(T(\lnot (p \wedge q)) = \{\{\mathbf{\overline{p}}\}, \{\mathbf{\overline{q}}\}\}\).

To account for quantifiers, we first have to spell out how we interpret atomic sentences with predicates and individual terms. To do so, we assume that for each n-place predicate P the model contains states of affairs like \(\mathbf{P d_1, \ldots d_n}\), with each \(\mathbf{d_i} \in D\) an individual. We assume for simplicity that each \(\mathbf{d} \in D\) has a unique name \(\underline{d}\) in the language.

  • \(T(P\underline{d_1},\ldots , \underline{d_n}) \ \ = \ \{\{\mathbf{P d_1, \ldots , d_n}\}\}\) \(F(P\underline{d_1},\ldots , \underline{d_n}) \ \ = \ \ \{\{\mathbf{\overline{\mathbf{P d_1, \ldots , d_n}}}\}\}\)

  • \(T(\forall x \phi ) \quad = \quad \bigotimes _{d \in D}T(\phi [^x/_{\underline{d}}])\)           \(F(\forall x \phi ) \quad = \quad \bigcup _{d \in D}F(\phi [^x/_{\underline{d}}])\)

  • \(T(\exists x \phi ) \quad = \quad \bigcup _{d \in D}T(\phi [^x/_{\underline{d}}])\)           \(F(\exists x \phi ) \ \ \ = \quad \bigotimes _{d \in D}F(\phi [^x/_{\underline{d}}] )\)

Observe that \(T(\forall x Px) = T(P\underline{a}) \otimes T(P\underline{b}) = \{\{\mathbf{Pa},\mathbf{Pb}\}\}\), if \(D = \{a,b\}\). Similarly, \(T(\exists x Px) = T(P\underline{a}) \cup T(P\underline{b}) = \{\{\mathbf{Pa}\},\{\mathbf{Pb}\}\}\). Notice that facts might not only be incomplete (neither verify nor falsify a sentence), they might also be inconsistent and both verify and falsify a sentence. Indeed, we have not ruled out facts like \(\{\mathbf{Tj, \overline{Tj}}\}\). Such inconsistent facts are crucial for us to model the meaning of vague sentences, expressing in this case that John is borderline tall.

\(T(\phi )\) can be thought of as a fine-grained semantic interpretation of \(\phi \). It can be used to determine a truth-conditional meaning for \(\phi \) in terms of possible worlds, so long as we have some set W of worlds, understood as special facts:

$$\begin{aligned}{}[\![\phi ]\!] \quad {\mathop {=}\limits ^{df}} \quad \{w \in W\;|\; \exists f \in T(\phi ): f \subseteq w\}. \end{aligned}$$
(1)

It is usual, however, to suppose that not just any fact counts as a world; and depending on what restrictions are or are not imposed, this truth-conditional meaning can behave in different ways.Footnote 21 On one standard approach to worlds that fits with this setup, worlds are taken to be complete and consistent facts. That is, a world is a set w of states of affairs such that for each atomic sentence p in the language, either \(\mathbf{{p}}\in w\) or \(\mathbf{\overline{p}}\in w\) and not both. For our purposes, however, this will not work; it gives us no way to capture the difference between strict and tolerant satisfaction at a world, and so prevents us from connecting this approach to our initial three-valued setup. We will instead take worlds to be complete facts, with no requirement of consistency. That is, a world is a set w of states of affairs such that for each atomic sentence p in the language, either \(\mathbf{{p}}\in w\) or \(\mathbf{\overline{p}}\in w\) (and possibly both). This allows us to capture the difference between strict and tolerant satisfaction at a world, connecting truth-maker semantics to our three-valued st-models above. For each atomic sentence p and world w, we define:

$$\begin{aligned} \mathcal{V}_ w(p)= & {} 1 \quad \hbox {iff} \quad \mathbf{p} \in w \hbox { and } \mathbf{\overline{p}} \not \in w.\\ \mathcal{V}_w(p)= & {} 0 \quad \hbox {iff} \quad \mathbf{p} \notin w \hbox { and } \mathbf{\overline{p}} \in w.\\ \mathcal{V}_w(p)= & {} \frac{1}{2} \quad \hbox {iff} \quad \mathbf{p} \in w \hbox { and } \mathbf{\overline{p}} \in w. \end{aligned}$$

These values can then be used just as outlined above to define the tolerant and the strict semantic interpretation of any sentence.

3.2.2 Pragmatic meaning

However, we did not introduce truth-makers just to recover notions we already had. Our purpose in introducing truth-makers is to define a notion of pragmatic meaning in terms of which we can strengthen the semantic meaning of a sentence.Footnote 22 We have suggested at the end of the previous section that although we allow for inconsistencies, we can still pragmatically infer that \(\lnot p\) is not true from the fact that ‘p’ is asserted, by a reasoning analogue to the kind of reasoning involving scalar implicatures. In linguistic pragmatics it is not uncommon to use minimal models (we will not bother to distinguish models from worlds, using the terms interchangeably) to account for scalar pragmatic implicatures (e.g. Schulz and van Rooij 2006). We will do the same here to strengthen the semantic meaning, although we will think of a minimal world, in a somewhat different way. For us, a minimal world that makes \(\phi \) true is one that is minimally inconsistent: it doesn’t contain more inconsistencies than required to make \(\phi \) at least tolerantly true. It’s worth considering a few ways we might try to get to grips with this intuitive notion. The first way to model this that comes to mind is the following definition, with \(v < w\) iff\(_{df}\) \(\{x \in SOA: \{x, \overline{x}\} \subseteq v\} \subsetneq \{x \in SOA: \{ x, \overline{x}\} \subseteq w\}\):

  • \(Prag(\phi ) \quad = \quad \{w \in [\![\phi ]\!] \;|\; \lnot \exists v \in [\![\phi ]\!]: v < w\}\).

Indeed, this notion of minimally inconsistent interpretation is used in Priest (1991). Unfortunately, as discussed in Cobreros et al. (2015b), this interpretation rule won’t do for our purposes. Recall that according to Cobreros et al. (2012), to communicate that John is a borderline case of a tall man one can say ‘John is tall and not tall’. According to the above pragmatic interpretation, however, a sentence like ‘John is tall and not tall, or Mary is rich’ is incorrectly interpreted as being pragmatically equivalent with ‘Mary is rich’; the contradictory disjunct is too hastily ruled out (see Alxatib et al. 2013).

To avoid this prediction, we argued in Cobreros et al. (2015b) for the following definition of pragmatic interpretation (where \(v <_f w\) iff\(_{df}\) \( \{x \in SOA: x \in f \ \& \ \overline{x} \in v\} \subsetneq \{x \in SOA: x \in f \ \& \ \overline{x} \in w\}\)):

$$ \begin{aligned} PRAG(\phi ) \quad {=_{df}} \quad \{w \in W \ | \ \exists f \in T(\phi ): f \subseteq w\ \& \ \lnot \exists v \supseteq f: v <_f w\}. \end{aligned}$$
(2)

The function PRAG differs in two important ways from Prag: (1) \(PRAG(\phi )\) looks for minimal worlds for each exact truth-maker of \(\phi \) rather than for the whole sentence itself, and (2) for each of these exact truth-makers, it only seeks to minimize inconsistencies with respects to the atomic facts occurring in it; it doesn’t bother about other inconsistencies. It is due to the first feature that we achieve the correct prediction for ‘John is tall and not tall, or Mary is rich”; the exact truth-maker for this sentence that makes its first disjunct true is inconsistent, and so no world containing it could occur in the sentence’s Prag-value, since there is another exact truth-maker that is consistent. But PRAG allows such inconsistent worlds to occur.

PRAG gets many other predictions correct as well: (1) ‘John is tall’ is pragmatically interpreted to mean that John is strictly tall (that is, \(w \in PRAG(Tj)\) iff \(\mathcal{V}_w(Tj) = 1\)), (2) ‘John is not tall’ is predicted to mean that John is not even tolerantly tall (\(w \in PRAG(\lnot Tj)\) iff \(\mathcal{V}_w(Tj) = 0\)), (3) ‘John is tall and John is not tall’ means that John is borderline tall, i.e. tolerantly tall but not strictly tall (\(w \in PRAG(Tj \wedge \lnot Tj)\) iff \(\mathcal{V}_w(Tj) = \frac{1}{2}\)), and (4) ‘John is tall or not tall’ means that John is not borderline tall (\(w \in PRAG(Tj \vee \lnot Tj)\) iff \(\mathcal{V}_w(Tj) = 1\) or 0). Those predictions are in accordance with recent experimental results reported by Alxatib and Pelletier (2011), Ripley (2011), Serchuk et al. (2011), Égré et al. (2013) and Egré and Zehr (2016). Furthermore, ‘John is tall and not tall, and Mary is rich’ is pragmatically interpreted to mean that John is borderline tall and Mary strictly rich, which seems intuitively correct. Finally, ‘John is tall and not tall, or Mary is rich’ is correctly interpreted as saying that John is borderline tall, or Mary is strictly rich.

3.2.3 A tolerance constraint

Let us go back now to the tolerance principle. Since similarity statements like \(a \sim _P b\) are atomic, they correspond to atomic SOAs like \(\mathbf{a \sim _P b}\). At the moment, however, our worlds pay no attention to the connection between these SOAs and the SOAs they are meant to constrain, like \(\mathbf{Pa}\) and \(\mathbf{Pb}\).Footnote 23 We need to implement some kind of constraint so that tolerance can be respected. Here is our chosen constraint: we require that \(\mathcal{V}_w(a \sim _P b) = 1 - |\mathcal{V}_w(Pa) - \mathcal{V}_w(Pb)|\). That is, a and b are to count as P-similar to the extent that the value of Pa matches the value of Pb.Footnote 24 We henceforth exclude all worlds w that violate this condition.

One immediate effect of this is that the tolerance conditionals \(Pa \wedge a \sim _P b \rightarrow Pb\) come out at least tolerantly true at every world. For a world to assign such a conditional the value 0, it would need to assign both Pa and \(a \sim _P b\) the value 1 and Pb the value 0. But this is not possible compatibly with the constraint; if a world assigns 1 to \(a \sim _P b\), then it must assign the same value to Pa and Pb. These tolerance conditionals, then, can never receive the value 0 at any world.

3.3 Tolerance and inference relations

3.3.1 Pragmatic-to-tolerant entailment (Prt)

Our pragmatic interpretation rule can be included in the definition of logical consequence to try and overcome the limitations we pointed out above about the assertability of the tolerance formula in \(st^\sim \).Footnote 25 In Cobreros et al. (2015b), we introduced the following notion of pragmatic consequence, \(\vDash ^{prt}\), going from pragmatically strongest to tolerant:

  • \(\Gamma \vDash ^{prt} \phi \)    iff\(_{df}\)    \(\bigcap _{\psi \in \Gamma }PRAG(\psi ) \subseteq [\![\phi ]\!]\)

The resulting notion of consequence, however, distinguishes between separate premises and their conjunction. There are cases in which \(\phi , \psi \vDash ^{prt} \chi \) but \(\phi \wedge \psi \not \vDash ^{prt} \chi \). For example, \(p, \lnot p \vDash ^{prt} q\): worlds in PRAG(p) must make p strictly true, and worlds in \(PRAG(\lnot p)\) must make it strictly false, so \(PRAG(p) \cap PRAG(\lnot p) = \emptyset \). However, \(p \wedge \lnot p \not \vDash ^{prt} q\), as \(PRAG(p \wedge \lnot p)\) is not empty, and includes some worlds outside of \([\![q]\!]\). The trouble is that PRAG here has its effect only on individual sentences, when we might want it to apply to full collections of premises. To that end, we introduce the following variant approach. We restrict attention to finite sets of premises, and say that a finite set of premises \(\Gamma \) entails \(\phi \) provided the pragmatic meaning of the conjunction of the premises entails the tolerant meaning of the conclusion. (For empty premise sets, we stipulate that \(PRAG(\bigwedge \emptyset ) = W\); asserting nothing rules nothing out.) We call the corresponding notion of validity Prt, and in what follows, we focus on Prt rather than prt, although there is much that is similar about them:

  • \(\Gamma \vDash ^{Prt} \phi \)    iff\(_{df}\)    \(PRAG(\bigwedge \Gamma )\subseteq [\![\phi ]\!]\)

We want to call quick attention to three features of Prt. First, Prt has all tolerance formulas as theorems, since for each such formula A, \([\![A]\!] = W\), as we saw in Sect. 3.2.3. Second, \(\phi , \lnot \phi \not \vDash ^{Prt} \psi \); since there can be worlds that tolerantly satisfy \(\phi \wedge \lnot \phi \), and such worlds need not satify all \(\psi \) even tolerantly, explosion is not valid and Prt entailment is paraconsistent. Third, conjunction elimination is valid: \(\phi \wedge \psi \vDash ^{prt} \phi \) and \(\phi \wedge \psi \vDash ^{prt} \psi \), since a world must at least tolerantly satisfy something in order to pragmatically satisfy it, and tolerant satisfaction of a conjunction requires tolerant satisfaction of each conjunct.

We turn now to Prt’s structural properties. The first thing to note is that Prt is nonmonotonic, and in ways we might want for handling soritical situations. To start, Prt validates n-fold tolerance: \(Pa_0, a_0 \sim _P a_1, \ldots , a_{n - 1} \sim _P a_n \vDash ^{Prt} Pa_n\). To see this, consider a world \(w \in PRAG(Pa_0 \wedge a_0 \sim _P a_1 \wedge \cdots \wedge a_{n - 1} \sim _P a_n)\). There is only one fact in \(T(Pa_0 \wedge a_0 \sim _P a_1 \wedge \cdots \wedge a_{n - 1} \sim _P a_n)\), and it is consistent; thus, any such w must give \(\mathcal{V}_w(Pa_0) = \mathcal{V}_w(a_0 \sim _P a_1) = \cdots = \mathcal{V}_w(a_{n - 1} \sim _P a_n) = 1\). But since all the similarity facts take value 1, by our constraint on worlds, we must have \(1 = \mathcal{V}_w(Pa_0) = \mathcal{V}_w(Pa_1) = \cdots = \mathcal{V}_w(Pa_n)\). And thus, any such world tolerantly satisfies \(Pa_n\)—indeed, strictly satisfies it.

But although Prt validates n-fold tolerance, adding \(\lnot Pa_n\) to the premises of this argument results in an invalid argument. There is only one fact in \(T(Pa_0 \wedge a_0 \sim _P a_1 \wedge \cdots \wedge a_{n - 1} \sim _P a_n \wedge \lnot Pa_n)\), but this time it is not consistent; that is, our similarity restrictions have ruled out any consistent world containing this fact. Indeed, no world meeting our similarity restrictions can strictly satisfy \(Pa_0 \wedge a_0 \sim _P a_1 \wedge \cdots \wedge a_{n - 1} \sim _P a_n \wedge \lnot Pa_n\). It is still tolerantly satisfiable, however. Among the worlds in \(PRAG(Pa_0 \wedge a_0 \sim _P a_1 \wedge \cdots \wedge a_{n - 1} \sim _P a_n \wedge \lnot Pa_n)\) are worlds that tolerantly satisfy every conjunct, and strictly satisfy all conjuncts but two similarity facts \(a_i \sim _P a_{i + 1}\) and \(a_j \sim _P a_{j + 1}\), with \(i < j\).Footnote 26 But at such worlds, we can have \(Pa_k = 1\) for \(0 \le k \le i\), plus \(Pa_k = \frac{1}{2}\) for \(i < k \le j\), plus \(Pa_k = 0\) for \(j < k \le n\). In particular, then, such worlds do not even tolerantly satisfy \(Pa_n\); they show that \(Pa_0, a_0 \sim _P a_1, \ldots a_{n - 1} \sim _P a_n, \lnot Pa_n \not \vDash ^{Prt} Pa_n\), although as we have seen the same argument without its final premise remains Prt-valid.

Is Prt transitive? Answering this question requires fixing on a sense of transitivity. As many usual senses of the term are not appropriate for nonmonotonic logics, this requires a bit of care, since as we have seen Prt is nonmonotonic. We will focus on a property like the one called Cut in, for example, Giordano et al. (2009, p. 18:6): If \(\Gamma \vdash B\) and \(\Gamma , B \vdash C\), then \(\Gamma \vdash C\).Footnote 27 Does Prt have this property?

It does not. The core reason is that conclusions are held to a low standard—tolerant satisfaction—while premises are pragmatically strengthened. And so we have \(Pa, \lnot Pa \vDash ^{Prt} a \sim _P b\), plus \(Pa, \lnot Pa, a \sim _P b \vDash ^{Prt} Pb\); but \(Pa, \lnot Pa \not \vDash ^{Prt} Pb\).Footnote 28 The latter two claims are perhaps straightforward to see, but the first might be surprising. However, it does indeed hold. Any world w in \(PRAG(Pa \wedge \lnot Pa)\) must be such that \(\mathcal{V}_w(Pa) = \frac{1}{2}\). As a result, no matter what \(\mathcal{V}_w(Pb)\) is, we have \(|\mathcal{V}_w(Pa) - \mathcal{V}_w(Pb)| < 1\), and so \(1 - |\mathcal{V}_w(Pa) - \mathcal{V}_w(Pb)| > 0\). By our constraint, this requires \(\mathcal{V}_w(a \sim _P b) > 0\), and so w tolerantly satisfies \(a \sim _P b\).

3.3.2 Pragmatic-to-pragmatic entailment (PrPr)

There are reasons you might be unhappy with \(\vDash ^{Prt}\). We argued in Sects. 1 and 2 that nontransitivity and nonmonotonicity might be ways to account for vagueness. One might wonder, however, whether we need both of these properties. If either one suffices on its own to capture the situation, and if both are somehow seen as drawbacks, then we might hope for a purely nonmonotonic approach, just as st provides a purely nontransitive one. Second, if pragmatic interpretation captures what is meant by the speaker, one might wonder whether either conjunct can be inferred from the premise \(Pa \wedge \lnot Pa\). With this sentence the speaker wants to impart that a is borderline tall. But if a conjunct like ‘Pa’ is asserted alone, it is pragmatically interpreted to mean that a is strictly tall, and thus that a is not borderline tall. If we want a consequence relation capturing what can be asserted on the basis of antecedent assertions, the inference from \(Pa \wedge \lnot Pa\) to Pa should not be valid according to such a relation.

Indeed, experimental investigations of borderline cases point to participant willingness to agree with sentences of the form \(Pa \wedge \lnot Pa\) while disagreeing with Pa and \(\lnot Pa\) individually (see Alxatib and Pelletier 2011; Égré et al. 2013). However, one important thing you might think a consequence relation ought to do is to constrain patterns of agreement and disagreement in the following way: if \(\Gamma \vdash A\), then one ought not agree with everything in \(\Gamma \) while disagreeing with A.Footnote 29 But then, if the participants in these experiments are not mistaken, indeed \(Pa \wedge \lnot Pa\) does not entail Pa or \(\lnot Pa\), in any sense of entailment that bears this relation to their actions. What might such an entailment relation look like? Not Prt, since \(Pa \wedge \lnot Pa\) entails both Pa and \(\lnot Pa\) in Prt.

To account for the latter type of consequence relation we therefore define the following inference relation (from pragmatic to pragmatic interpretation), again restricting ourselves to finite sets of premises, and stipulating that \(PRAG(\bigwedge \emptyset ) = W\):

  • \(\Gamma \vDash ^{PrPr} \phi \)    iff\(_{df} \quad PRAG(\bigwedge \Gamma )\subseteq PRAG(\phi )\)

Thus, for inference we take into account what is (pragmatically) meant both by the premises and by the conclusion. It follows that \(\vDash ^{PrPr}\) does not satisfy conjunction elimination: in particular, \(p \wedge \lnot p\not \vDash ^{PrPr} p\). Because \(p \wedge \lnot p\) will have no models where p is strictly true, the pragmatic interpretation of the premise can only select models where p is tolerantly true. Obviously, p cannot be strictly true here: \(p \wedge \lnot p\not \vDash ^{PrPr} p\). However, even though \(\phi \wedge \psi \not \vDash ^{PrPr} \phi \) in general, still \(p \wedge q \vDash ^{PrPr} p\). The reason is that p and q are independent of each other, and so there are models where \(p \wedge q\) will receive value 1; these will be selected for the interpretation of the premise. In all such models, p will indeed be strictly true.

As explained in Sect. 3.2.2, the pragmatic interpretation of a sentence of the form \((p \wedge \lnot p) \vee q\), \(PRAG((p \wedge \lnot p) \vee q)\), will contain not only worlds where q has value 1, but also ones where p is tolerantly true and q is false. For this reason, \((p \wedge \lnot p) \vee q\not \vDash ^{PrPr} q\). Similarly, \(\phi \wedge \lnot \phi \nvDash ^{PrPr} \psi \); that is, explosion is not valid. PrPr-entailment is again a paraconsistent entailment relation, just like Prt.

The tolerance inference, \(Pa, a \sim _P b \vDash ^{PrPr} Pb\), remains valid. Since the premises can be jointly strictly satisfied by a world, every \(w \in PRAG(Pa \wedge a \sim _P b)\) does indeed strictly satisfy them both, and so strictly satisfy Pb as well. Indeed, for essentially the same reasons as in Prt, n-fold tolerance is PrPr-valid. (Note that the reasoning given to show this for Prt establishes the strict satisfaction of the conclusion of n-fold tolerance, assuming the pragmatic satisfaction of the premises.) However, in contrast to Prt, the conditional form of tolerance, \((Pa \wedge a \sim _P b) \rightarrow Pb\), is not a PrPr theorem. To be a theorem, it would have to follow from the empty set; that is, \(PRAG((Pa \wedge a \sim _P b) \rightarrow Pb)\) would have to be W. But this is not so.Footnote 30 This reflects the fact that, while tolerance conditionals are always at least tolerantly satisfied, asserting them need not be idle; their pragmatic strengthening still rules out some worlds.

Like Prt, PrPr too is nonmonotonic. Some examples of this nonmonotonicity were already present in Prt: \(Pa,a~\sim _P~b \vDash ^{PrPr}~Pb\), but \(Pa, a \sim _P b,\lnot Pa, \lnot a \sim _P b \not \vDash ^{PrPr} Pb\). Others are specific to PrPr: while \(p \vDash ^{PrPr} p\), still \(p, \lnot p \not \vDash ^{PrPr} p\).

Unlike Prt, however, PrPr is transitive in a sense appropriate for nonmonotonic consequence relations: if \(\Gamma \vDash ^{PrPr} B\) and \(\Gamma , B \vDash ^{PrPr} C\), then \(\Gamma \vDash ^{PrPr} C\). This follows from the fact that if \(PRAG(\bigwedge \Gamma ) \subseteq PRAG(B)\), then \(PRAG(\bigwedge \Gamma ) \subseteq PRAG(\bigwedge \Gamma \wedge B)\).Footnote 31 So if \(PRAG(\bigwedge \Gamma \wedge B) \subseteq PRAG(C)\), then \(PRAG(\bigwedge \Gamma ) \subseteq PRAG(C)\) too.

4 Sorites arguments

With the machinery introduced in Sect. 3 we have come to define three different consequence relations, which we restrict to finite sets of premises:

\(\Gamma \vDash ^{st} \phi \) :

just in case there is no model \(\mathcal {M}\) such that,

\(\mathcal {V_M}(\psi )=1\) for all \(\psi \) in \(\Gamma \) and \(\mathcal {V_M}(\phi )=0\).

\(\Gamma \vDash ^{Prt} \phi \) :

just in case there is no model \(\mathcal {M}\) such that,

\(\mathcal {M}\in PRAG(\bigwedge \Gamma )\) and \(\mathcal {V_M}(\phi )=0\).

\(\Gamma \vDash ^{PrPr} \phi \) :

just in case there is no model \(\mathcal {M}\) such that,

\(\mathcal {M}\in PRAG(\bigwedge \Gamma )\) for all \(\psi \) in \(\Gamma \) and \(\mathcal {M}\notin PRAG(\phi )\).

In Sect. 3 we presented some ways to capture similarity relations, in order to be able to express tolerance. In these logics tolerance is internalized since, in fact, the inference from ‘\(\mathsf {Red(a)}\)’ and ‘\(\mathsf {a\sim _{red} b}\)’ to ‘\(\mathsf {Red(b)}\)’ is valid in each case. In this section we review how these logics deal with sorites arguments.

4.1 The logics, side by side

4.1.1 st

The logic st is based on the idea that premises and conclusions of an argument need not be subject to equal standards of satisfaction. If the premises of an argument are true to some strict standard, it suffices for validity if the conclusion is true to some less strict standard. Intuitively, this will lead to breaches of transitivity and this is precisely what happens, according to this logic, in sorites arguments. Since pragmatic satisfaction is not involved, st remains fully monotonic.

In our presentation of the sorites, we can formulate the argument as a step-by-step argument over the similarity relation or as an argument making use of tolerance as an explicit premise. According to st all steps in the sorites argument are valid. For example, for any two colored objects \(\mathsf {a}\) and \(\mathsf {b}\) that differ imperceptibly in hue within a more extended series, we have:

$$\begin{aligned} \mathsf {Red(a)} ,\; \mathsf {a\sim _{Red} b} \vDash ^{st} \mathsf {Red(b)}, \end{aligned}$$

But it doesn’t follow that the last element of the series is red, since when we put these arguments together, validity breaks; n-fold tolerance is not st-valid. That is,

$$\begin{aligned} \begin{array}{c} \mathsf {Red(a_1)},\, \mathsf {a_1\sim _{Red} a_2} \vDash ^{st} \mathsf {Red(a_2)}{\quad }\cdots \quad \mathsf {Red(a_{n-1})},\; \mathsf {a_{n-1}\sim _{Red} a_n} \vDash ^{st} \mathsf {Red(a_n)}\\ \hbox { BUT }\\ \mathsf {Red(a_1)},\; \mathsf {a_1\sim _{Red} a_2},\ldots , \mathsf {a_{n-1}\sim _{Red} a_n}\nvDash ^{st} \mathsf {Red(a_n)} \end{array} \end{aligned}$$

Concerning the tolerance principle,

$$\begin{aligned} \mathsf {\forall x\forall y((Red(x)\wedge x\sim _{Red}y)\rightarrow Red(y))}, \end{aligned}$$

appropriate constraints on similarity render the principle st-valid.

However, st does validate

$$\begin{aligned} \begin{array}{c} \mathsf {\forall x\forall y((Red(x)\wedge x\sim _{Red}y)\rightarrow Red(y)),\; Red(a_1)},\; \mathsf {a_1\sim _{Red} a_2},\ldots , \\ \mathsf {a_{n-1}\sim _{Red} a_n}\vDash ^{st} \mathsf {Red(a_n)} \end{array} \end{aligned}$$

That is, although n-fold tolerance is not st-valid, and although tolerance is already an st-theorem, adding tolerance as a premise to n-fold tolerance results in a valid argument.

4.1.2 Prt

The logic Prt combines two features: pragmatic interpretation for the premises and tolerance for the conclusion. This results in a logic that is both nonmonotonic and nontransitive, as we’ve seen. Regarding step-by-step sorites arguments, Prt predicts all steps in the sorites argument to be valid. So for any element \(\mathsf {a}\) in the series,

$$\begin{aligned} \mathsf {Red(a)},\; \mathsf {a\sim _{Red} b} \vDash ^{Prt} \mathsf {Red(b)} \end{aligned}$$

Unlike st, Prt validates n-fold tolerance:

$$\begin{aligned} \mathsf {Red(a_1)},\; \mathsf {a_1\sim _{Red} a_2},\ldots , \mathsf {a_{n-1}\sim _{Red} a_n}\vDash ^{Prt} \mathsf {Red(a_n)} \end{aligned}$$

But this cannot be arrived at by chaining the smaller inferences together; as we’ve seen, Prt is not in general transitive, and does not allow such chaining.

Moreover, Prt’s nonmonotonicity means that similarity inferences can cease to hold in the presence of countervailing information. For example, we may have:

$$\begin{aligned} \begin{array}{c} \mathsf {Red(a)},\; \mathsf {a\sim _{Red} b} \vDash ^{Prt} \mathsf {Red(b)}\\ \hbox {BUT}\\ \mathsf {Red(a)},\; \mathsf {a\sim _{Red} b},\; \mathsf {\lnot Red(a)},\; \mathsf {\lnot a\sim _{Red} b} \nvDash ^{Prt} \mathsf {Red(b)} \end{array} \end{aligned}$$

Concerning the tolerance principle,

$$\begin{aligned} \mathsf {\forall x\forall y((Red(x)\wedge x\sim _{Red}y)\rightarrow Red(y))} \end{aligned}$$

as in the case of st, is Prt-valid. Unlike in st, however, adding this tolerance sentence as a premise to an n-fold tolerance argument makes no difference: the argument is valid both with and without such a tolerance premise.

4.1.3 PrPr

The driving idea for PrPr is that the validity of an argument should be evaluated in connection with those models that provide the highest standards of satisfaction compatible with the statements contained in the premises and in the conclusions. Just as with our other logics, according to PrPr all steps in the sorites argument are valid. So for any element \(\mathsf {a}\) in the series,

$$\begin{aligned} \mathsf {Red(a), a\sim _{Red} b} \vDash ^{PrPr} \mathsf {Red(b)} \end{aligned}$$

As we’ve seen, in contrast to either st or Prt, PrPr is transitive and, therefore, validity won’t break down when we put these arguments together. That is, since for each \(j \le n\) we have

$$\begin{aligned} \mathsf {Red(a_1)},\; \ldots ,\; \mathsf {Red(a_{j - 1})},\; \mathsf {a_1 \sim _{Red} a_2},\; \ldots ,\; \mathsf {a_{j - 1} \sim _{Red} a_j} \vDash ^{PrPr} \mathsf {Red(a_j)}, \end{aligned}$$

we can put these together by repeated appeals to transitivity to arrive at

$$\begin{aligned} \mathsf {Red(a_1)},\; \mathsf {a_1\sim _{Red} a_2},\ldots ,\mathsf {a_{n-1}\sim _{Red} a_n}\vDash ^{PrPr} \mathsf {Red(a_n)} \end{aligned}$$

Again, nonmonotonicity allows us to render the inference invalid by adding countervailing information in the premises:

$$\begin{aligned} \mathsf {Red(a_1)},\; \mathsf {a_1\sim _{Red} a_2},\ldots ,\mathsf {a_{n-1}\sim _{Red} a_n}\; ,\mathsf {\lnot Red(a_n)}\nvDash ^{PrPr} \mathsf {Red(a_n)} \end{aligned}$$

Unlike in st and Prt, tolerance sentences are not theorems in PrPr. However, like Prt (and unlike st), tolerance sentences make no difference for validity when they are added as premises to n-fold tolerance: with or without additional tolerance premises, n-fold tolerance is PrPr-valid.

4.2 Conclusion

In the course of this paper, we have outlined three distinct consequence relations: st, Prt, and PrPr. Of these, the first is monotonic but nontransitive, the second is both nonmonotonic and nontransitive, and the third is nonmonotonic but transitive. They have much in common: all validate the tolerance inference, and each allows the soft status of tolerance to be recognised by failing to obey the full budget of usual structural rules.

Each relation reveals a different aspect of the underlying space of models, and of the way those models operate in pragmatic strengthening. These models draw together tools from many-valued and nonmonotonic logics to generate appropriate predictions about what is assertible, even in the kinds of tricky cases we’ve discussed.

Since we have defined three kinds of satisfaction (strict, tolerant, pragmatic), we have the raw materials to define nine different two-sided consequence relations, of which we have explored three in a bit of detail. None of these is itself any full story; rather they all reveal different interactions between strict satisfaction, tolerant satisfaction, and the pragmatic processes we have outlined. Overall, then, this paper’s approach not only explains how tolerance can be valid without the sorites wreaking disaster, it also gives a detailed picture of the various ways in which tolerance is valid.