1 Horwichian MT and the generalisation problem

According to Horwich’s minimalism (see Horwich 1999), all the facts about truth can be explained on the basis of the so-called ‘minimal theory’ (MT). The axioms of this theory are the instances of the following disquotational T-schema:

(T)   \(<p>\) is true iff p

where the expression ‘\(<p>\)’ reads ‘the proposition that p’. Horwich claims that the minimal theory fully characterises the content of the notion of truth. Moreover, our understanding of this notion consists of our disposition to accept every (non-paradoxical) instance of (T). The final upshot is that the concept of truth becomes light and unproblematic, devoid of any deep nature for philosophers then to uncover.

One of the main concerns for the adherent of Horwichian minimalism is the (so-called) generalisation problem. How can the minimalist account for generalities involving the notion of truth? Consider, for example, the following statements:

  1. (1)

    Every proposition of the form ‘\(\varphi \rightarrow \varphi \)’ is true;

  2. (2)

    For every \(\varphi \), the negation of \(\varphi \) is true iff \(\varphi \) is not true;

  3. (3)

    Every theorem of S is true (where S is some theory which we accept).

Objections have been made that Horwich’s minimal theory is too weak to prove such generalisations (cf. Gupta 1993). The validity of this charge is not entirely clear, with the main stumbling block being that Horwich has never precisely delineated the collection of axioms of MT. Hence it is not possible to give an exact assessment of their truth-theoretic strength. Nevertheless, it is instructive in this context to see how effective disquotational theories can be in proving general statements of the envisaged sort. On the one hand, it is a well-known fact that some disquotational theories are weak in this respect. As an illustration, let \(L_T\) be the language obtained by extending \(L_{PA}\) (the language of Peano arithmetic) with a new one-place predicate ‘T(x)’. The expressions ‘\(Sent_{L_{PA}}\)’ and ‘\(Sent_{L_{T}}\)’ will be used to denote sentences of (respectively) \(L_{PA}\) and \(L_{T}\). Let \(Ind(L_T)\) be the set of all substitutions of the schema of induction ‘\(\big ( \varphi (0) \wedge \forall x~[\varphi (x) \rightarrow \varphi (x+1)] \big ) \rightarrow \forall x \varphi (x)\)’ by formulas of \(L_T\). We define:

Definition 1

  • \(TB = PA \cup \{T(\ulcorner \varphi \urcorner ) \equiv \varphi : \varphi \in { Sent}_{L_{PA}} \} \cup Ind(L_T)\),

  • \(UTB = PA \cup \{\forall x_1 \ldots x_n [T(\ulcorner \varphi (x_1 \ldots x_n) \urcorner ) \equiv \varphi (x_1 \ldots x_n)]: \varphi \in L_{PA} \} \cup Ind(L_T)\),

  • Theories like TB and UTB but with arithmetical induction only will be denoted (respectively) as \(TB^-\) and \(UTB^-\).

UTB (hence also TB) is a disquotational truth theory quite weak in proving truth-theoretic generalisations. This is the content of the following theorem.

Theorem 2

For every arithmetical formula \(\varphi (x)\), if \(UTB \vdash \forall x [\varphi (x) \rightarrow T(x)]\), then there is a natural number n such that \(PA \vdash \forall x [\varphi (x) \rightarrow Tr_n (x)]\).Footnote 1

It immediately follows that (1) is not derivable in UTB, being that formulas of the form ‘\(\varphi \rightarrow \varphi \)’ have an arbitrarily large complexity. It also follows that for no theory S, \(UTB \vdash \forall \psi [Pr_S(\psi ) \rightarrow T(\psi )]\) (with ‘\(Pr_S(\psi )\)’ being an arithmetical formula with the natural reading ‘\(\psi \) is provable in S’). The reason here is the same as before; namely, that the syntactical complexity of theorems of S will be arbitrarily large. A different argument shows that (2)—the compositional principle for negation—is not provable in UTB either.Footnote 2

On the other hand, it is not the case that all disquotational theories are truth-theoretically weak. As soon as we drop the typing restrictions, the situation changes drastically, as witnessed by the following observation due to McGee (1992).

Theorem 3

Let PAT be Peano arithmetic formulated in \(L_T\). Let \(\varphi \) be an arbitrary sentence of \(L_T\). Then there is a sentence \(\psi \) of \(L_{T}\) such that \(PAT \vdash \varphi \equiv (T(\psi ) \equiv \psi )\).

In other words, every \(L_{T}\) sentence is provably (in PAT) equivalent to some substitution of Tarski’s disquotational schema. In particular, this includes sentences of \(L_{T}\) corresponding to (1)–(3). Therefore (1)–(3) will be provable in some untyped disquotational truth theories.

At best, McGee’s result shows that disquotational theories are not doomed at the start: it is theoretically possible for some set of well-motivated disquotational axioms to be truth-theoretically strong. However, it still remains unclear whether it is anything more than a mere theoretical possibility. The crucial question remains: Is there any one set of disquotational axioms which is both well-motivated and truth-theoretically strong? It is worth observing that the untyped disquotational theories which were actually proposed in the literature (with some philosophical motivation offered) do not fare well in this respect.Footnote 3

In effect, the minimalist owes us an answer to the question of why—if at all—we are entitled or perhaps obliged to accept various generalisations involving the notion of truth. Does his disquotational truth theory prove generalisations such as (1)–(3)? And, if it does not, how can it help us to arrive at them? In a nutshell, this is the generalisation problem.

A useful way of framing the challenge has been suggested by Ketland (2005), who introduced the concept of conditional epistemic obligation. Ketland starts with the intuition that if we accept some base arithmetical theory S (formulated in \(L_{PA}\)), then we are obliged to accept various further statements, possibly unprovable in S itself. Here Ketland’s emphasis is on reflection principles; the relevant definition is introduced below. The acronyms (GR), (UR) and (LR) stand for global, uniform and local reflection respectively.

Definition 4

(GR):

\(\forall \psi \in L_{PA}~[Pr_S(\psi ) \rightarrow T(\psi )]\).

(UR):

\(\forall x~[Pr_S(\psi (x))) \rightarrow \psi (x)]\), for all \(\psi (x) \in L_{PA}\).

(LR):

\(Pr_S(\psi ) \rightarrow \psi \), for all \(\psi \in { Sent}_{L_{PA}}\).

According to Ketland, when accepting S, we are epistemically obliged to accept reflection principles for S in all three versions. Since none of the reflection principles is provable in S,Footnote 4 the natural explanation of our conditional epistemic obligation suggests itself: namely, that all three principles become theorems as soon as appropriate truth axioms are added. However, the prospects for finding well-motivated disquotational axioms producing this effect look dim. Therefore, the disquotationalist faces the dilemma. Paraphrasing Ketland, either he strengthens his axioms, thereby rejecting disquotationalism, or he offers some non-truth-theoretic analysis of the conditional epistemic obligation.Footnote 5

Indeed, it is my opinion that Ketland’s concept of conditional epistemic obligation can be fruitfully applied not just to reflection principles, but also to the compositional principles governing the behaviour of the truth predicate. Consider TB as a starting point. For every arithmetical sentence \(\psi \), we can easily establish in TB that \(T(\lnot \psi ) \equiv \lnot T(\psi )\). The recognition of this fact carries a conditional epistemic obligation: given that we accept TB, we should also accept the general compositional principle for negation (restricted, in this context, to arithmetical sentences only), even though—as it happens—it is not provable in TB itself. How can the disquotationalist account for this without compromising his philosophical standpoint? That is the question.Footnote 6

In recent years Paul Horwich has made two attempts to deal with the challenge. They will be presented and briefly discussed in the next section.

2 Horwich’s two solutions

First attempt In Horwich (1999) an attempt is made to strengthen the minimal theory in such a way that it proves by itself the desired generalisations. This strengthening involves modifying the proof techniques available to us in MT. In Horwich’s words:

It is plausible [\(\ldots \)] that there is a truth-preserving rule of inference that can take us from set of premises attributing to each proposition some property F, to the conclusion that all propositions have F. (Horwich 1999, p. 137)

It seems that Horwich proposes the introduction of a new rule, very similar to the well-known \(\omega \)-rule applied in arithmetical contexts. Assuming that we accept all the sentences obtained from \(\varphi (x)\) by substituting an arbitrary numeral for the variable x, the \(\omega \)-rule permits us to accept the general statement ‘\(\forall x \varphi (x)\)’. The quoted passage hints at a similar strategy. If for each proposition \(\varphi \), \(F(\varphi )\) can be derived in our theory, then we are entitled to conclude that \(\forall \varphi F(\varphi )\).

I will not discuss this idea in detail, referring the reader to Raatikainen (2005) for a convincing criticism. The main thrust of Raatikeinen’s remarks is that any rule invoked to solve the generalisation problem should be practical. In other words, there does not seem to be much point in generalisations being provable in a given theory of truth if we, as human beings, are never able to produce such proofs. After all, we somehow do reach generalisations about truth and any adequate explanatory account should take this fact into consideration. Unfortunately, the system with the \(\omega \)-rule does not satisfy this basic feasibility condition. The rule in question requires infinitely many premises and for this reason its practical utility is close to null.Footnote 7 Indeed, I am inclined to think that it is a very serious worry.

Second attempt Horwich’s second solution was originally proposed in Horwich (2001) and elaborated on in Horwich (2010). Unlike in the previous case, the current proposal involves leaving the proof machinery of MT intact (it remains thoroughly classical); the idea is just to use it together with a certain additional premise. Horwich emphasises that, apart from MT, the minimalist is permitted to uses additional ‘truth-free’ assumptions in his explanations. For example, we can explain why we accept ‘<Elephants have trunks> is true’ as soon as we enlarge MT with a truth-free assumption ‘Elephants have trunks’. Here is the final answer: we accept that ‘Elephants have trunks’ is true because we believe that elephants have trunks and we accept an appropriate disquotational axiom of MT.

In an attempt to generalise this strategy, Horwich proposes the following truth-free assumption:

  1. (A)

    Whenever someone is disposed to accept, for any proposition of structural type F, that it is G (and to do so for uniform reasons) then he will be disposed to accept that every F-proposition is G. (Horwich 2010, p. 45)

With this assumption at hand, Horwich promises to explain why we are inclined to accept generalisations of the (1)–(3) type. As an example, we present the Horwichian explanation below for (1).

Explanation 5

(\(P_1\)):

For every proposition of structural type ‘\(\varphi \rightarrow \varphi \)’, we are disposed to accept that it is true (and we do it for uniform reasons).

(\(P_2\)):

If \(P_1\), then we will be disposed to accept that every proposition of structural type ‘\(\varphi \rightarrow \varphi \)’ is true.

Conclusion We will be disposed to accept that every proposition of structural type ‘\(\varphi \rightarrow \varphi \)’ is true.

Clearly, the reasoning is logically valid. Indeed, we can assume that premise (\(P_1\)) describes us as users of a given disquotational theory of truth; it is also easy to observe that \(P_2\) is an instance of (A). In effect, we obtain the explanation of our acceptance of ‘\(\forall \varphi ~T(\varphi \rightarrow \varphi )\)’.

Still, some critics remained unconvinced. In particular, Bradley Armour-Garb was dissatisfied with premise \(P_2\). In his own words:

One will not be disposed to accept (the proposition) that all F-propositions are G, from the fact that, for any F-proposition, she is disposed to accept that it is G [\(\ldots \)], unless she is aware of the fact that, for any F-proposition, she is disposed to accept that it is G. (Armour-Garb 2010, p. 699)

Armour-Garb’s reservation seems fair indeed. However, as he notes himself, it is possible to take this objection into account, which generates the following modified version of Explanation 5:

Explanation 6

\(S_1\) :

For every proposition of structural type ‘\(\varphi \rightarrow \varphi \)’, we are disposed to accept that it is true.

\(S_2\) :

We are aware that \(S_1\).

\(S_3\) :

If \(S_1\) and \(S_2\), then we will be disposed to accept that every proposition of structural type ‘\(\varphi \rightarrow \varphi \)’ is true.

Conclusion We will be disposed to accept that every proposition of structural type ‘\(\varphi \rightarrow \varphi \)’ is true.

Nevertheless, Armour-Garb is dissatisfied with \(S_2\). He asks: “What is it for one to be aware of such a fact”? He then answers:

Here is a plausible answer: for one to be aware of the fact that, for every F-proposition, she is disposed to accept that it is true is for that person to be aware of the fact that she is disposed to accept that every F-proposition is true. (Armour-Garb 2010, p. 700)

If this is so, then \(S_2\) simply means ‘we are aware that the conclusion holds’ and for this reason Armour-Garb accuses Explanation 6 of being viciously circular. We just cannot explain our disposition to accept a general sentence by citing our awareness that we have such a disposition.

Still, such a dismissal of Horwichian explanations seems to me too hasty, since there are other possible interpretations of \(S_2\) which should be taken into account. The next section contains an initial sketch of what seems to me to be a more promising strategy.

3 Horwichian explanations reconsidered

We will propose here a certain reconstruction of Horwichian explanations. To enhance clarity, they will be presented in a very restricted, arithmetical framework. For starters, we are going to assume that the disquotational theory \(TB^-\) is our preferred theory of truth for the language of arithmetic. The obvious question then arises of why we are inclined to accept such general statements as (1).Footnote 8 A Horwichian explanation of our acceptance of ‘\(\forall \psi \in { Sent}_{L_{PA}}~T(\psi \rightarrow \psi )\)’ will be presented below. The explanation is carried out in a metatheory S about which we will make the following stipulations.

  1. (a)

    The language of S contains expressions ‘we are aware that ...’ and ‘we are disposed to accept ...’, predicated of sentences of \(L_{S}\).

  2. (b)

    S contains Peano arithmetic.

  3. (c)

    S contains the information that \(TB^-\) is our theory. That is, we are disposed to accept sentences in awareness that they are theorems of \(TB^-\).

  4. (d)

    S contains an axiom stating that if we are aware that (for every x, \(\varphi (x)\)), then for every x, we are aware that \(\varphi (x)\).Footnote 9

  5. (e)

    S contains the following necessitation rule: given \(\varphi \) as a theorem, we are allowed to infer ‘we are aware that \(\varphi \)’.

  6. (f)

    S contains Horwich’s rule: given a proof of ‘we are aware that for every x, we are disposed to accept \(\varphi (x)\)’, we are allowed to infer: ‘we are disposed to accept \(\forall x \varphi (x)\)’.Footnote 10

It should be noted that, as it stands, S describes us as highly idealised users of \(TB^-\).Footnote 11 Thus, for example, condition (b) together with the closure condition (e) guarantees that for every theorem \(\varphi \) of PA, S will prove ‘we are aware that \(\varphi \)’. Surely, the awareness of every single theorem of Peano arithmetic (including those never proved by anyone) is an impossibly tall order for any real-world agent, which invites the charge that any solution to the generalisation problem based on S will be as unrealistic and impractical as the recourse to \(\omega \)-rule. Nonetheless, in fact the situation is not that dire at all. In the course of explaining the dispositions of the real-world agents, we can still appeal to those concrete reasonings carried out in S, which employ only the principles known to the agents at a given time (ideally, principles for which the agents themselves have provided proofs).Footnote 12

At this point let us recall Armour-Garb’s question. What is it for one to be aware that \(\varphi \)? In my opinion, the application of (e) to the real-world agents (see the previous paragraph) provides a reasonable sufficient condition. In order to be aware that \(\varphi \) it is enough to prove \(\varphi \). After proving \(\varphi \) in our metatheory S, we are permitted to conclude ‘we are aware that \(\varphi \)’.

Below, we present a Horwich-style explanation of why we are inclined to accept that every arithmetical sentence of the form ‘\(\varphi \rightarrow \varphi \)’ is true. The explanation proceeds as follows.

Explanation 7

  1. (1)

    For every \(\varphi \in { Sent}_{L_T}\), if we are aware that \(TB^- \vdash \varphi \), then we are disposed to accept \(\varphi \). (By (c))

  2. (2)

    For every x, \(TB^- \vdash { Sent}_{L_{PA}}(x) \rightarrow T(x \rightarrow x)\). (By (b), since (2) is provable already in PA)Footnote 13

  3. (3)

    We are aware that (2). (Necessitation, applied to (2))

  4. (4)

    For every x, we are aware that \(TB^- \vdash { Sent}_{L_{PA}}(x) \rightarrow T(x \rightarrow x)\). (From (3) by (d))

  5. (5)

    For every x, \({ Sent}_{L_T}(\ulcorner { Sent}_{L_{PA}}(x) \rightarrow T(x \rightarrow x) \urcorner )\) Footnote 14 (Provable in PA)

  6. (6)

    For every x, we are disposed to accept: \({ Sent}_{L_{PA}}(x) \rightarrow T(x \rightarrow x)\). (From (1), (4) and (5), logic)

  7. (7)

    We are aware that (6). (Necessitation, applied to (6))

  8. (8)

    We are disposed to accept: for every x, \([{ Sent}_{L_{PA}}(x) \rightarrow T(x \rightarrow x)]\). (From (7) by Horwich’s rule)

The acceptance of various other truth-involving generalisations can be explained in an analogous manner.

How should we assess Explanation 7? All in all, I am inclined to think of it as of a step in the right direction. Nevertheless, I will formulate two critical remarks below, both of which indicate the need for a still further reformulation and amendment.

Before engaging into the criticism, let us note one peculiar trait of Explanation 7 (or of Horwichian explanations in general). The explanation of why we are disposed to accept a given statement could proceed by deriving this statement in a theory which we accept. Indeed, this would be the case of \(TB^-\) with \(\omega \) rule, where various truth-theoretic generalisations become provable; this is also the case when we try to explain our acceptance of the truth of ‘Elephants have trunks’.Footnote 15 However, this is not what happens here. The general statement in question, i.e. ‘\(\forall \varphi \in L_{PA} T(\varphi \rightarrow \varphi )\)’, is not derived here at all, neither in \(TB^-\) (which would be impossible, anyway), nor even in \(TB^-\) supplemented with some additional premises. What is instead derived is a statement about our disposition to accept the general sentence under discussion.

I will now formulate two objections against Explanation 7 and against Horwichian explanations in general.

Problem 1

Horwichian explanations are psychological. A psychological fact (namely, our disposition to accept a given sentence) is explained here in terms of our other dispositions and mental abilities. This in itself is not problematic. There is nothing wrong with psychological explanations as such. However, the trouble is that in Explanation 7 the normative element is completely lost and the following additional question arises: Is someone who accepts \(TB^-\) (or Horwich’s MT) committed to accept additional generalisations, unprovable in \(TB^-\)? Assume for the sake of argument that we do satisfy the description from Explanation 7, entailing that, while accepting \(TB^-\), we are also inclined to accept the general statement ‘\(\forall \varphi \in L_{PA} T(\varphi \rightarrow \varphi )\)’. But is there any reason why we should accept such independent sentences?

If the sentence in question was provable in a theory accepted by us, the difficulty would not be so acute. However, as we noticed, Explanation 7 does not contain a derivation of ‘\(\forall \varphi \in L_{PA} T(\varphi \rightarrow \varphi )\)’ in any theory accepted by us. Why then should we accept it?Footnote 16

Problem 2

Premise (1) requires some reformulation. It seems that an assumption to the effect that \(TB^-\) is a theory accepted by us, should be employed in Horwichian explanations. But what does it mean to accept a theory? The problem is that Premise (1) does not adequately express this content. For illustration, assume that my knowledge of the theory \(TB^-\) is very limited—that I do not know much more about \(TB^-\) apart from the fact that it is some theory. In such a case Premise (1) would be vacuously true; nevertheless, we would not say that in this situation I accept \(TB^-\).

In view of this, later on we are going to propose an alternative approach, which preserves some essential traits of Horwichian explanations, but gets rid of psychological concepts altogether. However, we will start with remarks about the notion of accepting a theory.

4 Accepting a theory

Given that we accept a theory Th, why should we then accept various generalisations that are not provable in Th? This question is the starting point of our investigations; this is also what we consider to be the basic challenge behind the generalisation problem. In such a formulation, the notion of accepting a theory comes out as crucial. But what does it mean to accept a theory?

For starters, we are going to consider (and reject) a few candidates for the role of an explication of ‘I accept Th’. Our criterion of assessment of the explications will be twofold. First, we are going to reject those explications which yield clearly implausible consequences.Footnote 17 Second, we will also reject those explications which are too strong (in a sense to be explained below). Here is our initial list of options. ‘I accept Th’ could mean that:

  1. (a)

    For any sentence \(\varphi \), if I believe that \(\varphi \) has a proof in Th, then I am ready to accept \(\varphi \).

  2. (b)

    For any sentence \(\varphi \), if I believed that \(\varphi \) has a proof in Th, then I would be ready to accept \(\varphi \).

  3. (c)

    I accept that all the theorems of Th are true.

  4. (d)

    I accept some truth-free version of the reflection principle for Th (the local or the uniform one).

(a) is clearly inadequate for reasons which have already been indicated (see Problem 2 above). Indeed, if I know nothing about Th, then (a) is vacuously true. Hence it would follow that I accept Th, which is hardly plausible.

(b)—a counterfactual strengthening of (a)—turns out to be inadequate as well. I submit that very few people would accept any theory Th in such a sense! For example, if I believed that Peano arithmetic proves ‘\(0 = 1\)’, then I would not be ready to accept that \(0 = 1\). I would reject PA instead.

Even though (c) sounds plausible in itself, it is too strong to be of much use in our present discussion. To give a simple example, one of the ‘conditional epistemic obligations’ (to use Ketland’s phrase again) is the consistency of Th. Why is it that when accepting Th, we should accept that Th is consistent? In a way, the explication (c) makes the issue trivial. Thus, given an arithmetical theory Th it is easy to observe that any theory of truth which includes T-sentences for the arithmetical language, proves \(Con_{Th}\) (the consistency of Th) when supplemented with ‘All theorems of Th are true’. So far so good—but where does it leave the disquotationalist? How can he accept Th if—as it may well happen—his disquotational truth theory for the language of Th does not permit him to prove ‘all theorems of Th are true’? Indeed, if (c) was the only possible way to make sense of the notion of ‘accepting a theory’, I would be inclined to see it as a strong argument against disquotationalism. But, this is not the only way.

In addition, treating (c) as an explication of ‘I accept Th’ will be very problematic in some particularly pertinent cases, namely, when Th itself is a theory of truth. For illustration, let Th be \(KF + {\textsc {CON}}\); in other words, let it be the Kripke–Feferman system supplemented with the consistency axiom ‘\(\forall \varphi \lnot (T(\varphi ) \wedge T(\lnot \varphi ))\)’.Footnote 18 What does it mean to accept such a theory? Since the system \(KF + {\textsc {CON}}\) proves the untruth of some of its own theorems, accepting \(KF + {\textsc {CON}}\) in the sense (c)—that is, introducing the information that all theorems of \(KF + {\textsc {CON}}\) are true—produces an inconsistent theory.Footnote 19 This is another reason why (c) should be deemed unsatisfactory.

We reject also (d) for similar reasons as (c)—that is, we consider it too strong for our purposes. If ‘accepting Th’ means accepting all the substitutions of (say) the local reflection principle for Th, then how can the disquotationalist accept Th in this sense if (as it may well happen) his disquotational truth theory does not prove all such substitutions? Indeed, one could try to argue that on any admissible sense of ‘accepting’, our acceptance of Th commits us to some forms of reflection for Th.Footnote 20 Still, I am inclined to view such a commitment as something to be explained. It is totally unhelpful to postulate it in advance, as a part of the meaning of ‘I accept Th’.

At this point we have considered and rejected four explications of ‘I accept Th’. So what is left?

In this paper we will work with the following explication, which is a modified version of (b).

  1. (e)

    For any sentence \(\varphi \), if I believed that \(\varphi \) has a proof in Th and I had no independent reason to disbelieve \(\varphi \), then I would be ready to accept \(\varphi \).

Observe that the objection raised earlier against (b) is no longer valid. Here, a mere possibility of the theory being inconsistent is no longer a problem: if I believed that ‘\(0=1\)’ has a proof in Th, then I would not be ready to accept that \(0=1\) because of independent reasons! In fact, the formulation (e) gives justice to the fact that we rarely—if ever—accept our theories unconditionally. The intuition is rather that we will stick to them as long as we believe that they do not yield false consequences. Still, if I accept Th, then given a new sentence \(\varphi \) (new in the sense that I neither accepted nor rejected \(\varphi \) previously), I would accept \(\varphi \) if I believed that it is a theorem of Th. Indeed, this is how the present story goes.

Throughout the rest of this paper I am going to treat (e) as my basic description of the content of ‘I accept Th’. Let us observe that, with such a choice, the generalisation challenge remains nontrivial. For example, we still have to explain why someone accepting PA in this sense should be committed to accept statements not provable in PA (the consistency statement in particular). Alternatively, one could claim that any such additional commitments are illusory.

5 An epistemic approach

5.1 Introducing believability

In the remaining part of this paper, a Horwich-style solution to the generalisation problem will be proposed, one which eschews psychological concepts altogether. Instead of ‘being aware of’ and ‘being disposed to accept’ (see Explanation 7), in our amended explanations we are going to employ a single epistemic predicate B(x) (‘x is believable’), predicated of sentences. The intuitive intended interpretation of B(x) is ‘there is a reason to accept x which is good enough to warrant rational acceptance of x, given the absence of reasons to reject x’ (later on we will abbreviate this as ‘there is a reason to accept x which is normally good enough’ or just as ‘there is a good reason to accept x’). We emphasise that this interpretation is not to be confused with ‘there is a compelling reason to accept x’ or with an unconditional ‘x should be rationally accepted’. It corresponds rather to the weak notion of theory acceptance characterised by condition (e) on p. 11 of this paper. The initial idea is that proofs carried out in a theory that we accept are treated by us exactly as such reasons: when presented with such a proof, we accept its conclusion, unless given strong reasons to the contrary.Footnote 21 In a moment we are going to characterise the new predicate by means of axioms and rules. However, I will start with a general outline of the proposed strategy.

We begin with a very simple notion of truth, characterised by purely disquotational axioms of our truth theory Th (resembling \(TB^-\) or, more ambitiously, resembling perhaps Horwich’s MT). We are convinced that our axioms fully specify the meaning of the truth predicate. Moreover, we treat the axioms as obvious, simple and epistemologically basic. Sometimes we express these convictions by slogans like ‘truth is innocent’ or ‘truth is a light notion’.

We accept our disquotational truth theory. The notion of acceptance is employed here in the sense (e) of the previous section. In practice, given a reliable information about the existence of a proof of \(\varphi \) in Th, we accept \(\varphi \). However, there is one additional element of the picture: it should be emphasised that in such cases we accept \(\varphi \) because of the proof in Th. Indeed, we consider theorems of Th believable. The key point is that our mathematical practice—that of accepting statements because of their proofs in Th (even in cases when we did not check the proofs by ourselves)—would be irrational without the underlying belief that proofs in Th function as reasons which are normally good enough to accept their conclusions.

In the next stage, we characterise our notion of believability by means of some basic axioms and rules. As we are going to see, this move brings important consequences. The final result is that we declare as believable various additional statements in the language of Th, unprovable in Th itself. In effect, our initial acceptance of disquotational theory Th, together with some basic convictions about believability, leads us to the recognition that, indeed, there is a reason to accept e.g. compositional truth axioms. In this way new commitments are generated. The point is that given such a reason, we behave rationally when—not aware of any good reasons to reject compositional principles—we finally accept them. This is the general outline of the story proposed here.

In order to make some of these ideas precise, we introduce the following definition:

Definition 8

Let K be an axiomatisable extension of PA in the language \(L_K\) (possibly richer than \(L_{PA}\)). Denote as \(L_{K, B}\) the extension of \(L_K\) with a new one-place predicate ‘B’. Let KB be a theory K formulated in the language \(L_{K, B}\).Footnote 22

  • We denote as \(Bel(K)^-\) the theory in the language \(L_{K, B}\) which extends KB with the following axioms:

    • \((A_1)\, \forall \psi \in L_{K, B}[KB \vdash \psi \rightarrow B(\psi )]\)

    • \((A_2)\, \forall \varphi , \psi \in L_{K, B}[\big ( B(\varphi ) \wedge B(\varphi \rightarrow \psi ) \big ) \rightarrow B(\psi )]\)

    In addition, the theory \(Bel(K)^-\) has the following rules of inference:

    $$\begin{aligned} {\textsc {NEC}}\quad \frac{\vdash \phi }{\vdash B (\phi )} \qquad \qquad \frac{\vdash \forall x B \phi (x)}{\vdash B \big ( \forall x \phi (x) \big ) }\quad {\textsc {GEN}}\end{aligned}$$
  • We denote as \(Bel^{Con}(K)^-\) the theory which is exactly like \(Bel(K)^-\), except that it contains the following additional consistency axiom:

    • \((A_3)\, \forall \psi \in L_{K, B}\lnot B(\psi \wedge \lnot \psi )\).

  • Bel(K) and \(Bel^{Con}(K)\) are theories which are exactly like \(Bel(K)^-\) and \(Bel^{Con}(K)^-\), except that they contain all the axioms of induction for formulas of \(L_{K, B}\).

We will view Bel(K) as a believability theory built over our initial theory K. The intended reading of ‘B(x)’ (‘x is believable’) will manifest itself in our treatment of a proof of \(B(\psi )\) in Bel(K) as showing that \(\psi \) should be rationally accepted, given our acceptance of K and given that we are not aware of any independent reasons to reject \(\psi \).Footnote 23

Axiom (\(A_1\)) is not much more than a formalisation of the assumption hidden behind our acceptance of K. We realise that proofs in K are (for us) good reasons to accept their conclusions—that is the content. The only addition is that (\(A_1\)) also expresses our acceptance of logic in the extended language, with the new predicate ‘B’ (we emphasise that in the formulation of \((A_1)\) the provability predicate for KB is used).

Axiom (\(A_2\)) expresses the following thought: if there is a good reason to accept the implication and there is a good reason to accept its antecedent, then there is a good reason to accept the consequent.

As for the rules of inference of Bel(K), the intuitive validity of \({\textsc {NEC}}\) seems to me uncontroversial. If a proof of \(\varphi \) in Bel(K) is provided, it is simply this proof itself which constitutes a good reason to accept \(\varphi \), therefore in such a situation \(\varphi \) is believable.Footnote 24

Rule \({\textsc {GEN}}\) is of crucial importance. As we are going to see, it is exactly \({\textsc {GEN}}\) which permits us to derive (in the scope of ‘B’) strong consequences, unprovable in K itself. In order to explain the idea behind \({\textsc {GEN}}\), let us compare it to the following (somewhat similar) reflection rule for the theory K Footnote 25:

$$\begin{aligned} \frac{\vdash \forall x Pr_{K} \phi (x)}{\vdash \forall x \phi (x) }\quad {\textsc {REF}}\end{aligned}$$

Let us assume that K is the theory we accept. Let us also assume that we are able to prove (perhaps in K itself) that for every x, \(\phi (x)\) is a theorem of K. Why should we then accept the general statement ‘\(\forall x \ \phi (x)\)’? The answer becomes unproblematic as soon as we assume that all theorems of K are true. Indeed, after this (with just a few basic properties of the truth predicate) the conclusion of \({\textsc {REF}}\) can be easily justified. Still, with just the disquotational notion of truth we might not have a right to such an assumption. How can we close this gap?Footnote 26

The idea of \({\textsc {GEN}}\) is to close the gap by introducing an additional epistemological element. Here, it should be noted that what we really do in such contexts involves some reasoning about reasons, not about truth. For illustration, assume that you consider the Pope an authority in matters of religious faith. You trust the Pope and the fact that a certain opinion was expressed by the Pope is treated by you as a good reason to accept this opinion. Assume in addition that you heard from a trustworthy source—possibly from the Pope himself—that every book written by a given theologian was proclaimed heretical by the Pope. You will then have a good reason to think that (for every book written by the theologian, there is a good reason to think that it contains a heresy). The idea behind \({\textsc {GEN}}\) is that this in itself constitutes a good reason to think that every book written by the theologian contains a heresy. According to the present proposal, this is how we reason about reasons.

It is worth observing that the following transformation of \({\textsc {GEN}}\) into the axiom misses the mark:

  1. (Ax)

    \(\forall \psi [\forall n B(\psi (n)) \rightarrow B(\forall n \psi (n))]\).

As a formalisation of our intuitions concerning believability, (Ax) is clearly inadequate. For every numerical instance of \(\psi \), there may exist a good reason to accept it without there being a uniform reason, one covering all the cases simultaneously. In other words, there might still not be any good reason to believe the general statement in question. We emphasise that in \({\textsc {GEN}}\) this particular flaw is eliminated; we require that the statement ‘\(\forall n B(\psi (n))\)’ is itself provable, and that we therefore have a good, uniform reason (namely, the proof of the general sentence in Bel(K)) to believe of every instance of \(\psi \) that it is believable. The intuition is that such a good reason to think that all instances are believable is also a good reason to believe a general statement. It is exactly this intuition that is expressed by \({\textsc {GEN}}\).

Axiom (\(A_3\)) is an ugly duckling and I do not present it as a part of the minimal believability theory Bel(K). In this context we should emphasise one more time that, according to the intended interpretation, believability is neither truth nor unconditional rational acceptance. Not only can false sentences be believable but there is also no automatic transition from ‘\(B(\varphi )\)’ to ‘\(\varphi \) should be rationally accepted’. Intuitively, ‘\(B(\varphi )\)’ instead means that ‘there is a reason to accept \(\varphi \) which is normally good enough’ or, in other words, ‘you should rationally accept \(\varphi \) unless you have a strong independent reason to reject \(\varphi \)’. Observe that this reading corresponds closely to the characterisation of the weak notion of theory acceptance adopted in this paper. Thus, the information that \(\varphi \) is provable in K (a theory which I accept) is normally good enough for me to accept \(\varphi \). However, my acceptance of K is not unconditional and if it turned out that \(K \vdash 0=1\), then I would not be ready to accept that \(0=1\). In our framework, this would be the situation of Bel(K) proving \(B(0=1)\). I then still have a reason to accept ‘\(0=1\)’ that is normally good enough (namely, the proof in K—a theory which I accept). However, the situation envisaged is not normal, since I am also able to prove that \(B(\lnot 0 = 1)\). In general, the intuition is that whenever we obtain both \(B(\varphi )\) and \(B(\lnot \varphi \)), there is no automatic transition to the rational acceptance of statements in the scope of the believability predicate.

All in all, one might have a good reason—that is, a reason normally good enough—to accept a sentence \(\varphi \) (for instance, a derivation of \(\varphi \) from some plausible premises) while at the same time having a good reason to accept \(\lnot \varphi \) (say, a derivation of \(\lnot \varphi \) from another set of plausible premises). This is why I reject (\(A_3\)) as an axiom formalising the properties of the believability predicate. The axiom is introduced just in order to indicate that nice models (in a sense to be explained) can be provided even for full \(Bel^{Con}(K)\) on the assumption that the initial theory K is ‘safe’ enough. Nevertheless, none of the applications of the believability theory discussed in this paper will employ \((A_3)\).

5.2 Formal properties of Bel(K)

In this section we state the main formal results about the properties of Bel(K). We omit the proofs, which can be found in the Appendix.

When analysing formal properties of Bel(K) with the intended interpretation of ‘B’ in mind, the following question looms as primary. Imagine that K is an initial theory which we accept. If K is trustworthy, just how trustworthy are the statements which are, provably in Bel(K), within the scope of B? In other words, if Bel(K) proves the believability of a given statement, will something go wrong if we accept this statement? In order to facilitate further discussion, we introduce the following notational convention:

Definition 9

\(Int_{Bel(K)} = \{\psi \in L_{K, B}: Bel(K) \vdash B(\psi ) \}\)

The set \(Int_{Bel^{Con}(K)}\) can be defined in an analogous manner.

What we would like to know is whether the elements of \(Int_{Bel(K)}\) form a nice theory. One particular interpretation of the phrase, ‘a nice theory’, will be considered here; that is, nice theories are interpretable in the standard model of arithmetic.Footnote 27 One of our main formal observation is that even \(Int_{Bel^{Con}(K)}\) is indeed nice in this sense. This is the content of the theorem formulated below.

Theorem 10

Let K be a theory in the language without the predicate ‘B’. If N (the standard model of arithmetic) is expandable to a model \(N^*\) of K, then \(N^*\) is expandable to a model of \(Int_{Bel^{Con}(K)}\).

Having checked that \(Int_{Bel(K)}\) is a nice theory, we observe that even without the axiom \((A_3)\), the theory \(Bel(TB^-)\), obtained by taking \(TB^-\) as the basic (initially accepted) theory, is already strong enough to prove the believability of various interesting generalisations unprovable in \(TB^-\). Let ‘B(CT)’ be a shorthand of ‘all the compositional truth principles are believable’. It is then possible to show that:

Theorem 11

\(Bel(TB^-) \vdash B(CT)\).

As a final observation, we remark that similar results can be obtained for some untyped disquotational truth theories taken as a starting point. Thus, in (Horsten and Leigh 2015) the language \(L_{T, F}\) has been defined in the following manner.

Definition 12

  • Terms and function symbols of \(L_{T, F}\) are exactly those of \(L_{PA}\).

  • The connectives of \(L_{T, F}\) are \(\wedge \) and \(\vee \).

  • The predicates of \(L_{T, F}\) are: \(=\), \(\ne \), T, F.

  • In \(L_{T, F}\) we have the quantifiers \(\forall \) and \(\exists \).

  • The formulas of \(L_{T, F}\) are built in the usual style.

What is important is that in \(L_{T, F}\) we do not have a symbol for negation. We will instead be using the symmetric relation of being a dual formula. Strictly speaking, this notion is first defined for primitive symbols and then generalised to cover all formulas.

Definition 13

  • \(=\)’ and ‘\(\ne \)’ are dual predicate symbols.

  • T’ and ‘F’ are dual predicate symbols.

  • \(\wedge \)’ and ‘\(\vee \)’ are dual connectives.

  • \(\forall \)’ and ‘\(\exists \)’ are dual quantifiers.

  • \(\psi \) and \(\varphi \) are dual formulas iff \(\psi \) is obtained from \(\varphi \) by replacing every symbol in \(\varphi \) with its dual.

We will use the notation \(\varphi ^d\) whenever we want to indicate that a given formula is a dual of \(\varphi \).

In this language, the following basic disquotational theory can be characterised:

Definition 14

  • \(TFB = PA \cup \{T(\ulcorner \varphi \urcorner ) \equiv \varphi : \varphi \in L_{T, F} \} \cup \{F(\ulcorner \varphi ^d \urcorner ) \equiv \varphi : \varphi \in L_{T, F} \} \cup Ind(L_{T, F})\).

  • A theory like TFB, but with arithmetical induction only, will be denoted as \(TFB^-\).

Let KF be a variant of Kripke–Feferman truth theory in the language \(L_{T, F}\).Footnote 28 Let ‘B(KF)’ be the sentence stating that all compositional axioms of KF are believable. Then it is possible to show that:

Theorem 15

\(Bel(TFB^-) \vdash B(KF)\).

In this way a Horwich-style solution to the generalisation problem is vindicated. Indeed, it turns out that if we find our initial disquotational theory (for example, \(TB^-\) or \(TFB^-\)) believable, then we will also find believable various truth-theoretic generalisations, not provable in the initial theory of our choice.

5.3 How does it help?

How attractive is the proposed epistemic strategy? Does it really permit the disquotationalist to escape the generalisation problem? We believe that it does. Nevertheless, in this final section we are going to discuss some limitations of the proposed believability framework, indicating the areas with an additional work still awaiting to be done.

For starters, let us recapitulate the proposed solution.

The disquotationalist begins with accepting some disquotational theory K, which may well be truth-theoretically weak. In particular, it might happen that various compositional principles involving truth are not provable in K. Should he accept these principles? If so, why? This is the question.

Here is the proposed answer. The disquotationalist treats proofs in K as good reasons to accept their conclusions (more exactly, he treats them as reasons which are normally good enough—see the condition (e) earlier in this paper). In this realisation, he applies the believability theory to K treating it as the base. However, it transpires that Bel(K) proves the believability of compositional principles. The initial acceptance of K, together with some basic convictions about believability, leads the disquotationalist to the recognition that there is indeed a good reason to accept compositional principles. Given such a good reason and without being aware of any good reason to reject these principles, he should accept them.

Viewed in more general terms, our endeavour has been to understand the commitments of someone who accepts a given theory K.Footnote 29 We have noticed that the answer depends on the notion of accepting a theory. After observing that some strong notions trivialise the issue, we have decided to focus on the weak notion of theory acceptance. This has forced us to adopt a notion of believability partially characterised by the axioms \((A_1)\) and \((A_2)\) but without the consistency axiom \((A_3)\). We have argued that the statement ‘All theorems of K are believable’ should be accepted once we reflect upon our mathematical practice as users of K.Footnote 30 That is, once we realise that “if we believed that \(\varphi \) has a proof in K and we had no independent reason to disbelieve \(\varphi \), then we would be ready to accept \(\varphi \)”; in other words, once we appreciate that proofs in K are normally good enough for us as reasons to accept their conclusions. The resulting theory Bel(K), with sentences declared believable even though they are unprovable in K, permits us then to recognise our commitments as users of K.

At this point it should be emphasised that our intended interpretation of ‘B’ is only partially captured by the axioms and rules of Bel(K). Apart from the formal machinery of the believability theory, the second crucial element of the picture, external to Bel(K), are the prospective bridge rules which connect believability with rational acceptance. When then, or under what conditions, are we permitted to move from ‘\(B(\varphi )\)’ to ‘\(\varphi \) should be rationally accepted’? In this paper I have advocated a modest approach. I would say that given a proof of \(B(\varphi )\) in Bel(K), we should rationally accept \(\varphi \) as long as we are not aware of any derivation of \(B(\lnot \varphi )\) in Bel(K) or in some theory \(Bel(K')\), built over a well-entrenched mathematical theory \(K'\), possibly different than K. I consider it to be a workable option, one that the deflationist can employ in practice, successfully defending himself against the objections of the critics. However, there is still a room for a more ambitious approach, aiming to explain how we ‘reason about reasons’ even in troublesome cases.

In general, when working without the consistency axiom \((A_3)\) (as I think we should), we are faced with the question of how the bridge rules are to be used in contexts with contradictory statements appearing within the scope of B. One of the main issues is the problem of logical explosion. Since by \((A_1)\) the whole of first-order logic is believable, it is easy to see that with contradictory statements in the scope of B, an arbitrary statement will become believable, provably so in our theory. Let me stress that with our weak reading of ‘B’, this result is not troublesome in itself. Indeed, the intuition is then that treating proofs in K as reasons normally good enough to rationally accept their conclusions, gives you a reason to accept an arbitrary \(\varphi \)—again, a reason which is normally good enough to warrant rational acceptance. However, with both \(B(\varphi )\) and \(B(\lnot \varphi )\) as theorems, the situation is not normal and the bridge rules should not licence the universal transition from B(x) to ‘x should be rationally accepted’. Indeed, various concrete cases indicate that the problem concerns not so much the believability rules and axioms, but K itself.Footnote 31

Nevertheless, in the more ambitious approach afore-mentioned, one could try to give justice to the fact that we do make great efforts to tolerate contradictions in the scope of B. Naturally, when given good reasons for a pair of contradictory statements, we try to resolve the contradiction and to reassess the reasons. However, the main observation to make would be that, in the meantime, we do not treat the believability of a contradiction as an argument which, for every sentence \(\varphi \) of our language, demonstrates that rational acceptance of \(\varphi \) is invalid. As I have said, one bridge rule will permit us to conclude that \(\varphi \) should be rationally accepted, given that we have \(B(\varphi )\) and we are not aware of any reason to accept the negation of \(\varphi \). However, blocking the move to ‘we should rationally accept \(\varphi \)’ whenever both \(B(\varphi )\) and \(B( \lnot \varphi )\) are provable, always and without exceptions, does not seem to be a good idea because in practice such an additional rule is not treated by us as universal. This intuition could lead to a more general, possibly paraconsistent analysis of both believability and rational acceptance. Indeed, I view it as a promising line of research. Nevertheless, here I prefer to remain noncommittal about the shape of such a general theory of believability and rational acceptance.

An additional avenue for further research corresponds to the limiting condition built into our Theorem 10: namely, to the reservation that the base theory K is to be formulated in a language without ‘B’. Indeed, it is easy to observe that removing this reservation makes Theorem 10 false. For a trivial example, let K be PA with the additional axiom ‘\(B(0=1)\)’. Then clearly the standard model of arithmetic is expandable to a model of K but just as clearly \(Int_{Bel(K)}\) is inconsistent. A non-trivial illustration is provided by the lottery paradox, introduced by Kyburg (1961). Consider a fair lottery with a large number of tickets exactly one of which will win. In what follows we assume that the tickets are numbered from 1 to k. It is highly probable that ticket 1 will not win, ticket 2 will not win ...and so on, separately for every single ticket up to k.

The following intuitively plausible premises then lead to a contradiction:

  1. 1.

    It is rational to accept propositions with very high probability assigned to them.

  2. 2.

    If you are aware that a given proposition is inconsistent, then it is not rational to accept it.

  3. 3.

    If it is rational to accept \(\varphi \) and it is rational to accept \(\psi \), then it is rational to accept \(\varphi \wedge \psi \).

Let \(R(\varphi )\) abbreviate ‘it is rational to accept \(\varphi \)’. We obtain:

  1. (a)

    R(ticket 1 will not win) and R(ticket 2 will not win) and ...R(ticket k will not win), (by 1)

  2. (b)

    R(ticket 1 will not win and ticket 2 will not win and ...ticket k will not win), (by 3)

  3. (c)

    R(ticket 1 will win or ...ticket k will win), (by the assumption that some ticket will win)

  4. (d)

    \(R\big (\)(ticket 1 will not win and ...ticket k will not win) and (ticket 1 will win or ...ticket k will win)\(\big )\) (by 3)

However, we are aware that the last sentence in the scope of R is inconsistent and in this way a contradiction with 2 is obtained.

It should be emphasised that the lottery paradox concerns rational acceptance, not believability. For believability, premise 2 simply does not hold. Believability of \(\varphi \) and \(\psi \) (separately) means, in intuitive terms, that we should rationally accept both \(\varphi \) and \(\psi \) assuming that we are not aware of any good reason to accept \(\lnot \varphi \) or \(\lnot \psi \). We should then rationally accept \((\varphi \wedge \psi )\), unless we are aware of a good reason to accept \(\lnot (\varphi \wedge \psi )\), with the point being simply that a good justification of both conjuncts is normally good enough to warrant rational acceptance of a conjunction.Footnote 32 However, if we are aware that \(\lnot (\varphi \wedge \psi )\) is a truth of logic, then the situation is not normal and the usual bridge rule leading to rational acceptance does not apply.

Nevertheless, steps (a)-(c) could be reproduced in Bel(K) assuming that the background theory K, formulated in the language with ‘B’, contains as its theorems (i) all sentences ‘B(ticket m will not win)’ for \(1 \le m \le k\), (ii) ‘B(ticket 1 will win or ...ticket k will win)’. Observe that with these assumptions about K, the rule \({\textsc {GEN}}\) is not needed at all to derive a contradiction in the scope of B, as the mere believability of propositional logic together with \((A_2)\) is quite enough to produce this effect.

As before, if a contradiction appears in the scope of ‘B’, then the bridge rules had better not block the rational acceptance of all sentences of our language; in other words, the transition to rational acceptance should be invalidated locally, not globally. Let me also stress that the present proposal is not meant to offer a solution to the lottery paradox. Indeed, I think that the various solutions proposed in the literature can be applied in our framework on the level of the bridge rules, with the believability theory remaining neutral in this respect.Footnote 33

In spite of these limitations, I think that the proposed epistemic strategy indeed offers an attractive way out for the disquotationalist who is worried about the truth-theoretic weakness of his axioms. It permits him to vindicate his philosophical position in a model case of arithmetical truth, where Theorem 10 applies. There are also good prospects for moving towards a conception of a more directly self-applicable notion of believability, with a base theory K possibly being formulated in the language containing ‘B’.Footnote 34