Abstract
In this paper, we show how to construct for a given consistent theory U a \(\varSigma ^0_1\)-predicate that both satisfies the Löb Conditions and the Kreisel Condition—even if U is unsound. We do this in such a way that U itself can verify satisfaction of an internal version of the Kreisel Condition.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
When does a predicate P in a theory U count as a provability predicate for U? There are various ideas on the market to explicate this notion. These ideas provide conditions for being a provability predicate that cater to various intuitions.Footnote 1 In the present paper, three classes of conditions will be considered: the Hilbert–Bernays–Löb Conditions, the Kreisel Condition and the Feferman Condition. We will introduce the various conditions with some care in Sect. 3. In the present paper, we will not go into the philosophical discussion about the meaning of the conditions and their relative pro’s and con’s. However, in Appendix A, we will give examples that illustrate that all three classes of conditions are independent of one another. These examples can help the reader to form her own impression of what the conditions involve and, possibly, help her to get more grip on the issues surrounding the choice between the various classes of conditions.
The aim of our paper is to study the interplay of the Hilbert–Bernays–Löb Conditions and the Kreisel Condition for the case of \(\varSigma ^0_1\)-predicates. The Kreisel condition for a provability predicate \({\vartriangle }\) for a theory U demands that \(U\vdash {\vartriangle }A\) iff \(U \vdash A\). A first question is whether we can have the Kreisel Condition for a predicate that satisfies the Löb Conditions in case our theory U is unsound. For example, what happens when our theory is \({\textsf {PA}}+{\textsf {incon}}({\textsf {PA}})\)? A second question is as follows. If \({\vartriangle }\) satisfies the Löb Conditions, the theory U, when consistent, cannot verify both the Kreisel Condition and the internal Kreisel Condition \({\vartriangle }{\vartriangle }A \leftrightarrow {\vartriangle }A\). However, can we have the next best thing, to wit: given an appropriately good provability predicate \({\square }\) for U, can we find a predicate \({\vartriangle }\), that satisfies the Löb Conditions and for which we have both \(U\vdash {\vartriangle }A\) iff \(U \vdash A\), and \(U \vdash {\square }{\vartriangle }A \leftrightarrow {\square }A\)? As we will see, the answer to the last question is yes. We can find, in many cases, a predicate \({\vartriangle }\) that satisfies the Löb Conditions, the Kreisel Condition and the internal Kreisel Condition \(U \vdash {\square }{\vartriangle }A \leftrightarrow {\square }A\).
We develop a general construction of a \(\varSigma ^0_1\)-predicate \({\vartriangle }\) that satisfies both the Löb Conditions and the external and internal Kreisel Conditions from suitable data. The internal Kreisel principle \(\vdash {\square }{\vartriangle }A \leftrightarrow {\square }A\) splits in two sub-principles, to wit, the absorption law \(\vdash {\square }{\vartriangle }A \rightarrow {\square }A\) and the emission law \(\vdash {\square }A \rightarrow {\square }{\vartriangle }A\). Our main focus will be on the absorption law.
1.1 Historical note
The following fact is due to Orey. Suppose U is an extension of PA. Then, we can find an elementary \(\alpha \) such that \(\alpha \) represents the axiom set of U over PA and \(U \nvdash {\square }_\alpha \bot \). See [3]. See also [20, Chapter 2].
A construction of a Fefermanian predicateFootnote 2 with \(\varSigma ^0_1\)-axiomatization \(\alpha \) for a theory U that extends PA, such that none of the iterated \({\square }_{\alpha }\)-inconsistency statements \({\square }^n_{\alpha }\bot \) is provable in U is given in [2]. See also [18]. The result we will prove extends the results of Orey and Beklemishev.
The predicates constructed in the present paper can be viewed as slow provability predicates. The absorption law holds for slow provability predicates over PA. Slow provability over EA was introduced and studied in [30]. Slow provability over PA was introduced and studied in [6]. Our knowledge concerning this notion was further extended in [4, 5, 11, 26].
The disadvantage of the present approach to slow provability is that the connection to proof theory and ordinal analysis is not visible. The advantage of the present approach to slow provability compared to the one of Friedman [6] is its wider scope. Moreover, as we discuss below, it is not known whether the approach of Friedman [6] works for Heyting’s Arithmetic HA, the constructive counterpart of PA, where our approach works without problems. (This does not mean that it would not be very interesting to see how to extend the methods of Friedman [6] to the constructive case.)
An alternative approach to obtain a provability style predicate that satisfies both the Löb Conditions and the Kreisel Condition can be found in Section 5 of [34]. The approach in the present paper has a number of advantages. First, it is somewhat more perspicuous. Secondly, the constructed predicates also satisfy the Hilbert–Bernays Condition. Thirdly, the construction of the predicates is fixed-point-free. Fourthly, using the present approach we can also, in a number of cases, construct predicates \({\vartriangle }\) with the desired properties that are Fefermanian (in a sense that will be further specified in the paper).
The basic idea for the predicate constructed in this paper is due to Fedor Pakhomov. He suggested considering this predicate when I asked him whether there was a non-model theoretic proof of the absorption law for slow provability. However, the proof of absorption given in this paper is quite different from the one Fedor had in mind.
1.2 Prerequisites
The reader should be familiar with basic materials from Hájek and Pudlák [10]. For certain local results there may be further prerequisites, but we will make these clear in situ.
1.3 Overview of the paper
In Sect. 2, we introduce the basic facts, notations and definitions for the rest of the paper.
Section 3 is a brief treatment of the Löb Conditions, the Kreisel Condition and the Feferman Condition.
Then, in Sect. 4, we construct a predicate \({\vartriangle }\) with the desired properties for theories U that extend Peano Arithmetic. This is, in many respects, the simplest case of the construction. In this simple case, we have the extra property that our \({\vartriangle }\) is Fefermanian. On the other hand, the construction is subject to some restrictions. It works only if we start with an elementary numeration \(\alpha \) of the axiom set of U such that \(\pi \preceq \alpha \). The construction delivers an axiom set numerated by \({\widetilde{\alpha }}\) for U. However, we will have \(\pi \not \preceq {\widetilde{\alpha }}\), so that the construction cannot be iterated. The construction in this section will also be covered by the more general construction in Sect. 5. However, we think the general treatment becomes easier to follow if one has seen Sect. 4 first.
In Sect. 5 , we give the basic construction in the abstract, starting from a predicate \(\theta \) that satisfies a list of properties. We illustrate how \({\square }_{\alpha _x}\) of Sect. 4 does satisfy the properties for \(\theta \), so that the approach of Sect. 4 is subsumed under the approach of Sect. 5.
Finally, in Sect. 6, we show, under fairly general conditions, how to construct a predicate \(\theta \) that satifies the desired properties.
In “Appendix A”, we provide separation examples between the Löb Conditions, the Kreisel Condition and the Feferman Conditions.
2 Basic conventions, notations, definitions
In this section, we introduce basic conventions and fix some notations and give some definitions.
2.1 Theories
A theory U in this paper is a theory in the signature of arithmetic.Footnote 3 A theory is given by a set X of axioms. We will generally assume that X is a recursively enumerable set. However, X is just given as a set and it is not intrinsically connected with a presentation. We will assume as a default that U extends Elementary Arithmetic EA.
Two salient theories of the paper are Elementary Arithmetic EA and Peano Arithmetic PA. The theory EA is \(\mathrm I\varDelta _0+{\textsf {exp}}\). It is finitely axiomatizable by a single axiom B. See [10]. The predicate \(x= \underline{\ulcorner B \urcorner }\) will be called \(\beta \). The theory PA has a standard elementary presentation \(\pi \) of the axiom set corresponding to the usual axiom scheme.
We will also consider the extension of EA with the \(\varSigma ^0_1\)-collection principle \(\mathrm B \varSigma ^0_1\). This principle is given by:
Here \(S_0\) is \(\varSigma ^0_1\) and may contain further parameters.
2.2 Arithmetization
We will sometimes use implementation properties of the arithmetization like monotonicity and the efficiency of syntactical operations. For this reason, we outline a few features of the Gödel coding we intend to use. We use a style of Gödel numbering that is due to Smullyan (see [27]). Our Gödel numbering is based on the length-first ordering. We enumerate the strings of our finite alphabet according to length and the strings of the same length alphabetically. The Gödel number of a string s will be the number of occurrences in this enumeration. In this ordering the arithmetical function tracing concatenation is of the order of multiplication. We can use our bijective coding of strings to implement sequences of numbers. This has the bonus that also concatenation of sequences of numbers will be of the order of multiplication.Footnote 4
We will in many cases employ modal notations. E.g., let \({\textsf {prov}}_\alpha \) be the arithmetization of provability from the axioms in \(\alpha \). We write \({\square }_\alpha A\) for \({\textsf {prov}}_\alpha (\underline{\ulcorner A \urcorner })\). Here \(\underline{\ulcorner A \urcorner }\) is the numeral of the Gödel number of A. We will sometimes quantify the sentence-variables inside a modal operator. For example, we write things like:
This stands for:
Admittedly, such notations are somewhat sloppy, but I think in practice they are very convenient. E.g., \((\dag )\) is more pleasant to read than (\(\ddag \)).
We employ the usual conventions for quantifying numerical variables into modal contexts. E.g. \({\square }_\alpha A(x)\) means \(\exists z \,({\textsf {sub}}(x,\underline{\ulcorner v_0 \urcorner },\underline{\ulcorner A(v_0) \urcorner },z) \wedge {\textsf {prov}}_\alpha (z))\).
We will employ the witness comparison notation. Suppose \(A = \exists x\, A_0(x)\) and \(B = \exists x\, B_0(x)\). We write:
-
\(A \le B\) for \(\exists x\, (A_0(x) \wedge \forall y < x\, \lnot \, B_0(y))\).
-
\(A < B\) for \(\exists x\, (A_0(x) \wedge \forall y \le x\, \lnot \, B_0(y))\).
2.3 Ordering of predicates for axioms
Let \(\gamma (x)\) and \(\delta (x)\) be formulas with only x free that EA-verifiably represent classes of arithmetical sentences. Let T be an extension of EA. We write \(\gamma \preceq _T \delta \) for
Here \({\textsf {prov}}_\alpha \) is a standard arithmetization of provability from \(\alpha \). Our default for T will be EA and we will write \(\preceq \) for \(\preceq _{\textsf {EA}}\).
It is easy to see that \(\preceq _T\) is a partial pre-ordering.
3 Conditions for provability predicates
In this section, we introduce three (classes of) conditions that aim to explicate when a predicate is a provability predicate.
3.1 The Löb conditions
To state the Löb conditions we write \({\vartriangle }A\) for \(P(\underline{\ulcorner A \urcorner })\) and \(\vdash \) for provability in U. The Löb conditions (introduced in [21]) are as follows.
- L1.:
-
If \(\vdash A\), then \(\vdash {\vartriangle }A\)
- L2.:
-
\( \vdash ({\vartriangle }A \wedge {\vartriangle }(A\rightarrow B)) \rightarrow {\vartriangle }B\)
- L3.:
-
\(\vdash {\vartriangle }A \rightarrow {\vartriangle }{\vartriangle }A\)
We obtain the Hilbert–Bernays Conditions in case we replace L3 by:
- HB.:
-
\(\vdash S \rightarrow {\vartriangle }S\), for \(\varSigma ^0_1\)-sentences S.
The usual assumption connected to the Hilbert–Bernays conditions is that P be \(\varSigma ^0_1\), so that L3 is a special case of HB. It is easy to see that if P is not \(\varSigma ^0_1\), we can have \({\textsf {L}}1,2\) and HB but not L3. E.g., we may take P to be Feferman provability for PA.
We note that, in case P is \(\exists \varSigma ^{\textsf {b}}_1\), the Löb conditions are more general than the Hilbert–Bernays Conditions. For example, in a weak theory like \({\textsf {S}}^1_2\) we do have the Löb Conditions for \({\textsf {S}}^1_2\)-provability for a standard provability predicate—assuming an efficient arithmetization—but it is unknown whether we have the Hilbert–Bernays Conditions.
Technically, the Löb Conditions constitute a superior analysis of the proof of the Second Incompleteness Theorem. The philosophical use of the Conditions is independent of their technical interest. The philosophical idea is that the Löb Conditions explicate the theoretical role that a provability predicate plays in a theory.
We note that the Löb conditions do depend on the choice of Gödel numbering and hence are still not entirely ‘coordinatefree’. For a study of this dependence and a proposal to abstract away from it, see [7].
The Löb Conditions also have a uniform and a global version. In the uniform version we allow parameters in the formulas inside the operator. For example, L2 becomes: \( \vdash \forall \vec {x}\, (({\vartriangle }A(\vec {x}\,) \wedge {\vartriangle }(A(\vec {x}\,)\rightarrow B(\vec {x}\,))) \rightarrow {\vartriangle }B(\vec {x}\,))\). In the global version, the quantifiers over sentences are not outside but inside the theory. For example, L2 becomes: \( \vdash \forall A,B\in {\textsf {sent}}\,( ({\vartriangle }A \wedge {\vartriangle }(A\rightarrow B)) \rightarrow {\vartriangle }B)\). We note that the global version is stronger than the uniform one. We will not consider the strengthened conditions in the present paper.
3.2 The Kreisel condition
The Kreisel Condition was first formulated in [16]. Its statement is as follows:
- K.:
-
\(U\vdash {\vartriangle }A\) iff \(U\vdash A\).
We note that the Kreisel Condition is of a quite different nature than the Löb Conditions. It just asks that the theory numerates its own provability by the given predicate.
One could imagine a variant of the Kreisel Condition where we just ask numerability in a base theory \(U_0\) that is a sub-theory of U.
Finally, we observe that, like the Löb Conditions, the Kreisel Condition does depend on the chosen Gödel numbering.
3.3 The Feferman condition
We explain the idea that a provability-predicate is Fefermanian. We derive this idea from the methodology introduced in [3]. The main ingredient of the idea is simply to fix a preferred arithmetization of provability and allow the choice of the predicate \(\alpha \) representing the axiom-set to be free, given that it satisfies certain adequacy conditions.
The best way to present a Fefermanian predicate is to view it as a tuple \({\langle U_0,U, \alpha \rangle }\). Here \(U_0\) is the base theory and U is the lead theory. We ask that U extends the base \(U_0\). We demand that \(\alpha \) numerates an axiom set X for U in the base theory \(U_0\). In other words, we demand that \(A\in X\) iff \(U_0 \vdash \alpha (\underline{\ulcorner A \urcorner })\).
We note that the demands on a Fefermanian predicate treat the axioms of the lead theory via a condition similar to the Kreisel Condition.
In the present paper, we will consider Fefermanian predicate modulo provability in the base theory. Thus, we will say that P is Fefermanian for U over \(U_0\) in the relaxed sense iff, there is an \(\alpha \) such that \({\langle U_0,U, \alpha \rangle }\) is Fefermanian in the strict sense and \(U_0 \vdash \forall x\, (P(x) \leftrightarrow {\textsf {prov}}_\alpha (x))\).
The reader may object that the Feferman Condition does not count as a real condition since it employs an unspecified specification of the arithmetization.Footnote 5 Of course, the reader is correct here. Feferman, in his paper, does specify a choice for a proof system and an arithmetization. However, in Feferman’s arithmetization, the Gödel number of a formula is superexponential in its length, so it is not a convenient Gödel numbering to work with within EA. Moreover, if Feferman’s specific Gödel numbering would really be the golden standard, it would be reasonable that everybody would know its specification, but, of course, that is not the case. I see the use of the Feferman idea more as dialogical. The reader is asked to take her favored good arithmetization in mind and read for prov provability according to that arithmetization. So, prov becomes context dependent like the word ‘you’. I will employ the Feferman idea in this way.
3.4 Properties of Fefermanian predicates
In this subsection we briefly consider some basic insights on Fefermanian predicates.
Let \({\textsf {A}}_U\) be the class of all \(\alpha \) in \(\varSigma ^0_1\) such that \({\langle {\textsf {EA}}, U,\alpha \rangle }\) is Fefermanian.
Theorem 1
Let U be a theory. Then \({\mathsf{A }}_U\) has a minimum w.r.t. \(\preceq \) iff U is finitely axiomatizable.
Proof
Suppose U is finitely axiomatizable, say by \(A_0,\ldots , A_{n-1}\). Let \(\alpha ^*(x) :=\bigvee _{i<n} x=\underline{\ulcorner A_i \urcorner }\). Consider any \(\alpha \) in \({\textsf {A}}_U\). We find for \(i<n\) that \(U \vdash A_i\), and, hence \({\textsf {EA}} \vdash {\square }_\alpha A_i\).
We reason in EA. Suppose p witnesses \({\square }_{\alpha ^*} A\) and \(p_i\), for \(i<n\), witnesses \({\square }_\alpha A_i\). We obtain an \(\alpha \)-proof q of A by adding the \(p_i\) ‘above’ \(A_i\) to p. (Note that we do not need \(\varSigma ^0_1\)-collection since n is standard.)
Suppose U is not finitely axiomatizable. Consider any \(\alpha \in {\textsf {A}}_U\). Clearly, for any n there is a B such that \(U\vdash B\) but the axioms in \(\alpha \) that are \(\le n\) do not prove B. Hence, \(C:= \forall x\, \exists B \, ({\square }_\alpha B \wedge \lnot \, {\square }_{\alpha _x} B)\) is true, where \(\alpha _x(y) :\leftrightarrow \alpha (y) \wedge y\le x\). Thus, \({\textsf {EA}}+C\) is consistent. Let \(\gamma (x) :\leftrightarrow \beta (x) \vee x=\underline{\ulcorner C \urcorner }\), where \(\beta \) is the standard axiomatization of EA. We define:
It is evident that \(\alpha '\preceq \alpha \). Suppose (†) \(\alpha \preceq \alpha '\). We reason inside \({\textsf {EA}}+C\). By (†), we have \(\forall B\, ({\square }_\alpha B \rightarrow {\square }_{\alpha '} B)\). Suppose p is a \(\gamma \)-proof of \(\bot \). It follows that the \(\alpha '\) axioms are below p. Consider B such that \({\square }_\alpha B\) but not \({\square }_{\alpha _p}B\). It follows that \(\lnot \, {\square }_{\alpha '}B\). A contradiction. It follows that there is no \(\gamma \)-proof of \(\bot \), in other words, \(\diamond _\gamma \top \). We leave \({\textsf {EA}}+C\).
We have shown \({\textsf {EA}}+C\vdash \diamond _\gamma \top \). But this contradicts the Second Incompleteness Theorem. Hence (†) must fail. \(\square \)
Remark 1
What happens if we replace EA in the definition of \({\textsf {A}}_U\) by another base theory T and, simultaneously, consider \(\preceq _T\) in stead of \(\preceq _{\textsf {EA}}\)? Inspection of the proof of Theorem 1 shows that we have to replace \(\beta \) in the proof by an elementary predicate numerating the axioms of T in T. This can always be arranged due to Craig’s trick. We also have to assume that T is \(\varSigma ^0_2\)-sound to be sure that \(T+C\) is consistent. We note that the application of the Second Incompleteness theorem goes through by the usual argument since the analogue of \(\gamma \) is elementary.Footnote 6
Thus, our result goes through, as long as the base theory is \(\varSigma ^0_2\)-sound. \(\lozenge \)
Theorem 2
Consider theories \(U_0\) and U where EA is a sub-theory of \(U_0\) and \(U_0\) is a sub-theory of U. Suppose:
-
a.
P numerates U in \(U_0\).
-
b.
P contains \(U_0\)-provably all predicate-logical tautologies.
-
c.
P is \(U_0\)-provably closed under finite conjunctions.
-
d.
P is \(U_0\)-provably closed under modus ponens.
Then, P is Fefermanian (in the relaxed sense) for U over \(U_0\) as witnessed by \({\langle U,U_0, P \rangle }\).
Proof
Clearly, we have \(U_0 \vdash \forall B\in P \, {\textsf {prov}}_P(B)\). Conversely, reason in \(U_0\). Suppose p is a P-proof of B. Let X be the finite set of P-axioms used in p. Then, \((\bigwedge X \rightarrow B)\) is a predicate logical tautology, so \((\bigwedge X \rightarrow B)\in P\). By closure under conjunction, we have \(\bigwedge X \in P\). Hence, by closure under modus ponens, we find \(B\in P\). \(\square \)
Theorem 3
Consider theories \(U_0\) and U where \(\mathrm I\varSigma ^0_1\) is a sub-theory of \(U_0\) and \(U_0\) is a sub-theory of U. Let P be a \(\varSigma ^0_1\)-predicate. Suppose:
-
a.
P numerates U in \(U_0\).
-
b.
P contains \(U_0\)-provably all predicate-logical tautologies.
-
c.
P is \(U_0\)-provably closed under modus ponens.
Then P is Fefermanian for U over \(U_0\) with P itself as representation of the axiom set.
Proof
Under the assumptions of the theorem, we can prove that P is closed under finite conjunctions by \(\varSigma ^0_1\)-induction. \(\square \)
Example 1
We take as base and lead theory PA. The predicate \({\square }_\pi {\square }_\pi \) is Fefermanian. Similarly, for \(\exists x\,{\square }_\pi ^{x+1}(\cdot )\). The last predicate is, modulo PA-provable equivalence, Parikh provability or fast provability. Parikh provability can be obtained by adding to an axiomatization based on \(\pi \) the Reflection Rule: \({\vdash {\square }_\pi A} \Rightarrow {\vdash A}\). See [24]. See also [9]. \(\lozenge \)
Theorem 4
Suppose U extends EA and P is Fefermanian w.r.t. a \(\varDelta _0({\mathsf{exp }})\)-presentation \(\alpha \) of the axiom set. Then, P satisfies the Löb Conditions.
Theorem 5
Suppose U extends \({\mathsf{EA }}+ \mathrm {B}\varSigma ^0_1\) and P is Fefermanian w.r.t. a \(\varSigma _1\)-presentation \(\alpha \) of the axiom set. Then, P satisfies the Löb Conditions.
Theorem 6
Suppose \({\langle U_0,U,\alpha \rangle }\) is a strict Fefermanian representation, where \(\alpha \) is \(\varSigma ^0_1\), and suppose U and \(U_0\) are sound. Then, \({\square }_\alpha \) satisfies the Kreisel Condition for U.
Proof
Since \(U_0\) is sound, we have \(\alpha ({\ulcorner A \urcorner })\) iff \(U_0 \vdash \alpha (\underline{\ulcorner A \urcorner })\). So, \(\alpha \) truly represents the axioms of U. It follows that \(U \vdash A\) iff \({\square }_\alpha A\). Since U is sound, we find \({\square }_\alpha A\) iff \(U \vdash {\square }_\alpha A\). So, we may conclude \(U\vdash A\) iff \(U \vdash {\square }_\alpha A\). \(\square \)
Finally, we look into the interaction of elementary axiomatizations, i.e., \(\varDelta _0({\textsf {exp}})\)-axiomatizations, and \(\varSigma ^0_1\)-axiomatizations. We note that an elementary formula numerates the same set in every consistent theory. This immediately gives us the following insight.
Theorem 7
Suppose \(U_0\) and \(U_1\) are consistent subtheories of U. Suppose further that \({\langle U_0,U,\alpha \rangle }\) is Fefermanian, where \(\alpha \) is elementary. Then, \({\langle U_1,U,\alpha \rangle }\) is Fefermanian.
Similarly, a \(\varSigma ^0_1\)-formula numerates the same set in all \(\varSigma ^0_1\)-sound theories. So, we have:
Theorem 8
Suppose \(U_0\) and \(U_1\) are \(\varSigma ^0_1\)-sound subtheories of U. Suppose further that \({\langle U_0,U,\alpha \rangle }\) is Fefermanian, where \(\alpha \) is \(\varSigma ^0_1\). Then, \({\langle U_1,U,\alpha \rangle }\) is Fefermanian.
We remind the reader of Craig’s trick. Let \(U_0\) be a \(\varSigma ^0_1\)-sound base theory. Suppose \(\sigma \) is a \(\varSigma ^0_1\)-formula that \(U_0\)-provably represents a set of arithmetical sentences. Suppose \(\sigma (x) = \exists y\, \sigma _0(y,x)\), where \(\sigma _0\in \varDelta _0({\textsf {exp}})\). We define
Here conj arithmetizes forming a conjunction, id arithmetizes forming an identity statement from terms, num arithmetizes the numeral function. Clearly, \(\hat{\sigma }\) is elementary. As is well known, we have:
Theorem 9
-
i.
\({\mathsf{EA }} \vdash \forall A\, ({\square }_{\hat{\sigma }} A \rightarrow {\square }_\sigma A)\).
-
ii.
\({\mathsf{EA }}+\mathrm B\varSigma _1 \vdash \forall A\, ({\square }_{\hat{\sigma }} A \leftrightarrow {\square }_\sigma A)\).
Theorem 10
Suppose \({\langle U_0,U,\sigma \rangle }\) is Fefermanian and \(U_0\) is \(\varSigma ^0_1\)-sound and \(\sigma \) is \(\varSigma ^0_1\). Then, \({\langle U_0,U,\hat{\sigma } \rangle }\) is also Fefermanian.
Proof
Let X be the set of axioms numerated by \(\sigma \) in \(U_0\) and let \(\hat{X}\) be \(\hat{\sigma }\) in \(U_0\). By the \(\varSigma ^0_1\)-soundness of \(U_0\), the set X is the set of numbers for which \(\sigma \) is true and the set \(\hat{X}\) is the set of numbers for which \(\hat{\sigma }\) is true. So, by the unformalized version of Theorem 9(ii), we find that both X and \(\hat{X}\) axiomatize the same theory, to wit U. \(\square \)
We note that it is essential that \(U_0\) is \(\varSigma ^0_1\)-sound. If it were not the Craig construction could transform a standard axiom to a non-standard axiom. The non-standard axiom would not be visible in the numeration.
3.5 Examples
We provide a list of examples for coincidence and separation of the conditions. As before \(\beta \) is the standard representation of the axiom of EA and \(\pi \) is the standard representation of the axioms set of Peano Arithmetic. We will, in our examples, prefer EA over PA, \(\varSigma ^0_1\)-predicates over more complex ones, and sound theories over unsound ones. Only in the first examples of Examples 8 and 10, perhaps, improvement is possible by finding an example that works for and over EA.
Base | Lead | P | Löb | Kreisel | Feferman | |
---|---|---|---|---|---|---|
Example 4 | EA | EA | \(\varSigma ^0_1\) | \(+\) | \(+\) | \(+\) |
Example 5 | EA | EA | \(\varSigma ^0_1\) | \(+\) | \(+\) | − |
Example 6 | EA | \({\textsf {EA}}+{\square }_\beta \bot \) | \(\varSigma ^0_1\) | \(+\) | − | \(+\) |
EA | EA | \(\varSigma ^0_2\) | ||||
Example 7 | EA | EA | \(\varSigma ^0_1\) | \(+\) | − | − |
Example 8 | PA | PA | \(\varSigma ^0_2\) | − | \(+\) | \(+\) |
EA | EA | \(\varSigma ^0_{1,1}\) | ||||
Example 9 | EA | EA | \(\varSigma ^0_1\) | − | \(+\) | − |
Example 10 | PA | PA | \(\varSigma ^0_2\) | − | − | \(+\) |
EA | \({\textsf {EA}}+{\square }_\beta {\square }_\pi \bot \) | \(\varSigma ^0_{1,1}\) | ||||
Example 11 | EA | EA | \(\varSigma ^0_1\) | − | − | − |
We will provide and verify the promised examples in “Appendix A”.
4 Extensions of Peano arithmetic
Let U be a consistent extension of PA and let \(\alpha \) be an elementary numeration of an axiom set X of U in U, such that \(\pi \preceq \alpha \).Footnote 7 We will show how to construct a \(\varSigma ^0_1\)-predicate \({\widetilde{\alpha }}\) that numerates the the axioms of U in U. Thus \({\widetilde{\alpha }}\) will be Fefermanian for U over U.
The concrete examples to keep in mind are the standard representation \(\pi \) of the axioms of PA and \(\pi (x) \vee x = \underline{\ulcorner {\square }_\pi \bot \urcorner }\) representing the axioms of \({\textsf {PA}}+{\square }_\pi \bot \).
We write \(\alpha _x\) for the \(\alpha \)-axioms \(\le x\). So, \(\alpha _x(y) :\leftrightarrow \alpha (y) \wedge y\le x\). We write \({\square }_{\alpha ,(x)}\) for provability from \(\alpha \) by a proof \(\le x\).Footnote 8 We will use \(S,S',\dots \) as variables ranging over \(\varSigma ^0_1\)-sentences.
A number x is small, or \(\mathcal S(x)\), iff \({\square }_{\alpha ,(x)}\) is \(\varSigma ^0_1\)-reflecting. The means that \(\mathcal S(x)\) iff \(\forall S\, ({\square }_{\alpha ,(x)} S \rightarrow {\textsf {true}}(S))\), where true is a standard \(\varSigma ^0_1\)-truth predicate. We note that smallness does depend on the chosen \(\alpha \). We also note that, by our assumptions on the Gödel numbering, the quantifier over S can be bounded by n. It follows, by \(\varSigma ^0_1\)-collection, that, modulo PA-provability, smallness is a \(\varSigma ^0_1\)-predicate. Finally, smallness is clearly downward closed.
It is consistent with U that not all numbers are small, since U does not prove \(\varSigma ^0_1\)-reflection for \(\alpha \)-provability. On the other hand, for every n, we have that U proves that it is small, i.e., \(U \vdash \mathcal S(\underline{n})\). The argument looks like this. (A more general argument is given in the proof of Lemma 2.) Consider a number n.
Let \(k\le n\) and \(s \le n\). If s is a code of a \(\varSigma ^0_1\)-sentence S and if k is the Gödel number of an \(\alpha \)-proof of S, then we have \(U \vdash S\), and, hence,
If s is not a code of a \(\varSigma ^0_1\)-sentence or, if s is a code of a \(\varSigma ^0_1\)-sentence S and k is not the Gödel number of an \(\alpha \)-proof of S, then we have \(U \vdash \lnot \, {\textsf {sent}}_{\varSigma ^0_1}(s) \vee \lnot \, {\textsf {proof}}_\alpha (\underline{k},s)\), and, hence, again, \(U \vdash ({\textsf {sent}}_{\varSigma _1^0}(s) \wedge {\textsf {proof}}_{\alpha }(\underline{k},s)) \rightarrow {\textsf {true}}(s)\). It follows that:
Hence, by U-reasoning, \(U \vdash \mathcal S(\underline{n})\). The above reasoning is so simple that it can be verified in PA, and so (†) \({\textsf {PA}} \vdash \forall x\, {\square }_\alpha \mathcal S(x)\). The principle (†) is a typical example of an outside-big-inside-small principle. Objects that are very big in the outer world are small in the inner world.
We define the slow provability of A or \({\vartriangle }A\) as: A is provable from small \(\alpha \)-axioms. So,
-
\({\widetilde{\alpha }}(x) :\leftrightarrow \alpha (x) \wedge \mathcal S(x)\)
-
\({\vartriangle }A :\leftrightarrow {\square }_{{\widetilde{\alpha }}}A\).
We list two formulas that are equivalent to \({\vartriangle }A\) over PA and all provide worthy ways of looking at it. Let \({\square }^*_\alpha A :\leftrightarrow \exists x\, {\square }_{\alpha _x}A\).
-
\({\vartriangle }A\) iff \(\exists x\, ({\square }_{\alpha _x}A \wedge \mathcal S(x))\),
-
\({\vartriangle }A\) iff \({\square }^*A < \exists x\, \lnot \, \mathcal S(x)\).
Remark 2
Our \({\vartriangle }\) is part of a family of closely related predicates. To make this visible we consider a slight variant of our \({\vartriangle }\) with the same good properties. We write \({\square }_\alpha ^{\varPi ^0_1}\) for provability with a \(\varPi ^0_1\)-oracle. We define \({\vartriangle }^\circ A\) as \({\square }_\alpha ^*A < {\square }_\alpha ^{\varPi ^0_1}\bot \). We note that this is equivalent to \(\exists x\, ({\square }_{\alpha _x}A \wedge \diamond _{\alpha ,(x)}^{\varPi ^0_1}\top )\).
This representation brings out the analogy with Feferman provability which can be defined as \({\square }_\alpha ^*A < {\square }_\alpha ^*\bot \) and a provability predicate studied in [12, 14], to wit \(\exists x\, ({\square }_{\alpha _{x+1}}A \wedge \diamond ^{\varPi ^0_1}_{\alpha _x}\top )\) or, alternatively, \({\square }_\alpha ^*A \le {\square }_\alpha ^{*\varPi ^0_1}\bot \). We note that, unlike \({\vartriangle }^\circ \), these predicates are not \(\varSigma ^0_1\).\(\lozenge \)
Suppose A is in X. Then, \(U \vdash \alpha (\underline{\ulcorner A \urcorner })\). Since also \(U \vdash \mathcal S(\underline{\ulcorner A \urcorner })\), we find \(U \vdash \widetilde{\alpha }(\underline{\ulcorner A \urcorner })\). Conversely, suppose \(U \vdash \widetilde{\alpha }(\underline{\ulcorner A \urcorner })\). Then, \(U \vdash \alpha (\underline{\ulcorner A \urcorner })\) and, hence, \(A \in X\). Thus \({\widetilde{\alpha }}\) numerates X in U.
Since \({\vartriangle }\) is \({\square }_{{\widetilde{\alpha }}}\) and \({\widetilde{\alpha }}\) is \(\varSigma ^0_1\), it follows, in PA by \(\varSigma ^0_1\)-collection, that that \({\vartriangle }\) is \(\varSigma ^0_1\). Hence, \({\vartriangle }\) satisfies the Löb conditions.
We show that PA verifies emission and absorption for \({\vartriangle }\). By the soundness of PA, the Kreisel Condition follows.
We first prove emission. We prove the stronger \({\textsf {PA}}\vdash {\square }_\alpha A \rightarrow {\square }_\alpha {\vartriangle }A\). We reason in PA. Suppose \({\square }_\alpha A\). Then, clearly, for some x, we have \({\square }_{\alpha _x} A\). Hence, \({\square }_\alpha {\square }_{\alpha _x}A\). Also, (†) gives us \({\square }_\alpha \mathcal S(x)\). So, \({\square }_\alpha ({\square }_{\alpha _x}A \wedge \mathcal S(x))\) and, thus, \({\square }_\alpha {\vartriangle }A\).
We prove absorption. The proof turns out to be remarkably simple. We find R such that \({\textsf {EA}} \vdash R \leftrightarrow (\exists x \, {\square }_{\alpha _x} A ) < {\square }_\alpha R\). We note that R is \(\varSigma ^0_1\).
We reason in PA. Suppose \({\square }_\alpha {\vartriangle }A\). We prove \({\square }_\alpha A\). We reason inside \({\square }_\alpha \). Since, by assumption, \({\vartriangle }A\), we have, for some x, (i) \({\square }_{\alpha _x} A\) and (ii) \(\forall S\, ({\square }_{\alpha ,(x)} S \rightarrow {\textsf {true}}(S))\). In case not \({\square }_{\alpha ,(x)} R\), by (i), we find R. If we do have \({\square }_{\alpha ,(x)} R\), we find R by (ii). We leave the \({\square }_\alpha \)-environment. We have shown \({\square }_\alpha R\). It follows, (a) that for some p, we have \({\square }_\alpha {\square }_{\alpha ,(p)} R\) and, by the fixed point equation for R, (b) \({\square }_\alpha ((\exists x\, {\square }_{\alpha _x} A) < {\square }_\alpha R)\). Combining (a) and (b), we find \({\square }_\alpha {\square }_{\alpha _p} A\), and, thus, since U, as axiomatized by \(\alpha \), is, EA-verifiably, essentially reflexive by our assumption that \(\pi \preceq \alpha \), we obtain \({\square }_{\alpha } A\), as desired. We leave PA. We have shown \({\textsf {PA}} \vdash {\square }_\alpha {\vartriangle }A \rightarrow {\square }_\alpha A\).
What happens if we drop the assumption that \(\alpha \) is elementary and work with a \(\varSigma ^0_1\)-predicate \(\sigma \)? We have the following.
Theorem 11
Suppose \({\langle \textsf {PA} ,U,\sigma \rangle }\) is Fefermanian, where \(\sigma \) is \(\varSigma ^0_1\) and \(\pi \preceq \sigma \). Then we can construct a \(\varSigma ^0_1\)-predicate \(\sigma ^*\) such that \({\langle U,U,\sigma ^* \rangle }\) is Fefermanian and \({\square }_{\sigma ^*}\) satisfies, in U, the Löb-conditions, the Kreisel condition and the pair \({\square }_{\sigma ^*}\), \({\square }_{\sigma }\) satisfies emission and absorption over U.
Proof
We take \(\sigma ^*\) to be \(\widetilde{\hat{\sigma }}\). We note that \({\langle {\textsf {PA}},U,\hat{\sigma } \rangle }\) is Fefermanian and thus \({\square }_{\sigma ^*}\) satisfies, in U, the Löb-conditions and the Kreisel condition. Moreover the pair \({\square }_{\sigma ^*}\), \({\square }_{\hat{\sigma }}\) satisfies emission and absorption over U. However, \({\square }_\sigma \) and \({\square }_{\hat{\sigma }}\) are co-extensional over PA and, hence, a fortiori, over U. So, the pair \({\square }_{\sigma ^*}\), \({\square }_{\sigma }\) also satisfies emission and absorption over U. \(\square \)
Remark 3
The arguments of this section can be extended to constructive logic. In this case we still have the representations \(\beta \) for the axiom set of \(\mathrm i\text{- }{\textsf {EA}}\) and \(\pi \) for the axiom set of HA. So the whole development remains unchanged. One just has to check that never a step was taken that is essentially classical.
The intuitionistic development has an important point. In their paper [1], Mohammad Ardeshir and Mojtaba Mojtahedi characterize the provability logic of HA for \(\varSigma ^0_1\)-substitutions. This is the most informative result on the provability logic of HA at the moment of writing. An alternative proof has been developed in [36]. This proof uses slow provability in the style of Friedman, Rathjen and Weiermann for HA. The proof works because only a restricted version of the absorption law is needed. The validity of the full absorption law is plausible but not proved. Replacement by of Friedman–Rathjen–Weiermann slow provability by slow provability in the style of the present paper (as suggested by Fedor Pakhomov) does give us full absorption.
We show that we get a strengthened version of absorption in the case of HA. The proof is intended for readers with some background in the metamathematics of constructive arithmetical theories.
Theorem 12
\(\mathrm {i} \text{- }{\mathsf{EA }} \vdash {\square }_\pi (A\vee B) \leftrightarrow {\square }_\pi (A \vee {\square }_{{\widetilde{\pi }}} B)\).
Proof
We reason in i-EA.
Suppose \({\square }_\pi (A\vee B)\). It follows by either q-realizability or the de Jongh translation that, for some x, we have \({\square }_\pi (A \vee {\square }_{\pi _x} B)\). From this, we may conclude \({\square }_\pi (A \vee {\square }_{{\widetilde{\pi }}} B)\).
Conversely, suppose \({\square }_\pi (A \vee {\square }_{{\widetilde{\pi }}} B)\). By the left-to-right case (with change of variables), we have \({\square }_\pi ({\square }_{{\widetilde{\pi }}} A \vee {\square }_{{\widetilde{\pi }}} B)\). Hence, \({\square }_\pi {\square }_{{\widetilde{\pi }}} (A \vee B)\). So, by absorption, \({\square }_\pi (A \vee B)\). \(\square \)
Thus, the alternative predicates that satisfy the absorption law also have a rich constructive life. \(\lozenge \)
5 The abstract construction
In this section we present a construction that builds an appropriate \({\vartriangle }\) from a given predicate \(\theta \) that satisfies certain good properties.
As before the variables \(S,S',\dots \) will range over (codes of) \(\varSigma ^0_1\)-sentences.
5.1 The argument
Let U be a theory. Suppose \(\alpha (x)\) is an elementary predicate that numerates the axioms of U in U. Let \(\theta (y,z)\) be a \(\varSigma ^0_1\) binary predicate. We demand that \(\theta \) is EA-verifiably, upwards persistent in y, i.e., we assume that
Let be \(\theta (y,\underline{\ulcorner A \urcorner })\). We write as long as \(\theta \) is given in the context.
As a heuristic, the reader may think of as a generalization of \({\square }_{\alpha _y}A\) as we used it in Sect. 4, studying the case where \(\pi \preceq \alpha \).
We define:
-
true is the \(\varSigma ^0_1\)-truth predicate, which is of the form \(\exists y\,{\textsf {true}}_0(y,x)\), where \({\textsf {true}}_0\) is \(\varDelta _0({\textsf {exp}})\). We write \({\textsf {true}}^z(x)\) for \(\exists y \le z\,{\textsf {true}}_0(y,x)\).
-
\({\square }_{\alpha ,(x)} A :\leftrightarrow \exists p\le x \,{\textsf {proof}}_\alpha (p,\underline{\ulcorner A \urcorner })\), where proof is the standard arithmetization of the proof predicate.
-
\(\mathcal S(x) :\leftrightarrow \exists z\, \forall S \le x\, ({\square }_{\alpha ,(x)} S \rightarrow {\textsf {true}}^z(S))\). Here the variable ‘S’ ranges over \(\varSigma ^0_1\)-sentences.
-
. We will usually write \({\vartriangle }\) for \({\vartriangle }_\theta \) suppressing the contextually given \(\theta \). We note that modulo some rewriting \({\vartriangle }_\theta \) is \(\varSigma ^0_1\).
The definition of \({\vartriangle }_\theta \) is in essence due to Fedor Pakhomov.
As explained in Sect. 2.2, we assume that we have a reasonable coding of proofs in which the code of the proof is larger than the code of the conclusion. We fix, for the moment \(\theta \) in the background.
We note that our definition of \(\mathcal S\) is slightly different from the one in Sect. 4. This is just to compensate for the lack of \(\varSigma ^0_1\)-collection. We have:
We note that we can write the right-hand-side of (†) as:
Here the witness comparison is only concerned with the outer quantifiers.
Lemma 1
.
Proof
We find R such that . We note that R is \(\varSigma ^0_1\).
We reason in . Suppose \({\square }_\alpha {\vartriangle }A\). We prove \({\square }_\alpha A\).
We reason inside \({\square }_\alpha \). Since, by assumption, \({\vartriangle }A\), we have, for some x, (i) and (ii) \(\forall S\le x\, ({\square }_{\alpha ,(x)} S \rightarrow {\textsf {true}}(S))\). In case not \({\square }_{\alpha ,(x)} R\), by (i), we find R. If we do have \({\square }_{\alpha ,(x)} R\), we find R by (ii).Footnote 9 We leave the \({\square }_\alpha \)-environment.
We have shown \({\square }_\alpha R\). It follows, (a) that for some p, we have \({\square }_\alpha {\square }_{\alpha ,(p)} R\) and, by the fixed point equation for R, (b) . Combining (a) and (b) and the upward persistence of , we find , and, thus, \({\square }_\alpha A\), as desired. We leave .
We have shown . \(\square \)
The proof of Lemma 1 deserves a few comments.
Remark 4
We note that the argument also works when we define \({\vartriangle }A\) as . The argument does not use that \({\vartriangle }\) is \(\varSigma ^0_1\). \(\lozenge \)
Remark 5
In all applications of Lemma 1, EA verifies not just for the concrete choice of of the application, but the stronger . We note that using this last condition, we may obtain the theorem without the demand that is upward persistent in y. In , we can go from \({\square }_\alpha {\square }_{\alpha ,(p)} R\) and to , and, hence, \({\square }_\alpha A\). \(\lozenge \)
Remark 6
The proof of Lemma 1 does not use exponentiation and would work in \({\textsf {S}}^1_2\). The reason is that we only use \({\textsf {true}}(R) \rightarrow R\), which is the direction of \({\textsf {true}}(R) \leftrightarrow R\) that does not require exponentiation. \(\lozenge \)
Remark 7
Let \(\mathrm i\text{- }{\textsf {EA}}\) be the constructive version of EA. Let U be a constructive theory that extends \(\mathrm i\text{- }{\textsf {EA}}\). Suppose \(\mathrm i\text{- }{\textsf {EA}} \vdash (\theta (y,z) \wedge y < y') \rightarrow \theta (y',z)\).
Then, inspection shows that the entire proof of Lemma 1 also works when we substitute \(\mathrm i\text{- }{\textsf {EA}}\) for EA. This uses the basic insight that \({\square }_{\alpha ,(x)}R\) is decidable in \(\mathrm i\text{- }{\textsf {EA}}\). So the case-splitting in the proof can be constructively justified.
Thus, we find . \(\lozenge \)
We prove the outside-big-inside-small lemma for \(\mathcal S\) as a notion of smallness in EA. The proof has to be a bit more elaborate that in the luxurious case where we had full PA to work with.
Lemma 2
\({\mathsf{EA }} \vdash \forall x\, {\square }_\alpha \mathcal S(x)\).
Proof
We work in EA. We prove the desired result by induction on x. We need a multi-exponential bound for the \({\square }_\alpha \)-proofs. We will extract the desired bound by inspecting the induction step.
The base case is trivial since there will be no \(S \le 0\). The proof witnessing the base will be given by a standard number \(\underline{n}\).
We turn to the induction step. Suppose \(p_0\) witnesses \({\square }_\alpha \mathcal S(x)\). We have two possibilities: \({\textsf {proof}}_\alpha (x+1,S^*)\), for some \(S^*\le x+1\), or \(\lnot \, {\textsf {proof}}_\alpha (x+1,S^*)\), for all \(S^*\le x+1\).
Suppose \({\textsf {proof}}_\alpha (x+1,S^*)\). Inspecting the proof of the truth-lemma for \({\textsf {true}}\) in [10, Ch. V, Section 5b, pp. 361–366], we obtain a \(p_1\) such that \({\textsf {proof}}_\alpha (p_1,S^*\rightarrow {\textsf {true}}(S^*))\). The transformation \(S^*\mapsto p_1\) is p-time. By [10, Ch. III, Lemma 3.14, p. 175], we obtain an \(\alpha \)-proof \(p_2\) of \({\textsf {proof}}_\alpha (x+1,S^*)\). The transformation \(x+1 \mapsto p_2\) is of order \(2_{\underline{k}}^{x+1}\), where k is standard and the subscript \(\underline{k}\) indicates the number of iterations of exponentiation. Working inside \({\square }_\alpha \) we can put these facts together to obtain
Let \(z_0\) be the witness of (a), let \(z_1\) be the witness of (c). Then, it is easily seen that \(z := {\textsf {max}}(z_0,z_1)\) witnesses \(\mathcal S(x+1)\).
Suppose \(\forall S^*\le x+1 \, \lnot \, {\textsf {proof}}_\alpha (x+1,S^*)\). By [10, Ch. III, Lemma 3.14, p. 175], we may find an \(\alpha \)-proof \(p_3\) of \(\forall S^*\le x+1 \, \lnot \, {\textsf {proof}}_\alpha (x+1,S^*)\) where the transformation \(x+1 \mapsto p_3\) is of order \(2_{\underline{k}}^{x+1}\). Using
inside \({\square }_\alpha \), we easily find the desired proof of \(\mathcal S(x+1)\).
We note that apart from a bit of overhead we extend \(p_0\) with at most two proofs that are estimated by \(2_{\underline{k}}^{x+1}\). So, the resulting proof will be of order \(p_0\times (2_{\underline{k}}^{x+1})^2\). Thus, after all is said and done, the proof we obtain of \(\mathcal S(x+1)\) will be estimated by \(\underline{n} \times (2_{\underline{k}}^{x+1})^{2(x+1)} = 2^{2_{\underline{k} -1}^{x+1}\times 2(x+1)} \le 2_{\underline{k}}^{2x+1}\), assuming that \(k \ge 2\). \(\square \)
Lemma 3
. Hence,
Proof
We work in EA. Let x be given. By Lemma 2, we find \({\square }_\alpha \mathcal S(x)\). Thus, . This gives us . \(\square \)
Lemma 4
.
Proof
We work in . Suppose \({\vartriangle }A\) and \({\vartriangle }(A\rightarrow B)\). It follows that, for some x, we have and \(\mathcal S(x)\) and that, for some y, we have and \(\mathcal S(y)\). Let \(z := {\textsf {max}}(x,y)\). It is easily seen that and and \(\mathcal S(z)\). Hence, and \(\mathcal S(z)\), and, thus, \({\vartriangle }B\). \(\square \)
Lemma 5
.
Proof
This is immediate by Lemma 2. \(\square \)
We formulate the obvious theorem that follows from the Lemmas. Let \({\textsf {W}}_{\alpha ,\theta }\) be EA plus the following principles:
- a.
- b.
- c.
- d.
Let \({\textsf {W}}^+_{\alpha ,\theta }\) be EA plus the following principles.
-
A.
\( {\square }_\alpha {\vartriangle }A \rightarrow {\square }_\alpha A\)
-
B.
\({\square }_\alpha A \rightarrow {\square }_\alpha {\vartriangle }A\)
-
C.
\(({\vartriangle }A \wedge {\vartriangle }(A\rightarrow B)) \rightarrow {\vartriangle }B\)
-
D.
\( \forall S\, {\square }_\alpha (S \rightarrow {\vartriangle }S)\)
Theorem 13
Let \(\alpha \) be a \(\varDelta _0({\mathsf{exp }})\)-predicate that numerates the axioms of U in EA, or, equivalently, in true arithmetic. Let \(\theta \) be a \(\varSigma ^0_1\)-predicate that satisfies \(\mathsf{EA } \vdash (\theta (y,z) \wedge y < y') \rightarrow \theta (y',z)\). Then, \({\mathsf{W }}_{\alpha ,\theta }\) implies \({\mathsf{W }}^{+}_{\alpha ,\theta }\).
The logic GLT is the bimodal propositional logic which has GL both for \({\square }\) and \({\vartriangle }\), plus the following principles.
-
\(\vdash {\vartriangle }\phi \rightarrow {\square }\phi \).
-
\(\vdash {\square }\phi \rightarrow {\vartriangle }{\square }\phi \).
-
\(\vdash {\square }\phi \rightarrow {\square }{\vartriangle }\phi \).
-
\(\vdash {\square }{\vartriangle }\phi \rightarrow {\square }\phi \).
By Theorem 13, we have:
Theorem 14
Let \(\alpha \) be a \(\varDelta _0({\mathsf{exp }})\)-predicate that numerates the axioms of U in EA, or, equivalently, in true arithmetic. Let \(\theta \) be a \(\varSigma ^0_1\)-predicate that satisfies \({\mathsf{EA }} \vdash (\theta (y,z) \wedge y < y') \rightarrow \theta (y',z)\).
Suppose \({\mathsf{W }}_{\alpha ,\theta }\) is a true theory and \(U \vdash {\mathsf{W }}_{\alpha ,\theta }\). Then GLT is arithmetically valid in U. In addition, U satisfies HB both for \({\square }_\alpha \) and for \({\vartriangle }_\theta \). Finally, \({\vartriangle }_\theta \) satisfies the Kreisel Condition in U.
5.2 Extensions of Peano arithmetic revisited
We show how the case of extensions of Peano Arithmetic, treated in Sect. 4, fits the framework of the present section.
Let U be a consistent extension of PA and let \(\alpha \) be an elementary predicate numerating the axioms of U in EA with \(\pi \preceq \alpha \). We note that, equivalently, \(\alpha \) numerates the axioms of U in true arithmetic. Let \(\alpha _z(x) :\leftrightarrow \alpha (x) \wedge x \le z\). We take \( \varTheta _\alpha (z,x) := {\textsf {prov}}_{\alpha _z}(x)\) in the role of \(\theta \). Thus, we have \( {\square }_{\alpha _z}A = \varTheta _\alpha (z,\underline{\ulcorner A \urcorner })\) in the role of and we have \( {\widetilde{{\square }}}_\alpha A := \exists z\, ({\square }_{\alpha _z}A \wedge \mathcal S(z))\) the role of \({\vartriangle }A\).
We define: \({\widetilde{\alpha }}(a) :\leftrightarrow \alpha (a) \wedge \mathcal S(a)\). We have:
Lemma 6
\({\mathsf{EA }} \vdash \forall A\, ({\widetilde{{\square }}}_\alpha A \leftrightarrow {\square }_{{\widetilde{\alpha }}} A)\).
Proof
We reason in EA.
Suppose \({\widetilde{{\square }}}_\alpha A\). Then, for some z, we have \({\square }_{\alpha _z}A\) and \(\mathcal S(z)\). Suppose p witnesses \({\square }_{\alpha _z}A\) and B is an axiom used in p. Then, \(\alpha (B)\) and \(B \le z\). Since \(\mathcal S\) is downward persistent w.r.t. \(\le \), we find \(\mathcal S(B)\), and, hence, \({\widetilde{\alpha }}(B)\).
Conversely, suppose \({\square }_{{\widetilde{\alpha }}}A\). let q be a witnessing proof. Let B be the maximal \(\alpha \)-axiom used in q. We find \(\mathcal S(B)\). Thus, \({\square }_{\alpha _B} A\) and \(\mathcal S(B)\), i.e., \({\widetilde{{\square }}}_\alpha A\). \(\square \)
Lemma 7
The predicate \({\widetilde{\alpha }}\) numerates the axioms of U in U. Hence, \({\widetilde{{\square }}}_\alpha \) is Fefermanian in U over U.
Proof
Let X be the set of axioms set of axioms numerated by \(\alpha \).
Suppose \(n \in X\). Then \(\alpha (n)\) and hence \(U \vdash \alpha (\underline{n})\). Since also, by Lemma 2, we have \(U \vdash \mathcal S(\underline{n})\), it follows that \(U \vdash {\widetilde{\alpha }}(\underline{n})\).
Suppose \(U \vdash {\widetilde{\alpha }} (\underline{n})\). Then, \(U \vdash \alpha (\underline{n})\) and, hence, \(n \in X\). \(\square \)
Lemma 8
EA verifies \({\mathsf{W }}_{\alpha ,\varTheta _\alpha }\).
Proof
The principle (a) follows by essential reflexivity. The principles (b) and (c) are trivial. The principle (d) follows since for a sufficiently large n we will have, in EA, that \({\square }_{\alpha _{\underline{n}}} B\), where B is a single axiom for EA. \(\square \)
By Theorem 14 and Lemma 8 we find:
Theorem 15
The logic GLT is arithmetically valid for \({\square }_\alpha \) and for \({\square }_{{\widetilde{\alpha }}}\) over U. In addition, we have HB both for \({\square }_\alpha \) and for \({\square }_{{\widetilde{\alpha }}}\) over U. Finally, \({\square }_{{\widetilde{\alpha }}}\) satisfies the Kreisel Condition in U.
In case U is sound, one easily sees that the pair \({\square }_\alpha \) and \({\square }_{{\widetilde{\alpha }}}\) satisfies the conditions of Theorem 16 of [11]. It follows that GLT is precisely the bi-modal propositional logic of \({\square }_\alpha \) and \({\square }_{{\widetilde{\alpha }}}\) in U, for sound U.
Remark 8
Let \({\textsf {EA}}+{\textsf {ref}}\) be EA plus sentential reflection for predicate logic. Let \(\tau \) be a standard axiomatization for \({\textsf {EA}}+{\textsf {ref}}\). Let U be an extension of \({\textsf {EA}}+{\textsf {ref}}\) and let \(\alpha \) be an elementary axiomatization of U such that \(\tau \preceq \alpha \). With these basic ingredients we can repeat the development of the present section noting that we are always looking at sentential reflection rather than uniform reflection.
In [31] we introduced the theory Peano Corto, which has many analogies to \({\textsf {EA}}+{\textsf {ref}}\). It would be interesting to see how much of our development can be repeated for the case of Peano Corto. \(\lozenge \)
6 Extensions of elementary arithmetic
In this section, we give a general construction of a \(\theta \) with the desired properties for extensions of EA.
We first take a moment to see that, in order to get the desired combinations of properties for extensions of EA, we indeed need to leave the realm of the Fefermanian predicates.
6.1 Two examples
Our first example addresses the case that we only demand that our Fefermanian predicate is Kreiselian.
Example 2
Consider the theory \(U := {\textsf {EA}}+{\square }_\beta \bot \). Here \(\beta \) numerates a single axiom for EA. Suppose there would be a Fefermanian predicate for U over EA that is Kreiselian. Say the witnessing predicate for the axiom set is \(\alpha \). Let \(\gamma (x) :\leftrightarrow \beta (x) \vee x= \underline{\ulcorner {\square }_\beta \bot \urcorner }\). We have \(\gamma \preceq \alpha \). Since, \(U \vdash {\square }_\beta \bot \), it follows that \(U \vdash {\square }_\gamma \bot \), and, hence, \(U \vdash {\square }_\alpha \bot \). So, \(U \vdash \bot \). Quod non. \(\lozenge \)
In the previous example, we needed an unsound theory. In our second example, we consider the case that our example satisfies absorption. Here we can use a sound theory.
Example 3
Let \(U := {\textsf {EA}}\). Consider a Fefermanian predicate P based on \(\alpha \) for EA over EA. We write \({\vartriangle }\) for P. We note that \(\beta \preceq \alpha \). Suppose we would have the absorption law for \({\vartriangle }\) and \({\square }_\beta \). Then, it would follow that:
So, by Löb’s Rule, \({\textsf {EA}} \vdash {\square }_\beta \bot \). Quod non. \(\lozenge \)
Question 1
We note that our examples are of finitely axiomatized theories. The construction of Sect. 5.2 gives us Fefermanian predicates for theories extending Peano Arithmetic. As pointed out in Remark 8, we can improve this to extensions of \({\textsf {EA}}+{\textsf {ref}}\). Obviously there is a big gap between our examples and counterexamples for the possibility to obtain a Fefermanian \({\vartriangle }\).
So, the question is whether we can find a larger class of theories for which we have a Fefermanian \({\vartriangle }\) that satisfies both the Löb Conditions and the Kreisel Condition and that satisfies the emission and absorption laws. \(\lozenge \)
6.2 Motivating remarks for our construction
We may construct the desired predicates in many ways. However, for didactic reasons, it good to maximize the meaningfulness of the construction.
As a first step, we note that we have the conditions of Theorems 1 and 3, for \({\square }_{\alpha ,(x)}\). So, to obtain absorption and emission, is already sufficient. The idea of our construction is simply to add closure under modus ponens and closure under HB in a minimal way to \( {\square }_{\alpha ,(x)}\).
The minimal way to obtain the addition of modus ponens is simply to close of the \(\alpha \)-theorems with proofs \(\le x\) under modus ponens. However, we can strengthen the analogy with our approach to the case of extensions of PA by working with a Hilbert system that only has modus ponens as a rule. Such deduction systems are described in [25] (first edition 1940) and in [3]. When we have such a system we can, for the definition of , consider the theorems whose proofs contain only axioms whether logical or non-logical which are \(\le x\). Thus, the main difference between our approach for the extensions of PA and the new one is that we stop treating logical and non-logical axioms as different.
What to do to obtain the Hilbert–Bernays condition? Simple: we add the true \(\varSigma ^0_1\)-sentences to our original axiom set.
There is a small technical complication, due to the lack of \(\varSigma ^0_1\)-collection, that necessitates us to stipulate a bound on the witnesses of the truth of the \(\varSigma ^0_1\) sentences involved in a proof, but this complication disappears as soon as we have \(\varSigma ^0_1\)-collection in the ambient theory.
6.3 The construction
We fix a Hilbert system \(\mathfrak H\) with as only rule modus ponens. Let \({\textsf {logic}}(x)\) be a \(\varDelta _0({\textsf {exp}})\)-formula that numerates the set axioms of \(\mathfrak H\) in EA.
We assume that a \(\varSigma ^0_1\)-sentence begins with a, possibly vacuous, existential quantifier.
We give the basic definitions for our approach. Let a theory U be given and a \(\varDelta _0({\textsf {exp}})\)-formula \(\alpha \) that numerates the axioms of U in EA (or, equivalently, in true arithmetic).
-
We define \({\textsf {ass}}^\circ (p)\) as the set of assumptions op p, where now a logical axiom also counts as an assumption. In other words, anything not proved from previous items using modus ponens counts as an assumption.
-
We write \({\textsf {proof}}^\circ _\gamma (p,x)\) for \({\textsf {proof}}(p,x) \wedge \forall y\in {\textsf {ass}}^\circ (p)\, \gamma (y)\).
-
We write for \(\exists p\, {\textsf {proof}}^\circ _\gamma (p,\underline{\ulcorner A \urcorner })\).
-
B is a direct \(\circ \)-subformula of A if A is of the form \((C\rightarrow B)\) or \((B \rightarrow C)\). The \(\circ \)-subformulas of A are the smallest set that contains A and is closed under taking direct \(\circ \)-subformulas.
-
\(\alpha ^+(a) := \alpha (a) \vee {\textsf {logic}}(a) \vee {\textsf {true}}(a)\).
Here true is a \(\varSigma ^0_1\)-truth predicate. We we take \({\textsf {true}}(a)\) to imply that a is a \(\varSigma ^0_1\)-sentence.
-
\({\widetilde{\alpha }}^+(a) := \alpha ^+(a)\wedge \mathcal S(a)\).
-
\(\alpha ^+_{x,z}(a) :\leftrightarrow (\alpha (a) \vee {\textsf {logic}}(a) \vee {\textsf {true}}^z(a)) \wedge a\le x\).
-
\(\alpha _x^+(a) :\leftrightarrow \alpha ^+(a) \wedge a\le x\).
-
.
We use \(\varTheta ^\circ _\alpha \) in the role of \(\theta \). So has the role of .
-
We define . So, has the role of \({\vartriangle }\).
In case we have \(\varSigma ^0_1\)-collection, the situation simplifies. We note that in the absence of \(\varSigma ^0_1\)-collection is not \(\varSigma ^0_1\) but \(\varSigma ^0_{1,1}\). See [31] or [32] for an explanation of \(\varSigma ^0_{1,1}\). We have:
Lemma 9
-
a.
.
-
b.
.
-
c.
.
-
d.
.
Proof
(a) is trivial. (b) is an immediate application of collection. (c) and (d) are analogous to the proof of Lemma 6, using respectively (a) and (b). \(\square \)
We start with a well-known lemma.
Lemma 10
Let \(\alpha \) be a \(\varDelta _0(\textsf {exp} )\)-predicate numerating the axiom set of U over EA. Then, \(\textsf {EA} \vdash \forall x, A \; {\square }_\alpha ({\square }_{\alpha ,(x)} A \rightarrow A)\).
Proof
We reason in EA. Suppose, for some \(p\le x\), we have \({\textsf {proof}}_\alpha (p,A)\). It clearly follows that \({\square }_\alpha A\) and, hence, a fortiori, \({\square }_\alpha ({\square }_{\alpha ,(x)} A \rightarrow A)\).
Suppose, for all \(q\le x\), we have \(\lnot \, {\textsf {proof}}_\alpha (q,A)\). If follows, by \(\varSigma ^0_1\)-completeness, that \({\square }_\alpha \forall q \le p\, \lnot \, {\textsf {proof}}_\alpha (q,A)\). In other words, \({\square }_\alpha \lnot \, {\square }_{\alpha ,(x)}A\). If follows that \({\square }_\alpha ({\square }_{\alpha ,(x)} A \rightarrow A)\). \(\square \)
The next lemma is in the spirit of the previous one, but takes a bit more work.
Lemma 11
Let \(\alpha \) be a \(\varDelta _0(\textsf {exp} )\)-predicate numerating the axiom set of U over EA. We have .
Proof
We will use a well-known fact, to wit that
This fact means that we do not have to worry that undesirable non-standard elements creep in below elements that are internally standard in EA.
We reason in EA. Let x be given.
We reason inside \({\square }_\alpha \). Suppose ($) . Let z and p witness . Keeping z fixed, we may, by the \(\varDelta _0({\textsf {exp}})\)-mimimum Principle, find a \(p_0\) that is minimal with this property.
Suppose that \(p_0\) contains a formula B twice. If B is the conclusion A of \(p_0\) we may omit the part after the first occurrence of A, obtaining a shorter proof. This contradicts the minimality of \(p_0\). If B is not the conclusion of \(p_0\), we may omit all occurrences of B after the first one, obtaining a shorter proof. This again contradicts the minimality of \(p_0\). We may conclude that all sentences in \(p_0\) occur only once in \(p_0\).
We claim that every formula that is a (sub)conclusion of \(p_0\) is a \(\circ \)-subformula of a formula in \({\textsf {ass}}^\circ (p_0)\). Suppose not. Let B be the first such formula. Clearly, B cannot be a \(\circ \)-assumption. So, it must be the conclusion of an application of modus ponens and, thus, a direct \(\circ \)-subformula of a previous formula of the form \((C\rightarrow B)\). But this formula is by assumption a \(\circ \)-subformula of \({\textsf {ass}}^\circ (p_0)\). A contradiction.
So, all sentences occurring in \(p_0\) are in \(\circ \)-subformulas of \({\textsf {ass}}^\circ (p_0)\) and occur only once. It follows that the sentences in \(p_0\) are all \(\le x\) and, hence, the number of these sentences is also \(\le x\). So, by our assumptions on coding, we find \(p_0 \approx x^x\). So, certainly \(p_0\) will be estimated by \(2^{x^2} +\underline{k}\), for a sufficiently large standard k.
It follows that \(\bigvee _{q \le 2^{x^2} +\underline{k}} \, {\textsf {proof}}^\circ _{\alpha ^+}(q,A)\) and, hence,
(Here the q are standard on the \({\square }_\alpha \)-external EA-level.)
Now, suppose (\(\ddag \)) \({\textsf {proof}}_{\alpha \cup {\textsf {true}}}(q,A)\), where \(q\le 2^{x^2} +\underline{k}\). We transform q as follows. Let \(\mathscr {S}\) be the set of the \(\varSigma ^0_1\)-sentences in \({\textsf {ass}}(q)\) that are not in \(\alpha \). It follows that all \(S\in \mathscr {S}\) are true. We transform q in two steps. First we form a proof \(q'\) from the assumptions \(({\textsf {ass}}(q) {{\setminus }} \mathscr {S} ) \cup \bigwedge \mathscr {S}\) with conclusion A. Then, we transform \(q'\) to \(q''\) with assumptions \({\textsf {ass}}(q) {\setminus } \mathscr {S}\) to \(\bigwedge \mathscr {S} \rightarrow A\). We note that the big conjunction is bounded by q and, thus, exists at the \({\square }_\alpha \)-external level.
We easily see that \(|q'|\) can be bounded by a linear term in |q|. The transformation \(q' \mapsto q''\) uses the deduction theorem. Inspection of the proof shows that here also \(|q''|\) is linear in \(|q'|\). Thus, \(q''\) is bounded by \(2^{\underline{m} x^2} +\underline{n}\), for appropriate standard m and n. We conclude that \(q''\) is also \({\square }_\alpha \)-external. We have found that \({\square }_{\alpha ,(q'')} (\bigwedge \mathscr {S} \rightarrow A)\), where \(q''\) is \({\square }_\alpha \)-external.
We apply Lemma 10 to obtain \(\bigwedge \mathscr {S} \rightarrow A\). We also have, since all elements of \(\mathcal S\) are true and \(\mathcal S\) is \({\square }_\alpha \)-external, that \(\bigwedge \{ {\textsf {true}}(S) \mid S \in \mathscr {S} \}\). From this it follows that \(\bigwedge \mathcal S\). Combining \(\bigwedge \mathscr {S} \rightarrow A\) and \(\bigwedge \mathcal S\), we find A.
By (\(\dag \)) we find A without assumption (\(\ddag \)). We now cancel ($) to obtain the sentence: .
We leave the \({\square }_\alpha \)-environment. We have shown , as desired. \(\square \)
We insert a quick corollary of Lemma 11.
Corollary 1
Let \(\alpha \) be a \(\varDelta _0({\mathsf{exp }})\)-predicate numerating the axiom set of U over EA. Then, .
Proof
We reason in EA. The left-to-right direction works as follows. We use Lemma 11.
The right-to-left direction is immediate since \({\square }_\alpha A\) implies \({\square }_{\alpha , (x)} A\), for some x, and \({\square }_{\alpha , (x)} A\) implies . \(\square \)
Lemma 12
Let \(\alpha \) be a \(\varDelta _0({\mathsf{exp }})\)-predicate numerating the axiom set of U over EA. Then, EA verifies \({\mathsf{W }}_{\alpha ,\varTheta ^\circ _\alpha }\).
Proof
The principle (a) follows by Lemma 11. The principle (b) follows by:
The principles (c) and (d) are immediate by the construction of . \(\square \)
By Theorem 14 and Lemma 12 we find:
Theorem 16
Let \(\alpha \) be a \(\varDelta _0({\mathsf{exp }})\)-predicate numerating the axiom set of U over EA. Then, the logic GLT is arithmetically valid in U for and \({\square }_\alpha \). In addition, we have HB over U both for \({\square }_\alpha \) and for \({\square }_{{\widetilde{\alpha }}}\). Finally, satisfies the Kreisel Condition in U.
Question 2
The predicate allows us to give an Orey Hájek Characterization for extensions of EA. Suppose \(\gamma \) is an elementary predicate that numerates the axioms of V over EA. Then, U is \(\varPi ^0_1\)-conservative over V iff, for all n, we have .
It seems to me that, using this characterization, many results in the work of Per Lindström and Christian Bennet should be transferable from the case of extensions of PA to the case of extensions of EA. It would be interesting to explore this.
Question 3
In our definition of , we counted instances of all proposition-logical and predicate-logical schemes as axioms. I think we only need to count the instances of a few specific schemes that are essentially predicate logical. It would be interesting to explore this.
7 An application
We first prove a very general result.
Theorem 17
Let U be any theory and suppose that \({\vartriangle }\) satisfies the Löb Conditions in U and HB, to wit, \(U \vdash S \rightarrow {\vartriangle }S\), for \(S\in \varSigma ^0_1\).Footnote 10 We have:
-
i.
U is \(\varPi _1^0\)-conservative over \(U+{\vartriangle }\bot \).
-
ii.
If \({\vartriangle }\) satisfies the Kreisel condition for U, then U is \(\varSigma _1^0\)-conservative over \(U+\lnot \, {\vartriangle }\bot \).
Proof
Suppose \({\vartriangle }\) satisfies the Löb conditions and the HB for U.
We prove (i). Let P be a \(\varPi ^0_1\)-sentence. Suppose \(U + {\vartriangle }\bot \vdash P\). Then, (a) \(U + \lnot \, P \vdash \lnot \, {\vartriangle }\bot \). Hence, \(U \vdash {\vartriangle }\lnot \, P \rightarrow {\vartriangle }\lnot \, {\vartriangle }\bot \). It follows by HB and by the formalized Second Incompleteness Theorem for \({\vartriangle }\), which follows by the Löb Conditions, that (b) \(U + \lnot \, P \vdash {\vartriangle }\bot \). Combining (a) and (b), we find \(U \vdash P\).
We prove (ii). Suppose that \({\vartriangle }\) satisfies the Kreisel Condition for U. Let S be a \(\varSigma ^0_1\)-sentence. Suppose \(U + \lnot \, {\vartriangle }\bot \vdash S\). It follows that \(U \vdash {\vartriangle }\bot \vee S\), and, hence, by the Löb Conditions and HB, \(U \vdash {\vartriangle }S\). By the Kreisel Condition, we find \(U \vdash S\). \(\square \)
The proof of (i) is ascribed by Per Lindström, in [20, p94], to Georg Kreisel in [17].
Consider any recursively enumerable theory U and let \(\alpha \) be a \(\varDelta _0({\textsf {exp}})\)-formula that numerates a set of axioms for U in EA. We note that, by Craig’s trick, we can always find such a \(\varDelta _0({\textsf {exp}})\)-formula. We take . We note that \({\vartriangle }\) fulfills the conditions of Theorem 17 for U. It follows that \({\vartriangle }_\alpha \bot \) is a \(\varSigma ^0_1\)-sentence such that U is \(\varPi ^0_1\)-conservative over \(U+{\vartriangle }_\alpha \bot \) and \(\lnot \,{\vartriangle }_\alpha \bot \) is a \(\varPi ^0_1\)-sentence such that U is \(\varSigma ^0_1\)-conservative over \(U+{\vartriangle }_\alpha \bot \).
For extensions U of Peano Arithmetic, the existence of a \(\varSigma ^0_1\)-sentence S, such that U is \(\varPi ^0_1\)-conservative over \(U+S\) and U is \(\varSigma ^0_1\)-conservative over \(U+\lnot \,S\) is a special case of a result due to Robert Solovay. See [8]. See also [20, Chapter 5].
Remark 9
We note that \({\vartriangle }_\alpha \bot \) is a fortiori a Rosser sentence for U. The resulting proof of Rosser’s Theorem is like the proof of the Second Incompleteness Theorem in the sense that the sentence under consideration is self-reference-free, but in the proof of the desired property we do use self-reference. Another example of a self-reference-free \(\varSigma ^0_1\) Rosser sentence (for extensions of PA) is due to Fedor Pakhomov. See [23]. We note that Pakhomov’s construction is, in a sense, orthogonal to ours. An essential feature of Pakhomov’s construction is that, like the ordinary Rosser sentence and its opposite, it produces \(\varSigma ^0_1\)-sentences \(S_0\) and \(S_1\), each with the Rosser property over U, such that we have \(U \vdash \lnot \, (S_0\wedge S_1)\) and \(U \vdash {\square }_\alpha \bot \leftrightarrow (S_0\vee S_1)\). It follows that e.g. \(U \vdash S_0 \rightarrow \lnot \, S_1\), but \(U\nvdash \lnot \, S_1\). So, \(S_0\) is not \(\varPi ^0_1\)-conservative. The non-\(\varPi ^0_1\)-conservativity of Pakhomov’s sentences is an important feature since it allows him to use them for his alternative proof of Solovay’s arithmetical completeneness theorem for Löb’s Logic. \(\lozenge \)
We formulate a consequence of Theorem 11.
Theorem 18
Suppose U is a recursively enumerable extension of PA. Then there is a \(\varSigma ^0_1\)-predicate \(\sigma ^*\) such that \({\langle U,U,\sigma ^* \rangle }\) is Fefermanian and \({\square }_{\sigma ^*}\) satisfies, in U, the Löb-conditions and the Kreisel condition.
Proof
Consider a recursively enumerable extension U of PA. We can easily construct a \(\varSigma ^0_1\)-formula \(\sigma \) such that \({\langle {\textsf {PA}},U,\sigma \rangle }\) is Fefermanian and \(\pi \preceq \sigma \). We now apply Theorem 11 to obtain the desired \(\sigma ^*\). \(\square \)
It follows from Theorems 17(ii) and 18 that:
Theorem 19
Suppose U is a recursively enumerable extension of PA. Then, there is a \(\varSigma ^0_1\)-predicate \(\sigma ^*\) such that \({\langle U,U,\sigma ^* \rangle }\) is Fefermanian and \(\diamond _{\sigma ^*}\top \) is \(\varSigma ^0_1\)-conservative over U.
Theorem 19 stands in interesting contrast to a result due to Craig Smoryński that is reported in Example 1.6 of [8].
Theorem 20
(Smoryński). Suppose U is an extension of PA such \({\langle {\mathsf{PA }},U,\sigma \rangle }\) is Fefermanian and \(\alpha \) is \(\varDelta _1^0({\mathsf{PA }})\). Then, \(\diamond _\alpha \top \) is \(\varSigma ^0_1\)-conservative over U iff U is \(\varSigma ^0_1\)-sound.
Inspection of the proof shows that no special properties of PA are used (except its \(\varSigma ^0_1\)-soundness when it occurs in the role of base theory), so, in fact, we have a far more general result. However, we will not pursue that line here. We just present a variation of Smoryński’s result for extensions of PA.
Theorem 21
Suppose \({\langle {\mathsf{PA }},U,\sigma \rangle }\) is Fefermanian, where \(\sigma \) is \(\varSigma ^0_1\). Then, the following are equivalent.
-
i.
U is \(\varSigma ^0_1\)-sound.
-
ii.
\({\square }_\sigma \) satisfies the Kreisel condition for U.
-
iii.
\(\diamond _{\sigma }\top \) is \(\varSigma ^0_1\)-conservative over U.
Proof
Suppose \({\langle {\textsf {PA}},U,\sigma \rangle }\) is Fefermanian, where \(\sigma \) is \(\varSigma ^0_1\).
(i) \(\Rightarrow \) (ii). Suppose U is \(\varSigma ^0_1\)-sound. We note that the fact that \({\langle {\textsf {PA}},U,\sigma \rangle }\) is Fefermanian, implies that \(\sigma \) truly enumerates the Gödel codes of an axiom set of U. Suppose \(U \vdash {\square }_\sigma A\). Then, \({\square }_\sigma A\) is true, and, hence \(U \vdash A\).
(ii) \(\Rightarrow \) (iii). This is immediate by Theorem 17.
(iii) \(\Rightarrow \) (i). Suppose \(U+\diamond _{\sigma }\top \) is \(\varSigma ^0_1\)-conservative over U. By Theorem 9, we have \(U\vdash \forall A\, ({\square }_{\hat{\sigma }}A \leftrightarrow {\square }_\sigma A)\). So, \(U+\diamond _{\hat{\sigma }}\top \) is \(\varSigma ^0_1\)-conservative over U. By Theorem 10, the triple \({\langle {\textsf {PA}},U, \hat{\sigma } \rangle }\) is Fefermanian. Moreover, \(\hat{\sigma }\) is elementary and, hence, a fortiori, \(\varDelta ^0_1({\textsf {PA}})\). So, by Theorem 20, we find that U is \(\varSigma ^0_1\)-sound. \(\square \)
The contrast between Theorems 19 and 21, illustrates well how sensitive the choice of the base theory may be already in the case where we consider \(\varSigma ^0_1\)-axiomatizations. Moreover, this contrast may be a warning against uncritical use of Craig’s trick of the form we may always replace a \(\varSigma ^0_1\)-axiomatization by a \(\varDelta _0({\textsf {exp}})\)-axiomatization. This is only unproblematic when we have a \(\varSigma ^0_1\)-sound base theory and consider extensions of \({\textsf {EA}}+\mathrm B\varSigma ^0_1\). In all other cases, some care is needed.
Notes
This means, roughly, that we use a standard arithmetization of syntax and proofs, but tinker just with the representation of the axiom set. See Sect. 3.3, for a detailed explanation.
Everything in the paper lifts to the more general case where a theory of arithmetic is interpretable in the given theory. However, it is pleasant to avoid the extra notational burden of the more general case. The notational burdens of the present paper seem to be sufficiently heavy already.
Usually, there is some overhead in defining sequences since we want to add some materials to make the definition of the projection function easy. However, the uses of sequences to define syntax and proofs usually only require that we can determine whether something occurs in a sequence before something else. For this one does not need the extra material.
As remarked above the other conditions suffer, admittedly to a lesser degree, from the same defect.
If we consider a \(\varSigma ^0_1\)-axiomatization in a context without \(\varSigma ^0_1\)-collection, the Löb conditions may fail. However, even in such Löbless cases, the Second Incompleteness Theorem holds. See [35].
Our argument also works under the weaker assumption that \(\pi \preceq _{\textsf {PA}} \alpha \).
I use the round brackets to distinguish the intended notion from \({\square }_{\alpha ,x}\) which is used in some of the literature for \({\square }_{\alpha _x}\), where \(\alpha _x(y) :\leftrightarrow \alpha (x) \wedge y \le x\).
We note that, in this step, we use that (the code of) the conclusion is smaller than (the code of) the proof.
We note that in the present general context the Löb Conditions and HB are mutually independent.
References
Ardeshir, M., Mojtahedi, M.: The \({\Sigma }_1\)-provability logic of HA. Ann. Pure Appl. Log. 169(10), 997–1043 (2018)
Beklemishev, L.D.: On the classification of propositional provability logics. Math. USSR-Izv. 35(2), 247–275 (1990)
Feferman, S.: Arithmetization of metamathematics in a general setting. Fundam. Math. 49, 35–92 (1960)
Freund, A.: Slow reflection. Ann. Pure Appl. Log. 168(12), 2103–2128 (2017)
Freund, A., Pakhomov, F.: Short proofs for slow consistency. Notre Dame J. Formal Log. 61(1), 31–49 (2020)
Friedman, S.D., Rathjen, M., Weiermann, A.: Slow consistency. Ann. Pure Appl. Log. 164(3), 382–393 (2013)
Grabmayr, B.: On the invariance of Gödel’s Second Theorem with regard to numberings. (2018). ArXiv preprint, arXiv:1803.08392
Guaspari, D.: Partially conservative extensions of arithmetic. Trans. Am. Math. Soc. 254, 47–68 (1979)
Henk, P.: Nonstandard provability for Peano arithmetic, a modal perspective. Ph.D thesis, ILLC, Amsterdam (2016). http://www.illc.uva.nl/Research/Publications/Dissertations/DS/
Hájek, P., Pudlák, P.: Metamathematics of First-Order Arithmetic, Perspectives in Mathematical Logic. Springer, Berlin (1993)
Henk, P., Pakhomov, F.: Slow and ordinary provability for Peano arithmetic (2016). ArXiv preprint arXiv:1602.01822
Henk, P., Visser, A.: Interpretability suprema in Peano arithmetic. Arch. Math. Log. 56(5–6), 555–584 (2017)
Halbach, V., Visser, A.: Self-reference in arithmetic I. Rev. Symb. Log. 7, 671–691 (2014)
Henk, P., Yu Shavrukov, V.: A Solovay function for the least 1-inconsistent subtheory of PA. ILLC e-prints, ILLC (2016). https://eprints.illc.uva.nl/567/
Kalsbeek, M.B.: Towards the interpretability logic of \({\rm I}{\Delta }_0+{\sf EXP}\). Logic Group Preprint Series 61, Faculty of Humanities, Philosophy, Utrecht University (1991). https://lgps.sites.uu.nl
Kreisel, G.: On a problem of Henkin’s. Indag. Math. 15, 405–406 (1953)
Kreisel, G.: On weak completeness of intuitionistic predicate logic. J. Symb. Log. 27(2), 139–158 (1962)
Kurahashi, T.: Arithmetical completeness theorem for modal logic K. Stud. Log. 106(2), 219–235 (2018)
Kurahashi, T.: Arithmetical soundness and completeness for \(\Sigma _2\)-numerations. Stud. Log. 106(6), 1181–1196 (2018)
Lindström, P.: Aspects of Incompleteness. Lecture Notes in Logic, vol. 10. ASL/A.K. Peters, Natick (2003)
Löb, M.H.: Solution of a problem of Leon Henkin. J. Symb. Log. 20, 115–118 (1955)
Montagna, F.: On the algebraization of a Feferman’s predicate (the algebraization of theories which express Theor; X). Stud. Log. 37, 221–236 (1978)
Pakhomov, F.: Solovay’s completeness without fixed points. In: Kennedy, J., de Queiroz, R.J.G.B. (eds.) Logic, Language, Information, and Computation, pp. 281–294. Springer, Berlin, Heidelberg (2017)
Parikh, R.: Existence and feasibility in arithmetic. J. Symb. Log. 36(3), 494–508 (1971)
Quine, W.V.: Mathematical Logic, Revised edn. Harvard University Press, Cambridge (1996)
Rathjen, M.: Long sequences of descending theories and other miscellanea on slow consistency. J. Log. Appl. 4(4), 1411–1426 (2017). Special issue dedicated to Gregori Mints
Smullyan, R.M.: Theory of Formal Systems, Annals of Mathematics Studies, vol. 47. Princeton University Press, Princeton (1961)
Visser, A.: Peano’s smart children: a provability logical study of systems with built-in consistency. Notre Dame J. Formal Log. 30(2), 161–196 (1989)
Visser, A.: Interpretability logic. In: Petkov, P.P. (ed.) Mathematical Logic, Proceedings of the Heyting 1988 summer school in Varna, Bulgaria, pp. 175–209. Plenum Press, Boston (1990)
Visser, A.: The second incompleteness theorem and bounded interpretations. Stud. Log. 100(1–2), 399–418 (2012). https://doi.org/10.1007/s11225-012-9385-z
Visser, A.: Peano Corto and Peano Basso: a study of local induction in the context of weak theories. Math. Log. Q. 60(1–2), 92–117 (2014)
Visser, A.: Oracle bites theory. In: Gosh, S., Szymanik, J. (eds.) The Facts Matter, Essays on Logic and Cognition in Honour of Rineke Verbrugge, pp. 133–147. College Publications, London (2015)
Visser, A.: The second incompleteness theorem: reflections and ruminations. In: Horsten, L., Welch, P. (eds.) Gödel’s Disjunction: The Scope and Limits of Mathematical Knowledge, pp. 67–90. Oxford University Press, Oxford (2016)
Visser, A.: Transductions in arithmetic. Ann. Pure Appl. Log. 167(3), 211–234 (2016)
Visser, A.: Another look at the second incompleteness theorem. Rev. Symb. Log. 13(2), 1–27 (2019)
Visser, A., Zoethout, J.: Provability logic and the completeness principle. Ann. Pure Appl. Log. 170(6), 718–753 (2019)
Wilkie, A.J., Paris, J.B.: On the scheme of induction for bounded arithmetic formulas. Ann. Pure Appl. Log. 35, 261–302 (1987)
Yu Shavrukov, V.: A smart child of Peano’s. Notre Dame J. Formal Log. 35, 161–185 (1994)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
I am grateful to Fedor Pakhomov for sharing his ideas about slow provability and for a number of corrections to an earlier version of this paper. I thank Volodya Shavrukov for asking good questions and for pointing me to some references. I am grateful to Michael Beeson for pointing out a mistake and an ambiguity. I thank the anonymous referee for excellent suggestions.
Examples
Examples
For convenience, we repeat our overview of the examples.
Base | Lead | P | Löb | Kreisel | Feferman | |
---|---|---|---|---|---|---|
Example 4 | EA | EA | \(\varSigma ^0_1\) | \(+\) | \(+\) | \(+\) |
Example 5 | EA | EA | \(\varSigma ^0_1\) | \(+\) | \(+\) | − |
Example 6 | EA | \({\textsf {EA}}+{\square }_\beta \bot \) | \(\varSigma ^0_1\) | \(+\) | − | \(+\) |
EA | EA | \(\varSigma ^0_2\) | ||||
Example 7 | EA | EA | \(\varSigma ^0_1\) | \(+\) | − | − |
Example 8 | PA | PA | \(\varSigma ^0_2\) | − | \(+\) | \(+\) |
EA | EA | \(\varSigma ^0_{1,1}\) | ||||
Example 9 | EA | EA | \(\varSigma ^0_1\) | − | \(+\) | − |
Example 10 | PA | PA | \(\varSigma ^0_2\) | − | − | \(+\) |
EA | \({\textsf {EA}}+{\square }_\beta {\square }_\pi \bot \) | \(\varSigma ^0_{1,1}\) | ||||
Example 11 | EA | EA | \(\varSigma ^0_1\) | − | − | − |
Example 4
\({+}{+}{+}\): We take \(U_0 := U := {\textsf {EA}}\) and \(P := {\textsf {prov}}_\beta \). Clearly, this P satisfies all three conditions for EA. We note that our example satisfies HB too. \(\lozenge \)
Example 5
\({+}{+}{-}\): By Theorem 16, there is a \(\varSigma ^0_1\)-predicate P for \(U := {\textsf {EA}}\), that satisfies Löb Conditions and the absorption principle in combination with \({\square }_\beta \). By Example 3, the predicate P cannot be Fefermanian.
Here is a second example. We consider \({\textsf {cfprov}}_\beta (x)\), which stands for cut-free provability in EA. Let’s write \({\square }^{\textsf {cf}}_\beta A\) for \({\textsf {cfprov}}(\underline{\ulcorner A \urcorner })\). We have Löb’s Logic for \({\square }^{\textsf {cf}}_\beta \). See [29] and [15]. Also we easily see that \({\square }^{\textsf {cf}}_\beta \) satisfies the Kreisel condition. However, \({\square }^{\textsf {cf}}_\beta \) cannot be Fefermanian for EA over EA. If it were Fefermanian, we would have \({\textsf {EA}} \vdash {\square }_\beta \bot \rightarrow {\square }_\beta ^{\textsf {cf}}\bot \). To prove that this is impossible is outside the scope of the present article. We just give the outline of the proof, so that the reader can see the basic idea.
Suppose \({\textsf {EA}} \vdash {\square }_\beta \bot \rightarrow {\square }_\beta ^{\textsf {cf}}\bot \). It follows that \({\textsf {EA}} \vdash \diamond ^{\textsf {cf}}_\beta \top \rightarrow \diamond _\beta \top \). Then, by a meta-theorem from [37], it follows that (a) \({\textsf {S}}^1_2 + \diamond ^{\textsf {cf}}_\beta \top \vdash \diamond _\beta ^J\top \), for a definable cut J. We also have that (b) EA interprets \({\textsf {S}}^1_2 + \diamond ^{\textsf {cf}}_\beta \top \). Combining (a) and (b), we find that EA interprets \({\textsf {S}}^1_2+\diamond _\beta \top \). But this contradicts the Second Incompleteness Theorem.
We note that our examples also satisfy HB. A disadvantage is that they do not work for the global version of the Löb Conditions, where the quantifiers over sentences for L2 and L3 are inside the theory. It would be interesting to have an example for this case. \(\lozenge \)
Example 6
\({+}{-}{+}\): Here is an example of a Fefermanian predicate that does satisfy the Löb Conditions and does not satisfy the Kreisel Condition. Let \(U_0:= {\textsf {EA}}\), \(U:={\textsf {EA}}+{\square }_\beta \bot \). Let \(P := {\textsf {prov}}_{\gamma }\), where \(\gamma (x) :\leftrightarrow \beta (x) \vee x = \underline{\ulcorner {\square }_\beta \bot \urcorner }\).
We have \(U\vdash {\square }_\gamma \bot \), but \(U\nvdash \bot \), so the Kreisel condition fails for P and U.
We note that our example also works for \(U_0 := U:={\textsf {EA}}+{\square }_\beta \bot \).
We provide a second example, where the base and the lead theories are sound. By Theorem 6, the predicate that represents the axioms cannot be \(\varSigma ^0_1\).
We take \(U_0 := U := {\textsf {EA}}\). We define: \( \delta (x) :\leftrightarrow \beta (x) \vee (\diamond _\beta \top \wedge x = \underline{\ulcorner \bot \urcorner })\). We note that \(\delta \) numerates \(\{ B \}\) in EA, where B is the single axiom for EA. We find:
So \({\textsf {EA}} \vdash {\square }_\delta \bot \). It follows that \({\textsf {prov}}_\delta \) is not Kreiselian and satisfies the Feferman Conditions and the Löb Conditions. We note that \({\textsf {prov}}_\delta \) is \(\varSigma ^0_2\). \(\lozenge \)
Example 7
\({+}{-}{-}\): Let \(U_0 := U := {\textsf {EA}}\) and let P be \(x=x\). Clearly, P satisfies the Löb conditions in EA, but P is not Kreiselian. Since EA is sound and P is \(\varSigma ^0_1\), a Fefermanian P must be Kreiselian. So, P is also not Fefermanian. \(\lozenge \)
Example 8
\({-}{+}{+}\): The case of Fefermanian predicates that do not satisfy the Löb Conditions is among the most interesting of our cases. The study of the possibilities for such predicates for the case of extensions of Peano Arithmetic has been taken up by Taishi Kurahashi in great depth. See [18, 19].
A classical example of such a predicate is Feferman Provability. We define \(\pi ^\star (y) :\leftrightarrow \exists x \,(\pi _x(y) \wedge \diamond _{\pi _x}\top )\). Let \(P := {\textsf {prov}}_{\pi ^\star }\). This predicate was introduced by Solomon Feferman in his classical paper [3]. By the essential reflexivity of PA, one finds that \({\square }_{\pi ^\star }\) is Fefermanian for PA over PA. For closely related reasons \({\square }^\star \) is Kreiselian. However, \({\square }^\star \) does not satisfy the Löb Conditions. The bimodal provability logic of \({\square }_\pi \) and \({\square }_{\pi ^\star }\) has been characterized by Volodya Shavrukov in [38]. For some earlier work, see [22, 28]. We note that \({\square }_{\pi ^\star }\) is \(\varSigma ^0_2\).
An example of quite different flavor uses the fact that EA does not verify \(\varSigma ^0_1\)-collection. We refer the reader to [35, Subsection 6.2]. This example provides a \(\varSigma ^0_1\)-axiomatization \(\sigma \). As a consequence \({\textsf {prov}}_\sigma \) is \(\varSigma ^0_{1,1}\). We refer the reader to e.g. [31] for a further explanation of the relevant formula hierarchy. \(\lozenge \)
Example 9
\({-}{+}{-}\): Here is an example of a P that satisfies the Kreisel Condition but not the Löb Conditions and the Feferman Condition. Let \(U_0 := U:= {\textsf {EA}}\). Let \(P(x) := ({\textsf {prov}}_\pi (x) \wedge x \ne \, \underline{\ulcorner \bot \urcorner })\).
We note that L2 fails for P over EA. This shows that P does not satisfy the Löb Conditions and, hence, cannot be Fefermanian. \(\lozenge \)
Example 10
\({-}{-}{+}\): The examples are adaptations of the predicates and theories in Example 8. We just add something to make the examples non-Kreiselian. We use the notations of Example 8.
We give our first example of a non-Kreiselian Fefermanian predicate that does not satisfy the Löb Conditions. Let \(U_0:= U := {\textsf {PA}}\). We take:
Let \(P := {\textsf {prov}}_{\pi ^\circ }\).
It is easily seen that \(\pi ^\circ \) numerates the axioms of PA in PA. We have, using the fact that we have HB for \({\square }_{\pi ^\star }\):
It follows that \({\textsf {PA}} \vdash {\square }_{\pi ^\circ }{\square }_\pi \bot \). However, \({\textsf {PA}} \nvdash {\square }_\pi \bot \), so \({\textsf {prov}}_{\pi ^\circ }\) is not Kreiselian. We have:
So \({\textsf {PA}} \vdash \lnot \, {\square }_{\pi ^\circ } \bot \). Thus, \({\square }_{\pi ^\circ }\) cannot satisfy the Löb Conditions. Finally, \({\square }^\circ \) is clearly \(\varSigma ^0_2\).
Here is our second example. The presentation of our example presupposes that the reader has [35, Subsection 6.2] at hand. The predicate \(\sigma \) is imported here from that paper. We take \(U_0:={\textsf {EA}}\), \(U := {\textsf {EA}}+{\square }_\beta {\square }_\pi \bot \). We define
Clearly, \(\sigma ^\circ \) numerates the axioms of U in EA. We take \(P(x) := {\textsf {prov}}_{\sigma ^\circ }(x)\). Evidently, P is Fefermanian for \({\textsf {EA}}+{\square }_\beta {\square }_\pi \bot \) over EA.
Since, we have \({\textsf {EA}} \vdash {\square }_\beta C \rightarrow {\square }_\sigma C\) and \({\textsf {EA}} \vdash {\square }_\sigma C \rightarrow {\square }_{\sigma ^\circ } C\). We find \(U \vdash {\square }_{\sigma ^\circ } {\square }_\pi \bot \). Suppose we would have \(U \vdash {\square }_\pi \bot \). In would follow that \({\textsf {EA}} + {\square }_\beta {\square }_\pi \bot \vdash {\square }_\pi \bot \), and, hence, \({\textsf {EA}} \vdash {\square }_\pi \bot \). Quod non. Thus \(U \nvdash {\square }_\pi \bot \). So, P is not Kreiselian.
We note that over U we have, by \(\varSigma ^0_1\)-completeness, that \({\square }_\sigma \) and \({\square }_{\sigma ^\circ }\) coincide also in iterated \({\square }_\sigma \)-contexts. Suppose \({\square }_{\sigma ^\circ }\) satisfies the Löb Conditions over U. It follows that \({\square }_{\sigma }\) also satisfies the Löb Conditions over U. So, a fortiori, we find \(U \vdash {\square }_\sigma \diamond _\sigma \top \rightarrow {\square }_\sigma \bot \). By Lemma 6.11 and Lemma 6.12 of [35, Subsection 6.2], we find:
It follows that \({\textsf {EA}} +{\square }_\pi \bot + {\textsf {S}}^\star \vdash {\square }_\beta {\square }_\beta \bot \). However, we can construct a model of \({\textsf {EA}}+{\square }_\pi \bot + {\textsf {S}}^\star + \lnot {\square }_\beta {\square }_\beta \bot \) using the construction described in [35, Subsection 6.2]. \(\lozenge \)
Example 11
\({-}{-}{-}\): We take \(U_0 := U := {\textsf {EA}}\) and \(P(x) := \bot \). It is clear that P does not satisfy the Löb Conditions. Nor is it Kreiselian or Fefermanian. \(\lozenge \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Visser, A. The absorption law. Arch. Math. Logic 60, 441–468 (2021). https://doi.org/10.1007/s00153-020-00752-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00153-020-00752-5