1 Introduction

Deductions in normal form in Gentzen’s formalisation of intuitionist logic in natural deduction [2, 186] satisfy the subformula property. Precise definitions will be given in the next section, but for this preliminary discussion, the following explanations of terminology suffice. A deduction is in normal form if it contains no maximal formula,Footnote 1 that is a formula that is the conclusion of an introduction rule and the major premise of an elimination rule for its main connective. A deduction has the subformula property if any formula that occurs on it is a subformula of either an undischarged assumption or of the conclusion. The normalisation theorem for intuitionist logic states that any deduction can be brought into normal form, and so for any deduction of A from assumptions \(\Gamma \) in intuitionist logic, there is a deduction of A from some of the assumptions in \(\Gamma \) that satisfies the subformula property. This was shown by Prawitz [8, Ch IV].Footnote 2

Systems of natural deduction for classical propositional logic in which any deduction converts into one with the subformula property have proved to be more elusive. In general, deductive systems for classical logic with this property relinquish some of the features of natural deduction for intuitionist logic. The obvious route is to abandon single conclusions and adopt Gentzen’s sequent calculus. Deductions in normal form in Prawitz’s system of natural deduction for classical logic, for instance, only satisfy a restricted version of the subformula property: allowance must be made for assumptions of the form \(\lnot A\) that are discharged by classical reductio ad absurdum and formulas \(\bot \) concluded from them [8, 42].

The main result of this paper is a proof that in a system of natural deduction for classical propositional logic \(\mathbf{c} \) put forward by Milne [5], every deduction converts into one in normal form, and every deduction in normal form satisfies the subformula property without any restrictions. The system has some rather original features. It couples the well-known general elimination rules for \(\vee \), \(\wedge \), \(\supset \)Footnote 3 with introduction rules that are unusual in that, rather than permitting the conclusion of a formula with the connective they govern as main operator, they permit the discharge of such a formula. These were called ‘general introduction rules’ by Negri and von Plato, who formalised a system of intuitionist logic with such rules [6, 217]. While Negri and von Plato use \(\bot \) as primitive, and define negation in terms of it, Milne has general introduction and elimination rules for a primitive negation operator. Milne’s general introduction rules for implication are also unexpected and original. Negri and von Plato’s system is the subject of an independent investigation [3], in which normalisation and subformula property are proved for this system, too and general introduction and elimination rules for identity are also considered.

Milne’s system is closely related to a formalisation of classical logic by Michel Parigot [7]. The main difference is that Parigot’s system is formalised in sequent calculus, but translating one framework into the other, they are, mutatis mutandis, the same. A brief section of this paper compares the two systems.

Milne observes that, while adding rules for the universal quantifier to his system cannot produce one in which any deduction can be transformed into one that satisfies the subformula property, this is not so for the existential quantifier. The penultimate section of the present paper sheds some further light on this result: if the universal quantifier is added, the reduction procedures for maximal formulas with negation as main operator, and one of the reduction procedures for maximal formulas with implication as main operator, are no longer applicable. Thus deductions in the system do not normalise. The problem does not arise when the existential quantifier is added: any deduction in C plus \(\exists \) converts into one in normal form, and deductions in normal form have the subformula property.

Finally, the conclusion considers transposing general introduction rules back into the more usual form of introduction.

Milne gives a model-theoretical and non-constructive proof that in his system, if, whenever all of \(\Gamma \) are true, then A is true, then there is a deduction of A from \(\Gamma \) that has the subformula property [4]. By soundness it follows that for any deduction in his system, there is one that satisfies the subformula property. Here this result is proved proof-theoretically and constructively as a consequence of the normalisation theorem for the system.

The results of this paper are of some philosophical interest. Normalisation and subformula property of deductions in normal form are commonly regarded as necessary conditions for harmony.Footnote 4 Harmony consists in a certain balance in the rules governing a logical constant: an elimination rule for a logical connective should not license the deduction of more conclusions from a formula with it as main operator than are justified by its introduction rules. Prawitz’s formalisation of classical logic of [8], for instance, lacks the subformula property for normal deductions and its rules for negation are not considered to be harmonious. The disappearance of a negation from the assumptions and conclusions of a deduction in those cases where the strict subformula property is violated establish that too much has been derived from the discharged assumptions: Prawitz’s rules for classical negation license the deduction of more conclusions from formulas of the form \(\lnot A\) than are justified by the introduction rule governing this connective. A more detailed discussion of the philosophical implications of the results presented here goes beyond the purposes of this paper.

2 A system of classical propositional logic

The definition of the language of C is standard.

Definition 1

(Connective, atomic formula, degree of a formula) \(\lnot \), \(\supset \), \(\wedge \) and \(\vee \) are the connectives. An atomic formula is one that contains no connective. The degree of a formula is the number of connectives occurring in it.

Occasionally, the discussion mentions \(\bot \). Being a connective, it is not an atomic formula, but a formula of degree 1.

Deductions in C have the familiar tree shape, with the (discharged or undischarged) assumptions at the top-most nodes or leaves and the conclusion at the bottom-most node or root. The conclusion of a deduction is said to depend on the undischarged assumptions of the deduction. Similar terminology is applied to subdeductions of deductions.

Every assumption in a deduction belongs to an assumption class, marked by a natural number, different numbers for different assumption classes. Formula occurrences of different typesFootnote 5 must belong to different assumption classes. Formula occurrences of the same type may, but do not have to, belong to the same assumption class. Discharge of assumptions is marked by a square bracket around the formula: \([A]^i\), i being the assumption class to which A belongs, with the same label of the assumption class also occurring at the application of the rule at which the assumption is discharged. Assumption classes are chosen in such a way that if one assumption of an assumption class is discharged by an application of a rule, then it discharges all assumptions of said assumption class. Empty assumption classes are permitted: they are used in vacuous discharge, when a rule that allows for the discharge of assumptions is applied with no assumptions being discharged.

Upper case Greek letters \(\Sigma \), \(\Pi \), \(\Xi \), possibly with subscripts or superscripts, denote deductions. In general some of the assumptions and the conclusion of the deduction are mentioned explicitly at the top and bottom of \(\Sigma \), \(\Pi \), \(\Xi \). Using the same designation more than once to denote subdeductions of a deduction means that these subdeductions are exact duplicates of each other apart from, possibly, the labels for assumption classes: the deductions have the same structure and at every node formulas of the same type are premises and conclusions of applications of the same rules.

Definition 2

(Deduction in C)

  1. (i)

    The formula occurrence \(A^n\), where n marks an assumption class, is a deduction in C of A from the undischarged assumption A.

  2. (ii)

    If \(\Sigma \), \(\Pi \), \(\Xi \) are deductions in C, then following are deductions of C in C from the undischarged assumptions in \(\Sigma \), \(\Pi \), \(\Xi \) apart from those in the assumption classes i and j, which are discharged:

figure a
  1. (iii)

    Nothing else is a deduction in C.

In writing deductions we may suppress the label indicating the rule applied, but the labels indicating discharge must always be present.

A few words on the rules for implication will be in order. In Negri and von Plato’s intuitionist system with general introduction and elimination rules, the introduction rule for \(\supset \) permits the discharge of an assumption A above its first premise B. Thus letting C be \(A\supset B\) in their rule derives the usual introduction rule for \(\supset \), which licenses the deduction to \(A\supset B\) from a deduction of B from A. TR is named after Tarski: Milne calls it Tarski’s Rule. Following the patterns of the other rules, it is treated as a second general introduction rule for \(\supset \). The usual introduction rule for \(\supset \) is derived in C in the following way:

figure b

The usual negation introduction rule, reductio ad absurdum, licenses the deduction of \(\lnot A\) from a deduction of a contradiction from A. It is derived like this in C:

figure c

Classical reductio ad absurdum, which licenses the deduction of A from a deduction of a contradiction from \(\lnot A\), is derived by interchanging A and \(\lnot A\). Conversely, Milne’s general introduction rules are clearly valid rules of inference of classical logic. Thus we have a system of classical propositional logic.

When added to the system, \(\bot \) is governed by a sole elimination rule, ex falso quodlibet. It licenses the deduction of any formula from \(\bot \). The rule may be restricted to atomic conclusions. It is the limiting case of a general elimination rule where there are no side-deductions with discharged assumptions corresponding to subformulas of the major premise above them. Having no introduction rule, it is trivially governed only by rules of the same form as the other connectives.

Notice that \(\lnot E\) has the form of a general elimination rule. This can be seen by treating \(\lnot A\) as \(A\supset \bot \) and replacing \(\bot \) for B in \(\supset E\). As everything follows from \(\bot \), a side-deduction showing that the conclusion of the application of the rule follows from \(\bot \) is redundant.

For recording from which assumptions a conclusion has been derived, it suffices to list the assumption classes to which the undischarged assumptions of the deduction belong. This will be a multiset. If a set is preferred, delete multiple occurrences of assumption classes of formulas of the same type. We can write \(\Gamma \vdash _{\mathbf {C}} A\) if there is a deduction of (the formula occurrence) A from (occurrences of) some of the formulas in \(\Gamma \) in C.

The premise A of \(\supset E\) and \(\lnot A\) and C in all four elimination rules are normally called the minor premises, but in the current system it is useful to have terminology that allows to distinguish them.

Definition 3

(Terminology for premises and discharged assumptions)

  1. (i)

    In applications of the elimination rules, formula occurrences taking the places of \(A\wedge B\), \(A\supset B\), \(A\vee B\) and \(\lnot A\) to the very left above the line are the major premises; formula occurrences taking the places of C are the arbitrary premises, and a formula occurrence taking the place of A in an application of \(\supset E\) and in \(\lnot E\) is the minor premise of those rules.

  2. (ii)

    In applications of \(\wedge I\), \(\vee I\), \(\supset I\), formula occurrences taking the places of A and B to the very left above the line are the specific premises (both in case of \(\wedge I\)), and those taking the place of C to their right are also called the arbitrary premises; formula occurrences taking the places of the discharged assumptions \(A\wedge B\), \(A\vee B\), \(A\supset B\) and \(\lnot A\) are the major assumptions discharged by applications of \(\wedge I\), \(\vee I\), \(\supset I\), TR and \(\lnot I\), and those taking the places of the discharged assumptions A in \(\lnot I\) and TR are the minor assumptions discharged by applications of those rules.

Like \(\bot E\), \(\lnot I\) has no arbitrary premises.

Vacuous discharge happens when in an application of a rule that permits discharge no formula is in fact discharged. In systems of natural deduction with the rules of Gentzen and Prawitz, applications of rules with vacuous discharge above arbitrary premises are removed from deductions by what is often called simplification conversionsFootnote 6 The procedure is obvious, and I will give no details here. In C, all vacuous discharge happens above arbitrary premises, and there it is clearly superfluous: instead of applying a rule and discharging vacuously, we might as well go on with the deduction straight from the arbitrary premise. In C there is no need for vacuous discharge at all, and in the following, I will assume that any deduction is cleaned up so as to contain no vacuous discharge above arbitrary premises; in particular, I will assume that this is done should vacuous discharge above an arbitrary premise arise as a result of the conversions of deductions to be given in Sect. 4. In \(\wedge E\), it is of course often necessary to make use of the option of discharging only one assumption.

3 Comparison with Parigot’s free deduction

The feature of Milne’s system that all conclusions of rules are identical to an arbitrary premise and hence no logical constants appear below the inference line in any of the general statements of its rules has a close parallel in Michel Parigot’s system of free deduction [7]. This is a system of multiple conclusion sequent calculus for classical logic where it is also the case that no formulas with logical constants appear in the conclusions of the general statement of the rules:

figure d

\(\Gamma , \Delta , \Pi , \Sigma \) are sets of formulas. Weakening and contraction may be added either in their usual form as explicit rules or in an implicit form through the conventions that active formulas need not occur in a premise and that the conclusions of rules are contracted.

Parigot explains how to obtain common systems of sequent calculus and natural deduction for classical logic from free deduction. In both cases, some of the premises of the rules of free deduction are replaced by axioms and then suppressed [7, 367f]. Parigot gives the examples of a common formulation of the classical sequent calculusFootnote 7 and a system of natural deduction with multiple conclusions.

Milne’s system can be obtained from Parigot’s by using replacing its right conjunction rules by a variant [7, 364]:

figure e

Instead of replacing premises by axioms and suppressing them, we simply restrict the succedents of sequents of free deduction to single conclusions in the most obvious way. Right rules turn into general elimination rules. Left rules turn into general introduction rules. Not carrying an active formula of a rule of free deduction to the left of \(\vdash \) into the conclusion corresponds to the discharge of that formula; active formulas to the right of \(\vdash \) turn into premises of the rules:

\(\wedge I\): take \(L\wedge \), let \(\Delta =\{C\}\) and let \(\Sigma =\Sigma '=\varnothing \).

\(\wedge E\): take \(R\wedge '\), let \(\Sigma =\{C\}\) and let \(\Delta =\varnothing \).

\(\vee I\): take \(L\vee \), let \(\Delta =\{C\}\) and let \(\Sigma =\varnothing \).

\(\vee E\): take \(R\vee \), let \(\Delta =\varnothing \) and let \(\Sigma =\Sigma '=\{C\}\).

\(\lnot I\): take \(L\lnot \), let \(\Delta =\Sigma =\{C\}\).

\(\lnot E\): take \(L\lnot \), let \(\Delta =\Sigma =\varnothing \) in the premise and \(\{C\}\) in the conclusion.

TR: take the first \(L\supset \), let \(\Delta =\Sigma =\{C\}\)

\(\supset I\): take the second \(L\supset \) and let \(\Delta =\{C\}\) and let \(\Sigma =\varnothing \).

\(\supset E\): take \(R\supset \) and let \(\Delta =\Sigma =\varnothing \) and let \(\Sigma '=\{C\}\).

The result is Milne’s system when written in natural deduction in sequent calculus style.

At one point the correspondence is not quite so perfect: in \(\lnot E\), the succedents of the premises and the conclusion are treated differently. But this is a minor blemish. It could be remedied by adding \(\bot \) to the system and treating \(\lnot \) as defined in terms of it and \(\supset \).

A closer comparison of the proof-theoretic properties of Milne’s system with Parigot’s free deduction and other systems of classical logic found in the literature merits its own investigation is left for another occasion.

4 Normalisation for C

We begin by adjusting the common notion of a maximal formula to deductions in C:

Definition 4

(Maximal formula) A maximal formula in a deduction in C with main operator \(*\) is an occurrence of a formula \(A*B\) or \(*A\) (i.e. \(*\) is \(\lnot \)) that is the major premise of an application of \(*E\) and the major assumption discharged by an application of \(*I\) or TR in case \(*\) is \(\supset \).

Maximal formulas are removed from deductions by procedures to be given shortly. But first an observation about negation.

It would be possible to restrict the conclusion of \(\lnot E\) to formulas not having \(\supset \), \(\wedge \) or \(\vee \) as main connectives. Similar constructions that show that \(\bot E\) can be restricted to atomic conclusions are easily adapted to \(\lnot E\) of C. We cannot, however, restrict the conclusion to atomic formulas nor to literals: they sometimes need to be of the form \(\lnot C\), where C may be a complex formula.

An attempt to avoid deriving \(\lnot C\) by \(\lnot E\) while also avoiding the creation of a maximal formula would need to appeal to \(\lnot I\) where the major assumption discharged by the application of the rule is also the right arbitrary premise and of the same shape as the conclusion. Hence we would need to derive \(\lnot C\) from C to gain the left arbitrary premise using the premises \(\lnot A\) and A of the application of \(\lnot E\) that concluded \(\lnot C\). The only way to do so would be to derive \(\lnot C\) from A and \(\lnot A\), which was to be avoided.

\(\lnot E\) will therefore be treated like all the other elimination rules and it will be shown that its conclusion need never be the major premise of an elimination rule.

Applications of general introduction and elimination rules except \(\lnot E\) require deductions of arbitrary premises C which also provide the conclusion of the application of the rule. They form part of a sequence of formula occurrences of the same shape:Footnote 8

Definition 5

(Segment) A segment is a sequence of formula occurrences \(C_1\ldots C_n\) in a deduction such that either

  1. (i)

    \(n>1\) and for all \(i<n\), \(C_i\) is an arbitrary premise of an application of a rule and \(C_{i+1}\) is its conclusion, and \(C_n\) is not an arbitrary premise of an application of a rule; or

  2. (ii)

    \(n\ge 1\) and \(C_1\) is the conclusion of \(\lnot E\) and for all \(i<n\), \(C_i\) is an arbitrary premise of an application of a rule and \(C_{i+1}\) is its conclusion, and \(C_n\) is not an arbitrary premise of an application of a rule.

The length of a segment is the number of formula occurrences of which it consists, its degree the degree of any such formula. As \(C_1\ldots C_n\) are all of the same shape, I will speak of the formula (as a type) constituting the segment.

Every conclusion of \(\lnot E\) forms part of a segment of length 1. The motivation for classifying conclusions of \(\lnot E\) as giving rise to segments is that this rule is like a crossing between \(\supset E\) and \(\bot E\): it has a minor premise, like \(\supset E\), but lacks arbitrary premises, like \(\bot E\).

Due to the ban on vacuous discharge above arbitrary premises, in all other cases the first formula of a segment is an arbitrary premise discharged by an introduction rule the conclusion of which is the second formula of the segment.

The major, minor and specific premises of rules are either assumptions or the last formulas of segments.

Definition 6

(Maximal segment) A maximal segment is a segment the last formula of which is the major premise of an elimination rule.

Maximal segments are removed from deductions by procedures to be given shortly. It will be a consequence of their successive application that maximal segments the first formula of which is the conclusion of \(\lnot E\) are removed from deductions.

Definition 7

(Normal form) A deduction is in normal form if it contains neither maximal formulas nor maximal segments.

It will be shown that any deduction can be brought into normal form by successive and systematic application of the procedures to be given now, which remove maximal segments and formulas from deductions. I will call the deduction to which such a procedure is applied the initial deduction and the result the reduced deduction.

First we need to take care of a minor issue. In the initial deduction, more assumptions of the form \(A*B\) or \(*A\) than the maximal formula to be removed by the procedure may be discharged above the arbitrary premise of the introduction rule. There are several ways of dealing with this issue. A fuller discussion of the options is reserved for the companion piece to the present paper which covers intuitionist logic [3]. Here I will simply adopt the easiest solution to the problem, which consists in the observation that any application of a general introduction rule can be transformed into one that discharges exactly one major assumption. An advantage besides simplicity is that the normalisation theorem for C with rules so restricted transposes directly to a system of classical logic with the more common introduction rules in the style of Gentzen and Prawitz. This will be established in the conclusion to the present paper.

Suppose, for instance, one wanted to discharge n formula occurrences of the type \(A\vee B\) by an application of \(\vee I\):

figure f

Then instead of making this one application of \(\vee I\), one can apply it n times:

figure g

The cases for the other connectives are similar.

There are now two options: either make the restriction part of the construction of deductions, or first transform a given deduction into one that satisfies the restriction before the reduction procedures are applied. Either option works, as the system with the unrestricted introduction rules and the system with their restricted versions are evidently equivalent. Obviously any application of a restricted introduction rule is also a correct application of the unrestricted version, and the converse holds in virtue of the following:

Lemma 1

Any deduction can be transformed into one in which every application of a general introduction rules discharges exactly one major assumption.

Proof

By the ban on vacuous discharge above arbitrary premises, the transformations indicated above and an induction over a suitable measure of the complexity of deductions, e.g. the number of applications of introduction rules discharging multiple formula occurrences of highest degree in a deduction. Take such an application such that no other such application stands above it in the deduction. Applying the transformation reduces the measure. \(\square \)

Moreover, any sequence of applications of introduction rules as in the example above can be collapsed into one application, so one could, after maximal formulas have been removed from a deduction, also simplify it again in that respect.

As it lends itself to a very straightforward normalisation proof, unless otherwise stated, in the following I assume what I call the unique discharge convention on introduction rules: every application of an introduction rule for \(*\) discharges exactly one formula occurrence of the form \(A*B\) or \(*\,A\).

Now for the procedures to remove maximal segments and formulas from deductions. Concerning the procedures to remove maximal formulas of the form \(A\supset B\) and \(\lnot A\), observe once more that all discharge happens above arbitrary premises, and so they can be used to conclude any formula. This observation is used to ensure that assumptions that may have been discharged in the initial deduction are also discharged in the reduced deduction: we will reuse certain applications of rules that discharge assumptions in the initial deduction with new arbitrary premises and conclusions in the reduced deduction for the purpose of ensuring the conclusion of the reduced deduction does not depend on more formulas than the conclusion of the initial deduction.

Reduction Procedures for Maximal Formulas

Maximal formulas are removed from deductions by applying the following reduction procedures for maximal formulas, where \(\Pi , \Sigma \) above [A], [B] indicate that these deductions are used to conclude each formula occurrence in the assumption class to which AB belong (assumption class markers are deleted):

1. The maximal formula is of the form \(A\wedge B\). Convert the deduction on the left into the deduction on the right:

figure h

2. The maximal formula is of the form \(A\vee B\). Convert the deduction on the left into the deduction on the right:

figure i

Similarly for the case where the premise of \(\vee I\) is B concluded by \(\Sigma _2\).

3. The major premise of \(\supset E\) is discharged by \(\supset I\). Convert the deduction on the left into the deduction on the right:

figure j

4. The major premise of \(\supset E\) is discharged by TR. Convert the deduction on the left into the deduction on the right, where \(\Xi ^*\) is constituted by applications of rules in \(\Xi \) that discharge assumptions in \(\Pi _1\):

figure k

\(\Xi ^*\) is constructed in the following way. Let \(\rho _1\ldots \rho _n\) be the sequence of applications of rules in \(\Xi \) that discharge assumptions in \(\Pi _1\) from top to bottom. Let \(\rho _1^*\) be an application of the same rule as \(\rho _1\), with its major, minor and specific premises concluded as in \(\rho _1\) and its arbitrary premises and conclusion replaced by D. If \(\rho _1^*\) has only one arbitrary premise, conclude it with the deduction ending in the upper D in the schematic representation of the reduction procedure (i.e. it is concluded by \(\Sigma \), in turn the continuation of \(\Pi _1\) through A). If \(\rho _1^*\) has two arbitrary premises, then one is concluded as described previously, and to conclude the other, observe that in that case \(\Xi \) contains a subdeduction of D from the conclusion E of \(\rho _1\): append it to the deduction concluding the other arbitrary premise of \(\rho _1\) to conclude the other arbitrary premise of \(\rho _1^*\), deleting redundant applications of rules (i.e. those discharging assumptions that do not stand above that arbitrary premise of \(\rho _1\)). Continue in the same way with \(\rho _2\) until you reach \(\rho _n\).

Examples of applications of the reduction procedures will make this more concrete, and I will give two shortly. For now, notice that the reduction procedure removes the maximal formula \(A\supset B\) without introducing any new ones, as it only uses (some of the) material already present in \(\Xi \).

5. The maximal formula is of the form \(\lnot A\). Convert the deduction on the left into the deduction on the right, where \(\Xi ^*\) is constructed from all those applications of rules in \(\Xi \) that discharge assumptions in \(\Pi _1\) as in the previous reduction procedure:

figure l

This completes the reduction procedures for maximal formulas.

Examples

Let’s illustrate the method of reusing rules that discharge assumptions in the reduced assumption with two examples. First, suppose there is a sole application of a rule below the major premise discharged by \(\lnot I\) and above its right arbitrary premise that discharges an assumption above the minor premise of \(\lnot E\), and let it be \(\vee E\):

figure m

In this case any applications of rules in \(\Xi _2\) can only be of \(\lnot E\), and any application of a rule in \(\Xi _4\) that discharges assumptions (i.e. rules other than \(\lnot E\)) discharge them in \(\Xi _2\). \(\Xi \) of the schematic representation of the reduction procedure for maximal formulas of the form \(\lnot A\) is constituted by \(\Xi _1, \Xi _2, \Xi _3, \Xi _4\) in the example. \(\Xi ^*\) consists of an application of \(\vee E\) with its major premise derived by \(\Xi _1\), its first arbitrary premise derived by \(\Pi \) and \(\Sigma \), and the second arbitrary premise derived by \(\Xi _3\) and applications of those rules of \(\Xi _4\), call them \(\Xi _4^*\), that do not discharge assumptions in \(\Xi _2\) (these become redundant). The reduced deduction is:

figure n

As all discharge happens above arbitrary premises, we could also first permute the application of \(\vee E\) in the initial deduction downwards to conclude with F. This would duplicate \(\Xi _4\) and in most cases create redundant applications of rules in the copy of \(\Xi _4\) under \(\Xi _3\), which are to be removed. Then apply the reduction procedure. The result would be the same in this example, but in the general case this method is not enough, as applying the reduction procedure removes \(\Xi _2\) from the deduction, and some of it may be needed to discharge assumptions in \(\Pi \): these rules would still have to be reused somehow.

Now consider the case in which there are more than one applications of rules discharging assumptions above the minor premise A of \(\lnot E\) in the initial deduction, and let \(\vee E\) be the top-most such rule. Then all other applications of such rules are in \(\Xi _4\), and \(\Xi _2\) can disappear in the reduction procedure without loss. After concluding F by \(\vee E\), the deduction needs to continue with those rules of \(\Xi _4\) that discharge assumptions in \(\Pi \). This poses no further problem: as discharge happens above arbitrary premises, we can apply the relevant rules of \(\Xi _4\) with F as the arbitrary premises and conclusions, deriving the major, minor, specific and remaining arbitrary premise (if any) as in \(\Xi \).

As a second example, suppose TR is the top most application of a rule below the major premise discharged by \(\lnot I\) and above its right arbitrary premise that discharges an assumption above the minor premise of \(\lnot E\):

figure o

Then \(\Xi \) of the schematic representation of the reduction procedure for maximal formulas of the form \(\lnot A\) is constituted by \(\Xi _1, \Xi _2, \Xi _3\). \(\Xi ^*\) consists of an application of TR with its left arbitrary premise concluded by \(\Sigma , \Pi \), and its right arbitrary premise concluded by \(\Xi _2\) and those applications of \(\Xi _3\), call them \(\Xi _3^{*_1}\), needed to derive F from D, and with its conclusion followed by applications of the rules of \(\Xi _3\), call them \(\Xi _3^{*_2}\), that discharge assumptions in \(\Pi \). The reduced deduction is:

figure p

Once more, as all discharge happens above arbitrary premises, constructing \(\Xi _3^{*_2}\) from \(\Xi _3\) poses no further problem.

C has 10 rules, 4 of which are elimination rules. So there are 40 permutative reduction procedures. Most of them are handled by permuting the application of the elimination rule upwards. The 4 cases where the major premise of an elimination rule is concluded by \(\lnot E\) do not really involve any permutation upwards, but I will class them with the permutative reduction procedures. None of this presents any difficulty, so I will only give some of the permutativereduction procedures as examples, the others being similar.

Premutative Reduction Procedures for Maximal Segments

1. The major premise of \(\supset E\) is derived by \(\vee I\). Convert the deduction on top into the deduction below:

figure q

2. The major premise of \(\supset E\) is derived by \(\supset I\). Convert the deduction on top into the deduction below:

figure r

3. The major premise of \(\supset E\) is derived by \(\wedge E\). Convert the deduction on top into the deduction below:

figure s

4. The major premise of \(\lnot E\) is derived by TR. Convert the deduction to the left of \(\leadsto \) into the one on its right:

figure t

5. In the case a major premise of an elimination rule is concluded by \(\lnot E\), remove the application of the elimination rule and use \(\lnot E\) to conclude the discharged assumption(s) of (one of) the side deductions concluding with an arbitrary premise instead. For instance, convert the deduction to the left of \(\leadsto \) into the one on its right:

figure u

Alternatively, we could have concluded C by\(\lnot E\) and used \(\Sigma _2\) to conclude D. The normal form of deductions is therefore not unique.

This completes the permutative reduction procedures.

Repeated application of a permutative reduction procedure reduces the length of a maximal segment by permuting applications of elimination rules upwards in the deduction. As noted earlier, the first formula of a segment can only be one discharged by an introduction rule, and so repeated application of a permutative reduction procedure turns a maximal segment into a maximal formula. At the top of every maximal segment, there stands a maximal formula, so to speak.

Definition 8

(Rank of deductions) The rank of a deduction \(\Pi \) is the pair \(\langle d, l\rangle \), where d is the highest degree of a maximal formula or maximal segment in \(\Pi \) or 0 if there is none, and l is the sum of sum of the lengths of maximal segments of highest degree and number of maximal formulas in \(\Pi \). \(\langle d, l\rangle < \langle d', l'\rangle \) iff either (i) \(d<d'\) or (ii) \(d=d'\) and \(l<l'\).

Applying reduction procedures for maximal formulas cannot introduce new maximal formulas into the reduced deduction, but it may increase the lengths of maximal segments that were in the initial deduction.Footnote 9 In the case of maximal formulas of form \(A\wedge B\), this can happen if A is concluded by a rule in \(\Sigma _1\) or B is in \(\Sigma _2\) (i.e. \(\Sigma _1\) or \(\Sigma _2\) is not empty) and some formula occurrence in the assumption class to which the formulas discharged by \(\wedge E\) belong is the major premise of an elimination rule in \(\Pi _1\). Similarly for maximal formulas of the form \(A\vee B\). In the case of maximal formulas of the form \(A\supset B\), this can happen in reduction procedure 3 if B is concluded by a rule in \(\Sigma \) (i.e. \(\Sigma \) is not empty) and some formula occurrence in the assumption class to which the formulas discharged by \(\supset E\) belong is the major premise of an elimination rule in \(\Pi _2\), and in reduction procedure 4 if A is concluded by a rule in \(\Pi _1\) (i.e. \(\Pi _1\) is not empty) and some formula occurrence in the assumption class to which the minor assumptions discharged by TR belong is the major premises of an elimination rule in \(\Sigma \). The case of maximal formulas of the form \(\lnot A\) is similar to the last one with TR.

Any maximal segment that suffers an increase in length as a result of a reduction procedure is, however, of lower degree than the maximal formula removed, as the formula that form part of the segment are subformulas of the latter. Hence applying a reduction procedure for maximal formulas cannot increase the rank of a deduction.

An application of a permutative reduction procedure reduces the length of a maximal segment by 1 or removes it entirely, if the maximal segment is of length 1 (i.e. it is the conclusion of \(\lnot E\) and major premise of an elimination rule). In the latter case, another maximal segment is also reduced by 1, namely should the conclusion D in the schematic representation of the procedure above form part of a maximal segment. The other permutative reduction procedures may increase the lengths of maximal segments that were in the initial deduction. To ensure the decrease of the rank of a deduction, the permutative reduction procedures must be applied with a strategy.

Adopting the convention that a deduction that already is in normal form converts into itself, we have:

Theorem 1

Any deduction in C can be converted into a deduction in normal form.

Proof

By Lemma 1, it suffices to consider deductions in which introduction rules discharge exactly one major assumption. The theorem follows by the considerations of the paragraphs immediately preceding the theorem and an induction over the rank of deductions. Applying reduction procedures for maximal formulas cannot increase the rank of a deduction, and as a maximal formula is removed, applying a reduction procedure to a maximal formula of highest degree decreases the rank of the deduction. Permutative reduction procedures must be applied so as to avoid an increase of a length of segments of highest degree. This can be achieved by applying one to a maximal segment of highest degree such that no maximal segment of highest degree stands above it in the deduction. This reduces the rank of the deduction. \(\square \)

The deduction in normal form has the same undischarged assumptions as the initial deduction:

Theorem 2

For any deduction in C, there is a deduction in normal form of the same conclusion from the same undischarged assumptions.

Proof

By theorem 1 and the ban on vacuous discharge. \(\square \)

This is a noteworthy difference to normalisation in Gentzen’s and Prawitz’s systems of classical intuitionist logic, where during the course of the normalisation procedure typically some undischarged assumptions of the deduction are removed together with maximal formulas above which they occur.

The form of normal deductions has other noteworthy features:

Theorem 3

If \(\Pi \) is a deduction in normal form, then all major premises of elimination rules are (discharged or undischarged) assumptions of \(\Pi \).

Proof

This is a consequence of the way the permutative reduction procedures are applied in normalisation. \(\square \)

Deductions in normal form in C are particularly perspicuous.

Definition 9

(Subformula property) A deduction \(\Pi \) of a conclusion C from the undischarged assumptions \(A_1\ldots A_n\) has the subformula property if every formula on the deduction is a subformula either of C or of \(A_1\ldots A_n\).

The crucial detail of deductions in normal form in C that ensures that they enjoy the subformula property is that the major premise \(\lnot A\) of \(\lnot E\) is not the major assumption discharged by \(\lnot I\), and because the minor assumption discharged by TR is a subformula of the major assumption, which is not the major premise of \(\supset E\). To prove the subformula property formally, we need the notion of a branch:

Definition 10

(Branch)

A branch in a deduction is a sequence of formula occurrences \(\sigma _1\ldots \sigma _n\) such that \(\sigma _1\) is an assumption of the deduction that is neither discharged by an elimination rule nor the major assumption discharged by an introduction rule other than \(\lnot I\) or TR, \(\sigma _n\) is either the conclusion of the deduction or the minor premise of \(\supset E\) or \(\lnot E\), and for each \(n>i\):

  1. (i)

    if \(\sigma _i\) is the major premise of an elimination rule other than \(\lnot E\), \(\sigma _{i+1}\) is an assumption discharged by it, and if it is the major premise of \(\lnot E\), \(\sigma _{i+1}\) is the conclusion of the rule;

  2. (ii)

    if \(\sigma _i\) is the specific premise of an introduction rule, \(\sigma _{i+1}\) is a major assumption discharged by it;

  3. (iii)

    and if \(\sigma _i\) is an arbitrary premise (of an introduction or an elimination rule rule), \(\sigma _{i+1}\) is the conclusion of the rule.

Definition 11

(Order of branches) A branch is of order 0 if its last formula is the conclusion of the deduction; it is of order \(n+1\) if its last formula is the minor premise of \(\supset E\) or of \(\lnot E\) such that its major premise is on a branch of order n.

A branch of order 0 is also called a main branch in the deduction.

Corollary 1

If any major premises of elimination rules are on a branch in a deduction in normal form, then they precede any major assumptions discharged by introduction rules that are on the branch.

Proof

By theorem 3, the major premises of elimination rules that occur on a branch in a deduction in normal form are assumptions. Hence they are not the last formulas of any segments, and in particular they are not the last formulas of any segments beginning with discharged major assumptions of introduction rules. \(\square \)

TR and \(\lnot I\) are introduction rules without specific premises. Branches move from their premises to their conclusions. A branch in a deduction in C begins with an undischarged assumption or an assumption discharged by \(\lnot I\) or TR. If an assumption discharged by TR or \(\lnot I\) was also an arbitrary premise of the rule, there would be vacuous discharge above the other arbitrary premises. But vacuous discharge was banned. Hence the assumptions discharged by TR and \(\lnot I\) can only be major, minor or specific premises of rules. In deductions in normal form, the first option does not occur in case of the major assumptions discharged. Hence the major assumptions discharged by TR and \(\lnot I\) are minor premises of \(\lnot E\), \(\supset E\) or specific premises of \(\supset I\) or \(\vee I\), and consequently subformulas of other formulas on the deduction. The minor premises discharged by TR and \(\lnot E\) are subformulas of the major premises discharged. This, in a nutshell, guarantees that deductions in normal form enjoy the subformula property.

A branch in a deduction in normal form in C can be partitioned into an E-part, the first part of the branch, which consists of major premises of elimination rules, and an I-part, which consists of sequences of segments that are the major assumptions discharged by introduction rules. Separating the two parts is the minimal formula or minimal segment. Either part may be empty: some branches in normal deductions consist of only an E-part, some of only an I-part, and in the case of a deduction that consists of a single formula A, both parts are empty and there is only a minimal formula.

For brevity we may speak of a segment being the premise, conclusion or discharged assumption of the rule of which its last or first formula is the premise, conclusion or discharged assumption.

An induction over the order of branches establishes the following result:

Corollary 2

Deductions in normal form have the subformula property.

Proof

By inspection of the rules and an induction over the order of branches. Consider a branch of order 0. If it has an E-part, it begins with a sequence of formulas and segments that are major premises of elimination rules, going from major premise to assumption discharged by the elimination rule, until it reaches a specific formula of an introduction rule, and then continues with segments discharged by introduction rules, until it reaches the conclusion of the deduction. All formulas in the latter part of the branch are subformulas of the conclusion of the deduction. All the formulas on the former part of the branch are subformulas of an assumption that remains undischarged in the deduction. If the branch does not have an E-part, it begins with specific premises of introduction rules. This is because the major assumptions discharged by TR and \(\lnot E\) form branches consisting of only a minimal formula, because due to the normal form of the deduction and the ban on vacuous discharge, they can only be the specific premises of introduction rules. In a deduction in normal form branches beginning with the major assumptions discharged by TR and \(\lnot I\) cannot contain any major premises of elimination rules. Hence the major assumptions discharged by TR and \(\lnot I\) are subformulas of the final formula of the branch, and therefore so are the minor premises discharged by these rules. The only tricky case to consider is \(\lnot E\): here the branch goes from the major premise of the form \(\lnot A\) to a conclusion C, which may not share any subformulas. But the case is really no different from applications of \(\bot E\) in Gentzen’s and Prawitz’s system of natural deduction for intuitionist logic: the crucial point is that C is not the major premise of an elimination rule, and so it is on the I-part of the branch, and hence a subformula of the last formula of the branch. Any conclusion of \(\lnot E\) on a branch of a deduction in normal form is either the last formula of the branch, or the specific premise of an introduction rule, or the first formula of a segment ending in a specific premise of an introduction rule, and hence it is a subformula of the conclusion of the deduction. This completes the basis of the induction. A branch that ends in the minor premise of \(\supset E\) or \(\lnot E\) ends in a formula that is a subformula of a branch of lower order, and hence the theorem holds by induction over the order of branches. \(\square \)

As a corollary of this corollary we have:

Corollary 3

For any deduction in C, there is a deduction of the same conclusion from the same undischarged assumptions with the subformula property.

Proof

From theorem 2 and corollary 2. \(\square \)

Finally, normalisation yields a direct proof of the consistency of C. Let a proof in C be a deduction that has no undischarged assumptions.

Corollary 4

If there is a proof of A in C, then there is one that ends with an introduction rule.

Proof

Elimination rules do not discharge assumptions above their major premises. Hence if in a deduction in normal form there is a main branch that begins with the major premises of an elimination rule and does not have an I-part, it is not a proof. Contraposing and applying theorem 1, if there is a proof of A in C, then there is one that ends with an introduction rule. \(\square \)

Corollary 5

C is consistent.

Proof

Suppose there is a proof of an atomic formula p in C. Then there is a proof in normal form of p. But as p contains no connective, it cannot end in an introduction rule, and hence it is not a proof. Contradiction. \(\square \)

A similar argument establishes that by the form of normal deductions, a proof of \(A\wedge \lnot A\) would end with an introduction rule, hence both A and \(\lnot A\) would need to have been derived from no assumptions, which is impossible.

5 Adding the quantifiers

To close this paper, let’s consider what happens when C is extended to quantificational logic. The language is extended in standard fashion to contain the connectives \(\forall \), \(\exists \), constants, function and predicate symbols and variables. The language has two disjoint sets of variables, the parameters \(a, b, c\ldots \) playing the role of free variables, and the variables to be bound by the quantifiers \(x, y, z\ldots \), which do not occur free in formulas. The terms of the language are built up from the parameters, constant symbols and function symbols. We may call an expression that is like a formula or a term, but containing free variables instead of parameters, a pseudo-formula or a pseudo-term.

Let \(A_x^a\) be the result of substituting the variable x for the parameter a in A. Let \(A^x_t\) be the result of substituting all occurrences of the variable x in A by the term t, where it is assumed that t is free for x in A, i.e. no variable in the pseudo-term t becomes bound as a result of the substitution.

The usual elimination rule for the existential quantifier already has the form of general elimination rules. The general elimination rule for the universal quantifier has the same form with a different use of terms:

figure v

where in \(\exists E\), parameter a does not occur in \(\exists xA\), nor in C, nor in any formulas undischarged in \(\Pi \) except those of the assumption class \([A^x_a]\).

The following are the general introduction rules for the quantifiers:

figure w

where in \(\forall I\), parameter a does not occur in any undischarged assumption of \(\Sigma \).Footnote 10

It is worth remarking that the rules for both quantifiers have the same form and differ only with respect to the occurrences of terms and parameters and consequently where restrictions on parameters are imposed.

Milne observes that adding rules for both quantifiers upsets the subformula. It is instructive to see why. Consider the classically, but not intuitionistically, valid inference of \(\forall x(Fx\vee A)\vdash \forall x Fx \vee A\), (x not free in A). It may be derived in the following way:

figure x

\(\lnot A\) is a maximal formula. An attempt to remove it from the deduction by applying the reduction procedure for maximal formulas of this form gives the following:

figure y

The application of \(\forall I\) is now incorrect.

The problem is that \(\Xi \) in the schematic representation of the reduction procedure for maximal formulas of the form \(\lnot A\) may contain applications of rules that discharge formulas in which parameters of applications of \(\forall I\) lower down in the deduction occur, but in \(\Xi ^*\), these applications become incorrect as they are permuted upwards.

So, for instance in the first example of an application for the reduction procedure for maximal formulas of the form \(\lnot A\) given in the previous section, there may be an application of \(\forall I\) in \(\Xi _4\) that is indispensable for the deduction of F from D (i.e. if they are different), where E contains the parameter of that application: this application becomes incorrect in \(\Xi _4^*\).

A similar problem occurs in case the maximal formula is the major assumption discharged by TR.

The situation is different with applications of \(\exists E\), as there the restriction on parameters is imposed on a deduction above an arbitrary premise. If there is an application of \(\exists E\) in \(\Xi _4\), we apply it after the application of \(\vee E\), and the problem does not arise.

Let \(\mathbf {C^\exists }\) be C plus the rules for the existential quantifier. We will give reduction procedures for maximal formulas of the form \(\exists xA\) and prove a normalisation theorem for this system.

As we have an unlimited amount of parameters at our disposal, we may adopt the convention that every application of \(\exists E\) has its own parameter, so that the parameter of an application of \(\exists E\) occurs only in its discharged assumption and formulas derived from it. Consequently, the parameter occurs only above the application of the rule in a deduction, and any application of \(\exists E\) below it has a different parameter. Call this the parameter convention.Footnote 11

Inspection of the reduction procedures for the propositional connectives shows that, if the parameter convention is upheld, then any correct application of \(\exists E\) in the initial deduction remains correct in the reduced deduction. The same holds for the following reduction procedure for maximal formulas of the form \(\exists x A\), where \(\Xi _t^a\) is the result of substituting the term t for the parameter a throughout \(\Xi \). We add to the list of reduction procedures of the previous section:

6. The maxima formula is of the form \(\exists xA\). Convert the deduction on the left into the deduction on the right:

figure z

The additional permutative reduction procedures are evident and pose no further difficulty. Neither does extending the proof of theorem 1 to \(\mathbf {C^\exists }\), and we have:

Theorem 4

Deductions in \(\mathbf {C^\exists }\) normalise.

Consequently, for the standard notion of a subformula applicable to quantificational logic, where all substitution instances of a formula \(\exists x A\) are counted amongst its subformulas, we have:

Corollary 6

Deductions in normal form \(\mathbf {C^\exists }\) have the subformula property.

Finally, as vacuous discharge remains banned in \(\mathbf {C^\exists }\), we have:

Corollary 7

For any deduction in \(\mathbf {C^\exists }\), there is a deduction of the same conclusion from the same undischarged assumptions with the subformula property.

Proof

From theorem 4 and corollary 6. \(\square \)

6 Conclusion

Having adopted the unique discharge convention meant that, for the purposes of normalisation, consideration was restricted to applications of general introduction rules that discharge exactly one major assumption. After normalisation, it would be possible to collapse certain sequences of applications of general introduction rules into one. On the other hand, the unique discharge convention allows for an easy transposition of C into a more conventional system. The conclusion of an application of an introduction rule in Gentzen’s system obviously occurs exactly once in a deduction, so if the unique discharge convention is upheld, there is a simple correspondence between deductions in Gentzen’s system and in the present system with general introduction rules. Instead of assuming, concluding and discharging the major assumption \(A*B\), we only conclude it, without making and discharging the assumption. This works for \(\wedge \), \(\vee I\), \(\supset I\) and \(\exists I\), but of course not for \(\lnot I\) and TR: them we must leave as they are. Furthermore, although it is not necessary for a complete system of classical logic to allow the discharge of an assumption A above B in \(\supset I\), as this rule is derivable from \(\supset I\) and TR, doing so does not upset the subformula property. The following is therefore a complete system of classical quantificational logic in which for every deduction, there is a deduction that satisfies the subformula property:

figure aa

Finally, we could also define \(\lnot \) in terms of \(\bot \) and replace the two negation rules with one rule:

figure ab

This rule can be restricted to atomic conclusions, and certain economies ensue in the number of reduction procedures required to prove normalisation.