1 Introduction

Resource aware logics have been object of passionate study for quite some time now. The motivations for this passion vary: resource consciousness are adequate for modeling steps of computation; logics have interesting algebraic semantics; calculi have nice proof theoretic properties; multi-modalities allow for the specification of several behaviors; there are many interesting applications in linguistics, etc.

With this variety of subjects, applications and views, it is not surprising that different groups developed different systems based on different principles. For example, the Lambek calculus (\(\mathsf {L}\)) [29] was introduced for mathematical modeling of natural language syntax, and it extends a basic categorial grammar [3, 4] by a concatenation operator. Linear logic (\(\mathsf {LL}\)) [16], originally discovered by Girard from a semantical analysis of the models of polymorphic \(\lambda \)-calculus, turned out to be a refinement of classical and intuitionistic logic, having the dualities of the former and constructive properties of the latter. The key point is the presence of the modalities \(\mathop {!},\mathord {?}\), called exponentials in \(\mathsf {LL}\). In the intuitionistic version of \(\mathsf {LL}\), denoted by \(\mathsf {ILL}\), only the \(\mathop {!}\) exponential is present.

\(\mathsf {L}\) and \(\mathsf {LL}\) were compared in [2], when Abrusci showed that Lambek calculus coincides with a variant of the non-commutative, multiplicative version of \(\mathsf {ILL}\) [41]. This correspondence can be lifted for considering also the additive connectives: Full (multiplicative-additive) Lambek calculus \(\mathsf {FL}\) relates to non-commutative multiplicative-additive version of \(\mathsf {ILL}\), here denoted by \(\mathsf {cLL}\).

In this paper we propose the sequent based system \(\mathsf {acLL}_\varSigma \), a conservative extension of \(\mathsf {cLL}\), where associativity is allowed only for formulas marked with a special kind of modality, determined by a subexponential signature \(\varSigma \). The notation adopted is modular, uniform and scalable, in the sense that many well known systems will appear as fragments or special cases of \(\mathsf {acLL}_\varSigma \), by only modifying the signature \(\varSigma \). The core fragment of \(\mathsf {acLL}_\varSigma \) (i.e., without the subexponentials) corresponds to the non-associative version of full Lambek calculus, \(\mathsf {FNL}\) [8].Footnote 1

The language of \(\mathsf {acLL}_\varSigma \) consists of a denumerable infinite set of propositional variables \(\{p, q, r, \ldots \}\), the unities \(\{1,\top \}\), the binary connectives for additive conjunction and disjunction \( \{\mathbin { \& },\oplus \}\), the non-commutative multiplicative conjunction \(\otimes \), the non-commutative linear implications \(\{\rightarrow , \leftarrow \}\), and the unary subexponentials \(\mathop {!}\nolimits ^{i}\), with i belonging to a pre-ordered set of labels \((I,\preceq )\).

Roughly speaking, subexponentials [13] are substructural multi-modalities. In \(\mathsf {LL}\), \(\mathop {!}A\) indicates that the linear formula A behaves classically, that is, it can be contracted and weakened. Labeling \(\mathop {!}\) with indices allows moving one step further: The set I can be partitioned so that, in \(\mathop {!}\nolimits ^{i}A\), A can be contracted and/or weakened. This allows for two other types of behavior (other than classical or linear): affine (only weakening) or relevant (only contraction). Pre-ordering the labels (together with an upward closeness requirement) guarantees cut-elimination [42]. But then, why consider only weakening and contraction? Why not also take into account other structural properties, like commutativity or associativity? In [20, 21] commutativity was added to the picture, so that in \(\mathop {!}\nolimits ^{i}A\), A can be contracted, weakened, classical or linear, but it may also commute with the neighbor formula. In this work we consider the last missing part: Associativity.

Smoothly extending \(\mathsf {cLL}\) to allow consideration of the non-associative case is non trivial. This requires a structural recasting/reframing of sequents: we pass from sets/multisets to lists in the non-commutative case, onto trees in the case of non-associativity [28]. As a consequence, the inference rules should act deeply over formulas in tree-structured sequents, which can be tricky in the presence of modalities [17].

On the other side, the multi-modal Lambek calculus introduced in [35, 45] and extended/compiled/implemented in [18, 36,37,38]Footnote 2 use different families of connectives and contexts, distinguished by means of indices, or modes. Contexts are indexed binary trees, with formulas built from the indexed adjoint connectives \(\{\rightarrow _i,\leftarrow _i\}\) and \(\otimes _i\) (e.g. \((A\rightarrow _i B,(C\otimes _j D,H)^k)^i\)). Each mode has its own set of logical rules (following the same rule scheme), and different structural features can be combined via the mode information on the formulas. This gives to the resulting system a multi-modal flavor, but it also results in a language of binary connectives, determined by the modes. This forces an unfortunate second level synchronization between implications and tensor, and modalities act over whole sequents, not on single formulas.

In order to attribute particular resource management properties to individual resources, in [27, 33] explicit (classical) multi-modalities \(\Diamond _i,\Box _i\) were proposed. While such unary modalities were inspired in \(\mathsf {LL}\) exponentials, the resemblance stops there. First of all, the logical connectives come together with structural constructors for contexts, which turns \(\Diamond _i,\Box _i\) into truncated forms of product and implication.

Second, \(\Diamond _i,\Box _i\) have a temporal behavior, in the sense that \( \Diamond \Box F\Rightarrow F\) and \(F \Rightarrow \Box \Diamond F \), which are not provable in \(\mathsf {LL}\) using the “natural interpretation” \(\Diamond = \mathord {?}\), \(\Box =\mathop {!}\).

In this paper, multi-modality is totally local, given by the subexponentials. The signature \(\varSigma \) contains the pre-ordered set of labels, together with a function stating which axioms, among weakening, contraction, exchange and associativity, are assumed for each label. Sequents will have a nested structure, corresponding to trees of formulas. And rules will be applied deeply in such structures. This not only gives the \(\mathsf {LL}\) based system a more modern presentation (based on nested systems, like e.g. in [10, 15]), but it also brings the notation closer to the one adopted by the Lambek community, like in [25]. Finally, it also uniformly extends several \(\mathsf {LL}\) based systems present in the literature, as Example 8 in the next section shows.

Designing a good system serves more than simple pure proof-theoretic interests: Well behaved, neat proof systems can be used in order to approach several important problems, such as interpolation, complexity and decidability. And decidability of extensions/variants/fragments of \(\mathsf {L}\) and \(\mathsf {LL}\) is a fascinating subject of study, since the presence or absence of substructural properties/connectives may completely change the outcome. Indeed, it is well known that \(\mathsf {LL}\) is undecidable [32], but adding weakening (affine \(\mathsf {LL}\)) turns the system decidable [24], while removing the additives (\(\mathsf {MELL}\) – multiplicative, exponential \(\mathsf {LL}\)) reaches the border of knowledge: It is a long standing open problem [50]. Non-associativity also alters decidability and complexity: \(\mathsf {L}\) is NP-complete [47], while \(\mathsf {NL}\) is decidable in polynomial time [1, 6]. Finally, the number of subexponentials also plays a role in decision problems: \(\mathsf {MELL}\) with two subexponentials is undecidable [9].

In this work, we will present two undecidability results, all orbiting (but not encompassing) \(\mathsf {MELL}/\mathsf {FNL}\). First, we show that \(\mathsf {acLL}_\varSigma \) containing the multiplicatives \(\otimes ,\rightarrow \), the additive \(\oplus \) and one classical subexponential (allowing contraction and weakening) is undecidable. This is a refinement of the unpublished result by Tanaka [51], which states that \(\mathsf {FNL}\) plus one fully-powered subexponential is undecidable.

In the second undecidability result, we keep two subexponentials, but with a minimalist configuration: the implicational fragment of the logic plus two subexponentials: the “main” one allowing for contraction, exchange, and associativity (weakening is optional), and an “auxiliary” one allowing only associativity. This is a variation of Chaudhuri’s result (in the non-associative, non-commutative case), making use of fewer connectives (tensor is not needed) and less powerful subexponentials.

The rest of the paper is organized as follows: Sect. 2 presents the system \(\mathsf {acLL}_\varSigma \), showing that it has the cut-elimination property and presenting an example in linguistics; Sect. 3 shows the undecidability results; and Sect. 4 concludes the paper.

We have placed, in Table 1, the acronyms for and decidability of all considered systems. Decidability for the cases marked with “−” depends on the signature \(\varSigma \).

Table 1. Acronyms/decidability of systems mentioned in the paper.

2 A Nested System for Non-associativity

Similar to modal connectives, the exponential \(\mathop {!}\) in \(\mathsf {ILL}\) is not canonical [13], in the sense that if \(i\not = j\) then \(\mathop {!}\nolimits ^{i}F\not \equiv \mathop {!}\nolimits ^{j}F\). Intuitively, this means that we can mark the exponential with labels taken from a set I organized in a pre-order \(\preceq \) (i.e., reflexive and transitive), obtaining (possibly infinitely-many) exponentials (\(\mathop {!}\nolimits ^{i}\) for \(i\in I\)). Also as in multi-modal systems, the pre-order determines the provability relation: for a general formula F, \(\mathop {!}\nolimits ^{b}F\) implies \(\mathop {!}\nolimits ^{a}F\) iff \(a \preceq b\).

The algebraic structure of subexponentials, combined with their intrinsic structural property allow for the proposal of rich linear logic based frameworks. This opened a venue for proposing different multi-modal substructural logical systems, that encountered a number of different applications. Originally [42], subexponentials could assume only weakening and contraction axioms:

$$ \mathsf {C}:\;\; \mathop {!}\nolimits ^{i} F \rightarrow \mathop {!}\nolimits ^{i} F \otimes \mathop {!}\nolimits ^{i} F \qquad \mathsf {W}:\;\; \mathop {!}\nolimits ^{i} F \rightarrow 1$$

This allows the specification of systems with multiple contexts, which may be represented by sets or multisets of formulas [44], as well as the specification and verification of concurrent systems [43], and biological systems [46]. In [20, 21], non-commutative systems allowing commutative subexponentials were presented:

$$ \mathsf {E}:\;\; (\mathop {!}\nolimits ^{i} F)\otimes G \equiv G\otimes (\mathop {!}\nolimits ^{i} F) $$

and this has many applications, e.g., in linguistics [21].

In this work, we will present a non-commutative, non-associative linear logic based system, and add the possibility of assuming associativityFootnote 3

$$ \mathsf {A1}:\;\; \mathop {!}\nolimits ^{i} F\otimes (G\otimes H) \equiv (\mathop {!}\nolimits ^{i} F\otimes G)\otimes H \qquad \mathsf {A2}:\;\; (G\otimes H)\otimes \mathop {!}\nolimits ^{i} F \equiv G\otimes (H\otimes \mathop {!}\nolimits ^{i} F) $$

as well as commutativity and other structural properties.

We start by presenting an adaption of simply dependent multimodal linear logics (\(\mathsf {SDML}\)) appearing in [31] to the non-associative/commutative case.

The language of non-commutative \(\mathsf {SDML}\) is that of (propositional intuitionistic) linear logic with subexponentials [21] supplied with the left residual; or similarly, that of \(\mathsf {FL}\) with subexponentials. Non-associative contexts will be organized via binary trees, here called structures.

Definition 1

(Structured sequents). Structures are formulas or pairs containing structures:

$$\begin{array}{ll} \varGamma ,\varDelta&:= F \mid (\varGamma ,\varGamma ) \end{array} $$

where the constructors may be empty but never a singleton.

An n-ary context is a context that contains n pairwise distinct numbered holes \(\{\,\}\) wherever a formula may otherwise occur. Given n contexts , we write for the context where the k-th hole in has been replaced by (for \(1 \le k \le n\)). If the hole is removed.

A structured sequent (or simply sequent) has the form \(\varGamma \Rightarrow F\) where \(\varGamma \) is a structure and F is a formula.

Example 2

Structures are binary trees, with formulas as leaves and commas as nodes. The structure \(\mathop {!}\nolimits ^{i}A,(B,C)\) represents the tree below left, while \((\mathop {!}\nolimits ^{i}A,B),C\) represents the tree below right

figure j

Definition 3

(SDML). Let \(\mathcal {A}\) be a set of axioms. A (non-associative/commutative) simply dependent multimodal logical system (\(\mathsf {SDML}\)) is given by a triple \(\varSigma =(I,\preccurlyeq ,f)\), where I is a set of indices, \((I,\preccurlyeq )\) is a pre-order, and f is a mapping from I to \(2^{\mathcal {A}}\).

If \(\varSigma \) is a \(\mathsf {SDML}\), then the logic described by \(\varSigma \) has the modality \(\mathop {!}\nolimits ^{i}\) for every \(i \in I\), with the rules of \(\mathsf {FNL}\) depicted in Fig. 1, together with rules for the axioms f(i) and the interaction axioms \(\mathop {!}\nolimits ^{j} A\rightarrow \mathop {!}\nolimits ^{i} A\) for every \(i,j \in I\) with \(i \preccurlyeq j\). Finally, every \(\mathsf {SDML}\) is assumed to be upwardly closed w.r.t. \(\preceq \), that is, if \(i\preceq j\) then \(f(i)\subseteq f(j)\) for all \(i,j\in I\).

Figure 2 presents the structured system \(\mathsf {acLL}_\varSigma \), for the logic described by the \(\mathsf {SDML}\) determined by \(\varSigma \), with \(\mathcal {A}=\{\mathsf {C},\mathsf {W},\mathsf {A1},\mathsf {A2},\mathsf {E}\}\) where, in the subexponential rule for \(\mathsf {S}\in \mathcal {A}\), the respective \(s\in I\) is such that \(\mathsf {S}\in f(s)\) (e.g. the subexponential symbol e indicates that \(\mathsf {E}\in f(e)\)). We will denote by \(\mathop {!}\nolimits ^{\mathsf {Ax}}\varDelta \) the fact that the structure \(\varDelta \) contains only banged formulas as leaves, each of them assuming the axiom \(\mathsf {Ax}\).

As an economic notation, we will write \({\uparrow ^{}\!\!\,{i}}\) for the upset of the index i, i.e., the set \(\{j \in I : i \preccurlyeq j\}\). We extend this notation to structures in the following way. Let \(\varGamma \) be a structure containing only banged formulas as leaves. If such formulas admit the multiset partition

$$ \{ \mathop {!}\nolimits ^{j}F\in \varGamma : i \preccurlyeq j\}\cup \{ \mathop {!}\nolimits ^{k}F \in \varGamma : i \not \preccurlyeq k \text{ and } \mathsf {W}\in f(k)\} $$

then \(\varGamma ^{{\uparrow ^{}\!\!\,{i}}}\) is the structure obtained from \(\varGamma \) by easing the formulas in the second component of the partition (equivalently, the substructure of \(\varGamma \) formed with all and only formulas of the first component of the partition). Otherwise, \(\varGamma ^{{\uparrow ^{}\!\!\,{i}}}\) is undefined.

Example 4

Let \(\varGamma =(\mathop {!}\nolimits ^{i}A,(\mathop {!}\nolimits ^{j}B,\mathop {!}\nolimits ^{k}C))\) be represented below left, \(i\preceq j\) but \(i\not \preceq k\), and \(\mathsf {W}\in f(k)\). Then \(\varGamma ^{{\uparrow ^{}\!\!\,{i}}}=(\mathop {!}\nolimits ^{i}A,\mathop {!}\nolimits ^{j}B)\) is depicted below right

figure k

Observe that, if \(\mathsf {W}\notin f(k)\), then \(\varGamma ^{{\uparrow ^{}\!\!\,{i}}}\) cannot be built. In this case, any derivation of \(\varGamma \Rightarrow \mathop {!}\nolimits ^{i}(A\otimes B)\) cannot start with an application of the promotion rule \(\mathop {!}\nolimits ^{i} R\) (similarly to how promotion in \(\mathsf {ILL}\) cannot be applied in the presence of non-classical contexts). In this case, if AB are atomic, this sequent would not be provable.

Example 5

The use of subexponentials to deal with associativity can be illustrated by the prefixing sequent \(A\rightarrow B\Rightarrow (C\rightarrow A)\rightarrow (C\rightarrow B)\): It is not provable for an arbitrary formula C, but if \(C=\mathop {!}\nolimits ^{a} C'\), then

Fig. 1.
figure 1

Structured system \(\mathsf {FNL}\) for non-associative, full Lambek calculus.

Fig. 2.
figure 2

Structured system \(\mathsf {acLL}_\varSigma \) for the logic described by \(\varSigma \).

2.1 Cut-Elimination

When it comes to the proof of cut-elimination for \(\mathsf {acLL}_\varSigma \), the cut reductions for the propositional connectives follow the standard steps for similar systems such as, e.g., Moot and Retoré’s system \(\mathsf {NL}\Diamond \) in [38, Chapter 5.2.2]. The case of structural rules, on the other hand, should be treated with care.

Theorem 6

If the sequent \(\varGamma \Rightarrow F\) is provable in \(\mathsf {acLL}_\varSigma \), then it has a proof with no instances of the rule \(\mathsf {mcut}\).

Proof

The most representative cases of cut reductions involving subexponentials are detailed next. In order to simplify the notation, when possible, the \(\mathsf {mcut}\) rule is presented in its simple form, with an 1-ary context.

Case \(\mathop {!}\nolimits ^{a}\)::

Suppose that

Since axioms are upwardly closed w.r.t. \(\preceq \), it must be the case that \(\varDelta _1^{{\uparrow ^{}\!\!\,{a}}}\) contains only formulas marked with subexponentials allowing associativity. All the other formulas in \(\varDelta _1\) can be weakened; this is guaranteed by the application of the rule \(\mathop {!}\nolimits ^{a}R\) in \(\pi _1\). Hence the derivation above reduces to

Case \(\mathop {!}\nolimits ^{c}\)::

Suppose that

Since \(\varDelta ^{{\uparrow ^{}\!\!\,{c}}}\) contains only formulas marked with subexponentials allowing contraction, the derivation above reduces to

Observe that here, as usual, the multicut rule is needed in order to reduce the cut complexity.

Case \(\mathop {!}\nolimits ^{i}R\)::

Suppose that

If \(j\not \preceq i\), then it should be the case that \(\mathsf {W}\in f(i)\) and , since \(\mathop {!}\nolimits ^{i}F\) will be weakened in the application of rule \(\mathop {!}\nolimits ^{j}R\). Hence, all formulas in \(\varDelta \) can be weakened as well and the reduction is

On the other hand, if \(j\preceq i\), by transitivity all the formulas in \(\varDelta ^{{\uparrow ^{}\!\!\,{i}}}\) also have this property (implying that \(\varDelta ^{{\uparrow ^{}\!\!\,{i}}}\) is a substructure of \(\varDelta ^{{\uparrow ^{}\!\!\,{j}}}\)), and the rest of formulas of \(\varDelta \) can be weakened. Hence the derivation above reduces to

The other cases for subexponentials are similar or simpler.    \(\square \)

The next examples illustrate what we mean by \(\mathsf {acLL}_\varSigma \) being a “conservative extension” of subsystems and variants. Indeed, although we remove structural properties of the core \(\mathsf {LL}\), subexponentials allow them to be added back, either locally or globally.

Example 7

(Structural variants of \(\mathsf {iMALL}\)). Adding combinations of contraction \(\mathsf {C}\) and / or weakening \(\mathsf {W}\) for arbitrary formulas to additive-multiplicative intuitionistic linear logic (\(\mathsf {iMALL}\)) yields, respectively, propositional intuitionistic logic \(\mathsf {ILP}= \mathsf {iMALL}+ \{\mathsf {C},\mathsf {W}\}\), and the intuitionistic versions of affine linear logic \(\mathsf {aLL}= \mathsf {iMALL}+ \mathsf {W}\) and relevant logic \(\mathsf {R}= \mathsf {iMALL}+ \mathsf {C}\). For the sake of presentation we overload the notation and use the connectives of linear logic also for these logics. In order to embed the logics above into \(\mathsf {acLL}_\varSigma \), let \(\alpha \in \{ \mathsf {ILP}, \mathsf {aLL},\mathsf {R}\}\) and consider modalities \(\mathop {!}\nolimits ^{\alpha }\)  with \(f(\alpha ) = \{ \mathsf {E},\mathsf {A1},\mathsf {A2}\} \cup \mathcal {A}\) where \(\mathcal {A} \subseteq \{ \mathsf {C},\mathsf {W}\}\) is the set of axioms whose corresponding rules are in \(\alpha \). The translation \(\tau _\alpha \) prefixes every subformula with the modality \(\mathop {!}\nolimits ^{\alpha }\). For \(\mathcal {L} \in \{ \mathsf {ILP}, \mathsf {aLL}, \mathsf {R}\}\) it is then straightforward to show that a structured sequent S is cut-free derivable in \(\mathcal {L}\) iff its translation \(\tau _\alpha (S)\) is cut-free derivable in the logic described by \((\{\alpha \}, \preccurlyeq , f)\) with \(\preccurlyeq \) the obvious relation, and f as given above.

Example 8

(Structural variants of \(\mathsf {FNL}\)). Following the same script as above and starting from \(\mathsf {FNL}\):

  • considering \(f(\alpha ) =\mathcal {A} \subseteq \{ \mathsf {E},\mathsf {A1},\mathsf {A2}\}\);

    • If \(\mathcal {A} = \{\mathsf {A1},\mathsf {A2}\} \), then we obtain the system \(\mathsf {FL}\);

    • If \(\mathcal {A} =\{ \mathsf {E},\mathsf {A1},\mathsf {A2}\}\) then the resulting system corresponds to \(\mathsf {iMALL}\).

    • Adding \(\mathsf {C},\mathsf {W}\) as options to \(\mathcal {A}\) will result the affine/relevant versions of the systems above.

  • in a pre-order \((I,\preceq )\), if \(f(i) = \{ \mathsf {A1},\mathsf {A2}\} \cup \mathcal {A}_i\) where \(\mathcal {A}_i \subseteq \{ \mathsf {E},\mathsf {C},\mathsf {W}\}\) for each \(i\in I\), then the resulting system corresponds to \(\text{ SMALC}_{\varSigma }\) in [21] (that is, the extension of \(\mathsf {FL}\) with subexponentials).

2.2 An Example in Linguistics

Since its inception, Lambek calculus [29] has been applied to the modeling of natural language syntax by means of categorial grammars. In a categorial grammar, each word is assigned one or several Lambek formulas, which serve as syntactic categories. For a simple example, John and Mary are assigned np (“noun phrase”) and loves gets \((np \rightarrow s) \leftarrow np\). Here s stands for “sentence”, and loves is a transitive verb, which lacks noun phrases on both sides to become a sentence. Grammatical validity of “John loves Mary” is supported by derivability of the sequent \(np, (np \rightarrow s) \leftarrow np, np \Rightarrow s\). Notice that this derivability keeps valid also in the non-associative setting, if the correct nested structure is provided: \((np, ((np \rightarrow s) \leftarrow np, np)) \Rightarrow s\).

The original Lambek calculus \(\mathsf {L}\) is associative. In some cases, however, associativity leads to over-generation, i.e., validation of grammatically incorrect sentences. Lambek himself realized this and proposed the non-associative calculus \(\mathsf {NL}\) in [30]. We will illustrate this issue with the example given in [38, Sect. 4.2.2]. The syntactic category assignment is as follows (where n stands for “noun”):

figure l

With this assignment, sentences “The Hulk is green” and “The Hulk is incredible” are correctly marked as valid, by deriving the sequent

$$ (np \leftarrow n, n), ((np \rightarrow s) \leftarrow (n \leftarrow n), n \leftarrow n) \Rightarrow s $$

However, in the associative setting the sequent for the phrase “The Hulk is green incredible,” which is grammatically incorrect, also becomes derivable:

$$ np \leftarrow n, n, (np \rightarrow s) \leftarrow (n \leftarrow n), n \leftarrow n, n \leftarrow n \Rightarrow s, $$

essentially due to derivability of \(n \leftarrow n, n \leftarrow n \Rightarrow n \leftarrow n\).

In other situations, however, associativity is useful. Standard examples include handling of dependent clauses, e.g., “the girl whom John loves,” which is validated as a noun phrase by the following derivable sequent:

$$ np \leftarrow n, n, (n \rightarrow n) \leftarrow (s \leftarrow np), np, (np \rightarrow s) \leftarrow np \Rightarrow np $$

Here \((n \rightarrow n) \leftarrow (s \leftarrow np)\) is the syntactic category for who.

Our subexponential extension of \(\mathsf {NL}\), however, handles this case using local associativity instead of the global one. Namely, the category for whom now becomes \((n \rightarrow n) \leftarrow (s \leftarrow {!}^a np)\), where \({!}^a\) is a subexponential which allows the \(\mathsf {A2}\) rule, and the following sequent is happily derivable:

$$ np \leftarrow n, (n, ((n \rightarrow n) \leftarrow (s \leftarrow {!}^a np), (np, (np \rightarrow s) \leftarrow np))) \Rightarrow np $$

The necessity of this more fine-grained control of associativity, instead of a global associativity rule, is seen via a combination of these examples. Namely, we talk about sentences like “The superhero whom Hawkeye killed was incredible” and “... was green”. With \({!}^a\), each of them is handled in the same way as the previous examples:

$$\begin{aligned}&(np \leftarrow n, (n, ((n \rightarrow n) \leftarrow (s \leftarrow {!}^a np), (np, (np \rightarrow s) \leftarrow np)))), \\&\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \quad \,\,\,\, ((np \rightarrow s) \leftarrow (n \leftarrow n), n \leftarrow n) \Rightarrow s. \end{aligned}$$

On one hand, without \({!}^a\) this sequent cannot be derived in the non-associative system. On the other hand, if we make the system globally associative, it would validate incorrect sentences like “The superhero whom Hawkeye killed was green incredible.”

3 Some Undecidability Results

Non-associativity makes a significant difference in decidability and complexity matters. For example, while \(\mathsf {L}\) is NP-complete [47], \(\mathsf {NL}\) is decidable in polynomial time [1, 14].

For our system \(\mathsf {acLL}_\varSigma \), its decidability or undecidability depends on its signature \(\varSigma \). In fact, we have a family of different systems \(\mathsf {acLL}_\varSigma \), with \(\varSigma \) as a parameter. Recall that the subexponential signature \(\varSigma \) controls not just the number of subexponentials and the preorder among them. More importantly, it dictates, for each subexponential, which structural rules this subexponential licenses. If for every \(i\in I\) we have \(\mathsf {C}\notin f(s)\), that is, no subexponential allows contraction, then \(\mathsf {acLL}_\varSigma \) is clearly decidable, since the cut-free proof search space is finite. Therefore, for undecidability it is necessary to have at least one subexponential which allows contraction.

For a non-associative system with only one fully-powered exponential modality s (that is, \(f(s) = \{ \mathsf {E}, \mathsf {C}, \mathsf {W}, \mathsf {A1}, \mathsf {A2} \}\)), undecidability was proven in a preprint by Tanaka [51], based on Chvalovský’s [11] result on undecidability of the finitary consequence relation in \(\mathsf {FNL}\).

In this section, we prove two undecidability results. The first one is a refinement of Tanaka’s result: We establish undecidability with at least one subexponential which allows contraction and weakening (commutativity/associativity are optional), in a subsystem containing only the additive connective \(\oplus \) and the multiplicatives \(\otimes \) and \(\rightarrow \).

The second undecidability result is for the minimalistic, purely multiplicative fragment, which includes only \(\rightarrow \) (not even \(\otimes \)). As a trade-off, however, it requires two subexponentials: the “main” one, which allows contraction, exchange, and associativity (weakening is optional), and an “auxiliary” one, which allows only associativity.

It should be noted that this undecidability result is orthogonal to Tanaka’s [51], and the proof technique is essentially different. Indeed, Chvalovský’s undecidability theorem does not hold for the non-associative Lambek calculus without additives, where the consequence relation is decidable [7].

Finally, we observe that if the intersection of these systems is decidable (which is still an open question), then our two undecidability results are incomparable: we have two undecidable fragments of \(\mathsf {acLL}_\varSigma \), but their common part, which includes only divisions and one exponential, would be decidable.

3.1 Undecidability with Additives and One Subexponential

We are going to derive the next theorem from undecidability of the finitary consequence relation in \(\mathsf {FNL}\) [11]. Recall that \(\mathsf {FNL}\) is, in fact, the fragment of \(\mathsf {acLL}_\varSigma \) without subexponentials (that is, with an empty I).

Theorem 9

If there exists such \(s \in I\) that \(f(s) \supseteq \{ \mathsf {C}, \mathsf {W} \}\), then the derivability problem in \(\mathsf {acLL}_\varSigma \) is undecidable. Moreover, this holds for the fragment with only \(\otimes \), \(\rightarrow \), \(\oplus , {!}^s\).

In fact, using \(\mathsf {C}\) and \(\mathsf {W}\), one can also derive \(\mathsf {A1}\), \(\mathsf {A2}\), \(\mathsf {E1}\), and \(\mathsf {E2}\). Therefore, if \(f(s) \supseteq \{ \mathsf {C}, \mathsf {W} \}\), then \(!^s\) is actually a full-power exponential modality. (In the proof of Theorem 9 below, we use only \(\mathsf {W}\) and \(\mathsf {C}\) rules, in order to avoid confusion.) However, Theorem 9 does not directly follow from undecidability of propositional linear logic [32], because here the basic system is non-associative and non-commutative, while linear logic is both associative and commutative. Thus, we need a different encoding for undecidability.

Let \(\mathbf {\Phi }\) be a finite set of \(\mathsf {FNL}\) sequents. By \(\mathsf {FNL}(\mathbf {\Phi })\) let us denote \(\mathsf {FNL}\) extended by adding sequents from \(\mathbf {\Phi }\) as additional (non-logical) axioms. In general, \(\mathsf {FNL}(\mathbf {\Phi })\) does not enjoy cut-elimination, so \(\mathsf {mcut}\) is kept as a rule of inference in \(\mathsf {FNL}(\mathbf {\Phi })\). A sequent \(\varGamma \Rightarrow F\) is called a consequence of \(\mathbf {\Phi }\) if this sequent is derivable in \(\mathsf {FNL}(\mathbf {\Phi })\).

Theorem 10

(Chvalovský [11]). The consequence relation in \(\mathsf {FNL}\) is undecidable, that is, there exists no algorithm which, given \(\mathbf {\Phi }\) and \(\varGamma \Rightarrow F\), determines whether \(\varGamma \Rightarrow F\) is a consequence of \(\mathbf {\Phi }\). Moreover, undecidability keeps valid when \(\mathbf {\Phi }\) and \(\varGamma \Rightarrow F\) are built from variables using only \(\otimes \) and \(\oplus \).

Now, in order to prove Theorem 9, we internalize \(\mathbf {\Phi }\) into the sequent using \({!}^s\), assuming \(f(s) \supseteq \{ \mathsf {C}, \mathsf {W} \}\).

First we notice that we may suppose, without loss of generality, that all sequents in \(\mathbf {\Phi }\) are of the form \(\Rightarrow A\), that is, have empty antecedents. Namely, each sequent of the form \(\varPi \Rightarrow B\) can be replaced by \(\Rightarrow (\bigotimes \varPi ) \rightarrow B\), where \(\bigotimes \varPi \) is obtained from \(\varPi \) by replacing each comma with \(\otimes \). Indeed, these sequents are derivable from one another: from \(\varPi \Rightarrow B\) to \(\Rightarrow (\bigotimes \varPi ) \rightarrow B\) we apply a sequence of \(\otimes L\) followed by \(\rightarrow R\), and for the other direction we apply a series of cuts, first with \( (\bigotimes \varPi , (\bigotimes \varPi ) \rightarrow B) \Rightarrow B\), and then with \((F, G) \Rightarrow F \otimes G\) several times, for the corresponding subformulas of \(\bigotimes \varPi \). The following embedding lemma (“modalized deduction theorem”) holds.

Lemma 11

The sequent \(\varGamma \Rightarrow F\) is a consequence of \(\mathbf {\Phi } = \{ {} \Rightarrow A_1, \ldots , {} \Rightarrow A_n \}\) if and only if the sequent \(\bigl ((\ldots (({!}^s A_1, {!}^s A_2), {!}^s A_3), \ldots , {!}^s A_n), \varGamma \bigr ) \Rightarrow F\) is derivable in \(\mathsf {acLL}_\varSigma \).

Proof

Let us denote \((\ldots (({!}^s A_1, {!}^s A_2), {!}^s A_3), \ldots , {!}^s A_n)\) by \(!\varPhi \). Notice that \(\mathsf {C}\) and \(\mathsf {W}\) can be applied to \(!\varPhi \) as a whole; this is easily proven by induction on n.

For the “only if” direction let us take the derivation of \(\varGamma \Rightarrow F\) in \(\mathsf {FNL}(\mathbf {\Phi })\) (with cuts) and replace each sequent of the form \(\varDelta \Rightarrow G\) in it with \((!\varPhi , \varDelta ) \Rightarrow G\), and each sequent of the form \(\Rightarrow G\) with \(!\varPhi \Rightarrow G\). The translations of non-logical axioms from \(\mathbf {\Phi }\) are derived as follows:

Translations of axioms \(\mathsf {init}\) and 1R are derived from the corresponding original axioms by \(\mathsf {W}\), n times; \(\top R\) remains valid.

Rules \(\otimes L\), \(\oplus L\), \(\oplus R_i\), \( { \& } L_i\), \( { \& } R\), and 1L remain valid. For \(\rightarrow L\), \(\leftarrow L\), and \(\mathsf {mcut}\) we contract \(!\varPhi \) as a whole:

For \(\otimes R\), \(\rightarrow R\), and \(\leftarrow R\), we combine contraction and weakening:

Notice that our original derivation was in \(\mathsf {FNL}(\mathbf {\Phi })\), so it does not include rules operating subexponentials.

For the “if” direction we take a cut-free proof of \(({!}\varPhi , \varGamma ) \Rightarrow F\) in \(\mathsf {acLL}_\varSigma \) and erase all formulas which include the subexponential. In the resulting derivation tree all rules and axioms, except those which operate \({!}^s\), remain valid. Structural rules for \({!}^s\) trivialize (since the !-formula was erased). The \({!}^s R\) rule could not have been used, since we do not have positive occurrences of \({!}^s F\), and our proof is cut-free.

Finally, \(\mathsf {der}\) translates into

This is modeled by cut with one of the sequents from \(\mathbf {\Phi }\):

Thus, we get a correct derivation in \(\mathsf {FNL}(\mathbf {\Phi })\).    \(\square \)

Theorem 10 and Lemma 11 immediately yield Theorem 9.

3.2 Undecidability Without Additives and with Two Subexponentials

Theorem 12

If there are \(a,c \in I\) such that \(f(a) = \{ \mathsf {A1}, \mathsf {A2} \}\) and \(f(c) \supseteq \{ \mathsf {C}, \mathsf {E}, \mathsf {A1}, \mathsf {A2} \}\), then the derivability problem in \(\mathsf {acLL}_\varSigma \) is undecidable. Moreover, this holds for the fragment with only \(\rightarrow \), \({!}^a\), and \({!}^c\).

Remember from Example 8 that \(\mathsf {SMALC}_\varSigma \) [21] denotes the extension of \(\mathsf {FL}\) with subexponentials. The undecidability theorem above is proved by encoding the one-division fragment of \(\mathsf {SMALC}_\varSigma \) containing one exponential c such that \(f(c) \supseteq \{ \mathsf {C}, \mathsf {E} \}\). It turns out that that such a system is undecidable.

Theorem 13

(Kanovich et al. [22, 23]). If there exists such \(c \in I\) that \(f(c) \supseteq \{ \mathsf {C}, \mathsf {E} \}\), then the derivability problem in \(\mathsf {SMALC}_\varSigma \) is undecidable. Moreover, this holds for the fragment with only \(\rightarrow \) and \({!}^c\).

Observe that \(\mathsf {SMALC}_\varSigma \) can be obtained from \(\mathsf {acLL}_\varSigma \) by adding “global” associativity rules:

The usual formulation of \(\mathsf {SMALC}_\varSigma \), of course, uses sequences of formulas instead of nested structures as antecedents. The alternative formulation, however, would be more convenient for us now. It will be also convenient for us to regard all subexponentials in \(\mathsf {SMALC}_\varSigma \) to be associative, that is, \(f(s) \supseteq \{ \mathsf {A1}, \mathsf {A2} \}\) for each \(s \in I\).

In order to embed \(\mathsf {SMALC}_\varSigma \) into \(\mathsf {acLL}_\varSigma \), we define two translations, \(A^{!-}\) and \(A^{!+}\), by mutual recursion:

$$ \begin{aligned}&z^{!-} = {!}^a z \qquad \qquad \qquad \qquad \quad \,\,\,\, z^{!+} = z \qquad \qquad \qquad \qquad \quad \text {where }z \text { is a variable,} 1\text {, or }\top \\&(A \rightarrow B)^{!-} = {!}^a (A^{!+} \rightarrow B^{!-}) \quad (A \rightarrow B)^{!+} = A^{!-} \rightarrow B^{!+} \\&(B \leftarrow A)^{!-} = {!}^a (B^{!-} \leftarrow A^{!+}) \quad (B \leftarrow A)^{!+} = B^{!+} \leftarrow A^{!-} \\&(A \circledast B)^{!-} = {!}^a (A^{!-} \circledast B^{!-}) \quad \,\,\,\, (A \circledast B)^{!+} = A^{!+} \circledast B^{!+} \qquad \text {where }\circledast \in \{ \otimes , \oplus , \& \}\\&({!}^s A)^{!-} = {!}^s (A^{!-}) \qquad \qquad \qquad ({!}^s A)^{!+} = {!}^s (A^{!+}) \end{aligned}$$

Informally, our translation adds a \({!}^a\) over any formula (not only over atoms) of negative polarity, unless this formula was already marked with a \({!}^s\). Thus, all formulae in antecedents would begin with either the new subexponential \({!}^a\) or one of the old subexponentials \({!}^s\), and all these subexponentials allow associativity rules \(\mathsf {A1}\) and \(\mathsf {A2}\).

Lemma 14

A sequent \(A_1, \ldots , A_n \Rightarrow B\) is derivable in \(\mathsf {SMALC}_\varSigma \) if and only if its translation \(( \ldots (A_1^{!-}, A_2^{!-}), \ldots , A_n^{!-}) \Rightarrow B^{!+}\) is derivable in \(\mathsf {acLL}_\varSigma \).

Proof

For the “only if” part, let us first note that each formula \(A_i^{!-}\) is of the form \({!}^s F\) and \(\mathsf {A1}, \mathsf {A2} \in f(s)\). Indeed, either s is an “old” subexponential label (for which we added \(\mathsf {A1},\mathsf {A2}\)) or \(s = a\). Thus brackets can be freely rearranged in the antecedent.

Now we take a cut-free proof of \(A_1, \ldots , A_n \Rightarrow B\) in \(\mathsf {SMALC}_\varSigma \) and replace each sequent in it with its translation. Right rules for connectives other than subexponentials, i.e., \(\otimes R\), \(\oplus R_i\), \( { \& } R\), \(\rightarrow R\), and \(\leftarrow R\), remain valid as they are, up to rearranging brackets in antecedents. For \({!}^i R\), we notice that the translation of a formula of the form \({!}^j F\), where \(j \preceq i\), is also a formula of the form \({!}^j F'\). Thus, this rule also remains valid. The same holds for the dereliction rule \(\mathsf {der}\), because \(({!}^i F)^{!-}\) is exactly \({!}^i (F^{!-})\). Finally, the “old” structural rules (exchange, contraction, weakening) also remain valid (up to rearranging of brackets), since \({!}^i F\) gets translated into \({!}^i (F^{!-})\), which enjoys the same structural rules.

For the other left rules, we need to derelict \({!}^a\) first, and then perform the corresponding rule application. Rearrangement of brackets, if needed, is performed below dereliction or above the application of the rule in question.

The “if” part is easier. Given a derivation of \(( \ldots (A_1^{!-}, A_2^{!-}), \ldots , A_n^{!-}) \Rightarrow B^{!+}\) in \(\mathsf {acLL}_\varSigma \), we erase \({!}^a\) everywhere, and consider it as a derivation in \(\mathsf {SMALC}_\varSigma \). Associativity rules for the erased \({!}^a\) (which are the only structural rules for this subexponential) keep valid, because now associativity is global. Dereliction and right introduction for \({!}^a\) trivialize. All other rules, which do not operate \({!}^a\), remain as they are. Thus, we get a derivation of \(A_1, \ldots , A_n \Rightarrow B\) in \(\mathsf {SMALC}_\varSigma \), since erasing \({!}^a\) makes our translations just identical.    \(\square \)

4 Related Work and Conclusion

In this paper, we have presented \(\mathsf {acLL}_\varSigma \), a sequent-based system for non-associative, non-commutative linear logic with subexponentials. Starting form \(\mathsf {FNL}\), we modularly and uniformly added rules for exchange, associativity, weakening and contraction, which can be applied with the subexponentials having with the respective features. This allows for the application of structural rules locally, and it conservatively extends well known systems in the literature, continuing the path of controlling structural properties started by Girard himself [16].

Another approach to combining associative and non-associative behavior in Lambek-style grammars is the framework of the Lambek calculus with brackets by Morrill [39, 40] and Moortgat [34]. The bracket approach is dual to ours: there the base system is associative, and brackets, which are controlled by bracket modalities, introduce local non-associativity. Both the associative Lambek calculus and the non-associative Lambek calculus can be embedded into the Lambek calculus with brackets: the former is just by design of the system and the latter was shown by Kurtonina [26] by constructing a translation.

From the point of view of generative power, however, the (associative) Lambek calculus with brackets is weaker than the non-associative system with subexponentials, which is presented in this paper. Namely, as shown by Kanazawa [19], grammars based on the Lambek calculus with brackets can generate only context-free languages. In contrast, grammars based on our system with subexponentials go beyond context-free languages, even when no subexponential allows contraction (subexponentials allowing contraction may lead to undecidability, as shown in the last section).

As a quick example, let us consider a subexponential \({!}^{ae}\) which allows both associativity (\(\mathsf {A1}\) and \(\mathsf {A2}\)) and exchange (\(\mathsf {E}\)). If we put this subexponential over any (sub)formula, the system becomes associative and commutative. Using this system, one can describe the non context-free language \(\text {MIX}_3\), which contains all non-empty words over \(\{a,b,c\}\), in which the numbers of a, b, and c are equal. Indeed, \(\text {MIX}_3\) is the permutation closure of the language \(\{ (abc)^n \mid n \ge 1 \}\). The latter is regular, therefore context-free, and therefore definable by a Lambek grammar. The ability of our system to go beyond context-free languages is important from the point of view of applications, since there are known linguistic phenomena which are essentially non-context-free [49].

Regarding decidability, let us compare our results with the more well-known associative non-commutative and associative commutative cases.

In the associative and commutative case the situation is as follows. In the presence of additives, the system is known to be undecidable with one exponential modality [32]. Without additives, we get \(\mathsf {MELL}\), the (un)decidability of which is a well-known open problem [50]. However, with two subexponentials \(\mathsf {MELL}\) again becomes undecidable [9]. Thus, we have the same trade-off as in our non-associative non-commutative case: for undecidability one needs either additives, or two subexponentials.

Our results help to shed some light in the (un)decidability problem for the spectrum of logical systems surrounding \(\mathsf {MELL}/\mathsf {FNL}\), allowing for a fine-grained analysis of the problem, specially the trade-offs on connectives and subexponentials for guaranteeing (un)decidability.

There is a lot to be done from now on. First of all, we would like to analyze better the minimalist fragment of \(\mathsf {acLL}_\varSigma \) containing only implication and one fully-powered subexponential, as it seems to be crucial for understanding the lower bound of undecidability (or the upper bound of decidability). Second, one should definitely explore more the use of \(\mathsf {acLL}_\varSigma \) in modeling natural language syntax. The examples in Sect. 2.2 show how to locally combine sentences with different grammatical characteristics, and the \(\text {MIX}_3\) example above illustrates how that can be of importance. That is, it would be interesting to have a formal study about \(\mathsf {acLL}_\varSigma \) and categorial grammars. Third, we plan to investigate the connections between our work and Adjoint logic [48] as well as with Display calculus [5, 12]. Finally, we intend to study proof-theoretic properties of \(\mathsf {acLL}_\varSigma \), such as normalization of proofs (e.g. via focusing) and interpolation.