1 Introduction

A semantic theory should exactly capture meaning, telling us exactly what the meanings of the object language expressions are. This entails that the theory should characterise meaning uniquely and conspicuously; it needs to provide characterisations of object language expressions that no non-synonymous expressions satisfy, and it needs to do so in a transparent way. One way of getting at the notion of exactly capturing meaning is in terms of knowledge: if a theory exactly captures meaning, then knowing the theory would suffice for knowledge of meaning. Or, to put it the other way around, if knowing the theory was insufficient for knowledge of meaning—if you could know the theory while failing to know what the expressions of the object language mean—then the theory does not manage to exactly capture meaning. Thus, we can also say that a semantic theory should be such that knowledge of the theory would suffice for knowledge of meaning.Footnote 1

A large swath of mainstream approaches fail to exactly capture meaning. This goes not only for simple extensional truth-theoretic accounts along broadly Davidsonian lines and their semantic value-based cousins, but also for intensional and model-theoretic approaches which relativise truth to possible worlds, situations, or interpretations. All of these approaches suffer from the information problem: they are too weak to exactly capture meaning and thus unable to ground knowledge of meaning. This is our negative goal in the current paper: to hammer home that the information problem remains an unsolved foundational problem.

Nevertheless, it is hard not be impressed by the progress that has been made within formal semantics. If we had to abandon this body of work, that would be a great loss. Can the foundations be repaired without jeopardising the building? This is our positive goal: to show that the information problem can be addressed in a way which satisfies the worrying philosopher without disrupting the daily business of working semanticists. To this end, we will suggest that semantic theories (and linguistic theories more broadly) should be construed as theories of constitutive linguistic essences, and that semantic knowledge (and linguistic knowledge more broadly) should be construed as knowledge of the linguistic essences of one’s language.

Before going into the details, we present a bird’s-eye view of our overall argument. Let S be the German sentence ‘Sokrates ist weise’, which means that Socrates is wise. In its simplest incarnation, the information problem applies to the proposal that a proper semantic theory merely says such things as: S is true iff Socrates is wise. Note that you could have this knowledge about sentences that do not mean that Socrates is wise, e.g. about a sentence which means that Plato is a philosopher (or any other true sentence). Hence, knowing that S is true iff Socrates is wise cannot suffice to know that it means that Socrates is wise. This, roughly, is the information problem as it applies to simple extensional truth theories. Moving to theories that employ semantic values, or that relativise truth to possible worlds, situations, or interpretations does not eliminate the basic problem. We develop these considerations in more detail in Sect. 2.

The only way out is to go beyond the information provided by standard semantic theories. But this does not mean that we have to abandon the whole framework. For it does seem right that the information that S is true iff Socrates is wise captures something essential about the meaning of the German sentence. It is just that this information by itself is not enough. How should the theory be strengthened? An analogy can provide guidance. Someone who knows merely that all moving objects continue to move unless acted upon by a force knows less than someone who knows that it is a law of physics that moving objects behave in this way. Similarly, knowing that it is essential to S to be true iff Socrates is wise goes beyond merely knowing that it is true iff Socrates is wise. This, we suggest, points the way to a solution of the information problem: linguistic theories should explicitly be construed as theories about linguistic essences. More specifically, we will argue that the information problem can be solved by embedding the axioms of standard semantic theories under an operator expressing (something closely related to) Kit Fine’s notion of relative constitutive essence. We develop our positive proposal in Sect. 3, focussing on simple predication and sentential connectives. Section 4 concludes the paper with a brief outloook, and a technical appendix shows how our approach can be applied to first-order quantification.

2 The information problem

This section introduces the information problem and illustrates how far reaching it is. We will frame the problem in terms of a condition of adequacy on semantic theories: they should exactly capture meaning so that knowledge of such a theory suffices for knowledge of meaning. The information problem consists in the fact that mainstream approaches in current semantics are unable to satisfy this condition. Section 2.1 introduces the condition and illustrates how we can show that a given theory fails to meet it. Section 2.2 indicates the scope of the problem. Section 2.3 distinguishes the information problem from two related problems: overgeneration and granularity. Finally, Sect. 2.4 prepares the development of our positive proposal by briefly addressing existing responses to the information problem.

2.1 What is the information problem?

This section develops three claims: first, that semantic theories need to exactly capture the meanings of object language expressions. Second, that in order to do so, they need to be uniquely characterising as to these meanings. And third, that the notion of exactly capturing meaning can be fruitfully construed in terms of information that would suffice for knowledge of meaning. Theories that do not provide information sufficient for knowledge of meaning suffer from what we will call the information problem.Footnote 2 Apart from making the information problem more precise by linking it to unique characterisation, one point of our paper is to emphasise that the problem extends far beyond the Davidsonian programme and afflicts a wide swath of mainstream approaches (Sect. 2.2). Another point is that, even with respect to the Davidsonian programme, it has never been addressed in a satisfying manner (Sect. 2.4).

Let \({\mathcal {T}}\) be a candidate theory for some language \({\mathcal {L}}\), and let \(\alpha \) be some \({\mathcal {L}}\)-expression. We take \({\mathcal {T}}\) to be adequate only if \({\mathcal {T}}\) exactly captures the meaning of \(\alpha \). What does it mean to exactly capture the meaning of an expression? Roughly speaking, it means that the theory should tell us exactly and transparently what the meaning of the expression is. In order to be able to show that specific theories fail to meet this demand, we point out a necessary condition for exactly capturing meaning on which we will rely in subsequent arguments. Let \(\Phi (\alpha , {\mathcal {L}})\) be the totality of information \({\mathcal {T}}\) provides about \(\alpha \). If \(\Phi (\ldots )\) could also hold of a non-synonymous expression \(\beta \) of some language \(\mathcal {L^*}\)—i.e. if \(\Phi (\beta , \mathcal {L^*})\) is true for some \(\beta \) that differs in meaning from \(\alpha \)—then the theory leaves open what \(\alpha \) means; for all the theory says, it could mean what \(\beta \) means. We will say that such a theory fails to be uniquely characterising as to meaning. In order to exactly capture the meaning of an expression, a theory has to be at least uniquely characterising.

What unique characterisation amounts to and how it can best be tested depends on the underlying conception of languages and expressions and the format of semantic theories.Footnote 3 We roughly sketch one approach, relying on an abundant conception of languages, the view that simple expressions receive individual axioms, and the view that the information provided by a theory is fully determined by the theory’s axioms. For any finitely axiomatised theory \({\mathcal {T}}\), let \(\bigwedge {\mathcal {T}}\) be the conjunction of \({\mathcal {T}}\)’s axioms. For any singular terms \(t_1, \ldots , t_n\) and \(u_1, \ldots , u_n\), let \(\bigwedge {\mathcal {T}}_{t_1/u_1, \ldots , t_n/u_n}\) be the result of replacing every occurrence of \(t_i\) in \(\bigwedge {\mathcal {T}}\) by an occurrence of \(u_i\). We assume some canonical way of denoting expressions and languages, and we will write \({\overline{\alpha }}\) and \(\overline{{\mathcal {L}}}\) for the canonical designator of an expression \(\alpha \) and language \({\mathcal {L}}\), respectively. Given this approach, a theory \({\mathcal {T}}\) for a language \({\mathcal {L}}\) will be uniquely characterising for a simple expression \(\alpha \) iff there is no language \(\mathcal {L^*}\) containing an expression \(\beta \) such that \(\alpha \) and \(\beta \) are non-synonymous while \(\bigwedge {\mathcal {T}}_{{\overline{\alpha }}/{\overline{\beta }},{\overline{\mathcal {L}}}/\overline{\mathcal {L}^{*}}}\) is true. Correspondingly, \({\mathcal {T}}\) will be uniquely characterising for a complex expression \(\alpha = {\mathcal {S}}(\alpha _1, \ldots , \alpha _n)\) built from the simple expressions \(\alpha _1, \ldots , \alpha _n\) by a syntactic operation \({\mathcal {S}}(\ldots )\) iff there is no language \({\mathcal {L}}^*\) containing an expression \(\beta = {\mathcal {S}}(\beta _1, \ldots , \beta _n)\) such that \(\alpha \) and \(\beta \) are non-synonymous while \(\bigwedge {\mathcal {T}}_{\overline{\alpha _1}/\overline{\beta _1}, \ldots , \overline{\alpha _n}/\overline{\beta _n},{\overline{\mathcal {L}}}/\overline{\mathcal {L}^{*}}}\) is true.

In Sect. 2.3 we will show that a unique but non-transparent characterisation might fail to exactly capture meaning. To sharpen the latter notion, we add the assumption that a central goal of linguistic semantics is to contribute to an explanation of knowledge of meaning. In particular, we assume that knowledge of an adequate semantics \({\mathcal {T}}\) would suffice to know, of every \({\mathcal {L}}\)-expression, what it means, and that the notion of exactly capturing meaning can be fruitfully construed in terms of knowledge of meaning: \({\mathcal {T}}\) exactly captures the meaning of \(\alpha \) iff knowledge of \({\mathcal {T}}\) would suffice to know what \(\alpha \) means. Accordingly, we will freely switch between talk of exactly capturing meaning and talk of providing information sufficient for knowledge of meaning.

To sum up: a theory \({\mathcal {T}}\) is an adequate semantic theory for a language \({\mathcal {L}}\) only if \({\mathcal {T}}\) exactly captures the meanings of \({\mathcal {L}}\)-expressions. This, in our view, comes to the same thing as saying that knowledge of \({\mathcal {T}}\) would suffice for knowing what \({\mathcal {L}}\)-expressions mean. Theories that fail this condition suffer from the information problem. One way of showing that a theory fails this condition is to show that it is not uniquely characterising.

2.2 How far reaching is the problem?

Given the previous section, we can establish the inadequacy of a candidate theory \({\mathcal {T}}\) for some language \({\mathcal {L}}_1\) by showing that, for some \({\mathcal {L}}_1\)-expression \(\alpha _1\), there is an expression \(\alpha _2\) of some language \({\mathcal {L}}_2\) such that, while \(\alpha _1\) and \(\alpha _2\) share all properties that \({\mathcal {T}}\) ascribes to \(\alpha _1\), they are non-synonymous. In order to apply this strategy, we need to add some assumptions about the availability of non-synonymous expressions. Semantic theories combine information about simple expressions, about syntactic structure of complex expressions, and about the semantic effects of syntactic composition. Thus, if \(\alpha _1\) is complex, it will not suffice that \(\alpha _2\) instantiates some central semantic property that the theory ascribes to \(\alpha _1\) (e.g. contribution to truth conditions). Rather, \(\alpha _2\) will have to be semantically isomorphic with \(\alpha _1\). For instance, let \(\alpha _1 =\) ‘equilateral triangle’ and \(\alpha _2 =\) ‘equiangular triangle’. Then \(\alpha _1\) and \(\alpha _2\) are intensionally equivalent, but non-synonymous. Nevertheless, an intensional semantic theory will still manage to provide a characterisation of \(\alpha _1\) that is not satisfied by \(\alpha _2\), given that it must construe \(\alpha _1\), roughly, as built up of morphemes ‘equi’, ‘lateral’, ‘tri’, ‘angle’, each with its own semantic properties. Hence, the easiest way of applying the above strategy is in cases where \(\alpha _1\) and \(\alpha _2\) are both simple. Accordingly, we want assumptions that ensure the availability of non-synonymous simple expressions.

We make three assumptions. First, we assume that, for every language \({\mathcal {L}}_1\) and for every complex predicate \(\alpha _1\) of \({\mathcal {L}}_1\), there is a synonymous simple predicate \(\alpha _2\) of some language \({\mathcal {L}}_2\). For instance, ‘unmarried man’ is complex and synonymous with the simple ‘bachelor’. Our assumption says that such pairings can always be found, even if not within the same language. Second, we assume that for all languages \({\mathcal {L}}_1\) and \({\mathcal {L}}_2\), containing the simple predicates \(\alpha _1\) and \(\alpha _2\), respectively, there is a language \({\mathcal {L}}_3\) which differs from \({\mathcal {L}}_1\) only in containing \(\alpha _2\) (meaning in \({\mathcal {L}}_3\) what it means in \({\mathcal {L}}_2\)) instead of \(\alpha _1\). For instance, if \({\mathcal {L}}_1\) contains ‘cordate’ (meaning creature with a heart), then there is a language which differs from \({\mathcal {L}}_1\) only in containing ‘renate’ (meaning creature with a kidney) instead of ‘cordate’. Finally, we assume a fine-grained notion of linguistic meaning. In particular, we assume that for every predicate \(\alpha _1\), there is a logically equivalent yet non-synonymous predicate \(\alpha _2\). For instance, we take it that the predicates ‘x is wise’ and ‘x is wise and if x is a molehill, then x is a mountain or a molehill’ are logically equivalent, yet not synonymous.

The first two assumptions rely on an abundant conception of languages. They could be weakened by rephrasing the following in terms of which languages could have existed, but we take them to express the plausible default position.Footnote 4 The third assumption is more substantial. We will not try to defend it here, however. If you do not agree, you may conditionalise the rest of the paper on this assumption, and if you prefer restricted versions of our three assumptions, you might then want to check whether you could accept the particular consequences we make use of in the remainder of this subsection.

On a standard notion of logical equivalence, synonymous expressions need not be logically equivalent (for instance, the predicates ‘x is a snake’ and ‘x is a serpent’ are synonymous; yet, since they belong to the non-logical vocabulary, they can receive distinct interpretations in a model). But let us say that two expressions are logically equivalent\(^*\) iff they have synonyms that are logically equivalent. Now, our three assumptions entail that, for every language \({\mathcal {L}}_1\) containing a simple predicate \(\alpha _1\), there is another language \({\mathcal {L}}_2\) that differs from \({\mathcal {L}}_1\) only in containing the simple predicate \(\alpha _2\) instead of \(\alpha _1\), where \(\alpha _1\) and \(\alpha _2\) are logically equivalent\(^*\) yet non-synonymous. Hence, in order to be uniquely characterising, a theory has to provide a semantic characterisation of a given simple predicate that is not eo ipso satisfied by every simple logically equivalent\(^*\) predicate. As can easily be checked, standard mainstream approaches fail in this respect.

First, consider clausal extensional truth theories familiar from work within the Davidsonian tradition, employing axioms such as the following:Footnote 5

$$\begin{aligned} (\mathrm{A}1)\qquad \qquad \forall N \; (T(\ulcorner N \text { ist weise}\urcorner ) \leftrightarrow R(N) \text { is wise}) \end{aligned}$$

Note that (A1) is not only true, but interpretive: the object language expression mentioned is synonymous with the metalanguage expression used. Following Lepore and Ludwig (2007, p. 32) we call a truth theory all of whose semantic axioms are interpretive an interpretive truth theory. It is easy to see that such theories are not uniquely characterising. Any simple expression that is logically equivalent\(^*\) with ‘ist weise’ is true of all and only the wise things; but many of these expressions are not synonymous with ‘ist weise’. Thus, clausal extensional truth theories do not exactly capture meaning, and suffer from the information problem.

As we pointed out above, the information problem has received some discussion with regard to clausal extensional truth theories. However, it extends much further. For instance, it is not restricted to the clausal approach, but carries directly over to an extensional semantic value-based approach as employed in Heim and Kratzer (1998). After all, when it comes to unique characterisation, nothing is gained by moving from (A1) to a lexical entry which assigns the corresponding function from objects to truth values.

Moving from extensional approaches to theories that work with possible worlds or situations significantly increases the information encapsulated in the theory. But whether we do this in the clausal format (by relativising truth, reference, and satisfaction) or in the semantic value-based framework (by adding places to our functions), this move does not yield theories that are uniquely characterising. Given that two simple predicates \(\alpha _1\) and \(\alpha _2\) are logically equivalent\(^*\), they share all properties that can be expressed in the vocabulary of possible-worlds semantics or situation semantics. Finally, augmenting extensional and intensional information with information about non-intended interpretations along model-theoretic lines will not help either.Footnote 6 For, while it is true that, under a standard notion of ‘permitted interpretation’, we can say that there are some interpretations under which \(\alpha _1\) and \(\alpha _2\) come apart extensionally, this does not allow us to conclude that they are non-synonymous; for synonymous expressions routinely come apart in this way. Thus, the upshot so far is the following: mainstream approaches along broadly truth-theoretic lines fail to be uniquely characterising, fail to exactly capture meaning, and suffer from the information problem.

The recent literature contains proposals, arguably located at the edge of truth-theoretic semantics, that go beyond the accounts considered above. Prominent examples include various impossible-worlds semantics, truthmaker semantics, procedural semantics, and semantics in terms of structured propositions.Footnote 7 Some of these are clearly not uniquely characterising, but some of them arguably are (see e.g. the account mentioned in Berto and Jago (2019)). We are sceptical that these latter accounts exactly capture meaning, but we will not enter into this discussion here. Rather, our goal in this paper is to investigate a different route that is more conservative in one way (by sticking to standard truth-theoretic machinery), but more exploratory in another way (by augmenting the language of the theory with a relativised hyperintensional operator expressing an essentialist concept). A comparison of the resulting approach to other non-standard approaches has to be left for later work.

2.3 Overgeneration and granularity

We want to distinguish the information problem from two related problems: overgeneration and granularity. Let us turn to overgeneration first. Any formal semantics aims to derive central theorems that characterise object-language sentences. In extensional theories, these will be the interpretive T-sentences ascribing (extensional) truth conditions, in possible-worlds semantics they ascribe intensional truth conditions, and so on. For simplicity, we stick to extensional theories, but the discussion carries over to other approaches.

Via the interpretive T-sentences, a truth theory relates any object-language sentence S to a metalanguage sentence that could be used to explicitly give S’s meaning. This fact lies at the heart of why truth theories have seemed promising for natural-language semantics. However, once we consider the theory’s full set of theorems, it becomes apparent that the theory by itself does nothing to identify any metalanguage sentence as the one that gives S’s meaning. Call a truth theory which employs classical logic a classical truth theory. Any classical truth theory yields many non-interpretive T-sentences such as

$$\begin{aligned} (1)\qquad \quad&T(\text {`Sokrates ist weise'}) \leftrightarrow (\text {Socrates is wise, and if Socrates is a}\\&\text {mountain, then Socrates is a mountain or a molehill}). \end{aligned}$$

We call this the problem of overgeneration.Footnote 8

Just like the information problem, the problem of overgeneration has been discussed with respect to the Davidsonian programme; but again, it arises for semantic theories in general. The standard response within the Davidsonian programme is to modify the inferential component of a classical truth theory. With respect to first-order languages, it is known that an appropriate modification to the inferential component is available. In such cases, it is possible to give what we will call a canonical truth theory: a truth theory whose deductive apparatus does not enable us to derive any non-interpretive T-sentences such as (1).Footnote 9

Note that solving overgeneration does not by itself solve the information problem. As we argued above, (classical) truth theories are not uniquely characterising. But canonical theories fail to be uniquely characterising for precisely the same reason. A canonical theory and the corresponding classical theory share their axioms, and the set of canonical theorems is a subset of the classical theorems. Thus, everything the canonical theory says, the classical theory says, too. Since what the classical theory says is not sufficient, neither is what the canonical theory says. You cannot increase informativity by saying less.

In a sense, unique characterisation is a matter of granularity: a theory that fails to be uniquely characterising fails to provide sufficiently fine-grained characterisations of object language expressions. The need for a fine-grained semantics becomes especially apparent when the object language contains sensitive constructions, such as operators ascribing propositional attitudes and explanatory vocabulary. However, the information problem should be distinguished from the problem of how to treat such constructions, and it should be acknowledged to go beyond mere matters of granularity/unique characterisation.

That the problem of how to treat sensitive constructions and the information problem are distinct should be evident from the fact that the former arises only for languages which contain such operators, while the latter arises even for fully extensional languages. More importantly, even a maximally fine-grained theory can suffer from the information problem. Consider a theory that assigns full-fledged meanings as semantic values (whatever meanings turn out to be). The theory might do so via axioms such as the following:

$$\begin{aligned}&(\mathrm{A}2)\qquad \llbracket {ist\, weise}\rrbracket =\text { the meaning of `ist weise'.} \end{aligned}$$

Or it might do so via

$$\begin{aligned}&(\mathrm{A}3)\qquad \llbracket {ist\, weise}\rrbracket =\text { Mickey,} \end{aligned}$$

where ‘Mickey’ is a directly referring name for the meaning of ‘ist weise’.Footnote 10 By definition, such a theory will be sufficiently fine-grained and uniquely characterising. But it would not solve the information problem. In particular, knowing such a theory would add next to nothing to an understanding of the object language.Footnote 11

2.4 Responding to the information problem

Before developing our own response to the information problem, we want to address other existing approaches. While remaining within a broadly truth-theoretic framework, there are at least three kinds of responses. The first is to modify the axiomatic base of a truth theory; the second is to modify the inferential component; the third is to move to a meta level, construing meaning theories as theories about truth theories.

In our view, the first kind of response seems the most promising. The information a theory provides is most naturally viewed as determined by its axioms. Hence, if a theory is not informative enough, it seems natural to include more information in its axiomatic base. One way to do this is by strengthening the axioms with some operator. Versions of this approach have been considered by Higginbotham (1992), who proposes the operator it is common knowledge among the speakers of the object language that, and by Wallace (1978), who proposes it is a matter of meaning alone that. Our own proposal falls into this category, and we will briefly indicate why we take Wallace’s closely related proposal to be insufficient.Footnote 12

Let us abbreviate Wallace’s operator as \(\Box _W\). Prefixing \(\Box _W\) to ordinary truth-theoretic axioms, we arrive at:

$$\begin{aligned} \mathrm{(W1)}\qquad&\Box _W \forall N \; (T(\ulcorner N \text { ist weise}\urcorner ) \leftrightarrow R(N) \text { is wise}).\\ \mathrm{(W2)}\qquad&\Box _W R(\text {`Sokrates'})= \text {Socrates}. \end{aligned}$$

Such axioms should enable us to derive strengthened T-sentences:

$$\begin{aligned} (2)\qquad \quad \Box _W (T(\text {`Sokrates ist weise'}) \leftrightarrow \text {Socrates is wise}). \end{aligned}$$

However, as Davies (1981, p. 246) rightly complains, the principles governing \(\Box _W\) ‘are not specified by Wallace’. That specifying such principles may not be straightforward becomes apparent when one notices that \(\Box _W\) cannot be closed under logical consequence. We adapt an example from Lepore and Ludwig (2005, 108f.). Let \(\chi \) be a predicate which means is wise and LT, where LT is some logical truth. Hence the following holds:

$$\begin{aligned} (3)\qquad \quad \Box _W \forall N \; (T(\ulcorner N \text { ist } \chi \urcorner ) \leftrightarrow (R(N) \text { is wise} \wedge LT)). \end{aligned}$$

Assuming closure under logical consequence for \(\Box _W\), (3) entails:

$$\begin{aligned} (4)\quad \qquad \Box _W \forall N \; (T(\ulcorner N \text { ist } \chi \urcorner ) \leftrightarrow R(N) \text { is wise}). \end{aligned}$$

Given that \(\chi \) and ‘ist weise’ are non-synonymous, the truth of (4) illustrates that a theory treating ‘ist weise’ solely by (W1) fails to be uniquely characterising. Lepore and Ludwig seem to assume that \(\Box _W\) has to be closed under logical consequence. More moderately, one could take the above consideration to issue a challenge: Wallace needs to amend his proposal with a plausible specification of the principles governing \(\Box _W\) which ensures both the derivability of intended theorems and unique characterisation.

There are, however, reasons to think that this challenge cannot be met, even if closure under logical consequence is taken off the table. Consider the following schema:

$$\begin{aligned} (5)\qquad \Box _W (\Phi \leftrightarrow ( \Psi \wedge \Omega )) \wedge \Box _W \Omega \; \; \; \; \rightarrow \; \; \; \; \Box _W (\Phi \leftrightarrow \Psi ). \end{aligned}$$

If it is a matter of meaning alone that a condition holds iff a conjunction of two other conditions holds, while one of those two conditions is itself a matter of meaning alone, then we should take it to be a matter of meaning alone that the first condition only depends on the remaining one. But now let \(\Omega \) be any condition that holds as a matter of meaning alone, and let \(\chi \) be a predicate which means is wise and \(\Omega \). Since we have \(\Box _W \Omega \), (3) entails (4) via (5), and hence a failure to be uniquely characterising.

The second response to the information problem takes its cue from the fact that, once overgeneration has been addressed, we can introduce a rule which allows the derivation of explicit meaning ascriptions.Footnote 13 Given that \(({\mathcal {A}}, {\mathcal {R}})\) is a canonical theory comprising a set \({\mathcal {A}}\) of axioms and a set \({\mathcal {R}}\) of inference rules, and given that a T-sentence ‘\(T(S) \leftrightarrow p\)’ is derivable from \({\mathcal {A}}\) via \({\mathcal {R}}\), the corresponding meaning ascription ‘S means that p’ is guaranteed to be true. We could hence introduce a new rule, \(r_m\), which allows the derivation of ‘S means that p’ from the corresponding canonically derived T-sentence. The resulting theory, \(({\mathcal {A}}, {\mathcal {R}}\cup \{r_m \})\), thus produces true meaning ascriptions. However, there is something awkward about this inferential approach. Note that \(r_m\) is not truth preserving. Let \({\mathcal {A}}_*\) be a set of true but non-interpretive axioms. Combining \({\mathcal {A}}_*\) with \({\mathcal {R}}\) results in a theory \(({\mathcal {A}}_*, {\mathcal {R}})\) which yields only true, albeit non-interpretive, T-sentences. Hence, combining \({\mathcal {A}}_*\) with \({\mathcal {R}}\cup \{r_m\}\) allows the derivation of false meaning ascriptions from true premisses. Thus, this approach essentially models the speaker as systematically inferring conclusions from premisses that do not warrant these conclusions. The fact that a given set of truth-theoretic axioms is interpretive cannot be gleaned from these axioms themselves; yet interpretiveness of the axioms is strictly required if the suggested inference is to be legitimate. It seems more natural to think that the lesson of the information problem is that the content of the axioms themselves must be enriched.

Versions of the third, ‘meta-level’ approach have been endorsed by Davidson and more recently by Lepore and Ludwig. We turn to Davidson first. Let ‘p’ be the conjunction of the axioms of some interpretive truth theory for a language \({\mathcal {L}}\). In Reply to Foster, Davidson suggests that what suffices for understanding \({\mathcal {L}}\) is the information that some truth theory (satisfying conditions such as charity) for \({\mathcal {L}}\) states that p. The thought here is that, by explicitly characterising the content of a truth theory as the content of a truth theory, we say more than if we just give the truth theory itself. For instance, when we say about ‘Snow is white’ that it is true iff snow is white, this also holds for ‘Grass is green’ (or any other true sentence). However, given that no truth theory satisfying the pertinent conditions entails that ‘Grass is green’ is true iff snow is white, explicit recourse to truth theories allows us to differentiate between the two. However, it is hard to see how Davidson’s proposal can avoid the overgeneration problem. The ‘p’-slot in ‘some truth theory for \({\mathcal {L}}\) states that p’ has to be filled by the truth-theoretic axioms, and there is no explicit reference to an inferential component. Hence, this approach cannot appeal to canonical proofs. But without addressing overgeneration, there is no hope of solving the information problem.

Partly motivated by considerations along these lines, Lepore and Ludwig have developed a different version of the meta-level approach.Footnote 14 On their proposal, what suffices for understanding is the meta-linguistic information expressed by ‘ “p” is an interpretative truth theory for \({\mathcal {L}}\)’ combined with: (i) a specification of how to derive canonical theorems; (ii) a statement of the fact that canonically derived theorems are interpretive; and (iii) information that, they hold, suffices for understanding the metalanguage in which the truth theory is formulated. In contrast to Davidson’s proposal, which directly targets the content of a truth theory, Lepore and Ludwig’s approach aims at truth theories construed as syntactic objects. While this allows them to make use of canonical proofs, it introduces an additional language (the language of the truth theory), semantic competence with which has to be ensured in order for the meta theory to do its work. We are sceptical that this can be done in a satisfying manner.Footnote 15 Moreover, even if it can be done, the resulting theories seem somewhat awkward, generating understanding of the object language by first generating understanding of another language, then giving a truth theory in this second language, and finally using a third language to talk about this truth theory. One wonders why there shouldn’t be a more straightforward route.

Obviously, both the inferential response and the meta-level response deserve a fuller discussion. But our main focus lies elsewhere. We merely hope that the remarks in this section suffice to motivate a fresh look at a response along the lines of the first approach: strengthening truth-theoretic axioms by embedding them under a new operator.

3 Semantic theories as theories of constitutive semantic requirements

This section presents our positive proposal. Let \(\alpha _1\) and \(\alpha _2\) be two simple predicates, the first meaning is wise, the second is wise and self-identical. Since the wise things are the self-identical wise things, vocabulary such as ‘applies to’ does not by itself enable us to express any semantic difference between \(\alpha _1\) and \(\alpha _2\), and the situation is not improved by adding possible worlds, situations, or models. But there is a crucial semantic difference between the two. Roughly speaking, for \(\alpha _1\), it is primary to apply to all and only the wise things, while applying to all and only the self-identical wise things is secondary—it is merely a consequence of the primary fact. In the case of \(\alpha _2\), it is precisely the other way around: for \(\alpha _2\), it is primary to apply to all and only the self-identical wise things, while applying to all and only the wise things is a secondary consequence. In a nutshell, our proposal is that a semantic theory should explicitly state what the primary semantic facts governing object language expressions are. By doing so, it will be uniquely characterising, and by doing it transparently, it will be exactly capturing and able to ground knowledge of meaning.

Obviously, this talk of primacy is in need of explication. We turn to Kit Fine’s essentialist framework for help. Section 3.1 introduces the framework, in particular the distinction between constitutive and consequential essence, and the notion of a semantic requirement. While we will lean quite heavily on Fine’s work, we will also suggest two novel modifications. Section 3.2 argues that we should acknowledge a notion of a constitutive semantic requirement in addition to Fine’s own consequentialist construal of semantic requirements. Moreover, Sect. 3.3 argues that Fine’s Chaining principle, which allows him to combine semantic requirements, should be rejected for constitutive requirements. Instead, we propose a new principle, Replacement, and illustrate how it can be used to derive constitutive requirements on complex expressions from the constitutive requirements on their parts. Our proposal then is that semantic theories should be construed as theories that explicitly state the constitutive semantic requirements on the object language expressions, and that these requirements can be stated by strengthening interpretive truth-theoretic axioms with an operator that expresses the notion of a constitutive semantic requirement on the expression in question.

3.1 Varieties of essence and semantic requirements

Following Fine (1994), we assume that there is a robust notion of essence, to be thought of in terms of real definitions: to give the essence of an object is to say what the object is, and to say that a certain property is essential to the object is to say that ‘the object must have that property if it is to be the object that it is’ (Fine 1994, p. 4). This notion of essence allows fine-grained distinctions. For instance, while Socrates and singleton Socrates necessarily coexist, the first is involved in the essence of the latter, but not vice versa.

There are two distinctions between different notions of essence that will be central to our proposal: the distinction between constitutive and consequential notions and between immediate and mediate notions. While a constitutive notion covers only what is part of the definition of the object, the corresponding consequential notion in addition covers all the logical consequences of the definition; and an immediate notion is only concerned with what is directly part of the definition of the object, while the corresponding mediate notion additionally takes into account what is part of the definition by virtue of being part of a definition of something that occurs in the original definition. Since both distinctions cut across each other, we end up with four notions: immediate and mediate constitutive essence; immediate and mediate consequential essence.Footnote 16

Let us illustrate these distinctions by way of examples. Suppose that being the successor of 1 is directly part of the definition of the number 2, while being the successor of 0 is directly part of the definition of 1; i.e. the following are immediately constitutively essential to 2 and 1 respectively:

$$\begin{aligned}&(6)\qquad 2 = S(1). \\&(7)\qquad 1 = S(0). \end{aligned}$$

Now consider:

$$\begin{aligned}&(8)\qquad \exists x\; 2 = S(x).\\&(9)\qquad 2 =S(S(0)).\\&(10)\qquad \exists x\; 2 = S(S(x)). \end{aligned}$$

First, since (8) is a logical consequence of (6), (8) is part of 2’s immediate consequential essence. Second, since 2 is defined as the successor of 1, while 1 is defined as the successor of 0, (9) is the result of substituting definiens for definiendum. Hence, (9) is part of 2’s mediate constitutive essence. Finally, since (10) is a logical consequence of (9), it is part of 2’s mediate consequential essence.

Focussing now on semantics, we follow Fine in drawing on a concept of semantic requirement. Fine (2007, p. 43) distinguishes between facts that are ‘semantic as to topic’ and facts that are ‘semantic as to status’. While the first class includes all facts that ‘pertain to the exemplification of semantic properties or relations’, the second class contains only facts that are semantic in a more narrow sense by belonging ‘to the semantics of a given language’.Footnote 17 For instance, the fact that ‘Schnee ist weiß’ is a true sentence of German is merely semantic as to topic, while the fact that the sentence is true in German iff snow is white may be considered to be semantic as to status. Semantic requirements are facts that are semantic as to status; as Fine (2010, p. 66) suggests, they can be thought of as laws governing the relevant language.

Some semantic requirements are requirements simpliciter, while others are relative requirements on certain expressions, holding ‘in virtue of their meaning or semantic features’, concerning them ‘as source rather than simply as subject’, as Fine (2007, p. 123) puts it. An example: if ‘he’ functions as an anaphor for ‘John’, then the requirement for these two expressions to be coreferential has ‘he’ as its source; it is a requirement on the anaphor, not on the antecedent. The relative/simpliciter distinction can thus be understood in analogy to the distinction between essence and metaphysical modality given in Fine (1994): just as it is necessary that \(\{\)Socrates\(\}\) contains Socrates, while this is essential to \(\{\)Socrates\(\}\) without being essential to Socrates, it is a semantic requirement simpliciter that ‘he’ and ‘John’ are coreferential, but only a semantic requirement on the anaphor.

Fine conceives of semantic requirements in a consequentialist spirit, taking them to be closed under a restricted form of logical consequence, what Fine (2007, p. 48) calls manifest consequence. Informally, manifest consequences can be characterised as classical consequences that would be evident to an ideal cogniser.Footnote 18 We use \(\Box \) to express consequentialist semantic requirements simpliciter, so that \(\Box \Phi \) can be read as it is a semantic requirement that \(\Phi \). Correspondingly, we use \(\Box _\alpha \) to express relative consequentialist semantic requirements, so that \(\Box _\alpha \Phi \) can be read as it is a semantic requirement on the expression \(\alpha \) that \(\Phi \). If \(\vdash ^\text {m}\) stands for the relation of manifest consequence, the closure principle on relative semantic requirements can be expressed as follows:

figure a

A main reason for Fine to focus on a consequentialist notion of semantic requirements seems to be a concern about how to derive semantic requirements concerning complex expressions. Fine (2007, p. 47) observes that ‘the compositional character of semantics requires that we should derive semantic facts concerning complex expressions from the semantic facts concerning simpler expressions.’

Fine needs, however, an additional principle to account for compositionality. For example, let us grant that the following two semantic requirements are obtainable in a Finean consequentialist approach:

$$\begin{aligned} (11)\qquad&\Box _{\text {`es regnet nicht'}} \; \big (T(\text {`es regnet nicht'}) \leftrightarrow \lnot \, T(\text {`es regnet'})\big ).\\ (12)\qquad&\Box _{\text {`es regnet'}} \; \big (T(\text {`es regnet'}) \leftrightarrow \text {it is raining}\big ). \end{aligned}$$

That is, ‘es regnet nicht’ is semantically required to be true iff ‘es regnet’ is not true, and ‘es regnet’ is semantically required to be true iff it is raining. Then, ‘es regnet nicht’ should also be semantically required to be true iff it is not raining:

$$\begin{aligned} (13)\qquad \Box _{\text {`es regnet nicht'}} \; \big (T(\text {`es regnet nicht'}) \leftrightarrow \lnot \text { it is raining}\big ) . \end{aligned}$$

The question is how (13) can be obtained from (11) and (12) by means of plausible principles for the notion of a relative semantic requirement.

Note that (13) does not follow from (11) and (12) by means of the closure principle \(\text {C}^\text {m}_{\Box _\alpha }\), since (11) and (12) give semantic requirements for different expressions, ‘es regnet nicht’ and ‘es regnet’, respectively. What is needed is a principle which allows one to infer that something is a semantic requirement on an expression \(\alpha \) from claims (at least some of) which do not concern semantic requirements on \(\alpha \). For this reason, Fine (2007, p. 123) introduces a principle he calls ‘Chaining’:

$$\begin{aligned} \frac{\Box _\alpha \, \Phi (\alpha , \beta ) \quad \Box _\beta \, \Psi (\beta )}{\Box _\alpha \, (\Phi (\alpha , \beta ) \wedge \Psi (\beta ))} \end{aligned}$$

This principle allows one to import the content of one requirement into the content of another. Applying it to (11) and (12) leads to

$$\begin{aligned}&(14)\qquad \quad \Box _{\text {`es regnet nicht'}} \; \big ( \, (T(\text {`es regnet nicht'}) \leftrightarrow \lnot \, T(\text {`es regnet'})) \nonumber \\&\qquad \qquad \qquad \wedge \, (T(\text {`es regnet'}) \leftrightarrow \text {it is raining}) \, \big ). \end{aligned}$$

And from this, (13) follows by \(\text {C}^\text {m}_{\Box _\alpha }\).

3.2 The first idea: constitutive instead of consequential semantic requirements

To obtain a semantics that exactly captures the meanings of the object-language expressions, we suggest that the truth-theoretic axioms should employ an operator expressing the notion of a semantic requirement.Footnote 19 More precisely, if the truth-theoretic axiom mentions an object language expression \(\alpha \), then it should employ an operator expressing the notion of a relative semantic requirement on \(\alpha \). However, although this Finean idea points to a solution of the information problem, it does not quite get us there. The problem is that by employing a consequentialist notion of semantic requirement, one ends up with a theory that is not uniquely characterising (as is witnessed, for example, by two simple predicates corresponding to the manifestly equivalent ‘is wise’ and ‘is wise and self-identical’).

Now, corresponding to the distinction between a consequentialist and a constitutive notion of essence, it seems natural to draw an analogous distinction within the domain of semantics, distinguishing merely consequentialist and truly constitutive requirements. Take Fine’s example of the anaphor. Surely, the difference between the following requirements

$$\begin{aligned}&(15)\qquad \qquad \qquad \Box _{\text {`He'}}\;\hbox {`He' is coreferential with `John'.} \\&(16)\qquad \qquad \qquad \Box _{\text {`He'}}\;\hbox {`He' refers to John.}\\&(17)\qquad \qquad \qquad \Box _{\text {`He'}}\;\hbox {`He' refers to John or molehills are mountains.} \end{aligned}$$

quite neatly mirrors the distinction between immediate constitutive, mediate constitutive, and consequential essence. We suggest to explicitly acknowledge constitutive semantic requirements, and to investigate how they can be put to work. More strongly, we suggest that a truth-theoretic axiom for \(\alpha \) should employ an operator expressing the notion of a relative constitutive semantic requirement on \(\alpha \). Only such a semantically fine-grained notion allows one to formulate an axiom that exactly captures the meaning of \(\alpha \) and therefore provides all the basic semantic information about this expression.

We will use \(\boxdot \) to express constitutive semantic requirements simpliciter, so that \(\boxdot \Phi \) can be read as it is a constitutive semantic requirement that \(\Phi \). Correspondingly, we will use \(\boxdot _\alpha \) to express relative constitutive semantic requirements, so that \(\boxdot _\alpha \Phi \) can be read as it is a constitutive semantic requirement on the expression \(\alpha \) that \(\Phi \). We assume that \(\boxdot _\alpha \Phi \) can only be true if \(\alpha \) is mentioned in \(\Phi \), for which we write \(\alpha \in \Phi \). How are the notion of a (constitutively) essential property and the notion of a (constitutively) semantic requirement on some expression related? Like Fine we think that these notions come down to the same thing: a semantic requirement on some expression is nothing but an essential semantic property of this expression.

Note that this claim presupposes a thick conception of linguistic expressions, according to which their meanings are essential to them (cp. Fine 2007, 135). For example, one would say that it is an essential property of ‘house’ to mean house, rather than dog. On the alternative ‘thin’ conception of linguistic expressions, one would say that a word could have a different meaning—still being the same word.Footnote 20

Many linguistic phenomena can be adequately accounted for on both conceptions, though in slightly different terms. Consider lexical change as an example:

  • (Thin) In Middle High German the word ‘witzig’ meant smart, in Standard German this same word means funny.

  • (Thick) Middle High German had a word spelled ‘witzig’ meaning smart, Standard German has a (different) word spelled ‘witzig’ which means funny.

A thick conception of expressions is compatible with the observation that it is arbitrary, perhaps conventional in the sense of Lewis (1975), which sequences of sounds/marks go with which meanings in some linguistic community. For example, instead of saying that there is a regularity among speakers of Standard German to use a (thinly conceived) expression only if they have certain attitudes towards a possible meaning—e.g. to use ‘Peter ist witzig’ only when they believe that Peter is funny—a Lewisian might alternatively say that there is a regularity among these speakers to use certain sequences of sounds/marks only if they have such attitudes, thereby bringing it about that a certain (thickly conceived) expression is part of the language of their community (where this could, but need not, be understood as involving reference to a thickly conceived language, i.e. one that has its expressions essentially, see note 3). A semantics based on thick expressions thus does not preclude a metasemantics based on thin expressions.

For many purposes, it is thus insignificant which metaphysics of expressions one adopts. Adopting a thick conception of expressions allows us to literally understand relative semantic requirements as essential properties of the relevant expressions. Without defending it any further, we will thus adopt this thick account of linguistic expressions and this essentialist understanding of relative semantic requirements.

Our proposal, then, is the following. A linguistic theory for some object language \({\mathcal {L}}\) should be concerned with the real definitions of the expressions of \({\mathcal {L}}\)—it should state what their constitutive essences are. This will at least involve specifying their syntactic essences and their semantic essences, but here we will focus on their semantic essences.Footnote 21 Adopting a truth-theoretic approach (but see Sect. 4), we suggest that the constitutive semantic essence of a lexical item can be given by an interpretive specification of the truth conditional contribution it makes to sentences.Footnote 22 Hence, we take the axioms of an interpretive truth theory to express the content of the semantic part of a real definition of the object language expression mentioned in the axiom. To strengthen simple truth theories, we propose to attach indexed sentential operators for constitutive essence to truth-theoretic axioms, so that the resulting strengthened axioms explicitly convey their definitional character. Such axioms explicitly state what the constitutive semantic requirements on the lexical items of the object language are. In the following, when we use ‘requirement’, we mean constitutive semantic requirement, unless stated otherwise.

3.3 The second idea: replacement instead of chaining

To illustrate how semantic requirements can be employed in giving a compositional semantics, we now turn to the principles governing this notion. We replace Fine’s Chaining principle, which merely allows the importation of the content of one (consequentialist) requirement into that of another, with a different principle about how (possibly mediate, but constitutive) requirements interact. As a result, Fine’s closure principle, which is not plausible for a constitutive notion, can be dispensed with; compositionally deriving requirements for complex expressions can be done more directly via our new principle.

Recall that requirements can be thought of as (the semantic component of) real definitions of expressions. Central to our proposal is the idea of the expansion of a given definition. Suppose that \(\Phi \) is defined in terms of \(\Psi \), which is itself defined in some way. In such a case, we can expand \(\Phi \)’s definition by replacing \(\Psi \) with its definiens. The result of this expansion will itself have definitional status. Here is a non-linguistic example. If knowledge is defined in terms of truth:

  • x knows p \(\Leftrightarrow _\text {def}\) p is true, and ...,

and if truth is defined as correspondence:

  • p is true \(\Leftrightarrow _\text {def}\) p corresponds to a fact,

then knowledge is also defined in terms of correspondence:

  • x knows p \(\Leftrightarrow _\text {def}\) p corresponds to a fact, and ....

Turning to linguistic cases, suppose a real definition of one expression links it to a second expression. By replacing the definiendum by the definiens of a real definition of the second expression, one will likewise obtain an expanded version of the initial definition. This points the way towards a compositional account of requirements. For a complex expression \(\alpha \), there will be a requirement on \(\alpha \) in terms of \(\alpha \)’s constituents, while these constituents will in turn come with their own requirements. Expanding the initial requirement on \(\alpha \) by filling in the requirements of \(\alpha \)’s parts, we obtain an interpretive requirement on \(\alpha \) which exactly captures \(\alpha \)’s meaning.

For purposes of illustration, we again work with clausal extensional truth theories. Moreover, we restrict ourselves to some core truth-theoretic treatments: sentential operators, predicates and singular terms (in this section), and quantifiers (in the appendix). Even given these restrictions, there is still a wide variety of techniques that have been employed in treating such constructions, and there are different possibilities for how to apply our approach. We consider these differences to be of some semantic importance, but in this paper we focus on the main idea, leaving a discussion of various options of implementation for the future.Footnote 23

The informal idea of expanding definitions leads to different formal implementations, depending on the form of the definition. Suppose an expression \(\beta \) is a constituent of an expression \(\alpha \), and a definition of \(\alpha \), \(\boxdot _\alpha \varphi \), contains the definiendum of a definition of \(\beta \). One can then replace definiendum by definiens to obtain an expanded definition of \(\alpha \), \(\boxdot _\alpha \varphi ^*\). Depending on the form the definition of \(\beta \) takes, there will be different implementations, which we all call ‘versions of Replacement’. As examples, we will treat definitions based on simple biconditionals, identities, and quantified biconditionals. Ultimately, one wants a comprehensive account (cp. Sect. 4), but here we will only present those formal implementations that are relevant to the linguistic constructions at issue.

For our first example, we consider \(\boxdot _\beta \)-biconditionals in which only one side mentions the pertinent expression \(\beta \). For specificity, we focus on cases where the expression is mentioned on the left hand side:

$$\begin{aligned} \boxdot _\beta (\psi \leftrightarrow \chi ) \text {, where } \beta \! \notin \! \chi \end{aligned}$$

We call such claims constitutive equivalences (for \(\beta \)). In such a constitutive equivalence, \(\psi \) expresses a condition on \(\beta \), and \(\chi \) expresses what \(\beta \)’s satisfying this condition consists in. An example is

$$\begin{aligned} \boxdot _\text {`Sokrates ist weise'} (T(\text {`Sokrates ist weise'}) \leftrightarrow \text {Socrates is wise}), \end{aligned}$$

stating that it is a requirement on the sentence ‘Sokrates ist weise’ to be true iff Socrates is wise. We take such constitutive equivalences to be prime examples of real definitions. In line with the idea of definition expansion, we propose the following first version of Replacement:Footnote 24

 

\(\boxdot _\alpha \varphi \)

 
 

\(\boxdot _\beta (\psi \leftrightarrow \chi )\)

\(\alpha \notin \psi , \beta \notin \chi \)

\(\Rightarrow \)

\(\boxdot _\alpha \varphi [\psi / \chi ]\)

 

In the relevant instances of this rule, the first line will state a requirement on \(\alpha \) in which a condition on \(\beta \), namely \(\psi \), occurs, and we have a requirement on \(\beta \) in which it is stated what this condition on \(\beta \) consists in, namely \(\chi \). One can then replace the condition on \(\beta \) in the original requirement by what it consists in.Footnote 25

Let us first turn to negation. The negated sentence ‘Sokrates ist nicht weise’ (\(= \alpha \)) means that Socrates is not wise. Our goal is to illustrate how to derive the relevant interpretive constitutive equivalenceFootnote 26

$$\begin{aligned} (18)\qquad \boxdot _\text {`SNW'} (T(\text {`Sokrates ist nicht weise'}) \leftrightarrow \lnot \text { Socrates is wise}). \end{aligned}$$

from the interpretive constitutive equivalence for the contained sentence ‘Sokrates ist weise’ (\(= \beta \))

$$\begin{aligned} (19)\qquad \boxdot _\text {`Sokrates ist weise'} (T(\text {`Sokrates ist weise'}) \leftrightarrow \text {Socrates is wise}). \end{aligned}$$

and an appropriate axiom for ‘nicht’. We propose the following axiom, stating that every German sentence S is such that the result of applying ‘nicht’ to S is defined to be true iff S is not true:Footnote 27

$$\begin{aligned} (\mathrm{A}4)\qquad \forall S [\text {Sent}(S) \rightarrow \boxdot _{\text {`nicht'}^\curvearrowright S}(T(\text {`nicht'}^\curvearrowright S) \leftrightarrow \lnot T(S))]. \end{aligned}$$

Instantiating for S,Footnote 28 and relying on the syntactic information that ‘Sokrates ist weise’ is a German sentence, Modus ponens yields:

$$\begin{aligned} (20)\qquad \boxdot _\text {`SNW'} (T(\text {`Sokrates ist nicht weise'}) \leftrightarrow \lnot T(\text {`Sokrates ist weise'})). \end{aligned}$$

We can now apply Replacement to (20) and (19) to arrive at (18), our target theorem.Footnote 29 Sentential operators of arity higher than 1 can be dealt with analogously.

Next, let us turn to predication. There are various options for a truth-theoretic treatment of predication. For purposes of illustration, we consider employing a term-forming operator ‘R’ (for ‘the referent of’). This allows us to illustrate that the informal idea of expanding definitions leads to a second version of Replacement if the relevant definitions are identities rather than biconditionals:

$$\begin{aligned} \boxdot _\beta (s = t) \text {, where } \beta \! \notin \! t \end{aligned}$$

We also call these constitutive equivalences. Here, t discloses the identity of the object which s, a term mentioning \(\beta \), denotes. For example:

$$\begin{aligned} (\mathrm{A}5)\qquad \boxdot _\text {`Sokrates'}(R(\text {`Sokrates'}) = \text {Socrates}). \end{aligned}$$

that is, it is a requirement on the name ‘Sokrates’ to have Socrates as its referent. For the predicate, we here use the following axiom, analogous to the one for the sentential operators:

$$\begin{aligned} (\mathrm{A}6)\qquad \forall N [\text {Name}(N) \rightarrow \boxdot _{\text {`ist weise'}^\curvearrowright N}(T(\text {`ist weise'}^\curvearrowright N) \leftrightarrow R(N) \text { is wise}) ] . \end{aligned}$$

That is, every German name N is such that the application of the predicate ‘ist weise’ to N is defined to be true iff the referent of N is wise. Instantiating for N, relying on the syntactic information that ‘Sokrates’ is a German name, Modus ponens yields:

$$\begin{aligned} (21)\qquad \boxdot _\text {`Sokrates ist weise'} (T(\text {`Sokrates ist weise'}) \leftrightarrow R(\text {`Sokrates'}) \text { is wise}). \end{aligned}$$

Now, we have to combine this with the constitutive equivalence (A5). To this end, we use a suitable version of Replacement:

 

            \(\boxdot _\alpha \varphi \)

 
 

            \(\boxdot _\beta (s = t)\)

            \(\alpha \notin s, \beta \notin t\)

 

\(\Rightarrow \)       \(\boxdot _\alpha \varphi [s / t]\)

 

Applying this rule to (A5) and (21) yields the desired interpretive theorem. Predicates of arity higher than 1 can be dealt with analogously.

Note that the axioms we have considered for sentential operators as well as for predicates do not simply specify a requirement on the expressions in question. Rather, they state a requirement on the result of combining that expression with some input (e.g. a sentence or a name). This is witnessed by the fact that the \(\boxdot \) in these axioms has a complex index (for instance \(\boxdot _{\text {`nicht'}^\curvearrowright S}\) rather than simply \(\boxdot _{\text {`nicht'}}\)). However, one might prefer that a lexical axiom for some expression \(\alpha \) should state a requirement solely on \(\alpha \), and hence be governed by \(\boxdot _\alpha \). We briefly indicate how this can be achieved for a sentential operator such as negation (the case of predication is analogous). Again, our focus here is on the main idea rather than on the details of different ways of implementation.

A simple index axiom for ‘nicht’ would state that it is a requirement on ‘nicht’ that the result of applying ‘nicht’ to a German sentence is true iff that sentence is not true:

$$\begin{aligned} \boxdot _{\text {`nicht'}} \forall S [\text {Sent}(S) \rightarrow (T(\text {`nicht'}^\curvearrowright S) \leftrightarrow \lnot T(S))]. \end{aligned}$$

Note that this would be a different kind of real definition, requiring a rule that is not simply a definiens-for-definiendum version of Replacement. Restricting attention here to the case of a unary operator/predicate \(\alpha \), building a complex expression \(\alpha ^\curvearrowright \beta \) if applied to a suitable expression \(\beta \), we propose a rule which combines restricted universal quantifier elimination with the move from a simple to a complex index:Footnote 30

  

\(\boxdot _\alpha \forall x \, (F(x) \rightarrow \psi )\)

  

\(\boxdot _\beta F(\beta )\)

 

\(\Rightarrow \)

\(\boxdot _{\alpha ^\curvearrowright \beta } \psi [x/\beta ]\)

Take an example. Since it is a requirement on ‘nicht’ that the result of applying ‘nicht’ to a sentence is true iff that sentence is not true, and since syntax tells us that it is a requirement on ‘Sokrates ist weise’ to be a sentence, we can apply this rule to derive that it is a requirement on the result of combining ‘nicht’ and ‘Sokrates ist weise’ to be true iff ‘Sokrates ist weise’ is not true. Given the appropriate theorem for ‘Sokrates ist weise’, the first version of Replacement now yields the desired interpretive theorem for the negated sentence ‘Sokrates ist nicht weise’. We take this extension of the aforementioned approach, which was based exclusively on definiens-for-definiendum versions of Replacement, to be especially interesting, because it observes an interplay of semantic and syntactic requirements—a topic we will not go into here.

Let us take stock. Our proposal is to employ the notion of relative constitutive semantic requirements to strengthen existing semantic approaches. In order to derive requirements for complex expressions on the basis of the requirements of their constituents, we have suggested drawing on the idea of definition expansion. Since there typically are various proposals for how a given object language construction should be treated in a broadly truth-theoretic framework, there is a corresponding range of possible strengthened theories. Moreover, even starting from a single approach for a certain construction, there will often be various options of how to implement our proposal (corresponding, for instance, to various possible placements of the \(\boxdot _\alpha \)-operator). The goal of this section was to illustrate the feasibility of the main idea rather than to settle the details.Footnote 31

3.4 Constitutive requirements, unique characterisation, and the information problem

This section puts forward three claims of increasing strength: (i) that our approach does not fall victim to the examples we used to show that other approaches fail to be uniquely characterising; (ii) that our approach can in fact yield theories that are uniquely characterising; (iii) that it can yield theories that exactly capture meaning and avoid the information problem.

We cannot give a rigorous proof of these claims, as this would require a substantive account of the notion of exactly capturing and a theory of synonymy; tasks that are beyond the scope of this paper. However, we do suggest that there are strong reasons for believing these claims. To see this, it is useful to make one further wrinkle explicit that we have so far left tacit: the distinction between three levels of constitutive semantic requirements. The first two, immediate and mediate, are familiar from Fine’s work. We add a third level of semantically fully expanded constitutive semantic requirements. Roughly speaking, the three levels correspond to three stages in a derivation of truth conditions: we start from immediate requirements, move through mediate ones, and the derivation terminates once every semantic definiendum has been fully expanded.

Consider a conjunction ‘\(S_1\) und \(S_2\)’. For simplicity, assume that the constituent sentences \(S_1\) and \(S_2\) are treated as simple, receiving axioms stating that \(S_1\) is required to be true iff p and that \(S_2\) is required to be true iff q. A derivation starts with the axiom for ‘und’, expressing the immediate requirement that ‘und’ forms true sentences iff combined with two true sentences. By instantiating for \(S_1\) and \(S_2\), we derive an immediate requirement on the complex expression ‘\(S_1\) und \(S_2\)’, stating that it is required to be true iff both \(S_1\) and \(S_2\) are true. We now appeal to the requirement on \(S_1\) to be true iff p and derive the mediate requirement on ‘\(S_1\) und \(S_2\)’ to be true iff p and \(T(S_2)\). Finally, appealing to the requirement on \(S_2\) to be true iff q, we derive the semantically fully expanded requirement on ‘\(S_1\) und \(S_2\)’ to be true iff p and q. This last requirement no longer defines the truth of ‘\(S_1\) und \(S_2\)’ in semantic terms, and hence our derivation terminates. We use to express that \(\Phi \) is an immediate requirement on \(\alpha \). We continue to use \(\boxdot _\alpha \Phi \) to express that \(\Phi \) is a requirement on \(\alpha \), where this is neutral between the levels of requirements.

Ad (i). Our proposal does not fall victim to logically equivalent\(^*\) but non-synonymous expressions, since it is not based on a consequentialist notion. Roughly speaking, while a consequentialist notion is closed under logical (or perhaps manifest) consequence, our constitutive notion is merely closed under definition expansion that results from replacing definienda by definentia. Consider, for example, a simple expression \(\alpha _1\) meaning is wise and a simple expression \(\alpha _2\) meaning is wise and self-identical. Now, since \(\boxdot _\alpha \) expresses a constitutive (and not just a consequential) semantic requirement on \(\alpha \), it is correct to say

$$\begin{aligned} \boxdot _{\alpha _1} \forall N [\text {Name}(N) \rightarrow (T(\alpha _1^\curvearrowright N) \leftrightarrow R(N) \text { is wise})], \end{aligned}$$

but incorrect to say

$$\begin{aligned} \boxdot _{\alpha _2} \forall N [\text {Name}(N) \rightarrow (T(\alpha _2^\curvearrowright N) \leftrightarrow R(N) \text { is wise})]. \end{aligned}$$

It is not a constitutive semantic requirement on a predicate meaning is wise and self-identical that it forms true sentences iff combined with a name of a wise thing; you cannot arrive at such a characterisation of \(\alpha _2\) by merely expanding definienda. Thus, the shift to a constitutive characterisation excludes formerly problematic cases.

Ad (ii). We suggest that our proposal provides the tools to formulate uniquely characterising theories. In a sharpening of the broad approach set out in the previous sections, we add that the lexical axioms should state immediate requirements, and hence be formulated by employing . We claim that, in the case of a simple expression \(\alpha \), the -axiom by itself is already uniquely characterising. Recall that a requirement on \(\alpha \) is immediate if it does not hold in virtue of other requirements. In our view, the only way to state immediate requirements for lexical items in broadly truth-theoretic terms is by being interpretive—that is, by stating the truth conditional contribution using a metalanguage expression synonymous with the pertinent lexical item. Assume, for simplicity, that we treat monadic predicates by axioms of the following form:

$$\begin{aligned} \ldots \leftrightarrow R(N) \text { is } \phi . \end{aligned}$$

Now let \(\alpha \) be a predicate meaning is wise. If there are any constitutive requirements of the above kind on \(\alpha \), then surely one of them must be the one obtained by replacing \(\phi \) with ‘wise’:

But it also seems clear that this is the most basic of all the constitutive requirements on \(\alpha \)—any other constitutive requirement of this form will only hold in virtue of this one. Hence, true -axioms must be interpretive.

Turning to complex expressions, note that -theories will, by virtue of their syntactic component, provide information about the syntactic structure of a given complex expression, and, by virtue of their semantic component, information about the immediate requirements governing the simple parts of the complex expression. We claim that, taken together, this provides uniquely characterising information about complex expressions. Take a simple predication, ‘Sokrates ist weise’. Assume that syntax tells us (very roughly) that ‘Sokrates ist weise’ \(=\) ‘ist weise’\(^\curvearrowright \)‘Sokrates’, while semantics tells us what the immediate requirements on ‘ist weise’ and ‘Sokrates’ are. Now, if \(\beta _1\) is synonymous with ‘Sokrates’ while \(\beta _2\) is synonymous with ‘ist weise’, then combining \(\beta _1\) and \(\beta _2\) in the way in which ‘ist weise’ and ‘Sokrates’ are combined in the relevant sentence will result in a sentence synonymous with ‘Sokrates ist weise’. Hence, building on the unique characterisation for simple expressions, the theory will also be uniquely characterising for complex ones.

In this paper, we have left the notion of meaning mostly intuitive. There are arguably various possible sharpenings of ‘(linguistic) meaning’, and the conditions of adequacy on semantic theories will depend on which sharpening the theory aims at. We think there is one natural and relevant notion of linguistic meaning that lies at the intersection of truth conditional contribution and what is definitional. Relying on such a notion, it is plausible to assume that an expression \(\alpha \) is synonymous with ‘und’, for example, if there is an immediate constitutive requirement on \(\alpha \) to form a true sentence iff it is applied to two true sentences. Arguably, this would yield a notion of synonymy according to which ‘und’ and ‘aber’ (like ‘and’ and ‘but’) are synonymous. To be sure, there are also more fine-grained notions of linguistic meaning. We think that our approach could be fleshed out in a way that allows it to discriminate between ‘und’ and ‘aber’ by explicitly incorporating information about the completeness of the requirements provided by the theory. Suppose that a theory not only provides the immediate requirement on ‘und’, but also states that this is the only requirement governing ‘und’. Since it seems plausible that ‘aber’ is governed by additional requirements regarding, for instance, felicity, such a theory could be taken to be uniquely characterising even under the assumption that ‘und’ and ‘aber’ diverge in meaning.

Ad (iii). Do theories employing -axioms exactly capture meaning? Would knowing such a theory suffice for knowing the object language? We have argued that true -axiom are guaranteed to be interpretive. Moreover, recall that the rules governing constitutive requirements only allow for definitional expansion. A fully expanded constitutive equivalence derivable from interpretive -axioms is itself guaranteed to be interpretive, where a constitutive equivalence is fully expanded if we cannot replace any more definienda by definentia via applications of Replacement. Thus, by employing an operator in the language of the theory which, when applied to lexical axioms, forces interpretiveness, and which is governed by rules that ensure an interpretive end result, we not only obtain a theory that is uniquely characterising. Rather, since these facts transparently flow from the notion expressed by the operator, the theory will be exactly capturing, and knowledge of the theory will suffice for knowledge of meaning.

4 Conclusion

What does it take to know the meaning of ‘Sokrates ist weise’? What does it take to exactly capture its meaning? It has been the guiding thought of the mainstream approach to semantics that knowledge of meaning involves knowledge of truth conditions. In its simplest form, the claim would be that knowing what ‘Sokrates ist weise’ means comes down to knowing that this sentence is true iff Socrates is wise; more sophisticated truth-theoretic machinery allows for more sophisticated versions of the claim. As the information problem illustrates, this will not do. One goal of our paper was to emphasise that the information problem is central, and remains unsolved. The second goal of this paper was to illustrate how the information problem can be overcome. To bring out what is correct about the motivating thought, we need to make explicit the definitional connection between truth and truth condition. We have proposed to do this in terms of relative constitutive requirements, and we have argued that there is a central notion of linguistic meaning which is fruitfully explicated in these terms.

Depending on which aspect of our proposal one focusses on, it can be viewed as a conservative continuation of the truth-theoretic approach, or as a radical departure from the commonly accepted path. It is conservative in the sense that the core machinery of the semantic theory is left untouched (this might be a slight simplification; see, for instance, the appendix). Given a truth theory of your choice, our proposal can more or less piggyback on this theory to produce a strengthened theory explicitly stating what the constitutive semantic requirements governing the object language are. The more radical aspect of the proposal lies in the employment of a relativised hyperintensional operator in the metalanguage. This move sets our proposal apart from other non-standard approaches such as truthmaker semantics or impossible-worlds semantics, which stick with an extensional metalanguage and instead inflate the ontology of the theory.

In this paper, we have introduced the main idea of semantic theories based on the notion of constitutive requirements, and we have begun to illustrate how such theories can be formulated. However, we have merely touched upon a number of issues that deserve a fuller investigation. In closing, we want to indicate five related areas in which further work would seem to be promising and/or necessary.

(i) In this paper, we confined our approach to fairly simple constructions (basic predication, sentential operators, first-order quantification). While we are optimistic that our approach can be combined with truth-theoretic treatments of more complicated constructions (e.g. modal operators, adverbial modification, gradable adjectives, and so on), this optimism should be substantiated by providing detailed accounts of the relevant constructions in terms of constitutive requirements.

(ii) For purposes of illustration, we have assumed a clausal format for truth theories and illustrated how, based on the respective axioms of a clausal truth theory, we can bring in the notion of constitutive requirement to produce a strengthened theory. However, as this exercise has already illustrated, we will sometimes have various options of how exactly to proceed, e.g. by choosing different possible placements for the semantic requirement operator. These various options deserve a fuller investigation, as do the options that arise when applying our approach to semantic value-based truth theories.

(iii) Our main aim in this paper was to argue that ordinary semantic theories can be strengthened with \(\boxdot _\alpha \) in order to solve the information problem. On the resulting picture, \(\boxdot _\alpha \) could be perceived as a mere add-on; the first-order semantic work of coming up with treatments for natural language constructions proceeds just as before, the results can then be strengthened with \(\boxdot _\alpha \). But once we have admitted \(\boxdot _\alpha \) into our tool kit, it is natural to ask whether it could contribute to the first-order work as well. For instance, it might be worth considering whether we can construe propositional attitude operators or other non-extensional constructions as sensitive to the constitutive requirements governing their inputs, rather than merely in terms of the truth conditional properties of their inputs. We suggest this as another potentially fruitful area for further research.

(iv) As we have briefly indicated with respect to the contrast between ‘und’ and ‘aber’, there may be reason to acknowledge additional kinds of constitutive requirements besides those concerning truth conditional contribution. Perhaps there is work to be done for a notion of constitutive requirements concerning what a standard use of a given expression conveys over and above its truth conditional contribution. Giving a linguistic theory of a particular language could then be construed as the investigation into the essences of the expressions of that language, where this involves specifying syntactic, semantic/truth conditional, and semantic/conveying requirements governing these expressions, and the way these requirements interact.

(v) Finally, while we have focussed on truth-theoretic accounts of meaning, the underlying idea might be applied to other frameworks aswell. Broadly speaking, we suggest a semantics in terms of constitutive semantic requirements on the object language expressions: immediate such requirements on the simple expressions, mediate ones on the complex expressions. It calls for further work to develop a general account of mediate constitutive requirements on expressions and, ultimately, mediate constitutive essence in general, which can be applied to alternative semantic frameworks. In particular, we assume that the proposed variants of Replacement have to be considerably supplemented if one aims at a general account of the informal idea of expanding (real) definitions.