The previous sections provide empirical evidence for the judge-index correlation. In this section, which comprises the theoretical heart of the paper, we use this correlation to adjudicate between existing theories of SP exceptionalism that postulate judges in the semantics of SPs. As we already discussed in Sect. 2.3, we categorize the existing theories into the following three classes depending on how judges and worlds are connected: (i) must associate theories, which obligatorily yoke judges and worlds together, (ii) can dissociate theories, which allow for a certain flexibility in how judge-world combinations are chosen, and (iii) must dissociate theories, which obligatorily disjoin judges and worlds.
We will argue that any theory in the first class will straightforwardly account for the data presented in the paper. The argument here is thus parallel to the one offered by Keshet (2008) for why worlds and times of evaluation need to be merged: they pattern together. We will show that while accounting for our data in theories from other classes is not impossible, it would require building in constraints that undermine the initial goal of unbundling the judge and the evaluation index. We want to emphasize again the following point. If there were no judges at all, SPs would be predicted to trivially follow Keshet’s constraints along with other ordinary predicates. However, as we discussed extensively in Sect. 2, most theories that deal with SPs postulate SP exceptionalism and among theories of SP exceptionalism most postulate judges in the semantics (with the exception of Anand 2009; Coppock 2018). Therefore, any theory with judges has to make sure that judges and worlds pattern together.
We start by presenting a basic extensional framework in Sect. 4.1, which we will further connect with an extensional theory of de re. Section 4.2 talks about must associate theories exemplified by an extensionalized version of Lasersohn (2005).Footnote 24 Section 4.3 talks about the overgeneration problem of can dissociate theories (Stephenson 2007a, b; Stojanovic 2007; Sæbø 2009). Section 4.4 discusses two must dissociate proposals, indexical contextualism along the lines of McCready (2007); Bylinina et al. (2014) (Sect. 4.4.2) and the logophoric binding approach in Pearson (2013a) (Sect. 4.4.3). We show that such theories crucially depend on a scopal view of de re—which in and of itself is problematic (Sect. 4.4.1)—to derive our cases as is, or require significant elaboration of the technology in question. Section 4.5 goes over a possible alternative implementation of the logophoric binding approach and shows that it does not solve the problem either.
Semantic assumptions
In Sect. 2.3, we introduced the basics of the extensional framework we are using in the paper. While an intensional systems treats indices as elements of evaluation sequence, we treat indices as present in the logical form and assume that all non-logical predicates require an argument of type I. Lexical entries for dog and brown in (60) illustrate.
The final I-type arguments of non-logical predicates will be filled by silent I-type bound variable pronominals present in logical form (indicated as \(i_n\) variables), and the the binders over these index variables are introduced by intensional operators such as attitude predicates as well as at the root node (Cresswell 1990; Percus 2000 and later work). The logical forms for the sentences Montmorency is brown and Mary thinks Montmorency is yellow are given below:
Expressions are evaluated relative to an assignment and a context, which we will assume for now is an element of the Cartesian type \(D_\kappa = D_e \times D_s\), corresponding to the author and world of the context. We assume the rules of Function Application, Predicate Modification, and Abstraction (Heim and Kratzer 1998):
Footnote 25
With this set of assumptions, the semantics for (61a) is as follows:
Thus, the interpretation of a declarative sentence relative to a context and assignment is a proposition, and we define truth at a context (and assignment) as the evaluation of that proposition relative to the world of the context (64).
While we assume that the index argument of a non-logical predicate in main predicate position is supplied by a bound variable high in the clause, a nominal or adjectival predicate inside a DP requires some additional assumptions to fill its index argument. We will assume for explicitness here that determiners themselves compose with index variables, which they pass on to their complements (though nothing hinges on this implementational detail). (65) is the form for the indefinite determiner a:
We now consider the interpretation of the sentence A brown dog is a yellow fox, (66a). We assume the logical form in (66b) which involves movement of the subject DP a brown dog. (67) sketches the interpretative process, where \(g' = g[0/x]\).
The resulting truth conditions of the sentence in (66a) are, as desired, contradictory, a result of the fact that dog and fox are contraries (as, presumably, are brown and yellow). Importantly, when (66a) is embedded, as in (68), the sense of a contradiction vanishes:
Because think is an intensional operator, it introduces another intensional binder, per (69).
While this yields four potential logical forms combinatorically, we assume that the two expressed in (70a) and (70b) are ruled out by empirically motivated constraints that require clausal index variables to be bound by the closest binder (Farkas 1997; Percus 2000). Note that based on the rule for intersective modification applied in (67a), even in intensional environments we don’t have mixed interpretations such that brown and dog inside the DP a brown dog are interpreted with respect to different worlds.
This leaves only the last two forms, 70c and (70d). Assuming the interpretation of think is as in (69), the logical form in (70d) attributes to Mary a belief with the contradictory truth conditions in (67d).
In contrast, the form in (70c) yields a sensible interpretation, crucially because the witness for the existential is a brown dog not in \(i'\) but in the (bound) matrix world—that is, because it is interpreted de re:
must associate theories
The framework we introduced in the previous section sets the stage for a detailed discussion of SP exceptionalism in its various versions. As we have discussed in Sect. 2, the literature treats SPs as being semantically relativized to a perspective, or a judge. There are different ways to fix judges compositionally, and in this section, we take a closer look at must associate theories.
The basic tenet of such theories, first proposed in Lasersohn (2005) is the addition of a special judge coordinate, responsible for storing the evaluative perspective, to the index of evaluation (which we previously assumed consisted simply of a world of evaluation, hence of type \(D_s\)). Thus, indices are of Cartesian type \(D_I = D_e \times D_s\) where the entity coordinate of an index \(i=\langle j,w\rangle \) encodes \(\textsc {judge}(i)\), the judge of the index. (72) illustrates the semantic difference between ordinary vs. subjective predicates.
In this way, the truth conditions of some predicates are determined “relative to” an additional component of evaluation beyond worlds, and systems that involve something like a judge coordinate of the index are now known as relativist (see MacFarlane 2009, 2014 for discussion).
At the root level, the judge coordinate is conventionally set to a value supplied by the context, in parallel to the convention for worlds, as in (64). Thus, in this system contexts are of type \(D_\kappa = D_e \times D_s \times D_e\), corresponding to the author, world, and judge of the context. Lasersohn argues that in default cases SPs in a root declarative clause have what he calls an autocentric perspective: judge(c) is set to author(c), the speaker. As we already discussed in Sects. 3.3 and 3.4, in certain circumstances a third-party or a generic perspective is also possible. For now, we confine ourselves to autocentric and reported autocentric cases.
Following Stephenson (2007a), we will assume for now that attitudes quantify over indices such that the judge argument is co-referent with the attitude holder.Footnote 26 This means that what we observed for worlds of evaluation holds for judges as well. SPs that are interpreted de dicto in the scope of an attitude predicate are evaluated with respect to the attitude holder of that predicate, just as de dicto predicates in the scope of intensional operators are evaluated with respect to the worlds introduced by those operators (73).
In contrast, if the index argument for an SP is interpreted de re (i.e., bound by a non-local binder), then the world of evaluation will switch. Correspondingly, in multi-judge sentences like (74), this account predicts patterns of felicity to align with whether the nominal containing the attributive adjective is read de re. (74) illustrates. (While in our scenario there is a specific dog, note that de re indefinites don’t have to be specific, see Fn. 19 and Sect. 4.4.1. We assume that the difference is not a matter of scopal distinctions.)
In (74), an adorable dog is read de re and evaluated with respect to \(i_0\), therefore, adorable also is evaluated with respect to \(i_0\) and the matrix judge (by assumption, the speaker). If an adorable dog were read de dicto, the SPs adorable and ugly would be evaluated at the same index, the embedded \(i_1\). This would mean that they would be evaluated with respect to the same world and the same judge (by assumption, Mary), yielding a contradiction.
In order for (74) to be non-contradictory, adorable and ugly have to be evaluated with respect to either (i) different worlds: the same object can be adorable for the speaker in the actual world and ugly for the speaker in someone’s doxastic alternatives, or (ii) different judges: the same object can be adorable for the speaker and ugly for Mary in the same world. An important empirical prediction of Lasersohn’s theory is that judges and worlds pattern together, since they are both part of the index. One cannot evaluate adorable with the speaker as the judge in (74) without evaluating it in the matrix world, which prevents mixed non-attested readings which we have seen in (40) and to which we come back in the next section. This is the judge-index correlation, and it follows naturally from the architecture of Lasersohn’s system.
Note that this type of framework also easily handles exocentric uses discussed in Sect. 3.4. For example, in line with Lasersohn (2005), we may model this by adding an argument to the denotation of attitude predicates for the embedded judge (75).
For the lexical entry in (75) we assume the judge element is a free variable implicit argument of the attitude, and hence may (a) be bound by the subject, to derive a report of an autocentric judgment, or (b) be free, to produce exocentric readings without requiring a non-local world of evaluation. Crucially, each index introduces only one judge, and all SPs relative to an index will be relativized to the judge of that index. This ensures that even exocentric readings obey the judge-index correlation.
The above logic makes use of the mechanisms of de re interpretation, and hence could be dependent on the details of a particular account. Within the extensional theory we have been developing, de re interpretation is done by non-local binding of an index variable. Another (more traditional) approach pursued within intensional approaches is the Scope Theory (Russell 1905; Montague 1973; Cresswell and von Stechow 1982). In such an approach, de re interpretation involves a logical form in which the de re nominal has moved from its base position to one above the attitude predicate, as in (76) below:
As the nominal is outside the scope of the intensional operator, its index variable cannot be bound by the intensional operator, and hence must be bound by the matrix binder. The Scope Theory thus allows us to derive the non-local interpretive effects of de re without requiring non-local binding (it is for this reason that the Scope Theory is used by many intensional systems, since many cannot capture something like non-local binding otherwise). As we shall observe, some theories of judge-dependence disallow non-local judges (particularly, the approaches in Stojanovic 2007 and Pearson 2013a), and so such theories will require some version of the Scope Theory if non-local judges are available.
Relativist frameworks in Egan (2010), MacFarlane (2014) and Bylinina (2017) work very similarly to Lasersohn (2005) with respect to our data. In these approaches, evaluation indices reserve a coordinate for the judge, which various perspectival expressions are sensitive to. Because of this bundling, such theories will straightforwardly derive the correlation we have observed, parallel to how they would derive Keshet’s (2008) original time-world correlation.Footnote 27 In the next sections, we will argue that all extant contextualist approaches fail to do the same. Our data thus provide a novel argument for a relativist semantics for subjective meaning.
can dissociate theories
A spate of more recent theories have reacted to the original relativist position by dissociating judges and evaluation indices. Here, we will consider theories that fall under the umbrella of what we will call implicit variable approaches: the mixed contextualist/relativist framework of Stephenson (2007a) and the mixed contextualist/variable-free framework of Sæbø (2009), which builds off the contextualist theory of Stojanovic (2007).
Despite different conceptual underpinnings, those accounts share the following formal property. They provide two routes for the specification of an SP judge, treated as a pronominal element with a bound interpretation and or a free interpretation. In case of the former, the judge is linked to the evaluation index, much like in Lasersohn (2005). However, in the latter, “free”, case, the bare use judge can pick up any referent. We will show momentarily that the “free” route predicts the existence of de dicto non-local judge interpretations that are systematically absent for multi-judge cases. We will thus claim that the implicit variable theories of SPs are too weak in their current form and predict unattested readings. We start by discussing the account in Stephenson (2007a) and then show that the proposal in Sæbø (2009) is the same as far as our data are concerned.
Stephenson (2007a) assumes alongside Lasersohn (2005) that indices contain a judge coordinate, which attitude verbs shift as in (69) in Sect. 4.1. In addition to that, all SPs are treated as dyadic predicates:
Two kinds of implicit variables can fill the z judge role.Footnote 28 In autocentric cases, the judge argument is filled by the distinguished pronominal PRO\(_J\) that picks out the judge coordinate of the index of evaluation. The result in (78c) is identical to Lasersohn’s (2005), given in (63).Footnote 29
However, in addition to autocentric uses, SPs also allow non-autocentric readings, as we discussed in Sect. 3.4. How such cases are handled constitutes one of the crucial differences between the approach in Lasersohn (2005) and the one in Stephenson (2007a). For Lasersohn, non-autocentricity arises purely due to pragmatics. Because SPs are semantically judge-dependent in the same way across the board, the judge, autocentric or not, is always linked to the index of evaluation, which is all that matters for our cases. For Stephenson, however, there is a semantic difference between autocentric and non-autocentric uses. In this system, non-autocentric readings arise when the z role is filled by a free variable, \(pro_i\):
Importantly for our purposes, the account in (79) overgenerates in attitude reports.Footnote 30 Due to the lack of restrictions on the distribution of \(pro_i\), nothing prevents a speaker-oriented SP from combining with a de dicto nominal in the subject position in the complement clause. However, as shown in (80) (=39), such reading is not attested (cf. also 33 in Sect. 2.3). A derivation for the non-attested mixed reading (80b), which is not ruled out by Stephenson (2007a), is given in (81):
A similar problem, albeit for a different formal reason, arises in the account originally advocated in Stojanovic (2007) and argued for in Sæbø (2009). In this account, predicative and attributive uses differ in their semantic composition in attitudinal environments, which leads to overgeneration, as we will show below.
SPs are treated as dyadic and there is a distinguished variable of the assignment reserved for the judge, e.g., \(x_{100}\). In Sæbø’s (2009) version of the proposal, the judge argument of the SP is not the first argument. Here, for compositional simplicity, we will assume it is the last, after the index argument:
This induces an interesting compositional difference for SPs depending on whether they are in mainPred position or in attributive position. In mainPred position, composition with the first argument of the SP will yield a property of SP judges, as below:Footnote 31
Attitude verbs in this framework take such a property argument, feeding in both judge and index of evaluation in succession (as we shall see, this is similar to the compositional process for Pearson 2013a):
While (84b) can be treated as a notational variant of Lasersohn’s (2005) approach, things are different for attributive SPs. In such cases, there is an obvious type clash between a dyadic SP like adorable and the monadic noun dog. To solve this clash, attributive SPs directly reference the distinguished variable.
However, the distinguished variable is not mandatorily updated in the scope of intensional operators, which creates an opportunity for two different judges to emerge in intensional environments, precisely in configurations like (80). Specifically, mainPred SPs will leave an unsaturated judge argument for the intensional operator to fill in and attributive ones will use \(x_{100}\). It is thus possible for g(100) to be autocentrically linked to the speaker while a mainPred SP is compositionally linked to the attitude holder. This, in turn, allows an unattested interpretation, such as the one in (80b) above. Like Stephenson’s, Sæbø’s system predicts that the end result of (81) is a valid interpretation of the sentence in question. We should note that Sæbø (2009: 337–338) in fact discusses such cases explicitly but does not exclude the unattested interpretation.Footnote 32
To summarize, the accounts of Stephenson (2007a), Stojanovic (2007) and Sæbø (2009), despite their differences, share a flexibility with respect to the relation between judges and intensional operators. This flexibility allows mainPred and subject SP judges to differ in attitudinal environments irrespective of whether the subject DP is read de re. By assimilating SP judges to ordinary pronouns, these accounts essentially predict that SPs with bare use judges, at least in some cases, should behave like SPs with overt judge PPs. Because such SPs are no longer judge-dependent, they should be insulated from any effects of intensional quantification. However, as we have shown in Sect. 3.3, bare use judges and overt judges do behave differently, which is a problem for implicit variable approaches.
It is worth asking what could be done to make these accounts work. As far as we can tell, the only way to do so would be to more closely link the implicit variables (\(x_{100}\) and \(pro_n\)) to the index of evaluation. One possible way to do this would be to invoke the so-called Acquaintance Inference, or the AI. Several researchers have discussed the fact that assertions of some autocentric SPs come with an evidential requirement for the judge to have a certain type of perceptual experience of the object, such as tasting a cake for asserting that it’s delicious, or viewing (some part of) a movie for asserting that it’s boring (Stephenson 2007b; Pearson 2013a; Ninan 2014; Bylinina 2017; Anand and Korotkova 2018; Muñoz 2019).Footnote 33 For Pearson (2013a), this restriction is encoded as a presupposition, which is adapted for Stephenson’s (2007a) proposal in the example below (see Muñoz 2019 for a similar constraint):
In principle, such a presupposition has the potential to restrict the possible judges for an SP in the following way. Multiple judges could be disallowed when the subject DP is read de dicto since it could be that the referent of the implicit variable fails to have acquaintance with the object in the doxastic alternatives. However, we do not believe that such a move will rescue the examples constructed above. (87) repeats 80b from above, with the LF argued to be impossible:
Given (86), in order for the LF in (87b) above to be felicitious, it must be the case that g(10) (i.e., Sue) has visual experience of the stuffed animal in question in the belief indices of Mary that are quantified over (where the actual indices quantified over may depend on assumptions of how the presupposition projects out of the attitude quantifier). But this is, in fact, quite easy to satisfy; since Mary and Sue are looking at, and discussing, the catalog together, it is natural that Mary would believe that Sue has seen the object they are discussing. Moreover, the de dicto reading is possible with an overt judge in the same scenario (Sect. 3.3). Assuming that the AI holds for overt judges as well (shown in Anand and Korotkova 2018), it seems unlikely that the lack of a de dicto reading for the bare use SP is due to an unmet presupposition.
Beyond the immediate issue of this example, it is worth noting that there are independent empirical problems with the idea that SPs semantically encode a restriction requiring the judge to have a certain perceptual experience. The AI is present in simple positive and negative assertions, but when embedded under several kinds of operators, including epistemic modals, indirect evidentials and attitudes compatible with indirect evidence, it disappears (Klecha 2014; Ninan 2014, 2020; Anand and Korotkova 2018; Cariani 2021):
It is thus possible to construct cases where the judge not only doesn’t have perceptual experience, but could not. In 89, for example, the SP object went extinct before the speaker could try it, hence violating the AI presupposition for an autocentric judge.Footnote 34
Footnote 35
While we believe that (89) is acceptable with an autocentric judge, it is admittedly possible that the acceptability stems from an exocentric judge (i.e., some group of humans contemporaneous with silphium). In (90a) and (90b), we attempt to control for this, by choosing an object that went extinct long before mammals existed (90a) and an event that ceased before life existed (90b), and hence could not be visually apprehended with current astrophysical techniques.
Footnote 36
Similarly, building on contrasts discussed by MacFarlane (2014), we can observe that SPs can be used inside counterfactuals without requiring that the judge be acquainted with the SP object in the counterfactual worlds. This is most easily observable in cases where the counterfactual premise explicitly denies any acquaintance with the SP object, as in (91a), or the existence of the judge, as in (91b):
Despite the lack of any acquaintance, in both cases in (91) we believe that there is nevertheless a salient autocentric interpretation, where the judgement of brilliance or beauty is that of the speaker. These cases thus furnish a crisp argument against treating the AI as a uniform requirement of some kind of relation between the judge and the SP object.
The few accounts of the AI that attempt to deal with the above facts end up eliminating it from embedded SPs altogether. For Ninan (2014), the AI is not hard-wired to the semantics of SPs but is due to the pragmatics of SP assertions, thus not affecting the interpretation of embedded SPs. Anand and Korotkova (2018) model the AI as a presupposition, but one that is designed to be trivially satisfied under epistemics, certain attitudes and other operators that affect evidentiary grounds for a claim (as Anand and Korotkova 2018 discuss, it is only those markers that allow or require indirect evidence, but not intensional operators across the board; pace Ninan 2020; Cariani 2021). To sum up, we should expect the AI—whatever its precise etiology—to not be operative for most attitude complements, and hence not something we could invoke to rule out multi-judge de dicto readings. We conclude that can dissociate theories do not capture our data, after all.
must dissociate theories
We now turn to two must dissociate theories, the context-shifting account sketched by McCready (2007) and Bylinina et al. (2014) and the logophoric binding account of Pearson (2013a). We observed above that can dissociate theories incorrectly predict that multi-judge sentences should allow nominals to be read de dicto (without apparent contradiction). As we will see, both of the must dissociate theories correctly prevent de dicto nominals in multi-judge sentences, principally because they do not allow non-local judges.
The question of de re nominals in multi-judge sentences is more complex. In Sect. 4.2, we highlighted two mechanisms for de re interpretation: (i) non-local binding, and (ii) scope taking at logical form. We noted there that for must associate theories, the judge-index correlation followed regardless of the mechanism for de re interpretation. But we will argue that the context-shifting and logophoric binding approaches crucially require the Scope Theory to capture our correlation. The Scope Theory has received several empirical challenges over the years, and we thus take the dependence of these theories on the Scope Theory as a sign that they cannot, in fact, derive the judge-index correlation. We will begin this argument in reverse, first reviewing the empirical evidence against the Scope Theory and then considering the two theories of SPs. We elaborate on this issue in the Appendix, where we show that must dissociate theories do not get our data in the concept generators framework, a prominent current theory of de re interpretation (see Charlow and Sharvit 2014 for discussion).
Problems for the Scope Theory
One central problem of the classical Scope Theory is that it correlates the index of evaluation for the predicates within a nominal with the scopal position of the nominal. What this means is that it predicts a correlation for quantificational nominals between de re interpretation and scope-taking with respect to other quantificational operators. There are several counterexamples to this prediction (see Keshet 2008; Schwarz 2012; Keshet and Schwarz 2019 for discussion). One prominent instance is given in (92) below.
There is a reading of (92) where Mary forms a desire to buy, let’s say, a certain kind of pyramid tent. In this case, there is no specific pyramid tent she wants to buy, just a certain type. Unbeknownst to her, the speaker happens to own that precise kind of pyramid tent. Under the Scope Theory, this kind of reading is unavailable. On the one hand, the description that the tent is like mine cannot be evaluated with respect to the local world introduced by want, because that is not part of Mary’s desires (nor, indeed, her beliefs). That will mean that a tent just like mine will need to scope above want, as in (93). But want is also a quantificational operator, and hence scoping above it will require that there is a particular tent like the speaker’s that all of Mary’s desire worlds agree upon, contrary to fact.
SP-versions of (92) show a similar ‘some or another of this type’ reading.Footnote 37 Thus, (94) has a reading which conveys that the speaker is alleging the tent to be ugly without imputing to Mary such a belief.
As for (93), the relevant readings for these sentences are not predicted to be available under the Scope Theory. In response to this difficulty, several kinds of approaches have arisen, including, most prominently, the extensional system with non-local binding that we have been using. In this system, an ugly tent can stay within the quantificational scope of want but still be evaluated relative to the matrix index, as (95) shows:
Another problem for the classic Scope Theory is so-called scope trapping, where a de re expression contains material bound by some structurally higher de dicto expression—effectively, restricting the LF site of the de re term to something relatively low. The setup is illustrated in (96) below, inspired by Bäuerle’s (1983) examples.
By design, since John does not bake his cakes on his friends’ actual birthdays, the phrase on each of his friends birthdays must be interpreted de dicto. But since this expression quantifies over the day that that day refers to, it traps that phrase (and hence the DP containing it) below it, and hence below think as well. But being in the scope of think will mean that the DP will also be interpreted de dicto and that disgusting will be interpreted relative to the local judge, like tasty. We should thus generate a contradiction in this scenario, and the absence of a contradiction is a problem for the Scope Theory (or any other scopal approach to de re, e.g. Keshet 2010).
Above we have replicated existing problems for the Scope Theory of de re in analogous sentences with SPs. If indices contain judges and we allow non-local index binding, then this is entirely as expected. De re interpretation arises via non-local binding, and the non-local index determines both the world of evaluation and the SP judge. Insofar as the literature has already shown that the Scope Theory is problematic for de re interpretation, it may seem unimportant to dwell on this fact. However, in the sections below, we will examine two theories which separate judges and worlds of evaluation and which require local judges. We will argue that those two components will force these theories to adopt the Scope Theory of de re. We will thus predict that either (i) multi-judge sentences should be wholesale unavailable (if the wide-scoping LFs required by the Scope Theory were, for some reason, systematically blocked), or that (ii) multi-judge sentences should be possible, but not with interpretations that would violate Scope Theory LFs, such as scope-trapping. The fact that we replicate the extant problems for the Scope Theory even with SPs thus furnishes a serious problem for the theories we are about to review.
Indexical contextualism
We begin by examining an indexical contextualist theory (this approach is inspired by McCready 2007 and Bylinina et al. 2014). At first blush, this theory looks like a member of the must associate class. In this approach, the judge is treated very much like a shiftable indexical along the lines first proposed in Schlenker (2003). There are two components. First, the context, and not the index, includes the judge coordinate that SPs are relativized to, \(c = \langle author, world, judge\rangle \), hence the term contextualism (as elsewhere in the paper, we supress coordinates of the context that are not relevant for us, such as time or location).
Second, SPs differ from ordinary indexicals in attitude reports. While SPs readily receive an interpretation relative to the attitude holder, English indexicals I and you typically remain speaker-oriented.Footnote 38 In order for the indexical contextualist to capture it, attitude verbs are treated as monstrous operators over contexts (in the sense of Kaplan 1989; Schlenker 2003) and uniformly shift the judge coordinate to the attitude holder, as in (98):
As attitude predicates shift the judge coordinate of the context, all judge-dependent items in the scope of an attitude should be interpreted with the attitude holder as the judge. For multi-judge sentences, this correctly derives a contradiction in case the nominal is read de dicto.
Now, let us consider what occurs if the subject is read de re. Since the attitude is what shifts the judge parameter, if the nominal is raised out of the attitude’s scope, it will be interpreted relative to the matrix judge. Thus, if we adopt the Scope Theory, de re interpretation of the nominal will allow a non-local judge, as we observed for must associate relativist theories in Sect. 4.2. However, if we use non-local index binding to derive de re readings, we will not allow a non-local judge. Instead, the judge will be the attitude holder, but the world of evaluation will be the matrix world (99).
The issue in (100) arises because there is no necessary connection between the index and context of evaluation for a particular DP, and so using a non-local index does not require using a non-shifted context.Footnote 39 The Scope Theory avoids this issue because the particular landing site of de re movement is above the attitude predicate, and hence outside the shifted context. Thus, in indexical contextualism judges are obligatorily local, relative to the attitude predicate’s syntactic scope. The sole way around this is to escape that scope. As we will show next, the logophoric binding approach in Pearson (2013a) has a similar signature, and thus also requires the Scope Theory.
Logophoric binding (Pearson 2013a)
The account in Pearson (2013a) is what Coppock (2018) calls sophisticated contextualism. The proposal aims to derive several properties of SPs without substantial appeal to any special technology. For reasons of space, we will only concentrate on some aspects of this system; see Pearson (2013a) for a detailed defense. In (101), we have provided the logical form and interpretation for a sample sentence with an SP.
In a nutshell, the approach has three components. First, SPs are argued to be individual-level predicates (see discussion in Sect. 3.2), and individual-level predicates are analyzed as inherently generic (Chierchia 1995, though see Czypionka and Lauer 2016). This is responsible for the GEN operator in (101). Second, Pearson assumes that SPs are dyadic predicates with a judge argument, although the judge is bound by the generic, as with \(x_4\) in (101c). Finally, SPs impose a restriction on the domain of GEN: they restrict GEN to a set of individuals that the speaker (or attitude holder) empathizes or identifies with.Footnote 40 To this end, Pearson proposes that SPs involve first-person genericity (cf. Moltmann 2010a, 2012), via the relation I of identification, designed to encode those individuals whose perspectives are under consideration:Footnote 41
The crucial component for present purposes is the identifier argument \(y_1\). In Pearson’s system, it is bound by the \(\lambda _1\) operator at the left periphery of the clause. As a shorthand, we will call this kind of individual binder a logophoric binder. Embedded clauses also introduce a logophoric binder, and the system provides interpretive rules so that root logophoric binders map to the speaker (to result in speaker-autocentric readings) and embedded logophoric binders map to attitude holders (to handle reported autocentric readings). Thus, there is a unified LF in both cases, with embedded and root clauses both interpreted as properties of individuals (cf. Chierchia 1989), and interpretive differences arise from how those properties are further combined.Footnote 42
Alongside the licit LFs above, there is another potential LF given in (104), where the identifier argument of an embedded SP is bound by the matrix binder, yielding a speaker-oriented SP in embedded clauses:
In order to block the logical form in (104), Pearson assumes that the identifier argument must be bound by the closest possible binder (cf. similar constraints in Anand 2006; Hacquard 2006; Percus 2000):
With this brief summary, we can now return to our test cases, repeated again in (106) below:
How does Pearson’s account fare with respect to those cases? Our interest in answering this question is ultimately about the identifier argument to the I predicate. We will thus dispense with the generic quantifier in what follows. To make this concrete, we will reformulate the LFs as ones where the SP take the identifier directly, and where the identifier is subject to the CIB.
For de dicto subjects (106a), we would obtain the following LF, where dog, adorable, and ugly are all evaluated with respect to \(w_{22}\), the world introduced by the intensional operator:
Due to the CIB, adorable and ugly will have the same binder for their identifier, \(\lambda _{21}\), which will make them both relative to the same judge, Mary. This will yields a contradiction. So the account in Pearson (2013a) does not overgenerate and correctly rules the unattested (106a) out. By the same token, it also rules out the unattested (106d): the noun will be interpreted de dicto, as in (107), and the framework does not allow non-local judges without de re.
However, this theory cannot derive the fact that de re interpretation results in a non-local judge, and this is the result of the CIB. Note first that the Scope Theory will yield non-local judges. Scoping the DP outside the embedded clause will mean that it is subject not to the \(\lambda _{21}\) logophoric binder but to the non-local \(\lambda _1\) binder, parallel to the one we observed for the approach in Lasersohn (2005) in Sect. 4.2 and for indexical contextualism in Sect. 4.4.2:Footnote 43
However, just as with indexical contextualism, it is the Scope Theory that is doing the job of linking the world and judge of the SP. Even though the world and the identifier are bound by separate binders, the particular landing site of de re movement is above both of them, and hence the world and identifier correlate. If we consider long-distance binding for de re interpretation instead, then we yield the wrong interpretation, precisely as with indexical contextualism. While the world argument of the DP is bound long-distance, the CIB forces a local judge, yielding a contradictory interpretation (109).
To recapitulate, the accounts in Pearson (2013a) and Bylinina et al. (2014) can generate felicitous interpretations of multi-judge sentences only if de re attitude ascription proceeds via movement of the subject DP outside of the attitude. But we have discussed that we can create scope paradoxical sentences like (96) with multi-judge sentences, meaning that such movements are not necessary for de re subjects in such sentences (Charlow and Sharvit 2014 discuss further problems for the Scope Theory). We thus have strong evidence against splitting judges from worlds of evaluation. We follow up on this discussion in the Appendix, where we show that when Pearson’s (2013a) account is coupled with concept generators, it still fails to predict the only reading available for attributive SPs in multi-judge scenarios under attitudes.Footnote 44
Deriving the correlation via identification
We have argued that Pearson’s logophoric binding approach does not account for multi-judge sentences because the CIB prevents long-distance logophoric binders. However, the CIB was originally advanced for main predicate SPs. It therefore may be possible that it applies only to that position, much like Percus (2000) argues for world-binding. In that case, we could suppose that attributive SPs allow any c-commanding binder, subject to whatever constraints SPs impose (cf. Sæbø’s 2009 compositional differences between main predicate vs. attributive SPs). As with the can dissociate approaches, it may become difficult to rule out contradictory mixed LFs. In Sect. 4.3, we argued against the possibility of using the acquaintance inference to restrict the potential judges for a given world. However, the logophoric binding system additionally has an identification relation, I, that restricts the generic. It may be possible to use that relation to appropriately constrain interpretations.
Let us now re-consider the kind of mixed-judge LF that the CIB was designed to block:Footnote 45
We argued that without the CIB, nothing would block this unattested mixed reading with only the nominal read de dicto. One alternative mechanism to consider is the I relation. If identification is a matter of empathic perspective-taking, then it might seem reasonable to suggest that the identifier and identifee must stand in a certain relation in \(w_{22}\), and hence must both be residents of \(w_{22}\).Footnote 46 Minimally, then, we could assume the following presupposition:
If \(y_1\) is not present in \(w_{22}\), then we can rule out the undesired LF in (110) on the basis of restrictions arising from the I relation. This is a potentially elegant way to capture the judge-index correlation within a system like Pearson’s. In what follows, we will argue that despite this elegance, there are problems with assuming that (111) is a restriction on felicitous use of SPs and that \(y_1\) is not in \(w_{22}\).
When we contemplated the restrictive powers of the acquaintance inference in Sect. 4.3, we presented counterfactuals whose premises denied the existence of the speaker in the counterfactual worlds (91). In that vein, consider what George Bailey from the movie It’s a Wonderful Life might say after being shown the alternate timeline without him:
Such examples are comprehensible and felicitous, contrary to what we would expect if the presupposition in (111) were always active. Conversely, in the case of our multi-judge scenarios, we are also skeptical of the claim that \(y_1\) doesn’t exist in \(w_{22}\), though here the discussion becomes a bit more involved.
At first, it seems clear that in some scenarios \(y_1\) is in \(w_{22}\). In particular, in the scenario in which Sue utters the sentence corresponding to (110), she exists in some form in Mary’s doxastic alternatives, since Mary and Sue are having a conversation about the animals in the catalog (and hence, should be present in each other’s doxastic alternatives). And, again, the overt judge subject DP a dog adorable to me in (113) allows the multi-judge de dicto reading, showing that the speaker can exist in the intensional indices.
This much is true for the simplified version of Pearson’s system we have been considering so far. It thus appears that there is evidence from counterfactuals against requiring the intuitive identifier from being in the local world of evaluation as well as evidence from overt judges that in our case \(y_1\) is in \(w_{22}\).
However, the interested reader may wonder how Pearson’s more complete system involving self-ascription of de se properties affects the reasoning above. In what follows, we briefly consider some possible responses, referring readers to Pearson (2013a) for justification for the details we will now assume.
In Pearson’s complete system, the logophoric binders create a property that is applied not to the speaker or attitude holder, but to their de se alternatives. In (110), the doxastic alternatives quantify over individual-world pairs, with worlds corresponding to worlds meeting Mary’s beliefs and individuals corresponding to who she self-identifies with in each belief world (as per Lewis 1979), and \(y_{21}\) picks out Mary’s self-ascribed alternative in each \(w_{22}\), not necessarily Mary herself. Similarly, \(y_1\) is not the speaker, Sue, in the actual world, but each of Sue’s de se doxastic alternatives relative to the possible \(w_2\) alternatives to the actual world. In light of this nuanced understanding, the question is whether Sue’s de se alternative \(y_1\) is present in Mary’s belief worlds.
Reasoning from the data alone, the overt judge form in (113) would suggest that \(y_1\) is present. In Pearson’s system, the overt judge me is bound by \(\lambda _1\) precisely as the identifier of a covert identification relation, since the first person indexical is interpreted de se and all de se expressions involve logophoric binding (see detailed discussion in Pearson 2013b). If the overt judge is in \(w_{22}\), then so should be the covert identifier. We then should be able to generate the same multi-judge de dicto reading.
We can think of one possible, admittedly complex, response to this. Lewis (1986) proposes a metaphysical framework in which individuals are strictly world-bound (i.e., the mapping from individuals to worlds is one-to-one). Under such a view, Sue’s de se counterparts could never be in \(w_{22}\), since they are in \(w_1\) worlds. To handle the grammaticality of (113), we would need to assign the referent of me a de re interpretation with respect to Mary’s doxastic state, producing a cross-world counterpart that actually exists in \(w_{22}\). We thus would have an expression with a de se-de re profile: it is de re with respect to the lower attitude (Mary’s thoughts) and de se with respect to the higher one (Sue’s self-ascribed statement). Then such an approach would have to stipulate that the covert identifier for an SP cannot be similarly interpreted de se-de re. Note that we cannot appeal to the fact that the covert identifier is an obligatory de se anaphor, since such anaphors are routinely interpreted de se-de re when further embedded under other attitudes (see Anand 2006 for discussion).
Stepping back a bit, any view that makes individuals inhabitants of exactly one world renders individuals strictly more informative than worlds alone. Such an approach would not need SPs to depend on individuals and worlds, nor have property abstraction over individuals and worlds. In both cases, individuals would make a suitable proxy for an individual-world index in our discussion, hence invalidating the idea of breaking apart centered worlds to begin with.
To sum up this discussion, ruling out the multi-judge de dicto reading within Pearson’s system involves three requirements: (i) that the identifier is in the world of evaluation, (ii) that individuals are world-bound, and (iii) that overt elements read de se allow de se-de re interpretations, but covert ones do not. In turn, we have shown that none of these requirements are innocent. The first seems to be empirically false in general, the second undermines the idea of separating worlds and judges, and the third is, at present, a stipulation.