1 Introduction

Recent work has advanced the idea that cases are decomposed featurally as in Table 1 (Caha 2009; McFadden 2018; Smith et al. 2019; Zompì 2019). Under this view, the Case features of the nominative form a proper subset of those of the accusative, which in turn form a proper subset of those of the dative.Footnote 1 We refer to systems such as the one in Table 1 as Strong Case Containment (SCC).Footnote 2 As will be discussed in Sect. 2 of the present paper, a benefit of adopting SCC is that, when combined with the widely assumed Subset Principle (Halle 1997; Halle and Marantz 1993), it straightforwardly captures a robust cross-linguistic generalization about Case morphology known as *ABA: no rule of exponence applies in both the nominative and the dative without also applying in the accusative (Caha 2009; McFadden 2018; Smith et al. 2019).Footnote 3,Footnote 4 *ABA for the order nomaccdat is predicted because, under SCC, it is impossible for a rule of exponence to exclusively apply in accusative contexts unless a more specific rule applies in dative contexts.

Table 1 Strong Case Containment

The success of SCC in accounting for *ABA motivates a move away from more traditional systems where the three cases are not in a proper containment relation to one another, as in Table 2, and which we will be referring to as No Case Containment (NCC).Footnote 5 NCC does not allow for a straightforward analysis of *ABA because, in this system, rules can be readily specified to apply exclusively in accusative contexts ({\(k_{1}\), \(k_{3}\)}), with no implications for the corresponding dative (or nominative).

Table 2 No Case Containment

Interestingly, opting for SCC over NCC does not come without problems. The main empirical issue with SCC that we discuss in this paper is exemplified in Table 3, which features what we call a Non-Elsewhere Nominative Stem (NENS). The distribution of the h-stem and the t-stem in the paradigm of Table 3 exemplifies, as we shall see, a pervasive pattern in Indo-European languages. A straightforward description of this pattern is that it involves one stem, here the h-stem, applying in nominative singular contexts, and another one, here the t-stem, applying elsewhere. This situation is perfectly consistent with NCC, under which we can represent the distribution of the two stems in Table 3 as the result of competition between an h-rule specified for {\(k_{0}\), \(k_{3}\)} singular contexts, and a t-rule that is not specified for any Case or Number features. Since the h-rule is specified for a proper superset of the features that the t-rule is specified for, the h-rule will be chosen over the t-rule in nominative singular contexts under any version of the Subset Principle. Under SCC, on the other hand, the pattern of Table 3 does not receive a straightforward analysis. As we will see in Sect. 3, the problem is that SCC offers no way of writing a non-elsewhere rule that will apply exclusively in nominative contexts, because all the features of the nominative are present in accusative and dative contexts as well. We will argue that the introduction of “markedness” meta-features by McFadden (2018) as a way of side-stepping this problem leads to an untenable notion of markedness and that it therefore does not provide a viable solution to the problem raised by NENSs for SCC.

Table 3 Doric Greek feminine definite determiner (Szemerényi 1996: 205)

The tension between SCC and NCC leads us to our main theoretical question: how should we represent nominative, accusative and dative in order to straightforwardly capture both *ABA and the existence of NENSs? In Sect. 4, we show that a system such as the one in Table 4, which combines properties of SCC and NCC and which we refer to as Weak Case Containment (WCC), is a system that allows us to do just that.

Table 4 Weak Case Containment

Section 5 discusses the predictions made by WCC about the derivation of AAB and ABC patterns and argues that these are in line with the empirical landscape, despite a couple of apparent problems. Section 6 concludes the paper with a discussion of a potential additional advantage of WCC over both SCC and NCC when it comes to capturing the (un)attested patterns of surface containment between nominative, accusative and dative forms.

2 *ABA: SCC over NCC

There is a history of thinking of cases like the nominative, the accusative and the dative as being universally ordered relative to each other—specifically, with the accusative sandwiched between the nominative and the dative (Blake 2001; Bobaljik 2008; Marantz 1991; Yip et al. 1987). This idea has gained new momentum as a result of a recently discovered empirical generalization, which we refer to as *ABA. According to this generalization, no rule of exponence ever applies in both the nominative and the dative without also applying in the accusative (Caha 2009; McFadden 2018; Smith et al. 2019; Zompì 2017, 2019). Take, for example, the patterns found in the distribution of stems in pronominal paradigms, illustrated in Table 5 (stems are bolded). Given the order nomaccdat, we find instances of AAA (no stem allomorphy at all), ABB (same stem for acc and dat to the exclusion of nom), AAB (same stem for nom and acc to the exclusion of dat) and ABC (a different stem for each case), but never ABA (see Smith et al. 2019 for the original cross-linguistic survey).Footnote 6 The same goes for patterns of non-accidental Case-affixal syncretism, as illustrated with Icelandic examples in Table 6 (affixes are bolded). Given the same order nomaccdat, we sometimes find the same Case (and Number) affix in all three cases (AAA), sometimes in just acc and dat (ABB), and sometimes in just nom and acc (AAB), but we never find ABA (Caha 2009; cf. Baerman et al. 2005).

Table 5 *ABA in stem suppletion for nomaccdat
Table 6 *ABA in affix syncretism for nom ≺ acc ≺ dat (Icelandic examples; Einarsson 1949: 38–44, 68)

The *ABA generalization has been taken to support a particular implementation of the idea of a universal case ordering—one whereby the nominative involves a proper subset of the features of the accusative and the accusative involves a proper subset of the features of the dative, as in Table 7.Footnote 7 We refer to this featural decomposition of the three cases as Strong Case Containment, or SCC.

Table 7 Strong Case Containment

The appeal of SCC is that it straightforwardly derives *ABA within realizational morphological frameworks that adopt the Subset Principle in (1).Footnote 8

  1. (1)

    Subset Principle (cf. Halle 1997: 128)

    1. a.

      A rule of exponence applies to a structure only if that rule is specified for a subset of the features present in that structure.

    2. b.

      If several incompatible rules of exponence may apply to the same structure, only the rule specified for the greatest number of features applies.

An exhaustive list of the derivations that become possible once we combine the Subset Principle in (1) with SCC is found in Table 8. On the far left side of the table, we have the number of possible rule inventories that lead to distinct derivations. The second column provides the rules in each of these inventories, numbered as A, B and C, along with the Case features that these rules are specified for.Footnote 9 The next three columns of the table represent the competition between the rules made available by each inventory, in each of the three contexts: nominative, accusative and dative. The lightest shade of gray indicates that the rule of that row wins the competition in the specific context, the middle shade indicates that the rule is eligible to apply but is blocked by a more specific rule, and the darkest shade indicates a rule that is not eligible to apply in that context (see also key above the table). The far right column of the table points out the resultant surface pattern for each possible derivation.

Table 8 Derivations of AAA, ABB, AAB and ABC, assuming the Subset Principle and SCC

AAA patterns are derived simply when a rule A is not specified for any Case features (∅) and there is no other rule that is. By contrast, ABB patterns will arise in case two rules A and B are specified for ∅ and {\(k_{1}\)}, respectively: in the nominative, only A will be eligible to apply and therefore will apply, whereas, in both the accusative and the dative, both A and B will be eligible to apply, but B will win over A in both cases by virtue of being specified for a greater number of features (\(\lvert \{k_{1}\}\rvert >\lvert \emptyset \rvert \)). By the same token, AAB patterns will arise in the presence of two rules A and B, respectively specified for ∅ and {\(k_{2}\)} (or for ∅ and {\(k_{1}\), \(k_{2}\)}): rule A will be the only rule eligible to apply in nominative and accusative and, therefore, will apply; both A and B will be eligible to apply in the dative but B will win due to it being more specified. Finally, ABC patterns will be derived as the result of three rules A, B and C, specified for ∅, {\(k_{1}\)} and {\(k_{1}\), \(k_{2}\)}, respectively: in the nominative, A will apply as the only eligible rule; in the accusative, both A and B will be eligible, but the latter will win; in the dative, all three rules will be eligible, but C will win. The only logically possible pattern that is not derivable in this system is ABA. This is because, for any two-rule inventory (i.e. any inventory that only includes a rule A and a rule B) where one rule applies in the nominative and the other in the accusative, the rule that applies in the accusative will also win the competition in the dative.

SCC contrasts with a more traditional decomposition of nominative, accusative and dative that we refer to as No Case Containment (or NCC), whereby none of the three cases is properly contained within either of the others. NCC has been typically adopted in terms of binary features, such as the ones used in Calabrese (2008) in Table 9.Footnote 10 In order to facilitate the comparison of this system with SCC, we have translated NCC into privative features in Table 10 by adding to SCC two additional features, namely \(k_{0}\), which is the equivalent of −motion in being a feature present only in nominative contexts, and \(k_{3}\), which is equivalent to −peripheral in being a feature that is present in nominative and accusative contexts but absent in dative contexts.

Table 9 NCC (in terms of binary features)
Table 10 NCC (in terms of privative features)

Importantly, unlike what we saw for SCC, once NCC is combined with the Subset Principle, ABA patterns are readily generable. Table 11 lists all the possible derivations under NCC and the Subset Principle, including one with the surface pattern ABA (row 2). Such patterns are generable by positing a rule A that is not specified for any Case features and a rule B specified to apply in accusative contexts via reference to the accusative-specific set of features {\(k_{1}\), \(k_{3}\)}. Given the above, if the literature is correct that ABA patterns are not attested in the languages of the world, NCC suffers from an overgeneration problem that SCC does not.Footnote 11

Table 11 Derivations of AAA, ABA, ABB, AAB and ABC, assuming the Subset Principle and NCC

As one can verify by comparing Tables 8 and 11, the possibility of deriving ABA is not the only way in which SCC and NCC differ from each other. Notably, whereas NCC allows for deriving the patterns ABB, AAB and ABC in multiple ways (three ways of deriving each of the patterns ABB and AAB, and nine ways of deriving ABC patterns), SCC predicts a specific derivational path for each of these patterns.Footnote 12 The question that arises and with which we will be concerned for the rest of the paper is whether, in addition to capturing *ABA, SCC also makes the right predictions for the derivations of ABB, AAB and ABC patterns.

3 Derivations of ABB: NCC over SCC

In the present section, we argue that SCC’s strong predictions about the derivation of ABBs face insurmountable empirical problems. This will lead us to the conclusion that, despite its success with *ABA, SCC must ultimately be rejected.

3.1 The argument against SCC

As one can verify by looking at row 2 of Table 8 (the row is repeated below as Table 12), there is only one way of deriving ABB under SCC. The derivation of this pattern involves two rules, A and B, the former not specified for Case features and the latter specified for \(\{k_{1}\}\). This leads to a prediction about the possible occurrence of A and B in other parts of the paradigm. Specifically, since \(k_{1}\) is, by hypothesis, never present in nominatives, we expect that the prediction in (2) should hold true.

  1. (2)

    For a given context \(C_{1}\) where we find the pattern ABB, there should be no context \(C_{2}\) where rule B applies in the nominative.

Table 12 Derivations of ABB, assuming the Subset Principle and SCC (from Table 8)

The prediction in (2) appears to be falsified by paradigms such as the one from Doric Greek in Table 3, repeated here as Table 13. This paradigm involves precisely what we expect not to find according to (2): an ABB pattern in the singular (i.e. h-stem, t-stem, t-stem) whose B exponent (i.e. the t-stem) is also found in nominative contexts (in the dual and plural). Perhaps a more intuitive way of formulating the problem is the following: in the most straightforward description of a paradigm such as that of Doric Greek, the t-stem appears to be the “elsewhere” stem, with the h-stem specified to only occur in nominative singular contexts, as in (3). However, the analysis in (3) is ruled out under SCC, which states that, in any ABB pattern, rule B must be more specific than rule A.

  1. (3)
    1. a.

      Insert h-stem in nominative singular contexts

    2. b.

      Insert t-stem elsewhere

Table 13 Doric Greek feminine definite determiner (Szemerényi 1996: 205)

Non-Elsewhere Nominative Stems (NENSs) such as the h-stem in Table 13 are widely attested, at least across Indo-European languages. As seen in Table 14, a NENS can even be observed in English. Examples of NENSs in other Indo-European languages that we have encountered include: the 1st-person pronoun in Modern Greek (Holton et al. 2012: 113) and Latvian (Praulin̹š 2012: 54), the demonstrative pronouns in Eastern Armenian (Dum-Tragut 2009: 130–131), Gothic (Braune and Heidermanns 2004: 134–135), Icelandic (Einarsson 1949: 70), Old English (Hogg and Fulk 2011: 192–195), Old Norse (Barnes 2008: 63–64), Sanskrit (Mayrhofer 1978: 58–59) and Tocharian (Krause and Slocum 2007–10: Sect. 3.1), the Gothic relative pronoun (Braune and Heidermanns 2004: 136), the Latvian “emphatic” pronoun pats(Praulin̹š 2012: 465)and the 3rd-person pronouns of Afrikaans (Donaldson 1993: 123), Dutch (Donaldson 2008: 66), Frisian (Tiersma 1999: 55), Gothic (Braune and Heidermanns 2004: 133) and Low German (Matras and Reershemius 2003: 22).

Table 14 English feminine singular 3rd person pronoun

Under NCC, on the other hand, no prediction is made regarding the relative specificity of rules A and B in ABB patterns. As can be seen in rows 3–5 of Table 11 (repeated below as Table 15), under NCC, ABB patterns may be derived not only by inventories where rule B is more specific than rule A (like row 5), but also by inventories where rule A is more specific than rule B (like row 4). This renders NCC perfectly consistent with an analysis like (3) for a paradigm like that of Doric Greek, and as such, NCC appears to be superior to SCC when it comes to capturing the existence of NENSs.Footnote 13

Table 15 Derivations of ABB, assuming the Subset Principle and NCC (from Table 11)

We should point out that the argument relies on a meta-theoretical assumption that we take to be fundamental to most work in morphology. It would be possible to posit rules that make reference to disjunctive contexts, as in (4), in order to satisfy SCC’s requirement that the t-stem be more richly specified than the h-stem. While positing (4) would technically solve the problem under SCC, admitting rules like (4a) (without anything else restricting possible disjunctions) opens the door to describing any type of exponent distribution under any theory of features by simply listing the contexts in which each exponent appears. Since we take this to be an extremely weak position, for the sake of restrictiveness, we reject the general approach that appeals to rules that reference disjunctive environments.Footnote 14

  1. (4)
    1. a.

      Insert t-stem in contexts that are non-singular or non-nominative

    2. b.

      Insert h-stem elsewhere

We conclude that, once both *ABA and the existence of NENSs have been considered, a tension arises between SCC and NCC: on the one hand, SCC offers a straightforward account of *ABA, while NCC does not. On the other hand, SCC makes a prediction about the relative specificity of A and B in ABBs which appears to be too strong, while NCC is appropriately flexible to accommodate the NENS pattern.

3.2 A note on the empirical basis of our argument

Before we move on, let us clarify some choices we have made in constructing the empirical basis of the argument against SCC in Sect. 3.1. Particularly problematic for SCC are cases where some exponent can be identified as making direct reference to nominative contexts, i.e. an exponent that occurs in nominative contexts but which could not reasonably be taken to be an elsewhere exponent. Though we have come to believe that exponents of this type are abundant (at least) in Indo-European languages, finding the right paradigms to make the case for their existence can be tricky if one wants to control for possible alternative analyses. Below we provide justification for why we opted for basing our argument on certain types of paradigms and not on others, as a means of controlling for alternative analyses that would not necessarily require direct reference to nominative contexts.

One type of paradigm that might seem to make the same point that we made in Sect. 3.1, but which we have intentionally excluded from our database, can be exemplified by the one in Table 16 from Old English, where an s-stem only occurs in nominative environments and a -stem has what looks like an elsewhere distribution. What prevents us, then, from safely diagnosing the s-stem here as a NENS? The property of the paradigm in Table 16 that is of potential importance is that, in all cells in which the -stem occurs in nominative contexts, the form is syncretic with the corresponding accusative form. This fact crucially opens the door to an analysis under which the s-stem is the elsewhere stem, despite appearances. Such an analysis would account for the syncretism via a feature manipulation that effectively turns nominative cells into accusative cells before rules of exponence apply.Footnote 15 Under this analysis, nominative cells that involve the -stem do not really involve nominative feature bundles, but accusative ones. Proponents of such an analysis could then claim that the -stem really is specified to occur in non-nominative contexts (i.e. \(k_{1}\)), but that this is obscured by the feature manipulation. In order to rule out possible analyses of this type, we have decided to exclude from our database paradigms like the one in Table 16 and to only include ones where at least one of the nominative cells featuring the putative elsewhere stem is not syncretic with its corresponding accusative. Paradigms that we dismissed for such reasons include the Modern and Classical (Attic) Greek definite determiner (Holton et al. 2012: 52; Kühner and Blass 1890: 604), the Albanian adjectival article (Newmark et al. 1982: 181), the Lovari definite clitic pronoun (Pobożniak 1964: 58), as well as nominal paradigms such as that of Latin homō ‘human’ (Ernout 1953: 45–46) and Slovene mati ‘mother’ (Herrity 2015: 82).Footnote 16

Table 16 Old English distal demonstrative pronoun (Hogg and Fulk 2011: 192)

For a similar reason, we avoided paradigms such as the one in Table 17. In this Latin paradigm, the affix -us appears to be a Non-Elsewhere Nominative Affix, as it were, since it only appears in nominative contexts, at the same time as a different affix, namely -um, shows up in both nominative and non-nominative contexts, i.e. it shows an elsewhere distribution. Just like in the case of the Old English pronouns, however, one might be tempted to argue that the elsewhere distribution of -um is only apparent. Given the syncretism between nominative and accusative in neuter contexts, one may assume that neuter nominatives turn into accusatives before rules of exponence apply and that, therefore, the exponent -us is, despite appearances, the elsewhere affix, with -um specified to occur in accusative contexts. Recall that our solution to the analogous problem with stems was to focus on paradigms where at least one of the nominative cells featuring the putative elsewhere stem was not syncretic with its corresponding accusative. For affixes, this turned out to be impossible, since when the affixes of two cells are identical, their stems tend to also be identical. In other words, paradigms such as the one in Table 17 were as close as we were able to get to diagnosing Non-Elsewhere Nominative Affixes. Given the possibility of treating syncretism in terms of feature manipulation, we decided to exclude affixal patterns from our database altogether.

Table 17 Latin ‘good’ (Ernout 1953: 25)

Finally, we have tried to exclude from our database paradigms featuring an apparent NENS, but in which the relevant alternation lends itself to an analysis that appeals to phonological conditioning. Take, for example, the Latin paradigm in Table 18. Here, the stem senec- shows up only in nominative singular contexts, and the stem sen- shows up everywhere else.Footnote 17 In other words, the stem senec- appears to be a NENS. However, the distribution of the two stem alternants in this paradigm could also be taken to be governed by phonological properties of the affixes: senec- shows up only in cells where the affix is consonantal, whereas sen- shows up only when the affix starts with (or perhaps simply includes) a vowel. Should this be the right analysis of the distribution of these stems, the fact that one of them appears in the nominative singular and the other one everywhere else would be completely coincidental. Notice that stem alternations that are conditioned by phonological properties of the affix(es) are independently needed to account for otherwise inexplicable alternations like the one between gir- and gīr- in the Sanskrit paradigm in Table 19, with the short-vowel stem appearing before vowels and the long-vowel stem everywhere else (Stump 2015: 72–73).Footnote 18 In order to make sure that our argument against SCC was built on clear evidence that exponence may reference nominative contexts, we decided to also exclude paradigms such as the one in Table 18, which lend themselves to analyses referencing the phonological properties of the affixes.

Table 18 Latin ‘old man’ (Ernout 1953: 60)
Table 19 Sanskrit ‘song’ (Stump 2015: 73)

This filtering of our initial database has led us to mainly consider stem alternations in pronominal (and determiner) paradigms in building our argument against SCC.Footnote 19,Footnote 20 What we hope to have established is that, even if we admit the possibility of (reasonable) feature manipulations and phonologically-conditioned stem alternations, there still exist paradigms (the ones listed in Sect. 3.1 and Sect. 3.3) that are immune to such alternative analyses. It is these paradigms that most clearly provide evidence for rules of exponence making direct reference to nominative environments and therefore the ones that most severely undermine SCC. For the rest of this section we review the viability of a proposal made in McFadden (2018) as a way of side-stepping the problem of NENSs for SCC. We argue, however, that his proposal is ultimately not viable and that, therefore, the tension between SCC and NCC remains unresolved.

3.3 Why invoking “markedness” features is not the right solution to the problem

Acknowledging the problem that NENSs pose for SCC, McFadden (2018) proposes that we could maintain SCC if we allow rules of exponence to make reference to meta-features encoding an abstract notion of “markedness.” Concretely, McFadden proposes that “markedness” be conceived of as “the presence of a feature.” Assuming SCC and this notion of markedness, nominative would be “unmarked,” while both accusative and dative would be “marked” by virtue of them having features, as in (5a).Footnote 21 Assuming further that singular and plural are also decomposable as in (5b), with dual/plural marked and singular unmarked, the Doric Greek h-t- stem alternation (Table 13) can be understood as resulting from rules of exponence sensitive to McFadden’s notion of markedness: t- is inserted in environments where marked features are present (i.e. in all environments that are non-singular or non-nominative) and h- is inserted elsewhere. Rules of exponence with this effect are given in (6).

  1. (5)
    figure b
  1. (6)
    1. a.

      Insert t-stem in “marked” contexts

    2. b.

      Insert h-stem elsewhere

We argue that McFadden’s notion of markedness faces a severe empirical problem. For any category distinguishing two or more properties, at least one of these properties would have to be “marked.” This, however, does not seem to be correct. Consider, for example, the Latvian emphatic pronoun in Table 20. Given that the exponence rule for paš- does not apply to either nominative singular masculine or nominative singular feminine, we are forced, on McFadden’s approach, to conclude that neither masculine nor feminine in Latvian have Gender features. An even more striking manifestation of the same problem is found in the Tocharian anaphoric pronouns in Table 21, where neither masculine nor feminine can involve features, but at the same time these two Genders must still be distinguished by a stem alternation. Notice, further, that Gender as a category cannot be thought to be generally exempt from McFadden’s notion of markedness, since, if it were so, the approach would be left without an analysis of the stem alternation in the English 3rd person pronoun (Table 14) and others like it.Footnote 22

Table 20 Latvian emphatic pronoun ‘self’ (Praulin̹š 2012: 465)
Table 21 Tocharian A anaphoric pronouns (Krause and Slocum 2007–10: Sect. 31)

Tocharian also instantiates an analogous problem for the domain of Deixis. In Table 22, we find, again, the characteristic L-shaped pattern replicated this time across Deixis values. By the same logic that we applied above to Latvian, a markedness approach would be forced to assume that this language’s Deixis properties (i.e. distal and proximal) should both lack features. Moreover, just as with Gender, the category Deixis cannot be thought to be generally exempt from McFadden’s notion of markedness, since, for paradigms such as the one from Old English in Table 23, proximal would have to be assumed to be “marked.”

Table 22 Tocharian A masculine demonstrative pronouns (Krause and Slocum 2007–10: Sect. 31)
Table 23 Old English singular feminine demonstrative pronoun (Hogg and Fulk 2011: 192–195)

Given the existence of categories with multiple properties that would all have to count as “unmarked” (and that, moreover, would have to do so in some languages but not in others), we conclude that McFadden’s notion of markedness can provide at best a technical solution to the problem that NENSs pose to SCC. An implication of this conclusion is that the tension between SCC and NCC that we pointed out in Sect. 3.1 remains, effectively, unresolved.

4 Weak Case Containment

In this section, we propose a solution to the tension between SCC and NCC by adopting a system that is midway between the two, which we refer to as Weak Case Containment (WCC). The system is the one represented in Table 24, where the dative feature set still properly contains the accusative one (as in SCC), but nominative and accusative do not stand in a proper containment relation to one another (as in NCC). It should be clear why we say that WCC is “midway” from NCC to SCC: WCC is like SCC and unlike NCC in lacking a feature that is common to the nominative and accusative but absent in dative contexts, namely \(k_{3}\); on the other hand, WCC is like NCC and unlike SCC in having a feature that is present exclusively in nominative contexts, namely \(k_{0}\).Footnote 23,Footnote 24 We argue that this midway featural decomposition also allows us to have the best of both worlds, in terms of empirical coverage.

Table 24 Weak Case Containment

Recall that, under NCC, one could readily specify a rule with features only present in accusative contexts, namely with the feature set {\(k_{1}\), \(k_{3}\)}, which opened the door to the derivation of the ABA pattern. Under SCC, on the other hand, this was not possible: any rule that could apply in accusative contexts could also apply in dative contexts due to proper containment of the former in the latter. Since our WCC, like SCC, involves proper containment of the accusative inside the dative, it also shares with SCC the prediction that ABA should not be a generable surface pattern. Table 25 lays out all the possible derivations of the surface patterns AAA, ABB, AAB and ABC that are available once we combine WCC and the Subset Principle.

Table 25 Derivations of AAA, AAB, ABB, ABC assuming the Subset Principle and WCC

While WCC matches SCC’s success in ruling out ABA, it is not as strict as SCC when it comes to its predictions about the relative specificity of rules of exponence in ABB patterns. As can be seen from rows 2–4 of Table 25, unlike SCC but like NCC, WCC does not make the prediction that rule B in ABBs will always be more specific than rule A. In fact, rows 2–3 specifically show that, under WCC, there are derivations of ABB where rule A is more specific than rule B. As a result, WCC also matches the success of NCC in affording a straightforward analysis of NENSs.Footnote 25

For example, the Latvian stem alternation from Table 20, singling out nominative singular across both masculine and feminine, can now easily be captured by the rules in (7). Since neither the pat-stem nor the paš-stem are specified for Gender features, the pattern that their interaction gives rise to will cut across genders, as desired.Footnote 26

  1. (7)
    1. a.

      Insert pat-stem in \(k_{0}\) singular contexts

    2. b.

      Insert paš-stem elsewhere

What we hope to have shown is that, once we consider both the absence of ABA patterns and the existence of ABB patterns where rule A is more specific than rule B, both SCC and NCC face a challenge, whereas a system like WCC does not.Footnote 27 What we still have not addressed is how this kind of reasoning may extend to other patterns. SCC, WCC and NCC make predictions about the relative specificity of rules of exponence not only in ABB patterns, but in AABs and ABCs as well. For example, WCC predicts that rule B should always be more specific than rule A in AAB patterns (cf. row 5 of Table 25) and that rule C should always be more specific than rule B in ABCs (cf. rows 6–8 of Table 25). In the next section, we address these predictions and argue that, at least within Indo-European, they are not met with any convincing counterexamples (though see Sect. 5.1 for some apparent challenges from AABs), and that therefore, WCC seems to be a promising feature decomposition of nominative, accusative and dative.

5 Possible and impossible derivations of AABs and ABCs under WCC

5.1 AABs

WCC (like SCC) is more restrictive than NCC when it comes to deriving AAB patterns. Consider the three possible derivations of AAB under NCC in Table 26. Importantly, NCC allows us to derive AAB patterns either via inventories where rule B is more specific than rule A (row 7 in Table 26) or via inventories where rule A is more specific than rule B (row 8 in Table 26). The latter derivation, however, has no counterpart under WCC (or SCC; see row 3 in Table 8), because in WCC there is no such feature as \(k_{3}\). The only possible derivation of AABs under WCC is thus the one repeated below in Table 27, where rule A is unspecified for Case features, and where rule B is specified for (at least) the dative-specific feature \(k_{2}\). It thus follows, under WCC, that rule B in any AAB pattern will not be able to apply in nominative or accusative contexts elsewhere in the paradigm, since these contexts do not involve \(k_{2}\). This prediction is laid out in (8).Footnote 28

  1. (8)

    For a given context \(C_{1}\) where we find the pattern AAB, there should be no context \(C_{2}\) where rule B applies in the nominative or the accusative.

Table 26 Derivations of AAB, assuming the Subset Principle and NCC (from Table 11)
Table 27 Derivations of AAB, assuming the Subset Principle and WCC (from Table 25)

In the rest of this section, we review the AAB patterns of stem exponence that appear to counterexemplify (8) within Indo-European. While a detailed analysis of each relevant case study would take us too far afield, here we wish to provide a proof of concept that, for virtually all the apparent counterexamples to (8) that we have come across, there are alternative analyses of the stem alternation patterns that do not rely on morphosyntactic conditioning.

The first class of apparent counterexamples to (8) is found in Indo-European athematic neuter nouns such as Latin ‘thigh’ in Table 28. Here a stem ending with -in- shows an elsewhere distribution, while a second stem, ending in -ur, appears exclusively in nominative singular and accusative singular contexts (Calabrese 2020; McFadden 2018; Moskal 2015). Importantly, this paradigm seems to counterexemplify (8), with an AAB pattern in the singular and a BBB pattern in the plural. However, the distribution of the two stems in this paradigm can be accounted for in terms of a phonological trigger: since only nominative singular and accusative singular are suffixless, one may simply assume that the allomorph femur can only show up at the right edge of the phonological word, and that femin- shows up elsewhere. The same problem arises and the same solution is available for Classical Greek neuter nouns such as ásty in Table 29. The morphophonological solution may initially seem to not be viable for paradigms such as the one in Table 30 from Slovene, where both the nominative/accusative singular and the genitive dual/plural seem to lack an overt affix and yet they receive different stems. A morphophonological account of the stem alternations is possible even here, however, if the genitive plural/dual suffix underlyingly involves a vowel (yer), as has been claimed for other Slavic languages such as Russian (Halle 1994; Bailyn and Nevins 2008).Footnote 29

Table 28 Latin ‘thigh’ (Weiss 2009: 240)
Table 29 Classical Greek ‘town’ (Kühner and Blass 1890: 440)
Table 30 Slovene ‘tribe’ (Herrity 2015: 86)

The second class of apparent counterexamples to (8) comes specifically from Classical Greek, where the AAB pattern may also be found outside neuters, as illustrated in Table 31. Here it is a stem ending in a high vowel (poli-, pēxy-) that only shows up in nominative singular and accusative singular, while a stem ending in -e (pole-, pēxe-) appears elsewhere. What makes these cases different from the ones discussed above is that the A∼B alternation cannot be blamed on the presence vs the absence of a suffix, since the nominative and accusative singular forms are marked with -s and -n, respectively. However, while these latter cells are not suffixless, they still have unique morphophonological properties within the paradigm: they are the only cells whose suffix amounts to just a coda (-s, -n), while all other suffixes are ostensibly syllabic, at least underlyingly.Footnote 30 One could thus assume that the high-vowel stem (poli-, pēxy-) only appears if it contains the last syllable of the (underlying) phonological word.Footnote 31

Table 31 Classical Greek 3rd declension nouns (Kühner and Blass 1890: 440)

Finally, the set of (cognate) paradigms in our sample that proved the most resistant to a morphophonological reanalysis was the one of 3rd-person pronouns in High Germanic. Consider the paradigm of the Old Saxon masculine 3rd-person pronoun in Table 32, where the i-stem appears only in the dative in plural contexts (AAB) but in both dative and accusative in singular contexts (CBB). Thus, by looking at the masculine paradigm, one would conclude that i-stem is the elsewhere stem, which we (just like SCC) predict to be impossible.

Table 32 Old Saxon 3rd person masculine pronoun (Fulk 2018: 190)

Crucially, however, the problem with these Germanic paradigms is more general. The stem alternations in paradigms like the one in Table 32 could not be characterized in morphosyntactic terms even if we adopted NCC. Consider Table 33, where we include both the masculine and feminine forms of the Saxon 3rd person pronoun. What is important to observe is that neither the i-stem nor the s-stem have morphosyntactically coherent distributions. Since i- and s- cannot both be distinct elsewhere stems, the s-∼i- alternation appears to not be amenable to any morphosyntactic analysis, irrespective of whether we adopt WCC, NCC or SCC. Given these facts, we take Old Saxon and other paradigms like it to be open problems, not only for the theory we are proposing, but for any theory that rejects “morphomes.”Footnote 32,Footnote 33

Table 33 Old Saxon 3rd person pronoun (Fulk 2018: 190)

5.2 ABCs

The possible derivations of ABC patterns under WCC are repeated below in Table 34. For any ABC, rule C is predicted to only ever apply in the presence of the feature \(k_{2}\). Given that under WCC this feature is only present in datives, we make the prediction in (9). The prediction is shared by SCC, though the latter additionally predicts that rule B will be more specific than rule A (see Table 11). Neither of these predictions is shared by NCC, which admits derivations of ABC from inventories where any of the three rules may be more specific than any other (see Table 11).

  1. (9)

    For a given context \(C_{1}\) where we find the pattern ABC, there should be no context \(C_{2}\) where rule C applies in the nominative or the accusative.

Table 34 Derivations of ABC, assuming the Subset Principle and WCC (from Table 25)

Unfortunately, there are only a very small number of ABC patterns available for testing the prediction in (9). The only case of ABC in stems that we have encountered in the Indo-European languages we have looked at (and which is also one of the two ABC patterns identified in Smith et al. 2019) is found in the Albanian demonstrative pronoun, in feminine singular contexts across Deixis properties, as shown in Table 35. As can be verified by looking at the rest of the paradigm, it is the t-stem (i.e. rule B) that appears to be the elsewhere rule. Crucially, the sa-stem (i.e. rule C) does not spread into accusatives or nominatives outside the feminine singular context, in compliance with (9).Footnote 34

Table 35 Albanian demonstrative pronoun (Newmark et al. 1982: 122)

6 Final remarks

6.1 Main conclusions about featural containment in the domain of Case

Different theories of featural decomposition for a particular domain make different predictions not only about the attested and unattested patterns of exponence, but also about the ways in which the attested patterns can and cannot be derived. The present paper specifically discussed this issue as it applies to the domain of Case. We showed that a tension arises between two competing theoretical approaches to the featural representation of the nominative, the accusative and the dative cases, which we referred to as SCC and NCC: while SCC fares better than NCC in capturing the fact that ABA patterns are unattested (in contrast to AAA, ABB, AAB and ABC, which are all attested), SCC proves to be too strict when it comes to its predictions about the possible and impossible derivations of attested patterns of exponence (in particular ABB). We argued that our proposed third theory, which we referred to as WCC, achieves the best of both worlds: it captures *ABA with the same ease that SCC does and the (im)possible derivations of (un)attested patterns of exponence as easily as NCC does.

We conclude the paper by entertaining some commonly held assumptions about the relation between featural containment and surface containment, and argue that, to the extent that these assumptions are correct, WCC appears to make more accurate predictions than both SCC and NCC in that domain as well.

6.2 From featural containment to surface containment

Featural containment has sometimes been observed to correlate with universal asymmetries in the attested surface-containment patterns. In this last section, we briefly consider what the consequences of adopting WCC are in terms of the attested patterns of surface containment. We conclude that, assuming a particular theory of how featural containment and surface containment relate to each other, WCC fares better than both SCC and NCC.

Bobaljik (2012) argues, based on a *ABA generalization in adjectival suppletion, that the positive degree of an adjective is featurally properly contained in its comparative, and that the comparative is in turn featurally properly contained in its superlative. Concomitantly, Bobaljik also shows that there are several languages where the positive form is regularly a substring of the corresponding comparative form (e.g. Persian or English in Table 36), as well as several languages where the comparative form is a substring of the corresponding superlative form (e.g. Persian or Czech in Table 36). There are also languages where no surface containment is observed between positives and comparatives (Czech in Table 36), or between comparatives and superlatives (English in Table 36). Importantly, there seem to be no languages (in Bobaljik’s sample) where the superlative form is a substring of the corresponding comparative, or where the comparative is a substring of the corresponding positive. Assuming Bobaljik’s analysis of *ABA in adjectival suppletion to be correct, we might entertain the general hypothesis in (10), which essentially states that featural containment and surface containment may not contradict each other.Footnote 35

Table 36 Surface containment in comparative morphology (Bobaljik 2012: 31–32)
  1. (10)

    Given two feature sets F and \(F'\) such that \(F\subset F'\), the string of exponents realizing F may form a substring of the string of exponents realizing \(F'\), but the string of exponents realizing \(F'\) may not be a substring of the string of exponents realizing F.

Coupled with the hypothesis in (10), the three case decompositions we have introduced so far make different predictions about the presence or absence of universal surface-containment asymmetries between nominative, accusative and dative. In the rest of the present section, we discuss the predictions in Table 37 against the empirical landscape. We argue that, once again, adopting SCC leads to predictions that are too strong, adopting NCC leads to predictions that are too weak, and adopting WCC leads to predictions that are just right.

Table 37 Predictions about attested and unattested surface-containment patterns once SCC, WCC and NCC are combined with (10)

Let us start with the relation between accusative and dative forms. Recall that both SCC and WCC, unlike NCC, hold that the accusative is featurally contained in the dative; so, assuming (10), they should both predict that there might be languages where accusative forms are regularly substrings of the corresponding dative forms, but not languages where dative forms are substrings of the corresponding accusatives. As far as we know, this appears to be true: while the literature reports the existence of several languages of the first type (Table 38), we do not know of any languages that clearly showcase dative forms that are substrings of the corresponding accusatives.

Table 38 Surface containment and Case (Gippert 1987 [cf. Caha 2010: 42]; Smith et al. 2019: 1037)

Since SCC posits featural containment between nominative and accusative, given (10), SCC predicts that there should not be any instances of accusative forms surface-contained in the corresponding nominatives. In other words, SCC would lead us to expect the attested surface-containment patterns between nominative, accusative and dative forms to be no different from the attested surface-containment patterns between positive, comparative and superlative as described by Bobaljik (2012). By contrast, WCC and NCC do not posit any featural containment between nominative and accusative, and, thus, do not necessarily predict a universal surface-containment asymmetry between nominative and accusative forms. What are the relevant facts? While advocates of SCC have occasionally adduced in their favor the existence of languages where nominatives are substrings of the corresponding accusatives (e.g. Kalderaš Romani and Tocharian B in Table 38, again), the fact of the matter is that the reverse surface-containment pattern is also attested, at least within Indo-European (Table 39; cf. Zompì 2017: Sect. 3.3 on Gothic, and McFadden 2018: 10 on Icelandic). While this is unexpected under SCC, it is not under WCC or NCC.

Table 39 Accusative forms surface-contained in nominative forms (Einarsson 1949: 33; Praulin̹š 2012: 28)

Finally, under SCC, nominatives may be substrings of datives but not vice versa; by contrast, under either WCC or NCC, no prediction is made about the attested patterns of surface containment between nominatives and datives, since the two cases are not in a containment relation to each other. Here, too, SCC’s predictions turn out to be too strong. Cases of nominatives being substrings of datives can be found in Table 38, but cases of datives being substrings of the corresponding nominatives may crucially also be found, as seen in Table 40.

Table 40 Dative forms surface-contained in nominative forms (Einarsson 1949: 35; Braune and Heidermanns 2004: 109)

Thus, to the extent that something like the hypothesis in (10) can be maintained, the facts appear to provide support for featural containment of the accusative within the dative, but not for featural containment of the nominative within the accusative—or, at least, no more support for nomacc or nomdat than they provide for accnom and datnom. This is precisely in line with what WCC would lead us to expect. The difference between the attested surface-containment patterns in adjectival morphology observed by Bobaljik (2012) and Case morphology can simply be seen as the result of two different ways of deriving *ABA: strict featural containment (as in SCC) for positive, comparative and superlative, but weak featural containment (i.e. WCC) for nominative, accusative and dative.