Polymorphic distributivity

This article describes a novel pattern of interpretations associated with universal determiners like ‘each’ and ‘every’. It is demonstrated that these canonically distributive quantifiers can give rise to surprising collective readings when they quantify into sub-clausal constituents, especially other Determiner Phrases. For instance, ‘two cards from each player’ can be understood to pick out a single assorted deck of cards, one whose contents co-vary with the players. Yet this deck as a whole may be said to participate in a range of collective activities (being shuffled together, being traded en masse, not fitting into a standard pack, etc.). Such examples are shown to differ from more familiar cumulative readings of the same quantifiers. A compositional analysis is offered that generalizes Krifka’s (2001) method of quantification into speech acts in order to accommodate quantification into a larger class of non-truth-denoting semantic objects, including in these cases, entities.


Introduction
The 17th Amendment to the US Constitution, ratified in 1913, begins with the sentence in (1). The register is appropriately formal, but the line is still perfectly interpretable in contemporary standard English.
(1) The Senate of the United States shall be composed of two Senators from each State, elected by the people thereof, for six years; and each Senator shall have one vote.
The sentence establishes several rules at once. In particular, it guarantees that at any given time: (i) each state will have two elected senators D. Bumford dbumford@ucla.edu 1 UCLA, Los Angeles, USA (ii) each of these senators will serve a six-year term (iii) the collection of these 100 elected officials will make up the governing body called the Senate These truth conditions would suggest a kind of inverse-linking semantics. Since senators vary with states, the universal quantifier 'each state' should scope over the object DP that contains it, 'two senators from . . . '. But the truth conditions also suggest a kind of collective predication. No pair of senators can be said to compose the Senate, so certainly the amendment does not require that for each state, its two senators compose the Senate. It is easy enough to characterize informally how the intended meaning might arise. The direct object asks us to find, for each state, two individuals who have been elected by its constituents to the appropriate office. Once we have accomplished this, we set the states aside. The rest of the sentence is concerned only with the officials we have found. If those individuals collectively perform the actions of the Senate, then the law is satisfied.
But what theory of universal quantification predicts this? Whence the "finding" and "collecting"? Canonical analyses of distributive DPs like 'each state' assume that such a DP quantifies over a property, or a set, or something that one way or another would distinguish abiding from non-abiding states. There are only two candidate properties in (1): the property of being a place that a senator is from, and the property of being a place that the Senate is composed of two senators from. Neither of these is a property that (1) requires each state to have.
More sophisticated compositional theories like those of Moltmann and Szabolcsi (1994), Heim and Kratzer (1998) (Ch. 8.6), or Büring (2004) would allow the universal to quantify directly into its host DP, creating a complex quantifier λQ. each state two senators from x Q . But this merely kicks the can down the distributive road. The composite DP still demands a property Q so as to assert what it is that each state state's senators must do. In this case, Q can only be the property of composing the Senate, but this again is not the right reading of the law; no particular state is responsible for fielding the entire Senate, much less each of them.
The same is true of yet more systematic mechanisms for recursive scopetaking, like that proposed by Barker and Shan (2014). The higher-order function λk. each state λx. k λQ. two senators from x Q leaves two continuations open for further computation, k and Q, one of which will eventually soak up the denotation of 'compose the Senate'. But both occur in the distributive scope of the universal, and so both will again erroneously require at least one Senate per state.
Indeed, the challenge that (1) poses is not one of scope, per se, but of the lexical semantics of words like 'each'. To arrive at the intended meaning, 'each state' should distribute over its host DP (so that the senators vary with states) without thereby distributing over the entire rest of the sentence (so that the composition of the Senate varies with states). Its argument therefore should be a function from entities x to whatever sort of thing 'two senators from x' denotes, and its result should be whatever sort of thing '100 senators' denotes.
There is no difficulty in concocting such a function. Ideally though, it should in some way follow from the canonical denotation for 'each', the one that contributes to the truth conditions of the final provision 'each Senator shall have one vote'. In other words, (1) does not equivocate between its first and second uses of the word 'each'.
This manuscript describes one way in which a universal denotation might be constructed to derive both clauses of (1), while hewing as close as possible to standard theories of quantification, scope, and distributivity. The strategy will be to generalize Krifka's (2001) polymorphic theory of quantification-into-speech-acts, largely following the compositional lead of Bumford 2015. Consider, for example, a pair-list question like (2).
(2) Which book did each student read? a. John read Structures; Mary read Aspects; Fred read SPE.
Like (1), this sentence seems to require its 'each'-DP to scope over something that is not a property, and to return something that is not a truth value. In this case it turns a simple question-one whose answers specify for some student which book the student has read-into a compound question-one whose answers specify for each student which book the student has read. Krifka (2001) argues that this is as expected if the semantics of 'each' simply "conjoins" the various meanings generated by its restrictor, where the relevant notion of conjunction is determined by the conjuncts. In (2), for instance, the conjuncts are questions (say, sets of propositions), and conjunction is refinement (say, pointwise set intersection).
Krifka formulates his analysis specifically as a theory of conjoint speech acts, dealing with pair-list interpretations of questions, commands, blessings, and the like. And thus the flexibility of conjunction is closely connected to the notion of sequential performance. To conjoin two acts is to perform one and then the other. To distribute a restrictor over an act-valued function is to perform actions for all of the elements of the restrictor. Bumford (2015) detaches this idea from speech act theory, adapting the pair-list vocabulary to other side-effects of "iterated conjunctions", including in ordinary declarative clauses, even when embedded in conditionals.
Here the idea is relaxed further to include quantification into entities: pair-list individuals, so to speak. The thesis is the same though: whatever can be conjoined can be distributed over. Take the schematic examples in (3) and (4).
(3) Who did each state elect? a. each state [λx who did x elect] (4) two senators from each state a. each state [λx two senators from x] As described above, in (3), the argument of 'each state' is a function from entities to sets of propositions. Thus for each state this function generates a question. Conjoining these questions is intersecting their respective possible answers pointwise. The resulting meaning is then a new set of propositions each of which entails for every state one of the answers to the question generated by that state (Alabama elected John, and Alaska elected Mary, and . . . ). In (4), the argument to 'each state' is a function from entities to sets of (plural) entities. Conjoining these indeterminate individuals is joining their respective possible witnesses pointwise. The resulting meaning is then a new set of (plural) entities each of which contains for every state one of the candidate pairs in the description generated by that state (Mary from Alabama + John from Alabama + Sue from Alaska + Bill from Alaska + . . . ).
For further lexical and compositional details, see Sect. 3.2. Before that, Sect. 2 presents a range of data to establish and circumscribe the empirical pattern that (1) exemplifies.

Data
The basic pattern represented in (1) is repeated in the following attested examples. In every case, some kind of existential determiner hosts an inversely linked 'each'-DP. And in every case, the whole inversely linked DP satisfies a collective predicate. For example, the first sentence in (5a) places the attendance of CCA Day, a single event, at somewhere above 60. The second sentence in (5a) clarifies that the facilities were all well represented in the set of attendees. Specifically, each facility was represented by at least 10 employees, and all 60 or so of these employees assembled for the event.
(5) a. More than 60 CCA employees recently had an opportunity to rub elbows with some of the state Capitol's most prestigious officials. [. . . ] At least 10 employees from each facility assembled for CCA Day, including all six wardens. b. The jobs in each set are now combined into a mega-job. c. If you don't know what an acrostic is, it's a type of poem where the first letter of each line spells out a word. Mine spells "SUPERHERO". d. But the words of each inquiry rattled and collided in my brain and spun into an unintelligible tangle. e. School performance estimates combine [three years of data from each school] to provide an estimate of the expected percent of students proficient or advanced [. . . ] f. In this analysis, feature measurements of at most 100 cells from each slide were pooled to form two groups. g. Tell you what. I'll trade you two of each kind for the King's Rock.
The other examples are similar, and present a variety of hosting determiners. In (5g), as in (1), the host is a bare numeral, while in (5a), the hosting numeral is complex. The hosts of (5b) and (5d) are definite plurals; that of (5c) a singular definite ordinal. Examples (5e) and (5f) are more complicated. (5e) is plausibly syntactically ambiguous between an outer pseudo-partitive parse ('[[three years] of [data [from each school]]]') and an inner pseudo-partitive parse ('[[[three years] of data] from each school]'). In the former case, the host determiner is null, since the host NP is the mass noun 'data'; in the latter, the host determiner is itself a pseudo-partitive. There are probably other options as well. In any case, although the sentence could in principle describe a cumulative situation in which each school provides at least part of the data, and the total time period over which data is collected is three years, my own judgment is that the distributive DP must outscope the numeral 'three' (so each school provided three years of data, and all of the data was subsequently aggregated). (5f) presents even more difficulty. However the attachments are resolved, the sentence should end up with at least one interpretation in which the scopes are totally inverted: from each slide, at most 100 cells were measured, and then all of the measurements from all of the cells from all of the slides were pooled for analysis.
What's crucial here is that universal distributors like 'each' generally resist aggregation. This, after all, is why we call them "distributors". Simplified variants of the sentences above, with vanilla 'each'-DPs, do not make much sense, since they require a bunch of atomic entities-employees, jobs, etc.-to have a bunch of collective properties-assembling, combining, etc.
(6) a. # Each employee assembled for CCA Day, including all six wardens.
b. # Each job is now combined into a mega-job. c. # An acrostic is a type of poem where each letter in the first column spells out a word. d. # Each word rattled and collided in my brain and spun into an unintelligible tangle. e. # Estimates combine each relevant data point to predict student performance. f. # Each measurement was pooled to form two groups. g. # I'll trade you each kind for the King's Rock.
The examples above all involve distributive DPs embedded in complements or adjuncts of non-distributive DPs. Distributors embedded in possessive specifiers exhibit similar interpretive possibilities. The adaptations in (7) have the same "pair-list entity" readings as the examples above.
(7) a. ✓ Each facility's ten employees assembled for CCA Day.
b. ✓ Each set's jobs are now combined into a mega-job. c. ✓ An acrostic is a type of poem where each line's first letter spells out a word. d. ✓ Each inquiry's words rattled and collided in my brain and spun into an unintelligible tangle. e. ✓ Estimates combine each school's (three years of) data to predict performance. f. ✓ Each cell's few feature measurements were pooled to form two groups. Before turning to the analysis of this pattern, let me note several empirical points. First, I have been speaking generically about "distributive universals", but the only universal quantifier to appear in (5)-(8) is 'each'. To be sure, analogous sentences with 'every' or 'all' in place of 'each' are easy to find, as exemplified in (9) and (10). However, as discussed in Sect. 3.1, 'every' and especially 'all' are known to take on definite or cumulative interpretations in certain environments. So while I see no reason to doubt that sentences like those in (9) might be analyzed along exactly the lines laid out in what follows, I concentrate on the examples with 'each' to avoid potential confounds.
(9) a. Our Favorite Wedge Sneakers Combine the Best of Every Trend. b. Our master invoice template will combine the line items of every current invoice for that company and show a combined total. c. During the next meeting you attend, add up the hourly cost of every person in the room. d. Later, in the story of the flood, Genesis says that God told Noah to gather two of every animal and take them on his ark to save them. e. As the governing body of the union, the Board of Player Representatives is comprised of one player representative from every team.
(10) a. Not all characterisation tests provided evidence of "superiority" of one mAb over another, but together the results from all characterisation studies allowed for selection of an antibody pair to be taken forward to assay development. b. Figure 1: Hypothetical trees of a community composed of two species from all monophyletic families from APG III (APG III, 2009) c. SAAC (Student-Athlete Advisory Committee) is comprised of two studentathletes from all our teams.
Second, almost all of the sentences in (5) contain a singular distributive quantifier like 'each set' embedded within a larger plural DP like 'the jobs in . . . '. The entire complex DP denotes a collection of entities, a model-theoretic plurality. In these cases, unsurprisingly, inflection on the collective predicate is plural, matching both the form and denotation of the argument. In (5c) and (7c), however, the head nominal 'first letter' is singular, though the denotation of the entire argument is still multipartite. This creates a clash between syntactic and semantic number. The author of (5c) opts for syntactic rather than semantic agreement on the predicate: 'spells' rather than 'spell'. In my own judgment, this is by far the more natural choice, though the attested (11) may give some doubts about the strength of the morphological force exerted by atomic subjects.
(11) [T]he single electron from each atom's orbital combine to form an electron pair creating a sigma bond.
On other occasions, as in (12), the multiplicity of the emergent pair-list collection seems to pull people toward pluralizing the head noun, even when the referents described by the host NP are clearly atomic: a kind of i-within-i dependent plurality.
(12) a. Just used AI to combine the faces of every single US President. b. This is what you get when you combine the male leads from every Nicholas Sparks movie. Turns out a bunch of hot guys makes one single, also hot guy.
In yet other instances of this configuration, neither morphological choice seems especially acceptable. For example, (13a)-with a plural verb form-is all but ungrammatical, but (13b)-with a singular verb form-all but disambiguates to the nonsensical reading in which there are a number of single-jersey piles. Unfortunately I do not have anything to say about the Catch-22 of inflection and agreement here. 1 (13) a. *A jersey from each team are piled up on the couch. b. ? A jersey from each team is piled up on the couch.
Third, not all collective predicates accept these pair-list sums. Dowty (1987), among many others, distinguishes between 'gather'-type predicates and 'numerous'-type predicates, largely on the basis of their acceptability with 'all'-DPs. For instance, (14b) patterns with (14c) rather than (14a) in asserting that the campers collectively gathered around the fire. But (15b), like (15a) and unlike (15c), incoherently requires that the campers are individually numerous.
(14) a. # Each camper gathered around the fire. b. ✓ All the campers gathered around the fire. c. ✓ The campers gathered around the fire.
(15) a. # Each camper is numerous. b. # All the campers are numerous. c. ✓ The campers are numerous.
At present it seems there is little agreement on what general conceptual or theoretical classes these predicates are representative of, or what diagnostics distinguish them, or even what the empirical facts are regarding those diagnostics (Champollion 2020).
In my own judgment, the collective predicates most willing to take 'all'-DPs (but not 'every' or 'each'-DPs) as arguments are predicates of joint action ('gather', 'elect') or joint use ('shuffle'), including acts of collaboration and cooperation ('work together', 'put on a play'), predicates of similarity or compatibility ('agree', 'make a good team'), predicates of constitution and composition ('comprise', 'form', 'make up'), and of combination ('add up', 'pool', 'pile'). The predicates most resistant to all universally quantified DPs, including those headed by 'all', are predicates of cardinality ('be numerous', 'outnumber'), distribution ('be denser in the middle', 'be homogeneous'), and to a lesser extent arrangement ('be arranged in a triangle').
And also in my judgment, it seems that these latter, 'numerous'-type predicates are as incompatible with the pair-list collections exemplified in (5) as they are with ordinary distributive arguments like (15a). For example, (16) cannot mean that because there are so many bands in the lineup, a collection consisting of two songs from each would be a very large collection.
(16) # Two songs by every band in the lineup are numerous.
Even when the host DP is a simple plural definite description-the sort of phrase that should have no trouble generally satisfying 'numerous'-type predicates, as in (15c)-numerosity is unassertable. That is, (17a) can only mean that each borough is populated by more than a million people, not that the boroughs have a combined population of over a million. Likewise, (17c) cannot mean that the marching bands, taken together, were arranged so that the highest concentration of bands was in the middle of the arrangement; it means that each band was arranged in a middle-heavy formation. This is perhaps surprising, since the agreement facts above suggest that the features of the host nominal and determiner can retain some influence over the predicate. Instead, these pair-list arguments appear to show neither the semantic properties of the embedded distributor nor the properties of the definite host. Rather they show the properties of 'all'-DPs. That is, they readily saturate predicates of joint activity, composition, combination, etc., but stubbornly distribute over predicates of cardinality and arrangement. It would seem that for whatever reason, a sum assembled compositionally via distributive quantification is too motley to be considered a holistic group entity and so cannot have properties like numerosity or uniformity. Such entities, like DPs headed by 'all', apparently must be construed as loose sets of individuals, acting together to some end. Again, though, given the theoretical and empirical murkiness of the 'gather'/'numerous' split, I'm afraid I have nothing more insightful to say about this contrast. The final point I wish to draw attention to is that the readings of interest here depend on a fundamentally ordinary kind of inverse scope: an embedded universal takes scope over a DP that contains it. This means the phenomenon is delimited by normal constraints of scope and binding. Accordingly, if the universal is embedded in an island, then the pair-list reading is unavailable. For instance, the rules of the Constitution would be unsatisfiable if the 17th Amendment were written as in (18), where the universal is in a relative clause, as it would require potential senators to be born in multiple states.
(18) # The Senate shall be composed of two Senators who come from each State.
As far as binding, recall that in a typical case of inverse linking, an embedded quantifier distributes over the rest of its clause in addition to its host. In (19), for instance, the cheese eating events vary with the cities, just as the cheese eaters do. And in virtue of scoping over this clausal continuation, the quantifier may also bind pronouns in it, as in May's (1985) well-rehearsed example (19b).
(19) a. Someone in every city is eating cheese right now.
b. Someone in every city hates it.
But with a collective predicate and a pair-list interpretation of its complex argument, the embedded quantifier by definition does not scope over any more of the clause other than its host DP. As a result, it should not be able to bind anything outside of that host (Barker 2012). Checking this requires consideration of rather careful artificial sentences, but the following perhaps bear out the prediction. The sentence in (20a) is a typical example of the pattern under discussion: the object of the collective predicate 'piled up' denotes a single heap containing one book per author. The minimal variant in (20b) sounds unremarkable at first blush, but doesn't actually make any sense. For instance, it doesn't mean what (20c) means, where the universal takes scope over the entire clause, because that would require accepting that a solitary book can constitute a pile. It also can't mean anything like what (20d) means, in which each author has their own book-per-author pile, which perhaps might be expected if the universal could take scope both inside the object DP 'one book from x' and over the entire clause, so as to bind the pronoun.

Analytical background
The English determiners 'each', 'every', and 'all (the)' are universal. The truth of a sentence containing one of these determiners generally depends on whether the entirety of the set denoted by its restrictor satisfies a certain property. The former two, 'each' and 'every', are also generally distributive. A sentence containing 'each', for instance, requires that the individuals in the extension of its restrictor, one at a time, satisfy the property corresponding to its scope. The latter determiner, 'all (the)', appears in both distributive and collective contexts. In collective contexts, it suffices that the set of individuals described by the restrictor, as a whole, meet the condition described by the scope. A typical sort of contrast is given in (21).
(21) a. ✓ All the students collided in the middle of the field. b. # Each student collided in the middle of the field.
There are a few recognized exceptions to this generalization, at least for 'every'. Synthetic, lexicalized universal DPs like 'everybody' and 'everywhere' can satisfy collective predicates, unlike their analytic counterparts with independent restrictors, as illustrated in (22). There are no analogous words formed from 'each'.
(22) a. ✓ Everybody circled up around the coach. b. # {Every, Each} player circled up around the coach. Beghelli and Stowell (1997) point out the contrast in (23). Descriptively speaking, the DPs in (23a) and (23b) seem to be coerced into an abstract degree-of-toughness determined by calculating the combined strengths of the students. In that sense, there is something collective about the interpretations of the clauses, though not in the usual sense that the DPs successfully saturate the argument of a collective predicate.
Notably, this total-strength interpretation is unavailable with 'each', though as far as I know, the semantics of the 'it took' construction is not well studied, and there is no theory of the difference between universals here.
(23) a. ✓ It took all the students to lift the piano. b. ✓ It took every student to lift the piano. c. # It took each student to lift the piano.
More robustly, 'all'-and 'every'-DPs are known to sometimes stand in what are called cumulative relations to other plural arguments. Kratzer (2000), following Schein (1993), discusses sentences like (24b), on the reading in which all of the mistakes were caught by (at least) one of the three editors, and all of the editors did some catching. This is quite unexpected, given that (24a), for instance, lacks an analogous reading; there is no sense in which (24a) could be true merely because three mistakes were caught in total, with some editors catching only one or two. To make matters worse, Schein observes that 'every' can accumulate with some of its co-arguments, yet distribute over others. For instance, (24c) has a reading on which three very influential video games can be held cumulatively responsible for every quarterback's having learned two (possibly distinct) plays.
(24) a. Every editor caught three mistakes. b. Three copy editors caught every mistake. c. Three video games taught every quarterback two new plays.
Schein and Kratzer both assess (24c) as an argument for the necessity of event variables, but Champollion (2010) and Brasoveanu (2013) provide event-free analyses of these sorts of split-distributivity cases. In keeping with a line of research pursued by, e.g., Szabolcsi (1997) and Matthewson (2001), Brasoveanu (2013) decomposes 'every' into a maximal, existential component and a distributive, quantificational component. The existential component accumulates with "higher" operators, while the quantificational component distributes over "lower" properties. The effect is approximated in English by replacing 'every NP' with 'the NPs each', as in (25).
(25) a. Every girl wore a dress to the party.
b. The girls each wore a dress to the party.
What exactly counts as "higher" in these contexts is somewhat contentious. In the event semantics of Kratzer (2000), (obligatory) distributivity is correlated with agency, so that 'every'-DPs playing the role of agents are necessarily distributive, while 'every'-DPs in non-agentive roles are optionally cumulative. For Brasoveanu, it is simply semantic scope that determines the potential for cumulativity; anything within the nuclear scope of the universal will also fall within its distributive scope. Champollion (2010) criticizes both of these accounts. The former makes predictions that are at odds with the judgments of Bayer (1997), given in (26), and Zweig (2008), given in (27). And the latter cannot explain why (24a) lacks a cumulative reading, given that DPs with bare numeral determiners are generally free to take arbitrarily wide scope.
(26) a. # Every screenwriter in Hollywood wrote Gone with the Wind.
b. ✓ Gone with the Wind was written by every screenwriter in Hollywood.
(27) a. ✓ The Fijians and the Peruvians won every game.
b. # Every game was won by the Fijians and the Peruvians. Zweig (2008, p. 137) conjectures that 'every' is only obligatorily distributive when it is in subject position. In other positions he expects it to exhibit cumulative 'all'-like behavior. A bit less radically, Champollion proposes that an 'every'-DP is doomed to distribute over anything it (syntactically) c-commands. This would of course largely privilege subject positions, but should extend also to lower distinctions within the thematic argument hierarchy (see Flor 2017; Chatain 2020 for some discussion of this prediction). Unfortunately, none of these analyses provide much help with the data in Sect. 2. For one, the sentences in (5) all show surprising collective interpretations, not cumulative ones. Brasoveanu (2013, pp. 40, 48) in fact explicitly denies that 'every'-DPs can support collective interpretations, in the middle of his discussion of their cumulative potential. For another, the scope-splitting mechanisms are still too coarse. In the sentences of interest like those in (5), we do not see a universal associating distributively with one constituent and cumulatively (or collectively) with another. Instead, the universal distributes over some element, and then that element, or rather the sum of elements distributed over, assumes a collective role in the rest of the sentence. We would need to split the distributive scope of the universal in two (and then do some work to pick up the distributed pieces), which would mean actually breaking the universal into three arguments, a plurality formed from the restrictor, a distributive share, and then a collection of distributees. And then most plainly, all of the sentences in (1) and (5) contain 'each', which as far as I know has never been reported to participate in the cumulative readings just discussed.
In a different strain of research concerned with the semantic flexibility of universals, many theorists have argued that interrogatives like (28) exhibit readings best accounted for by treating the universal DP as taking scope over its containing question. On the relevant interpretation, (28) is answered by any proposition that specifies for each student a book they read.
(28) Which book did every student read? a. John read Structures; Mary read Aspects; Fred read SPE.
Indeed, the closest example that I have been able to find in the literature to the Constitutional excerpt in (1) comes from Belnap's (1982) early study of question composition, given in (29).
(29) What the average grade is depends (only) on what each student receives.
In Belnap's words: "Obviously the scope of 'each' cannot be the whole declarative; for it is not true that for each student, the average grade depends (only) on what grade that student receives. Nor can the scope of 'each' be inside the what, for that would presuppose that each student received the same grade. Instead, one has to look at what grade each student received as a closed unit-an indirect question which is a complement of depend-in which 'each' is wide with respect to 'what'." Of course for this to be possible, for 'each' to scope over just the question that embeds it, its denotation would have to be capable of accepting questions as arguments and returning questions as results.
Taking this assumption as a starting point, Krifka (2001) suggests that the sentence in (28) ought to have exactly the same meaning as the sequence of sentences in (30). They are both "conjunctions" of question acts. Since a question act is not a proposition, 'conjunction' here should not be understood in the Boolean, intersective sense, but rather as a sequencing operator in some more dynamic denotational algebra. Krifka sketches what this might look like in a semantics of acts oriented around commitment states (see, e.g., Krifka 2015 for subsequent elaboration of this idea).
(30) Which book did John read? And which book did Mary read? And which book did Fred read?
a. John read Structures; Mary read Aspects; Fred read SPE. Bumford (2015) argues that reconfiguring the denotation of 'every' in terms of generalized dynamic conjunction has benefits below the level of speech acts. He demonstrates that ordinary matters of intra-sentential scope and binding can be explained in terms of "iterated conjunction". For instance, consider the so-called internal reading of (31), on which it asserts the existence of an injection from students to books within the read relation.  Bumford (2015) models this reading by assuming that 'different' means "distinct from the relevant previously mentioned entities", and that (31) has the Krifka-esque structure of (32). Crucially by dynamicizing the conjunction that the universal uses to glue together the results of evaluation, the extension of 'different' evolves as the semantic computation unfolds. Thus students evaluated later will have stricter standards for different-book-reading than the students evaluated earlier.
In the next section, the non-Boolean conjunction at the heart of Krifka's and Bumford's universals is further generalized to include sets of entities in addition to the sets of propositions generated by wh-words and indefinites.

Composition
The following formal fragment caches the previous ideas out. The metalanguage is simply-typed with ground types for entities ( ) and truth values ( ), and constructors for functions ( ) and sets ( ). Semantic combination is function application. Object-language variables and abstractions run through an assignment function, as usual. The notation v :: indicates that the variable v has type .
To fix terms, I will say that in a configuration of the form [ϕ [Λ v ψ ]], the expression ϕ "scopes over" the expression ψ . When ϕ is of type , I will sometimes say that its denotation is "indeterminate". For instance, where a definite description like 'the student', of type , might refer determinately to a particular student-say, the student = john-an indefinite description like 'a student', of type , will be said to refer indeterminately to the set of potential student referents-a student = {d | student d}. But to be clear, I do not mean to attach any technical significance to this way of talking about sets. Nor is this particular set-theoretic analysis of indefinite (35) 2 The notation (·) o is intended to signify that I generally represent the application of o to an argument by attaching o as a superscript to that argument; e.g., the squaring function might be defined by (·) 2 := λn. n × n. 3 For those keeping score at home, (·) η and (·) are the two algebraic components of the (power)set monad (Wadler 1992;Shan 2001).

Conjunction across types
With this grammar established, the only crucial theoretical notion is that of what I'll call a multiplicative type. This determines the range of semantic objects over which distributors like 'every' can scope. As a hypothesis, I assume that any sort of expression that can be conjoined by the word 'and' is also the sort of expression that can act as the nuclear scope of a distributive universal, and any operation that and can compute with two arguments is an operation that every generalizes to a set of such arguments. Since such expressions come in a variety of types, I will say that conjunctions and universals are polymorphic. 4

Let represent the family of type-indexed actions associated with conjunction.
At what types should be defined? For starters, we might include the fundamental, bitwise multiplication of (36a), the classic Boolean generalizations of this that Partee and Rooth (1983) called Generalized Conjunction ((36b) and (36c)), as well as the collectivizing actions associated with plurality symbolized in (36d).

(36)
We might also assume, again following Bumford (2015), that for any type such that is defined, there is a valid instantiation of for indeterminate analogs of , as in (37). This operation takes a set M of -valued sets m 1 through m n , and returns the pointwise product of these sets, where the product operation is appropriate to the type . Just for illustration, if did nothing more than place its arguments in a In other words, the product of a bunch of coerced values is exactly the coercion of those values' product.
If M contains non-trivial indeterminate truth values, then their product is crossed. For example, let M contain for each dog d the indeterminate truth value that d chased a cat, where truth varies with cats: Assuming indeterminate denotations correspond to judgments of truth when they contain T among their alternatives, M will be judged true whenever every dog chased at least one cat.
Most relevant to the topic at hand, consider the case when M consists of a variety of indeterminate entities, type . For example, let M contain for each dog the set of cats it chased:

Then
M is itself an indeterminate (plural) entity; for every way f of choosing a cat per dog, M contains the cat sum corresponding to the range of f : That is, M is a collection of cat clusters, each cluster containing for each dog one of the cats it chased. As a technicality, note that if any dog left the cats alone, then M = ∅; in this case there are no cat pluralities that can bear witness to the respective pursuits of the dogs. As expected from a product, if one of its factors is zero, the entire product is zero as well.

Denotations
With this in hand, (38) clarifies the lexical semantic hypothesis advanced in the introductory sections. Unsurprisingly, every is a binary wrapper for the Π operation, and is therefore a straightforward generalization of the traditional Boolean Generalized Quantifier. 6 The rest of the table spells out standard non-quantificational denotations for indefinites (e.g., Bittner 1994;Krifka 1999;Kratzer and Shimoyama 2002;Landman 2004;Charlow 2019) to complete the fragment.
In garden-variety cases, 'every' will scope over a truth-denoting constituent, and the lexical entry in (38) will reduce to Boolean infinitary conjunction. In the cases of interest, 'every' will scope over an entity-denoting constituent, and the semantics will behave as in the final cat example above. 6 "Generalization" here only in the technical sense that the polymorphic 'every' defined in (38) includes the traditional GQ denotation as a special case (when ). So in any derivation where the traditional 'every' could appear, the polymorphic 'every' may appear as well, and the derivations will have the same denotation. I make no claim as to whether the non-Boolean instances of conjunction should be thought of as generalizations of Boolean conjunction in any natural, conceptual, or psychological sense (see Schmitt 2020 for a recent overview of this widely pondered question).
(39) two representatives from every state The denotation of (39) is a set of sum entities. 7 Each sum contains for every state exactly two representatives. If any state happens to have more than two representatives, then the denotation here is properly indeterminate, containing all the ways of choosing respective pairs out of the states' representative pools. If all the states have exactly two representatives, then the denotation is singleton, containing just the one collection of pairs; in this case we would be justified in talking about the two representatives of every state. If any state has fewer than two representatives, then the denotation above will be empty, as no collection of representatives could meet the constraint that it contain two from each state.
Putting this denotation together with a collective predicate poses no subsequent challenge. The object of (40) denotes an indeterminate diversity of representatives. It takes scope over the sentence just as any ordinary indefinite like 'two cats' would. The whole sentence is predicted to be judged true if any alternative in the resulting indeterminate set of truth values is T. That is, if the Senate really does comprise 100 representatives, two per state. 7 MOD stands for the usual intersective mode of combination used in modification structures: This could certainly be generalized to support any multiplicative return type , replacing ∧ with , but the semantics of adjunction is entirely outside the scope of present concerns.
(40) The Senate is composed of two representatives from every state.
Note that, as promised in Sect. 1, the distributive universal in the example takes scope only over its own host DP. This much is similar to analyses in the tradition of May 1985, like Heim andKratzer's (1998) and Büring's (2004). But unlike those accounts, the universal quantifier here is not type-shifted in any way, the mode of combination that combines the object with its scope is simply Function Application, and the result is simply a kind of entity, with no arguments left to saturate. This is a consequence of taking the host itself to denote a kind of entity, so that distributive conjunction reduces to entity-agglomeration. Similar LFs will deliver the appropriate readings for all of the sentences in (5). In each case, a universal takes a (possibly indeterminate) entity-valued function as argument and returns a (possibly indeterminate) plurality as a result.
Still, because the universal must take scope over its host in order for the nominal witnesses to co-vary with the universal's restrictor, we predict the scope and binding effects introduced at the end of Sect. 2. If the universal is trapped inside an island within the host, a collective pair-list interpretation will be impossible. And whenever a collective pair-list interpretation does arise, the universal will not be able to bind a pronoun outside of the DP it lives in.
Before moving on to discuss possible extensions of the technique, let me try and be clear about the interplay of polymorphism and the universal's host. The denotation of (39) and the resulting truth conditions of (40) depend on both the type of the universal's scope and the instance of conjunction made available by the lexicon for that type. For instance, if the indefinite subject in (39) were analyzed as a Generalized Quantifier, with GQ conjunction defined as intersection, as in (36c), then the derivation would be effectively equivalent to the type-shifting analyses of Heim and Kratzer (1998) and Büring (2004). This is shown in (41). As expected, the resulting truth conditions are fully distributive; presumably not what the framers intended, but certainly a possible reading of the sentence.
(41) two representatives from every state But collective notions of Generalized Quantifier conjunction have also been entertained (see Winter 2001 for a careful study of some options). For instance, Hoeksema (1983Hoeksema ( , 1988 suggests that GQs might be conjoined by "lifting" them over collective arguments, as in (42). Leaving the LF in (41a) exactly as it is, but using the notion of conjunction in (42) instead of the intersective notion from (36c), would again yield the (intended) collective truth conditions of (40).
Yet another route to the collective reading arises in the context of choice-functional indefinites (see again Winter 2001 for an extensive study). For instance, let 'two' be a pronoun over pair-valued choice functions, such that for any property Q, two Q ∈ Q × Q. Then using the ordinary (determinate) conjunction for entities , the LF in (39) would denote an ordinary (determinate) sum of representative pairs, two per state. Which sum it represents would of course depend on which choice function two is resolved to, and whether the entire sentence in (40) is true would presumably depend on existentially closing over this choice.
The point here is that it doesn't much matter what particular analysis of (in)definite semantics one ascribes to the host DPs in (1) and (5), and to some extent it doesn't even matter what types one assigns them. As long as the grammar provides a collective way to combine objects of that type, the polymorphic lexical entry for universals will derive the correct collective truth conditions. That is, as long as there is some notion of conjunction under which (43a) can be made sense of, then a polymorphic 'every' will make sense of (43b) as well.
(43) a. Two representatives from Indiana and two representatives from California comprise the House Subcommittee on Technology Modernization. b. Two representatives from every state comprise the Senate.
In contrast, any grammar in which the denotational options for a universal DP like 'every state' are limited to the Generalized Quantifier λQ. ∀x. state x → Q x and perhaps the maximal collection of states ⊕{d | state d} will continue to undergenerate readings for (43b).

Extensions
A natural question to ask then is what other kinds of constituents such a polymorphic distributor can scope over. That in turn depends entirely on how many types the conjunction operation is defined for. Since the Partee and Rooth (1983) instances in (36c) define conjunction for any "type ending in ", distributors should have no trouble scoping over a predicative or relational constituent, as in (44). Because of the recursive nature of the Partee and Rooth definition, this reduces to familiar recursive generalizations of Boolean GQs as in, e.g., Keenan and Moss (2016).
But just as there are reasonable non-Boolean products of entities, there could in principle be non-Boolean products of relations. Link (1983) and Krifka (1990) offer such operations based on sentences like (45), where the individual predicates are understood to describe different parts of a single subject. Lasersohn (1995) and Winter (2001) find these cases unconvincing, arguing that they are better analyzed as instances of entity conjunction with mass terms, along the lines of (46). I set these sorts of examples aside, and leave it to scholars of mereology to decide whether it is worth hunting for similar instances with universal distributors instead of conjunctions.
(45) a. The flag is green and white. b. This is beer and lemonade.
(46) a. Green and white is my favorite combination of colors. b. Beer and lemonade is my favorite combination of drinks.
As it happens, though, Link's (1983) conception of predicate conjunction offers a useful model of the cumulative application of multiple properties across multiple entities. Recall that the surprising cumulative reading of (47), due to Schein (1993), is one on which the number of games played across quarterbacks is three, but the number of plays learned might be as many as twice the number of players.
(47) Three video games taught every quarterback two new plays.
The Schein sentence seems tantalizingly similar to the Constitution sentence that kicked things off in (1). In both cases, a distributive universal induces co-variation with a nearby indefinite, yet does not distribute over anything else in the sentence. But as discussed in Sect. 3.1, existing analyses of the former do not explain the latter. And in fact nothing said here about the Constitution predicts the relevant reading of the Schein sentence. Just as in (44), with Generalized Conjunction as the multiplicative mode for property conjunction, the two LFs in (48) are denotationally equivalent.

When
simply passes the eventual subject, the games, into each quarterback's nuclear scope, it doesn't matter whether the universal scopes above or below the trace of the subject-that is, over the whole sentence, or over just the VP. 8,9 The resulting truth conditions for both (48a) and (48b) are met whenever some trio of 8 I am assuming predicate denotations are lexically cumulative: R X Y holds iff every atomic y Y stands in the R relation to some atomic x X, and every atomic x X is stood in the R relation to by some atomic y Y . 9 The PFA mode of combination at the root of (48b) stands for Pointwise Function Application. It is an LF abbreviation: games is such that every quarterback played all three games and acquired a couple of (possibly distinctive) plays cumulatively from the three games. But the reading of interest allows quarterbacks to vary in which game(s) they learn their plays from, so long as the total number of games across quarterbacks is three.
Whether or not the relevant reading can be derived from this semantics depends on whether there is another instance of , one based on a cumulative notion of property conjunction rather than the distributive (intersective) notion underlying (36c). In fact, such an operation has recently been proposed by Haslinger and Schmitt (2018). Their analysis is ultimately aimed at just the example at hand, but builds on the interesting and extensive compositional framework for plurality in Schmitt 2017. But for the purposes here, it suffices to say, following Link (1983), that a sum of properties T applies (cumulatively) to a sum of entities x whenever x can be divided into (possibly overlapping) subpluralities each of which satisfies one of the properties in T . This is formalized in (49). The parametric operation in (37) then determines a lifted instance of to sets of such property sums.
Plugging this cumulative instance of into the LF in (48b), the denotation after 'every quarterback' has composed with its scope is given in (50), assuming the relevant set of quarterbacks is q 1 , . . . , q n . The sentence is expected to be true on this cumulative reading iff there's a game triple g such that for each quarterback q, some collection of games g < g is responsible for teaching q a couple of plays, so long as all of the games in g were useful to at least one quarterback.
(50) teach q 1 p 1 teach q 2 p 2 · · · teach q n p n p 1 , . . . , p n are play-pairs So this resolves the undergeneration issue posed by the sentences in (24b) and (24c). That is, the Link-ian definition of property conjunction is all that's needed to derive the appropriate cumulative truth conditions for the Schein sentence. The other half of the empirical puzzle is preventing the overgeneration of such readings for, e.g., distributors in subject position. I will not pursue this challenge in any depth, except to say that it is in fact not obvious to me how the offending reading of (24a) even could be generated, given what has been said so far. Consider the two conceivable LFs for (24a) depicted in (51).
Even in (51b), where the indefinite object takes scope over the distributive subject, there is no cumulativity between the two DPs. The reason is twofold. On the one hand, in both LFs, the nuclear scope of the universal traffics in truth values, be they ordinary (51b) or indeterminate (51a), and the only instance of is Boolean conjunction.
But it also wouldn't change matters here if one were to define a type-algebra to model proposition-collections or sum such. The wide scope of the indefinite guarantees that the editors were uniform in their mistake-catching. That is, for some triple of errors m, Editor 1 caught m, "and" Editor 2 caught m, "and" Editor 3 caught m, etc. Each "conjunct" relates an individual editor to m, a sum of mistakes. It is irrelevant how the truth values (or propositions) are combined, as long as the lexical semantics of 'catch' guarantees that for an individual to catch a plurality of mistakes, that individual has gotta catch 'em all. If so, the sentence will entail that every editor individually caught all three mistakes, not that the three were caught between them. It remains to be seen whether this line of analysis predicts, or is even consistent with, the full range of asymmetries in cumulativity, whatever the facts turn out to be. 10 Finally, I'd like to return to the fundamental asymmetry between distributive and plural universals, repeated in (52). 10 It should be said that Haslinger and Schmitt (2018) also derive the asymmetry in cumulative potential between subject-and object-universals. However, like Brasoveanu (2013), they attribute the asymmetry to the universal's differing semantic scope in the two syntactic configurations, but do not discuss the possibility of scope inversion. It is moreover not immediately clear how to incorporate typical scopeshifting mechanisms, given the novel mode of plural composition that they pursue.
(52) a. ✓ The students collided in the middle of the field. b. # Each student collided in the middle of the field.
The denotation of 'each' proposed here is still distributive, in the sense that it maps a function across the elements of its restrictor, one by one, and then multiplies the results. So it would seem that the basic infelicity in (52b) is still predicted. But technically-now that entities, indeterminate or otherwise, can be multiplied-it is possible for the universal to scope over nothing. In terms of LFs, this would amount to maximally "short" movement of the universal. In semantic terms, this would correspond to a trivial delimitation of the universal's continuation. The result is the total collection of campers, exactly the sort of thing that 'the campers' should denote.
So for the analysis here to have any purchase on the fundamental contrast in (52), constituents or derivations like (53) need to be ruled out. The idea is simple enough: a distributive quantifier should distribute over something (cf. Thomas and Sudo 2016). I leave it to future work to determine the precise nature that such a constraint ought to take, be it formal-perhaps something in the antilocality family-or pragmaticperhaps rooted in competition with 'the'.

Conclusion
Sentences like (1) from the US Constitution provide evidence that distributive universals do not always quantify over truth-denoting constituents. In a range of similar cases, I have argued that such quantifiers are best thought of as using their restrictors to assemble a collection of associated individuals, and that these collections behave for linguistic purposes like ordinary plural expressions. Consequently, the lexical semantics of the distributor should be polymorphic enough to accommodate such uses, but unified enough to explain what the various uses have in common. I have provided one analysis, rooted initially in dynamic semantics and speech act theory, but generalized to include a greater variety of non-Boolean conjunctions, most pressingly those of entities.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/ 4.0/.