Words and Roots – Polysemy and Allosemy – Communication and Language

Carston, Robyn

doi:10.1007/s13164-024-00729-w

Words and Roots – Polysemy and Allosemy – Communication and Language

Open access
Published: 26 April 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Review of Philosophy and Psychology Aims and scope Submit manuscript

Words and Roots – Polysemy and Allosemy – Communication and Language

Download PDF

Robyn Carston ORCID: orcid.org/0000-0003-0820-8968¹

231 Accesses
Explore all metrics

Abstract

Most substantive (content-bearing) words are polysemous, but polysemy is cross-categorial; for instance, the lexical forms ‘stone’ and ‘front’ are associated with families of interrelated senses and these senses are spread across their manifestations as three words, noun, verb and adjective. So, the ultimate unit underpinning polysemy is not a word but the categoryless root of the related words, which must, in some sense, track the interrelated families of senses. The main topic of this paper is the vexed question of the meaning of roots and the backdrop is a view of words as delineated syntactic domains which allow assignment of atomic content (non-compositional meaning), and whose actual meanings are, in the first instance, pragmatically inferred in the throes of communication, some of them subsequently becoming established, so stored in a lexicon and directly retrieved in comprehension. Three different positions on the meanings of roots are outlined, and their merits and shortcomings are discussed: (a) inherent underspecified meanings; (b) meanings conditioned by grammatical context (allosemy); (c) meaninglessness. I argue that, overall, the current state of the evidence favours the third position: roots are categoryless, meaningless (perhaps phonological) indices.

Polysemy and word meaning: an account of lexical meaning for different kinds of content words

Article 28 March 2017

Root suppletion in Swedish as contextual allomorphy

Article Open access 22 February 2024

The root and word distinction: an experimental study of Hebrew denominal verbs

Article 18 August 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction: Words, Roots, and the Basis of Polysemy

The phenomenon of polysemy (a linguistic unit having multiple interrelated meanings) is typically taken to be word-based in psychology, the philosophy of language, and (until relatively recently) linguistics. Most substantive open-class words are polysemous, some highly so, e.g. the noun ‘face’ has at least the following meanings: body part; facial expression (‘to put on a brave face’); apparent character (‘she has many faces’); front surface (of an object); metaphorical front surface (‘on the face of it’); sense of self/ego (‘he didn’t have the face to ask her out’); the verb ‘run’ has different but interrelated meanings in ‘run a mile’, ‘run a bath’, ‘run a business’, run for president, ‘run a tight ship’, and more. And new senses related to existing senses are emerging all the time: the noun ‘laser’, coined in 1957, as an acronym of ‘light amplification by stimulated emission of radiation’ now has a range of other uses, which could well become established as members of a polysemy family: ‘to get one’s tattoos lasered’; ‘a laser stare’ (i.e. a persistent and piercing stare); ‘to throw a laser’ (i.e. a straight and strong shot in baseball) (from Panagiotidis 2014).

What the ‘laser’ example indicates is that polysemy is cross-categorial (the meanings of the noun, verb and adjective are all interrelated and transparently so) and this holds quite generally: ‘stone’ and ‘back’ are categorized as noun, verb, and adjective, each with multiple senses interrelated cross-categorially, and so also the noun/verb pairs of ‘face’ and ‘run’. Given the pragmatics of meaning extensions/modulations (Carston 2020/21), a sense of a noun in the family of related senses may be more closely related to a sense of an adjective or a verb in the family than it is to another of the noun senses. It is not sufficient, therefore, to list families of senses with specific nouns, verbs, or adjectives—we need also to account for the cross-categoriality of sense interrelatedness. In other words, lexical polysemy is grounded in something more basic than words, namely, the root shared by a family of words: √stone, √run, √laser.

As a pragmaticist (employing the relevance-theoretic framework developed by Sperber and Wilson (1986/95, 2015)), I have focused on the lexical meaning adjustments or modulations in specific communicative contexts that can result in new (non-compositional) meanings/senses and are thus a major source of polysemy. However, more recently, it became evident to me that significant work relevant to polysemy has been going on in generative syntax for quite some time and that there has been little, if any, interaction between the two (the pragmatic and the syntactic), a state of affairs I have subsequently tried to begin to redress. So, in recent work (Carston 2022 ) I have argued for a view of polysemy that is root-based, drawing on ideas in current generative grammar that take roots, rather than words, as the primitives, the basic atoms, that feed the syntax. On this sort of view, what we think of as words are, in fact, phrases, that is, they are syntactic structures like any other such structure generated by the grammar. This holds for apparently simple words like ‘cat’, ‘run’ and ‘blue’ as much as it does for the more obviously complex cases like ‘nationalize’, demonstration’ and ‘gaslight’ (Marantz 1995, 1997; Arad 2003; Borer 2005; Harley 2005; among many others). Words have no distinctive theoretical status on this view; it is roots that are the basic units and that are listed as syntactic primitives in the grammar.

However, as conceded by many who endorse the root-based approach to syntax, words do seem to have some sort of psychological salience for language users and the structures that comprise words are manipulated as units of communication, on the basis that they are carriers of atomic meaning. One of these is Julien (2007), who denies words any theoretical status but says: “The psychological reality of words is probably a consequence of their distributional properties: since words are the minimal morpheme strings that can be used as utterances and that may be permuted more or less freely, words are the minimal linguistic units that speakers can manipulate consciously.” (ibid: 83). Certainly, it is words (and other phrases), rather than roots, that are the domain of pragmatic processes of ad hoc sense creation, such as narrowing/broadening of existing meanings and metaphorical and metonymic extensions.^{Footnote 1} I take it that, ultimately, a comprehensive account of language, as both a formal computational system of the mind and a powerful instrument of human communication, must accommodate both of these stances on words, that is, it will have to provide an architecture which includes both a narrowly linguistic (syntactic/computational) component and a broader user-based pragmatic component in which words are a central communicational unit.^{Footnote 2} The complex phenomenon of polysemy is, I maintain, a result of the interplay of the formal/syntactic and the pragmatic.

The main aim of this paper is to try to reach a reasoned position on what, if any, sort of meaning (categoryless) roots may have, as this is a currently unresolved component of the bigger syntactic-pragmatic picture of word meaning and polysemy. In Section 2, I briefly outline the ‘single engine’ root-based approach to morpho-syntax and its interface, as I see it, with the conceptual-pragmatic systems responsible for conferring non-compositional (atomic) meanings on word structures. This is intended as a backdrop to the central issue of the meaning of roots, which is discussed at length in Section 3, where three different views are presented (almost certainly not exhausting the possibilities) and their merits and shortcomings discussed. I don’t expect, by any means, to arrive at an unassailable conclusion about this issue, but rather to bring several distinct existing positions into confrontation with one another, something which, to my knowledge, has not been done before, and to assess which meshes best with the kind of account given in Section 2 (and the earlier papers cited there). Finally, in Section 4, I return to words, understood as units of communication, and consider the kind of user-based lexicon which they occupy, ending with an unresolved issue concerning compositionality and conventionality.

2 Words: Phrasal Syntax and Pragmatic Meanings

2.1 Overview of the Account

This subsection and the next are largely a summary of ideas more fully developed in Carston (2022, 2023), the aim here being to provide the wider background picture within which the focal issue of the meaning of roots is situated. The general idea is that in order to give a full account of the nature of those entities which we intuitively label “words”, the roles of two importantly distinct but interacting systems have to be delineated: the formal narrow linguistic system (syntax) and the system(s) of the broad faculty of language responsible for the meanings or Contents with which the formal elements are coupled, meanings that are used in and shaped by communication (pragmatics).^{Footnote 3} The main points of the account are the following:

(a)
A “single engine” account of language structure is adopted, i.e. there is no separate component for generating complex words, which are treated as structurally phrasal. The terminal elements (basic units) of the syntactic system are of two types, namely, roots (e.g. √cat, √sing, √blue) and functional/grammatical items (including categorizers like n, v, a, and functors like determiners, tense and number markers). Grammatical items come with a bundle of what are called ‘synsem’ features (such as ‘past' for tense, 'plural' for number, perhaps ‘event/action’ for the categoriser v, ‘entity’ for n, etc.), which are to be sharply distinguished from the conceptual meanings that come to be associated with words/roots (Embick 2015: 31–40). Roots do not carry synsem features; in particular, and crucially, they are categoryless, but whether or not they have meaning/sense of some sort, including possible polysemy, is a highly contentious issue, discussed at length in Section 3.
(b)
Whatever the position taken on the meaning properties of roots, it is clear that the set of words built on any given root can have a range of distinct and unpredictable meanings. Consider, for instance, the root √(de)tect, and the meanings of the nouns ‘detection’, ‘detector’, ‘detective’, ‘detectorist’, or the root √nat(ure) and the meanings of the various words based on it, including ‘natural’, ‘naturalist’, ‘naturist’, ‘naturalize’, ‘naturalization’. Many of these are non-compositional, that is, they are not a function of whatever meaning is assigned to the first category level in combination with the higher levels of functional items in their structure (e.g. ‘-ive’, ‘-ist’, ‘-ize’, ‘-ation’). If words are just phrasal structures (whose meaning is typically compositional), this fact about their semantics has to be explained somehow.
(c)
The proposed explanation is two-fold, requiring an account of those structural domains that can admit unpredictable non-compositional Content, and an account of how the particular atomic meanings arise, e.g. the non-compositional meaning of ‘naturalize’ (= become a citizen) and of ‘naturist’ (= one who espouses nudity). The first part, then, is syntactic, and the second is pragmatic. The pragmatic part of the story is simply the standard relevance-theoretic account of how words acquire new (ad hoc) meanings in on-line communication, via inferential processes of narrowing/broadening, metaphor and metonymy, some of which conventionalize into established senses, i.e. settle into polysemy families, so are no longer pragmatically inferred but retrieved from a storage system or lexicon. (Carston 2020/21, 2022).
(d)
The wider relationship of interest that this approach to words reflects is between (i) language (construed narrowly as syntax and its interfaces), (ii) languages (i.e. specific hook-ups of structure and conceptual content to the articulable/perceptible vehicles for externalizing them), and (iii) the use of languages in communication (where meanings are made, adjusted, established, or lost). The fundamental distinction is between relatively rigid formal structures, on the one hand, and very flexible meanings, on the other hand. Being the hybrid entities they are, words are both relatively rigid pieces of structure and bearers of meaning/content which is susceptible to language users’ pragmatic adjustments, hence associated with evolving families of senses (Contents or atomic meanings), stored in a ‘communicational lexicon’ (Carston 2019, 2023).

In the next subsection, I give a sketch of how the two systems (syntax and pragmatics) work together in the making of ‘words’, using the syntactic framework of Borer (2013a, b) and my own work on lexical pragmatics. For current purposes (and in fact ultimately), I assume, along with Borer and others, that (categoryless) roots have no meaning of their own and it is only categorized structures (e.g. roots that have been nominalized or verbalized, etc.) that acquire meaning, but other views on this issue are discussed in Section 3.

2.2 Roots, Syntactic Domains of Content, and Non-Compositional Meanings

I will look here at two kinds of words that appear different on the surface but are, in fact, manifestations of the same syntax-pragmatics interaction. The first are what are known as ‘conversions’, i.e. nouns and verbs with identical phonological form^{Footnote 4} and clearly related (non-compositional) meanings, e.g. ‘sand’, ‘hammer’, ‘starch’, ‘dust’, ‘porch’, ‘treasure’, ‘scalp’. The second are obviously complex, wearing their structural parts on their sleeve as overtly realized categorizers, but bearing non-compositional meanings: e.g. ‘natural-ize’ (become/make someone a citizen of a country); ‘recit-al’ (solo concert); ‘transmiss-ion’ (car’s gearbox); ‘social-ist’ (supporter of Marxist doctrine)’; ‘flake-y’ (unreliable).

The so-called ‘conversion’ process^{Footnote 5} is highly productive in English; here are some recent attested cases of innovative verbs, apparently coined on-the-fly in communication, and based on their established noun counterparts:

An adequate account of this phenomenon requires two distinct kinds/levels of explanation working in tandem: one concerning structure (so syntactic), the other concerning meaning (so pragmatic). According to the ‘constructionist’ approach to syntax (Borer 2005, 2013a), which is adopted here, syntactic structure determines the categorial status (noun, verb, adjective, …) of a root, which is, as it were, inserted into a structural template (Borer 2005: 69–70), which has a range of properties (e.g. temporal and aspectual, mass/count, singular/plural) and relations (e.g. thematic roles of the participants in the event and sub-events). More specifically: the tense functor ‘T’ defines a complement structure which is V-equivalent, and which accommodates both items that are already verbs (e.g. ‘crystal-ize’) and items which don’t have any categorial properties i.e. categoryless roots, so insertion of √stone, √run, or √yellow, into that structure makes them V-equivalent. The number functor ‘Num’ (singular/plural) defines a complement structure which is N-equivalent, so accommodates both items that are already nouns (e.g. ‘govern-ment’) and roots, e.g. √stone, √run, √yellow. For notational simplicity, I will use the following for categorized roots: [_N √stone], [_V √stone], [_Adj √stone]. Setting aside questions about the semantic and phonological properties of roots for the moment, we can assume that, at a minimum, roots consist of an index (or address) tracking occurrences across categories, in effect √127, √753, etc., and thus (potentially) tracking cross-categorial polysemy.

Turning now to the pragmatic meaning relations between the conversion pairs and focusing on the noun → verb cases,^{Footnote 6} which were described in a landmark study by Clark and Clark (1979) as speakers’ lexical innovations whose meanings were readily pragmatically inferred on-the-fly by hearers in appropriate communicational contexts. Bauer (2018) then added to this the idea that N → V (and V → N) conversions are metonymic shifts made by speakers in communication, on the basis (a) that the meaning relations between the members of the pairs are typical of figurative interpretations in being ‘unpredictable and unrestricted’ (ibid: 180), and (b) the meaning shifts involved are essentially the same as those discussed as regular metonymic transfers within the nominal category, such as process for result (e.g. ‘building’, ‘painting’), location for event (e.g. ‘Waterloo’, ‘Vietnam’), and attribute/property for person (e.g. ‘The ham sandwich is getting impatient’) (ibid: 177). Taking the example of ‘crater’ as in ‘X has cratered her authority with Y’, the new verb (specifically its meaning, as the phonology doesn’t change) is inferred from accessible information in the encyclopedic entry of the concept crater encoded by the noun ‘crater’ (including, in particular, the damage and destruction associated with a human-made crater), and information in the wider discourse context (here including that X has made very bad decisions causing members of her team to lose confidence in her), guided by the standard relevance-theoretic comprehension procedure (of following a path of least effort in accessing cognitive implications until an optimally relevant interpretation is reached). This new (ad hoc) verb with the sense crater* (= cause a massive loss) may ultimately become conventionalized and enter the lexicon of established communicational units. Thus, the noun and the verb share the root √crater, which, although meaningless in itself (according to this account), is a marker of the family of senses (the polysemy) in which the noun and the verb co-participate.^{Footnote 7}

Moving now to the overtly complex words, which have categorizing affixes but non-compositional meanings: ‘naturalize’, ‘reactionary’, ‘recital’, ‘transmission’, ‘socialist’, ‘solicitor’, ‘flakey’. Compositional meanings are, of course, available, in all cases, but are in much less communicative use, in most of these cases. On the face of it, these present a challenge to the supporters of the ‘single engine’ syntactic account of word structure, because phrasal structures generally have a compositional semantics (as in, e.g., ‘the girl who loves the brown horse’). The approach taken by some of the syntacticans who have confronted this issue is to delimit specific syntactic domains to which a non-compositional (atomic, holistic, idiosyncratic, unpredictable) meaning (or Content) can be assigned. So ‘recital’, ‘naturalize’, ‘reactionary’ have a specific kind of syntactic structure which, although it has a compositional meaning (like all syntactic structures), allows for (but does not require) assignment of a special (non-compositional) meaning.

On her account of the ‘syntactic domain of content’, Borer (2013a, b, 2014a) distinguishes two kinds of functional items, C-functors and S-functors, and the basic idea is that C-functors (i.e. categorizers) allow non-compositional meaning assignment at multiple levels, while S-functors (a more disparate group including tense, number, determiners) do not, as exemplified by the following:

At each categorization point here, a non-compositional meaning can be assigned; such content has, in fact, been assigned at the initial categorization of the root as a noun, ‘nature’, and later at the verbal categorization by ‘-ize’.^{Footnote 8} In contrast, structures headed by S-functors do not allow this:

In all of these cases, the meaning has to be compositional.^{Footnote 9} On Borer’s account, the ‘C-core’ of a syntactic structure indicates those points at which there is a search of what she calls the ‘Encyclopaedia’ for a matching content. The Encyclopedia is not a component of the grammar, but belongs in the Chomskyan conceptual-intentional (semantic-pragmatic) systems with which the syntactic engine interfaces, and it is the locus of stored non-compositional meanings (atomic Contents); it seems very much like a kind of lexicon.

The task of the pragmatic part of the story is to provide an account of the non-compositional meanings of such structures (words) as ‘reactionary’, ‘transmission’ and ‘naturalize’. Their atomic meanings are semantically related (sometimes somewhat tenuously) to their compositional meaning, in much the same way as the different senses of much simpler structures, such as ‘run’, ‘face’ and ‘crater’ are related, that is they are narrower or broader, metaphorical or metonymic. They too can be accounted for by the standard lexical pragmatic story in terms of online processes of meaning modulation (Carston 2022). For instance, ‘naturalize’ with its non-compositional meaning naturalize (= make a foreigner into citizen of a country) is a (considerable) pragmatic narrowing of the more general compositional meaning [natural + ize] roughly paraphrasable as ‘to make natural’. Many jargon terms, e.g. ‘transformation’ in linguistics and ‘transference’ in psychoanalysis, are narrowings of the general compositional meaning of the structures, [transform + ation] and [transfer + ence]. A case like ‘revolution’ with the non-compositional meaning revolution (= overturning of a social order or government) seems to involve a metaphorical extension of the literal compositional meaning [revolve + tion] to mean any major change (turning around) of an established social/psychological/behavioral configuration, with then a narrowing to the specifically social/governmental change meaning. A case that seems to involve metonymy is ‘transmission’ with the non-compositional meaning transmission (= car’s gearbox): there was probably first a narrowing of the denotation of the compositional meaning [transmit + tion], to the specific kind of transmission process that takes place in the engine of a car, and then a metonymical shift to the object responsible for this process (the gearbox). As with the conversion cases above, the syntax module is responsible for the formal structure of the word (root plus categorizers), while language users, that is, linguistic communicators employing their general pragmatic abilities, are responsible for the particular non-compositional meanings assigned to these structures.

What this discussion shows is that the root-based syntactic approach to word structure coupled with a pragmatic explanation of ad hoc meaning creation can provide an account of how a particular kind of polysemy arises, that is, where a word has both a compositional meaning (as do all phrasal structures) and a non-compositional meaning. Specific delimited syntactic domains allow assignment of (or mapping to) a non-compositional meaning and the particular sense/content is (or originally was) derived pragmatically based on the compositional meaning and the specifics of the context of communication.^{Footnote 10} The compositional meanings of some complex words are not in much current use (e.g. ‘recital’, ‘reactionary’, ‘naturalize’) but are, nevertheless, always available via first category meaning (recite, react, nature) and syntax, while others are cases of established and in-live-use polysemy, e.g. ‘revolution’, demonstration’, ‘flakey’, their polysemy distributed across syntax (which provides the compositional meaning) and a pragmatic (or communicational) lexicon which lists all conventionalized non-compositional meanings (but see Section 4 on the opposing pulls of compositionality and conventionality). Here, as with the conversion cases, the root shared across families of derived words (e.g. ‘nature’, ‘naturism’, ‘natural’, ‘naturalism’, ‘naturalize’, etc.) provides a formal means of tracking connections and relatedness of content across words.^{Footnote 11}

In this section, I have followed the line taken by Borer (2013a, 2014a, 2017) and others (Acquaviva 2009; Acquaviva and Panagiotidis 2012; Harley 2014a; Panagiotidis 2014, 2020) that roots have no meaning (semantics/content) in and of themselves, and it is only categorized roots, i.e. nouns, verbs, adjectives, that are capable of bearing meaning/content. However, this is by no means a consensus position, even among those linguists who subscribe to the ‘single engine’ view that word structure just is syntactic structure. In the next section, I consider two other views about root meaning that are fairly widespread in the relevant literature, presenting arguments against both of them, ending up, therefore, by supporting the assumption made in this section that roots are indeed meaningless indices.

3 The Meanings/Semantics of Roots

Here I am undertaking the limited exercise of looking at three possible general views of root meaning: (1) roots have an inherent or basic meaning; (2) roots have grammatically-conditioned contextual meanings; (3) roots themselves have no meanings (only categorized roots can bear content). Clearly, there are various possible hybrid accounts, which would allow for some roots to have inherent meanings, others to have meanings only in a context, etc. Thanks to an anonymous reviewer for making this point.

3.1 Inherent/Basic/Core Meanings for Roots?

‘Roots possess an inherent meaning (at least in the typical case)’, according to Embick (2015: 47), and ‘At a minimum, there must be a theory of Root meanings, …, to account for the simple fact that Roots possess inherent meanings.’ (ibid: 50; emphasis added). This clearly stated position that roots have an inherent meaning (at least in the typical case) appears to be the exact opposite of the position I assumed in Section 2 (that roots are meaningless in and of themselves, with the minimal bearers of meaning being categorized roots). However, it can be taken in at least two different ways: (a) roots have a semantic content or sense (they encode a concept); (b) roots have a semantically underspecified meaning that underlies the (various) meanings/contents they have in different contexts. As one reads on in the text from which the quotes above are taken, it is not clear exactly which of these two positions Embick is supporting as he also uses the terms ‘basic meaning’ and ‘core meaning’ of a root, each of which is open to several interpretations. What is made clear is that the meaning of a root can alter dependent on the immediate grammatical context in which it appears: ‘categorizers (and perhaps other morphemes local to a Root) play a role in determining which meaning(s) of a Root are active in a given grammatical context.’ (ibid: 47).

Embick gives the following example: ‘In … different grammatical environments there is a shared component of interpretation that is centred on the Root’s “basic” meaning, but perhaps some context-specific differences as well. For example, when the Root √feather in English is employed as the noun ‘feather’, it refers to an object that has a number of properties. Part of what English speakers know about this Root, though, is that in another context, as the adjective ‘feather-y’, it can also have a more restricted meaning, where the weight aspect of √feather is prominent: ‘feather-y’ in this sense means “light or airy”, with no actual objects (feathers) involved … Collectively, the different facets of lexical semantic meaning that are found with Roots in different contexts constitute the phenomenon of Root polysemy …’ (ibid: 48–49; emphasis added). Note first that the conception of ‘context’ here is highly restricted, referring just to grammatical/syntactic context (and with certain further locality restrictions on how far that contextual domain can extend).^{Footnote 13} And, second, the idea seems to be that all the different polysemes of a root share some component of meaning that comes from the inherent/basic meaning of the root. As regards the particular example, much more could be said: rather than being a more restricted meaning, the ‘feather’ in ‘feather-y’, at least as described above, is broader in denotation, applicable both to actual feathers and to other light airy entities (a classic case of pragmatic broadening, as discussed in Section 2). Also, this is just one meaning of ‘feathery’; consider a sentence like ‘I love that feathery collage’ referring to an art-work made from feathers, whether actual feathers or pieces of wire and fluff artistically modelled into things that look like feathers. There are, doubtless, other possibilities contingent on broader discourse or communication contexts.

The idea that the basic units (the primitives) of a language have (must have) a semantically underspecified meaning has a long history, having been applied to words long before the root-based approach took hold. It is highly intuitive and, if it worked, would make for a neat and unified account of word polysemy, one in which all the polysemes of a word shared a common core of meaning, while differing in whatever other components of meaning make each of them into a fully semantic entity capable of contributing to the truth-conditional content of a sentence/utterance. Furthermore, that common core would function as a constraint on possible senses of a word; any new contextually prompted sense would have to incorporate that core meaning in order to qualify as a possible new sense of the word. On such an account, a word would have an invariant but underspecified (abstract) meaning and any fully semantically specified senses it is used to express would be a result of (broad) context and pragmatic inference fleshing out that core (skeletal) meaning. One of the strongest advocate of this view is Ruhl (1989), who maintains that what people talk of as polysemy is a purely pragmatic phenomenon, entirely context-dependent, and to be explained in those terms, while most words (where words are taken here to be the basic linguistic units) are ‘monosemic’. Ruhl carried out extensive and thorough examination of specific cases, such as the verbs ‘bear’ and ‘run’, often cited as cases of highly polysemous verbs, examining their different meanings in different contexts, and describing their specific senses as cases of pragmatic specialization and generalization, very much at one with (and preceding) the relevance-theoretic account of lexical pragmatics (e.g. Carston 2002, Wilson & Carston 2007). However, the key difference is his commitment to the ‘monosemic principle’ that ‘a word’s semantics should concern what it contributes in all contexts’ (ibid: 87). He says that this single linguistic meaning is impossible to put into words, as it is highly abstract and formal, ‘ … highly remote from all ambient contingencies’, but insists that it must exist.

The psychologists Frisson and Pickering (2001) also favour a ‘radical monosemy’ model, according to which the only element of word meaning/semantics stored in the mental lexicon is an underspecified schematic meaning, which is said to ‘encompass all related senses of a word’ (ibid: 149). This is what a hearer retrieves initially and then at a subsequent stage of processing, ‘context is actively involved in refining the interpretation of a word by changing the underspecified meaning into a specific interpretation’ (ibid: 164), that is, a specific sense is recovered through interaction of the schematic linguistic meaning with contextual information. However, they openly acknowledge that this thin core meaning does not constrain the creation of new senses in context: ‘Because the underspecified meaning is an abstraction over the features of specific senses, a novel interpretation of a word cannot be captured by the underspecified meaning.’ (ibid: 159, their emphasis). Thus, it seems that as new senses arise and become established/conventionalized, a revision, a further attenuating, of the alleged underspecified meaning would be necessary. It is extremely hard, then, to see what purpose this schematic meaning would serve or why the child learning word meanings (via communication, so involving fully semantic senses) would induce such a non-semantic core meaning for a word. In short, the radical monosemy position seems to express a strong intuition but one for which evidence is entirely lacking. As the pragmatic account of the origins of much word polysemy (which includes metaphorically and metonymically related senses) indicates, the inferences may lead to senses whose relations are those of family resemblances without any single shared core. I have argued in more detail elsewhere against any common core meaning for polysemous words (Carston 2020/21), a meaning that seems to have no role in constraining new (pragmatically-derived) senses and which would have to grow ever more schematic and abstract the more new senses that a word acquires.^{Footnote 14}

Whether Ruhl would have embraced recent root-based approaches to syntax and attempted to find a common core meaning for the much larger families of senses that this entails, e.g. ‘run’ in its myriad noun and verb senses, I don’t know, but obviously any such endeavor would be subject to the same difficulties encountered at the word level but much magnified.

Moving back to roots now and the various meanings they can have in ‘grammatical contexts’ (considerably more restricted than the contexts that Ruhl considered and than the notion of communicative context employed quite generally in pragmatics). Most expositions of the position that roots have an inherent core meaning use examples from Semitic languages to illustrate their point because roots and their manifestations in different grammatical contexts are especially clear in these languages. This is due to the wholly consonantal nature of their roots, which cannot occur alone but appear in a range of different ‘templates’ (that is, sequences of consonants and vowels) as words whose meanings are often clearly related, e.g. Arabic √KTB, manifest in the verb ‘kataba’, the noun ‘kitaab’, the adjective ‘kitaabii’, among many others, all of which have meanings related to writing (Embick 2015: 48). An important study in this vein is Arad’s (2003, 2005) account of Hebrew word formation; she takes the meaning of a root to be the common ‘semantic denominator’ or the ‘kernel of meaning’ shared by the words derived from it. Consider, for instance, the following root XŠB and words built on it:

Discussing this and other cases, Arad (2003: 9–10) maintains that ‘although the range of meanings in each group is quite varied, all members share a common semantic core. For example, all the words made from the root √xšb are related to some mental activity – whether thought, consideration or calculation.’ The view may seem fairly plausible in this particular case, but there are also word families, clearly built on the same root, whose semantic relatedness is much less evident, making the case for a shared core or kernel of meaning intrinsic to the root much harder to maintain. Consider the following (from Aronoff 2007):

There is also a set of adjectives built on this root, but the nouns and verbs are sufficient for the key point here concerning the alleged core meaning of the root, which is given as ‘press’. Aronoff (2007: 822) puts the point well: ‘The reflex response to problems like this is to posit an 'underspecified' core meaning for the root, which is supplemented idiosyncratically in each lexical entry. It is logically impossible to show that underspecification is wrong, but trying to find a common meaning shared by pickles and highways brings one close to empirical emptiness and this methodological danger recurs frequently in any Semitic language. In any case, there is no need to find a common meaning in order to relate the two words morphologically …’, and he goes on to show how such words can be related via ‘root alternation classes’ without any common meaning, an account that goes well beyond the issue here. With regard to the meanings of the many words formed from √KBŠ, he tells a plausible etymological story about the origins of the ‘highway’ and ‘pickle’ meanings based on how paved roads and pickles were originally made in the relevant societies, both involving ‘pressing’ of one sort or another. The connection between the meanings seems to be essentially pragmatic, involving a series of metaphoric (resemblance) and metonymic (contiguity) relations; the essential point is that ‘there is nothing left of pressing in the meanings of these Hebrew words today’ (ibid: 822) and no role for any such core meaning of the root. See also Panagiotidis (2020: 61–63) for a comparable range of cases in Greek, all based on the root √esth but with no discernible common semantic denominator, and the discussion below of English words based on bound (Latin) roots, e.g. √-ceive, √-fer, √-mit, √-flect; whatever commonalities of meaning they may once have had are long gone, as evident from consideration of, for instance, ‘refer’, ‘infer’, ‘confer’, ‘defer’.

The problem here, as with polysemous words, is that there seems to be no way to isolate any common core meaning for roots and no role for any such element of meaning; as cultural and communicative contexts evolve, leading to the coining of new senses and new words based on the same root, any alleged common core meaning gets more and more attenuated until it disappears altogether. Furthermore, the pragmatics of meaning construction is not constrained by any such core meaning, but proceeds via chains of inference in multiple directions, via narrowing/broadening, metaphor and metonymy. Recall the case of ‘laser’ mentioned above: its meaning has rapidly proliferated from a single concrete noun meaning to various others based on (more or less fanciful) resemblances: ‘a laser stare’, ‘throw a laser’, ‘his laser precision’, etc., and the multiple diverse meanings of words based on the roots √face, √run, √line, and the noun/verb pairs: ‘book’, ‘table’, ‘sand’, ‘ship’, … There just is no common core constituting the meaning of the root in each case, and that isn’t a problem because language users coin new words and new senses (to meet their communicative needs in specific contexts) by adapting and repurposing the senses and words they already have/know, and they can rely on the pragmatic capacities of their interlocutors to understand and grasp these new senses and words.

The idea of a shared semantically-underspecified core meaning hasn’t worked for words and it doesn’t look as if it works for roots either, even when the relevant notion of ‘context’ is radically restricted, as it is here (and in the next position to be considered). Arguably, the identificatory linguistic knowledge that speakers have about the roots in their language is phonological and any apprehension of a root as having meaning is coming from knowledge of the various senses of the words based on that root, whose degree of interrelatedness can vary greatly.

3.2 Allosemy – Grammatically Conditioned Root Polysemy

The third position on root meaning that I’ll look at, in a bit more detail than the previous two (no root meaning, inherent root meaning), is known as ‘allosemy’ (or ‘root polysemy’), viewed by its advocates as a semantic counterpart to ‘allomorphy’, which is where roots can take different phonological forms in different grammatical contexts.

3.2.1 Allomorphy

In certain current approaches to the language faculty, in particular, the approach known as Distributed Morphology (DM) (Halle and Marantz 1993; Marantz 1997, 2001, 2013; Harley 2012, 2014a; Embick 2015, 2021, i.a.), once syntax has done its computational work (that is, performed various combinatorial operations, all of them instances of the basic operation of ‘merge’), a structure is ‘spelled out’ or ‘interpreted’ phonologically and semantically. That is, on the one side, it undergoes morpho-phonological operations, resulting in a ‘phonological form’, which interfaces with the articulatory/perceptual systems responsible for its externalization for use in communication, and, on the other hand, it undergoes certain semantic operations that result in a ‘logical form’, which interfaces with conceptual/intentional systems (within the individual’s wider cognitive system), which are responsible for providing the content or meaning of the structures. As well as the central computational (generative) engine of syntax, there are three lists of stored items: the syntactic primitives (roots and grammatical/functional items); vocabulary items (these are the various phonological forms that syntactic primitives may take); the encyclopedia (which provides ‘special’, that is non-compositional, unpredictable semantics). This is roughly summarized in the following diagram:^{Footnote 15}

There is some variation among theorists concerning what the syntactic domains of spell-out are, with some allowing just VP’s and clauses, while others, especially within DM, include much smaller domains (essentially, those phrases that comprise words) as the syntactic domain that is spelled out (Marantz 2001). There is an assumption here that the phonological and the semantic sides of spell-out are symmetrical and each has its ‘allo-’ manifestations, so a given single root (nothing more than an index or address, as hypothesized above) may take any of several (related) morpho-phonological forms, depending on its immediate syntactic context (e.g. ‘receive’ and ‘reception’, ‘wise’ and ‘wisdom’, ‘teach’ and ‘taught’, ‘mouse’ and ‘mice’) and may have any of several (related) meanings (semanticses), also depending on its syntactic context; the first of these, known as ‘allomorphy’, is well-established, while the second, labelled ‘allosemy’, is more recent and somewhat more contentious. The further issue of whether or to what extent the syntactic domains within which these two kinds of allo-forms arise are the same is discussed and disputed within the theory (see, for instance, Marantz 2013), but this is a level of detail that I won’t consider here, my aim being to cast doubt on the very notion of allosemy.

Let’s first look at the allomorphy side of the story and start with a clear case, the English plural (taken from Embick 2015: 172); on this sort of account, the grammatical item ‘plural’ has the following vocabulary items (allomorphs):

The first of these is to be read as ‘the plural’ functor manifests as /-en/ when it occurs in the context of (is suffixed to) the roots √ox, √child, and perhaps others that would need to be included in the unordered set; similarly, mutatis mutandis, for the second line, where the plural has a zero realisation. The third line, which does not specify any grammatical context, gives the ‘elsewhere’ condition or default value for the plural morpheme, that is, if neither of the other contextually specified vocabulary items applies, then the value is /-z/. (Note that this value may itself surface in several distinct forms, e.g. [-z] for ‘dog’, [-s] for ‘cat’, [əz] for ‘class’, but this is entirely a matter of phonological conditioning by the immediately preceding phoneme, e.g. the devoicing caused by the voiceless obstruent [t] of ‘cat’.)

The plural morpheme is, of course, a member of a functional/grammatical category, to do with number (singular/(dual)/plural). Now, what about the other class of basic syntactic elements, which is more central to this paper, namely, roots? On the surface at least, allomorphy seems to work here in essentially the same way,^{Footnote 16} so, for instance, there are two allomorphs (two vocabulary items) for the root √destroy, roughly indicated here:

That is, this root manifests as /destrʌk-/ in specific nominal and adjectival contexts (‘destruction’, ‘destructive’) and elsewhere it is /destrɔɪ/, the default value. Other cases of root allomorphy discussed in the literature are √mouse with the allomorphs /mais/ manifest in the context of plurality and /maus/ elsewhere (e.g. in the singular and in compounds like ‘mouse-trap’), √thief with the allomorphs /thi:v/ and /thi:f/, √speak with the allomorphs /spi:tʃ/, /spouk/, /spi:k/ in different grammatical contexts (see Siddiqi 2009).

Similarly for the bound root √-ceive or √683, which manifests as /sep/ in certain nominal and adjectival contexts (as in ‘reception’ / ‘perception’ and ‘receptive’ / ‘perceptive’) and as /si:v/ elsewhere, its default value (as in ‘receive’, ‘receiver’, ‘perceive’ and ‘perceiver’). This nicely captures a generalization: the /si:v/ ~ /sep/ alternation is a property of the –ceive root itself, which is why it behaves the same way across lexical items that share this root (e.g. deceive, conceive, perceive) and in imaginary nonce items formed from –ceive (e.g. acceive, acception). Contextual allomorphy, then, is a real phenomenon, such that there are distinct vocabulary items (phonological forms) for a single morpheme X (whether a functional item or a root) dependent on the immediate grammatical context Y that the morpheme occurs in, as schematized in (9) (from Embick 2015: 178):

There are important further components of the theory of allomorphy, in particular concerning the ‘locality conditions’ on the structures within which the morpheme X and the context Y can interact (that is, constraints on the domains within which they can ‘see each other’); I think/hope that this important and somewhat contentious issue, which I do not attempt to address here, does not affect my central concern which is to consider the grounds for a semantic parallel to allomorphy (for discussion of these locality issues, see Marantz (2013), Embick (2015, chapt. 7)).

In the next section, the focus is on allosemy, the hypothesized semantic counterpart to allomorphy. After introducing the idea, I look at some putative examples of allosemy, drawing on work from Borer (2014b) and Panagiotidis (2020) in finding striking differences between the way these work (or, in fact, do not work) as compared with the examples of allomorphy just considered. Then, at a more general level, stepping away from specific cases, I question the underlying assumption informing this work, namely, that there is a Phon/Sem symmetry in language, arguing instead for a significant asymmetry. In short, the position taken here is that there isn’t such a phenomenon as allosemy (root polysemy), that roots themselves have no meanings (inherent or grammatically conditioned), and that the bearers of content/meaning are structural domains as defined in Section 2 (Borer’s C-core of categorizers), the established/stored non-compositional meanings of these structures (words) having been originally pragmatically inferred in specific communicative contexts.

3.2.2 ‘Allosemy’—the Idea and its Problems^{Footnote 17}

Apparently paradoxically, some linguists talk of roots as being both meaningless (as discussed above) and polysemous (Marantz 2013, p.103; Panagiotidis & Nobrega forthcoming). How can this be? The idea is that roots are meaningless in isolation but have meanings in a grammatical context, often several distinct meanings, each dependent on the local grammatical context in which the root appears. Marantz characterizes the ‘theory of contextual allosemy’ as ‘the theory about what governs the choice of meaning for a polysemous root.’ (2013: 103). Note that if there are such meanings for roots, they are ‘special’ (non-compositional), as for the meanings at issue in Section 2 above; roots themselves have no structure, no parts to compose, so any such meaning(s) must be atomic. That is, as with allomorphs, allosemes are unpredictable and so have to be listed together with the syntactic contexts in which they are realized.^{Footnote 18}

In support of the view that contextual allomorphy and contextual allosemy arise in the same local grammatical domains, Marantz (2013: 102–103) discusses the root √house, which has the two allomorphs: /hauz/ as in the verb ‘house’ and the default (elsewhere) /haus/ as in the noun ‘house’ and compounds like ‘housework’. In parallel to this, he points out, their meanings are not compositionally related: ‘no literal house nor even a literal container is implied by the verb’ (p. 102); that is, √house has two contextual allosemes (polysemes): house (the dwelling) and house* (roughly: ‘give shelter/harbour’). So far, so good (although, of course, this case is at least as well explained by the account in Section 2 on which roots themselves have no meaning, contextual or otherwise). This is not to say that the two phenomena must always occur together: consider ‘relief’/ ‘relieve’, ‘belief’ / ‘believe’ (allomorphy without allosemy), on the one hand, and the noun/verb pairs ‘field’/ ‘field’ (= catch or stop) and ‘pinch’/ ‘pinch’ (= steal) (allosemy without allomorphy), on the other hand.

Setting aside allomorphy for now, the following seem like good candidates for root allosemy (with the grammatical contexts that condition the allosemes in each case presented very informally here):

It’s worth emphasizing that what is being claimed here is that these are meanings of the root itself, albeit contextually conditioned.

What about the special non-compositional meanings of complex words such as ‘reactionary’, ‘natualize’, and ‘recital’, discussed in Section 2? Recall that these, as well as the cases just discussed (‘fetch’, ‘liquid’, ‘book’), all fall within Borer’s ‘syntactic domain of Content’ account. Where do they fit into an allosemy account? Presumably, they simply don’t: the meaning ‘make a citizen’ isn’t an alloseme of √nature or √nat (whichever of these is the root here) and ‘backward-looking’ isn’t an alloseme of √(re)act, because in these cases the special meaning doesn’t arise in an appropriately local context of the root, there being several categories intervening between the root and the categorization at which the atomic meaning arises.^{Footnote 19} If that is right, then contextual allosemy is quite limited in the range of non-compositional meanings of words based on a single root that it encompasses, meanings all of which might be thought of as associated with that root.

Even allowing for the limited nature of the phenomenon, specific problems arise. Consider the following cases of allosemy for the root ‘-ceive’, or √683, as presented by Harley (2014a: 245):

The first line is to be read: the (inherently meaningless) root ‘-ceive’ (√683) gets a contextual meaning paraphraseable as “think” when it occurs within a verbal domain and is immediately preceded by the prefix ‘con- ‘, and the same goes, mutatis mutandis, for the other cases. As already noted, it’s important to be clear that the claim is that the Content (i.e. encyclopaedic meaning) here belongs to the root ‘-ceive’ itself rather than to the whole formation ‘conceive’ or ‘deceive’, whose verbal prefixes merely constitute the domains in which it is realized. This seems problematic. As emphasised by Panagiotidis (2020: 61) in a discussion of the Hebrew case above in (5), there is no ‘elsewhere’ condition here as there is for allomorphy, that is, no default meaning: the contextual meanings of ‘-ceive’ are all, in effect, specific ‘special meanings’; they are the atomic Contents conceive, receive, deceive, etc. Furthermore, the "think" alloseme realized in the context of [con-] does not apply to the "become pregnant" sense of 'conceive'/ ‘conception' and the "get" alloseme realized in the context of [re-] does not apply to the "social event" sense of ‘reception’. There is no nice generalization across cases here as there is for the allomorphy of √-ceive (i.e. /si:v/ and /sep/), which holds for all occurrences of the root in the specific syntactic contexts given by the rules. These issues arise for numerous other cases (of Latinate roots), e.g. √-mit in ‘commit’, ‘permit’, ‘transmit’, ‘emit’, etc.; √-volve in ‘revolve’, ‘devolve’, ‘involve’, ‘evolve’, etc.; √-tract in ‘detract’, ‘contract’, ‘subtract’, ‘retract’, etc. In contrast with allomorphy, allosemy seems unexplanatory.

A further point of interest is that the meanings of these roots are semantically unrelated, so talk of root polysemy seems misplaced, at least if the word ‘polysemy’ is being used in its well-established sense of a single linguistic unit which has several interrelated meanings, thereby distinguishing it from homonymy (accidental coincidence of form with unrelated meanings). As far as I can tell, it is not the intention of allosemy supporters to obliterate the polysemy/homonymy distinction; I don’t think anyone is suggesting, for instance, that the root √bank has as two of its allosemes the financial and the riverside meanings. It seems, then, that what is being captured by the allosemy cases above is etymological relatedness rather than any current semantic/pragmatic relatedness.^{Footnote 20}

Moving now to a different sort of case: the meanings of ‘conversions’ such as those discussed in Section 2 would, presumably, also be listed as allosemes of the root, so, for example:

But what about the sense of the verb ‘dust’ in ‘to dust the cake (with sugar)’? There is no syntactic context to distinguish this “sprinkle” sense from the “remove” sense – it is entirely a matter of discourse context, general knowledge and pragmatics. This could arise (perhaps has arisen already in some instances) for many verbs, which, in principle, allow for both a removing X and an adding X meaning, e.g. ‘seed’, ‘skin’, ‘shell’, ‘pit’, ‘pebble’, ‘weed’. Similarly, it is easy to envisage a further sense for ‘sand’ arising and becoming established, as in ‘Let’s sand this area so the kids can play here’ meaning to cover (or pile up) with sand.^{Footnote 21} Nothing about the immediate syntactic context will distinguish the operative alloseme here. The point applies generally to any number of cases of denominal verbs, so ‘brick’ gets different meanings according to speaker/hearer’s concerns as in ‘They cleared the debris and bricked the wall’ (= make/repair something with bricks), ‘She bricked the papers so they would all fit in her bag’ (= make something into (the shape of) bricks). Unlike the √-ceive case, these noun/verb alternations are instances of polysemy (in the sense of having related meanings), but again there is no default meaning, no generalization across cases, just a list of non-compositional meanings whose realization is not a matter of grammatically specifiable contexts but of fine-grained communicative contexts.

Another issue concerns what exactly constitutes a ‘syntactic’ or ‘grammatical’ domain, especially when overt categorizers are involved. Consider, for instance, the nouns ‘fracture’ and ‘fraction’, both formed directly from the root √fract and with very different meanings (albeit both to do with ‘breaking’); these meanings would appear to be prime candidates for contextual allosemes of √fract, but how are the two local grammatical contexts to be distinguished? The functors ‘-ure’ and ‘-tion’ are both nominalizers of the root and do not seem to have any obviously different formal syntactic/semantic (syn/sem) features, both apparently realizing states, processes, and results (see Lieber (2004:39) who talks of them as rival affixes which are semantically interchangeable). If this is right, it looks as if it will be necessary to involve phonological components, /tjur/ and /šən/, in conditioning the choice of root meaning, but this is precluded by the assumed architecture of the grammar and its interfaces (see the diagram at (6) above). There seem to be several other noun pairs built on a single root, to which this concern pertains: creature/creation; fissure/fission; denture/dentition; juncture, junction; nature/nation.^{Footnote 22} Other pairs that may raise the same problem, but involve different pairs of affixes are ‘proposal’/ ‘proposition’, ‘sensory’/ ‘sensual’, ‘credal’/ ‘credible’. Another manifestation of the same issue (that is, the required ‘grammatical’ nature of the conditioning context) arises for the multiply polysemous words/roots ‘run’, ‘bear’, and ‘line’. Consider √run and its myriad meanings when in an immediate verbal context: ‘run a mile’, ‘run a business’, ‘run a bath’, ‘run a tight ship’, ‘run for president’, ‘run for New Zealand’ (in the commonwealth games), ‘run an argument/idea’, and more. It looks very unlikely that each of these different meanings for ‘run’ and hence potential allosemes of √run can be distinguished entirely on the basis of immediate grammatical context, but rather that considerations of discourse context and real world knowledge have to be brought to bear, a far more expansive notion of context than anything needed for allomorphy and one which has, therefore, been excluded also for allosemy (see Harley (2014b: 469) for brief discussion of the exclusion of ‘discourse context’ from the theory of allosemy).

In her stringent critique of Harley’s (2014a) applications of allosemy, Borer (2014b) points out further problems that the approach faces when dealing with idiomatic phrases (e.g. ‘spill the beans’, ‘kick the bucket’),^{Footnote 23} the key point being that these structures receive their idiomatic meanings as a whole. She extends the point to morphologically complex and simple words quite generally:

Clearly, neither kick nor bucket are assigned Content in the context of the idiom kick the bucket. Rather, what is assigned Content is the constituent as a whole. By a similar rationale, neither nat nor ceive are assigned Content, nor, for that matter, are nature and natural within naturalize when the relevant Content is naturalize – become a citizen. Rather, Content is assigned to receive or to naturalize as a whole. But if that is the case, then there is little reason to assume that roots are ever assigned Content, with or without context. Content, rather, is always associated with (labeled) syntactic constituents, at times with considerable internal complexity. To the extent that we perceive the ‘root’ realized as dog to have Content, then, this is not because the root itself has Content but rather, [_N √dog] has Content, distinct, we note, from that associated with [v √dog]. (Borer, ibid: 356)

This is, of course, the position I have adopted in Section 2: given an adequate account of the syntactic domain of Content, as developed by Borer (2013a, b), together with a store of such Contents (established non-compositional meanings), and a process for matching the two, there is no need for anything like allosemy.^{Footnote 24} So, given the problems with specific alleged cases of allosemy and its, at best, very limited explanatory range, the Borerian account seems preferable.^{Footnote 25}

Leaving aside the details of specific cases now, the question arises: why is it that this alleged allomorphy/allosemy symmetry is seen by many theorists, especially those working within the DM framework, as a desideratum of the theory, which it very clearly is (see, in particular, Marantz (2013: 97))? Certainly, the architecture of DM, itself reflective of the architecture of the broader minimalist program in generative grammar (syntax kept minimal with much of the action now playing out at the phon/sem interfaces) (Chomsky 1995a, 2021; Allott & Lohndal forthcoming), does seem to point to such a symmetry (see the diagram in (6)), with the syntax cyclically sending chunks of structure off simultaneously to both kinds of spell-out (interpretation). Theory-internal considerations of economy and elegance, then, appear to favour a phon/sem symmetry, and thus, given the undoubted reality of allomorphy, the existence of a parallel phenomenon of allosemy.

I think there are significant grounds for doubting that there is such a symmetry, that the semantic interface works in the same way as the phonological interface. First, recall that there are really two kinds of allomorphy, syntactically conditioned (e.g. the different plurals for ‘boy’, ‘child’, ‘sheep’) and phonologically conditioned (e.g. the distinct manifestations of the plural /-z/: [-s], [-z], [iz] (Embick 2015: 173–176)). Similarly for the allomorphy of the bound root √-fer, with distinct syntactically conditioned stress patterns (as in ‘refer’ vs. ‘reference’, ‘referential’) and, arguably, phonologically conditioned allomorphs /fɚ/ and ‘fɚr/ as in ‘refer’ and ‘referral’. Given the symmetry assumption, one might wonder what parallel there might be to this phonologically conditioned allomorphy on the allosemy side. Apart from the (alleged) syntactically conditioned meanings, the only other ‘conditioning’ factor available on the meaning side seems to be semantic/pragmatic. But there is no parallel here. Phonologically conditioned allomorphs are a small definitive set, the predictable outcome of facts about human articulation: the plural of the nonsense word ‘jelit’ will be pronounced [s], the plural of ‘dax’ will be pronounced [iz]. ‘Semantically or pragmatically conditioned’ meanings, on the other hand, are indefinite in number (new ones arising every day for ever-evolving speaker-hearer purposes, some recurring, many one-off and transient), unpredictable ahead of specific contexts of utterance—we can’t predict what ‘jelit’ or ‘dax’ may come to mean, nor for that matter the meanings that ‘mouse’, ‘swipe’, ‘remote’ or any other word may acquire in use. There is no phon/sem parallel here.

Arguments coming from a rather different direction, point to an intrinsic asymmetry between the phonology and the (conceptual) semantics of a language. As Studdert-Kennedy (2000) says: ‘… a purely semantic representation of any language is indeed impossible, because a phonetic structure is intrinsic to every word. Every language has phonologically permissible words that happen not to have been assigned a meaning (so-called nonsense words). No language has words that have meaning, but have not yet been assigned phonological form’ (ibid: 173; my emphasis).^{Footnote 26} The point is well illustrated by the famous (and now much-used for linguistic purposes) Jabberwocky sentences, e.g.’Twas brillig, and the slithy toves / Did gyre and gimble in the wabe / All mimsy were the borogoves / And the mome raths outgrabe.’ These are perfectly grammatical sentences of English, with a minimal formal meaning template provided by functional and other closed-class items (e.g. ‘and’, ‘the’, ‘was’, ‘all’, plural, past tense, etc.), but the point here is that the open-class content words are phonologically complete and impeccable for English while lacking Content; the reverse situation (Content but no phonology) would simply not comprise linguistic expressions of any language. In short, languages are vehicles for Content and it is the phonology of a language that provides the vehicle.

The asymmetry at issue here could be argued to have its roots in a more fundamental (evolutionary) asymmetry, according to which the computational (syntactic) system has an asymmetrical relation with the two external components, the semantic-pragmatic or conceptual-intentional (CI) and the sensorimotor (SM) (Chomsky 2010; Berwick and Chomsky 2011, 2016).^{Footnote 27} Adopting this view, Mendívil-Giró (2019) says: ‘the computational system would be optimized for its interaction with the CI system, while the relationship with the SM system would be ancillary or secondary. It is then implied that the computational system is coupled with the CI system to form a kind of “internal language of thought” that would be essentially homogeneous within the species and that would not be evolutionarily designed for communication, but for thought.’ (ibid: 1164). The secondary relationship, with the SM systems, comes with the externalization of language via the SM systems, making it usable for communication (articulable and perceptible), and giving rise to human languages with all the diversity and specificity of their sensorimotor properties (sounds/gestures) and of some of their structural properties.^{Footnote 28}

In line with the claim of a phon-sem asymmetry, some theorists maintain that while roots are meaningless in and of themselves, they do have phonological properties, and are, in fact, fundamentally phonological in nature. For instance, Borer (2013a) maintains that a root is a phonological index: ‘a reference constant across all occurrences to a specific phonological information packet, where both /thief/ and /thieve/ could be accessed under the relevant syntactic circumstances, and where principles of phonological faithfulness would ensure some threshold of phonological relatedness, however defined’ (p. 381; my emphasis).^{Footnote 29} In support of this view, there are verb pairs whose meanings are very closely related, e.g. ‘eat’ and ‘feed’, ‘die’ and ‘kill’, (the latter pair being semantically related in much the same way as the two occurrences of the single verb ‘close’ in ‘The door closed’ and ‘Sue closed the door’), but which clearly have distinct roots: √eat and √feed, √die and √kill, rather than being allomorphs of a single root. That is, phonological form and not meaning/content is the key individuating property of a root, and in the case of the pairs ‘eat/feed’ and ‘die/kill’, the threshold of phonological relatedness for a single root is simply not met (so also for the closely semantically related pairs ‘leg/walk’, ‘round/circle’, ‘wing/fly’, and umpteen others).^{Footnote 30} Just like the roots √tove, √gyre, √wabe, √gimble, etc., which underlie the not (yet) contentful noun forms ‘tove’, ‘wabe’ and the verb forms ‘gyre’ and ‘gimble’, established roots for English such as √ceive and √teach come with phonological properties, albeit underspecified so as to allow for contextual allomorphs as discussed above (e.g. /si:v/ and /sep/ for the root √-ceive, and /tiːtʃ/ and /tɔːt/ for the root √teach, which are realized in particular grammatical contexts). The point, again, is that phonological forms are in this sense basic and intrinsic to a specific language, while meanings/Contents, the stuff of thoughts and central to communicative interactions, are a distinct (probably universal) system, parts of which get hitched up to pieces of linguistic form (and are perhaps thereby transformed (Pietroski 2018, 2023; Acquaviva 2022)).

To conclude this discussion of the meaning of roots, we have seen three positions:

(1)
Roots have a meaning in and of themselves, which may be (a) fully semantically specified (it is unclear whether anyone holds this view) or, more plausibly, (b) semantically underspecified (Arad 2003, 2005);
(2)
Roots have no meaning in isolation, but in at least some cases, they have meanings (allosemes) that are realized in specific local grammatical contexts (Marantz 1995, 2013; Embick and Marantz 2008; Embick 2021; Harley 2014a, b);
(3)
Roots themselves have no meaning, with or without context (Borer 2013a; Acquaviva 2014; Panagiotidis 2020). On some accounts taking this position, meaning (atomic Content) is assigned/matched to certain delimited phrasal structures (familiarly known as ‘words’) within which a root occurs, as detailed in Section 2.

The burden of this section has been to argue that neither of the first two options holds; rather, a root is a categoryless and meaningless index, which provides a marker of and, once categorized, a receptacle for inferred or stored atomic Content (sense), which can only be assigned to, or matched with, specified pieces of structure, and whose origin is pragmatic.^{Footnote 31}

4 Conclusion: Words as Units of Communication

What makes words particularly interesting and revealing is that they are both formal syntactic structures and manifestations of language as an instrument of communication, comprising the phonological vehicles (articulable and perceptible) that make the communicative use of language possible and providing a hub for the accumulation of senses/contents that matter to communicators in their exchanges with others. Compositional phrasal structures tend to become replaced by structures that allow non-compositional meaning when the denotation/content has passed some threshold of usefulness or salience in the life of a language community: ‘When new tools of uncertain usefulness are invented, speakers may describe them by means of phrases, like “computing machinery,” in Turing’s seminal paper (1950), or “self-driving car” today. If their referents prove to be widely useful and become ubiquitous, such phrases are likely to be replaced by single words, like “computer” or “hybrid”.’ (Spelke forthcoming).

While the narrow language faculty (syntax and its interfaces) have roots (and a wide array of functional categories) as their basic elements (primitives), the linguistic communicator, arguably, employs words and other bearers of non-compositional meaning/content as basic communication units. As noted in the introduction, even those who disavow the scientific reality of words, recognise their psychological reality for language users: “The psychological reality of words is probably a consequence of their distributional properties … words are the minimal linguistic units that speakers can manipulate consciously. It is therefore no surprise that speakers are generally aware of words” (Julien 2007: 83), and “What people learn, use, recognize, utter, forget, recall, and miss are words, not morphemes [roots, affixes, etc. (RC)]” Mendívil-Giró (2019: 1207), due, in his view, to the fact that they are the smallest stretches of syntax that connect to the sensorimotor systems responsible for language externalization. Based on observations of this sort, it seems reasonable to hypothesize that there is a user-based communicational lexicon, quite distinct from any of the listed elements of the narrow linguistic (syntactic) system. Such a lexicon stores conventionalized pairings of phonological forms and families of senses, so it connects up items of the two interface components, the sensorimotor and the conceptual-intentional (or ‘semantic-pragmatic’). The sense in which the meanings stored here are ‘semantic’ is not just that they can contribute to the truth-conditional content of utterances (i.e. are fully specified), as can ad hoc occasion-specific senses, but that they are established/conventionalized, while their polysemy families preserve various degrees of interrelatedness that reflect their typically pragmatic origins. In Chomskian terms, this sort of lexicon is a component of the language ‘performance’ systems (see footnote 3) and as such it reflects usage properties that are wholly irrelevant to the syntactic/compositional linguistic ‘competence’ system; for instance, the frequency of use of words and of particular word meanings, and the numerous associative relations (phonological, morphological, semantic) that give rise to priming effects, both of which are important factors to be controlled for (or exploited) in psycholinguistic experiments on language processing.

This notion of a communicator’s lexicon is, of course, quite distinct from the old ‘lexicalist’ view of the lexicon, which prevailed in earlier generative grammar and which was a pre-syntactic lexicon, a location for the derivation and storage of words which provided the basic input to the syntax. The lexical entries of the lexicalist lexicon were repositories of considerable syntactic and semantic information, including the argument structure of verbs and so-called ‘selection restrictions’ on those arguments (e.g. the subject of ‘eat’ must be animate, the object of ‘hit’ must be concrete). Rules of word formation (affixation) were also located here before the advent of DM (Halle and Marantz 1993; Marantz 1995) and Borer’s ‘exoskeletal’ view of the lexicon as a ‘listicon’, which does not replicate syntactic information.^{Footnote 32} As Marantz (1995) puts it, on the ‘lexicalist’ approach, there was a computational lexicon as distinct from the computational syntax (so two language engines). The communicational lexicon is not computational, not generative – it is a repository of stored communication units, with well-established (i.e. conventionalized) atomic (unpredictable, non-compositional) meanings that cannot be generated/composed from the meanings of their parts and their syntactic structure. Individual speakers’ communicational lexicons are likely to be somewhat idiosyncratic, incomplete, and informationally scrappy, overlapping to an appreciable degree with those of other members of a speech community but not 'shared' in any strong sense; they may contain ‘communal sublexicons’, tagged for the particular subgroup with whom the individual shares them (economists, linguists, Londoners, football aficionados, etc.) (Clark 1998).

The communicational lexicon, as I conceive of it, bears considerable resemblance to the lexicon developed in Jackendoff (1997, 2002), which he sees as the ‘store of memorized elements of language’, as distinct from those aspects of language which are generated afresh on each use by combinatorial rules (syntax). On this construal, the lexicon contains not only words (with their various stored senses), but also larger multi-word items: idioms like ‘to spill the beans’, ‘The cat’s got NP’s tongue’, frozen expressions like ‘by and large’, ‘in cahoots with’, ‘kith and kin’, and even whole sentences like ‘May the force be with you’. Marantz (1995, 2001) has argued that Jackendoff’s corpus of memorized/stored pieces of language is a quite different entity from anything we would want to call a ‘linguistic lexicon’. This seems right to me; rather, it’s a language user’s communicative lexicon, containing myriad memorized conventional language uses, and it may have quite a considerable degree of internal organisation (established ‘words’ and their evolving families of senses may comprise one component, social formulas like ‘May the force be with you’ having some distinct status, frozen forms another, and so on).^{Footnote 33}

An issue that arises for the communicational lexicon concerns the possible co-occurrence of compositionality and conventionality. It is clear that conventionalized non-compositional (atomic) meanings (e.g. reactionary, naturalization, transmission, detective, etc.) must be stored, but what about apparently conventionalized compositional meanings (e.g. transmit + ion, revolu + tion, kind + ness, syntac + tic, fantas + ize)? The latter can all be composed/generated from the atomic meaning of the parts and their syntactic structure, as is the case for new coinages that we are sometimes aware of producing on-the-fly (e.g. ‘crafty’ with the meaning of engaged in crafts (making things), or ‘machination’ with the meaning of the filling of a space with machines). On the other hand, since the examples at issue involve well-established meanings/uses of these English words (i.e. they seem to be conventionalized/memorized), they appear to meet the standard criterion for storage in the lexicon and direct retrieval, as a single unit, by communicators. Some theorists working within the single syntactic engine framework posit a principle of ‘Full Decomposition’ according to which ‘No complex [linguistic] objects are stored in memory; i.e. every complex object must be derived by the grammar’, from which it follows that ‘every word and every phrase is built every time it is used’ (Embick 2015: 17; my emphasis).^{Footnote 34} See also Mendívil-Giró (2019: 1192) who explicitly adheres to this principle, albeit with a somewhat different view from Embick of the kind of “complex object” that words and phrases are. Full decomposition is perhaps intended as an ideal, as a principle which would make for the most economical and elegant theoretical account: the only listed/stored items are syntactic primitives (i.e. roots, functors), vocabulary items (i.e. phonological realizations of roots and functors), and atomic Contents.

Again, then, a certain tension arises here as the FD principle seems to be at odds with the conventionalization and storage of complex (compositional) forms and meanings, and thus with the reality and salience of words (and other communication units) for language users, ordinary folk constantly employing language for communication. I remain unsure about the best way to go on this issue, but incline to the view that communicative exigencies coupled with our undoubtedly capacious memories point in the direction of storage of these structurally and semantically complex units, notwithstanding the apparent redundancy and lack of theoretical parsimony this entails.^{Footnote 35} It could be argued that this is not a pernicious redundancy in that there are two distinct cognitive systems here, the computational (syntactic) and the communicational, although the former (arguably, not originally selected for a communicative function^{Footnote 36}) is recruited by the latter in generating the phrases and sentences we utter. Such a ‘redundancy’ in the overall architecture may make for more economical processing in utterance production and comprehension, the retrieval of ready-made communication units being, at least sometimes, less effortful than building structures and matching forms with meaning/sense. Whether or not this proves to be the right way to view the situation awaits more work on the interface of the formal/computational/syntactic and the communicational/pragmatic/performance components of the language mosaic.

Notes

By and large, work in the philosophy of language meshes with the intuitive folk view of language in taking the word as the basic linguistic building block. Interesting recent exceptions to this are Dupre (2022) and Hughes (under review ), both of whom embrace current root-based approaches to words and demonstrate the implications this has for longstanding issues in philosophy, the account of reference in the case of Dupre and the ontological status of words in the case of Hughes.
Thanks to an anonymous reviewer who pressed me on the issue of words and their status in my account, pointing out that in language typology, it is widely agreed that there is not a clear understanding of what a word is. I acknowledge this point, and hope the text above and my discussion of what I call the ‘communicational’ lexicon (see Carston 2022, and remarks in Section 4 of this paper) go some way toward addressing the issue.
The narrow/broad language faculty distinction (FLN and FLB) is taken from Hauser et al. (2002), and inspiration for my syntactic-pragmatic story has come from the following passage by Chomsky: “it is possible that natural language has only syntax and pragmatics; … In this view, natural language consists of internalist computations and performance systems that access them along with much other information and belief, carrying out their instructions in particular ways to enable us to talk and communicate, among other things.” (Chomsky 1995b: 26–27).
Some pairs are subject to ‘allomorphic’ variation, e.g. ‘thief’ and ‘thieve’; ‘shelf’ and ‘shelve’; ‘house’ as /haus/ and /hauz/; and the stress differences in the noun/verb pairs of ‘contract’, ‘permit’, ‘protest’, ‘record’, ‘object’, etc. See Section 3 for discussion of allomorphy.
I say ‘so-called’ because there is no ‘conversion’ process as such (for more detail, see Carston 2022).
It’s important to note that this talk of directionality and of which came first, noun or verb, is utterly irrelevant to the syntactic account just briefly outlined. It applies only at the level of the individual communicator who might, for instance, have the noun ‘crater’ in her lexicon long before she encounters its use as a verb.
A contentious issue here is whether all apparent ‘conversion’ verbs are root-derived or some are, in fact, derived syntactically from their noun counterpart, that is, whether they are structurally [_V [ √531]] or [_V [_N [ √531]]]. Unfortunately, there is not space here to examine this important question; those who argue for two derivational classes of conversion include Arad (2003) and Marantz (2013); those who argue against that view include Harley and Haugen (2007) at least for a subclass of Arad’s cases, and Borer (2013a).
This account applies equally, of course, to the so-called ‘conversion’ cases; once a root, e.g. √stone, has been categorized as noun, verb, or adjective, it can be assigned a meaning/content.
The different behaviour of two kinds of verb-derived nominals, R-nominals, which allow non-compositional content, and AS-nominals, which preclude it, provides nice evidence in support of this account of the delimitation of the syntactic domain of content (Borer 2013a, 2014a).
This ‘pragmatic’ account of non-compositional meaning assignment should not be taken to imply that such meanings are always pragmatically inferred. Obviously, some (but far from all) word/phrase meanings that are first derived in online communication (via pragmatic modulation processes) become established over time and are stored in an individual’s communicational lexicon, from which they are henceforth directly retrieved rather than inferred; such conventionalized meanings can be thought of as ‘semantic’, but the key point is that they are pragmatic in origin. See Carston (2020/21) for emphatic rejection of the ‘radical pragmatic’ position (based on an interpretation of ‘Occam’s razor’ according to which, if a meaning can be pragmatically inferred then it always is inferred).
An anonymous reviewer for the journal asks to what extent my analysis depends on the specific syntactic and pragmatic theories I employ. The account could employ a different pragmatic theory from relevance theory (RT), i.e. a theory with different operative principles or communicative maxims: e.g. coherence, informativeness, or a different notion of relevance from that developed within RT, provided that this other theory acknowledged ‘lexical adjustment (or meaning modulation)’ as a phenomenon occurring in online communication, governed by whatever principles/norms/maxims it favoured, e.g. Recanati (2004) develops such an account using coherence as a criterion rather than relevance. It is more difficult for me to assess the extent to which the analysis depends on the specifics of the syntactic theory employed (that of Borer 2005, 2013a). Certainly, the particular position I take on polysemy depends on a ‘single engine’ root-based syntax (that is one of my main reasons for adopting it) and on the idea that there is a distinguished kind of syntactic domain to which atomic (non-compositional meanings) can be assigned, but there are other syntactic theories that have these attributes. Perhaps a word-based (lexicalist) grammar could, with appropriate adjustments, also interface with pragmatics in the provision of non-compositional meanings.
The attribution of different meanings to roots in different grammatical contexts is central to the ‘allosemy’ view surveyed in the next subsection, albeit with the crucial difference that, on that view, roots themselves do not (or need not) have an ‘inherent’ meaning. Embick, at least as manifest in Embick (2015), is an advocate of the view that roots can have both an inherent meaning (context-free, I take it) and several contextually conditioned meanings (allosemes or polysemes), with the former (somehow) constituting a common (shared) component of all of the latter. See also Embick (2021). Other advocates of allosemy, to be discussed in Section 3.2, maintain that roots do not have any meaning in isolation, e.g. Harley (2014a).
More recently, however, Pietroski (2018, 2023, i.a.) has developed a very well-argued position for distinguishing a word’s meaning from the various senses/concepts that may be associated with it. This is the view that the meaning of a linguistic expression is ‘an instruction for how to generate a concept’, e.g. to a first approximation, the meaning of the expression ‘book’ is the instruction to fetch a concept from the ‘book’-address, a single lexical instruction or recipe that may be executed in a number of ways (i.e. may generate any of several senses/concepts in particular contexts). This is as minimalist about meaning as you can get; there’s nothing contentful, conceptual or sense-like about it, not even anything like a ‘semantic feature’ in the offing (e.g. [± human], [± temporal]), or a Kaplanian character; no need to search in vain for some thin, abstract kernel of meaning that figures in each of the concepts/senses associated with the instruction. This ‘meaning’ is in effect, just a pointer to an address where all the real semantic/meaning stuff resides.
This diagram is an adaptation and combination of diagrams in Embick (2015: 20) and Harley (2012: 2153).
However, an anonymous referee for the journal cautions me that ‘the idea that there is allomorphy of the usual kind with roots is controversial’ (my emphasis). Indeed, there are various ways in which the apparent phenomenon can be approached (whether in terms of competing vocabulary items, hence “the usual kind” of allomorphy, or as requiring special morpho-phonological readjustment rules; see Siddiqi (2009) for clear exposition of these two accounts and of their knock-on effects for the overall view of the grammar). Further issues arise for cases of what is known as ‘root suppletion’, where the manifestations of a root seem not to have any shared phonology, e.g. ‘go’/’went’, ‘bad’/’worse’; for relevant discussion of root suppletion (from differing points of view), see Borer 2013a, pp. 397–403, and Harley 2014a. Whatever approach one takes to the morpho-phonological phenomena at issue, what I am questioning is the idea that there is a parallel semantic phenomenon (i.e. root allosemy).
Here I confine the discussion to the allosemy of roots and the issues arising in this domain; for a critical appraisal of its application to functional items, see Ramchand (2015).
It remains unclear to me whether or to what extent established/stored figurative meanings (metaphorical and metonymic) are included in this notion of ‘root polysemy’, since if included, it is difficult to see how grammatical context could choose between them. Consider, for instance, the so-called double function adjectives, ‘cold’, ‘warm’, ‘rigid’, ‘flexible’, ‘soft’, ‘hard’ etc., which have both a literal (physical) and a metaphorical (personality) meaning.
There is extensive ongoing work on the issue of what constitutes a spell-out domain (bearing on the locality conditions for root allomorphy and allosemy) and on a distinction between words that are directly root-derived (e.g. the noun /haus/ and the verb /hauz/ discussed above) and words that are (claimed to be) derived from other words (e.g. the verb /haus/ claimed to be derived from the noun /haus/ and so semantically much more closely aligned with the noun (Marantz 2013: 103)). The details are complex and contentious, and I am hoping that my discussion of allosemy can bypass them without distortion; see Arad 2003, Embick 2010, 2015, 2021; Marantz 2013.
An anonymous journal referee has commented that the unrelatedness of some allosemes of a root is exactly what the allosemic approach to meaning predicts, in parallel with this theory’s approach to allomorphy which does not require form relatedness. Allosemy is, then, quite different from standard cases of polysemy.
In discussing the ‘house’ example mentioned earlier, Marantz (2013) points out that, as well as the verb pronounced /hauz/, there could be a verb /haus/ (as in ‘The developer housed the piece of land in the most ugly way imaginable’ meaning ‘put up houses’); he maintains that rather than being directly root-derived like /hauz/, this verb is derived from the noun ‘house’, itself derived from the root, so that this is the same alloseme as that of the noun. Even if that is right in this case (and perhaps the ‘sand’ case too), it is unclear whether and, if so, how it might apply to the ‘dust’ type cases.
The relative rarity of both these suffixes nominalizing a single root is a further indication that there is very little, if any, difference in their syn/sem features, so e.g. rupture, *ruption; aperture, *apertion; culture, *cultion; closure, *clotion; stricture, *striction; fiction, *ficture; emotion, *emoture: ovation, *ovature; and many more. That is, typically, these two suffixes occur in complementary distribution.
See Harley (2014a: 270–271) on the (allosemic) interpretations of √kick and √bucket in the context of ‘kick the bucket’. Early remarks supporting this approach (although not yet using the term ‘allosemy’) to the meaning of ‘bucket’ in the idiom ‘kick the bucket’ appear in Marantz (1995, p.9): ‘… When a specialized context is listed in the Encyclopedia that “bleaches” or negates the (canonical/default) semantic effect of the choice of a Vocabulary item, this is a case of special non-compositional meaning. For example, in “kick the bucket,” the choice of “bucket” might lose its canonical non-compositional meaning referring to buckets in the syntactic context of “kick”.’ However, in subsequent work, Marantz (and others working on allosemy), has recognised that allosemy does not apply to phrasal idioms and the two must, therefore, be given distinct accounts (Marantz 2013, and see discussion in Anagnostopoulou and Samioti 2013). Similarly, Borer’s ‘syntactic domain of Content’ does not apply to phrasal idioms, which as she points out (following Nunberg et al. 1994) have quite different properties from those of words with non-compositional meaning (Borer 2013a: 480–488).
As a non-syntactician, I acknowledge that I have found the ‘allosemy’ view difficult to grasp and it may well be that, as one of the reviewers maintains, my presentation does not do it justice. In case this is so, let me note here some of the points he/she made: “Some morphemes show allomorphy and others do not. The same in the meaning side. Some roots have allosemes, but others do not. So, the problem is not whether roots have inherent meanings or not, but whether syntax plays a role in the realization of a subset of meanings. … this theory does not say that all meanings are allosemic, in the same way as not all forms are allomorphic. Allosemy theory is compatible with some roots having fixed meanings.”.
A variant of the allosemy position is taken by Saab (2016), who maintains that the amount and type of root content varies from the fully conceptual to the negligible. For critical discussion of this ‘spectrum of content’ view, see Panagiotidis & Nobrega (forthcoming).
Studdert-Kennedy’s topic is, in fact, reading and alphabetic writing systems: ‘the letters of an alphabet represent neither sounds nor articulations, but phonological entities (phonemes) at a higher level of abstraction than the perceptuomotor entities that we must posit to account for imitation. Each phoneme encompasses a class of phonetic variants, or allophones, that vary across phonetic contexts, dialects and even individual speakers. What is important here is that alphabetic writing represents the phonological form of a word, not its meaning. That is why we can read a text without understanding it, but cannot understand a text without reading it.’ (2000: 173; my emphasis).
However, Studdert-Kennedy’s points stand independently of whether or not one accepts this claim of an evolutionary asymmetry.
As Berwick and Chomsky (2011: 37) put it: ‘Parameterization and diversity, then, would be mostly – possibly entirely – restricted to externalization. That is pretty much what we seem to find: a computational system efficiently generating expressions interpretable at the semantic/pragmatic interface, with diversity resulting from complex and highly varied modes of externalization, which, furthermore, are readily susceptible to historical change’ (my emphasis). For further elaboration, see Mendívil-Giró (2019).
It lies beyond my aim here (and my capacity) to argue for (or against) this phonological view of roots, but Borer certainly makes a powerful case in its favour (Borer 2013a: 379–403), concluding that the very subject matter of morphology depends on (categoryless, meaningless) roots being fundamentally phonological. As noted in the quote above, the idea that a root is a package of phonological information (or instructions) does not entail phonological identity across all of its occurrences but rather that a standard of sufficient phonological resemblance or faithfulness (yet to be delineated) is met by the allomorphs of any given root (e.g. thief/thieve, mouse/mice, worse/worst; better/best, -ceive/-ception, -mit/-mis, -duce/-duct). On this view, then, cases of suppletion (e.g. good/better; bad/worse, go/went) must be taken to involve two distinct (phonological) roots.
One might wonder if there is any such threshold for semantic relatedness of words based on the same root. If there is, it seems very low, as pragmatic inferences in fine-grained communicative contexts can take meanings of a single form in many directions, as discussed in Section 2. Such single-root-based pairs as ‘recite/recital’, ‘react/reactionary’, ‘flake/flakey’, ‘prove/proofs’ have very divergent meanings. Over time, however, it may be that any meaning connections become sufficiently distant that they are lost to language users and what may have started out as polysemy is now homonymy. The form ‘bank’ may be such a case (see discussion in Elbourne (2011: 38)); whether we (still) have a single root √bank or two (albeit homophonous) roots, √291-bank and √377-bank, is a further interesting question.
A distinct position on root semantics, not explored here, is that roots are to be classified as belonging to different ontological types, e.g. ‘individual’-denoting, ‘state’-denoting, ‘event’-denoting, i.e. the idea that some roots are inherently ‘nouny’ (e.g. √dog) or ‘verby’ (e.g. √break) and that there are subclasses within the latter, perhaps as proposed by Alexiadou et al. (2006: 202): agentive (e.g. √murder), internally caused (e.g. √wilt), externally caused (e.g. √destroy), cause unspecified (e.g. √break), hence restricting their interaction (merging) with functional categories in the syntax. See also Harley (2005), Anagnostopoulou and Samioti (2014), Levinson (2014). Much of this work draws on the extensive earlier work on classes of verbs by Beth Levin and Malka Rappaport-Hovav (e.g. Rappaport-Hovav & Levin 1998). If correct, this sort of view would indicate that roots are not entirely semantically vacuous in isolation, after all, although this semantic typing is quite different from the attribution of specific meanings to individual roots. Many thanks to Terje Lohndal for relevant discussion.
See Borer (2017) for a clear account of developments within the lexicalist approach and of the issues that led to a radical redrawing of the lexicon/syntax boundary on constructivist and other root-based accounts.
In more recent work, Jackendoff has moved to a full-blown ‘construction grammar’ view of the lexicon, according to which it ‘incorporates the rules of grammar and explicitly encodes relationships among words and among grammatical patterns’, i.e. there is no principled distinction between grammar and lexicon (Jackendoff and Audring (2020). This, then, is radically different from what I am proposing, which maintains a strong divide between the computational system (grammar/syntax) and stored items in the lexicon, the latter belonging to the language faculty only on a broad construal that encompasses language as an instrument of communication.
The principle of Full Decomposition is presented by Embick (2015: 21) as one of the four core positions that define the framework of Distributed Morphology.
A similar parsimony issue arises in pragmatics when a word is regularly used with two senses, one of which is inferable from the other (i.e. calculable in context using pragmatic principles/maxims), e.g. perhaps the metaphorical use of the verb ‘to disarm’ or the narrowing of the general meaning of ‘drink’ to the more specific alcoholic drink meaning. According to Grice’s Modified Occam’s Razor: ‘Senses are not to be multiplied beyond necessity’, often interpreted as ‘if a meaning of a word can be pragmatically derived from another conventionalized sense of the word then it should not be treated as a (stored) sense of the word'. This principle held sway in early Gricean pragmatics, but is much less favoured now. For discussion, see (Carston 2020/21).
As noted in Section 3.2, this is the well-known ‘evo devo’ view of Chomsky (Chomsky 2010; Berwick and Chomsky 2011, 2016); see Mendívil-Giró (2019) for an instance of a detailed working out of this view and its implications for the nature of the externalized languages we use in communication, an account that is broadly driven by the same issues and outlook as that of Borer (2013a), which I’ve adopted in this paper, but with some very significant differences also: in particular, Mendívil-Giró argues that it is concepts that are categorized rather than roots.

References

Acquaviva, P. 2009. Roots and lexicality in distributed morphology’. In York-Essex 48 morphology meeting 5: Special issue of york working papers in linguistics, eds. A. Galani, D. Redinger, and N. Yeo, 2:1–21. York: University of York. http://ling.auf.net/lingbuzz/000654.
Acquaviva, P. 2014. Roots, concepts, and word structure: On the atoms of lexical semantics." In Morphology and Meaning: Selected papers from the 15th International Morphology meeting, Vienna 2012, eds. Franz Rainer, Francesco Gardani, Hans Christian Luschützky and Wolfgang U. Dressler, 49-70. Amsterdam: John Benjamins.
Acquaviva, P. 2022. Word meaning: A linguistic dimension of conceptualization. Synthese 200: 427. https://doi.org/10.1007/s11229-022-03910-9.
Article Google Scholar
Acquaviva, P., and P. Panagiotidis. 2012. Lexical decomposition meets conceptual atomism’. Lingue e Linguaggio XI 2: 105–120.
Google Scholar
Alexiadou, A., Anagnostopoulou, E. & Schäfer, F. 2006. The properties of anticausatives crosslinguistically. In Phases of interpretation, eds. M. Frascarelli, pp. 187–212. Berlin, New York: De Gruyter Mouton. https://doi.org/10.1515/9783110197723.4.187.
Allott, N. & Lohndal, T. forthcoming. Minimalism: Emergence and theoretical foundations. Draft, March 2023. To appear In Cambridge Handbook of Minimalism, eds. K. Grohmann & E. Leivada. Cambridge: Cambridge University Press.
Anagnostopoulou, E. & Samioti, Y. 2013. Allosemy, idioms, and their domains: Evidence from adjectival participles. In Syntax and its Limits, Oxford Studies in Theoretical Linguistics, eds. R. Folli, C. Sevdali and R. Truswell, 218–250. Oxford:University Press. https://doi.org/10.1093/acprof:oso/9780199683239.003.0012.
Anagnostopoulou, E. & Samioti, Y. 2014. Domains within words and their meanings: A case study. In The syntax of roots and the roots of syntax, eds. A. Alexiadou, H. Borer & F. Shafer, 81-111. Oxford:University Press.
Arad, M. 2003. Locality constraints on the interpretation of roots: The case of Hebrew denominal verbs. Natural Language and Linguistic Theory 21: 737–778.
Article Google Scholar
Arad, M. 2005. Roots and patterns. Dordrecht: Springer.
Aronoff, M. 2007. In the beginning was the word. Language 83 (4): 803–830.
Article Google Scholar
Bauer, L. 2018. Conversion as metonymy. Word Structure 11 (2): 175–184.
Article Google Scholar
Berwick, R. & Chomsky, N. 2011. The biolinguistic program: The current state of its development. In The biolinguistic enterprise, eds. A-M. Di Sciullo & C. Boeckx, 19–41. Oxford: Oxford University Press.
Berwick, R. and Chomsky, N. 2016. Why only us: Language and evolution. Cambridge, Mass: MIT Press.
Borer, H. 2005. The normal course of events: structuring sense, vol. II. Oxford University Press.
Borer, H. 2013a. Taking form: Structuring sense, vol. III. Oxford: Oxford University Press.
Borer, H. 2013b. The syntactic domain of content. In Generative Linguistics and Acquisition: Studies in Honor of Nina M. Hyams, eds. M. Becker, J. Grinstead, & J. Rothman, pp. 205–248. Amsterdam: John Benjamins.
Borer, H. 2014a. Derived nominals and the domain of content. Lingua 141: 71–96.
Article Google Scholar
Borer, H. 2014b. Wherefore roots? Theoretical Linguistics 40 (3/4): 343–359.
Google Scholar
Borer, H. 2017. The generative word. In The Cambridge Companion to Chomsky, ed. J. McGilvray, 110–133. Cambridge University Press. https://doi.org/10.1017/9781316716694.006.
Carston, R. 2002. Thoughts and utterances. Oxford: Blackwell.
Carston, R. 2019. Ad hoc concepts, polysemy and the lexicon. In Relevance, Pragmatics and Interpretation, eds. K. Scott, B. Clark & R, 150–62 Carston: Cambridge University Press.
Carston, R. 2020/21. Polysemy: Pragmatics and semantic conventions. Mind & Language 36: 108–133. https://doi.org/10.1111/mila.12329
Carston, R. 2022. Words: Syntactic structures and pragmatic meanings. Synthese 200: 430. https://doi.org/10.1007/s11229-022-03861-1.
Carston, R. 2023. The relevance of words and the language/communication divide. Frontiers in Psychology, Special issue ‘Relevance in Mind’. https://doi.org/10.3389/fpsyg.2023.1187343.
Chomsky, N. 1995a. The minimalist program. Cambridge, Mass: The MIT Press.
Google Scholar
Chomsky, N. 1995b. Language and nature. Mind 104 (413): 1–61.
Article Google Scholar
Chomsky, N. 2010. Some simple evo devo theses: How true might they be for language? In: The Evolution of Human Language: Biolinguistic Perspectives, eds. R. Larson, V. Deprez & H. Yamakido, pp. 45–62. Cambridge University Press. https://doi.org/10.1017/CBO9780511817755.003.
Chomsky, N. 2021. Minimalism: Where are we now, and where can we hope to go. Gengo Kenkyu 160: 1–41. https://doi.org/10.11435/gengo.160.0_1.
Article Google Scholar
Clark, E., and H. Clark. 1979. When nouns surface as verbs. Language 55: 767–811.
Article Google Scholar
Clark, H. 1998. “Communal lexicons” In Context in language learning and language understanding. eds. K. Malmkjær and J. Williams, 63–87. Cambridge: Cambridge University Press.
Dupre, G. 2022. Reference and morphology. Philosophy and Phenomenological Research 2022: 1–22. https://doi.org/10.1111/phpr.12896.
Article Google Scholar
Elbourne, P. 2011. Meaning: a slim guide to semantics. Oxford: Oxford University Press.
Embick, D. 2010. Localism versus globalism in morphology and phonology. Cambridge MA: The MIT Press.
Book Google Scholar
Embick, D. 2015. The morpheme: A theoretical introduction. Boston/Berlin: Walter de Gruyter.
Book Google Scholar
Embick, D. 2021. The motivation for roots in distributed morphology. Annual Review of Linguistics 7: 8.1–8.20.
Embick, D., and A. Marantz. 2008. Architecture and blocking. Linguistic Inquiry 39 (1): 1–53.
Article Google Scholar
Frisson, S., and M. Pickering. 2001. Obtaining a figurative interpretation of a word: Support for underspecification. Metaphor and Symbol 16: 149–171.
Article Google Scholar
Halle, M., and A. Marantz. 1993. Distributed morphology and the pieces of inflection. The View from Building 20: 111–176.
Google Scholar
Harley, H. 2005. How do verbs get their names? Denominal verbs, manner incorporation, and the ontology of verb roots in English. In The syntax of aspect, ed. M. Erteschik-Shir and T. Rapoport, 42–64. Oxford: Oxford University Press.
Chapter Google Scholar
Harley, H. 2012. Semantics in distributed morphology. In Semantics an international handbook of natural language meaning, eds. C. Maienborn, K. von Heusinger and P. Portner, 3: 2151–2171. Berlin: De Gruyter, Mouton.
Harley, H. 2014a. The identity of roots. Theoretical Linguistics 40 (3/4): 225-276.
Harley, H. 2014b. Reply to commentaries, “On the identity of roots.” Theoretical Linguistics 40 (3/4): 447–474. https://doi.org/10.1515/tl-2014-0024.
Article Google Scholar
Harley, H. and Haugen, J. 2007. Are there really two different classes of instrumental denominal verbs in English? Snippets 16:6–7.
Hauser, M., N. Chomsky, and W.T. Fitch. 2002. The language faculty: What is it, who has it, and how did it evolve? Science 298: 1569–1579.
Article Google Scholar
Hughes, T. under review. Words, atomicity, and ontology.
Jackendoff, R. 1997. The architecture of the language faculty. Cambridge, Mass: MIT Press.
Jackendoff, R. 2002. Foundations of language: Brain, meaning, grammar, evolution. New York: Oxford University Press.
Jackendoff, R. and Audring, J. 2020. The texture of the lexicon. New York: Oxford University Press.
Julien, M. (2007). On the relation between morphology and syntax. In The Oxford handbook of linguistic interfaces, eds. G. Ramchand and C. Reiss. Oxford: Oxford University Press. https://doi.org/10.1093/oxfordhb/9780199247455.013.0008.
Levinson, L. 2014. The ontology of roots and verbs. In The Syntax of roots and the roots of syntax, eds. A. Alexiadou, H. Borer and F. Shafer, 208-229. Oxford: Oxford University Press.
Lieber, R. 2004. Morphology and lexical semantics. Cambridge: Cambridge University Press.
Marantz, A. 1995. Cat as a phrasal idiom: Consequences of late insertion in distributed morphology. Unpublished ms, MIT.
Marantz, A. 1997. No escape from syntax: Don’t try morphological analysis in the privacy of your own lexicon. University of Pennsylvania Working Paper sin Linguistics 4(2): 201–225.
Marantz, A. 2001a. Words. Unpublished ms. MIT.
Marantz, A. 2013. Locality domains for contextual allomorphy across the Interfaces’. In Distributed morphology today, eds. Ora Matushansky and Alec Marantz, 95–113. Cambridge Mass: MIT Press. https://doi.org/10.7551/mitpress/9780262019675.003.0006.
Marantz, A. 2020. Contextual allosemy and idioms. Draft of section of chapter, NYU MorphLab, posted 11 August 2020.
Marantz, A. 2001b. Words and things. Handout, MIT.
Mendívil-Giró, J.-L. 2019. If everything is syntax, why are words so important? Linguistics 57 (5): 1161–1215.
Nunberg, G., I. Sag, and T. Wasow. 1994. Idioms. Language 70: 491–593.
Article Google Scholar
Panagiotidis, P. 2014. A minimalist approach to roots. In Minimalism and Beyond: Radicalizing the Interfaces, eds. Peter Kosta, Steven Franks, Lilia Schurcks, and Teodora Radeva-Bork, 287–303. Amsterdam; Philadelphia: John Benjamins Pub. Co.
Panagiotidis, P. 2020. On the nature of roots: Content, form, identification. Evolutionary Linguistic Theory 2 (1): 56–83.
Article Google Scholar
Panagiotidis, P. & Nóbrega V. forthcoming. Why we need roots in minimalism. To appear. In Cambridge Handbook of Minimalism, eds. K. Grohmann & E. Leivada. Cambridge: Cambridge University Press.
Pietroski, P. 2018. Conjoining meanings: Semantics without truth values. New York: Oxford University Press.
Pietroski, P. 2023/forthcoming. One word, many concepts: Endorsing polysemous meanings. To appear. In Oxford handbook to contemporary philosophy of language, eds. E. Lepore and U. Stojnić. Oxford: Oxford University Press.
Ramchand, G. 2015. Allosemy -- No thanks. Language blog. 14 September 2015. http://generativelinguist.blogspot.com/2015/09/allosemy-no-thanks.html.
Rappaport-Hovav, M., and B. Levin. 1998. Building verb meanings. In The projection of arguments, ed. M. Butt and W. Geuder, 97–134. Stanford: CSLI.
Google Scholar
Recanati, F. 2004. Literal meaning. Cambridge: Cambridge University Press.
Ruhl, C. 1989. On monosemy: A study in linguistic semantics. Albany, NY: SUNY Press.
Google Scholar
Saab, A. 2016. No name: The allosemy view. Unpublished ms. National Scientific and Technical Research Council, Buenos Aires, Argentina. http://ling.auf.net/lingbuzz/003154.
Siddiqi, D. 2009. On a theory of root allomorphy. Chapter 4 of Siddiqi, D. Syntax Within the Word. Amsterdam: John Benjamins.
Spelke, E. forthcoming. Précis of ‘What babies know’. Behavioral and Brain Sciences.
Sperber, D., & Wilson, D. 1986/95. Relevance: Communication and cognition. Oxford: Blackwell.
Sperber, D., and D. Wilson. 2015. Beyond speaker’s meaning. Croatian Journal of Philosophy 15 (44): 117–149.
Google Scholar
Studdert-Kennedy, M. 2000. Evolutionary implications of the particulate principle: Imitation and the dissociation of phonetic form from semantic function. In The Evolutionary Emergence of Language, eds. C. Knight, M. Studdert-Kennedy & J. Hurford. Cambridge University Press.
Wilson, D., & Carston, R. 2007. A unitary approach to lexical pragmatics. In Pragmatics, ed. N. Burton-Roberts pp. 230–259. Basingstoke: Palgrave Macmillan.

Download references

Acknowledgements

An early version of some of the material in this paper was presented at the sixth meeting of the Philosophy of Language and Mind (PLM) network, at the University of Warsaw (September 2022), organised by Joanna Odrowąż-Sypniewska. Warm thanks to Joanna and to the other participants at the meeting.

A version of the thoughts on allosemy was presented at the conference on ‘Philosophy of/and Linguistics’, held at the University of Keele (June 2023), organised by Gabriel Dupre. Many thanks to Gabe and to the other participants at the conference for their very helpful comments, in particular, Bridget Copley and Terje Lohndal.

Others whose input and support have been significant are Nick Allott, Phoevos Panagiotidis, Tim Pritchard, and the two anonymous reviewers for Review of Philosophy and Psychology, whose challenging comments led to revisions of parts of the paper. I want to mention Terje Lohndal again; his ongoing interest in my work and willingness to make time to read drafts and give constructive suggestions have been exceptionally helpful and heartening.

Finally, I am very grateful to Maria de Ponte, Joanna Odrowąż-Sypniewska, and Derek Ball, the editors of this special issue of papers from the PLM meeting in 2022, ‘Meaning, Context, and Non-Doxastic Attitudes’, for exemplary support and patience.

Funding

While writing this paper, the author was supported by a Leverhulme Research Fellowship (RF-2021–078).

Author information

Authors and Affiliations

Department of Linguistics, Division of Psychology and Language Sciences, University College London, London, UK
Robyn Carston

Authors

Robyn Carston
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robyn Carston.

Ethics declarations

Conflicts of interests/Competing interests

The author has no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Carston, R. Words and Roots – Polysemy and Allosemy – Communication and Language. Rev.Phil.Psych. (2024). https://doi.org/10.1007/s13164-024-00729-w

Download citation

Accepted: 22 February 2024
Published: 26 April 2024
DOI: https://doi.org/10.1007/s13164-024-00729-w

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Words and Roots – Polysemy and Allosemy – Communication and Language

Abstract

Similar content being viewed by others

Polysemy and word meaning: an account of lexical meaning for different kinds of content words

Root suppletion in Swedish as contextual allomorphy

The root and word distinction: an experimental study of Hebrew denominal verbs

1 Introduction: Words, Roots, and the Basis of Polysemy