Abstract
The form of a word sometimes conveys semantic information. For example, the iconic word gurgle sounds like what it means, and busy is easy to identify as an English adjective because it ends in -y. Such links between form and meaning matter because they help people learn and use language. But gurgle also sounds like gargle and burble, and the -y in busy is morphologically and etymologically unrelated to the -y in crazy and watery. Whatever processing effects gurgle and busy have in common likely stem not from iconic, morphological, or etymological relationships but from systematicity more broadly: the phenomenon whereby semantically related words share a phonological or orthographic feature. In this review, we evaluate corpus evidence that spoken languages are systematic (even when controlling for iconicity, morphology, and etymology) and experimental evidence that systematicity impacts word processing (even in lieu of iconic, morphological, and etymological relationships). We conclude by drawing attention to the relationship between systematicity and low-frequency words and, consequently, the role that systematicity plays in natural language processing.
Similar content being viewed by others
References
Abelin, Å. (1999). Phonesthemes in Swedish. Proceedings of XIV International Conference of Phonetic Sciences, 99, 1333–1336.
Abramova, E., Fernández, R., & Sangati, F. (2013). Automatic labeling of phonesthemic senses. Proceedings of the Annual Meeting of the Cognitive Science Society, 35(35).
Aitchison, J. (1987). Words in the mind: An introduction to the mental lexicon. Blackwell.
Alkire, T., & Rosen, C. (2010). Romance languages: A historical introduction. Cambridge University Press.
Amenta, S., & Crepaldi, D. (2012). Morphological processing as we know it: An analytical review of morphological effects in visual word identification. Frontiers in Psychology, 3.
Amenta, S., Marelli, M., & Sulpizio, S. (2017). From sound to meaning: Phonology-to-semantics mapping in visual word recognition. Psychonomic Bulletin & Review, 24, 887–893.
Amenta, S., Hasenäcker, J., Crepaldi, D., & Marelli, M. (2022). Prediction at the intersection of sentence context and word form: Evidence from eye-movements and self-paced readingx. Psychonomic Bulletin & Review, 1–12.
Baayen, H. (1996). The effects of lexical specialization on the growth curve of the vocabulary. Computational Linguistics, 22(4), 455–480.
Baayen, R. H., & Renouf, A. (1996). Chronicling the times: Productive lexical innovations in an English newspaper. Language, 72, 69–96.
Baayen, R. H., Chuang, Y. Y., Shafaei-Bajestan, E., & Blevins, J. P. (2019). The discriminative lexicon: A unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (de) composition but in linear discriminative learning. Complexity. https://doi.org/10.1155/2019/4895891
Bergen, B. K. (2004). The psychological reality of phonaesthemes. Language, 80(2), 290–311.
Blake, B. J. (2017). Sound symbolism in English: Weighing the evidence. Australian Journal of Linguistics, 37(3), 286–313.
Blust, R. (2003). The phonestheme ŋ-in Austronesian languages. Oceanic Linguistics, 42(1), 187–212.
Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017). Enriching word vectors with subword information. Transactions of the association for computational linguistics, 5, 135–146.
Bolinger, D. L. (1940). Word affinities. American Speech, 15(1), 62–73.
Bolinger, D. L. (1950). Rime, assonance, and morpheme analysis. Word, 6(2), 117–136.
Bolinger, D. L. (1953). The life and death of words. The American Scholar, 323–335.
Brand, J., Monaghan, P., & Walker, P. (2018). The changing role of sound-symbolism for small versus large vocabularies. Cognitive Science, 42, 578–590.
Brooks, P. J., Braine, M. D., Catalano, L., Brody, R. E., & Sudhalter, V. (1993). Acquisition of gender-like noun subclasses in an artificial language: The contribution of phonological markers to learning. Journal of Memory and Language, 32(1), 76–95.
Brysbaert, M., Mandera, P., & Keuleers, E. (2018). The word frequency effect in word processing: An updated review. Current Directions in Psychological Science, 27(1), 45–50.
Brysbaert, M., & New, B. (2009). Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behavior Research Methods, 41(4), 977–990.
Brysbaert, M., Stevens, M., Mandera, P., & Keuleers, E. (2016). How many words do we know? Practical estimates of vocabulary size dependent on word definition, the degree of language input and the participant’s age. Frontiers in Psychology, 7, 1116.
Bullinaria, J. A., & Levy, J. P. (2007). Extracting semantic representations from word co-occurrence statistics: A computational study. Behavior Research Methods, 39, 510–526.
Calude, A. S., & Pagel, M. (2011). How do we use language? Shared patterns in the frequency of San Jose (any) word use across 17 world languages. Philosophical Transactions of the Royal Society B: Biological Sciences, 366(1567), 1101–1107.
Caplan, S., Kodner, J., & Yang, C. (2020). Miller’s monkey updated: Communicative efficiency and the statistics of words in natural language. Cognition, 205, 104466.
Carr, J. W., Smith, K., Cornish, H., & Kirby, S. (2017). The cultural evolution of structured languages in an open-ended, continuous world. Cognitive Science, 41(4), 892–923.
Cassani, G., Chuang, Y. Y., & Baayen, R. H. (2020). On the semantics of nonwords and their lexical category. Journal of Experimental Psychology: Learning, Memory, and Cognition, 46(4), 621.
Cassani, G., & Limacher, N. (2022). Not just form, not just meaning: Words with consistent form-meaning mappings are learned earlier. Quarterly Journal of Experimental Psychology, 75(8), 1464–1482.
Cassidy, K. W., & Kelly, M. H. (1991). Phonological information for grammatical category assignments. Journal of Memory and Language, 30(3), 348–369.
Cassidy, K. W., & Kelly, M. H. (2001). Children’s use of phonology to infer grammatical class in vocabulary learning. Psychonomic Bulletin & Review, 8, 519–523.
Chomsky, N., Roberts, I., & Watumull, J. (2023). The false promise of ChatGPT. The New York Times. Retrieved 1 April, 2023, from https://www.nytimes.com/2023/03/08/opinion/noam-chomsky-chatgpt-ai.html
Chuang, Y. Y., & Baayen, R. H. (2021). Discriminative learning and the lexicon: NDL and LDL. Oxford Research Encyclopedia of Linguistics. https://doi.org/10.1093/acrefore/9780199384655.013.375
Chuang, Y. Y., Vollmer, M. L., Shafaei-Bajestan, E., Gahl, S., Hendrix, P., & Baayen, R. H. (2021). The processing of pseudoword form and meaning in production and comprehension: A computational modeling approach using linear discriminative learning. Behavior research methods, 53, 945–976.
Collins, P. (1995). The indirect object construction in English: An informational approach. Linguistics, 33, 35–49.
Contreras Kallens, P., & Christiansen, M. H. (2020). Phonological cues to semantic class membership across hundreds of languages. Proceedings of the 13th International Conference on the Evolution of Language.
Cutler, A. (1993). Phonological cues to open-and closed-class words in the processing of spoken sentences. Journal of Psycholinguistic Research, 22(2), 109–131.
Cutler, A., Hawkins, J. A., & Gilligan, G. (1985). The suffixing preference: A processing explanation. Linguistics, 23, 723–758.
Dautriche, I., Mahowald, K., Gibson, E., & Piantadosi, S. T. (2017). Wordform similarity increases with semantic similarity: An analysis of 100 languages. Cognitive Science, 41(8), 2149–2169.
Dellert, J., & Jäger, G. (2017). NorthEuraLex-Lexicostatistical Database of Northern Eurasia. University of Tubingen.
Dingemanse, M. (2012). Advances in the cross-linguistic study of ideophones. Language and Linguistics compass, 6(10), 654–672.
Dingemanse, M., Blasi, D. E., Lupyan, G., Christiansen, M. H., & Monaghan, P. (2015). Arbitrariness, iconicity, and systematicity in language. Trends in Cognitive Sciences, 19(10), 603–615.
Dingemanse, M., & Thompson, B. (2020). Playful iconicity: Structural markedness underlies the relation between funniness and iconicity. Language and Cognition, 12(1), 203–224.
D’Onofrio, A. (2014). Phonetic detail and dimensionality in sound-shape correspondences: Refining the bouba–kiki paradigm. Language and Speech, 57(3), 367–393.
Drellishak, S. (2006). Statistical techniques for detecting and validating phonesthemes. Unpublished master’s thesis.
Dubossarsky, H., Weinshall, D., & Grossman, E. (2017). Outta control: Laws of semantic change and inherent biases in word representation models. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (pp. 1136–1145).
Durkin, P. (2009). The Oxford guide to etymology. Oxford University Press.
Edmiston, P., Perlman, M., & Lupyan, G. (2018). Repeated imitation makes human vocalizations more word-like. Proceedings of the Royal Society B: Biological Sciences, 285(1874), 20172709.
Fan, F. (2010). An asymptotic model for the English hapax/vocabulary ratio. Computational Linguistics, 36(4), 631–637.
Farmer, T. A., Christiansen, M. H., & Monaghan, P. (2006). Phonological typicality influences on-line sentence comprehension. Proceedings of the National Academy of Sciences, 103(32), 12203–12208.
Finch, S., & Chater, N. (1992). Unsupervised methods for finding linguistic categories. Artificial Neural networks (pp. 1365–1368). North-Holland.
Finn, P. J. (1977). Word frequency, information theory, and cloze performance: A transfer feature theory of processing in reading. Reading Research Quarterly, 508–537.
Firth, J. R. (1930). Speech. Oxford University Press.
Firth, J. R. (1935). The use and distribution of certain English sounds. English Studies, 17, 8–18.
Firth, J. R. (1957). A synopsis of linguistic theory 1930-1955. Studies in Linguistic Analysis, 1–32.
Fiske, J. (1872). Myths and myth-makers: Old tales and superstitions interpreted by comparative mythology Houghton-Mifflin.
Fitneva, S. A., Christiansen, M. H., & Monaghan, P. (2009). From sound to syntax: Phonological constraints on children’s lexical categorization of new words. Journal of Child Language, 36(5), 967–997.
Fordyce, J. F. (1988). Studies in sound symbolism with special reference to English. University of California.
Frishberg, N. (1975). Arbitrariness and iconicity: Historical change in American sign language. Language, 696–719.
Gasser, M. (2004). The origins of arbitrariness in language. In Proceedings of the Annual Meeting of the Cognitive Science Society, 26(26).
Gatti, D., Marelli, M., & Rinaldi, L. (2022). Out-of-vocabulary but not meaningless: Evidence for semantic-priming effects in pseudoword processing. Journal of Experimental Psychology: General, 152(3), 851–863.
Gibson, E., Futrell, R., Piantadosi, S. P., Dautriche, I., Mahowald, K., Bergen, L., & Levy, R. (2019). How efficiency shapes human language. Trends in Cognitive Sciences, 23(5), 389–407.
Gimenes, M., & New, B. (2016). Worldlex: Twitter and blog word frequencies for 66 languages. Behavior Research Methods, 48, 963–972.
Greenberg, J. H. (1957). Essays in linguistics. University of Chicago Press.
Günther, F., Rinaldi, L., & Marelli, M. (2019). Vector-space models of semantic representation from a cognitive perspective: A discussion of common misconceptions. Perspectives on Psychological Science, 14(6), 1006–1033.
Guo, B., Zhang, X., Wang, Z., Jiang, M., Nie, J., Ding, Y., . . . Wu, Y. (2023). How close is ChatGPT to human experts? Comparison corpus, evaluation, and detection. arXiv preprint arXiv:2301.07597.
Gutiérrez, E. D., Levy, R., & Bergen, B. (2016). Finding non-arbitrary form-meaning systematicity using string-metric learning for kernel regression. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Vo. 1: Long Papers).
Hahn, U., & Bailey, T. M. (2005). What makes words sound similar? Cognition, 97(3), 227–267.
Harris, Z. S. (1954). Distributional structure. Word, 10(2/3), 146–162.
Haslett, D. A., & Cai, Z. G. (2022a). New neighbours make bad fences: Form-based semantic shifts in word learning. Psychonomic Bulletin & Review, 29(3), 1017–1025.
Haslett, D. A., & Cai, Z. G. (2022b). Pressure for and against non-arbitrariness: Evidence from phonesthemes Proceedings of the Joint Conference on Language Evolution.
Haslett, D. A., & Cai, Z. G. (2022c). When words impinge: Overlapping sounds and meanings in low-frequency words. Architectures and Mechanisms for Language Processing, 28.
Haslett, D. A., & Cai, Z. G. (2023). Similar-sounding words flesh out fuzzy meanings. Journal of Experimental Psychology: General, 152(8), 2359–2368.
Hendrix, P., & Sun, C. C. (2021). A word or two about nonwords: Frequency, semantic neighborhood density, and orthography-to-semantics consistency effects for nonwords in the lexical decision task. Journal of Experimental Psychology: Learning, Memory, and Cognition, 47(1), 157.
Hopper, P. J. (1994). Phonogenesis. Perspectives on grammaticalization.
Howes, D. H., & Solomon, R. L. (1951). Visual duration threshold as a function of word-probability. Journal of Experimental Psychology, 41(6), 401.
Hutchins, S. S. (1998). The psychological reality, variability, and compositionality of English phonesthemes. Dissertation: Emory University.
Imai, M., Li, L., Haryu, E., Okada, H., Hirsh-Pasek, K., Golinkoff, R. M., & Shigematsu, J. (2008). Novel noun and verb learning in Chinese-, English-, and Japanese-speaking children. Child Development, 79(4), 979–1000.
Iwasaki, N., David, P. V., & Gabriella, V. (2007). What do English speakers know about Gera-Gera and yota-yota?: A cross-linguistic investigation of mimetic words of laughing and walking. Japanese Language Education Around the Globe, 17, 53–78.
Jee, H., Tamariz, M., & Shillcock, R. (2022). Exploring meaning-sound systematicity in Korean. Journal of East Asian Linguistics, 31(1), 45–71.
Johansson, N. E., Carr, J. W., & Kirby, S. (2021). Cultural evolution leads to vocal iconicity in an experimental iterated learning task. Journal of Language Evolution, 6(1), 1–25.
Joyce, J. (1922). Ulysses Shakespeare & Company.
Kantartzis, K., Imai, M., & Kita, S. (2011). Japanese sound-symbolism facilitates word learning in English-speaking children. Cognitive Science, 35(3), 575–586.
Katz, J. J. (1972). Semantic theory. Harper & Row.
Kelly, M. H. (1992). Using sound to solve syntactic problems: The role of phonology in grammatical category assignments. Psychological Review, 99(2), 349.
Kelly, M. H., & Bock, J. K. (1988). Stress in time. Journal of Experimental Psychology: Human Perception and Performance, 14(3), 389.
Kessler, B. (2005). Phonetic comparison algorithms 1. Transactions of the Philological Society, 103(2), 243–260.
Kirby, S., Cornish, H., & Smith, K. (2008). Cumulative cultural evolution in the laboratory: An experimental approach to the origins of structure in human language. Proceedings of the National Academy of Sciences, 105(31), 10681–10686.
Kirby, S., Griffiths, T., & Smith, K. (2014). Iterated learning and the evolution of language. Current Opinion in Neurobiology, 28, 108–114.
Kirby, S., Tamariz, M., Cornish, H., & Smith, K. (2015). Compression and communication in the cultural evolution of linguistic structure. Cognition, 141, 87–102.
Klima, E. S., & Bellugi, U. (1979). The signs of language. Harvard University Press.
Köhler, W. (1929). Gestalt psychology Liveright.
Kornai, A. (2002). How many words are there? Glottometrics, 4, 61–86.
Kumar, A. A., Lundin, N. B., & Jones, M. N. (2022). Mouse-mole-vole: The inconspicuous benefit of phonology during retrieval from semantic memory. Proceedings of the Annual Meeting of the Cognitive Science Society, 651–657.
Kwon, N. (2017). Empirically observed iconicity levels of English phonaesthemes. Public Journal of Semiotics, 7(2), 73–93.
Kwon, N., & Round, E. R. (2015). Phonaesthemes in morphological theory. Morphology, 25, 1–27.
Laing, C. E. (2014). A phonological analysis of onomatopoeia in early word production. First Language, 34(5), 387–405.
Landauer, T. K., & Dumais, S. T. (1997). A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological review, 104(2), 211.
Lapesa, G., Kawaletz, L., Plag, I., Andreou, M., Kisselew, M., & Padó, S. (2018). Disambiguation of newly derived nominalizations in context: A distributional semantics approach. Word Structure, 11(3), 277–312.
Lenci, A. (2018). Distributional models of word meaning. Annual Review of Linguistics, 4, 151–171.
Lenci, A., Sahlgren, M., Jeuniaux, P., Cuba Gyllensten, A., & Miliani, M. (2022). A comparative evaluation and analysis of three generations of distributional semantic models. Language Resources and Evaluation, 56(4), 1269–1313.
Levenshtein, V. I. (1966). Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10(8), 707–710.
Lewis, M. L., & Frank, M. C. (2016). The length of words reflects their conceptual complexity. Cognition, 153, 182–195.
Li, W. (1992). Random texts exhibit Zipf's-law-like word frequency distribution. IEEE Transactions on Information Theory, 38(6), 1842–1845.
Liu, N. F., Levow, G. A., & Smith, N. A. (2018, June). Discovering phonesthemes with sparse regularization. Proceedings of the Second Workshop on Subword/Character LEvel Models (pp. 49–54).
Lund, K., & Burgess, C. (1996). Producing high-dimensional semantic spaces from lexical co-occurrence. Behavior Research Methods, Instruments, & Computers, 28(2), 203–208.
Lupyan, G., & Dale, R. (2010). Language structure is partly determined by social structure. PLOS ONE, 5(1), Article e8559.
Lupyan, G., & Winter, B. (2018). Language is more abstract than you think, or, why aren't languages more iconic? Philosophical Transactions of the Royal Society B: Biological Sciences, 373(1752), 20170137.
MacWhinney, B., Leinbach, J., Taraban, R., & McDonald, J. (1989). Language learning: Cues or rules? Journal of Memory and Language, 28(3), 255–277.
Magnus, M. (2000). What’s in a word? Evidence for phonosemantics. University of Trondheim.
Malkiel, Y. (1952). Old and new trends in Spanish linguistics. Studies in Philology, 49(3), 437–458.
Marelli, M., & Amenta, S. (2018). A database of orthography-semantics consistency (OSC) estimates for 15,017 English words. Behavior Research Methods, 50, 1482–1495.
Marelli, M., Amenta, S., & Crepaldi, D. (2015). Semantic transparency in free stems: The effect of orthography-semantics consistency on word recognition. Quarterly Journal of Experimental Psychology, 68(8), 1571–1583.
Marslen-Wilson, W., & Tyler, L. K. (1980). The temporal structure of spoken language understanding. Cognition, 8(1), 1–71.
McDonald, S. (2000). Environmental determinants of lexical processing effort. PhD thesis, University of Edinburgh.
McMahon, A. M. (1994). Understanding language change. Cambridge University Press.
Meylan, S. C., & Griffiths, T. L. (2021). The challenges of large-scale, web-based language datasets: Word length and predictability revisited. Cognitive Science, 45(6), Article e12983.
Mielke, S. J., Alyafeai, Z., Salesky, E., Raffel, C., Dey, M., Gallé, M., et al. (2021). Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP. arXiv preprint arXiv:2112.10508.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013a). Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013b). Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems, 26.
Miller, D. G. (2014). English lexicogenesis. Oxford University Press.
Miller, G. A. (1957). Some effects of intermittent silence. The American Journal of Psychology, 70(2), 311–314.
Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.
Monaghan, P., Chater, N., & Christiansen, M. H. (2005). The differential role of phonological and distributional cues in grammatical categorisation. Cognition, 96(2), 143–182.
Monaghan, P., Christiansen, M. H., & Fitneva, S. A. (2011). The arbitrariness of the sign: Learning advantages from the structure of the vocabulary. Journal of Experimental Psychology: General, 140(3), 325.
Monaghan, P., Mattock, K., & Walker, P. (2012). The role of sound symbolism in language learning. Journal of Experimental Psychology: Learning, Memory, and Cognition, 38(5), 1152.
Monaghan, P., Shillcock, R. C., Christiansen, M. H., & Kirby, S. (2014). How arbitrary is language? Philosophical Transactions of the Royal Society B: Biological Sciences, 369(1651), 20130299.
Morgan, J., Shi, R., & Allopenna, P. (1996). Perceptual bases of rudimentary grammatical categories. In J. Morgan & K. Demuth (Eds.), Signal to syntax (pp. 263–283). Erlbaum.
Nielsen, A. K. S. (2016). Systematicity, motivatedness, and the structure of the lexicon. PhD thesis, University of Edinburgh.
Nölle, J., Staib, M., Fusaroli, R., & Tylén, K. (2018). The emergence of systematicity: How environmental and communicative factors shape a novel communication system. Cognition, 181, 93–104.
OpenAI. (2023). GPT-4 technical report. 10.48550/arXiv.2303.08774
Otis, K., & Sagi, E. (2008). Phonaesthemes: A corpus-based analysis. In Proceedings of the Annual Meeting of the Cognitive Science Society, 30(30).
Pastizzo, M. J., & Feldman, L. B. (2009). Multiple dimensions of relatedness among words: Conjoint effects of form and meaning in word recognition. The Mental Lexicon, 4(1), 1–25.
Peirce, C. S. (1955). Philosophical writings of Peirce. Courier Corporation.
Pennington, J., Socher, R., & Manning, C. D. (2014, October). Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 1532–1543).
Perry, L. K., Perlman, M., & Lupyan, G. (2015). Iconicity in English and Spanish and its relation to lexical category and age of acquisition. PLOS ONE, 10(9), Article e0137147.
Perry, L. K., Perlman, M., Winter, B., Massaro, D. W., & Lupyan, G. (2018). Iconicity in the speech of children and adults. Developmental Science, 21(3), e12572.
Piantadosi, S. T. (2014). Zipf’s word frequency law in natural language: A critical review and future directions. Psychonomic Bulletin & Review, 21(5), 1112–1130.
Piantadosi, S. T., Tily, H., & Gibson, E. (2011). Word lengths are optimized for efficient communication. Proceedings of the National Academy of Sciences, 108(9), 3526–3529.
Pimentel, T., McCarthy, A. D., Blasi, D. E., Roark, B., & Cotterell, R. (2019). Meaning to form: Measuring systematicity as information. arXiv preprint arXiv:1906.05906.
Popescu, I. I., & Altmann, G. (2008). Hapax legomena and language typology. Journal of Quantitative Linguistics, 15(4), 370–378.
Postal, P. (1968). Aspects of phonological theory. Harper & Row.
Preston, K. A. (1935). The speed of word perception and its relation to reading ability. The Journal of General Psychology, 13(1), 199–203.
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., & Sutskever, I. (2019). Language models are unsupervised multitask learners. OpenAI blog, 1(8), 9.
Ramachandran, V. S., & Hubbard, E. M. (2001). Synaesthesia—A window into perception, thought and language. Journal of Consciousness Studies, 8(12), 3–34.
Rastle, K., Davis, M. H., Marslen-Wilson, W. D., & Tyler, L. K. (2000). Morphological and semantic effects in visual word recognition: A time-course study. Language and Cognitive Processes, 15(4/5), 507–537.
Rastle, K., Davis, M. H., & New, B. (2004). The broth in my brother’s brothel: Morpho-orthographic segmentation in visual word recognition. Psychonomic Bulletin & Review, 11, 1090–1098.
Raviv, L., & Arnon, I. (2018). Systematicity, but not compositionality: Examining the emergence of linguistic structure in children and adults using iterated learning. Cognition, 181, 160–173.
Raviv, L., Meyer, A., & Lev-Ari, S. (2019a). Larger communities create more systematic languages. Proceedings of the Royal Society B, 286(1907), 20191262.
Raviv, L., Meyer, A., & Lev-Ari, S. (2019b). Compositional structure can emerge without generational transmission. Cognition, 182, 151–164.
Raviv, L., de Heer Kloots, M., & Meyer, A. (2021). What makes a language easy to learn? A preregistered study on how systematic structure and community size affect language learnability. Cognition, 210, 104620.
Redington, M., Chater, N., & Finch, S. (1998). Distributional information: A powerful cue for acquiring syntactic categories. Cognitive Science, 22(4), 425–469.
Reilly, J., Hung, J., & Westbury, C. (2017). Non-arbitrariness in mapping word form to meaning: Cross-linguistic formal markers of word concreteness. Cognitive Science, 41(4), 1071–1089.
Reilly, J., Westbury, C., Kean, J., & Peelle, J. E. (2012). Arbitrary symbolism in natural language revisited: When word forms carry meaning. PLOS ONE, 7(8), article e42286. https://doi.org/10.1371/journal.pone.0042286
Rhodes, R. (1994). Aural images. In J. Ohala et al. (Eds.), Sound symbolism (pp. 276–292). Cambridge University Press.
Sahlgren, M. (2006). The word-space model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces. PhD thesis, Institutionen för lingvistik.
Sahlgren, M. (2008). The distributional hypothesis. Italian Journal of Disability Studies, 20, 33–53.
Sahlgren, M., & Lenci, A. (2016). The effects of data size and frequency range on distributional semantic models. arXiv preprint arXiv:1609.08293
Salle, A., & Villavicencio, A. (2019). Why so down? The role of negative (and positive) pointwise mutual information in distributional semantics. arXiv preprint arXiv:1908.06941
Samarin, W. J. (1970). Inventory and choice in expressive language. Word, 26(2), 153–169.
Schatz, E. K., & Baldwin, R. S. (1986). Context clues are unreliable predictors of word meanings. Reading Research Quarterly, 21(4), 439–453.
Schick, T., & Schütze, H. (2020). Rare words: A major problem for contextualized embeddings and how to fix it by attentive mimicking. Proceedings of the AAAI Conference on Artificial Intelligencem, 34(05), 8766–8774.
Selten, R., & Warglien, M. (2007). The emergence of simple languages in an experimental coordination game. Proceedings of the National Academy of Sciences, 104(18), 7361–7366.
Sennrich, R., Haddow, B., & Birch, A. (2016). Neural machine translation of rare words with subword units. arXiv preprint arXiv:1508.07909.
Shannon, C. E. (1948). A mathematical theory of communication. The Bell System Technical Journal, 27(3), 379–423.
Sherman, D. (1975). Noun-verb stress alternation: An example of the lexical diffusion of sound chance in English. Linguistics, 13(159). https://doi.org/10.1515/ling.1975.13.159.43
Shi, R. (1995). Perceptual correlates of content words and function words in early language input. Brown University.
Shi, R., Werker, J., & Morgan, J. (1998). Young infants’ perception of lexical and functional categories. Infant Behavior and Development, (21), 683.
Shi, R., Werker, J. F., & Morgan, J. L. (1999). Newborn infants’ sensitivity to perceptual cues to lexical and grammatical words. Cognition, 72(2), B11–B21.
Shillcock, R., Kirby, S., McDonald, S., & Brew, C. (2001). Filled pauses and their status in the mental lexicon. ISCA tutorial and research workshop (ITRW) on Disfluency in spontaneous speech.
Sidhu, D. M., & Pexman, P. M. (2018a). Five mechanisms of sound symbolic association. Psychonomic Bulletin & Review, 25, 1619–1643.
Sidhu, D. M., & Pexman, P. M. (2018b). Lonely sensational icons: Semantic neighbourhood density, sensory experience and iconicity. Language, Cognition and Neuroscience, 33(1), 25–31.
Sidhu, D. M., Vigliocco, G., & Pexman, P. M. (2020). Effects of iconicity in lexical decision. Language and Cognition, 12(1), 164–181.
Silvey, C., Kirby, S., & Smith, K. (2015). Word meanings evolve to selectively preserve distinctions on salient dimensions. Cognitive Science, 39(1), 212–226.
Smith, C. A. (2014). The phonesthetics of blends: A lexicographic study of cognitive blends in the OED. Explorations in English Language and Linguistics, 2(1), 12–45.
Smith, C. A. (2016). Tracking semantic change in fl-monomorphemes in the Oxford English dictionary. Journal of Historical Linguistics, 6(2), 165–200.
Smith, K., Kirby, S., & Brighton, H. (2003). Iterated learning: A framework for the emergence of language. Artificial Life, 9(4), 371–386.
Strik Lievers, F., Bolognesi, M., & Winter, B. (2021). The linguistic dimensions of concrete and abstract concepts: Lexical category, morphological structure, countability, and etymology. Cognitive Linguistics, 32(4), 641–670.
St. Clair, M. C., Monaghan, P., & Ramscar, M. (2009). Relationships between language structure and language learning: The suffixing preference and grammatical categorization. Cognitive Science, 33(7), 1317–1329.
Tamariz, M. (2008). Exploring systematicity between phonological and context-cooccurrence representations of the mental lexicon. The Mental Lexicon, 3(2), 259–278.
Tamariz, M., Roberts, S. G., Martínez, J. I., & Santiago, J. (2018). The interactive origin of iconicity. Cognitive Science, 42(1), 334–349.
Tamer, N., & Widmer, P. (2022). The prevalence of systematicity in indo-European languages. Paper presented at the joint conference on language evolution. Kanazawa.
Thompson, A. L., & Do, Y. (2019). Defining iconicity: An articulation-based methodology for explaining the phonological structure of ideophones. Glossa: A Journal of General Linguistics, 4(1), 72.
Thompson, B., Perlman, M., Lupyan, G., Sehyr, Z. S., & Emmorey, K. (2020). A data-driven approach to the semantics of iconicity in American Sign Language and English. Language and Cognition, 12(1), 182–202.
Trask, R. L. (2000). The dictionary of historical and comparative linguistics. Psychology Press.
Van Heuven, W. J., Mandera, P., Keuleers, E., & Brysbaert, M. (2014). SUBTLEX-UK: A new and improved word frequency database for British English. Quarterly Journal of Experimental Psychology, 67(6), 1176–1190.
Vania, C., & Lopez, A. (2017). From characters to words to in between: Do we capture morphology?. arXiv preprint arXiv:1704.08352
Vinson, D. P., Cormier, K., Denmark, T., Schembri, A., & Vigliocco, G. (2008). The British sign language (BSL) norms for age of acquisition, familiarity, and iconicity. Behavior Research Methods, 40(4), 1079–1087.
Vinson, D., Jones, M., Sidhu, D. M., Lau-Zhu, A., Santiago, J., & Vigliocco, G. (2021). Iconicity emerges and is maintained in spoken language. Journal of Experimental Psychology: General, 150(11), 2293.
Webb, S., & Rodgers, M. P. (2009). The lexical coverage of movies. Applied Linguistics, 30(3), 407–427.
Winter, B., Lupyan, G., Perry, L. K., Dingemanse, M., & Perlman, M. (2023). Iconicity ratings for 14,000+ English words. Behavior Research Methods, 1–16. https://doi.org/10.3758/s13428-023-02112-6
Winter, B., Pérez-Sobrino, P., & Brown, L. (2019). The sound of soft alcohol: Crossmodal associations between interjections and liquor. PLOS ONE, 14(8), article e0220449.
Winter, B., & Perlman, M. (2021). Size sound symbolism in the English lexicon. Glossa: A Journal of General Linguistics, 6(1).
Winter, B., Perlman, M., Perry, L. K., & Lupyan, G. (2017). Which words are most iconic? Iconicity in English sensory words. Interaction Studies, 18(3), 443–464.
Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., et al. (2019). HuggingFace’s Transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771.
Zipf, G. (1936). The psychobiology of language. Routledge.
Zipf, G. (1949). The principle of least effort. Hafner Publishing Company.
Zwicky, A. M., & Pullum, G. K. (1987). Plain morphology and expressive morphology. Annual Meeting of the Berkeley Linguistics Society, 13, 330–340.
Author note
Many thanks to Bodo Winter for helpful comments on an earlier draft.
This research was supported by a PhD fellowship from the Hong Kong Research Grant Council (to D.A.H.) and a General Research Fund grant (project number: 14600220) from the Hong Kong Research Grant Council (to Z.G.C.).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
We have no conflicts of interest to disclose.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Haslett, D.A., Cai, Z.G. Systematic mappings of sound to meaning: A theoretical review. Psychon Bull Rev (2023). https://doi.org/10.3758/s13423-023-02395-y
Accepted:
Published:
DOI: https://doi.org/10.3758/s13423-023-02395-y