Abstract
The previous chapter suggested the possibility that linguistic units that are atomically inseparable, such as words, partly derive from statistical universals of language. Harris’s hypothesis, however, only indicates how words are acquired and does not say anything about their meanings. Thus far, this book has not considered the notion of meaning at all, and the question of what is meaning is one of the most difficult in human history. It nevertheless seems that when we hear a sequence of words, we apprehend an image of its meaning. Hence, this chapter considers how the statistical universals could contribute to the meanings of words.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The original ratings in the MRC list ranged from 100 to 700, so the scores were divided by 100 here for consistency with the Amano list. Furthermore, the two lists differ in that the Amano list contains only content words, whereas the MRC list also has some functional words. Note that functional words need not be equally familiar. For example, the familiarity of “must” is clearly higher than that of “ought.” The functional words in the MRC list do not necessarily include the most frequently used functional words, such as “and” and “to.”
- 2.
This collection was crawled from the Internet in 2006 (Tanaka-Ishii and Terada, 2011). All the markup tags were eliminated, and texts in English and Japanese were extracted. For English, 265,823,502 pages were scanned to obtain 1.9 terabytes of text data without tags. For Japanese, 12,751,271 pages were scanned to obtain 69 gigabytes of text data without tags. Chapter 22 further describes these corpora.
- 3.
This figure appeared in Tanaka-Ishii and Terada (2011).
- 4.
The figure shows the Pearson correlation coefficient because it is the most well-known such coefficient. Some readers might know that this coefficient measures the linearity of the relationship between two variables. Looking at the plots, however, we can see a nonlinearity that would be better measured by Spearman’s correlation, which was 0.74 for the English data and 0.49 for the Japanese data.
- 5.
The example words were changed for clarity.
- 6.
- 7.
As Sect. 15.2 will explain, a power distribution has the characteristic of remaining a power distribution through various kinds of transformations.
References
Amano, Shigeaki and Kondo, Kimihisa (2000). On the NTT psycholinguistic databases : Lexical properties of Japanese. Journal of the Phonetic Society of Japan, 4(2), 44–50.
Baayen, R. Harald and Lieber, Rochelle (1996). Word frequency distributions and lexical semantics. Computers and the Humanities, 30, 281–291.
Beaney, Michael (1997). The Frege Reader, pages 151–180. Blackwell Publishing.
Connine, Cynthia M., Mullennix, John, Shernoff, Eve, and Yelen, Jennifer (1990). Word familiarity and frequency in visual and auditory word recognition. Journal of Experimental Psychology: Learning Memory and Cognition, 16(6), 1084–1096.
Dupoux, Emmanuel and Mehler, Jacques (1990). Monitoring the lexicon with normal and compressed speech : Frequency effects and the prelexical code. Journal of Memory and Language, 29, 316–335.
Fechner, Gustav T. (1860). Elements of psychophysics, volume 1. Holt, Rinehard and Winston. Elemente der Psychophysik Eds. Howes, D.H., Boring, E.G., translated by H.E.Adler, published in 1966.
Frege, Gottlob (1892). Über Sinn und Bedeutung, pages 25–50. Zeitschrift für Philosophie und Philosophische Kritik, Vol. 100.
Harris, Zellig S. (1954). Distributional structure. Word, 10(2–3), 146–162.
Heidelberger, Michael (1993). Nature from Within: Gustav Theodor Fechner and His Psychophysical Worldview. Vittorio Klostermann GmbH.
Marslen-Wilson, William D. (1990). Activation, Competition, and Frequency in Lexical Access. Cognitive Models of Speech Processing, pages 148–172. The MIT Press.
MRC Psycholinguistic Database (1987). http://websites.psychology.uwa.edu.au/school/MRCDatabase/uwa_mrc.htm, accessed in October 2020.
Segui, Juan, Mehler, Jacques, Frauenfelder, Uli, and Morton, John (1982). The word frequency effect and lexical access. Neuropsychologia, 20, 615–627.
Shannon, Claude E. (1951). Prediction and entropy of printed English. The Bell System Technical Journal, 30, 50–64.
Tanaka-Ishii, Kumiko and Terada, Hiroshi (2011). Word familiarity and frequency. Studia Linguistica, 65(1), 96–116.
Tian, Ran (2020). Semantic space of language. Mathematics Seminar, 701, 25–29. in Japanese.
Tian, Ran, Okazaki, Naoyuki, and Inui, Kentaro (2017). The mechanism of additive composition. Machine Learning, 106(7), 1083–1130.
Zipf, George K. (1945). The meaning-frequency relationship of words. The Journal of General Psychology, 33, 251–256.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s)
About this chapter
Cite this chapter
Tanaka-Ishii, K. (2021). Word Meaning and Value. In: Statistical Universals of Language. Mathematics in Mind. Springer, Cham. https://doi.org/10.1007/978-3-030-59377-3_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-59377-3_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59376-6
Online ISBN: 978-3-030-59377-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)