Word Meaning and Value

Tanaka-Ishii, Kumiko

doi:10.1007/978-3-030-59377-3_12

Kumiko Tanaka-Ishii¹¹

Part of the book series: Mathematics in Mind ((MATHMIN))

668 Accesses

Abstract

The previous chapter suggested the possibility that linguistic units that are atomically inseparable, such as words, partly derive from statistical universals of language. Harris’s hypothesis, however, only indicates how words are acquired and does not say anything about their meanings. Thus far, this book has not considered the notion of meaning at all, and the question of what is meaning is one of the most difficult in human history. It nevertheless seems that when we hear a sequence of words, we apprehend an image of its meaning. Hence, this chapter considers how the statistical universals could contribute to the meanings of words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The original ratings in the MRC list ranged from 100 to 700, so the scores were divided by 100 here for consistency with the Amano list. Furthermore, the two lists differ in that the Amano list contains only content words, whereas the MRC list also has some functional words. Note that functional words need not be equally familiar. For example, the familiarity of “must” is clearly higher than that of “ought.” The functional words in the MRC list do not necessarily include the most frequently used functional words, such as “and” and “to.”
2.
This collection was crawled from the Internet in 2006 (Tanaka-Ishii and Terada, 2011). All the markup tags were eliminated, and texts in English and Japanese were extracted. For English, 265,823,502 pages were scanned to obtain 1.9 terabytes of text data without tags. For Japanese, 12,751,271 pages were scanned to obtain 69 gigabytes of text data without tags. Chapter 22 further describes these corpora.
3.
This figure appeared in Tanaka-Ishii and Terada (2011).
4.
The figure shows the Pearson correlation coefficient because it is the most well-known such coefficient. Some readers might know that this coefficient measures the linearity of the relationship between two variables. Looking at the plots, however, we can see a nonlinearity that would be better measured by Spearman’s correlation, which was 0.74 for the English data and 0.49 for the Japanese data.
5.
The example words were changed for clarity.
6.
The proof in Tian et al. (2017) assumes multiple conditions, such as the proportionality of the co-occurrence and frequency also following a power law. Such a condition is not guaranteed to hold for natural language, in relation to the bias discussed in Chap. 5.
7.
As Sect. 15.2 will explain, a power distribution has the characteristic of remaining a power distribution through various kinds of transformations.

References

Amano, Shigeaki and Kondo, Kimihisa (2000). On the NTT psycholinguistic databases : Lexical properties of Japanese. Journal of the Phonetic Society of Japan, 4(2), 44–50.
Google Scholar
Baayen, R. Harald and Lieber, Rochelle (1996). Word frequency distributions and lexical semantics. Computers and the Humanities, 30, 281–291.
Google Scholar
Beaney, Michael (1997). The Frege Reader, pages 151–180. Blackwell Publishing.
Google Scholar
Connine, Cynthia M., Mullennix, John, Shernoff, Eve, and Yelen, Jennifer (1990). Word familiarity and frequency in visual and auditory word recognition. Journal of Experimental Psychology: Learning Memory and Cognition, 16(6), 1084–1096.
Google Scholar
Dupoux, Emmanuel and Mehler, Jacques (1990). Monitoring the lexicon with normal and compressed speech : Frequency effects and the prelexical code. Journal of Memory and Language, 29, 316–335.
Article Google Scholar
Fechner, Gustav T. (1860). Elements of psychophysics, volume 1. Holt, Rinehard and Winston. Elemente der Psychophysik Eds. Howes, D.H., Boring, E.G., translated by H.E.Adler, published in 1966.
Google Scholar
Frege, Gottlob (1892). Über Sinn und Bedeutung, pages 25–50. Zeitschrift für Philosophie und Philosophische Kritik, Vol. 100.
Google Scholar
Harris, Zellig S. (1954). Distributional structure. Word, 10(2–3), 146–162.
Article Google Scholar
Heidelberger, Michael (1993). Nature from Within: Gustav Theodor Fechner and His Psychophysical Worldview. Vittorio Klostermann GmbH.
Google Scholar
Marslen-Wilson, William D. (1990). Activation, Competition, and Frequency in Lexical Access. Cognitive Models of Speech Processing, pages 148–172. The MIT Press.
Google Scholar
MRC Psycholinguistic Database (1987). http://websites.psychology.uwa.edu.au/school/MRCDatabase/uwa_mrc.htm, accessed in October 2020.
Segui, Juan, Mehler, Jacques, Frauenfelder, Uli, and Morton, John (1982). The word frequency effect and lexical access. Neuropsychologia, 20, 615–627.
Article Google Scholar
Shannon, Claude E. (1951). Prediction and entropy of printed English. The Bell System Technical Journal, 30, 50–64.
Article MATH Google Scholar
Tanaka-Ishii, Kumiko and Terada, Hiroshi (2011). Word familiarity and frequency. Studia Linguistica, 65(1), 96–116.
Article Google Scholar
Tian, Ran (2020). Semantic space of language. Mathematics Seminar, 701, 25–29. in Japanese.
Google Scholar
Tian, Ran, Okazaki, Naoyuki, and Inui, Kentaro (2017). The mechanism of additive composition. Machine Learning, 106(7), 1083–1130.
Article MathSciNet MATH Google Scholar
Zipf, George K. (1945). The meaning-frequency relationship of words. The Journal of General Psychology, 33, 251–256.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Research Center for Advanced Science and Technology (RCAST), The University of Tokyo, Tokyo, Japan
Kumiko Tanaka-Ishii

Authors

Kumiko Tanaka-Ishii
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Tanaka-Ishii, K. (2021). Word Meaning and Value. In: Statistical Universals of Language. Mathematics in Mind. Springer, Cham. https://doi.org/10.1007/978-3-030-59377-3_12

Download citation

DOI: https://doi.org/10.1007/978-3-030-59377-3_12
Published: 02 April 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59376-6
Online ISBN: 978-3-030-59377-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics