Skip to main content

Word Meaning and Value

  • Chapter
  • First Online:
Statistical Universals of Language

Part of the book series: Mathematics in Mind ((MATHMIN))

  • 668 Accesses

Abstract

The previous chapter suggested the possibility that linguistic units that are atomically inseparable, such as words, partly derive from statistical universals of language. Harris’s hypothesis, however, only indicates how words are acquired and does not say anything about their meanings. Thus far, this book has not considered the notion of meaning at all, and the question of what is meaning is one of the most difficult in human history. It nevertheless seems that when we hear a sequence of words, we apprehend an image of its meaning. Hence, this chapter considers how the statistical universals could contribute to the meanings of words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The original ratings in the MRC list ranged from 100 to 700, so the scores were divided by 100 here for consistency with the Amano list. Furthermore, the two lists differ in that the Amano list contains only content words, whereas the MRC list also has some functional words. Note that functional words need not be equally familiar. For example, the familiarity of “must” is clearly higher than that of “ought.” The functional words in the MRC list do not necessarily include the most frequently used functional words, such as “and” and “to.”

  2. 2.

    This collection was crawled from the Internet in 2006 (Tanaka-Ishii and Terada, 2011). All the markup tags were eliminated, and texts in English and Japanese were extracted. For English, 265,823,502 pages were scanned to obtain 1.9 terabytes of text data without tags. For Japanese, 12,751,271 pages were scanned to obtain 69 gigabytes of text data without tags. Chapter 22 further describes these corpora.

  3. 3.

    This figure appeared in Tanaka-Ishii and Terada (2011).

  4. 4.

    The figure shows the Pearson correlation coefficient because it is the most well-known such coefficient. Some readers might know that this coefficient measures the linearity of the relationship between two variables. Looking at the plots, however, we can see a nonlinearity that would be better measured by Spearman’s correlation, which was 0.74 for the English data and 0.49 for the Japanese data.

  5. 5.

    The example words were changed for clarity.

  6. 6.

    The proof in Tian et al. (2017) assumes multiple conditions, such as the proportionality of the co-occurrence and frequency also following a power law. Such a condition is not guaranteed to hold for natural language, in relation to the bias discussed in Chap. 5.

  7. 7.

    As Sect. 15.2 will explain, a power distribution has the characteristic of remaining a power distribution through various kinds of transformations.

References

  • Amano, Shigeaki and Kondo, Kimihisa (2000). On the NTT psycholinguistic databases : Lexical properties of Japanese. Journal of the Phonetic Society of Japan, 4(2), 44–50.

    Google Scholar 

  • Baayen, R. Harald and Lieber, Rochelle (1996). Word frequency distributions and lexical semantics. Computers and the Humanities, 30, 281–291.

    Google Scholar 

  • Beaney, Michael (1997). The Frege Reader, pages 151–180. Blackwell Publishing.

    Google Scholar 

  • Connine, Cynthia M., Mullennix, John, Shernoff, Eve, and Yelen, Jennifer (1990). Word familiarity and frequency in visual and auditory word recognition. Journal of Experimental Psychology: Learning Memory and Cognition, 16(6), 1084–1096.

    Google Scholar 

  • Dupoux, Emmanuel and Mehler, Jacques (1990). Monitoring the lexicon with normal and compressed speech : Frequency effects and the prelexical code. Journal of Memory and Language, 29, 316–335.

    Article  Google Scholar 

  • Fechner, Gustav T. (1860). Elements of psychophysics, volume 1. Holt, Rinehard and Winston. Elemente der Psychophysik Eds. Howes, D.H., Boring, E.G., translated by H.E.Adler, published in 1966.

    Google Scholar 

  • Frege, Gottlob (1892). Ăśber Sinn und Bedeutung, pages 25–50. Zeitschrift fĂĽr Philosophie und Philosophische Kritik, Vol. 100.

    Google Scholar 

  • Harris, Zellig S. (1954). Distributional structure. Word, 10(2–3), 146–162.

    Article  Google Scholar 

  • Heidelberger, Michael (1993). Nature from Within: Gustav Theodor Fechner and His Psychophysical Worldview. Vittorio Klostermann GmbH.

    Google Scholar 

  • Marslen-Wilson, William D. (1990). Activation, Competition, and Frequency in Lexical Access. Cognitive Models of Speech Processing, pages 148–172. The MIT Press.

    Google Scholar 

  • MRC Psycholinguistic Database (1987). http://websites.psychology.uwa.edu.au/school/MRCDatabase/uwa_mrc.htm, accessed in October 2020.

  • Segui, Juan, Mehler, Jacques, Frauenfelder, Uli, and Morton, John (1982). The word frequency effect and lexical access. Neuropsychologia, 20, 615–627.

    Article  Google Scholar 

  • Shannon, Claude E. (1951). Prediction and entropy of printed English. The Bell System Technical Journal, 30, 50–64.

    Article  MATH  Google Scholar 

  • Tanaka-Ishii, Kumiko and Terada, Hiroshi (2011). Word familiarity and frequency. Studia Linguistica, 65(1), 96–116.

    Article  Google Scholar 

  • Tian, Ran (2020). Semantic space of language. Mathematics Seminar, 701, 25–29. in Japanese.

    Google Scholar 

  • Tian, Ran, Okazaki, Naoyuki, and Inui, Kentaro (2017). The mechanism of additive composition. Machine Learning, 106(7), 1083–1130.

    Article  MathSciNet  MATH  Google Scholar 

  • Zipf, George K. (1945). The meaning-frequency relationship of words. The Journal of General Psychology, 33, 251–256.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s)

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Tanaka-Ishii, K. (2021). Word Meaning and Value. In: Statistical Universals of Language. Mathematics in Mind. Springer, Cham. https://doi.org/10.1007/978-3-030-59377-3_12

Download citation

Publish with us

Policies and ethics