Skip to main content

Grammatical Structure and Long Memory

  • Chapter
  • First Online:
Statistical Universals of Language

Part of the book series: Mathematics in Mind ((MATHMIN))

  • 675 Accesses

Abstract

The previous chapters in Part IV mainly considered the relation between the statistical universals and elements of language. This chapter instead considers the relation between long memory and the structure of sentences. Part III explained the long memory underlying language, which is characterized by clustering phenomena among linguistic events, such that words occur in a clustered manner. Such clustering phenomena have been reported in many complex systems, and their appearance in language is intuitively comprehensible as being caused by shifts in context.

The original version of this chapter was revised. The correction to this chapter is available at https://doi.org/10.1007/978-3-030-59377-3_23

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Change history

  • 29 November 2022

    The original version of the chapter “Language as a Complex System” was previously published without updating missing reference in footnote 4, page 22. This change has now been included and the chapter and the book have been updated with the change.

Notes

  1. 1.

    Symbols such as S and VP appearing in the typewriter font indicate the inner nodes of the tree and represent grammatical elements such as a sentence (S) and verb phrase (VP). The linguistic meanings of each symbol are not necessary to explain the main point of this chapter.

  2. 2.

    The “S” of basic word order and “S” of CFG use the same symbol (with different fonts), but they are different. The former signifies the subject, whereas the latter signifies a sentence.

  3. 3.

    There is a fundamental relation between a context-free grammar and a dependency grammar, via the notion of head explained in the main text. A sentence structure description by the former framework, however, has more information than that by the latter. A context-free grammar can almost be transformed into a dependency grammar (Buchholz, 2002), but the reverse is difficult (Kong et al., 2015; Fernández-González and Martins, 2015) because of the lack of grammatical information for the inner nodes of the tree. The latter grammar has the advantage of simplicity, and numerous treebanks have been constructed accordingly; the CoNLL database is a representative example.

  4. 4.

    Note that ∗ at the beginning of a linguistic example indicates that it is grammatically incorrect.

  5. 5.

    The field of linguistics thoroughly studies such regularities and irregularities of word order, as in the database presented in Haspelmath et al. (2005), called the World Atlas of Language Structures (WALS).

  6. 6.

    The original article (Lin and Tegmark, 2017) showed that \(k \propto \log \phi \), where ϕ is the second largest eigenvalue of a probabilistic transition matrix M that generates the sequence, as such a matrix’s largest eigenvalue is 1.

  7. 7.

    ε = 0.00565, with a few negative values at large s > 500.

References

  • Buchholz, Sabine N. (2002). Memory-Based Grammatical Relation Finding. Eigen beheer. Doctoral Thesis.

    Google Scholar 

  • Chomsky, Noam (1957). Syntactic Structures. Mouton & Co.

    Google Scholar 

  • Dryer, Matthew S. (1988). Object-verb order and adjective-noun order: dispelling a myth. Lingua, 74, 185–217.

    Article  Google Scholar 

  • Dryer, Matthew S. (1992). The Greenbergian word order correlations. Language, 68(1), 81–138.

    Article  Google Scholar 

  • Fernández-González, Daniel and Martins, André F. T. (2015). Parsing as reduction. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing Volume 1 : Long Papers, pages 1523–1533. Association for Computational Linguistics.

    Google Scholar 

  • Greenberg, Joseph H. (1963). Universals of Language. The MIT Press. In the chapter entitled “Some universals of grammar with particular reference to the order of meaningful elements”, pages 73–113.

    Google Scholar 

  • Haspelmath, Martin, Dryer, Matthew S., Gil, David, and Comrie, Bernard (2005). The World Atlas of Language Structures. Oxford University Press.

    Google Scholar 

  • Hawkins, John A. (1990). A parsing theory of word order universals. Linguistic Inquiry, 21(2), 223–261.

    Google Scholar 

  • Kamp, Hans and Reyle, Uwe (1993). From Discourse to Logic. Springer.

    Google Scholar 

  • Kong, Lingpeng, Rush, Alexander M., and Smith, Noah A. (2015). Transforming dependencies into phrase structures. Proceedings of the 2015 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 788–798.

    Google Scholar 

  • Lin, Henry W. and Tegmark, Max (2017). Critial behavior in physics and probabilistic formal languages. Entropy, 19(7):299.

    Article  Google Scholar 

  • Marcus, Mitchell, Kim, Grace, Marcinkiewicz, Mary A., Macintyre, Robert, Bies, Ann, Ferguson, Mark, Katz, Karen, and Schasberger, Britta (1994). The Penn Treebank: Annotating predicate argument structure. HLT’94 Proceedings of the Workshop on Human Language Technology, volume 6, pages 114–119.

    Google Scholar 

  • Marcus, Mitchell P., Santorini, Beatrice, and Marcinkiewicz, Mary Ann (1993). Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics, 19(2), 313–330.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kumiko Tanaka-Ishii .

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s)

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Tanaka-Ishii, K. (2021). Grammatical Structure and Long Memory. In: Statistical Universals of Language. Mathematics in Mind. Springer, Cham. https://doi.org/10.1007/978-3-030-59377-3_14

Download citation

Publish with us

Policies and ethics