Abstract
Many recent studies have demonstrated the influence of sublexical frequency measures on language processing, or called for controlling sublexical measures when selecting stimulus material for psycholinguistic studies (Aichert & Ziegler, 2005). The present study discusses which measures should be controlled for in what kind of study, and presents orthographic and phonological syllable, dual unit (bigram and biphoneme) and single unit (letter and phoneme) type and token frequency measures derived from the lemma and word form corpora of the CELEX lexical database (Baayen, Piepenbrock, & Gulikers, 1995). Additionally, we present the SUBLEX software as an adaptive tool for calculating sublexical frequency measures and discuss possible future applications. The measures and the software can be downloaded at www.psychonomic.org.
Article PDF
Similar content being viewed by others
References
Aichert, I., &Ziegler, W. (2004). Syllable frequency and syllable structure in apraxia of speech.Brain & Language,88, 148–159.
Aichert, I., &Ziegler, W. (2005). Is there a need to control for sublexical frequencies?Brain & Language,95, 170–171.
Alameda, J. R., &Cuetos, F. (1995).Diccionario de frecuencia de las unidades lingüísticas del castellano (Vols. 1 & 2). Oviedo: Universidad de Oviedo.
Ans, B., Carbonnel, S., &Valdois, S. (1998). A connectionist multiple-trace memory model for polysyllabic word reading.Psychological Review,105, 678–723.
Baayen, R. H., Dijkstra, T., &Schreuder, R. (1997). Singulars and plurals in Dutch: Evidence for a parallel dual-route model.Journal of Memory & Language,37, 94–117.
Baayen, R. H., Piepenbrock, R., &Gulikers, L. (1995). The CELEX lexical database [CD-ROM]. Philadelphia: Linguistic Data Consortium, University of Pennsylvania.
Bailey, T. M., &Hahn, U. (2001). Determinants of Wordlikeliness: Phonotactics or Lexical Neighborhoods?Journal of Memory & Language,44, 568–591.
Barber, H., Vergara, M., &Carreiras, M. (2004). Syllable-frequency effects in visual word recognition: Evidence from ERPs.Neuro Report,15, 545–548.
Carreiras, M., Álvarez, C. J., &De Vega, M. (1993). Syllable frequency and visual word recognition in Spanish.Journal of Memory & Language,32, 766–780.
Carreiras, M., &Perea, M. (2004). Effects of syllable neighbourhood frequency in visual word recognition and reading: Cross-task comparisons. In L. Ferrand & J. Grainger (Eds.),Psycholinguistique cognitive. Bruxelles: De Broeck Université.
Cholin, J., Levelt, W. J. M., &Schiller, N. O. (2006). Effects of syllable frequency in speech production.Cognition,99, 205–235.
Clahsen, H. (1999). Lexical entries and rules of language: A multidisciplinary study of German inflection.Behavioral & Brain Sciences,22, 991–1060.
Coltheart, M., Rastle, K., Perry, C., Langdon, R., &Ziegler, J. (2001). DRC: A dual route cascaded model of visual word recognition and reading aloud.Psychological Review,108, 204–256.
Conrad, M., Carreiras, M., & Jacobs, A. M. (in press-a). Contrasting effects of token and type syllable frequency in lexical decision.Language & Cognitive Processes.
Conrad, M., Carreiras, M., & Jacobs, A. M. (in press-b). Syllable frequency and orthographic redundancy evidence for two different processing mechanisms in visual word recognition.Journal of Experimental Psychology: Human Perception & Performance.
Conrad, M., Grainger, J., &Jacobs, A. (2007). Phonology as the source of syllable frequency effects in visual word recognition: Evidence from French.Memory & Cognition,35, 974–983.
Conrad, M., &Jacobs, A. M. (2004). Replicating syllable frequency effects in Spanish in German: One more challenge to computational models of visual word recognition.Language & Cognitive Processes,19, 369–390.
Conrad, M., Stenneken, P., &Jacobs, A. M. (2006). Associated or dissociated effects of syllable-frequency in lexical decision and naming.Psychonomic Bulletin & Review,13, 339–345.
Davis, C., &Perea, M. (2005). BuscaPalabras: A program for deriving orthographic and phonological neighborhood statistics and other psycholinguistic indices in Spanish.Behavior Research Methods,37, 665–671.
Dehaene, S., Cohen, L., Sigman, M., &Vinckier, F. (2005). The neural code for written words: A proposal.Trends in Cognitive Sciences,9, 335–341.
De Jong, N. H., Schreuder, R., &Baayen, R. H. (2000). The morphological family size effect and morphology.Language & Cognitive Processes,15, 329–365.
del Prado Martín, F. M., Ernestus, M., &Baayen, R. H. (2004). Do type and token reflect different mechanisms? Connectionist modeling of Dutch past-tense formation and final devoicing.Brain & Language,90, 287–298.
del Prado Martín, F. M., Kostic, A., &Baayen, R. H. (2004). Putting the bits together: An information theoretical perspective on morphological processing.Cognition,94, 1–18.
Duyck, W., Desmet, T., Verbeke, L. P. C., &Brysbaert, M. (2004). WordGen: A tool for word selection and nonword generation in Dutch, English, German, and French.Behavior Research Methods, Instruments, & Computers,36, 488–499.
Eddington, D. (2004). Issues in modeling language processing analogically.Lingua,114, 849–871.
Ernestus, M., &Baayen, R. H. (2001). Choosing between the Dutch past-tense suffixes -te and -de. In T. van der Wouden (Eds.),Linguistics in the Netherlands 2001 (pp. 81–93). Amsterdam: Benjamins.
Ernestus, M., &Baayen, R. H. (2003). Predicting the unpredictable: Interpreting neutralized segments in Dutch.Language,79, 5–38.
Geyken, A. (2007). The DWDS-corpus: A reference corpus for the German language of the 20th century. In C. Fellbaum (Eds.),Collocations and idioms: Linguistic, lexicographic, and computational aspects. London: Continuum.
Goslin, J., &Frauenfelder, U. H. (2000). A comparison of theoretical and human syllabification.Language & Speech,44, 409–436.
Goswami, U., &Ziegler, J. C. (2006). A developmental perspective on the neural code for written words.Trends in Cognitive Sciences,10, 142–143.
Goswami, U., Ziegler, J. C., Dalton, L., &Schneider, W. (2003). Nonword reading across orthographies: How flexible is the choice of reading units?Applied Psycholinguistics,24, 235–247.
Grainger, J., &Jacobs, A. M. (1993). Masked partial-word priming in visual word recognition: effects of positional letter frequency.Journal of Experimental Psychology: Human Perception & Performance,19, 951–964.
Grainger, J., &Jacobs, A. M. (1996). Orthographic processing in visual word recognition: A multiple read-out model.Psychological Review,103, 518–565.
Grainger, J., &Whitney, C. (2004). Does the huamn mnid raed wrods as a wlohe?Trends in Cognitive Sciences,8, 58–59.
Hutzler, F., Bergmann, J., Conrad, M., Kronbichler, M., Stenneken, P., &Jacobs, A. M. (2004). Inhibitory effects of first syllablefrequency in lexical decision: An event-related potential study.Neuroscience Letters,372, 179–184.
Hutzler, F., Conrad, M., &Jacobs, A. M. (2005). Effects of syllablefrequency in lexical decision and naming: An eye-movement study.Brain & Language,92, 138–152.
Jacobs, A. M. (2002). The cognitive psychology of literacy. In N. J. Smelser & P. B. Baltes (Eds.),International encyclopedia of the social and behavioral sciences (pp. 8971–8975). Amsterdam: Elsevier.
Jacobs, A. M., &Graf, R. (2005). Wortformgedächtnis als intuitive Statistik in Sprachen mit unterschiedlicher Konsistenz [Word form memory as intuitive statistics in languages with different consistency].Zeitschrift für Psychologie,213, 133–141.
Jacobs, A. M., Graf, R., &Kinder, A. (2003). Receiver operating characteristics in the lexical decision task: Evidence for a simple signaldetection process simulated by the multiple read-out model.Journal of Experimental Psychology: Learning, Memory, & Cognition,29, 481–488.
Jacobs, A. M., Rey, A., Ziegler, J. C., &Grainger, J. (1998). MROM-P: An interactive activation, multiple read-out model of orthographic and phonological processes in visual word recognition. In J. Grainger & A. M. Jacobs (Eds.),Localist connectionist approaches to human cognition. (pp. 147–187). Mahwah, NJ: Erlbaum.
Kuchinke, L., Jacobs, A. M., Grubich, C., Võ, M. L.-H., Conrad, M., &Herrmann, M. (2005). Incidental effects of emotional valence in single word processing: An fMRI study.NeuroImage,28, 1022–1032.
Laganaro, M. (2005). Syllable frequency effect in speech production: Evidence from aphasia.Journal of Neurolinguistics,18, 221–235.
Leung, M.-T., Law, S.-P., &Fung, S.-Y. (2004). Type and token frequencies of phonological units in Hong Kong Cantonese.Behavior Research Methods, Instruments, & Computers,36, 500–505.
Levelt, W. J. M., Roelofs, A., &Meyer, A. S. (1999). A theory of lexical access in speech production.Behavioral & Brain Sciences,22, 1–75.
Liberman, I., Liberman, A. M., Mattingly, I., &Shankweiler, D. (1980).Orthography and the beginning reader. Baltimore, OH: University Park Press.
Martin, N. (2004). Comments on Nickels and Howard (2004) “Dissociating effects of number of phonemes, number of syllables, and syllable complexity on word production in aphasia: It’s the number of phonemes that counts.”Cognitive Neuropsychology,21, 528–530.
Massaro, D. W., &Cohen, M. M. (1994). Visual, orthographic, phonological, and lexical influences in reading.Journal of Experimental Psychology: Human Perception & Performance,20, 1107–1128.
Mathey, S., &Zagar, D. (2002). Lexical similarity in visual word recognition: The effect of syllabic neighborhood in French.Current Psychology Letters: Behavior, Brain & Cognition,8, 107–121.
New, B., Brysbaert, M., Segui, J., Ferrand, L., &Rastle, K. (2004). The processing of singular and plural nouns in French and English.Journal of Memory & Language,51, 568–585.
New, B., Pallier, C., Brysbaert, M., &Ferrand, L. (2004). Lexique 2: A new French lexical database.Behavior Research Methods, Instruments, & Computers,36, 516–524.
Nickels, L. A., &Howard, D. (2004a). Dissociating effects of number of phonemes, number of syllables, and syllabic complexity on word production in aphasia: It’s the number of phonemes that counts.Cognitive Neuropsychology,21, 57–78.
Nickels, L. A., &Howard, D. (2004b). Correct responses, error analyses, and theories of word production: A response to Martin.Cognitive Neuropsychology,21, 531–536.
Novick, L. R., &Sherman, S. J. (2004). Type-based bigram frequencies for five-letter words.Behavior Research Methods, Instruments, & Computers,36, 397–401.
Nuerk, H.-C., Rey, A., Graf, R., &Jacobs, A. M. (2000). Phonographic sublexical units in visual word recognition.Current Psychology Letters: Behavior, Brain, & Cognition,2, 27–38.
Perea, M., &Carreiras, M. (1998). Effects of syllable frequency and syllable neighborhood frequency in visual word recognition.Journal of Experimental Psychology: Human Perception & Performance,24, 134–144.
Peressotti, F., &Grainger, J. (1999). The role of letter identity and letter position in orthographic priming.Perception & Psychophysics,61, 691–706.
Seidenberg, M. S. (1985). The time course of phonological code activation in two writing systems.Cognition,19, 1–30.
Seidenberg, M. S. (1987). Sublexical structures in visual word recognition: Access units or orthographic redundancy? In M. Coltheart (Ed.),Attention and performance XII: The psychology of reading (pp. 245–263). Hillsdale, NJ: Erlbaum.
Seidenberg, M. S., &McClelland, J. L. (1989). A distributed, developmental model of word recognition and naming.Psychological Review,96, 523–568.
Stella, V., &Job, R. (2001). Le sillabe PD/DPSS: Una base di dati sulla frequenza sillabica dell’italiano scritto.Giornale Italiano di Psicologia,28, 633–639.
Stenneken, P., Bastiaanse, R., Huber, W., &Jacobs, A. M. (2005). Syllable structure and sonority in language inventory and aphasic neologisms.Brain & Language,95, 280–292.
Stenneken, P., Conrad, M., Hutzler, F., Braun, M., &Jacobs, A. M. (2005). Frequency effects with visual words and syllables in a dyslexic reader.Behavioral Neurology,16, 1–15.
Stenneken, P., Hofmann, M., &Jacobs, A. M. (2005). Patterns of phoneme and syllable frequency in jargon aphasia.Brain & Language,95, 221–222.
Stone, G. O., Vanhoy, M., &Van Orden, G. C. (1997). Perception is a two-way street: Feedforward and feedback phonology in visual word recognition.Journal of Memory & Language,36, 337–359.
Sumiya, H., &Healy, A. F. (2004). Phonology in the bilingual Stroop effect.Memory & Cognition,32, 752–758.
Van Orden, G. C. (1987). A ROWS is a ROSE: Spelling, sound, and reading.Memory & Cognition,15, 181–198.
Varley, R., &Whiteside, S. P. (2001). What is the underlying impairment in acquired apraxia of speech?Aphasiology,15, 39–49.
Yates, M. (2005). Phonological neighbors speed visual word processing: Evidence from multiple tasks.Journal of Experimental Psychology: Learning, Memory, & Cognition,31, 1385–1397.
Ziegler, J. C., &Ferrand, L. (1998). Orthography shapes the perception of speech: The consistency effect in auditory word recognition.Psychonomic Bulletin & Review,5, 683–689.
Ziegler, J. C., Ferrand, L., &Montant, M. (2004). Visual phonology: The effects of orthographic consistency on different auditory word recognition tasks.Memory & Cognition,32, 732–741.
Ziegler, J. C., &Goswami, U. (2005). Reading acquisition, developmental dyslexia, and skilled reading across languages: A psycholinguistic grain size theory.Psychological Bulletin,131, 3–29.
Ziegler, J. C., Jacobs, A. M., &Stone, G. O. (1996). Statistical analysis of the bidirectional inconsistency of spelling and sound in French.Behavior Research Methods, Instruments, & Computers,28, 504–515.
Ziegler, J. C., Perry, C., &Coltheart, M. (2003). Speed of lexical and nonlexical processing in French: The case of the regularity effect.Psychonomic Bulletin & Review,10, 947–953.
Ziegler, J. C., Perry, C., Jacobs, A. M., &Braun, M. (2001). Identical words are read differently in different languages.Psychological Science,12, 379–384.
Ziegler, J. C., Stone, G. O., &Jacobs, A. M. (1997). What is the pronunciation for -ough and the spelling for /u/? A database for computing feedforward and feedback consistency in English.Behavior Research Methods, Instruments, & Computers,29, 600–618.
Ziegler, J. C., Van Orden, G. C., &Jacobs, A. M. (1997). Phonology Can Help or Hurt the Perception of Print.Journal of Experimental Psychology: Human Perception & Performance,23, 845–860.
Zorzi, M., Houghton, G., &Butterworth, B. (1998). Two Routes or One in Reading Aloud? A Connectionist Dual-Process Model.Journal of Experimental Psychology: Human Perception & Performance,24, 1131–1161.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was partly supported by a grant from the Deutsche Forschungsgemeinschaft (Ja 823/3-1/Jacobs, „Zur Rolle phonologischer Prozesse beim Lesen komplexer Wörter: Ein sprachvergleichender Ansatz,” Freie Universitêt Berlin). None of the authors has any financial interest in any of the materials presented in this article. We thank Gabriele Hofmann and Susanne Prinz for proofreading, and Andreas Böckler for answering programming questions.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Hofmann, M.J., Stenneken, P., Conrad, M. et al. Sublexical frequency measures for orthographic and phonological units in German. Behavior Research Methods 39, 620–629 (2007). https://doi.org/10.3758/BF03193034
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03193034