Co-occurrence and cognitive basis of low language and low reading skills in children speaking a transparent language

Kamykowska, Joanna; Łuniewska, Magdalena; Banasik-Jemielniak, Natalia; Czaplewska, Ewa; Kochańska, Magdalena; Krajewski, Grzegorz; Maryniak, Agnieszka; Wiejak, Katarzyna; Krasowicz-Kupis, Grażyna; Haman, Ewa

doi:10.1007/s11145-024-10537-4

Co-occurrence and cognitive basis of low language and low reading skills in children speaking a transparent language

Open access
Published: 16 April 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Reading and Writing Aims and scope Submit manuscript

Co-occurrence and cognitive basis of low language and low reading skills in children speaking a transparent language

Download PDF

499 Accesses
Explore all metrics

Abstract

We investigated the comorbidity of low language and reading skills in 6- to 8-year-old monolingual Polish-speaking children (N = 962) using three different approaches: norming data to determine the prevalence of co-morbid difficulties, group comparisons of profiles on key cognitive-linguistic measures, and a case series analysis examining the frequency of single versus multiple deficits. We identified four groups of children based on their oral language and reading skills: children with low oral language skills alone, low reading skills alone, comorbid low language/reading skills, and typically developing chronological-age controls. We characterized the four groups (n = 38 per group) in terms of oral language and reading skills measured with normed test batteries, and in terms of the cognitive-linguistic profiles measured by the phonological awareness test (PA), rapid automatized naming test (RAN), and nonword repetition tests (NWR). We found that 24–31% of children with one type of difficulty had comorbid difficulties in the other domain. All groups differed significantly in cognitive-linguistic profiles. For each measure, the comorbid group had the lowest results. The group of children with isolated low language skills had better results than the comorbid group in (1) Sentence repetition (sub-test in an oral language test), (2) discrimination-based, blending-based, and elision-based PA sub-tests, (3) RAN (both digits and letters). The group with isolated low reading skills had better results than the comorbid group in: (1) discrimination-based PA sub-test, (2) NWR tests. The results indicate differences in cognitive-linguistic profiles between the groups with low language and/or low reading skills. They highlight the need to control for both types of difficulties in researching low language and/or reading skills, and to screen for comorbid issues while diagnosing children.

Neurocognitive and Psycholinguistic Profile of Specific Language Impairment: A Research Study on Comorbidity of SLI With/Without Reading Disabilities

The precursors of double dissociation between reading and spelling in a transparent orthography

Article 10 June 2016

Spelling as a way to classify poor Chinese-English literacy skills in Hong Kong Chinese children

Article 28 June 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

We investigate the relationship between two types of developmental difficulties related to language: low oral language skills – the primary symptom of developmental language disorder (DLD), and low reading skills—the primary symptom of developmental dyslexia.^{Footnote 1} Low oral language and low reading skills are distinct but often co-occurring phenomena (e.g. Baird et al., 2011; Bishop et al., 2009; Catts et al., 2005; Fraser et al., 2010; Kelso et al., 2007; McArthur & Castles, 2013; Ramus et al., 2013). Estimates of comorbidity vary between studies: comorbid low reading in children with low oral language ranges from 17% (Catts et al., 2005) to 79% (McArthur & Castles, 2013), while comorbid low oral language skills in children with low reading skills range from 15% (Catts et al., 2005) to 73% (Eisenmajer et al., 2005). These differences in reported comorbidities probably result from sampling methods, time of diagnosis, and specific inclusion criteria (Adlof & Hogan, 2018). The level of comorbidity of the two types of difficulties might also depend on some language characteristics (e.g., orthographic transparency): only a few studies included other-than-English-speaking samples and reported rather lower levels of comorbidity, e.g., Dutch—19–59% (De Groot et al., 2015), Russian—38–46% (Rakhlin et al., 2013), and Greek—41–47% (Spanoudis et al., 2019). In any case, the comorbidity of language and reading difficulties is above the statistical chance level, suggesting that these two types of difficulties may, at least in part, have the same underlying cause.

The list of potential specific underlying deficits contributing to low oral language and low reading skills is long (Ramus et al., 2003; Vellutino & Fletcher, 2008), and the research focused on identifying a single cognitive deficit that clearly explains language and/or reading problems has been inconclusive. An alternative—a probabilistic rather than a deterministic approach—comes from the Multiple Deficit Model—MDM (Pennington, 2006). According to this model, specific disorders result from the atypical development of various cognitive functions, which in turn, are affected by multiple risk and protective factors. None of these factors is necessary or sufficient, and some are shared by different disorders. Thus, comorbidity results from shared cognitive risk factors (Pennington, 2006).

Research on dyslexia within the MDM framework confirms that the picture is indeed quite complex. Although most English-speaking individuals with dyslexia fit within the MDM with various constellations of multiple cognitive deficits, approximately 25% have only a single deficit (usually a phonemic awareness deficit). On the other hand, a group of individuals with dyslexia have no deficit in phonemic awareness (Carroll et al., 2016; McGrath et al., 2020; Pennington et al., 2012). In addition, the use of less strictly defined cut-off points increases the number of cases with multiple deficits (based on Polish data—Dębska et al., 2022).

Polish is a Slavic language with a complex system of inflectional morphology. In terms of orthography, Polish uses an alphabetic writing system based on the Latin alphabet. The Polish alphabet consists of 32 letters with some additional diacritical marks. These diacritics are essential for the correct pronunciation and meaning of Polish words. The spelling of Polish words is largely consistent, with generally reliable grapheme-phoneme correspondences.

Some of the most thoroughly researched cognitive-linguistic skills related to low oral language and/or low reading are skills measured by phonological awareness (PA), rapid automatized naming (RAN), and nonword repetition (NWR) tests (e.g., De Groot et al., 2015; Fraser et al., 2010; Ramus et al., 2013; Snowling et al., 2019; Vandewalle et al., 2012). These skills are also important in the Polish context: PA and RAN deficits are the most common deficits out of seven skills measured in Polish school-aged children with dyslexia (Dębska et al., 2022), while NWR tests are used in the clinical diagnosis of both dyslexia (Bogdanowicz et al., 2011) and DLD (Szewczyk et al., 2015).

Phonological awareness

Phonological awareness (PA) is the ability to discriminate, identify, and manipulate phonological segments of speech (Swanson et al., 2003). The PA deficit is a well-researched phenomenon in dyslexic individuals (large overall mean effect size, d = −1.37, compared to age-matched controls in the meta-analysis by Melby-Lervåg et al., 2012). It is also considered as the most common deficit within the multifactorial model of dyslexia (Catts et al., 2017; Pennington et al., 2012), and it is stable across the age range studied (5;4—16;10 in Melby-Lervåg et al., 2012). With regard to orthographic transparency, the role of PA in reading is likely to be less pronounced in more transparent orthographies than in opaque English orthography (Pfost, 2015; Ziegler et al., 2010, though see: Melby-Lervåg et al., 2012). The characteristics of certain PA tasks also seem to modify the measured effect sizes. Comparisons with reading-level controls in transparent languages show that in terms of task complexity, simple tasks (matching, blending and segmentation) produce slightly larger overall effect sizes than complex tasks (elision, substitution and spoonerism—a meta-analysis by Parrila et al., 2020).

Within the DLD-related research, the status of PA deficits is less clear. English-based studies that take comorbidity into account show that in PA tasks, preschool children with low oral language only perform as poorly as those who are going to be have low reading skills only and those who are going to have comorbid low oral language and low reading skills (Bishop et al., 2009; Catts et al., 2005; Snowling et al., 2019), but their performance changes with time spent on formal instruction. For school-age groups with low oral language only, the results show mixed patterns. Children with low oral language only outperform the comorbid low language and low reading group but not the low reading only group (De Groot et al., 2015; Fraser et al., 2010; Ramus et al., 2013; Snowling et al., 2019), or they reliably outperform both low reading groups (Catts et al., 2005). This inconsistency in findings could be due to the language studied, the detailed characteristics of the stimuli (Farquharson et al., 2014) or the tasks measuring PA. For example, in a study of Dutch children, the low-language-only group performs worse on the substitution task compared to the elision task, probably due to a significant short-term memory load (De Groot et al., 2015).

Rapid automatized naming

Rapid automatized naming (RAN) is the ability measured by a task in which participants name arrays of familiar objects, colours, letters (RAN_Letters), or digits (RAN_Digits) as quickly as possible. A RAN deficit is manifested by slow naming times. This task involves accessing phonological representations, integrating phonological and visual information, and allocating working memory (Norton & Wolf, 2012). The similarities between the processes involved in RAN and reading guaranteed RAN the title of “a microcosm of the reading system” (Norton & Wolf, 2012, p. 448). Indeed, the meta-analysis by Araújo and collaborators shows a moderate-to-strong correlation between RAN and reading performance (Araújo et al., 2014).

Given that RAN is so closely related to reading, it is not surprising that at the group level, children with low reading skills show a well-documented deficit in RAN (d = 1.19, compared to age-matched controls, Araújo & Faísca, 2019). A meta-analysis revealed that the deficit is quite universal (Araújo & Faísca, 2019): it is seen in orthographies of different complexity (Araújo & Faísca, 2019; Landerl et al., 2013), and it generalizes across different stimulus types (both alphanumeric and non-alphanumeric). On the other hand, as predicted by MDM, a multiple case study analysis shows that up to 40% of English-speaking children with dyslexia who show any of the cognitive deficits measured in the study have no deficits in RAN (they only present PA and/or oral language deficits, Pennington et al., 2012). In Polish children with dyslexia, RAN deficits were observed in 26% of children (Dębska et al., 2022).

Within the low oral language only group (9 years old, English-speaking) RAN_Digits results were similar to typically developing controls (Bishop et al., 2009). Pennington and Bishop (2009) suggest that intact RAN is a protective factor against the development of reading difficulties. In addition, some studies suggest that the results of the low oral language-only group appear to be stimulus dependent, unlike the low reading only group where performance is consistently poor across stimulus types. The results in RAN_Letters do not help to distinguish between children with low reading skills only and children with low oral language only, whereas in RAN_Digits, the performance of the low oral language only group is much better, reaching low average levels. The comorbid group score is the lowest (De Groot et al., 2015).

Nonword repetition

Nonword repetition (NWR) is a task in which a participant has to repeat single pronounceable pseudowords. The list of items usually includes pseudowords of 2–5 syllables. Modifying items’ characteristics seems to allow measuring different abilities. The level of items’ wordlikeness (Archibald & Gathercole, 2006), phonotactic frequency (Munson et al., 2005), phonological complexity (Marshall & Van Der Lely, 2009), and prosodic features (Gallon et al., 2007) influence the overall results and probably the skills needed to complete the task. In older children, NWR is mostly a measure of phonological short-term memory, although it is also affected by long-term knowledge, especially in younger children (Rispens & Baker, 2012). Lexical knowledge is better measured by more word-like items (Archibald & Gathercole, 2006).

The deficit in NWR is described in meta-analyses for English-speaking children with dyslexia (Melby-Lervåg & Lervåg, 2012, d = 1.12) and with DLD (Graf Estes et al., 2007, d = 1.27). Both meta-analyses highlight large variability of the results, which could be caused by ignoring the comorbidity effect: a NWR deficit is no longer seen in the low reading group if children with low reading skills and control samples are matched on nonverbal IQ and oral language skills (Cowan et al., 2017). This suggests that it is rather characteristic for children with low language only or comorbid low language and low reading group, not for the low reading only group.

Children with comorbid low language and low reading skills suffer from a cumulative effect, showing the poorest outcomes compared to the low language only or low reading only groups (Bishop et al., 2009; Catts et al., 2005; Cowan et al., 2017; Ramus et al., 2013; Rispens & Parigger, 2010; Snowling et al., 2019). However, certain task characteristics modify this pattern. The difference between children with comorbid low language and low reading skills and those with low reading skills only becomes significant when the task includes consonant clusters, but not for the task that consists mostly of changes in consonants and vowels—CVCVC structure (Cowan et al., 2017). Moreover, the differences are only observed for longer pseudowords (3–4 syllables but not 1–2 syllables, Catts et al., 2005). The results also seem to be language dependent: a meta-analysis (Melby-Lervåg & Lervåg, 2012) shows that the deficit in NWR is less pronounced in more transparent languages (effect size: d = −0.56). In addition, a high level of wordlikeness should increase the difference between children with low language skills and children with low reading skills or typically developing groups. This is because only children with good lexical knowledge could benefit from high wordlikeness of the item.

The current analysis

Our analysis aimed to describe the relationship between oral language and reading difficulties using three different methods. We used norming data to determine the prevalence rates of comorbid low language and low reading skills in Polish, i.e. the prevalence of low reading skills in children with low language skills among monolingual Polish speakers, and the prevalence of low language skills in children with low reading skills. To the best of our knowledge, there is no published empirical work on the comorbidity of low language and low reading in Polish. We predicted that comorbidity rates should be similar to those in other rather transparent languages, such as Russian (38–46%; Rakhlin et al., 2013) or Greek (41–47%; Spanoudis et al., 2019).

We also looked at the cognitive-linguistic profiles associated with low language and/or low reading using carefully matched group comparisons. We expected that this analysis would help to distinguish between groups with only low language, only low reading, and comorbid low language and low reading. We hypothesised that typically developing children would have the highest scores on the PA, RAN and NWR tests, whereas children with comorbid low language and low reading skills would have low or the lowest scores on all these measures. In addition, the low language only group would show deficits on the PA subtests that require substantial short-term memory load (i.e., elision but not blending) and on the NWR, more so on items with high wordlikeness level. The low reading only group would show deficits mainly on simple PA subtests and RAN tests, and less so on NWR tests. Furthermore, a multiple case study analysis would reveal multiple deficits in the low language, low reading and comorbid low language and low reading groups, more so for less strict deficit cut-offs.

Method

Participants

Sampling procedure

The current analysis is a secondary analysis based on data collected by the Educational Research Institute in Poland in 2014–15 within a norming study of two comprehensive tests for Polish-speaking monolingual children assessing oral language and early reading skills (Krasowicz-Kupis et al., 2015a, 2015b; Smoczyńska et al., 2015).

In total, 4771 children (50.1% girls, 49.9% boys) aged 4;0 to 8;11 participated in the norming study of the two tests. We applied the following inclusion criteria to the original database (the number of participants left in the database after applying the additional criterion is given in parentheses): complete results available for both the reading and language tests (n = 3706), 6;6 to 8;5 years old (on the day of registration to the study) and attending first grade (summer semester) or second grade (winter semester, n = 1384), Polish monolinguals (n = 1210), no reported uncorrected vision or hearing problems and no neurological disorders (n = 975), no other special educational needs recognized by special education services, including autism spectrum disorder and disorders of intellectual development (n = 963), and IQ > 70 (n = 962; 51.5% girls, 48.5% boys).

Group assignment

Children were assigned to one of four groups based on their scores in the Language and Reading tests, according to the unisex sten-scale (M = 5.5, SD = 2.0). The grouping criteria were established empirically to resemble epidemiological data on dyslexia and DLD. Low reading skills were recognized if a child scored low (≤ 3 sten, corresponding to ≤ 16 percentile) in at least two out of four used reading sub-tests. Low language skills were identified if a child scored low (≤ 3 sten) in at least two out of six language sub-tests. Children who met both the criteria for low reading skills and low language skills were classified as the comorbid low language and low reading group. Participants who did not meet any of the above criteria were considered typically developing.

Matching groups

The four groups (low language, low reading, low language and low reading, typically developing) differed significantly on controlled variables such as parental education and nonverbal IQ (Table 1). To resolve this problem, for the purpose of further comparisons of the cognitive-linguistic profiles, we used a pairwise participant matching algorithm to match the groups based on age, gender, nonverbal IQ and parental education (n = 38 for each group—equal to the size of the smallest group; see Table 2).

Table 1 Sociodemographic characteristics and overall language and reading skills in the full sample

Full size table

Table 2 Sociodemographic characteristics and overall language and reading skills in matched groups

Full size table

Measures

The tasks used in the current analysis are listed below. A detailed description of all tests is available in the Supplementary material 1 (Table S1).

Reading skills were assessed with four sub-tests of ‘Bateria Testów Czytania BTCZ IBE’ [Battery of Reading Tests] (Krasowicz-Kupis et al., 2015a, 2015b): Letter naming (maximum score: max = 32), Timed word reading (for 60 s, max = 56), Pseudoword reading (untimed, max = 24), and Timed pseudoword reading (for 60 s, max = 56).

Language skills were assessed with six sub-tests of ‘TRJ: Test Rozwoju Językowego’ [Test of Language Development] (Smoczyńska et al., 2015): Vocabulary—comprehension (max = 28), Vocabulary—production (max = 25), Sentence repetition (max = 34), Grammar—comprehension (max = 32), Grammar—production (max = 14), and Discourse—comprehension (max = 20).

Phonological awareness was assessed with twelve sub-tests of ‘Bateria Testów Fonologicznych’ [Battery of Phonological Tests] (Krasowicz-Kupis et al., 2015a, 2015b) including Phoneme discrimination, Alliteration, Blending (syllables and phonemes), Segmenting (into syllables or phonemes within words or pseudowords), and Elision (syllables and phonemes).

RAN skills were assessed with digit-based and with letter-based RAN tasks (Fecenec et al., 2013), hence variables: RAN_Digits and RAN_Letters.

Two separate nonword repetition (NWR) tests with different levels of wordlikeness were used: ‘Zetotest II Krasowicz-Kupis’, max = 40 (Bogdanowicz et al., 2011)—created as a measure of phonological short-term memory, without considering the level of wordlikeness (hence: NWR_{Low wordlikeness}), and ‘Test Powtarzania Pseudosłów’ [Pseudoword Repetition Test], max = 27 (Szewczyk et al., 2015)—created mainly as a measure of sublexical knowledge, containing items of high wordlikeness (hence variable: NWR_{High wordlikeness}).

Nonverbal IQ was assessed with the individually administered version of the CFT 1-R (Koć-Januchta et al., 2013).

Parental education level was assessed via a sociodemographic questionnaire on an 8-point ordinal scale. For 7 participants the data come from fathers, otherwise from mothers.

Procedure

Data collection was carried out by trained psychologists in quiet locations at children’s schools. There were four sessions (45–50 min each) separated by a maximum of one week: session I was devoted to a language test, session II: reading test, some of PA and RAN sub-tests, session III: reading, PA, NWR, and writing tests (not included in this paper), and session IV: IQ test, and PA sub-tests.

Data collection was conducted in accordance with the ethical standards of the Educational Research Institute at the time of data collection (2012–2014). The study was conducted in accordance with the 1964 Declaration of Helsinki and its subsequent amendments. The data analyses presented in this article were approved by the Research Ethics Committee of the Faculty of Psychology at the University of Warsaw.

Data analysis

We present the findings on the between-group differences in the matched groups analysed with a two-way ANOVA (Language × Reading). To further explain the results, we also present post-hoc between-group differences; all p-values are presented with a Holm‒Bonferroni correction for multiple comparisons. All measured variables are converted into Z-scores, i.e., M = 0, SD = 1, relative to the results of the typically developing group, to make the results of different sub-tests comparable on a single scale. Z-scores are calculated based on typically developing group normed sten scores (M = 5.5, SD = 2.0).

First, we confirm our initial group selection by analysing the results of the Language and Reading tests that were used to create the groups. Second, we present data on cognitive-linguistic skills (PA, RAN, NWR). For the twelve sub-tests measuring PA skills, Principal Component Analysis (Supplementary material 2, Table S2) allowed us to create three theoretically driven standardized factors.

Between-group analyses were accompanied by the analysis of a multiple case study: the deficit distribution within the groups was counted for both strict (−1.65 SD) and more liberal (−1 SD) cut-off points established for each variable under scrutiny. All analyses were performed using SPSS 28 (IBM Corporation, Armonk, New York).

The complete dataset used for the current analysis is available from an Open Science Framework archive: https://osf.io/6348t/?view_only=0d4a33d98aaa457c801bc80d306a9f6a.

Results

Comorbidity rates

As expected, low oral language and low reading skills co-occurred in the sample (Table 3). A total of 12.9% of the children were classified as having low oral language skills, and 16.4% were classified as having low reading skills. Among children with low reading skills, 24.1% also presented low language skills, while among children with low language skills, 30.6% also presented low reading skills. Approximately equal numbers of girls and boys were classified as typically developing and with comorbid low language and low reading skills. There were more girls than boys in the low language only group (1.21:1). The gender ratio for the low reading only group was 1.31:1 (boys to girls). None of the gender differences were significantly different from the 1:1 ratio: low oral language only, χ2 (1, N = 86) = 0.744, p = 0.388, low reading only, χ2 (1, N = 120) = 2.13, p = 0.144.

Table 3 Overlap between children meeting criteria for low language and/or low reading skills, with gender ratios

Full size table

If low language and low reading skills co-occurred only by chance, the probability of such an event would equal 2.12%. In fact, these difficulties co-occurred significantly more often: 3.95%, χ2 (1, N = 962) = 15.6, p < 0.001.

Language and reading skills

Language

The main effect of Language was present for all six sub-tests (large effect sizes, see Supplementary material 3, Table S3). Additionally, the main effect of Reading was present in the Sentence repetition task (small effect size), but no interaction was observed (see Fig. 1). As expected based on group selection, children with low reading skills did not differ significantly from the typically developing group in any of the sub-tests. In most of them, children with low oral language had as poor results as the comorbid low language and low reading group (ranging from −1.17 to −2.22 Z-Scores relative to the typically developing group). Only the Sentence repetition results showed that the comorbid low oral language and low reading group showed significantly lower performance than the low language only group. The two-way ANOVA implies that the Sentence repetition result comes from the presence of two additive main effects: Language and Reading skills.

Reading

The main effect of Reading was present in all four sub-tests’ results (large effect sizes, Table S4 and Fig. 2). Additionally, the main effect of Language and the interactive effect was observed in Timed Word Reading (small effect sizes). In all four sub-tests, the low reading only group had as low results as children with low oral language and low reading skills. In Timed Word Reading, children with low language only had worse results than typically developing children, but they did as well as the typically developing group in all other Reading sub-tests: Letter naming, Pseudoword reading, and Timed pseudoword reading. The result of Timed Word Reading is further explained within two-way ANOVA by the main effects of language and reading skills but also their interaction: low reading skills determine low results regardless of the level of language skills, while typical reading skills are associated with lower results only when accompanied by low language skills.