Introduction

In order to become a competent citizen of the society and to acquire the necessary skills for full civic participation, one needs to be able to read and comprehend different types of texts. The PISA (The Programme for International Student Assessment) studies with 15-year-old Finnish students have shown a high level of reading performance compared to their peers in other countries (The Organisation for Economic Co-operation and Development [OECD], 2014). Despite the high ranking of Finnish youth’s PISA performance, a worrying proportion of up to 11% of Finnish 15-year-olds is estimated to fail to achieve the level of reading skills commended for being able to be proficient in education and society (Vettenranta et al., 2016). To target these students for early support, it is important to identify the factors and developmental mechanisms contributing to their less than optimal reading development. In the present study, we use Kindergarten (age 6) data to predict adolescent reading comprehension in Grade 9 (age 15). We draw on the comprehensive, widely used reading comprehension test from the PISA Reading test battery developed in association with the comparative assessments managed by OECD (2016). Although the status of this assessment tool of 15-year-old students’ reading comprehension and its impact on educational policymaking are widely acknowledged, the research concerning pre-reading skills, predictors and correlates of the PISA Reading, however, is still surprisingly scarce internationally.

In addition to early identification, this paper has a focus on gender differences. The Finnish 15-year-olds manifest a gender difference in reading comprehension which is the largest among OECD countries and the ninth largest among all participating countries in PISA Reading (Brozo et al., 2014; OECD, 2016). In general, the proposed explanations for relatively large gender differences in reading skills favouring girls include school-age individual factors such as an interest in and the extent of leisure reading, or factors related to the school such as “girl-friendly school practices” (see Stoet & Geary, 2013). There is evidence, however, pointing at potential early roots of these differences as some studies have shown that girls outperform boys in language and cognitive skills already before school entry (Lange, Euler, & Zaretsky, 2016; Palejwala & Fine, 2015). These findings lead us to examine whether the gender difference in 15-year-olds’ PISA Reading performance can be explained by differences in pre-reading skills between girls and boys that emerge already prior to school entry.

Early predictors of reading comprehension

According to the Simple View of Reading (Florit & Cain, 2011; Gough & Tunmer, 1986; Hoover & Gough, 1990; Kirby & Savage, 2008), comprehension of written text presupposes, first, learning to decode—that is, acquiring the ability to decipher letter strings to words that appear in spoken language and, second, proficiency in linguistic comprehension. The relative importance of these two components, decoding and comprehension, changes across reading development. When decoding is not yet fully automatized, it has a stronger impact on reading comprehension than later when a sufficient decoding ability has been achieved (e.g., Florit & Cain, 2011). Once decoding becomes fluent enough, linguistic comprehension displays a more prominent role than decoding in determining the level of reading comprehension skills (e.g., Perfetti, 1985). Despite this diminishing trend for the predictive role of decoding across age, reading fluency continues to have a significant role in adolescent PISA Reading in Grade 9, explaining 8–15% of the variance in the PISA score (Artelt, Schiefele, & Schneider, 2001; Eklund, Torppa, Sulkunen, Niemi, & Ahonen, 2018; Torppa, Eklund, Sulkunen, Niemi, & Ahonen, 2018). In the analysis of Kindergarten predictors of PISA Reading, we thus need to include measures of both linguistic comprehension and the known predictors of decoding and reading fluency.

The key early predictors of decoding and reading fluency among Finnish children and adolescents have repeatedly been reported to include letter knowledge, phonological awareness, and rapid naming (e.g., Kairaluoma, Torppa, Westerholm, Ahonen, & Aro, 2013; Puolakanaho et al., 2007; Torppa, Eklund, van Bergen, & Lyytinen, 2015). Similar findings have been found in other languages (e.g., Elbro, Borstrøm, & Petersen, 1998; Gallagher, Frith, & Snowling, 2000; Ziegler et al., 2010). However, studies in transparent orthographies such as Finnish, suggest that the association between phonological awareness and reading fluency gets weaker after Grades 1 or 2 whereas in less transparent orthographies the association remains stronger (de Jong & van der Leij, 2002; Landerl & Wimmer, 2000; Torppa et al., 2015). After the acquisition of basic decoding skills, individual differences in reading performance are primarily seen in speed as opposed to accuracy of decoding (e.g., Bekebrede, van der Leij, & Share, 2009). Kindergarten rapid naming is a strong predictor of reading fluency (e.g., Georgiou, Parrila, & Papadopoulos, 2008; Savage & Frederickson, 2005) and in transparent orthographies it is one of the most powerful predictors of reading skill already in the early grades (e.g., Puolakanaho et al., 2007). This applies to Finnish because of its almost perfect grapheme-to-phoneme correspondence which contributes to fast reading acquisition (Aro, 2017). More than 95% of Finnish children are accurate decoders after a few months of reading instruction in Grade 1 (Holopainen, Ahonen, & Lyytinen, 2001; Lerkkanen, Rasku-Puttonen, Aunola, & Nurmi, 2004).

As Finnish children on average become relatively fluent readers early in their schooling, the listening comprehension skills rather than decoding accuracy explain a growing portion of individual differences in reading comprehension in elementary grades (Dufva, Niemi, & Voeten, 2001; Eklund et al., 2018; Lerkkanen et al., 2004). In their SVR model, Gough and Tunmer (1986) suggested that measures of listening comprehension can preferably be used to assess linguistic comprehension, but later research has shown that vocabulary measures are reliable predictors of reading comprehension as well (e.g., Cain, Oakhill, & Bryant, 2004; Manolitsis, Georgiou, & Parrila, 2011; Ouellette, 2006; Torppa et al., 2006). Weak vocabulary knowledge can form a bottleneck particularly when reading comprehension requires comprehending complex words in order to make inferences (Cain, 2016).

Pre-reading skills as predictors of PISA Reading comprehension

The PISA Reading test, the outcome measure of the present study, assesses comprehension of various genres such as texts, tables, and graphs. It thus includes a broader array of tasks than many commonly used reading comprehension assessments that typically involve only continuous texts. PISA Reading tasks aim to test skills essential for full participation in modern societies (OECD, 2017a). The test protocol (OECD, 2017b, 51) uses the term Reading Literacy to indicate the broad coverage of the test assessing “understanding, using, reflecting on and engaging with written texts, in order to achieve one’s goals, to develop one’s knowledge and potential, and to participate in society.” The test (containing always a certain number of repeated link items to retain comparability between reading scores from different years) is used every three years in the comparative assessments of educational quality.

Research on skills predicting PISA Reading is scarce. Arnbak (2012) showed that concurrently measured word recognition and vocabulary explained 41% of the PISA Reading scores among Danish students, whereas Artelt et al. (2001) reported that concurrently measured decoding speed explained 13% of the variance in PISA Reading among German students. Using another Finnish sample than in the current study, Torppa et al. (2018) reported that concurrently assessed reading fluency predicted up to 15% of the variance in PISA Reading among Finnish students. To our knowledge, there is only one longitudinal study that has examined early predictors of PISA Reading, that by Eklund et al. (2018) involving a sample of adolescents with family risk for dyslexia (n = 88) and another sample without family risk for dyslexia (n = 70). The results showed that oral language skills (e.g., vocabulary) assessed prior to school entry predicted 53% of the PISA Reading variance among the high-risk group and 31% among the low-risk group. The pre-reading predictors of decoding (phonological awareness, rapid naming, letter knowledge) predicted 15% of the PISA Reading variance among the high-risk group and 13% among the low-risk group via reading fluency. The samples in the study by Eklund et al. were, however, relatively small.

Gender differences in pre-reading skills and PISA Reading

Gender differences in reading favouring girls is a robust finding across languages (e.g., OECD, 2014) as well as the overrepresentation of boys among struggling readers (e.g., Quinn & Wagner, 2015), often found already for young readers (for a recent meta-analysis, see Quinn, 2018). In PISA 2015, for instance, girls outperformed boys in reading comprehension in every participating country (OECD, 2016) although the size of the gender gap in reading varies considerably between countries (OECD, 2010a). Among Finnish students, it has been reported to be relatively large with a medium effect size (Cohen’s d = .50, OECD, 2016) making Finland belong to the group of the participating countries with a pronounced gender difference (Brozo et al., 2014; OECD, 2016). In 2015, Finnish girls outperformed girls from other countries and scored best in PISA Reading (551 points) while Finnish boys ranked seventh (504 points) among boys from other countries (Vettenranta et al., 2016). Although also Finnish boys performed relatively well, the difference of 47 points is estimated to correspond to a one-year gap in the mastery of the basic education syllabus (Vettenranta et al., 2016). In PISA Reading, there are seven levels of reading proficiency (OECD, 2017b). In 2015, 16% of Finnish boys and 7% of Finnish girls did not achieve Level 2, which one should reach to be able for active participation in an information society (Vettenranta et al., 2016).

However, the literature is equivocal with respect to gender differences in reading skills and difficulties, because there are also studies showing no clear gender differences in reading skills and reading motivation (e.g., McGeown, Goodwin, Henderson, & Wright, 2012), or in the occurrence of reading difficulties (e.g., Jimenez et al., 2011; Moll, Kunze, Neuhoff, Bruder, & Schulte-Körne, 2014). Moreover, in a longitudinal study using another Finnish sample, no differences in terms of reading difficulties were found between boys and girls in Grade 2, whereas in Grade 8, 65% of struggling readers were boys (Torppa et al., 2015). Better reading skills of girls have been suggested to be due to a more positive attitude toward and interest in reading among girls than boys (e.g., OECD, 2010a). For example, Finnish girls participating in PISA Reading in 2009 reported enjoying reading more than boys did, girls spent notably more time on reading, and reported reading more diverse texts than boys (Brozo et al., 2014). Many studies have shown that reading comprehension correlates with leisure reading (Mol & Bus, 2011; Torppa, Niemi, Vasalampi, Poikkeus, & Lerkkanen, 2020), and, girls’ more extensive voluntary reading could, thus, result in better reading comprehension.

Despite evidence of early differences in reading interest and other motivational factors favouring girls, the possibility remains that the gender gap in reading performance can be traced back to early language skills preceding learning to read. Gender differences favouring girls have been reported already in early language development (e.g., Berglund, Eriksson, & Westerlund, 2005; Fenson et al., 1994). For instance, in a German study, girls aged 3–6 years were shown to have better language competence than boys (Lange et al., 2016). In addition, better early processing speed of English-speaking girls aged 4–7 years is suggested to contribute also to their better reading and writing skills (Palejwala & Fine, 2015). In search for explanations for the gender difference in PISA Reading, it is important to examine whether gender differences in skills emerge already in Kindergarten when schooling-related factors do not yet have an influence.

Summary of aims

The aims of the study were as follows:

  1. (1)

    to analyse the extent to which Kindergarten pre-reading skills—phonological awareness, rapid naming, letter knowledge, vocabulary, and listening comprehension—predict PISA Reading comprehension in Grade 9;

  2. (2)

    to determine whether the predictive models are similar among boys and girls;

  3. (3)

    to examine whether there are gender differences in the Kindergarten pre-reading skills. Provided that there are gender differences in the skills, the final aim is

  4. (4)

    to explore whether the gender difference in PISA Reading can be traced back to them that is, whether the gender-related difference in PISA Reading is fully mediated via the Kindergarten pre-reading skills.

Methods

Participants

The data of this study were drawn from a longitudinal study, the First Steps Study, (see Lerkkanen et al., 20062016) in which a community sample of about 2000 children were followed from Kindergarten to the end of Grade 9. Children were born in the year 2000 and came from four municipalities, two in Central Finland, one in Western Finland and one in Eastern Finland. Altogether 1839 children (873 girls; 966 boys) in the First Steps Study participated in Kindergarten assessments of pre-reading skills. Of the longitudinal sample, 1010 adolescents (480 girls; 530 boys) participated in the PISA Reading assessment in Grade 9. At the time of the Kindergarten assessments (spring 2006) the mean age of children was 6.1 years (SD = .29) and in the Grade 9 assessment (fall 2017) the average age was 15.3 years (SD = .33). The First Steps Study has been reviewed by the University of Jyväskylä Ethical Committee. An informed written consent was given by the participants’ parents (Kindergarten) and the participants themselves (Grade 9).

Because of attrition across time, we examined whether missingness was random. Little’s MCAR test, χ2(28) = 86.85, p < .001, suggested that the participants attending assessments in Grade 9 and in Kindergarten differed in some study measures from those not participating. The independent t test comparisons of the two groups (participating vs. not participating in the Grade 9 assessment) in the Kindergarten skills are presented in Table 1. The participants had slightly better Kindergarten performance in all assessed skills than the group of students not participating in Grade 9. However, the effect sizes were small (Table 1), reflecting the fact that attrition was mainly caused by children and their families moving away from the study municipalities. Furthermore, the Grade 9 assessment did not include small classrooms with students having special needs. The participants of this study, N = 1010 (480 girls; 530 boys), had no missing data in any of the measures used in the analyses.

Table 1 Comparison of children participating and not participating in the Grade 9 assessment with independent samples t test

Measures

Kindergarteners’ phonological awareness, rapid naming, letter knowledge, vocabulary and listening comprehension were assessed in April 2006 by trained testers in individual test sessions. Note that formal reading instruction starts in Finland in Grade 1, and the Kindergarten curriculum does not contain formal reading instruction (i.e., teaching of decoding or spelling), but kindergarteners are engaged in playful activities involving letters, words and numbers. Assessments were carried out by trained testers who were either university researchers or education or psychology graduate-stage students. Assessments in Grade 9 took place in November 2015 in the classrooms using group-administered tests.

Phonological awareness

The initial phoneme identification test (Lerkkanen, Poikkeus, & Ketonen, 2006) included 10 items. In each of them the experimenter first named a row of four pictures of objects, gave the instruction “At the beginning of which word do you hear the sound/?/”, and asked the child to point out the correct picture. All sounds were single phonemes. The score was the number of correct responses (max. = 10). The Cronbach’s alpha reliability coefficient was .76.

Rapid naming

Rapid naming of objects was assessed using the standard procedure (Denckla & Rudel, 1976). The children were asked to name as fast as possible a series of five pictures of objects arranged in semi random order in five rows of 10. There was a practice trial before the test to ensure the child’s familiarity with names of the objects. The score was the total time in seconds that it took for the child to name all stimuli. Only a few errors occurred and for this reason, only total naming time was used in the analyses.

Letter knowledge

The children were asked to name all 29 Finnish letters which were shown on a sheet and arranged in three rows (Lerkkanen et al., 2006). The letters were named one row at the time, while the other rows were covered. The score was the number of correctly named letters (max. = 29). The Cronbach’s alpha reliability coefficient was .94.

Vocabulary

Receptive language was assessed using a 30-item shortened version of the Peabody Picture Vocabulary Test-Revised (PPVT-R, Form L; Dunn & Dunn, 1981). The children were asked to point out the picture from four alternatives which correctly represents the word spoken by the experimenter. The items for the shortened version were selected based on data from the full-scale administration of the PPVT-R in the Jyväskylä Longitudinal Study of Dyslexia (Lyytinen et al., 2004). The score was the number of correct responses (max. = 30). The Cronbach’s alpha reliability coefficient was .61.

Listening comprehension

Listening comprehension was assessed using a group-administered test developed at the Centre for Learning Research, University of Turku. The children heard a story with 130 words twice in the classroom setting after which they answered six multiple-choice questions (in four of the questions there were three choices, and in two questions there were four choices) which were based on the story and accompanied by pictures. The children were asked to select a picture that would best fit the story. Two points were given for each correct answer (max. = 12). The Cronbach’s alpha reliability coefficient was .30. Torppa et al. (2016) reported that the standardized beta for the prediction of another listening comprehension measure a year later was .45.

Programme for international student assessment reading

The students had 60 min to complete the battery of reading tasks in the classroom group administered setting. These tasks formed the PISA Reading link items which are used repeatedly in each cycle of the PISA survey to ensure that the measurement remains comparable across time (OECD, 2010b). The booklet included eight different texts which students were asked to read and then answer questions. The reading materials consisted of texts, tables, graphs and figures. There were 15 multiple-choice questions and 16 questions, which required written responses. Of the questions, 12 required the students to access and retrieve information, 12 to integrate and interpret information and 7 to reflect and evaluate information. The total score was calculated by adding up the item scores (1 point given for each correct response). The maximum score was 31. The Cronbach’s alpha reliability coefficient was .75.

Statistical Analysis

First, the distributions of the measures were examined (see Table 2). Kindergarten vocabulary and listening comprehension approached the normal distribution. Phonological awareness and letter knowledge were left-skewed. Rapid naming was somewhat right-skewed. One extreme outlier in rapid naming was moved to the end of the right-tail of the distribution. Due to the skewness of the measures, all path models were estimated using a robust maximum likelihood estimator (MLR) provided by Mplus 8.4 (Muthén & Muthén, 1998–2017). Two types of path models were estimated. First, to examine if the Kindergarten skills predict Grade 9 PISA Reading similarly among boys and girls, multigroup models were estimated using within-gender standardized variables. The model estimates were compared between boys and girls by fixing each estimate equal one-by-one and examining the model fit. Second, to examine if the effect of gender on Grade 9 PISA Reading is direct or mediated via Kindergarten cognitive skills, a model where gender was set to predict reading comprehension both directly and indirectly via Kindergarten cognitive skills was estimated. The goodness-of-fit of the estimated models was evaluated using the following indicators: χ2-test, Comparative Fit Index (CFI), Root Mean Square Error of Approximation (RMSEA), and Standardized Root Mean Square Residual (SRMR). Good model fit is indicated by a small, preferably non-significant χ2, CFI > .95, RMSEA < .06, and SRMS < .08 (Hu & Bentler, 1999).

Table 2 Descriptive statistics of the measures

Results

Prediction of PISA Reading among boys and girls

First, we examined the extent to which the Kindergarten skills predicted Grade 9 PISA Reading (RQ1) and whether the predictive models were similar among boys and girls (RQ2). Pearson correlation coefficients between the measures for boys and girls are presented in Table 3. All Kindergarten skills correlated significantly with the PISA Reading score for both genders. The multigroup path modeling (Fig. 1) suggested that the path from letter knowledge to PISA Reading was stronger for boys than girls as the model with equal path estimate did not fit the data well, χ2(1) = 5.04, p < .05. All the other regression paths were equal for boys and girls. Furthermore, the correlations between phonological awareness and letter knowledge, vocabulary, and rapid naming as well as with letter knowledge and vocabulary were not equal for boys and girls (see Table 3).

Table 3 Pearson correlation coefficients
Fig. 1
figure 1

The final multigroup path model (single estimates when boys’ and girls’ estimates were equal and with separate estimates (boys/girls) for the path from letter knowledge where boys’ and girls’ estimates were not equal)

In the final model (Fig. 1), the path from letter knowledge to PISA Reading and the unequal correlations were freely estimated and the other paths and correlations were fixed equal for boys and girls. The model fitted the data well, χ2(10) = 4.84, p = .90, RMSEA = .00, CFI = 1, SRMR = .01. In the model for boys, all kindergarten skills but phonological awareness predicted PISA Reading significantly, and 27% of the variance in PISA Reading was explained by the model. In the model for girls, rapid naming, vocabulary, and listening comprehension predicted PISA Reading, and 18% of the variance in PISA Reading was explained by the model.

Mediation path models

Second, we examined if there were gender differences in the kindergarteners’ pre-reading skills (phonological awareness, rapid naming, letter knowledge, vocabulary, and listening comprehension) (RQ3), and if the gender-related differences in PISA Reading were fully mediated via these measures (RQ4). The descriptives, t test results and effect sizes (Cohen’s d) are reported in Table 4. Statistically significant gender differences emerged for the means of Kindergarten phonological awareness, letter knowledge, and vocabulary, and for Grade 9 PISA Reading. Girls outperformed boys in phonological awareness, letter knowledge, and PISA Reading while boys outperformed girls in vocabulary. The effect sizes were, however, either small or negligible except for PISA Reading performance, in which it was moderate (Cohen, 1992). Gender differences for rapid naming and listening comprehension were not statistically significant.

Table 4 Gender comparison with independent sample t test

In the mediator path model (Fig. 2), of interest was if the effect of gender on PISA reading is mediated via the Kindergarten measures. The model predicted a total of 28% of the PISA Reading variance. The model suggested that the effect of gender on the PISA Reading score was mostly direct and only to a limited extent mediated via the Kindergarten skills. Three small but statistically significant indirect effects from gender to PISA Reading were identified and each one predicted 1–3% of the PISA Reading score (altogether 5%), one through letter knowledge (.03***), one through rapid naming (.01*), and one through listening comprehension (.01*). In other words, the gender differences in the Kindergarten skills did not fully explain the gender difference in the PISA Reading scores. The final model is saturated and thus no model fit indices are available.

Fig. 2
figure 2

The mediator path model

Discussion

The aims of the present study were to examine the extent to which the Kindergarten pre-reading skills (phonological awareness, rapid naming, letter knowledge, vocabulary, and listening comprehension) among Finnish kindergarteners predict their PISA Reading comprehension performance at the age of 15 years. Moreover, we wanted to determine whether the predictive models are similar among boys and girls, whether there are gender differences in these Kindergarten pre-reading skills and whether the gender difference in PISA Reading can be explained by the pre-reading skills.

The Kindergarten pre-reading skills predicted 27% of the variance in reading comprehension assessed by the PISA Reading test among boys and 18% among girls. The finding is in accordance with earlier results in another, smaller Finnish sample with a high proportion of children with reading disability (Eklund et al., 2018). The same Kindergarten skills were significant predictors among both genders, except for letter knowledge, which was a significant predictor only for boys. The strong role of letter knowledge and rapid naming as significant predictors of reading comprehension 9 years later can be understood through their widely reported associations with reading fluency (e.g., Eklund et al., 2018; Georgiou et al., 2008; Kairaluoma et al., 2013; Savage & Frederickson, 2005). The finding for letter knowledge extends previous literature by showing a long-term prediction of letter knowledge to early adolescence reading comprehension (e.g., Elbro et al., 1998; Pennington & Lefly, 2001; Puolakanaho et al., 2007; Snowling, Gallagher, & Frith, 2003). It is possible that letter knowledge was not a significant predictor among girls because their letter knowledge was already at a high level and there was too little variation at the end of Kindergarten. Rapid naming has been shown to be the best early predictor of reading fluency (e.g., Georgiou et al., 2008; Savage & Frederickson, 2005; Torppa, Georgiou, Salmi, Eklund, & Lyytinen, 2012) and it is likely that the effect of rapid naming on reading comprehension documented in the present study is mediated via reading fluency (e.g., Eklund et al., 2018; van Viersen et al., 2018). Prior findings attest that difficulties in rapid naming and letter knowledge predict persistent reading difficulties beyond early grades among Finnish readers (e.g., Eklund, Torppa, Aro, Leppänen, & Lyytinen, 2015; Korhonen, 1995; Torppa et al., 2015). Although reading difficulties were not the focus of the present study, it is likely that a subgroup of the children with poor Kindergarten rapid naming and letter knowledge will become slow readers, which hampers reading comprehension, particularly when assessed using a measure with multiple texts like PISA Reading.

In addition to the known strong predictors of reading, letter knowledge and rapid naming, also Kindergarten vocabulary and listening comprehension predicted PISA Reading performance. This supports the findings of the previous studies on skills which are needed in PISA Reading (Arnbak, 2012) and of Kindergarten skills that predict reading comprehension during elementary grades (e.g., Eklund et al., 2018; Leppänen, Aunola, Niemi, & Nurmi, 2008; Manolitsis et al., 2011; Torppa et al., 2016). Phonological awareness, on the other hand, did not add to the prediction which is in line with the previous similar findings in Finnish studies (e.g., Holopainen et al., 2001), and can be understood based on the nature of the Finnish language being situated at the far end of the continuum of transparency of orthography with almost perfect grapheme-phoneme correspondence and, consequently, kindergarteners being almost at the ceiling on initial phoneme identification.

The final aim of the present study was to examine whether the gender difference in favour of girls in PISA Reading can be explained by gender-related differences in pre-reading skills found in Kindergarten. As expected, a statistically significant gender difference in favour of girls emerged in PISA Reading which is in line with previous research on the gender gap in PISA Reading internationally (OECD, 2016) as well as in the Finnish context (Brozo et al., 2014; Vettenranta et al., 2016). In the national PISA Reading sample, the effect size for the gender difference was .50 (OECD, 2016) which is close to the effect size of .45 in the current sample. Quinn & Wagner (2015), on the other hand, reported that boy:girl gender ratio at different levels of reading difficulties (percentiles 30, 15, 5, and 3) varies between 1.15–2.56:1 depending on reading measure and percentile. The gender ratios for PISA Reading percentiles in this sample were larger: 1.99:1 for percentile 30, 2.83:1 for percentile 15, 8.75:1 for percentile 5, and 7.67:1 for percentile 3. Quinn & Wagner (2015) measures focused on reading fluency, however. These results suggest that gender differences are larger for PISA Reading comprehension than for reading fluency difficulties. With respect to gender differences in kindergarteners’ pre-reading skills, we found that girls outperformed boys in phonological awareness and letter knowledge although the effect sizes suggested that the difference was small. The finding is in line with the previous studies reporting better pre-reading skills in Kindergarten for girls (Eklund et al., 2018; Lange et al., 2016; Palejwala & Fine, 2015). Somewhat surprisingly, there were no differences in listening comprehension and boys outperformed girls in vocabulary, but the effect size was negligible. The findings are in general agreement with studies showing gender differences in reading-related skills and early reading (e.g., Berninger, Nielsen, Abbott, Wijsman, & Radskind, 2008; Quinn & Wagner, 2015; Stoet & Geary, 2013).

The mediator model suggested that gender is a significant direct predictor of PISA Reading performance even after taking into account the effects of the Kindergarten skills. The effect of gender on PISA Reading was only partially mediated through letter knowledge, listening comprehension, and rapid naming explaining altogether 5% of the PISA Reading variance. The results thus suggest that the gender difference in PISA Reading do not derive from the Kindergarten pre-reading skills. Instead, our results call for other explanations, for example, girls getting more exposure to reading practice, reading more or broader types of texts compared to boys (Brozo et al., 2014), girls being more interested and engaged in reading (OECD, 2010a), or having a more favourable attitude toward reading in Grades 1–6 (McKenna, Kear, & Ellsworth, 1995). In addition to the individual-level explanations in skills or motivation, also explanations at the level of cultural values, parenting practices, school environment factors, or pedagogical approaches are plausible (e.g., Stoet & Geary, 2013).

The findings of the predictive analysis suggest that despite the many strengths of the Finnish education system, it does not fully alleviate or compensate for the differences in pre-reading skills existing between children in Kindergarten. In addition, the present findings also provide some support to the notion that the gender gap in reading tends to increase across the school years. As gender differences in reading comprehension mostly seem to develop during basic education, it is crucial that reading of boys would be supported more strongly in homes, schools, and in the society (see also Torppa et al., 2020). Notwithstanding the avid discussion on the gender gap in Finland, it is noteworthy that in an international comparison on average, also Finnish boys are good readers (e.g., OECD, 2016; Vettenranta et al., 2016). The debate rather reflects the egalitarian ethos of the Finnish comprehensive school; it is difficult to digest that an excellent reading performance seems to be associated with gender.

Certain limitations of the present study need to be considered. First, there was only one pre-reading measure per domain. A more extensive assessment might have yielded a stronger prediction. The assessment of listening comprehension in particular was not optimal as it was carried out in classrooms with only one task with few items. We cannot rule out the possibility that with more comprehensive measures, the mediation analysis of gender might have provided different results. Secondly, the measures of letter knowledge and phonological awareness were negatively skewed which is typical of Finnish kindergarteners. An earlier assessment would have yielded better psychometric properties but not necessarily a stronger predictive effects as also the gap between assessments would have increased. The Kindergarten assessment is also in alignment with the pedagogical practice in Finland. Thirdly, it is possible that readers with the poorest skills were underrepresented in the data because students in special-needs education classes were not included in the sample (however, children with special needs who attended regular classrooms were included). Fourthly, there was attrition in the sample. Students participating in the PISA Reading test had slightly better pre-reading skills in Kindergarten, albeit the effect sizes were small. The findings could be more robust if all students had participated in the Grade 9 assessment. In future research, other early predictors of PISA Reading performance should be considered, for example, kindergarteners’ print exposure, motivational measures such as task-focused behaviour as well as school engagement during the elementary grades.

In conclusion, considering the time-gap between the ages of 6 and 15 years, the amount of variance (27%/18%) of PISA Reading predicted by the Kindergarten pre-reading skills is remarkable. In Finland, schools as a rule provide uniformly high-quality instruction by qualified teachers following the national core curriculum in collaboration with special needs teachers. Yet, close to a quarter of variance in reading comprehension scores at the age of 15 can be predicted by Kindergarten measures assessed prior to any reading instruction. This suggests some degree of stability in individual factors affecting reading development and, consequently, also the need for more careful early targeting of support. However, the gender-related gap in adolescent years appears to emerge along other pathways than differences in pre-reading skills related to gender. Therefore, the present results underscore the possibility that pathways of reading development through the school years appear to differ between girls and boys, much to the disadvantage of boys. If replicated across school systems, the findings would point up a profound challenge for the national curriculum.