Introduction

Literacy has the power to transform lives, particularly among the world’s most vulnerable populations. In countries with great linguistic diversity, however, the path to literacy acquisition is complex. India is home to over 300 million individuals who speak two or more of the approximately 462 languages (Simons & Fennig, 2018). The educational policy for language instruction reflects the prevalence of multilingualism, requiring children to learn to read in three languages upon the completion of secondary school, of which two are typically Hindi and English (Joshi et al., 2017). However, there is great variation across the nation in the languages taught, their order, and the time at which they are introduced in school (see Menganathan, 2011). Seen as a language of opportunity, schools are increasingly offering English as a medium of instruction starting in Grade 1. Children in these schools are not only expected to learn to read in the instructional medium (e.g., English) but they are often simultaneously taught one of the Indic languages (e.g., Hindi), requiring the mastery of two distinct writing systems upon school entry (Joshi et al., 2017).

There exists a large and sound body of research on biliteracy acquisition. One of the major findings to emerge from this work has been that phonological skills in the first language (L1) can and do transfer to the second language (L2), as well as vice versa, facilitating reading in both languages (e.g. see Gottardo et al., 2021 for review). However, meta-analyses have revealed variations in these associations based on the languages examined, tasks used, age of the participants, and their instructional/linguistic experiences (Branum-Martin et al., 2012; Melby-Lervåg & Lervåg, 2011), highlighting a need to study these associations in diverse populations and language pairs. Understanding these relations are critical as they may aid in the early identification of children at risk of reading difficulties and can help in the design of effective instruction and remediation.

Studies examining biliteracy acquisition among children in India are greatly limited. Those which have been conducted have largely ignored emergent readers, despite the fact that an increasing number of children are expected to learn to read in two languages upon school entry. Furthermore, existing studies have not examined bidirectional transfer despite evidence that in some languages, phonological processing skills in both L1 and L2 are related to reading in both languages (e.g., LaFrance & Gottardo, 2005). Finally, there are no studies to our knowledge which have specifically examined Hindi–English biliteracy despite these being two widely spoken languages in the country (Office of the Registrar General & Census Commissioner India, 2011). In light of this, the present study aimed to fill some of the gaps in the existing literature base on biliteracy acquisition among children in India by examining the role of L1 and L2 PA on reading in a sample of emergent Hindi–English bilinguals.

The role of phonological awareness in reading

Reading, regardless of language, requires using the sounds of the spoken language to process the written script (e.g., Perfetti, 2003; Perfetti et al., 2005). More specifically, phonological awareness (PA), the ability to recognize, discriminate, blend, and manipulate the sounds in language, has long been identified as a critical skill for reading acquisition (Adams, 1990; Goswami & Bryant, 1990; Wagner & Torgesen, 1987). Prior to formal literacy instruction, PA develops through children’s experiences with oral language (e.g., Carroll, 2001; Cooper et al., 2002). As children begin to engage in formal literacy instruction, their phonological sensitivity becomes more refined through instruction and increased exposure to language and print (Anthony & Lonigan, 2004). PA is believed to develop hierarchically, with children in the early stages developing sensitivity to large phonological units (e.g., syllables and rhymes), and over time becoming increasingly more sensitive to smaller units (e.g., phonemes) (Goswami & Bryant, 1990; Treiman & Zukowski, 1991; Ziegler & Goswami, 2005; see Pufpaff, 2009 for review). According to the Psycholinguistic Grain Size Theory, the rate at which these skills develop varies across languages as a result of orthographic differences which affects the availability of sounds in the spoken language, the consistency in how spoken language maps onto written language, and the granularity of the writing system (Goswami, 2010; Ziegler & Goswami, 2005, 2006).

Many of the scripts used across the Indian subcontinent emerged from the Brahmi-derived writing system in which the basic unit of writing is the akshara (see Kandhadai & Sproat, 2010 for a detailed description). While languages such as Hindi, Kannada, Oriya, Telugu, etc., have symbol sets that vary in appearance, they all share common psycholinguistic features (Nag, 2011). Each akshara represents either a vowel or a consonant–vowel syllable (e.g. ). In their primary form, consonants are encoded with an inherent schwa vowel which is retained when the vowel appears in the word initial position. Otherwise, the schwa vowel can be replaced with other vowel sounds by placing vowel diacritics before, after, above, or below the consonant or syllable cluster (e.g., - /ku:/ = ; - /mi:/ = ) (Rao et al., 2021). Thus, although each distinct akshara represents sound at the syllable-level, each syllable consists of distinct phonemic units. As a result, akshara reading requires sensitivity to both syllable and phoneme-level information for decoding (Share & Daniels, 2016; Vaid & Gupta, 2002). Given the availability of distinct phoneme markers, the akshara orthographies are transparent. However, they are highly extensive with a symbol set of over 400 akshara and have great visuo-spatial complexity due to the non-linear arrangement of the symbols (Nag, 2007). Consequently, it has been found that akshara learning continues well into the elementary grades (e.g., Grades 4–5) (Nag, 2011).

The role of phonological awareness in akshara reading

Studies on akshara reading development have been reflective of the dual importance of PA, at both the syllable and phoneme-levels, in decoding (Mishra & Stainthorp, 2007; Nag, 2007; Nag & Snowling, 2012; Nakamura et al., 2018; Reddy & Koda, 2013; Singh & Sumathi, 2019). Singh and Sumathi (2019), examined reading development in 230 children across Grades 1–5 (ages 5–10) who were learning to read Hindi or Marathi. PA was assessed through a rhyme oddity task and a syllable replacement task in each language, and an average score was used for the analysis. PA significantly predicted word reading across grades and for children in Grades 3–5, PA emerged as the strongest unique predictor of both word and pseudoword reading, reflecting the transparent nature of akshara-sound mapping. Unfortunately, due to the use of a composite score, it was not possible to tease apart the various contributions of rhyme versus syllable-level PA.

In a study which examined children across Grades 1–4 it was seen that although both syllable and phoneme-level PA play a role in akshara decoding, syllable-level awareness plays a more dominant role, particularly in the early grades (Nag, 2007). Once children reached Grade 3, and had developed more advanced reading skills, phoneme awareness emerged as a slightly stronger predictor. Nevertheless, syllable awareness has been found to remain significant over time, including among older children (Grades 4–6). The salience of the syllable and its emphasis in reading instruction likely contributes to the stability of syllable-level awareness as a predictor of akshara reading (Nag & Snowling, 2012). A more recent study which examined PA and reading in Kannada and Telugu across children in Grades 1–5 corroborated previous findings (Nakamura et al., 2018). While there were unique contributions of both syllable and phoneme awareness to word reading, children’s syllable-level awareness seems to become sharpened over time and eventually subsumes phoneme-level awareness due to the saliency of the syllable in the akshara orthographies.

Bilingual reading and cross-language transfer of phonological awareness skills

PA, viewed as a universally required skill for reading, has received much attention in studies of biliteracy. A major finding to emerge from these studies is that L1 and L2 PA skills are highly correlated, and L1 and L2 PA can predict L1 and L2 reading (Gottardo et al., 2021; Melby-Lervåg & Lervåg, 2011). This finding has since held true across a variety of language pairs such as English–Spanish, English–French, English–Korean, English–Chinese, English–Japanese, English–Kannada, and English–Kiswahili (e.g., Chen et al., 2010; Cisero & Royer, 1995; Comeau et al., 1999; Durgunoğlu et al., 1993; Gottardo et al., 2001; Kim, 2009; Kuo et al., 2016; Reddy & Koda, 2013; Wang et al., 2006; Wawire & Kim, 2018). According to Koda’s Transfer Facilitation Model (Koda, 2007, 2008), non-language specific aspects of PA, once developed in the L1, should be available for reading in the L2. As a result, we would expect PA in the two languages to be closely related. However, transfer would be expected to play a larger role when two languages share similar linguistic and orthographic properties which could be shared (Koda, 2008). Therefore, it is likely that different shared skills are transferred depending on the language pair.

Past studies have in fact revealed that there are differences in the facets of PA that transfer as a function of orthographic differences between languages. Bruck and Genesee (1995) found that English-speaking children in Grade 1 who were attending a French-immersion school had more advanced syllable-awareness skills in English than their monolingual peers, due to the saliency of the syllable in French. Chow et al. (2005) examined Chinese kindergarten students who were learning to read English and similarly found that Chinese syllable awareness significantly predicted English reading reflecting the role of the syllable as the basic phonological unit in Chinese. Interestingly, Chen et al. (2010) who examined Chinese children attending an intensive English program as compared to a regular English program found that children in the intensive group showed faster and higher growth on measures of Chinese rime and phonemic awareness, reflective of the phonological features of English. Thus, although PA is considered to be a language-universal construct, there is clear evidence that the pattern of transfer is reflective of the phonological features of the languages involved (see Branum-Martin et al., 2015), and that this is not only true for transfer from L1 to L2 but also from L2 to L1.

Cross-language transfer of phonological awareness between the akshara languages and English

Few studies have examined cross-language associations among akshara-English bilinguals. Mishra and Stainthorp (2007) conducted one of the first studies and examined 9-year-old (Grade 5) native Oriya speakers. The children attended schools which were either Oriya-medium (n = 48) or English-medium (n = 51) and had studied the other language as a subject starting at Grade 2. PA in English was assessed using the Test of Phonological Awareness (Hatcher et al., 1994) and an analogous test developed for Oriya. PA in English and Oriya were found to be highly correlated, however, the cross-language transfer of PA was not symmetrical between children in the two types of schools. For children who attended Oriya-medium school (Oriya is the first literacy language), PA in Oriya predicted decoding in both languages; a finding in line with previous studies of cross-language transfer. In addition, PA in English was found to predict word reading in English indicating that for children in Oriya-medium school, there is evidence of bidirectional transfer.

For children who were attending English-medium school (English is the first literacy language), PA in Oriya predicted pseudoword reading in Oriya and word reading in English and PA in English was found to only predict English decoding. Thus, while there was a role of transfer from the L1 to the L2, there was no bidirectional effect for children in English-medium school. Recognizing the impact of orthographic differences, the authors also examined the role of Oriya syllables versus Oriya phonemes. It was found that for those in Oriya-medium schools, awareness of the syllable made a significant unique contribution to Oriya decoding. Whereas for those in English-medium schools, it was awareness of the phoneme, and not the syllable, that predicted Oriya decoding. Authors concluded that this difference may be an effect of English phonemic awareness facilitating phonemic awareness in Oriya among the English-medium students. Unfortunately, these differential facets of PA were not examined across languages.

In a second study, Reddy and Koda (2013) specifically examined the effects of orthography-specific demands on decoding development among 10–14-year-olds (n = 52) who had received about four years of literacy instruction in Kannada and one year of literacy instruction in English. PA in both languages was assessed using analogous syllable-deletion and phoneme-deletion tasks. They found that while both syllable and phoneme-level awareness in Kannada contributed to Kannada decoding, only English phoneme-level awareness contributed to English decoding; reflecting the differential roles played by the syllable and phoneme in the two languages. Furthermore, it was seen that syllable and phoneme-level awareness in Kannada significantly contributed to English decoding, but this relationship was mediated by English phoneme awareness.

Purpose of the study

In the present study, we aimed to expand the current understanding of cross-language transfer of PA to include Grade 1 and Grade 2 Hindi speaking children who were attending an English-medium school. Specifically, we examined the strengths of the associations between the following:

  1. 1.

    PA in Hindi (L1) and decoding in Hindi (L1),

  2. 2.

    PA in English (L2) and decoding in English (L2),

  3. 3.

    PA in Hindi (L1) and decoding in English (L2),

  4. 4.

    PA in English (L2) and decoding in Hindi (L1).

We attempt to make three contributions to the gaps in the current research base on cross-language transfer of PA among akshara-English bilinguals. The first is by studying the role of PA in emergent readers. Previous studies conducted in India have all examined children in Grade 4 and above. While it is clear that at least among these older children with sufficiently developed literacy skills we see evidence of transfer, we do not know if the same is true for emergent readers. PA skill is well known to be the most powerful predictor of early reading (e.g., Melby-Lervåg et al., 2012). As students mature, however, PA and its relation to reading becomes less consistent (Scarborough, 2005). An increasing number of children are expected to learn to read in two languages upon school entry when literacy skills in both languages may still be rudimentary, thus it is important to explore patterns of transfer among these populations. Previous studies examining Chinese–English kindergarteners who were receiving literacy instruction in both languages, found a positive effect of L1 PA, specifically syllable awareness, on both L1 and L2 word reading (McBride-Chang et al., 2008). Although these young readers were still developing their PA in their L1, there still seems to be a meaningful role of L1 PA in L2 reading among this population.

The second contribution is the examination of cross-language transfer of PA at multiple grain sizes and bidirectionally. The examination of PA at multiple grain sizes is important as meta-analyses have indicated that composite tasks yield higher cross-language correlations than tasks only assessing PA at the syllable level, implying greater cross-language relations (Branum-Martin et al., 2012). The examination of PA at multiple levels will also allow us to better understand whether different phonological units have differential associations with word reading across the two languages. Although some previous studies in India have used multiple PA tasks (e.g. syllable-level and phoneme-level), these studies have not examined the transfer of these units both from the L1 to the L2 and vice versa. The final contribution is the addition of a new language pair to the research base. Although previous studies have examined akshara-English bilinguals, we believe it is important to replicate this work specifically examining Hindi–English bilinguals given the great amount of linguistic heterogeneity in India.

Method

Participants

The data for this study was collected as part of a larger study (see  Patel et al., 2021) which included 143 students in Grades 1 (n = 70; Mage = 5.73, SD = 0.45) and 2 (n = 73; Mage = 6.68, SD = 0.47) who were attending an English-medium government school in Delhi, India. Prior to the start of the study, parents/guardians were invited to the school and taken through a consent form provided to them in both Hindi and English to ensure they were fully informed of their child's participation. The sample was made up of 43 boys and 27 girls from three Grade 1 classrooms and 31 boys and 42 girls from three Grade 2 classrooms.

The children attended school for approximately six hours per day, five days a week, during which all subject material was presented in English and children learned Hindi for about an hour per day. Hindi literacy instruction focused on the teaching of whole symbol blocks, with children learning the syllables that each akshara in the basic symbol register represents (see for e.g. Nag, 2011). English literacy instruction included some teaching of letter-sound correspondences and word families; however, teachers were heavily reliant on rote reading and copy-writing as is common in many English classrooms across India (see for e.g. Dutta & Bala, 2012).

Information on children’s out-of-school language use was collected through a questionnaire provided to the families. Out of 142 respondents, 118 families reported using only Hindi in the home, 19 families reported using both Hindi and English in the home, and 5 families reported using Hindi and/or another Indian language in the home. For children in India, another major source of language input is through private after school tutoring, which a majority of children attend (see Dongre & Tewary, 2014 for a discussion on the impact of private tutoring in India). In our sample, 109 children (76.2%) attended after school tutoring out of which 42 attended Hindi-medium centers, 6 attended English-medium centers, and 60 attended centers where both Hindi and English were used.

Measures

Children’s literacy skills were assessed using the Dyslexia Assessment Battery from the Dyslexia Assessment for Languages of India (DALI-DAB) (Singh, 2015). The DALI-DAB is one of the first standardized and validated tools for the assessment of literacy skills in both Hindi and English, designed along similar lines as the Phonological Assessment Battery (PhAB; Frederickson et al., 1997), specifically for use in India (see Rao et al., 2021 for validation details). The DALI contains a separate battery for children in Grades 1–2 and Grades 3–5 to ensure that the tasks are developmentally appropriate, and the tasks were developed keeping in mind the orthographic properties of the respective language. The data used for this study was collected using the semantic fluency, rapid automatized naming (RAN), phonological awareness, and word reading subtests from the Grade 1–2 battery of the DALI in both Hindi and English. An additional word reading task as well as a pseudoword reading task developed for this population, but not a part of the DALI, were also used (Cherodath & Singh, 2015).

Semantic fluency

In the semantic fluency task, participants were given 30 s to name objects in two given categories (i.e., animals and vegetables for Hindi and fruits and vegetables for English). For each language, a sum score of the correctly named items across both categories was used.

Rapid automatized naming (RAN)

RAN was measured using an object naming task from the DALI in which participants were presented with line drawings of five common objects: shoe, flower, house, chair, and key for Hindi and cup, bird, clock, van, and pencil for English, which were randomly arranged in 5 rows of 10 objects each. The participants were asked to name all of the objects in order as quickly as possible and the time taken was measured in seconds.

Phonological awareness (PA)

Hindi syllable replacement

In line with the orthographic properties of Hindi in which each akshara represents a syllable unit, PA in Hindi was assessed at the syllable-level using a syllable (akshara) replacement task. The task consisted of two training trials and 10 experimental trials. In each trial, children were orally presented with a word and asked to replace the initial akshara (syllable unit) with a given akshara. (e.g., (Gar) with <  > , response =  (Sar)). A score of one was given for every correctly formed new word. Cronbach’s alpha reliability was 0.89.

English phoneme replacement

PA in English was assessed as at the phoneme-level through a phoneme replacement task. The task consisted of two training trials and 10 experimental trials. In each trial, children were orally presented with a word and asked to replace the initial phoneme with a given phoneme (e.g., replace /c/ in cot with /g/, response = got). A score of one was given for every correctly reproduced word. Cronbach’s alpha reliability was 0.89.

Rhyme oddity

PA in both languages was also assessed using a rhyme oddity task consisting of two training trials and 12 experimental trials. In each trial, children were orally presented with three words out of which they were asked to identify the rhyming pair (e.g., Hindi: (naam) – (kaam) – (neel), English: made—hide—fade). A score of one was given for every correct response. Cronbach’s alpha reliability for the Hindi and English rhyme oddity task was 0.80 and 0.83, respectively.

Word reading

Hindi word reading was measured using two separate word lists. The first list was from the Hindi DALI and consisted of 25 words collated from 1st and 2nd grade textbooks, arranged in order of increasing difficulty. The second word reading list consisted of 20 items taken from Grade 1–3 textbooks (Cherodath & Singh, 2015). Children were instructed to read all items and a score of one was given for every correctly read word.

English word reading was also measured using two separate word lists. The first word reading list was taken from the English DALI and consisted of 25 words collated from 1st and 2nd grade textbooks arranged in order of increasing difficulty. The second word reading list consisted of 20 items taken from Grade 1–3 textbooks (Cherodath & Singh, 2015). Children were instructed to read all items and a score of one was given for every correctly read word.

In both Hindi and English, the two word readings lists were found to be highly correlated (Hindi: r = 0.93, English: r = 0.86), and therefore the averages of the two scores in each language were used for analysis. Cronbach’s alpha reliability of the combined word reading list was 0.99 for both Hindi and English.

Pseudoword reading

Pseudoword reading was assessed in both languages using a pseudoword list developed by Cherodath and Singh (2015). The English list consisted of 20 items in which a single letter was replaced in a real word to create a legally pronounceable string in English. Similarly, the Hindi list consisted of 20 items in which a single akshara was replaced to create a legally pronounceable string in Hindi. A score of one was given for every correctly read pseudoword. Cronbach’s alpha reliability for the Hindi and English pseudoword reading tasks was 0.94 and 0.91, respectively.

Procedure

Children were brought into a quiet room within the school where they were assessed one-on-one across two sessions, one for Hindi and the other for English, each lasting about 20 min. The assessment sessions were conducted class-by-class, either in Hindi first or English first. Once all six classrooms had been assessed, children were brought back and assessed in the other language, thereby ensuring that children had a sufficient break between the two assessment sessions. The Hindi tasks were administered by three research assistants who were bilingual English–Hindi speakers. The English tasks were administered by the lead researcher who is a native English speaker and the Hindi–English bilingual research assistants. All of the assessors had a minimum of a bachelor’s degree and prior experience working with children. Prior to the start of the study, a training session was held with the assessors on the administration and scoring of the data collection tools to ensure full understanding and consistency.

Data analysis

The data was analyzed using IBM SPSS and R Studio. First, a pairwise t-test was conducted to compare children’s performance on the Hindi and English measures. Hierarchical linear regression was then employed to examine the unique effects of the various sub-skills in predicting word and pseudoword reading within and across the two languages. In hierarchical regression, predictor variables are added in steps, thereby allowing us to examine the unique effect of each variable(s) over and above the previously entered variable(s). In other words, this method allows us to examine the effect of a variable(s) after controlling for the effects of other variable(s). Effect size was measured using Cohen’s f2 according to which a value of 0.02 is a small effect, a value of 0.15 is a medium effect, and a value of 0.35 is a large effect (Cohen, 1992).

In all the models, children’s age, and performance on the semantic fluency and rapid automatized naming (RAN) tasks were entered into the model first, allowing us to control for the variance emerging from differences in age and children’s oral language/naming skills. Semantic fluency, while typically used as a measure of executive function, is also affected by vocabulary (Kavé, 2006). RAN, which assesses the ability to rapidly access names from visual symbols, has long been identified as a strong predictor of reading across a variety of languages including English (see for e.g., Compton, 2003) and Kannada (Nag & Snowling, 2012). The phonological awareness measures were then entered into the model in alternating blocks, allowing us to examine the unique contribution of each phonological awareness task over and above the other.

Commonality analysis was conducted to determine the variance contributed by the PA tasks within and across the languages. Commonality analysis allows us to decompose the regression effect into unique and common effects, allowing a more accurate interpretation of the results, particularly when there is multicollinearity among the predictor variables (Kraha et al., 2012; Nimon, 2010). The analysis was conducted using the ‘yhat’ package (Nimon et al., 2020) in R Studio. We also calculated the squared structure coefficients allowing us to examine the percentage of the regression effect explained by each predictor (Thompson, 2006).

Results

Descriptive statistics for the Hindi and English tasks and paired sample t-test results comparing children’s skills in the two languages are reported in Table 1. Results showed that children performed significantly better on the Hindi tasks, reflecting that Hindi was the more dominant language in this sample. Partial correlations, controlling for the effect of age, are presented in Table 2. The correlations between PA in Hindi and English, decoding in Hindi and English, PA in Hindi and decoding in English, and PA in English and decoding in Hindi were all moderate to high and significant.

Table 1 Descriptive statistics and paired samples t-test comparison of the Hindi and English measures
Table 2 Correlations among the Hindi and English variables controlling for age

Predicting Hindi decoding with Hindi PA

First, hierarchical regression analysis was used to examine the contribution of the Hindi PA tasks to Hindi word and pseudoword reading (see Tables 3, 4 respectively). Age was entered into the model first, followed by Hindi semantic fluency and Hindi RAN as covariates, and finally the Hindi PA tasks were entered in alternating order (Steps 3 and 4) to examine the contribution of each over and above the other.

Table 3 Hierarchical regression and commonality analysis results predicting Hindi word reading by Hindi phonological awareness
Table 4 Hierarchical regression and commonality analysis results predicting Hindi pseudoword reading by Hindi phonological awareness

In predicting Hindi word reading (see Table 3), the total variance explained by the model was 42%, with the Hindi PA tasks explaining 17% of the variance over and above the covariates. Both Hindi rhyme oddity and syllable replacement were found to be uniquely associated with Hindi word reading. Results of the commonality analysis indicated that Hindi rhyme uniquely explained 3% of the variance in Hindi word reading and made 69% of the contribution to the final regression model. Hindi syllable replacement emerged as the strongest unique predictor of Hindi word reading, explaining 6% of the variance and making 67% of the contribution to the final regression model (Table 3, Step 4).

In predicting Hindi pseudoword reading (see Table 4), the total variance explained by the model was 41%, with the PA measures explaining 20% of the variance over and above the covariates. Once again, both Hindi PA tasks were uniquely associated with Hindi pseudoword reading. Hindi rhyme oddity uniquely explained 5% of the variance and accounted for 78% of the total regression effect. Hindi syllable replacement uniquely explained 6% of the variance and accounted for 71% of the total regression effect (Table 4, Step 4).

Predicting English decoding with English PA

Next, we examined the contribution of the English PA tasks to English word and pseudoword reading (see Tables 5, 6 respectively). Once again, age was entered into the model first, followed by English semantic fluency and English RAN as covariates. The English phonological measures were then entered in alternating order to examine the contribution of each, over and above the other (Steps 3 and 4).

Table 5 Hierarchical regression and commonality analysis results predicting English word reading by English phonological awareness
Table 6 Hierarchical regression and commonality analysis results predicting English pseudoword reading by English phonological awareness

In predicting English word reading (see Table 5), the model explained 62% of the variance with the PA tasks explaining 14% of the variance over and above the covariates. When rhyme oddity was the only English PA task entered into the model, it was uniquely associated with English word reading (β = 0.25, t = 3.53, p < 0.001). However, once phoneme replacement was entered into the model, rhyme oddity was no longer uniquely associated with English word reading (β = 0.004, t = 0.05, p = 0.96). English phoneme replacement ultimately emerged as the strongest unique predictor of English word reading (β = 0.47, t = 6.03, p < 0.001). Commonality analysis indicated phoneme replacement uniquely explained 10% of the variance in word reading and accounted for more than 80% of the regression effect, whereas rhyme oddity has a near-zero unique contribution to the model (Table 5, Step 4).

In predicting English pseudoword reading (see Table 6), the model explained 46% of the variance, with the PA tasks explaining 18% of the variance over and above the covariates. Once again, rhyme oddity was uniquely associated with English pseudoword reading when it was the only PA task entered into the model (β = 0.29, t = 3.52, p < 0.01). However, once phoneme replacement was entered into the model, the unique contribution of the rhyme oddity task was no longer significant (β = 0.02, t = 0.25, p = 0.81). The English phoneme replacement task emerged as the strongest unique predictor of English pseudoword reading (β = 0.52, t = 5.55, p < 0.001). Commonality analysis indicated phoneme replacement uniquely explained 12% of the variance in pseudoword reading and accounted for 93% of the regression effect.

Predicting English decoding with Hindi PA

We then examined the contribution of Hindi (L1) PA to English (L2) word (Table 7) and pseudoword (Table 8) reading. Once again, hierarchical regression analysis was conducted with age, English semantic fluency, and English RAN entered first as covariates. In models 1 and 2, the Hindi phonological measures were entered to examine the contribution of L1 PA to L2 word reading. Both Hindi rhyme oddity and syllable replacement made a significant unique contribution to English word reading, with Hindi rhyme oddity emerging as the strongest predictor of English word reading in both models (β = 0.29, t = 4.08, p < 0.001). Commonality analysis indicated that Hindi rhyme uniquely explained 5% of the variance in English word reading and accounted for 63% of the total regression effect. Hindi syllable replacement uniquely explained 2% of the variance in English word reading and accounted for 49% of the total regression effect.

Table 7 Hierarchical regression and commonality analysis results predicting English word reading by Hindi and English phonological awareness
Table 8 Hierarchical regression and commonality analysis results predicting English pseudoword reading by Hindi and English phonological awareness

We then entered the Hindi PA measures after English phoneme awareness, the strongest within-language predictor of English decoding, to examine the contribution of Hindi PA over and above English PA (see Table 7, Models 3 and 4). Hindi rhyme oddity contributed a significant amount of unique variance to English word reading when it was the only Hindi PA task in the model (β = 0.17, t = 2.10, p < 0.05). Even after Hindi syllable awareness was added, the contribution of Hindi rhyme remained (β = 0.15, t = 2.00; p < 0.05). Hindi rhyme uniquely explained 1% of the variance in English word reading and accounted for 58% of the total regression effect. Hindi syllable replacement contributed near zero unique variance to English word reading.

In predicting English pseudoword reading (see Table 8), Hindi rhyme oddity contributed significant unique variance in both models 1 and 2 (β = 0.25, t = 2.88, p < 0.01). On the contrary Hindi syllable awareness added significant unique variance when entered before Hindi rhyme (see Table 8, Model 2) but not when entered after Hindi rhyme (see Table 8, Model 1). Commonality analysis indicated that Hindi rhyme uniquely explained 4% of the variance in English pseudoword reading and accounted for 63% of the total regression effect. Hindi syllable replacement, while only adding a small amount of unique variance (1%), accounted for 50% of the total regression effect. Neither Hindi rhyme nor Hindi syllable awareness added any significant unique variance to English pseudoword reading when English phoneme awareness was included in the model.

Predicting Hindi decoding with English PA

Finally, we examined the contribution of English (L2) PA to Hindi (L1) word (Table 9) and pseudoword (Table 10) reading. Hierarchical regression analysis was conducted, this time with age, Hindi semantic fluency, and Hindi RAN entered as covariates. Next, the English phonological measures were entered in alternating order to examine the contribution of L2 PA to L1 word reading (see Table 9, Models 1 and 2). English rhyme oddity added significant unique variance when entered before English phoneme replacement (Table 9, Model 1) but not when entered after English phoneme replacement (Table 9, Model 2). Commonality analysis indicated a near-zero unique contribution of English rhyme oddity. English phoneme replacement added significant unique variance in both models (β = 0.39, t = 4.08, p < 0.001), uniquely explaining 8% of the variance in Hindi reading and accounting for over 75% of the total regression effect.

Table 9 Hierarchical regression and commonality analysis results predicting Hindi word reading by Hindi and English phonological awareness

We then entered English phoneme replacement after the two Hindi PA measures to examine the contribution of English phoneme awareness over and above Hindi PA (see Table 9, Models 3, 4, and 5). English phoneme replacement added a significant amount of unique variance to Hindi word reading when entered after Hindi rhyme (Table 9, Model 3) and when entered after Hindi syllable replacement (Table 9, Model 4), but did not add any significant unique variance when added after both Hindi PA tasks (Table 9, Model 5). Commonality analysis indicated that English PA uniquely explains 1% of the variance in Hindi word reading.

In predicting Hindi pseudoword reading (see Table 10), once again English phoneme awareness contributed significant unique variance in both models 1 and 2 (β = 0.38, t = 4.12, p < 0.001). On the contrary, English rhyme oddity only added significant unique variance when added before English phoneme awareness (Table 10, Model 1). Commonality analysis indicated that English phoneme replacement uniquely explained 8% of the variance in Hindi pseudoword reading and accounted for 84% of the regression effect. English rhyme oddity also added a small amount of unique variance to the model.

Table 10 Hierarchical regression and commonality analysis results predicting Hindi pseudoword reading by Hindi and English phonological awareness

We then entered English phoneme replacement after the two Hindi PA measures to examine the contribution of English phoneme awareness over and above Hindi PA (see Table 10, Models 3, 4, and 5). English phoneme replacement added a significant amount of unique variance to Hindi pseudoword reading when entered after Hindi rhyme (Table 10, Model 3) and when entered after Hindi syllable replacement (Table 10, Model 4), but did not add any significant unique variance when added after both Hindi PA tasks (Table 10, Model 5). Commonality analysis indicated that English PA uniquely explained 3% of the variance in Hindi pseudoword reading.

Discussion

In this study we aimed to examine cross-language transfer of PA among emergent readers in India. Children in Grades 1 and 2 were simultaneously exposed to Hindi and English literacy instruction in school and were expected to learn to read in two distinct writing systems. Previous studies on cross-language transfer in India which have focused on children in Grade 4 and above have found evidence of L1 phonological transfer to L2 reading (Reddy & Koda, 2013), as well as vice versa (Mishra & Stainthorp, 2007). The results of our study extend these findings to include emergent readers in Grades 1 and 2.

In both Hindi and English, the PA measures of a language were predictive of decoding in that language, highlighting the importance of PA in reading across languages (e.g., Goswami, 2008). Both Hindi PA tasks added a significant amount of unique variance to Hindi word and pseudoword reading, a finding in line with previous studies which have indicated a critical role of PA, particularly large-unit awareness in akshara decoding (Nag, 2007; Nag & Snowling, 2012; Nakamura et al., 2018; Singh & Sumathi, 2019). As previously mentioned, although information is encoded at both the syllable and phoneme-levels in akshara orthographies, it is the syllable that is the salient unit. Early literacy instruction in the akshara languages also tends to focus on syllable-level mappings (Nag, 2007, 2011). As a result, it is perhaps unsurprising that the Hindi syllable replacement task emerged as the strongest predictor of Hindi word reading. Pseudoword reading in Hindi was highly correlated with Hindi word reading, reflective of the orthographic transparency present in akshara languages (Singh & Sumathi, 2019). Consequently, both Hindi phonological tasks were also significant unique predictors of pseudoword reading in Hindi.

When predicting English decoding, English phoneme-level awareness emerged as the single strongest predictor of English decoding. English rhyme oddity, on the other hand, only significantly contributed to English decoding when it was the only PA measure in the model but added no additional variance after controlling for the effect of phonemic awareness. This finding is in line with previous research examining Kannada–English biliteracy among older children (Reddy & Koda, 2013). Ample studies have identified phonemic awareness as one of the most important predictors of reading in alphabetic languages such as English (e.g., Melby-Lervåg et al., 2012), and many studies have shown the benefits of phonemic awareness instruction in helping children read in English (Ehri et al., 2001). The findings of our study show that even for young English language learners with an akshara-based L1, English phoneme awareness is critical for English decoding. Thus, the role of English phoneme awareness is one that cannot, and should not, be ignored in English-medium classrooms in India.

Both Hindi PA measures significantly predicted English decoding when English phoneme awareness was not included in the model, a finding once again in line with previous work examining older students in India (e.g., Reddy & Koda, 2013). When English phoneme awareness was entered into the model, Hindi rhyme oddity continued to add a small but significant amount of unique variance to English word reading. These findings extend previous studies on akshara-English biliteracy and are in line with studies in other language pairs (e.g. Chinese–English; Chow et al., 2005) to show that even in these emergent readers, PA in the L1 transfers to L2 reading. Furthermore, although English decoding is heavily reliant on awareness of the smallest unit, or that of the phoneme, syllable-level awareness is also important given that English is a highly irregular orthography (Ziegler & Goswami, 2005). Syllable awareness has long been known to be one of the earliest available phonological units, emerging even before children begin formal literacy instruction (Treiman & Zukowski, 1991), and is seen as a universal in language processing (Ziegler & Goswami, 2005). Transfer of these non-language specific aspects of PA is also in line with the Transfer Facilitation Model (Koda, 2008).

Although only the Hindi rhyme task was found to add additional variance, it is important to note that, due to the orthographic characteristics of Hindi, rhyme awareness and syllable awareness are essentially testing awareness at the same grain size, that of the syllable (see Cherodath et al., 2017). This is contrary to the English rhyme oddity task which requires onset-rime sensitivity which is considered to be a finer level of sensitivity (Goswami & Bryant, 1990; Treiman & Zukowski, 1991). However, while both Hindi tasks are testing awareness at the level of the syllable, the rhyme task is an identification task whereas the syllable replacement task is a manipulation task. Identification tasks have been recognized as being easier than manipulation tasks (Pufpaff, 2009), a finding that also holds true in this sample where children performed significantly better on the rhyme identification tasks than the manipulation tasks. The contribution of Hindi rhyme, and not syllable replacement, to English word reading may be reflective of differences in task difficulty rather than differences in the type of PA that is being transferred. However, further studies in which the distinction between Hindi syllable and Hindi sub-syllabic units is established are needed to better understand the role of Hindi syllable versus Hindi rhyme awareness and their differential contributions to English decoding.

We also found that phoneme awareness in English (L2) added a significant amount of variance to L1 decoding when L1 PA was not included in the model. However, when L1 PA was added in, there was no longer evidence of transfer from L2 PA to L1 decoding. However, commonality analysis indicated this may be due to multicollinearity such that once both L2 and L1 PA are in the model, they both lose predictive power. This finding is in line with previous studies which have identified PA as a language general skill (e.g. Cisero & Royer, 1995; Comeau et al., 1999; Kim, 2009). Mishra and Stainthorp (2007) found no evidence of L2 to L1 transfer in children who were going to English-medium school. Interestingly however, evidence of L2 to L1 transfer was found among children who were attending Oriya-medium school. Chung et al. (2013) who examined Chinese–English bilinguals similarly found a lack of L2–L1 transfer. They concluded that a lack of transfer may be due to limited L2 proficiency and that perhaps as children’s L2 proficiency increases, there may be stronger bidirectional transfer. Studies which have found evidence of L2–L1 transfer have been in line with this conclusion (e.g. LaFrance & Gottardo, 2005). Future studies are needed to untangle the effects of language proficiency as well as language of instruction in multilingual environments like India.

Limitations

There are some limitations to the present study which should be considered when interpreting the findings. First, although we attempted to control for oral language skill through the semantic fluency measure, children’s oral vocabulary was not explicitly assessed. Oral vocabulary is known to play a critical role in children’s PA development and word reading skill (e.g., Metsala & Walley, 1998), a finding which also holds true for reading in Hindi (Singh & Sumathi, 2019). In future studies, it is important to assess oral vocabulary in both languages. In addition, using equivalent measures which assess PA at multiple grain sizes (i.e. syllable, onset-rime, and phoneme-levels) in both languages will help provide more specific information on the role of cross-language transfer. Second, the cross-sectional nature of this study prevents us from understanding how associations may change as children develop over time. There is only one longitudinal study, to our knowledge, which has investigated akshara-English biliteracy development in an Indian sample (see Nakamura et al., 2014). However, the youngest group of children in this study were in Grade 3. Thus, future studies are needed in which children’s literacy development is followed starting at school entry. Third, this study only reports correlational data. Future studies which employ randomized controlled trials to study the effects of PA and reading instruction can provide valuable insights on how to improve reading development. Finally, it is important to acknowledge the characteristics of the sample when interpreting the generalizability of the results. India is a highly diverse country with an education system that differs greatly from school to school and state to state. Future research should examine cross-language associations in a wide variety of age groups, language pairs, and school types across the country to build a stronger evidence base which is applicable to the Indian population.

Implications

Despite these limitations, this study does provide meaningful insight into the cross-language relationships of young Hindi–English biliterates, a population that has been greatly understudied. While we would encourage researchers to examine this population in greater detail there are important implications, both theoretical and practical, that have emerged from this study. Theoretically, we have shown that L1 and L2 PA transfers to L1 and L2 decoding even among emergent akshara-English bilinguals. This study expands upon previous research on cross-language transfer among akshara-English bilinguals which has focused on children in Grade 4 and above, once children have well developed literacy skills. Through the examination of a younger set of students (Grade 1 and 2) who were simultaneously learning to read in both their L1 and L2 we were able to highlight the importance of PA for decoding both within and across languages. These findings have important practical implications for effective literacy instruction in India.

We see that the role that English phonemic awareness plays in children’s English decoding skills, even in this population, is one that cannot be ignored. In line with previous work, it is clear that explicit and systematic instruction that supports the development of English phonemic awareness can successfully aid English reading development (e.g., Dixon et al., 2011). We would encourage teachers and practitioners to better leverage children’s L1 literacy skills in helping them develop their L2 literacy skills. As mentioned by Reddy and Koda (2013), this could involve explicitly highlighting the sub-lexical components which are used in both languages. In a study conducted with 10-year-old Kannada-speaking children, researchers found that when an English phonics intervention was modified to include the Kannada symbols which represent the English letters, children performed significantly better on English reading as compared to children who received the English-only intervention (Nishanimut et al., 2013). These findings are promising, although future studies are needed in which such interventions are tested with emergent readers.

We would like to conclude by acknowledging those who have provided sound arguments against the introduction of English prior to the sufficient development of children’s literacy skills in their L1 (Nakamura et al., 2019). Studies have shown the role of transfer to be much more significant once children achieve a “threshold level” of literacy in their L1, and thus, researchers are encouraging practitioners and policy makers alike to not introduce English decoding instruction until children have reached the threshold level. Paying heed to these findings, India’s New National Education policy has recommended that the medium of instruction until at least Grade 5, but preferably until Grade 8, be the mother tongue or the local language (Government of India Ministry of Education, 2020). However, this remains a recommendation leaving state governments free to decide when and where it will be implemented (Vishnoi, 2020). In India, English-medium instruction starting at school entry is clearly there to stay. Therefore, it is critical that researchers continue to examine literacy development among these populations.