Introduction

There is increasing interest in the relationship between prosodic sensitivity (PS) and reading. Empirical evidence for the influence of PS on reading comes from studies comparing persons with and without dyslexia (Cuetos, Martínez-García, & Suárez-Coalla, 2018; Goswami, Gerson, & Astruc, 2010; Goswami et al., 2013; Leong, Hämäläinen, Soltész, & Goswami, 2011) as well as correlational studies (Arciuli, 2017; Clin, Wade-Woolley, & Heggie, 2009; Defior, Gutiérrez-Palma, & Cano-Marín, 2012; Goswami et al., 2010; Holliman et al., 2017; Holliman, Wood, & Sheehy, 2008, 2010a, 2010b; Jarmulowicz, Taran, & Hay, 2007). However, some studies report an effect only for a subsample of a certain age (Gutiérrez-Palma, Defior, Jiménez-Fernández, Serrano, & González-Trujillo, 2016; Lin, Wang, Newman, & Li, 2018) or a certain type of reading outcome (Wade-Woolley, 2016; Whalley & Hansen, 2006). Furthermore, several correlational studies could not confirm the influence of PS on reading (Deacon, Holliman, Dobson, & Harrison, 2018; Kim & Petscher, 2016; Schmidt, Breuer-Küppers, Göntgen, & Schabmann, 2016). As the results of correlational studies are inconsistent, the present study aims to examine the PS–reading relationship with a novel approach. Alongside existing correlational studies and examinations of an extreme group (dyslexics), a third way to examine the PS–reading relationship involves another extreme group, namely subjects with potentially excellent prosodic skills. As PS is affected by music-related, non-linguistic auditory processing skills (Goswami et al., 2010, 2013; Richards & Goswami, 2015), comparing a sample of musicians to non-musicians is a good way to further clarify the effects of PS on reading. In this paper, we compare PS and reading in a sample of adult musicians and non-musicians and try to clarify the possibility of a reciprocal influence of PS and reading.

The effect of prosodic sensitivity on reading

PS refers to sensitivity to the prosodic components of speech. According to Harrison and Wood (2016), three different components of prosody must be distinguished: variation in tone amplitude (stress; i.e., variations in loudness between syllables and words), variation in tone frequency (intonation; rising and falling pitch in speech), and the timing of words and syllables (“relative duration of syllables and vowel length”; Harrison & Wood, 2016, p. 91).

Efficient reading relies on a system enabling quick and stable access to words (Brown & Loosemore, 1994; Jackson & Coltheart, 2001). Although the exact mechanism is still not clear, it is conceivable that prosodic information might (combined with other information) serve as an additional hint for the reader when retrieving words. In the literature, several possible causal chains are discussed:

  1. 1.

    PS facilitates phoneme identification: This explanation refers to the fact that it is easier to identify phonemes (particularly vowels) in stressed than in unstressed syllables, which often consist of vocals with reduced quality, i.e., schwa (Chiat, 1983; cf. Holliman, Wood, & Sheehy, 2012; Wood, Wade-Woolley, & Holliman, 2009).

  2. 2.

    PS facilitates onset-rime boundary detection: Amplitude peak in spoken language is connected to the position and quality of the vowel sound in the word (Goswami, 2003; Scott, 1998). Therefore, persons with better sensitivity for amplitude changes in spoken language might find it easier to recognize onset-rime boundaries, which is crucial for reading (Goswami et al., 2002; cf. Holliman et al., 2012; Wood et al., 2009).

  3. 3.

    PS facilitates the development of vocabulary: PS might facilitate the identification of word boundaries. This might particularly be the case for stress-timed languages, in which most stress is on the first syllable (Cutler & Norris, 1988). This in turn affects both rhyme sensitivity and morphological awareness, which facilitates word reading (cf. Holliman et al., 2012, 2014; Holliman, 2016).

  4. 4.

    PS supports morphological clarity: PS might avoid conflicts in retrieving the mental lexicon as it gives additional information about the meaning of the word. Poor PS might hamper the differentiation of compound words, e.g., HIGHchair versus high CHAIR, in German FÜRsorge (welfare) versus für SORge (for concern; cf. Holliman, 2016; Kitzen, 2001, as cited in Whalley & Hansen, 2006; Wood et al., 2009). Furthermore, homophones can only be identified correctly via the correct assignment of stress, e.g., in German MOdern (molder) versus moDERN (modern), Übersetzen (cross over) versus überSETzen (translate). In these cases, problems in assigning stress correctly might lead to a conflict in the stored lexical representation of morphemes and thus slow reading and enhance reading errors (Sauter, Heller, & Landerl, 2012).

Empirical evidence

Empirical evidence for the influence of PS on reading comes from studies comparing persons with and without dyslexia as well as from correlational studies.

Group comparisons

Several studies have found limited prosodic skills in subjects with dyslexia (Cuetos et al., 2018; Goswami et al., 2010, 2013; Leong et al., 2011). For instance, Leong et al. (2011) tested the perception of syllable stress in English-speaking adults with dyslexia. The task was to decide whether two four-syllable words had the same stress. The results showed that adults with dyslexia performed significantly worse than adults without dyslexia. Goswami et al. (2010) analysed the PS of 56 children between 8 and 15 years old with and without dyslexia. They used an adapted version of the DEEdee task, originally developed by Kitzen (2001) to measure PS on the supra-word level. In this task, children saw a picture of a familiar celebrity or a famous movie or book character. They then heard two sequences consisting of stressed and unstressed dee-syllables with the same number of total syllables. The children had to decide which DEEdee sequence matched the target phrase in terms of stress pattern, e.g., SUperMAN corresponds to DEEdeeDEE. The performance of the children with dyslexia was significantly lower than the performance of age-matched children without dyslexia. Similar results in the DEEdee task were reported for 9-year-old children with dyslexia when compared to non-dyslexic children with the same reading level (Goswami et al., 2013). Moreover, studies examining more transparent orthographies have come to similar findings. For example, Cuetos et al. (2018) compared 32 11-year-old Spanish children with dyslexia with a group of age-matched and a group of reading-matched children. Children with dyslexia exhibited lower abilities in phonological awareness (PA), rapid automatized naming and PS than age-matched children. They also showed lower abilities than the reading-matched children in one each of the PA and PS subtests.

Correlational studies

Correlational studies in languages of different orthographic depths have compared the unique impact of PS when controlling for PA. Wade-Woolley and Heggie (2016) synthesized the findings of several studies concerning the contributions of PS and PA to word and nonword reading. In five out of seven studies, PS uniquely predicted word reading above and beyond PA in regression models (Defior et al., 2012; Goswami et al., 2010; Holliman et al., 2010a, 2010b; Whalley & Hansen, 2006). With respect to nonword reading, one out of three studies found a significant outcome for PS after controlling for PA (Jarmulowicz et al., 2007). Two studies explored the effect of PS above and beyond PA on reading composite scores, both of which reported PS to be a unique predictor (Clin et al., 2009; Holliman et al., 2008). More recent studies have delivered a similar picture. In five out of eight studies, PS uniquely predicted word reading after controlling for PA (Arciuli, 2017; Gutiérrez-Palma et al., 2016 (only for children in 3rd–5th grades); Holliman et al., 2017; Lin et al., 2018 (only for 6-year-old children); Wade-Woolley, 2016). However, Kim and Petscher (2016) could not confirm these results. In their study, PS contributed to word reading only via PA and morphological awareness. Likewise, Deacon et al. (2018) found no significant unique effects of PS on word reading. In a German study, PS had no unique effect on word and nonword reading over and above the effect of PA (Schmidt et al., 2016). Wade-Woolley (2016) also analysed nonword reading, once again finding no significant effect of PS after controlling for PA. Moreover, for the time being, the differences in the results cannot be conclusively attributed to the participants’ age (at least among children) or the PS measures used (cf. Wade-Woolley & Heggie, 2016). To exclude a possible effect of developmental aspects (e.g., different effects of PS on reading at different stages of children’s development), the present study examines adult participants. While previous studies have applied different PS measures, the present study uses a PS measure on the sentence level, which has rarely been examined in German-speaking countries (Schmidt et al., 2016).

The case of musicians

An additional approach to testing the influence of PS on reading is to examine persons with extraordinary prosodic skills, i.e., musicians (e.g., Goswami, 2011). Specifically, one can test whether a particularly pronounced PS goes hand in hand with good reading (in contrast to comparisons of poor and average readers, which at best tells us whether there is some kind of PS deficit in dyslexic children and adults). This point is also important because it sheds light on the possible benefits of prosodic and/or musical training to prevent reading difficulties.

Although directly measured information about the PS of musicians in the German-speaking world is not available to our best knowledge (see below), it is conceivable to assume good PS because of their strong non-linguistic basic auditory processing (BAP) skills like pitch sensitivity and rhythm discrimination (e.g., Banai & Ahissar, 2013; Deguchi et al., 2012; Micheyl, Delhommeau, Perrot, & Oxenham, 2006; Tsang & Conrad, 2011). In fact, musicians have been shown to outperform non-musicians in BAP tasks on a behavioural, electrophysiological and psychoacoustic basis (Banai & Ahissar, 2013; Deguchi et al., 2012; Kraus & Chandrasekaran, 2010; Magne, Schön, & Besson, 2006; Micheyl et al., 2006; Nikjeh, Lister, & Frisch, 2009; Tsang & Conrad, 2011). Even dyslexic musicians (dyslexics generally seem to have poorer BAP skills) show better performance in rise time (which reflects the “sharpness” of a rising tone), frequency, intensity and rhythm tasks than dyslexic non-musicians and equivalent BAP skills to musicians without dyslexia (Bishop-Liebler, Welch, Huss, Thomson, & Goswami, 2014). The study by Weiss, Granot, and Ahissar (2014) also revealed that dyslexic musicians showed comparable frequency and temporal-interval discrimination skills to musicians without dyslexia, in contrast to dyslexic non-musicians. Furthermore, BAP skills such as intensity, duration and frequency discrimination and amplitude rise time perception might be possible precursors of PS (Goswami et al., 2010, 2013; Richards & Goswami, 2015).

Generally, BAP skills are relevant for reading. Regression analyses have shown that the perception of note duration changes in repeating rhythms predicts text reading accuracy and word reading speed (Flaugnacco et al., 2014). Furthermore, dyslexic children exhibit low performance on BAP tasks, such as tapping in time with a metronome (Thomson & Goswami, 2008), rise time perception, pitch perception, and rhythm discrimination (Goswami, Huss, Mead, Fosker, & Verney, 2013; Huss, Verney, Fosker, Mead, & Goswami, 2011), as well as loudness perception (Huss et al., 2011). Thomson, Fryer, Maltby and Goswami (2006) reported similar results for adults with dyslexia. Likewise, Leong et al. (2011) also showed poorer rise time perception and pitch perception in adults with dyslexia.

However, studies comparing musicians and non-musicians in terms of reading have uncovered contradictory results. Moreno et al. (2009) assigned 32 8-year-old Portuguese non-musicians to either a musical training group or a painting training group. After 6 months, the music group but not the painting group exhibited significant improvements in word decoding and small pitch perception in speech. Similar results were found by Habib et al. (2016). They analysed the effect of a 6-week cognitive-musical training (CMT) on French dyslexic children. The results showed a significant improvement in word decoding after the CMT period, no difference after an untrained period before the CMT, and no significant decrease after an untrained period after the CMT. In addition, Bishop-Liebler et al. (2014) reported significantly higher word and nonword reading skills among dyslexic musicians compared to dyslexic non-musicians. Weiss et al. (2014) also found significantly higher pseudoword reading skills in dyslexic musicians than in dyslexic non-musicians. Furthermore, regression analyses have shown that music perception ability explains unique variance in early reading ability in 4- and 5-year old pre-schoolers (Anvari, Trainor, Woodside, & Levy, 2002). In contrast, Corrigall and Trainor (2011) reported that the length of musical training among 6–9-year-old normal readers predicted reading comprehension, whereas there was no association with word decoding. No significant improvements in word or nonword decoding (but significant improvements in PA) after music instruction were found by Overy (2003) with dyslexic children (mean age 8.8 years) and Gromko (2005) with kindergarten children. These contradictory results might be explained by the type of music training, the length of training and the participants’ age and reading level (Corrigall & Trainor, 2011). Nevertheless, in her metaanalysis, Standley (2008) found a small effect size of d = .32 for the benefits of music instruction on reading skills. In more recent metaanalyses, Sala and Gobet (2017) reported no significant effect of music training on literacy skills (combining reading and writing), but Cooper (2019) found a small to medium effect for musical training in schoolchildren on verbal and non-verbal cognitive skills. However, when examining only studies with high methological quality, i.e., studies that took place in a lab setting, the effects become non-significant. The author concluded that “music training was ultimately a beneficial intervention but not conclusively better than other types of trainings” (Cooper, 2019, p. 14).

Research questions

Existing studies that directly emphasize the importance of musical skills for reading are limited in several respects (Anvari et al., 2002; Bishop-Liebler et al., 2014; Habib et al., 2016; Moreno et al., 2009). Corrigall and Trainor (2011) pointed out that in numerous studies, music training was close to (or combined with) reading interventions, e.g., such as teaching reading skills using song texts (e.g., Dominguez, 1991). Standley (2008) showed that the effect of a music intervention on reading was stronger for programs that comprised direct reading interventions than for those which did not. This close connection between musical training and reading activities might impede the identification of which part of the intervention had an effect on reading: the musical training or the reading part of the training.

Furthermore, to the best of our knowledge, no previous studies have compared musicians and non-musicians on more language-related PS tasks in German. German is a shallow orthography in which grapheme-phoneme correspondences are quite transparent (e.g., Seymour, Aro, & Erskine, 2003). Due to the easiness of grapheme-phoneme mapping, it has been argued that PA might play a minor role in reading acquisition (Landerl et al., 2019; Landerl & Wimmer, 2008; Schabmann, Schmidt, Klicpera, Gasteiger-Klicpera, & Klingebiel, 2009; Verhagen, Aarnoutse, & van Leeuwe, 2008). As PS might be a predictor for PA (or at least for certain components of PA) and possibly only contributes to reading via PA (Kim & Petscher, 2016; Schmidt et al., 2016; Wade-Woolley, 2016), PS might be of only minor influence in a shallow orthography like German. Therefore, we measured PA as a control variable in our study to see if a possible effect of PS on reading remains after controlling for PA.

In this study, we expand knowledge about the effect of PS on reading by comparing a sample of musicians to a sample of non-musicians. As stated by Wade-Woolley and Heggie (2016) “the acoustic correlates of stress are duration, amplitude and frequency” (p. 6). To analyse sensitivity to stress in spoken language, the present study uses a language-related PS measure involving real sentences and nonword sentences in German played orally. Participants had to extract the stress pattern of the sentence by perceiving syllables of different stress size (e.g., different frequency, amplitude and duration). Then, they had to select a written sentence with the same stress pattern (a detailed description of the tasks is provided below). From a theoretical perspective, a high sensitivity to these prosodic components (which we expect in the musician group) should enhance the representation of words’ stress assignments in the mental lexicon, which should in turn enhance reading fluency, as prosodic information serves as an additional hint for the reader when retrieving words from the mental lexicon. Furthermore, subjects with high PS skills might find it easier to identify vowels (linked to amplitude peaks) in words (Goswami, 2003). Likewise, recognition of onset-rime boundaries in words and even non-words might be improved. This could facilitate the identification of phonological similarities to stored words and syllables and therefore enhance reading fluency. Hence, we expect higher reading fluency in the musician group.

We have two research questions:

  1. 1.

    Do musicians and non-musicians differ regarding PS and reading? We expect a difference in the two groups’ performance on both PS and reading. As musicians show higher (pre-existing) BAP skills (e.g., Bishop-Liebler et al., 2014; Deguchi et al., 2012; Schellenberg, 2020) than non-musicians and BAP skills might be possible precursors of PS (e.g., Richards & Goswami, 2015), we expect the musicians to be more sensitive in perceiving prosodic information than the non-musicians.

  2. 2.

    Can these possible differences in PS and reading performance be attributed to a reciprocal effect of reading on PS, in the sense that long experience with fluent reading improves PS? If this is the case, we assume that the differences in PS would disappear when comparing musician and non-musician subgroups with matched reading levels.

Method

Participants

The sample comprised 60 native German-speaking university students, 30 (50% males) who played a classical instrument (musicians) and 30 (50% males) who did not play an instrument (non-musicians). Twenty-three of the musicians were recruited from the University of Cologne Symphony Orchestra, the others from different orchestras or ensembles. According to the participants’ self-reported testimonies, all of the musicians showed especially strong musical abilities that were already apparent in early childhood. 23 had already passed the audition for the university orchestra at the time of the study. The musicians’ ages ranged from 20 to 32 years (M = 24.83, SD = 3.36), the non-musicians’ ages from 18 to 31 years (M = 23.00, SD = 2.64). This difference was significant, t(58) = 2.35, p < .05. The participants had no known diagnosis of any kind of psychological problems or learning disability. All participants fell within the boundaries of “normal readers” according to the norms of the Salzburger Lese- und Rechtschreibtest II (Salzburg Reading and Spelling Test II, SLRT-II; Moll & Landerl, 2010; see below for a description), with the exception of four persons in the non-musician group: one person for the word reading test, two persons for the nonword reading test, and one person for both reading tests. These persons would be classified as poor readers.

Measures

Prosodic sensitivity

According to Beyermann (2013), most German disyllabic nouns are stressed on the first syllable. In the CELEX German lemma corpus, 86% of the disyllabic nouns are trochaic and only 14% have final stress. Furthermore, German has certain rules that determine whether a syllable is stressed or unstressed. With respect to prefixes, detachable verbal particles, e.g., auf- in AUFdrehen (to turn on) are always stressed, whereas nondetachable verbal particles, e.g., ver- in verKAUfen (to sell) are unstressed (Féry, 1996). In addition, the compound stress rule (primary stress is on the first constituent element, e.g., STRAßenbahn; tram; Féry, 1996) leads to an accurate pattern in 99.5% of the disyllabic German noun compounds in CELEX (Beyermann, 2013). These regularities in the German language mean that measuring of PS on the word level would be too simple for our sample.

Since German is a stress-timed language, there is higher variability in the syllable structure (Sauter et al., 2012). Some syllables in a sentence are longer and louder in almost identical intervals, which are filled in with a variable number of unstressed syllables. Consequently, measuring PS on the sentence level requires perceiving not only stress syllables but also the entire rhythm of a sentence (Schmidt et al., 2016). Therefore, we measured PS on the sentence level rather than on the word level.

PS was measured with three tasks: The first task (piano task) was taken from Sauter et al. (2012). Participants had to match written sentences to a given sequence of piano tones of constant frequency but different length and loudness. The piano sequence was played three times from a PC, then the participants had to identify the prosodically compatible sentence out of three possible written solutions. For instance, the sentence Ich RENne und HÜPfe (I run and jump) would fit the sequence given in Fig. 1. Twelve piano sequences were presented (Cronbach’s alpha = .68).

Fig. 1
figure 1

The Sauter et al. (2012) piano task for the sentence Ich renne und hüpfe (I run and jump)

The second task was similar, but the participants had to match a real sentence to a sentence consisting of nonwords. The nonword sentence was played orally three times from a PC, then the participants had to identify the prosodically equivalent sentence out of three possible written solutions, e.g., MAlako TRAmelo matches FAHren wir ACHterbahn (Let’s ride a roller coaster). Twelve nonword sentences were presented (Cronbach’s alpha = .87).

In the third task, participants had to match a heard real sentence to another prosodically equivalent real written sentence, e.g., Ich RENne und HÜPfe (I run and jump) would match Der APfel hat WÜRmer (The apple has worms). The procedure was the same as in the second task. Twelve real sentences were presented (Cronbach’s alpha = .77).

The orally presented stress pattern of the piano sequence and (nonword) sentence, respectively, had to be compared to all three written sentences to identify which could be stressed with the same pattern. For each task, the target sentences and distractors had the same number of syllables. Therefore, it was not possible to identify the solution simply by counting the syllables. In the distractor sentences, at least one unstressed syllable fell on a stressed syllable from the presented pattern, therefore allowing the distractor sentence to be excluded. Examples are provided in “Appendix A”. All three tasks loaded on one factor, with Cronbach’s alpha = .76 (for the total scale; Schmidt et al., 2016).

The orally played nonword sentences (second task) and real sentences (third task) did not differ in number of syllables, sum syllable frequency, or sum biphoneme frequency, which indicates the total number of words that contain the occurred biphonemes. However, there was a difference in the number of phonemes, t(19.37) = 3.22, p < .01, and number of biphonemes, t(22) = 2.89, p < .01. This calculation is based on the sublexical measures by Hofmann, Stenneken, Conrad, and Jacobs (2007), which were derived from the German phonological word forms in CELEX. The stimuli list is shown in “Appendix B”.

Reading

Reading was measured with the reading section of the Salzburger Lese- und Rechtschreibtest II (Salzburg Reading and Spelling Test II, SLRT-II) by Moll and Landerl (2010). One list of words and one list of nonwords had to be read aloud as quickly and correctly as possible within one minute. The score was the number of correctly read words, i.e., a composite score of reading speed and accuracy. Retest reliability for this test ranges from .90 to .98 (Moll & Landerl, 2010).

Phonological awareness

PA was measured as a control variable to see if a possible effect of PS on reading still exists after controlling for PA. We used the following tasks from the German translation (Schmidt, Schabmann, & Schiller, 2014) of the Prueba de Conciencia Fonológica (Test of Phonological Awareness) by Jiménez (1995). Segmentation: Participants had to name the phonemes of an orally given nonword (e.g., pok = /p/, /o/, /k/). Deletion: Participants had to pronounce an orally presented nonword by leaving out a target phoneme (e.g., [k]lame = lame). Blending: The phonemes of a nonword were orally presented and the participants had to pronounce the nonword (e.g., /d/, /u/, /l/ = dul). Each subtest consisted of ten items.

All tests were administered in a quiet room at the University of Cologne. All baseline items for the PS measures were presented over the PC loudspeakers. Each subject participated in an individual session, which was conducted by one of two of the authors.

Data analysis

Data analysis was undertaken using SPSS ®, version 24. As participants’ age was correlated with reading and the piano task (Table 1), we controlled for this variable when comparing reading and PS for musicians and non-musicians.

Table 1 Correlations of age, prosodic sensitivity (PS), reading (READ), and phonological awareness (PA) measures

To answer our first research question, we chose a sample of musicians who play at least one instrument on a high level. These persons should have good non-linguistic BAP skills (e.g., Banai & Ahissar, 2013; Deguchi et al., 2012). We assumed that these can hardly be attributed to a reciprocal effect of reading on PS. However, to empirically prove this last point, we additionally conducted a subgroup analysis (second research question). If reading has no (reciprocal) influence on PS, possible differences in PS should remain when comparing groups of musicians and non-musicians with the same reading level. Otherwise, they should vanish. Conversely, PS-matched groups of musicians and non-musicians should exhibit no difference in reading skills. To check whether possible differences in PS remained when reading was held constant, we matched participants according to their reading level using the following procedure: We matched participants whose reading sum scores differed by no more than one point. If more than one match was possible, we chose the persons who were best matched according to age and gender. Twenty persons were matched. These ten musicians did not differ from the ten non-musicians in terms of reading, sex or age (Table 2). The same procedure was followed to check whether significant reading differences could be found while holding PS constant. Here, 24 persons could be matched. The 12 musicians did not differ from the 12 non-musicians in terms of PS, sex or age (Table 2).

Table 2 Descriptive results of the matching criteria in the subsamples

Results

Correlation of measures

Age was correlated with all reading measures and the piano task. Sentence and nonword sentence PS tasks, PS sum score, segmentation PA task, and PA sum score were significantly correlated with all reading measures (Table 1). In particular, a significant correlation between PS sum score and reading sum score as well as PA sum score and reading sum score were uncovered (linear regression equation: − 169 + 2.21 × PS + 10.78 × PA = reading). Hierarchical regression analysis showed a unique effect of PS on reading after controlling for age and PA (Table 3). No significant correlations with reading were found for the piano task and the deletion and blending PA tasks, except for a correlation between blending and the nonword reading task (Table 1). The piano task was also insignificantly correlated with the nonword sentence PS task. There were no significant correlations between segmentation and deletion, segmentation and blending, and deletion and blending. No significant correlations with PS were found for any of the PA tasks.

Table 3 Hierarchical regression analysis predicting reading (N = 60)

Differences between musicians and non-musicians

Full sample

On average, musicians performed better on all PS tasks, reading tasks, and PA tasks than non-musicians (Table 4). To test for differences between musicians and non-musicians, we conducted MANOVAS for PS and reading scores with age as a covariate, and for PA scores without a covariate. Multivariate tests revealed that musicians outperformed non-musicians on the PS tasks, F(3, 55) = 12.25, p < .001, ηp2 = .40, the reading tasks, F(2, 56) = 5.19, p < .01, ηp2 = .16, and the PA tasks, F(3, 56) = 3.96, p < .05, ηp2 = .18. Univariate tests showed significant differences for all PS, reading, and PA measures except for the piano task and the deletion PA task (Table 4). When omitting poor readers from the dataset (leading to a sample of non-musicians: n = 26; musicians: n = 30), the same pattern of results emerged (except for the segmentation PA task), piano task: F(1, 53) = 1.70, p = .20, ηp2 = .03; sentence task: F(1, 53) = 22.78, p < .001, ηp2 = .30; nonword sentence task: F(1, 53) = 29.20, p < .001, ηp2 = .36; PS sum score: F(1, 53) = 28.56, p < .001, ηp2 = .35; word reading: F(1, 53) = 4.40, p < .05, ηp2 = .08; nonword reading: F(1, 53) = 6.15, p < .05, ηp2 = .10; reading sum score: F(1, 53) = 5.71, p < .05, ηp2 = .10; segmentation: F(1, 54) = 2.20, p = .14, ηp2 = .04; deletion: F(1, 54) = 3.06, p = .09, ηp2 = .05; blending: F(1, 54) = 6.33, p < .05, ηp2 = .11; PA sum score: F(1, 54) = 9.93, p < .01, ηp2 = .16.

Table 4 Means, standard deviations, minimums, and maximums of prosodic sensitivity (PS), reading, and phonological awareness (PA) measures for musicians and non-musicians

Reading level-matched sample

The effect was less strong in this sample. When all three PS tasks were included in the equations, the multivariate test only showed a strong tendency, F(3, 15) = 5.19, p = .05, ηp2 = .39. However, the univariate differences between musicians and non-musicians remained significant for the sentence and nonword sentence tasks as well as the sum score (Table 4).

PS level-matched sample

All differences in reading which were found for the total sample did not remain significant among persons with the same level of PS (Table 4).

Discussion

As previous correlational studies report inconsistent results regarding the relation between PS and reading, the present study aims to further clarify this relationship by examining subjects with excellent PS skills instead of subjects with poor reading skills (dyslexics), as has been done in previous studies (see above). As the existing literature indicates that musicians exhibit higher (pre-existing) BAP skills (e.g., Deguchi et al., 2012; Schellenberg, 2020) than non-musicians, and BAP skills could be possible precursors to PS (Goswami et al., 2010, Goswami, Mead et al., 2013; Richards & Goswami, 2015), we expected the musicians to be more sensitive in perceiving prosodic information than the non-musicians. Hence, the goal of this study was to test whether the German PS of adult musicians is higher than among people with no musical experience, and if this is the case, whether this is associated with reading performance. Furthermore, we tried to find evidence for whether this potential effect could be attributed to a reciprocal effect of reading on PS. As most previous studies focused on English-speaking participants, the present study aims to further clarify the PS–reading relationship in a more transparent orthography.

Our results showed that, in line with our hypothesis, musicians achieved better results in reading words and nonwords as well as two of three prosodic tasks, namely the sentence and nonword sentence tasks. The mean of the musician group was only slightly higher for the piano task. An explanation might be that the piano task was too easy for our sample (Table 4), making ceiling effects likely. In contrast to the sentence and nonword sentence tasks, the played pattern in the piano task only requires (very low) BAP skills. It is possible that this stress pattern can be analysed and stored in working memory more easily than the stress pattern of a linguistic sequence. Thus, even the poorer PS skills of the non-musicians might suffice for this task, while differences between the two groups arise in the two linguistic PS tasks.

To check for possible reciprocal effects in our adult sample (i.e., that better word reading leads to better prosodic components on the long run, which should predominantly be the case for PS tasks that are close to reading), we compared reading level-matched groups of musicians and non-musicians. Differences found in the whole sample remained stable for the reading level-matched groups, which makes it hardly probable that differences in PS can be attributed to differences in reading. Furthermore, comparing musician and non-musician groups with the same levels of PS did not reveal any difference in reading. Therefore, subgroup analysis seems to support our assumption that better PS (and BAP skills) can hardly be attributed to a reciprocal effect of reading on PS.

In summary, our data indicate that high PS likely facilitates reading achievement and thus that PS is not only an (additional) deficit found in dyslexics. However, it is not clear whether the high level of musical experience and high PS skills in our sample of musicians exist due to their musical training or rather due to good PS prerequisites (such as BAP skills) that existed prior to reading and prior to playing an instrument. Genetic predispositions (including, potentially, BAP skills) might influence whether or not a person learns to play an instrument (Schellenberg, 2020). An interaction of both reasons is also plausible. Hence, our results might not be attributable to the musicians’ musical training per se.

The reported unique effect of PS on reading after controlling for PA (even in a shallow orthography like German) is in line with the results of previous studies conducted with children in languages of different orthographic depths (e.g., Defior et al., 2012; Holliman et al., 2017), whereas it contrasts the results by Schmidt et al. (2016) in German adult participants. However, as half the sample examined in our study is part of an extreme group, considering PS as a continuous variable might risk a biased outcome. Therefore, this result should be treated with caution. Furthermore, our results showed no significant correlation between PS and PA, arguing against a strong connection between these skills in German adults. In terms of group comparisons, musicians outperformed non-musicians in two of three PA tasks. One explanation might be the higher BAP skills assumed to be present in our musician group, which are important for the development of good PA skills (Thomson & Goswami, 2010). However, the PA measures used in this study appeared to be way too easy for our adult sample. Therefore, the PA results should be treated with caution due to possible ceiling effects.

We measured PS using a test on the sentence level, which is seldom the case in German-speaking countries (Schmidt et al., 2016). Firstly, this was necessary because measuring PS on the word level would be too simple for our sample due to the regularities in the German language (see above; Beyermann, 2013; Féry, 1996). Secondly, results based on a PS measure on the sentence level might help clarify the PS–reading relationship (for German). Previous correlational studies have predominantly used PS measures on the word level. However, the results based on measures above the word level, e.g., at the phrase or sentence level (Clin et al., 2009; Goswami et al., 2010; Holliman et al., 2017; Schmidt et al., 2016; Whalley & Hansen, 2006), are likewise contradictory. Holliman et al. (2014) considered different components of PS (stress, intonation, and timing) on different linguistic levels (word, phrase, and sentence). Their results support the assumption that PS on the word level and PS on the phrase/sentence level have different relations to reading. In the sentence-level PS measures used in this study, participants had to manage two processes. Firstly, the stress pattern of the orally presented sentence had to be analysed and stored in working memory. Unstressed and stressed syllables had to be perceived regardless of the stress produced by longer or louder articulation or pitch modulation. Secondly, the stress patterns of the written sentences had to be analysed. Stored lexical stress assignments of the printed words had to be retrieved from the mental lexicon. Additionally, in contrast to word-level measures, the relation between the words had to be considered. A lexically stressed syllable (word level) might be articulated as unstressed or less stressed in a sentence due to a certain sentence accent, whereas a lexically unstressed syllable alwaysFootnote 1 remains unstressed on the sentence level. For example, the German sentence Sie vergisst ihren Hut (She forgets her hat) may have the following stress patterns: sie verGISST ihren HUT or SIE vergisst IHren hut. The monosyllable words and lexically stressed syllables (sie, -gisst, ih-, hut) can be unstressed or less stressed on the sentence level, whereas the lexically unstressed syllables (ver-, -ren) stay unstressed in all cases. The written sentences must be compared to the orally presented stress pattern in order to determine which can be stressed identically. Thus, sentence-level PS might involve a higher process than simply retrieving lexical stress assignments.

Limitations and desiderata for future research

Since our sample of musicians exhibited a high level of musical experience, we assumed good BAP skills (and PS) prior to reading acquisition. However, adding a direct measure of BAP to this research would have been helpful to confirm the presence of good BAP skills in our musician sample. Moreover, our subgroup analysis indicated no reciprocal effect of reading on PS in the sense that long experience with fluent reading improves PS. However, it would be worthwhile to explore early BAP skills and PS prior to reading acquisition and (later) reading skills in a longitudinal design.

In terms of subgroup analysis, the sample sizes are quite small, which can result in a low test power (Cohen, 1988). Hence, the possibility cannot be completely excluded that the lack of significant difference in reading in the PS level-matched subsamples is a result of low test power.

Conclusion

A novel finding is that PS seems to be not only a deficit in dyslexics but an advantage for reading fluency among people with high PS, as shown in a sample of musicians compared to non-musicians. Overall, when examining German-speaking adults with a language-related PS task on the sentence level, we can conclude the following: Firstly, our results support the assumption that being a musician (and thus probably having good pre-existing BAP skills) is associated with extraordinary PS skills. Secondly, our results are aligned with those of previous studies indicating a possible positive effect of PS skills on reading. These are important points due to the possible benefits of prosodic and/or musical or rather non-linguistic BAP training to prevent reading difficulties.