Introduction

In recent years, there has been an increasing focus on the problems of reading disabled (RD) children and children with specific language impairment (SLI) in speech perception (e.g., Godfrey, Syrdal-Lasky, Millay, & Knox, 1981; Joanisse, Manis, Keating, & Seidenberg, 2000; Manis et al., 1997; Mody, 2003; Mody, Studdert-Kennedy, & Brady, 1997; Nittrouer, 1999; Reed, 1989; Rosen, 2003; Serniclaes, Sprenger-Charolles, Carre, & Démonet, 2001; Serniclaes, Van Heghe, Mousty, Carré, & Sprenger-Charolles, 2004; Studdert-Kennedy, 2002; Tallal, 1980). Although there is a lot of controversy in the field, Rosen (2003) suggested in a review of the literature that, rather than a general auditory deficit, the deficit underlying RD is most likely linguistic and phonological in nature and that acoustic complexity should be taken into account (but see Richardson et al., 2004 for a recent account of a nonspeech-specific mechanism). In the current study, we therefore focus on phonological processing and natural speech perception while specifically comparing acoustic contrasts. We selected children with reading problems (reading disabled) rather than children who have SLI.

There is an abundant amount of evidence to suggest that reading problems are due to a phonological deficit (see for reviews Hulme & Snowling, 2009; Vellutino, Fletcher, Snowling, & Scanlon, 2004). This deficit encompasses problems in phonological awareness, short-tem memory, and possibly rapid automatized naming. The phonological deficit is often believed to be due to a problem in the specification of phonological representations. In turn, poor speech perception has been argued to be at the basis of the phonological deficit, as it might hamper the buildup of high-quality phonological representations (e.g., Hulme & Snowling, 2009; Studdert-Kennedy, 2002). Finally, it has been argued (e.g., Mody, 2003) that speech perception problems are due to poorly defined phonological categories that may directly lead to problems in phonological awareness.

It has been argued that, in orthographically transparent languages, with their relatively more consistent connections between letters and sounds than in English, RD children’s problems in phonological awareness tasks might be short-lived and are only manifest in the beginning of learning to read (e.g., Landerl & Wimmer, 2000; Wimmer, Mayringer, & Landerl, 1998). Due to the transparency of the orthography, even dyslexic children can develop adequate phonological awareness. Indeed in a longitudinal study, de Jong & van der Leij (2003) found that, at the end of first grade, Dutch RD children performed worse than normal readers on a phonological awareness task (first and last sound categorization), but this difference had disappeared by the end of sixth grade. However, in an additional study using a nonword phoneme-deletion task, it appeared that fourth graders with RD could delete a phoneme from a one-syllable nonword but had problems in the deletion of a phoneme from a more complex two-syllable nonword. de Jong and van der Leij argued that dyslexic children learning to read in a transparent orthography have access to phonemes in spoken words as simple phonological awareness tasks showed no deficit, but evidently this access is fragile or insecure as a more complex phonological awareness task poses major problems.

Such fragile or insecure access to phonemes may hamper speeded word recognition. Slow word recognition is the major characteristic of dyslexia in transparent orthographies (e.g., de Jong & van der Leij, 2003; Ziegler & Goswami, 2005). It has, for example, been suggested that normal reading children learning to read in German, Dutch, or Italian, all relatively transparent orthographies, rely mainly on highly efficient sublexical recoding at the level of grapheme–phoneme correspondences (Martens & de Jong, 2006; Spinelli et al., 2005; Ziegler & Goswami, 2005). Without efficient access to phonemes, sublexical recoding may proceed very slowly with a resulting lack of fluency in reading. Accurate and rapid access to the phonemes in phonological representations seems needed for such an efficient process.

To determine whether insecure access to phonemes may be an underlying factor for the reading fluency problems in transparent languages, the current study will focus on both speech perception accuracy and speed. Examining not just accuracy but also speed of speech perception seems especially important in transparent orthographies like Dutch where lack of reading speed is the defining feature of RD. The relationship between speech perception accuracy and reading disability has been studied extensively, yet research into the relationship between speed of speech perception and reading is still lacking.

Tallal (1980) argued that speech perception problems in children with RD or SLI are due to the rapidly changing acoustic spectra involved. Auditory memory representations for stop consonants may be less strong because their distinctive cues are of brief duration and they involve fast transitions in the speech signal (e.g., Crowder, 1971). In contrast, Studdert-Kennedy (2002) argued that the problems may well arise because the elements contrasted are acoustically very similar and have to be distinguished from one another under time pressure. Adlard and Hazan (1998) looked at the accuracy of 9–12-year-old children on perceptual tests involving speech and nonspeech sounds. They found no differences between groups on any of the psychoacoustic tests of auditory acuity on nonspeech sounds, providing no support for a general auditory processing deficit (see also Breznitz & Meyler, 2003; Farmer & Klein, 1995; Heiervang, Stevenson, & Hugdahl, 2002; Talcott, Gram, Van Ingelghem, Witton, Stein, & Toennessen, 2003). By contrast and in line with Studdert-Kennedy’s (2002) view, all groups made more errors when speech sounds differed by small degrees, suggesting that similarity complicated accurate perception for all children.

Research that has investigated the speech perception of reading disabled children has mainly focused on contrasts between stop consonants (Godfrey et al., 1981; Joanisse et al., 2000; Maassen, Groenen, Crul, Assman-Hulsmans, & Gabreëls, 2001; Manis et al., 1997). These studies used a place-of-articulation (e.g., ) or voicing continuum (e.g., ) between a limited set of stop consonants. Stop consonants have been chosen because they are assumed to be particularly problematic in view of the fast transitions involved. Most studies to date have focused on a restricted number of contrasts between stop consonants, using phonemes that differed from each other by small and equal steps on a single acoustic dimension (), creating a continuum between two endpoints (either natural or artificial speech). Stop consonants in these studies often differed on only one phonetic feature that was manipulated (e.g., on a voicing continuum). Speech continua are an interesting approach to indicate whether problems exist in speech perception and the contrasts between sounds can be carefully manipulated. In addition, they provide important information on the perception of ambiguous phones in connected speech. However, this paradigm can only be used with a restricted set of sounds, and its main focus is not on the endpoints that occur in natural language. Consequently, it is worthwhile to supplement these studies by research that investigates whether RD children have problems in the perception of the diverse set of sounds they encounter in natural language. To date, there have been few studies that have investigated natural speech, and those that exist had a sole focus on accuracy.

Adlard and Hazan (1998) used natural language contrasts to examine speech perception accuracy in English RD children. Their study involved 13 RD children, 12 reading age-, and 12 chronological age (CA)-matched controls who were between 9 and 12 years old, had English as their mother tongue, suffered no emotional problems, and had no earlier or current problems with speech production. On a phoneme discrimination task with real words (e.g., ), RD children had more errors than reading and CA controls, but these errors were not limited to stop consonant contrasts. RD children’s problems in speech perception appeared more general and not only due to fast transitions. All children had higher error rates when pairs differed in one phonetic feature (place of articulation, e.g., /spill/ -/still/) than when three features (voicing, manner, and place of articulation, e.g., /spot/-/slot/) differed, suggesting that phonetic similarity created accuracy problems in speech perception for all children. On a VCV discrimination task with pseudowords (e.g., ), RD children, however, had more errors on stop consonant contrasts. Error rates on other consonant contrasts (fricative, nasal, and approximant contrasts) also appeared higher in the RD group, yet these differences were not significant. It has to be noted that all contrasts in this task differed on one phonetic feature only, so that errors may only have been caused by phonetic similarity. Further analysis also revealed that only a subgroup (four out of 13) of the RD children was responsible for the lower accuracy scores on the speech discrimination tests. Although Adlard and Hazan’s data demonstrate that phonetic similarity can complicate accurate perception, the issue of the interaction with stop consonants remains unresolved. According to Adlard and Hazan, errors were not limited to contrasts marked by fast transitions but were also due to phonetic similarity (differences on one feature). Similarly, in a study by Treiman, Broderick, Tincoff, & Rodriguez (1998), it was found that, for preschoolers and kindergarten children, stops were not more difficult to recognize than fricative consonants. In view of the contradicting results, further research into the perception of stop consonants in natural speech seems warranted.

Currently, in most studies, contrasts have been examined in simple CV or CVC words. However, it seems important whether the consonant is part of a cluster. In Dutch, for example, there is evidence that RD children continue to have problems in reading words with consonant clusters (Struiksma, 2003), despite intensive and focused instruction. In English, initial consonant clusters are difficult to spell for some children beginning to read, with most errors being due to the omission of the second consonant (e.g., Treiman, 1991), but these problems appear transitory. In the study by Treiman (1991), some children omitted the consonants in second position. In contrast, the first consonant was rarely dropped. Children also had to name pictures and were then asked whether the name contained the target letter. When the consonant was the second in a consonant cluster, children had less correct detections than for that same consonant outside a cluster. In addition, second-grade children made more spelling errors on CCVC words than older children (Steffler, Varnhagen, Friesen, & Treiman, 1998), and very young children had difficulty in pronouncing initial consonant clusters (Treiman & Weatherston, 1992). For Dutch, it has been shown that isolating initial phonemes was more difficult in consonant clusters (Schreuder & Van Bon, 1989). It was further shown that dyslexics had more errors than regular readers in auditory recognition of target phonemes, but the pattern of errors was similar to that of regular readers, with nonwords containing consonant clusters being especially problematic (Bruck & Treiman, 1990).

Research into speech perception of consonant clusters may indicate whether problems in reading consonant clusters that persist despite intervention stem from slow speed of speech perception. In the study by Struiksma, reading problems were most apparent when children were required to read words beginning with the same consonant cluster but followed by different vowels (e.g., blaf - blik - blok [‘bark’- ‘can’ - ‘block’]). Switching between vowels in CCV sequences complicated the reading task. This finding might be explained by the fact that, in articulating the words while reading, one speech segment influences the articulation of another (coarticulation, see Daniloff & Hammarberg, 1973), so that stop consonants have to be identified on the basis of neighboring vowel transitions (e.g., Pols & Schouten, 1978). As a result, even though the spelling of the consonant cluster sounds in the Struiksma study was identical, their acoustic properties changed whenever a different vowel followed, thereby increasing task demands.

Because reading fluency problems are the main characteristic of poor readers in an orthographically transparent language like Dutch, research into their speech perception problems should take speed measures into account. In addition, research is needed to clarify whether differences between RD children and CA controls were due to fast transitions or to phonetic similarity.

In the current study, we first examined differences between RD children and their normal reading peers in the perception of natural occurring speech contrasts instead of the in-between sounds that are used in continua studies. In addition to single consonants, we also examined consonants within consonant clusters, as we expected the perception of consonants in consonant clusters to be more difficult than the perception of single consonants for RD children, especially when stop consonants were involved. We predicted the RD children to have specific problems in the fluent perception of stop consonants.

Problems in speech perception have either been ascribed to fast transitions involved in stop consonants or, alternatively, to similarity between different speech sounds. In an additional analysis, we therefore specifically focused on phonetic similarity. If fast transitions in the auditory signal caused perceptual difficulties, contrasts between different stop consonants involving fast transitions would be more difficult for RD children than contrasts between nonstop consonants as the latter do not have fast transitions. In research on speech perception, contrasts between stop consonants are used because they involve fast transitions in the speech signal. However, there is often the theoretical confound that two stop consonants do not only involve fast transitions but are also phonetically similar. To examine the importance of fast transitions, phonetic similarity needs to be controlled for. We compared stop contrasts that differed on one phonetic feature (place of articulation) with fricative contrasts that differed on one phonetic feature (place of articulation). To examine similarity, we compared stop and fricative contrasts that differed on one phonetic feature (place of articulation) with contrasts that differed on more phonetic features (e.g., voicing, place, and manner of articulation). This approach enabled us to disentangle the influence of fast transitions (stop consonants) and phonetic similarity (number of different phonetic features). Under the similarity hypothesis, discriminating between two similar sounds (only one phonetic feature different) had to be more difficult for RD children than discriminating between sounds differing on more phonetic features. In contrast, if fast transitions were a problem for RD children and not just similarity, stop consonant contrasts would be more difficult than fricative contrasts, even though both differed only on one phonetic feature.

Our aim was to select children who were in the process of learning to read monosyllabic words with initial double consonants. In a recent paper by McArthur & Bishop (2005), they argued that age might be important because an auditory processing deficit might disappear with age, possibly before the age of 9. Abnormal auditory processing could have negative effects on speech discrimination. It is therefore interesting to establish whether speech discrimination in young RD children is already problematic. By selecting beginning readers, we also hoped to limit effects of reading skill on speech perception. This is important because differences in speech perception could be the result of reading skill differences instead of a causal factor, a possibility that has been suggested by Serniclaes and his colleagues (Serniclaes et al., 2001). There is evidence that orthography started to influence auditory language tasks in later elementary grades (Ehri & Wilce, 1980) and that reading instruction changed phonological representations for words (Ziegler & Goswami, 2006). Because reading ability is prone to change at this young age, we used a selection procedure involving repeated testing of reading to make sure that apparent reading problems were not just temporary but persistent. Children were first screened for reading problems 3–5 months before the study. Second, their reading skills were tested at the time of the study, in the same test session that our speech perception measures were administered. Finally, the reading test was administered a year after the study to determine whether they still had reading problems.

In sum, the aim of the current study was to answer the following questions:

  1. 1.

    Do RD children show speed and accuracy deficits in identifying and discriminating speech contrasts?

  2. 2.

    Are stop consonants slower and less accurately perceived than nonstop consonants for RD children?

  3. 3.

    Are consonant clusters slower and less accurately perceived than nonclusters for RD children?

  4. 4.

    Do RD children show speed and accuracy deficits in the perception of fast transitions after controlling for acoustic similarity?

Method

Participants

There were 11 RD children (seven girls and four boys) and 11 CA controls (five girls and six boys) from grade 2 (mean age 7;10 years) in this study. Our selection included only those children with stable and persistent reading problems, and they were therefore a pure representation of reading disabled children at this early stage of reading acquisition. There were 22 children with reading problems in our initial sample of 285. Information from school records was used to exclude RD and CA children with documented hearing or speech problems or with an ADHD diagnosis. All children had to have average scores on receptive vocabulary and nonverbal reasoning, so that differences in speech perception could not be attributable to general language problems or low nonverbal intelligence. Of the original 22 children, four children did not meet the cutoffs for nonverbal and verbal IQ. As a result of this procedure, we collected data from 18 children with reading problems and 26 CA children. However, because the children were relatively young and their reading skills at this time are known to be unstable, we only included in our analysis children with persevering reading problems. We took into account their reading scores at three points in time: at a screening before our study, at the time of our study, and a year after the study had finished. Of the 18 children with low reading scores we collected data from, 11 children were RD children and showed persistent reading problems.

To select poor readers, we first screened the children on a test for single-word reading speed, the Drie Minuten Toets (DMT) or “Three-Minute Test” (Verhoeven, 1995). It was administered 3–5 months before the study started, depending on the test schedule of the participating schools. This is a standardized and reliable test of timed single-word reading that is commonly used in Dutch primary schools. Children have to read a list of words aloud, as quickly as possible and in a clear voice. The raw score is the number of words read correctly. Subsection 2 of this test requires the reading of monosyllabic words containing single consonants and consonant clusters and was therefore appropriate for our aims. At the time of the screening, children had received 12 months of reading instruction, and all RD children already had a delay of 7 months or more when compared to the age adequate levels (see Struiksma, Van der Leij, & Vieijra, 1995; Table 1). At the time of the study, after 15 months of reading instruction, most RD children had 8 months or more delay in reading. Three children had smaller delays (5, 6, and 6 months delay).

Table 1 Mean scores on matching variables for experimental and control groups

A year after the study had finished, we collected scores on the same test (DMT subsection 2) to establish perseverance of the reading problems. At this point, after 27 months of reading instruction, the three children with smaller delays at the time of the study all had a reading delay of 14 months. The best scoring child in the RD group still had 10 months of reading delay when compared with age adequate levels. For details of the average reading lags in months, see Table 1. These results indicate that the RD children continued to have reading problems and that their reading disabilities were stable even early in their reading acquisition.

Children in the CA control group had age adequate reading levels at the time of screening. At the time of the study, they still had age adequate reading levels. A year after the study, some CA children scored at age adequate reading levels, but a number of CA children had progressed even further and were better than the level expected at that age. It is important to note that, when the speech perception data were collected, they had average reading levels for that age.

To make sure that speech perception problems were not due to weak vocabulary skills, receptive vocabulary was measured with a standardized subtest of the Revised Amsterdam Child Intelligence Test (RAKIT; Bleichrodt, Drenth, Zaal, & Resing, 1987). There were two reasons to exclude participants with weak vocabulary skills. The first was that weaker knowledge of Dutch vocabulary and consequently less familiarity with Dutch bigram regularities may influence perception speed, even when pseudowords were involved. Another reason was that RD children should have reading problems that cannot be attributed to lack of schooling as may be evident from a lack in vocabulary knowledge. The test resembles the Peabody Picture Vocabulary Test (Dunn, 1959). A test assistant reads out a word and children have to choose the matching picture from a set of four. For this age group, the test consists of 50 items. The raw vocabulary score (number of correct choices) was transformed into a standardized vocabulary score ranging from 0 to 30, with a mean of 15 and a standard deviation of 5. Standard score means of the complete sample used for screening on the RAKIT (N = 285, M = 16.57, SD = 6.34) were close to this national standard score mean. Scores for all children in the experimental sample had to be above the standard score mean in the sample minus 1 standard deviation to minimize the number of children with wider ranging language problems. In addition, all children were in regular education suggesting they did not have multiple deficits as in The Netherlands children with multiple deficits are often referred to special education.

To correct for the influence of low nonverbal intelligence on speech perception, nonverbal reasoning was measured with the RAVEN Standard Progressive Matrices (Raven, Court, & Raven, 1988). Children completed the first 48 items (sets A, B, C, D) because only these items were assumed to cover the range of intellectual development of children of this age. Scores on the RAVEN had to be above the mean score of the complete sample (N = 285, M = 29.6, SD = 8.7) minus 1 standard deviation.

RD and CA were matched on age (for combined groups M = 7;10, SD = 0;05), receptive vocabulary (raw scores for combined groups M = 41.0, SD = 1.84), and nonverbal reasoning (for combined groups M = 31.6, SD = 5.77). Differences between RD and CA on age, receptive vocabulary, and nonverbal reasoning were nonsignificant. For descriptives of each group separately and significance tests, see Table 1.

Instrumentation

Phoneme discrimination task

Pairs of pseudowords were orally presented (AX format). Participants had to decide as fast and accurately as possible whether the stimuli within each pair (e.g., ) were the same or different by pressing one of two keys on the keyboard (green-taped Spacebar for “same” or red- taped B key for “different”). They were instructed to use their right hand for “same” responses if they were right-handed but their left hand if they were left-handed. By choosing the spacebar for a “same” response, both left- and right-handed children could rest their hands naturally. A fixation point in the center of the screen preceded each stimuli pair for a specified 1,000 ms. After the fixation point had disappeared, the first sound of the pair was played. The interstimulus interval (ISI) within sound pairs was specified to be 200 ms (actual mean duration 200.3 ms, SD = 0.67). An ISI of 200 ms was used because ISIs shorter than 200 ms may lead to mutual masking whereas longer ISIs may make discrimination too easy (for discussion, see Gerrits, 2001, p. 16). Next, the second sound within the pair was presented followed by a silent interval. Sound and silent interval were maximally 10,000 ms to enable a response. The response interval was set relatively long to make sure data from children who gave accurate responses but took in general more time to respond were not lost. However, too long responses in view of a child’s average response were removed as outliers (see the section on “Scoring” for the exact procedure). Reaction time was measured from the onset of the second word in the pair to the onset of the button press. The moment a key was pressed, the fixation point reappeared to introduce the next sound pair. All experimental trials were preceded by ten practice trials: five word pairs followed by five pseudoword pairs. Feedback was provided after each practice trial, either on correctness when an incorrect response was given or on both correctness and speed in the case of a correct response.

Stimuli generation

A 35-year-old female speaker, who was a native speaker of Dutch from the same geographic area as the children tested, was asked to produce the stimuli. She spoke standard Dutch and was a qualified speech therapist. The recording took place in a low reverberant recording room using a digital CD recorder (Pioneer, model PDR555RW) and a microphone (Sennheiser, model MKH105T). The microphone was placed at ca. 50 cm from the mouth. The speaker read from a list and was instructed to speak clearly and at a normal rate. Because this sounded overarticulated, we instructed her to speak slightly faster. This faster rate sounded more natural and was used in the study. After visually inspecting the speech waveforms for onset and offset of the stimuli using PRAAT (Boersma & Weenink, 2003), we chose the starting point 50 ms before the onset and 50 ms after the offset to make sure the whole target sound was included in the selection. Each target sound was then extracted from the digital file and saved as an individual file. The individual files were used in E-prime (Schneider, Eschman, & Zuccolotto, 2002) to set up the desired sequences.

We calibrated the intensity of the sounds generated by the notebooks and headphones to make sure all notebooks had similar intensity levels. Two headphones were connected to each notebook via a splitter. On average, our recorded sound files had intensity levels of approximately 70 dB, according to the program PRAAT (Boersma & Weenink, 2003). With this same program, we generated 70 dB noise with a spectral slope of −6 dB/octave on all three notebooks used in the experiment. We used a decibel meter to measure the acoustical output levels from each of the six headphones. All levels were close to 70 dB. In addition, we measured the output levels in volts of each notebook. Differences between notebooks were negligible. Additional attenuation caused by the added headphones was also negligible. To get exact and equal settings of the volume in Windows, we set volume and wave sliders to maximum. This changed the intensity of the 70d-B noise into 80.6 dB. In E-prime (Schneider et al., 2002), we then set the volume control to −1,500 to obtain an intensity of 66.3 dB on the decibel meter. This is slightly above the normal intensity of ordinary speech of 60 dB so the sound should be clear to the listeners and not too loud to become tiring. Stimuli were played on a Dell D600 notebook via circumaural open-back headphones (Beyerdynamic, model DT 331). A splitter connected two headphones to enable test assistants to monitor the procedure.

Speech stimuli

We selected consonant clusters with stop and nonstop consonants from a list of legitimate initial two-consonant clusters in Dutch (Van Bezooijen & Pols, 1988). Stimuli pairs consisted of monosyllabic pseudowords, constructed by changing one or two phonemes of an existing word. Pseudowords were used because they provided the possibility to control for word frequency, neighborhood density, and age of acquisition across children. To keep the influence of the vowel similar, pseudowords had an in the nucleus. To ensure that our selection of consonants yielded acceptable pseudowords and no existing real words in Dutch, position 1 consonants had the ending . In each pair of pseudowords, the words were identical except for the consonants. To make sure that different responses could be solely attributed to the difference in consonants, we chose minimal pairs with all consonants different on one phonetic feature. In the experimental trials, we also included a number of “same” fillers to keep the number of required “same” and “different” responses equal.

The stop consonants and were selected. See “Appendices 1” (“different” items) and “2” (“identical” items) for a list of consonants and contrasts used in the speed and accuracy analysis of speech perception by RD and CA children. To avoid learning effects, the same stimuli occurred only once. Target sounds appeared either as the only consonant in CV syllables (e.g., ) or with another consonant in CCVC monosyllabic pseudowords. In the CCVC pseudowords, the target stop consonants appeared in position 1 (e.g., ) of the consonant cluster. We checked consonant sounds for occurrence with the vowel in a Dutch dictionary (only did not occur with this vowel). Care was also taken to have identical stop consonants in all positions, and therefore, not all possible stop consonants in Dutch could be used. Nevertheless, it was possible to use a wider selection of sounds than the restricted set that is normally used in the continua studies. In the accuracy analysis, some children had all different and identical position 1 items correct, and including this variable would violate assumptions for MANOVA. Therefore, the identical and different items in position 1 were removed from the accuracy analysis, and we used position 2 items to test the effect of clusters. A stimuli pair contained identical sounds, repeating the same sound file, or sounds that differed on one phonetic feature, for example, the same manner of articulation but different place of articulation (e.g., bilabial vs. alveolar plosives, and ).

The nonstop consonants in position 1 for the speed items and in position 2 of the cluster for the accuracy items were chosen for the different items (see “Appendix 1”). For single consonants, and were chosen both in the speed and accuracy analysis. Sounds differed either in voicing, place of articulation, or manner of articulation (e.g., alveolar lateral approximant vs. alveolar trill, and ). For the identical items in our speed analysis (see “Appendix 2”), we chose the consonants in position 1 and single . In the accuracy analysis, we chose the consonants in position 2 and single .

In the analysis that focused on the fast auditory processing hypothesis and phonetic similarity, we examined pairs that differed on one or more phonetic features. See “Appendix 3” for a list of consonants and contrasts used. We tested the stop consonants and the nonstop consonants and . We compared three types of contrasts. The first type of contrast involved stop consonants that differed on one phonetic feature (bilabial vs. alveolar ). The second contrast contained fricatives that also differed on one phonetic feature (labiodental vs. alveolar ). The third contrast involved sounds that differed on three phonetic features: place of articulation, manner of articulation, and voicing (bilabial voiced plosive vs. velar voiceless fricative ).

Scoring

We computed proportions correct because the numbers of items were not identical across all contrasts. For our speed measure, we used raw reaction time. Average speed was computed across the “hits” (i.e., correct responses on the items). As we were interested in speed, items used in our speed analysis had to be easy and should not pose a problem for CA listeners. We therefore retained only those items that were answered correctly by 50% or a higher percentage of the CA listeners. The sole aim of excluding these items was to exclude items that were problematic for all children, due to factors outside the focus of our study (e.g., unclear pronunciation of sounds). Three items in the phoneme discrimination task were consequently removed ( and ). We checked whether children had answered “yes” or “no” too often, as this might reflect an answering strategy. The maximum proportion of “yes” responses on the 114 tested items was 0.56 (correct proportion was 0.50), so no children were removed from the analysis. For our speed analysis, incorrect responses were recoded into missing values. In addition, we checked for outliers and removed items that were too fast or slow in view of an individual child’s scores on all items. We replaced all scores slower than −3.29 standard deviations or faster than +3.29 standard deviations from a child’s mean speed score (see Tabachnick & Fidell, 1996, p. 67) by a missing value. After removing not only the RT outliers but also all RTs for incorrectly responded items, the average proportion of missing data for each item was 0.07.

General procedure

A trained test assistant tested each participant individually at two different sessions in a quiet room at their school. Test assistants were sitting side by side the child to avoid the possibility that children could take cues from the assistants. The first session took about 30 min and started with the administration of the Drie Minuten Toets, followed by two speech perception tests. The second session also took about 30 min and consisted of two other speech perception tests and the reading aloud of all the speech perception words from a list. The speech perception tests were either the phoneme discrimination test reported on in this study or a phoneme-monitoring test. To avoid sequence effects, items within tests were randomized, and order between tests was randomized across subjects. Position was blocked for each speech perception test with single and position 1 consonant items in one test and position 2 items in the other test.

Results

In the first section, we will provide evidence on our first three research questions. We will present results on speed and accuracy deficits in identifying and discriminating speech contrasts, the perception of stop consonants in comparison to nonstop consonants, and the perception of consonants in consonant clusters in comparison to their perception as a single consonant. In the second section, we will answer our fourth question and focus on the perception of stop consonants in comparison to nonstop consonants while specifically controlling for phonetic similarity.

Examining speech perception in RD children and CA controls: the effect of type of contrast, position, and type of stimulus

To test whether RD children had speech perception processing problems, we compared RD children and CA controls on stop consonant contrasts and contrasts between other consonants, both in single position and in clusters. Speed and accuracy scores were subjected to a separate 2 × 2 × 2 × 2, four-way, mixed ANOVA with reading group (RD and CA controls) as a between-subjects factor and type of stimulus (identical and different items), type of contrast (stop consonant contrast and nonstop consonant contrast), and position (in a cluster and single consonants) as within-subjects factors. Because of high accuracy scores in both groups, we used arcsine-transformed scores in our analyses (Steel & Torrie, 1982, p. 236).

Accuracy

We first tested whether RD children had accuracy deficits in identifying and discriminating speech contrasts and whether stop consonants were especially difficult for RD children. There were no significant overall differences between reading groups in accuracy, F (1, 20) < 1, p = 0.34, partial η 2 = 0.05. In addition, there was no interaction between reading group and type of contrast (stop consonant vs. nonstop consonant) with F (1, 20) < 1, p = 0.64, partial η 2 = 0.01, failing to support the idea that stop consonants were more difficult for RD children than for CA children. Mean untransformed proportion correct appears in Table 2.

Table 2 Mean proportions correct on stop and nonstop consonant contrasts

In contrast to the hypothesis that stop consonants were more difficult for RD children, there was a statistical trend on the main effect of type of contrast (stop consonant vs. nonstop consonant) with F (1, 20) = 4.09, p = 0.06, partial η 2 = 0.17, indicating that stop consonants were easier than nonstop consonants for both RD and CA children. There was also a main effect (Wilks’ lambda) of type of stimulus (identical and different items) with F (1, 20) = 14.27, p < 0.01, partial η 2 = 0.42, indicating that giving responses to identical items was easier. However, both effects were modified by an interaction effect between type of stimulus and type of contrast with F (1, 20) = 5.55, p < 0.05, partial η 2 = 0.22. The interaction indicated that stop consonants were only easier than nonstop consonants when different items were involved. It also suggested that responses to identical items were especially easier in the case of nonstop consonants. We also tested whether consonant clusters were more difficult than nonclusters. The effect of position was not significant, F (1, 20) < 1, p = 0.62, partial η 2 = 0.01, as consonants that appeared on their own (single position) had similar accuracy scores (M = 0.93, SD = 0.05) as consonants in a consonant cluster (M = 0.90, SD = 0.15). Our data therefore did not support the theory that consonants clusters were more difficult than single consonants. It also has to be noted that the accuracy levels were lower for both groups in the case of the nonstop different condition (Table 2, last column), possibly because in Dutch pronunciation the distinction between / f / and / v / is acoustically not very salient (see the nonstop items in “Appendix 1”, single position). Finally, correlational analyses showed no significant relationship between accuracy in speech perception and reading fluency, suggesting that accuracy of speed perception was not an underlying deficit for lack of reading fluency in Dutch RD children.

Speed

We first tested whether RD children had speed deficits in identifying and discriminating speech contrasts and that stop consonants were especially difficult for RD children. Our analysis used mean raw reaction times (results were identical when analyses were based on words per second and missing data were estimated with the EM algorithm). There were no significant overall differences between groups in speed, F (1, 20) = 2.65, p = 0.12, partial η 2 = 0.12 (see for the mean raw reaction times Table 3). There was no effect of type of contrast (stop consonant vs. nonstop consonant) with F (1, 20) = 1.91, p = 0.18, partial η 2 = 0.09, indicating that processing speed was not deficient for stop consonants.

Table 3 Mean reaction times (milliseconds) on stop and nonstop consonant contrasts

We also tested whether consonant clusters were processed more slowly than nonclusters. Results showed that the effect of position was not significant, F (1, 20) = 1.50, p = 0.24, partial η 2 = 0.07: Consonant clusters (M = 1,520, SD = 398) were not responded to more slowly than single consonants (M = 1,452, SD = 308).

In contrast, there was a main effect (Wilks’ lambda) of type of stimulus (identical and different items) with F (1, 20) = 6.92, p < 0.05, partial η 2 = 0.26, indicating that both RD and CA children responded faster to identical items. Interestingly, there was a significant interaction effect between type of stimulus and reading group, with F (1, 20) = 4.38, p < 0.05, partial η 2 = 0.18, suggesting that identical and different items were not processed equally fast by RD and CA children. To clarify this interaction effect, we used t tests within each group to compare speed on identical and different items. Results showed that speed on identical and different items did not differ for the RD children t(10) = 0.211, p = 0.84, d = 0.1. In contrast, CA children were faster on identical items t(10) = 3.706, p < 0.01, d = 1.0. Additional t tests showed that RD and CA did not differ in speed on different items t(20) = 0.647, p = 0.53, d = 0.3. On the other hand, CA children were significantly faster on identical items than RD children t(20) = 2.259, p < 0.05, d = 1.0. In other words, the CA control group was faster in identifying identical sounds than in distinguishing different sounds, which was not the case for the RD children. To determine whether only a subgroup of RD children was responsible for this finding, we examined the scatter plots of the average scores of both reading disabled and control children (see Fig. 1). Although there was some overlap (two RD participants scored relatively fast and three CA scored relatively slow), there was no evidence that a subgroup of reading disabled children was responsible for the group differences.

Fig. 1
figure 1

Average speed for each participant in RD and CA reading group on identifying identical items

Correlational analyses supported the relevance of the finding on the ANOVAs that RD and CA controls differed on identifying identical items. There was a modest correlation between speed on identical and on different items with r = 0.54, p < 0.01 (n = 22), showing that, in general, fast identification was in line with fast differentiation. However, only the scores on the reading fluency measure (DMT) and reaction times on identical items correlated in the expected direction with r = −0.48, p < 0.05 (n = 22). Children with higher scores on reading fluency were faster in identifying identical items. In contrast, there was no significant correlation between reading fluency and distinguishing different items with r = −0.12, p = 0.59 (n = 22). When we recall that RD and CA controls did not differ in speed of discriminating sounds, this is additional evidence that faster discrimination of speech sounds is not essential for fluent reading.

To preclude possible interactions between type of contrast and reading group due to feature type differences, we carried out an additional analysis that only involved the place of articulation contrasts / p / - / t / and / f / - / s / (to make comparison with the previous analysis and the same participants possible, the identical contrast / xa / - / xa / was retained). This analysis confirmed the previous results in that we found neither a main effect of reading group nor an interaction between type of contrast and reading group on speed. Dyslexics were overall not slower than CA children and neither were they slower on stop consonants. We no longer found a main effect of type of stimulus (identical and different items), but the interaction effect between type of stimulus and reading group was replicated with F (1, 20) = 3.96, p = 0.06, partial η 2 = 0.17 (the previous significant effect now became a statistical trend, presumably due to smaller reliability because of fewer items). We performed additional t tests for each reading group to see whether speed on identical items was faster than on different items. Results showed that speed on identical and different items did not differ for the RD children t(10) = 0.165, p = 0.87, d = 0.1. In contrast, CA children were faster on identical items t(10) = 3.721, p < 0.01, d = 0.99. Additional t tests showed that RD and CA did not differ in speed on different items t(20) = 0.418, p = 0.68, d = 0.2. On the other hand, CA children were faster on identical items than RD children t(20) = 2.026, p < 0.05, d = 0.98. In sum, when we only included place of articulation contrasts, we found the same effects as we found in our previous analysis, although differences between groups became slightly smaller. Our previous result that only the CA control group was faster in identifying identical sounds than in distinguishing different sounds but not the RD children was replicated.

Examining the fast auditory processing hypothesis: fast transitions vs. similarity

To test Tallal’s hypothesis and to establish whether RD children had speech perception problems in the case of fast transitions or in the case of similar sounds, we carried out an analysis contrasting stop consonants (fast transitions) to nonstop consonants, while controlling for the effects of similarity on speech perception. Because of high accuracy scores in both groups, we used arcsine-transformed scores in our analyses (Steel & Torrie, 1982). Both speed and accuracy scores were subjected to a separate 2 × 3, two-way, mixed ANOVA with reading group (RD and CA controls) as a between-subjects factor and number of features different (one-feature different stop consonant contrast, one-feature different nonstop consonant contrast, and three-feature different stop and nonstop consonant contrasts) as a within-subjects factor.

We specifically controlled for number and type of phonetic features that sounds in a contrast differed. To avoid confounding effects of type of phonetic feature, we only took into account place of articulation contrasts. For example, we left out voicing contrasts between fricatives like / f / and / v / because the distinction between the pronunciation of / f / and / v / is acoustically not very salient in Dutch (see Table 2, last column). We compared three types of items. The first involved stop consonants in words that only differed in one phonetic feature of the same type: place of articulation (). The second contained fricatives in words that also only differed in one phonetic feature and again in place of articulation (). The third type involved sounds in words that differed in three phonetic features: place of articulation, manner of articulation, and voicing (e.g., bilabial plosive + voice vs. velar fricative − voice). See “Appendix 3” for the items used in the phonetic feature analysis.

Accuracy

We first tested whether RD children had accuracy deficits in identifying and discriminating speech contrasts. There were no differences between groups in accuracy, F (1, 20) < 1, p = 0.63, partial η 2 = 0.01. Next, we tested whether phonetic similarity caused speech perception problems and whether this differed between groups. There was a statistical trend on number of phonetic features different (one-feature different stop consonants vs. one-feature different fricatives vs. three-feature different consonants), F (1, 25) = 3.58, p = 0.06, partial η 2 = 0.15. See for the untransformed mean proportion correct Table 4.

Table 4 Mean proportions correct on different items examining similarity

However, there was no interaction between number of phonetic features different and group with F (1, 25) < 1, p = 0.47, partial η 2 = 0.03. This result indicates that RD children did not have more errors than CA controls when phonetic similarity increased.

To test the hypothesis that the number of different features determined difficulty, we specified a contrast between, on the one hand, both one-feature different stop consonants and one-feature different nonstop consonants and, on the other hand, three-feature different stop and nonstop consonants. This contrast showed that items that differed in three phonetic features were easier for both groups than items that differed in one phonetic feature, F (1, 20) = 6.53, p < 0.05, partial η 2 = 0.25. To test whether fast transitions were problematic, we tested whether stop consonants were more difficult than nonstop consonants when the number of features different was kept constant. We specified a contrast between, on the one hand, one-feature different stop consonants and, on the other hand, one-feature different nonstop consonants. We specifically controlled for number and type of phonetic features that sounds in a contrast differed by. This contrast showed that stop consonants that differed in one phonetic feature did not yield lower accuracy levels than fricatives that differed in one feature, F (1, 20) = 2.40, p = 0.14, partial η 2 = 0.11. These findings confirmed that number of features and therefore similarity were important in achieving accuracy and that groups did not differ. To determine the nature of the feature problems, we carried out post hoc paired samples t tests. These showed that one-feature stops were more difficult than three-feature fricatives and stops with t(21) = 2.309, p < 0.05, d = 0.7. On the other hand, one-feature fricatives were only marginally more difficult than three-feature fricatives and stops with t(21) = 1.449, p = 0.08, d = 0.2. This suggests that it was not only the number of features different but the combination of number of features different and the quality of stop consonants that were determining difficulty in distinguishing sounds. Stop consonants in themselves seemed to be not the only problem as the difference between one-feature stops and one-feature fricatives (nonstops) with t(21) = 1.555, p = 0.07, d = 0.5 was only marginally significant. Comparing stops with sounds that differed on three features was not difficult either. It seems that stops were difficult only when they had to be compared with similar sounding stops. These new findings modify our previous interpretation somewhat, in the sense that accuracy seemed not based on similarity alone: Stop consonants presented special problems as well. Still, it has to be noted that, even after including only place contrasts, we did not find differences between reading groups: Distinguishing highly similar stop consonants was difficult for both RD and CA reading controls.

Speed

We first tested whether RD children had processing deficits in identifying and discriminating speech contrasts. There were no significant differences between groups in speed, F (1, 20) < 1, p = 0.99, partial η 2 = 0.00. Next, we tested whether phonetic similarity caused speech perception problems and whether this differed between groups. There were no differences on number of phonetic features different (one-feature different stop consonants vs. one-feature different fricatives vs. three-feature different consonants) with F (2, 40) < 1, p = 0.57, partial η 2 = 0.03. See for the mean reaction times Table 5.

Table 5 Mean reaction times (milliseconds) on different items examining similarity

There was no significant interaction between number of phonetic features different and group, with F (2, 40) < 1, p = 0.48, partial η 2 = 0.04. This indicates that RD children did not process speech slower than CA controls when phonetic similarity increased.

To test the hypothesis that the number of different features determined difficulty, we specified a contrast between, on the one hand, both one-feature different stop consonants and one-feature different nonstop consonants and, on the other hand, three-feature different stop and nonstop consonants. This contrast showed that there was no evidence that items that differed in three phonetic features were faster responded to than items that differed in one phonetic feature, F (1, 20) < 1, p = 0.57, partial η 2 = 0.02 . To test whether fast transitions were problematic, we tested whether stop consonants were more difficult than nonstop consonants when the number of features different was kept constant. We specified a contrast between, on the one hand, one-feature different stop consonants and, on the other hand, one-feature different nonstop consonants. This contrast specifically compared stop contrasts and fricative contrasts, after controlling for number and type of phonetic difference. Results showed that, when stops differed only on place of articulation, they were not slower processed than fricatives that differed on place of articulation with F (1, 20) < 1, p = 0.36, partial η 2 = 0.04. This result makes clear that processing of fast transitions or similar sounds was not problematic for either the RD children nor for the CA controls.

Discussion

In this study, we compared differences in the perception of natural speech contrasts between young children with persistent reading problems and their normal reading peers. Our results provided no evidence that reading disabled children had accuracy deficits in identifying and discriminating speech contrasts. In contrast, the present study showed that, when processing time was taken into account, RD children appear to have speed problems in identifying sounds rather than in discriminating sounds, suggesting a deficit in automatic access to phonetic representations. Consonant clusters were not slower or less accurately perceived than nonclusters, neither for RD children nor for controls. As to the difficulty of stop consonants, stop consonants were not processed slower, yet an unexpected finding was that stop consonants seemed more accurate for all children. However, this difference disappeared after number and type of different phonetic features was taken into account. Our results only partly confirm Tallal’s theory (e.g., Tallal, 1980) that problems with fast transitions are at the basis of speech perception deficits in natural speech. The current findings indicated that, for all children, fast transitions were only perceived less accurately when they also differed on one phonetic feature and were therefore highly similar. Contrasts that differed on more phonetic features were easier to discriminate even when stops were involved. There were no differences in speed of perception. As far as accuracy is concerned, it seems that only the combination of similarity and fast transitions was difficult to perceive. On the other hand and more important, we did not replicate Tallal’s finding that fast transitions were especially problematic for RD children. Rather, RD children and CA controls did not differ in accuracy and even though accuracy levels were in general high, results showed that highly similar stop consonants were relatively more difficult to distinguish for all readers.

To test perception skills more purely, ruling out word knowledge, we used pseudowords in a task similar to the VCV discrimination task (e.g., ) used by Adlard and Hazan (1998). Their study showed that natural stop consonant contrasts were more problematic for RD children. Adlard and Hazan already remarked that there were also sizable error rates for the RD group on minimal pairs like “smack” and “snack”. Errors occurred in the case of fricatives, nasals, and approximants, yet these failed to reach significance. Closer inspection of their data suggests, however, that the difference on stop consonants was significant only because controls had a low error percentage on stop consonant contrasts. Apparently, accurate perception of stop consonants was not more problematic for RD children than perception of other sounds; only relative to the controls these errors were more pronounced. Our data involved pseudowords and were therefore less familiar to the control group, possibly explaining the different findings. As pointed out before, we found that stop consonants were difficult for all readers, yet only when they were similar as well.

As far as accuracy is concerned, our study showed that RD children did not differ from CA controls. Although the sample size is not large, it is comparable to previous studies in the field (e.g., Adlard & Hazan, 1998) that found differences between groups, suggesting that these sample sizes are sufficient to identify these kinds of effects. Still, we have to acknowledge that it is a possibility that a larger sample size could have enhanced power to such an extent that more differences between groups would have appeared. However, in contrast to the medium to large effects that were significant with the current sample size, these effects would be rather small and consequently less meaningful. Admittedly, it is also possible that our findings differ from the previous studies (e.g., Godfrey et al., 1981) because these have used artificial speech continua. It is a possibility that, in natural speech, other acoustic cues might obviate possible problems with fast transitions. However, other acoustic cues may only play a role when comparing stop and fricative contrasts, not when stop consonants have to be compared that only differ on place of articulation. The current findings are also in line with previous research with Dutch children by Blomert and Mitterer (2004), which also suggested that accuracy differences between groups disappeared when natural speech was used. Serniclaes et al. (2001, 2004) also did not find differences in accuracy between RD and CA on continua endpoints. They suggested that this could be due to the fairly long ISI of 100 ms that was used. Mody et al. (1997) found differences between groups but reported larger differences with shorter ISIs (10 vs. 50 or 100 ms). In contrast, Breznitz (2002) found differences with an ISI of 80 ms but not with a shorter ISI of 30 ms. In our study, we used an ISI of 200 ms because ISIs shorter than 200 ms may lead to mutual masking whereas longer ISIs may make discrimination too easy (for discussion, see Gerrits, 2001, p. 16). It is a possibility that the use of the longer 200 ms ISI may have caused the lack of accuracy differences. By contrast, Adlard and Hazan (1998) used an ISI of 1 s and found differences between groups. Clearly, more systematic study into the influence of different ISIs is required.

We carefully selected participants to specifically exclude the possibility that the speech perception problems were due to comorbid problems such as ADHD or other language problems. A similar procedure was followed by Adlard and Hazan (1998), who also excluded children who had current or earlier problems with speech production. As recent definitions of RD also stress resistance to remediation and persistence of the reading problems, we have excluded children with a lack of educational opportunity expressed by low vocabulary knowledge and retested all children a year after our study took place. McArthur and Bishop (2001) have stressed the importance of examining individual differences within the group of children that is reading disabled. It may well be that only a subgroup of reading disabled children has problems in rapid auditory processing. In the present study, we found reading disabled children slower in giving responses to identical item pairs. We therefore also examined the scatter plots of the average scores of both reading disabled and control children. Although there was some overlap in scores, there was no evidence that a subgroup of reading disabled children was responsible for the group differences we found, so slower identification appeared to be a general characteristic of the children with reading problems. However, the possibility remains that a number of RD children have comorbid problems. Further research should make clear whether also in transparent orthographies like Dutch such a comorbid subgroup may have problems in speech perception accuracy.

On the basis of Treiman (1991) and Struiksma (2003), we predicted the perception of phonemes in clusters to be more difficult than in single consonants, resulting in more errors and a longer processing time. The current study showed that the discrimination of clusters was not more difficult than discriminating single consonants and they did not require more processing time. Apparently, a surrounding cluster did not hamper straightforward recognition that sounds were identical or different. In research on spelling development, however, Treiman (1991) demonstrated that consonants in second position within an initial consonant cluster were problematic. Treiman (1991) suggested that these errors were due to a lack in phoneme awareness, a failure to analyze the clusters into their component phonemes, treating the first and the second consonants as a unit. In contrast, we did not find differences in correct detection proportions between position 2 in an initial consonant cluster and consonants in single position. Our study suggests that children knew which component phonemes were in position 2 within the cluster. In a study of German dyslexics, Landerl and Wimmer (2000) also showed that, for grade 3 dyslexic children, phoneme segmentation of onset consonant clusters was not problematic. Consequently, lack of phoneme awareness appears not to be the cause of problems with consonant clusters in Dutch. This is in line with Vellutino et al. (2004), who argued that phoneme awareness may be less of a problem for Dutch reading disabled children because of the shallow orthography of Dutch (but see Patel, Snowling & de Jong, 2004 for a different view). In a shallow orthography, it is possible that orthographic knowledge feeds back into the phonological system, preventing phonological awareness problems. However, this does not mean that there is no phonological deficit; only the manifestation of the phonological deficit may have changed (e.g., de Jong and van der Leij, 2003).

In the current study, our main focus was on processing time. The CA control group was faster in identifying identical sounds than in distinguishing different sounds, yet there was no difference for the RD children. Our findings demonstrate that, even when using natural speech, the method is sensitive enough to detect speech perception problems. However, it could be argued that the differences in reaction time reflect differences in more basic speed of processing abilities such a pure motor response/preparation. Many studies point to slower RTs for RD children on a number of tasks (e.g., see Talcott et al., 2003). However, note that, in the current study, we did not find differences between RD and CA controls when children were required to distinguish between different sounds but we did find them on a similar task when identifying identical items. If basic (motor) skills had been a decisive factor, our data would have showed differences between RD and CA controls on all measures, with RD children consistently showing slower RTs. Therefore, motor skills do not seem to account for the findings on speech processing in the current study.

The results of this study suggest that specific problems for RD children in accuracy found in other studies (e.g., Adlard and Hazan, 1998) resurface in a transparent orthography through processing speed. As said, the processing speed deficit occurs in identifying sounds rather than in discriminating sounds. In line with this finding, Serniclaes et al. (2001) argued that RD children were actually better in discriminating sounds belonging to the same phonemic category. These authors suggested that the reason for inferior speech perception was not so much a deficit in the perception of sounds but rather a result of less well-established phonemic categories, preventing them from subsuming allophones under the same phonemic category. The point Serniclaes et al. made was that RD have no perception problems per se. However, they argued that their phonemic categories may be less well established as they distinguish between different sounds that differ acoustically yet belong to the same phonemic category (allophones) and therefore should not be distinguished for understanding. We assume that this uncertainty may result in processing problems and longer RTs for RD children. It could therefore be speculated that the representation of consonants was not as well developed in the case of RD children and was possibly not distinct enough to differentiate different consonant sounds (see Elbro, 1996). On the other hand, similar to the research by Serniclaes, our present data indicated that the RD children’s mental specification was sufficiently distinct to differentiate stop and nonstop consonants (e.g., the /p/ can be differentiated from the /t/ when hearing the pair /prar/ -/trar/). Consequently, we suggest that RD children are less automatic in evaluating a phonetic representation, perhaps reflecting a general problem in access to phonology. These problems do not surface when detecting differences between sounds or only when sounds involve fast transitions as predicted by Tallal’s theory. Rather, poor readers experience problems with the fast identification of all sounds. Our correlational analysis supported this finding as these correlations showed that reading related variation in identifying sounds was significant for the combined groups and distinguished RD and CA control readers. In contrast, fast discrimination of sounds was not the crucial difference between RD and CA controls. Based on data from a recent longitudinal study, de Jong and van der Leij (2003) suggested that RD children’s access to phonemes may remain relatively fragile. Similarly, it has also been suggested that normal reading German children rely mainly on highly efficient sublexical recoding at the level of grapheme–phoneme correspondences (Ziegler & Goswami, 2005). If RD children lack efficient access to phonemes, sublexical recoding may proceed very slowly with a resulting lack of fluency in reading.

To disentangle whether fast transitions or phonetic similarity between sounds was causing problems for RD children, we also examined contrasts that were either very similar, i.e., that differed in one phonetic feature, or less similar, i.e., that differed on three phonetic differences. In addition, we compared stop consonant contrasts (with fast transitions) with fricative contrasts (without fast transitions) that both differed only in one phonetic feature and were therefore just as similar. Studdert-Kennedy (2002) has argued that, rather than the occurrence of fast transitions, phonetic similarity may be the underlying reason for problems in speech perception. Our results showed that, for both groups, similar speech sounds involving fast transitions were more difficult to distinguish and that this was not a specific problem for RD children, supporting Studdert-Kennedy’s theory. Similarly, Adlard and Hazan (1998) showed that, with a test involving real words, the highest error rates were obtained when pairs differed in one feature only. Their research also indicated that children made more errors when pairs of nonspeech sounds were similar so this seems to be a general issue in the processing of sounds.

Tallal (1980) assumed that fast transitions in stop consonants were problematic for RD children. We found that this was indeed the case, yet only when sounds were highly similar. However, in contrast to Tallal, these problems did not differentiate poor readers and controls, and therefore, they do not provide an explanation for the lack of reading speed that is so central to RD children’s problems in transparent orthographies. To conclude, the major finding of the present study was that RD children were slower in identifying sounds relative to CA controls, even though there was no difference in speed of discriminating sounds, suggesting that RD and CA groups differ in identification speed. It is possible that less distinct phonemic categories underlie this lack of processing speed, which in turn may lead to a lack in reading fluency.