Introduction

Developmental dyslexia is a neurobiological disorder characterized by a difficulty in reading acquisition despite adequate intelligence, conventional education, and motivation (APA 1994). The prevailing view supports the hypothesis that dyslexia results from a specific deficit of auditory-phonological perception, representation, and phonological memory (see Vellutino et al. 2004; Ziegler and Goswami 2005 for reviews). Children and adults with dyslexia show, indeed, deficits in the representation and manipulation of phonological information (e.g., poor speech-sound awareness, slow lexical retrieval and poor phonological short-term memory; see Ramus 2003, for a review). These deficits could interfere with one of the most critical skills for successful reading acquisition, such as phonological decoding (Share 1995; Ziegler and Goswami 2005).

Apart from their phonological difficulties, dyslexic subjects often suffer from a variety of subtle sensory and motor deficits. Whether these deficits have any causal relation to the reading disorder, or are totally independent, is currently under debate. One hypothesis of a visual cause for dyslexia is that the reading disorder is caused by a deficiency in the magnocellular part (also referred to as the “transient system”) of the visual system (Stein and Walsh 1997). This hypothesis might seem controversial, since one would expect the parvo- rather than the magnocellular pathways to be largely involved with fine pattern vision and object discrimination that are essential for reading. However, studies have indicated a function for the magnocellular-dominated dorsal stream in selective attention (Motter 1993; Vidyasagar 1998).Vidyasagar and Pammer (1999) suggested that if the magnocellular system (M system) is involved in gating all visual input going through the striate cortex, a deficit in this system would also affect the parvocellular system (P system). This would become manifest in tasks in which there is intense competition for attentional resources and in which the supposed M-mediated attentional spotlight is essential for good performance. Stein (2003) also suggested that a magnocellular deficiency may cause a type of visual attention deficit in dyslexia. This has made it important to assess visual attention in dyslexic readers in more detail. A recent and particularly interesting hypothesis that links the magnocellular deficit with reading problems is that dyslexics may have “sluggish attentional shifting” (SAS) (Hari and Renvall 2001).

The present study was motivated by the idea that SAS could indeed provide a coherent framework to understand a variety of sensory problems that dyslexics encounter. The basic notion underlying SAS is that sensory input is chunked and that attention of dyslexic subjects, once engaged on a chunk, cannot be easily disengaged (Hari and Renvall 2001). This causes impairments in the shift or the focusing of attention (Hari and Renvall 2001), and it may result in a prolonged attentional dwell time and poor judgments of temporal order (Jáskowski and Rusiak 2008). SAS can affect fluent reading because the sensory input is prolonged, thereby degrading essential cortical representations (for review, see Hari and Renvall 2001). In line with this idea, Lallier et al. (2010) used an auditory and visual stream segregation task and reported that in order to process two successive stimuli separately, dyslexic participants with phonological impairments required a significantly longer inter-stimulus interval than controls regardless of sensory modality. Another important prediction from SAS is that dyslexics may profit from a transient sound because of a general alerting effect that improves the disengagement of attention.

A number of methods have been used to measure visual attention in dyslexics, but so far, none has used sounds to improve search time. In the standard visual search task, participants search for a prespecified visual target among other distracters. Previous reports are rather consistent with a visual attention deficit in dyslexics, reporting slower search times for dyslexic than normal readers (Williams et al. 1987; Ruddock 1991; Casco and Prunetti 1996; Vidyasagar and Pammer 1999; Sireteanu et al. 2008; Romani et al. 2011). In this context, Pammer and Vidyasagar (2005) and Jones et al. (2008) argued that dyslexics may suffer from an impairment in the serial allocation of attention. According to SAS, the slowness of dyslexics can be explained by an impairment in the disengagement of attention from an element in the search set.

Here we thought to add sounds to the visual search task, using the “pip-and-pop” paradigm by Van der Burg et al. (2008). These authors designed a visual search task in which a target (a horizontal or vertical line) was embedded in a cluttered display of distracters (oblique lines). The targets and distracters changed, on randomly determined times color (and, important from the perspective of the magnocellular system, also luminance) from green-to-red or red-to-green. They found that a simple auditory pip could drastically decrease search times if the pip was synchronized with the color/luminance-change of the target: the “pip” then made the target “pop-out.” Further studies have shown that a sound will only lead to benefits in visual search if the changes in the two signals are both synchronized and transient (Van der Burg et al. 2010). If this condition is met, then the effect will also resist wide spatial misalignment (Fiebelkorn et al. 2011).

In the current study, we used this paradigm to examine whether a sound would improve the visual search time of dyslexic readers more than it does in normal readers. If, as proposed in SAS, dyslexic readers have problems with the disengagement of attention from the current fixation, one expects them in the tone-absent condition (serial search) to have longer search times than normal readers, and their slope of the search time per item should be steeper (see also Vidyasagar and Pammer 1999; Sireteanu et al. 2008; Romani et al. 2011). In the sound-present condition, though, the pip can make the target pop-out (parallel search), and search time may become independent of the set size of the distracters. This should be particularly helpful for dyslexic readers, as they may have a specific difficulty with serial, but not parallel, search (Sireteanu et al. 2008). Ultimately, then, a single pip may compensate for the dyslexics’ visual attention deficit.

Method

Participants

Fifteen young adults with developmental dyslexia (five men and ten women) and 15 age-matched controls without reading difficulties (five men, ten women) were tested. The dyslexic readers had been diagnosed with developmental dyslexia based on standard exclusion criteria (APA 1994). They were all formally assessed and diagnosed by clinical and educational psychologists. Their reading achievements (accuracy and/or speed) were additionally assessed via standardized Dutch reading tests for single word and nonword reading (Brus and Voeten 1997; Van den Bos et al. 1999). Dyslexic participants were selected on the basis of (1) normal or corrected-to-normal vision and hearing; (2) absence of neurological and/or psychiatric disorders; (3) absence of attention deficit disorder with hyperactivity (because of the high comorbidity with dyslexia); (4) absence of color blindness. The controls reported no history of reading problems. The two groups were drawn from the same subject pool of university students, but were significantly different for both accuracy and speed of word and nonword reading (see Table 1 for details). Participants were tested individually, were unaware of the purpose of the experiment, and received course credits or money for their participation. Written consent was obtained from all participants according to the Declaration of Helsinki.

Table 1 Mean and standard deviation (SD) of age (in years), word and nonword reading scores (errors and speed in number of correctly read items) in dyslexics (N = 15) and age-matched normal reading controls (N = 15)

Stimuli

The stimuli were made as in Van der Burg et al. (2008). The auditory stimulus was a short white noise click of 68 ms presented at 74 dB(A) through the laptop speakers. The visual stimuli were presented on a 15-inch, 60-Hz laptop monitor (Dell Latitude E5500), controlled by E-Prime 1.2 (Psychology Software Tools, Inc.; http://www.pstnet.com/eprime). The visual search displays consisted of 24 or 48 red (20 cd m−2) and green (11 cd m−2) line segments (length 0.88° visual angle) against a dark background (0.05 cd m−2). The initial color (red or green) was randomly determined for each item. The lines were randomly placed in an invisible 10 × 10 grid (10.5° × 6.5°) centered on a white central fixation cross, with the constraint that the target was never presented at the four central positions, to avoid immediate detection. The target was a horizontal or vertical line, while for distracters line orientation deviated randomly by plus or minus 26.5° from horizontal or vertical. The distracters changed color (from red-to-green or vice versa) every 50, 100, or 150 ms. The number of distracters that changed simultaneously during a trial varied for the different set sizes; in set size 24, 1, 2 or 4 distracters changed simultaneously, while in set size 48, it was 1, 4 or 7 distracters. The target changed color every 500 or 1,000 ms, and always changed alone. Distracters did not change color from 150 ms before the target until 100 ms after the target had changed color. During the first 500 ms of a trial, the target also did not change color. A dynamic example of trials with and without sound can be found at http://www.psy.vu.nl/pippop/.

Procedure

Participants were tested in a dimly lit and sound-proof cabin and were seated approximately 65 cm in front of the laptop screen. Head movements were precluded by a chin-rest. A white fixation cross was illuminated in the center of the screen at the beginning of each trial. Participants were asked to remain focused on the fixation cross. After 150–500 ms, the display with target and distracters appeared at the screen. In the sound-present condition, a change in the color of the target was always accompanied by a simultaneously presented sound. The search display was presented until the participants made a response. Participants were instructed to search for the target and to press one of two buttons corresponding with the target orientation “-” or “|” as fast and accurately as possible. All participants were explicitly told that sounds, if present, were synced with a color change of the target, and that they thus could benefit from the sound because it signaled that the target had changed color. To encourage that participants reacted as fast and as accurately as possibly throughout the whole experiment, written feedback about accuracy and search time was given after each trial. Overall scores were also given at the end of the experiment. A practice session preceded the experimental test that stopped until 10 consecutively correct answers were given.

Design

There were two within-subject factors: set size (24 or 48) and sound (present or absent). These factors were varied randomly across trials. Target orientation was balanced and randomly mixed. The whole test consisted of 1 block of 80 experimental trials, in which each of the 4 unique conditions was presented 20 times. The experiment lasted ~15 min in total.

Results

The data of the practice session and erroneous responses were excluded from analyses. The overall mean error rate was low (5.8% for the dyslexic group and 4.4% for the control group) and did not significantly differ between groups, t(28) = 1.32, P > 0.05. No further analyses were therefore performed on error rates. Search time was measured from the onset of the search display until the response to the target. The averaged search times for each condition are presented in Fig. 1. An overall repeated-measures ANOVA was conducted with group (dyslexics versus normal readers) as between-subject factor, and set size (24 or 48 items) and sound (sound present or absent) as within-subject factors. As expected, dyslexics had longer overall search times than normal readers, F(1,28) = 5.66, P < 0.05, ηp2 = 0.17 (mean search time of 8,004 ms for dyslexics and 6,100 ms for normal readers). The search time was also faster for the small than large set size, F(1,28) = 144.94, P < 0.001, ηp2 = 0.84 (mean search time of 4,759 ms for set size 24 and 9,351 ms for set size 48), and search time of trials with sound was faster than without sound, F(1,28) = 13.58, P < 0.01, ηp2 = 0.17 (mean search time of 7,929 ms for sound-absent conditions and 6,181 ms for sound-present conditions). There was an interaction between set size and group, F(1,28) = 7.15, P < 0.05, ηp2 = 0.20, indicating that dyslexics had slower search times per item than normal readers (dyslexics: 222 ms/item; normal readers: 170 ms/item). Most importantly, there was interaction between group, sound, and set size, F(1,28) = 5.275, P < 0.05, ηp2 = 0.16. As is clearly visible in Fig. 1, and as predicted, both groups benefitted from sound, but the dyslexics profited more from sound than normal readers as their improvement in search times per set size was bigger (an improvement of 1,000 ms for set size 24, and 4,735 ms for set size 48) than that of the controls (an improvement of 281 ms for set size 24, and 976 ms for set size 48).

Fig. 1
figure 1

Mean search time (in seconds) as a function of set size and presence of sound for the dyslexic (left panel) and normal reading (right panel) group. Error bars represent 1 standard error of mean

Separate ANOVAs on the sound-absent and sound-present conditions showed that in the sound-absent conditions, the dyslexics were significantly slower than the normal readers, F(1,28) = 9.15, P < 0.01, ηp2 = 0.25, and this difference was bigger with the large set size (set size × group interaction effect, F = 9.52, P < 0.01, ηp2 = 0.25). In stark contrast, in the sound-present condition, the search time of the dyslexic group was not different from the control group (main effect of group, F < 1, set size × group, F < 1).

Discussion

Here we examined the effect of sound on visual search times of dyslexic versus normal readers, using the “pip-and-pop” task. Our results confirmed, in accordance with predictions from SAS, that (1) in the sound-absent condition, dyslexics’ search time was much longer than of normal readers, (2) in the sound-absent condition, dyslexics’ search time increased more strongly with an increase in set size than of normal readers, and (3) dyslexics’ search time improved drastically by the presence of a transient sound; in fact their search time then became as good as normal readers.

The general slowness of the dyslexic group is consistent with previous studies using visual search tasks (e.g., Vidyasagar and Pammer 1999; Sireteanu et al. 2008; Romani et al. 2011). These findings are in accordance with the “sluggish attentional shifting” account of Hari and Renvall (2001), according to which the dyslexics’ automatic attention system cannot disengage fast enough from one item to move to the next. This results in a prolonged dwell time and large effect of set size.

The improvement in performance by the presence of the sound leads to the question why a transient sound could speed up the orienting of attention of dyslexic readers as much as it does. First of all, it is important to note that the sound used in the experiment did not bias attention toward a specific direction or location. We used a static sound that was centrally presented, so the sound was not informative about the possible location of the target. The gain offered by the sound for dyslexic readers is in line with other findings, demonstrating that dyslexics have specific problems in serial, but not parallel, search. For example, Sireteanu et al. (2008) demonstrated that dyslexics, compared to normal readers, have difficulties in serial, but not parallel, search. Our results are consistent with that (and previous findings by Williams et al. 1987; Ruddock 1991; Casco and Prunetti 1996; Vidyasagar and Pammer 1999) in that we found that in the sound-absent condition, the search time of dyslexic readers increased more with the number of distracters and was significantly slower than that of the normal readers. In the sound-present condition, it has been argued that the binding of synchronized auditory-visual signals occurs rapidly, automatically, and effortlessly, with the auditory signal attaching to the visual signal relatively early in the perceptual process. Through this, a nonspatial auditory event (“pip”) can guide attention toward the location of a synchronized visual event that, without an auditory signal, is difficult (Van der Burg et al. 2008). The “pip” then makes the target “pop-out,” changing a serial search into an automatic, parallel search. This then led to a decrease in dyslexics search times up to the level of normal readers.

Although it is clear that dyslexics were better in parallel search than serial search, one can still ask whether there were—besides SAS—any other reasons why dyslexics benefited significantly more from sound than the normal readers. One clue for this may come from a visual search study by Facoetti et al. (2000). These authors found that dyslexic children had a bigger pop-out effect, which is a characteristic of parallel processing. The authors argued that dyslexia is characterized by a difficulty to narrow the attentional focus, and dyslexics therefore tended to adopt a more distributed focus of attention. In addition, Van der Burg et al. (2008) suggested that at least some distributed attention is necessary for observers to notice the synchronized event of the “pip” and “pop.” By combining these two notions, it may become understandable why dyslexics, presumably with a more distributed focus of attention, could profit more from sound than normal readers.

Plausible explanations for the substantial benefit of sound by dyslexics might also be derived from studies that examined the effect of sound on visual attention. Robertson et al. (1998) studied patients with neglect and showed that the phasic alerting by a transient nonspatial sound can overcome their spatial deficits in visual awareness. This finding provides evidence that the phasic alerting alone can directly affect the speed of perceptual processing, rather than merely affecting motor readiness. Follow-up studies by Van der Burg et al. (2008, 2011), however, found evidence for a very early multisensory interaction that ruled out that the pip-and-pop effect is due to increases of alertness, as the effect follows a time course that is quite different from alerting effects.

Doyle and Snowden (2001) also examined the effects of simple auditory signals whose onset was synchronized with that of the visual target, but provided no information about the target location. Their findings made them speculate that an auditory signal may promote attentional disengagement (for similar reasoning, see Keetels and Vroomen (2011)). Related to our study, the sound may thus have a facilitatory effect on the disengagement of attention of the dyslexic readers, making them process the visual target (much) sooner when accompanied by a sound relative to when visual information is presented alone.

A final speculation about the reason for the bigger improvement by sound in the dyslexic group may be related to crossmodal temporal deficits that have been found in dyslexia. As demonstrated by Van der Burg et al. (2008), in order to be effective, a sound needs to be presented in close temporal proximity to the visual target change. This thus demands intact crossmodal temporal integration. Various authors, though, argued that dyslexics exhibit deficits in different sensory systems, which involve alterations in temporal information processing (e.g., Tallal 1980; Kinsbourne et al. 1991; Laasonen et al. 2002). One hypothesis confirmed by Hairston et al. (2005) could be relevant. The authors found support for the idea of altered crossmodal temporal processing in dyslexia, as they reported that dyslexic subjects showed an extended temporal window for binding visual and auditory cues. These findings could be potentially relevant for our results, because an enlarged temporal window may lead to a bigger “pip-and-pop” effect in the dyslexics, as they may profit over an extended period of time of the sound. It should be noted, though, that an enlarged temporal window of integration may also lead to more spurious binding between the sound and color changes of distracters. An enlarged temporal window would then interfere rather than being of help in visual search time. Future studies that vary the SOA between sound and target and the SOA between sound and distractor change might give more detailed information on this and on the mechanisms of multisensory integration in dyslexic readers.

To summarize, our results showed that dyslexic readers have problems with visual serial search. This is evidence for visual attentional abnormalities in dyslexia, more specifically in the disengagement and shifting of attention. Most interestingly, though, a spatially noninformative transient sound could overcome these abnormalities. It remains for future studies with other clinical populations who are thought to have difficulties with disengaging and shifting attention (e.g., high-functioning autistic individuals) to further explore the effects of sound on visual attention.